Tharindu Kumarage
9 papers · 2023–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (14) π§ Keyword Pioneer π£ Hot Topic Early Bird π Conference Polyglot (5) π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(10)
β‘
Prolific Year
(6)
β
The Questioner
(2)
Conferences
ACL (3)
AACL (2)
IJCNLP (2)
EMNLP (1)
NAACL (1)
Top co-authors
Keywords
large language model
(2)
fact verification
(1)
news analysis
(1)
ai safety
(1)
safety alignment
(1)
reinforcement learning from human feedback
(1)
knowledge graph
(1)
adversarial attack
(1)
language model
(1)
reward model
(1)
adversarial prompt
(1)
soft prompt
(1)
red teaming
(1)
harmful content detection
(1)
prompt tuning
(1)
text detection
(1)
misinformation detection
(1)
knowledge augmentation
(1)
multi-agent system
(1)
news classification
(1)
Papers
ARES: Adaptive Red-Teaming and End-to-End Repair of Policy-Reward System
ACL 2026
Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation
ACL 2025
Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey
NAACL 2024
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News
AACL 2023
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News
IJCNLP 2023
ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
IJCNLP 2023
How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts
EMNLP 2023
ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
AACL 2023
Towards Detecting Harmful Agendas in News Articles
ACL 2023