conftrace_

Christopher Parisien

9 papers · 2008–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🏃 Academic Marathon (17) 🐝 Cross-Pollinator (12)

🗺️ Taxonomy Completionist (24) 🌉 Interdisciplinary Bridge 🏆 Keyword Champion (2)

Conferences

EMNLP (6) ACL (1) CONLL (1) NAACL (1)

Top co-authors

Makesh Narsimhan Sreedhar (6) Traian Rebedea (6) Shaona Ghosh (4) Razvan Dinu (1) Aishwarya Padmakumar (1) Yulia Tsvetkov (1) Liwei Jiang (1) Makesh Sreedhar (1) Yftah Ziser (1) Jibin Rajan Varghese (1)

Keywords

large language model (5) dialogue system (4) ai safety (3) content moderation (2) canonical form (2) instruction tuning (1) domain adaptation (1) adversarial learning (1) responsible ai (1) model safety (1) neural network interpretability (1) prompt learning (1) model security (1) intent classification (1) topic coherence (1) safety alignment (1) adversarial attack (1) language model (1) controllable generation (1) task-oriented dialogue (1)

Papers

Guardrails and Security for LLMs: Safe, Secure and Controllable Steering of LLM Applications ACL 2025 A Simple Yet Effective Method for Non-Refusing Context Relevant Fine-grained Safety Steering in LLMs EMNLP 2025 Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models EMNLP 2025 AEGIS2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails NAACL 2025 Unsupervised Extraction of Dialogue Policies from Conversations EMNLP 2024 CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues EMNLP 2024 NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable Rails EMNLP 2023 Prompt Learning for Domain Adaptation in Task-Oriented Dialogue EMNLP 2022 An Incremental Bayesian Model for Learning Syntactic Categories CONLL 2008