Christopher Parisien
9 papers · 2008–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Conference Polyglot (4) π Academic Marathon (17) π Cross-Pollinator (12)
πΊοΈ
Taxonomy Completionist
(24)
π
Interdisciplinary Bridge
π
Keyword Champion
(2)
Conferences
EMNLP (6)
ACL (1)
CONLL (1)
NAACL (1)
Top co-authors
Keywords
large language model
(5)
dialogue system
(4)
ai safety
(3)
content moderation
(2)
canonical form
(2)
instruction tuning
(1)
domain adaptation
(1)
adversarial learning
(1)
responsible ai
(1)
model safety
(1)
neural network interpretability
(1)
prompt learning
(1)
model security
(1)
intent classification
(1)
topic coherence
(1)
safety alignment
(1)
adversarial attack
(1)
language model
(1)
controllable generation
(1)
task-oriented dialogue
(1)
Papers
Guardrails and Security for LLMs: Safe, Secure and Controllable Steering of LLM Applications
ACL 2025
A Simple Yet Effective Method for Non-Refusing Context Relevant Fine-grained Safety Steering in LLMs
EMNLP 2025
Safety Through Reasoning: An Empirical Study of Reasoning Guardrail Models
EMNLP 2025
AEGIS2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails
NAACL 2025
Unsupervised Extraction of Dialogue Policies from Conversations
EMNLP 2024
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
EMNLP 2024
NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable Rails
EMNLP 2023
Prompt Learning for Domain Adaptation in Task-Oriented Dialogue
EMNLP 2022
An Incremental Bayesian Model for Learning Syntactic Categories
CONLL 2008