Jiahe Guo
7 papers · 2025–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (2) π Cross-Pollinator (12) πΊοΈ Taxonomy Completionist (16)
β
Rising Star
(5)
Conferences
ACL (4)
EMNLP (2)
AAAI (1)
Top co-authors
Keywords
large language model
(5)
safety alignment
(3)
preference learning
(2)
emotional support conversation
(2)
dialogue generation
(1)
model safety
(1)
ai safety
(1)
monte carlo tree search
(1)
preference modeling
(1)
adversarial defense
(1)
supervised fine-tuning
(1)
jailbreak attack
(1)
inference cost
(1)
jailbreak defense
(1)
strategy optimization
(1)
multilingual learning
(1)
attack success rate
(1)
adaptive reasoning
(1)
activation steering
(1)
role-play fine-tuning
(1)
Papers
Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilities
AAAI 2026
When Personalization Legitimizes Risks: Uncovering Safety Vulnerabilities in Personalized Dialogue Agents
ACL 2026
TEA-Bench: A Systematic Benchmarking of Tool-enhanced Emotional Support Dialogue Agent
ACL 2026
Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs
ACL 2025
Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter
EMNLP 2025
MPO: Multilingual Safety Alignment via Reward Gap Optimization
ACL 2025
AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender
EMNLP 2025