Shihan Dou
34 papers · 2022–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🐝 Cross-Pollinator (10) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (8)
🌍
Conference Polyglot
(6)
🐣
Hot Topic Early Bird
👥
Mega-Team
(27)
🤝
Dynamic Duo
(23)
🔥
Unstoppable
(5)
💎
Century Club
(25)
🗃️
Keyword Collector
(113)
⚡
Prolific Year
(12)
Conferences
ACL (19)
EMNLP (6)
COLING (3)
AAAI (2)
ICLR (2)
ICML (2)
Top co-authors
Research topics
Keywords
large language model
(12)
reinforcement learning
(6)
reward model
(5)
reinforcement learning from human feedback
(3)
language model
(3)
text classification
(3)
mutual information
(3)
distribution shift
(3)
instruction tuning
(2)
knowledge retention
(2)
question answering
(2)
preference alignment
(2)
compiler feedback
(2)
code generation
(2)
representation learning
(2)
out-of-distribution generalization
(2)
sequential decision making
(1)
model robustness
(1)
reward modeling
(1)
object detection
(1)
Papers
Beyond Scaling: Measuring and Predicting the Upper Bound of Knowledge Retention in Language Model Pre-Training
ACL 2026
JanusMM: A Benchmark for Self-Deprecation Understanding in Real-World Multimodal Conversations
ACL 2026
VRPO: Rethinking Value Modeling for Robust RL under Noisy Supervision in LLM Post-Training
ACL 2026
PRISM: Probabilistic Reward Model with Inherent Structural Modeling
ACL 2026
LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models
ACL 2026
DARM: Distribution-Aware Reward Modeling by Alleviating Biases from Low Preference-Context Dependency Data
ACL 2026
OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding
ACL 2026
MetaAct-RL: Training Language Models for Reasoning Through Meta-Action-Based Reinforcement Learning
AAAI 2026
Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization
ACL 2026
Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning
AAAI 2025
Lost in the Context: Insufficient and Distracted Attention to Contexts in Preference Modeling
ACL 2025
Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
ACL 2025
Multi-Programming Language Sandbox for LLMs
ACL 2025
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts
ACL 2025
DocFusion: A Unified Framework for Document Parsing Tasks
ACL 2025
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
COLING 2025
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective
COLING 2025
Governance in Motion: Co-evolution of Constitutions and AI models for Scalable Safety
EMNLP 2025
UPLex: Fine-Grained Personality Control in Large Language Models via Unsupervised Lexical Modulation
EMNLP 2025
LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
EMNLP 2025
RMB: Comprehensively benchmarking reward models in LLM alignment
ICLR 2025
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin
ACL 2024
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback
ACL 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
ICML 2024
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
ICML 2024
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
ICLR 2024
TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
EMNLP 2024
DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization
ACL 2023
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
EMNLP 2023
On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection
ACL 2023
Detecting Adversarial Samples through Sharpness of Loss Landscape
ACL 2023
MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective
ACL 2022
Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective
COLING 2022
Kernel-Whitening: Overcome Dataset Bias with Isotropic Sentence Embedding
EMNLP 2022