Shihan Dou

34 papers · 2022–2026 · 6 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐝 Cross-Pollinator (10) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (8)

🌍 Conference Polyglot (6) 🐣 Hot Topic Early Bird 👥 Mega-Team (27) 🤝 Dynamic Duo (23) 🔥 Unstoppable (5) 💎 Century Club (25) 🗃️ Keyword Collector (113) ⚡ Prolific Year (12)

Conferences

ACL (19) EMNLP (6) COLING (3) AAAI (2) ICLR (2) ICML (2)

Top co-authors

Qi Zhang (30) Xuanjing Huang (28) Tao Gui (24) Zhiheng Xi (13) Rui Zheng (12) Xiao Wang (9) Shichun Liu (9) Junjie Ye (9) Ming Zhang (8) Songyang Gao (8)

Research topics

Privacy (1)

Keywords

large language model (12) reinforcement learning (6) reward model (5) reinforcement learning from human feedback (3) language model (3) text classification (3) mutual information (3) distribution shift (3) instruction tuning (2) knowledge retention (2) question answering (2) preference alignment (2) compiler feedback (2) code generation (2) representation learning (2) out-of-distribution generalization (2) sequential decision making (1) model robustness (1) reward modeling (1) object detection (1)

Papers

Beyond Scaling: Measuring and Predicting the Upper Bound of Knowledge Retention in Language Model Pre-Training ACL 2026 JanusMM: A Benchmark for Self-Deprecation Understanding in Real-World Multimodal Conversations ACL 2026 VRPO: Rethinking Value Modeling for Robust RL under Noisy Supervision in LLM Post-Training ACL 2026 PRISM: Probabilistic Reward Model with Inherent Structural Modeling ACL 2026 LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models ACL 2026 DARM: Distribution-Aware Reward Modeling by Alleviating Biases from Low Preference-Context Dependency Data ACL 2026 OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding ACL 2026 MetaAct-RL: Training Language Models for Reasoning Through Meta-Action-Based Reinforcement Learning AAAI 2026 Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization ACL 2026 Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning AAAI 2025 Lost in the Context: Insufficient and Distracted Attention to Contexts in Preference Modeling ACL 2025 Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric ACL 2025 Multi-Programming Language Sandbox for LLMs ACL 2025 PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts ACL 2025 DocFusion: A Unified Framework for Document Parsing Tasks ACL 2025 ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios COLING 2025 Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective COLING 2025 Governance in Motion: Co-evolution of Constitutions and AI models for Scalable Safety EMNLP 2025 UPLex: Fine-Grained Personality Control in Large Language Models via Unsupervised Lexical Modulation EMNLP 2025 LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation EMNLP 2025 RMB: Comprehensively benchmarking reward models in LLM alignment ICLR 2025 LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin ACL 2024 StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback ACL 2024 Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback ICML 2024 Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning ICML 2024 Improving Generalization of Alignment with Human Preferences through Group Invariant Learning ICLR 2024 TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities EMNLP 2024 DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization ACL 2023 Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback EMNLP 2023 On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection ACL 2023 Detecting Adversarial Samples through Sharpness of Loss Landscape ACL 2023 MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective ACL 2022 Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective COLING 2022 Kernel-Whitening: Overcome Dataset Bias with Isotropic Sentence Embedding EMNLP 2022