Ke Zeng
13 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Renaissance Researcher (5) π Conference Polyglot (2) π Cross-Pollinator (13) π§ Keyword Pioneer π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(27)
β
The Questioner
Conferences
ACL (9)
EMNLP (3)
AAAI (1)
Top co-authors
Keywords
large language model
(8)
reinforcement learning
(6)
policy optimization
(3)
catastrophic forgetting
(1)
anomaly detection
(1)
curriculum learning
(1)
knowledge transfer
(1)
preference learning
(1)
information gain
(1)
knowledge distillation
(1)
instruction following
(1)
paraphrase generation
(1)
computational efficiency
(1)
instruction tuning
(1)
importance sampling
(1)
reinforcement learning from human feedback
(1)
event detection
(1)
adaptive computation
(1)
mathematical reasoning
(1)
multi-task learning
(1)
Papers
Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales Agents
ACL 2026
Rectify Evaluation Preference: Improving LLMsβ Critique on Math Reasoning via Perplexity-aware Reinforcement Learning
AAAI 2026
PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling
ACL 2026
SILO-BENCH: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM Systems
ACL 2026
Turning Failures into Value: Negative Experience Replay for RLVR via Confidence Gating and Boundary Failure Sampling
ACL 2026
MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning
ACL 2026
From log π to π: Taming Divergence in Soft Clipping via Bilateral Decoupled Decay of Probability Gradient Weight
ACL 2026
Donβt Half-listen: Capturing Key-part Information in Continual Instruction Tuning
ACL 2025
Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
EMNLP 2025
A Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy
EMNLP 2025
When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning
EMNLP 2025
Dual-Stage Multi-Task Syntax-Oriented Pre-Training for Syntactically Controlled Paraphrase Generation
ACL 2024
Learning or Self-aligning? Rethinking Instruction Fine-tuning
ACL 2024