Qingyang Li
7 papers · 2014–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (4) π Academic Marathon (11) π Cross-Pollinator (7)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(22)
π£
Hot Topic Early Bird
Conferences
EMNLP (3)
ACL (2)
ICML (1)
NIPS (1)
Top co-authors
Keywords
prompt engineering
(2)
large language model
(2)
offline reinforcement learning
(1)
mathematical reasoning
(1)
preference learning
(1)
self-agreement
(1)
centroid-based clustering
(1)
resource allocation
(1)
dialogue generation
(1)
preference optimization
(1)
policy learning
(1)
reinforcement learning from human feedback
(1)
markov decision process
(1)
model-based reinforcement learning
(1)
linear programming
(1)
language model reasoning
(1)
policy gradient
(1)
clustering algorithm
(1)
bias mitigation
(1)
reinforcement learning
(1)
Papers
Towards Reward Fairness in RLHF: From a Resource Allocation Perspective
ACL 2025
SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin
EMNLP 2025
MoCoKGC: Momentum Contrast Entity Encoding for Knowledge Graph Completion
EMNLP 2024
Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios
ACL 2024
Synthetic Dialogue Dataset Generation using LLM Agents
EMNLP 2023
Offline Model-based Adaptable Policy Learning
NIPS 2021
A Highly Scalable Parallel Algorithm for Isotropic Total Variation Models
ICML 2014