Senjie Jin
9 papers · 2023–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π§ Keyword Pioneer π Conference Polyglot (3) π Renaissance Researcher (5) π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(24)
π₯
Mega-Team
(21)
β
The Questioner
(2)
Conferences
AAAI (3)
EMNLP (3)
ACL (1)
CVPR (1)
ICML (1)
Top co-authors
Keywords
reinforcement learning
(5)
preference alignment
(2)
large language model
(2)
mathematical reasoning
(2)
language model
(2)
safety alignment
(1)
sequential decision making
(1)
benchmark evaluation
(1)
text generation
(1)
chain-of-thought reasoning
(1)
program synthesis
(1)
language model alignment
(1)
policy optimization
(1)
reinforcement learning from human feedback
(1)
vision language model
(1)
preference modeling
(1)
prompt engineering
(1)
cross-modal alignment
(1)
reward model
(1)
reward modeling
(1)
Papers
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination
AAAI 2026
What Makes a Good Speech Tokenizer for LLM-Centric Speech Generation? A Systematic Study
AAAI 2026
MetaAct-RL: Training Language Models for Reasoning Through Meta-Action-Based Reinforcement Learning
AAAI 2026
VRPO: Rethinking Value Modeling for Robust RL under Noisy Supervision in LLM Post-Training
ACL 2026
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models
CVPR 2025
Parrot: A Training Pipeline Enhances Both Program CoT and Natural Language CoT for Reasoning
EMNLP 2025
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
ICML 2024
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning
EMNLP 2024
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
EMNLP 2023