Kaiyan Zhang
25 papers · 2021–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Cross-Pollinator (12) π Conference Polyglot (9) π§ Keyword Pioneer π£ Hot Topic Early Bird π Academic Marathon (5)
πΊοΈ
Taxonomy Completionist
(49)
π
Conference Polyglot
(9)
π£
Hot Topic Early Bird
π€
Dynamic Duo
(18)
π
Grand Slam
π
Keyword Champion
(3)
β‘
Prolific Year
(13)
β
The Questioner
ποΈ
Keyword Collector
(94)
π
Century Club
(22)
Conferences
ACL (7)
EMNLP (5)
ICML (4)
AAAI (3)
ICCV (2)
ICLR (1)
IJCNLP (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
large language model
(7)
reinforcement learning
(3)
test-time scaling
(3)
reinforcement learning from human feedback
(2)
knowledge distillation
(2)
language model
(2)
knowledge retrieval
(2)
foundation model
(2)
code generation
(2)
instruction tuning
(2)
supervised fine-tuning
(2)
persona-based dialogue
(2)
chain-of-thought reasoning
(2)
response generation
(2)
multi-agent system
(2)
preference optimization
(2)
video generation
(1)
question answering
(1)
image generation
(1)
representation learning
(1)
Papers
Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future
ACL 2026
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
AAAI 2026
MARS2: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation
ACL 2026
Scalability of LLM-Based Multi-Agent Systems for Scientific Code Generation: A Preliminary Study
EMNLP 2025
Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines
AAAI 2025
ReviewRL: Towards Automated Scientific Review with RL
EMNLP 2025
SciSketch: An Open-source Framework for Automated Schematic Diagram Generation in Scientific Papers
EMNLP 2025
Video-T1: Test-time Scaling for Video Generation
ICCV 2025
AdsQA: Towards Advertisement Video Understanding
ICCV 2025
OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees
ICLR 2025
Fourier Position Embedding: Enhancing Attentionβs Periodic Extension for Length Generalization
ICML 2025
Free Process Rewards without Process Labels
ICML 2025
How to Synthesize Text Data without Model Collapse?
ICML 2025
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
ICML 2025
Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
ACL 2025
Fusing Highly Specialized Language Models for Comprehensive Expertise
ACL 2025
SMR: State Memory Replay for Long Sequence Modeling
ACL 2024
Generative Multi-Modal Knowledge Retrieval with Large Language Models
AAAI 2024
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
NAACL 2024
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
ACL 2024
UltraMedical: Building Specialized Generalists in Biomedicine
NIPS 2024
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
EMNLP 2024
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
EMNLP 2023
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data
IJCNLP 2021
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data
ACL 2021