Jiaheng Liu
65 papers · 2019–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (11) πΊοΈ Taxonomy Completionist (21) π Interdisciplinary Bridge π Academic Marathon (6)
πΊοΈ
Taxonomy Completionist
(21)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Loyalist
(21)
π
Keyword Champion
(4)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(28)
π¬
Deep Specialist
(18)
π§¬
Topic Evolution
π€
Dynamic Duo
(19)
β‘
Prolific Year
(18)
π
Century Club
(56)
π₯
Unstoppable
(7)
β
The Questioner
(2)
ποΈ
Keyword Collector
(64)
Conferences
ACL (26)
AAAI (7)
EMNLP (6)
ICCV (5)
ICLR (5)
NIPS (4)
ECCV (3)
NAACL (3)
CVPR (2)
EACL (2)
COLING (1)
ICML (1)
Top co-authors
Keywords
large language model
(23)
benchmark evaluation
(10)
multimodal large language model
(7)
knowledge distillation
(6)
model compression
(4)
factuality evaluation
(4)
multimodal learning
(4)
representation learning
(4)
instruction tuning
(4)
reward modeling
(4)
metric learning
(3)
contrastive learning
(3)
chinese language
(3)
language model
(3)
visual question answering
(3)
instruction following
(3)
reinforcement learning
(3)
face recognition
(3)
preference learning
(3)
feature embedding
(3)
Papers
Long-form RewardBench: Evaluating Reward Models for Long-form Generation
AAAI 2026
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
EACL 2026
MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
EACL 2026
USB: A COMPREHENSIVE AND UNIFIED SAFETY EVALUATION BENCHMARK FOR MULTIMODAL LARGE LANGUAGE MODELS
ACL 2026
When Agents Look the Same: Quantifying Distillation-Induced Similarity in Tool-Use Behaviors
ACL 2026
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
ACL 2026
Think-J: Learning to Think for Generative LLM-as-a-Judge
AAAI 2026
Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning
ACL 2026
ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding
ACL 2026
PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment
ACL 2025
XCOT: Cross-lingual Instruction Tuning for Cross-lingual Chain-of-Thought Reasoning
AAAI 2025
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering
AAAI 2025
Quantification of Large Language Model Distillation
ACL 2025
MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training
ACL 2025
Can MLLMs Understand the Deep Implication Behind Chinese Images?
ACL 2025
Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models
ACL 2025
M2RC-EVAL: Massively Multilingual Repository-level Code Completion Evaluation
ACL 2025
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
ACL 2025
Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models
ACL 2025
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
ACL 2025
ProgCo: Program Helps Self-Correction of Large Language Models
ACL 2025
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation
ACL 2025
LIME: Less Is More for MLLM Evaluation
ACL 2025
See the World, Discover Knowledge: A Chinese Factuality Evaluation for Large Vision Language Models
ACL 2025
MIO: A Foundation Model on Multimodal Tokens
EMNLP 2025
AIR: Complex Instruction Generation via Automatic Iterative Refinement
EMNLP 2025
OAgents: An Empirical Study of Building Effective Agents
EMNLP 2025
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
ICCV 2025
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks
ICLR 2025
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
ICLR 2025
MuPT: A Generative Symbolic Music Pretrained Transformer
ICLR 2025
McEval: Massively Multilingual Code Evaluation
ICLR 2025
DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
NAACL 2025
2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision
NAACL 2025
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models
NIPS 2024
DDK: Distilling Domain Knowledge for Efficient Large Language Models
NIPS 2024
Compressing Large Language Models by Joint Sparsification and Quantization
ICML 2024
OWL: A Large Language Model for IT Operations
ICLR 2024
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
AAAI 2024
"Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts"
ECCV 2024
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models
NIPS 2024
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
EMNLP 2024
RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts
NIPS 2024
Towards Real-world Scenario: Imbalanced New Intent Discovery
ACL 2024
UniCoder: Scaling Code Large Language Model via Universal Code
ACL 2024
LTA-PCS: Learnable Task-Agnostic Point Cloud Sampling
CVPR 2024
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
ACL 2024
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
ACL 2024
E2-LLM: Efficient and Extreme Length Extension of Large Language Models
ACL 2024
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
ACL 2024
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
ACL 2024
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
COLING 2024
M2C: Towards Automatic Multimodal Manga Complement
EMNLP 2023
ICD-Face: Intra-class Compactness Distillation for Face Recognition
ICCV 2023
GD-MAE: Generative Decoder for MAE Pre-Training on LiDAR Point Clouds
CVPR 2023
Adaptive Contrastive Knowledge Distillation for BERT Compression
ACL 2023
CoupleFace: Relation Matters for Face Recognition Distillation
ECCV 2022
LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation
EMNLP 2022
AnchorFace: Boosting TAR@FAR for Practical Face Recognition
AAAI 2022
Cross-Lingual Cross-Modal Consolidation for Effective Multilingual Video Corpus Moment Retrieval
NAACL 2022
OneFace: One Threshold for All
ECCV 2022
DAM: Discrepancy Alignment Metric for Face Recognition
ICCV 2021
Learning to Auto Weight: Entirely Data-Driven and Highly Efficient Weighting Framework
AAAI 2020
Correlation Congruence for Knowledge Distillation
ICCV 2019
Knowledge Distillation via Route Constrained Optimization
ICCV 2019