Bo Zheng
94 papers · 2013–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (29) π Interdisciplinary Bridge π Renaissance Researcher (8) π Conference Polyglot (15)
π
Renaissance Researcher
(8)
π
Interdisciplinary Bridge
π
Academic Marathon
(12)
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(21)
π€
Dynamic Duo
(17)
π
Grand Slam
π¬
Deep Specialist
(16)
π§¬
Topic Evolution
π
Trend Setter
π₯
Unstoppable
(6)
π
Conference Pioneer
β‘
Prolific Year
(8)
π
Century Club
(77)
β
The Questioner
(2)
ποΈ
Keyword Collector
(80)
Conferences
ACL (33)
EMNLP (13)
NIPS (9)
AAAI (7)
CVPR (7)
ICLR (6)
ECCV (4)
ICCV (3)
CONLL (2)
IJCAI (2)
IJCNLP (2)
NAACL (2)
COLING (1)
EACL (1)
ICML (1)
RSS (1)
Top co-authors
Keywords
large language model
(25)
reinforcement learning
(9)
benchmark evaluation
(8)
multimodal learning
(6)
question answering
(5)
multimodal large language model
(5)
image generation
(5)
cross-lingual language model
(4)
cross-lingual transfer
(4)
3d reconstruction
(3)
data augmentation
(3)
direct preference optimization
(3)
vision-language model
(3)
factuality evaluation
(3)
visual question answering
(3)
information retrieval
(3)
consistency regularization
(3)
chinese language
(3)
in-context learning
(2)
knowledge distillation
(2)
Papers
MIRAGE: Towards AI-Generated Image Detection in the Wild
AAAI 2026
Navigating the Infinite Dynamic Web Space: Effective In-Context Exploration via Cognitive Multi-Agent Collaboration
EACL 2026
CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling
ACL 2026
ShopSimulator: Evaluating and Exploring RL-Driven LLM Agent for Shopping Assistants
ACL 2026
Mobile-R1: Towards Interactive Capability for VLM-Based Mobile Agent via Systematic Training
ACL 2026
Towards Interpretable Tabular Reasoning: Enhancing LLM Reasoning on Tabular Data with Pre-Constructed Logic Graph
ACL 2026
Read As Human: Compressing Context via Parallelizable Close Reading and Skimming
ACL 2026
Enabling Agents to Communicate Entirely in Latent Space
ACL 2026
USB: A COMPREHENSIVE AND UNIFIED SAFETY EVALUATION BENCHMARK FOR MULTIMODAL LARGE LANGUAGE MODELS
ACL 2026
Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning
ACL 2026
Unified Thinker: A General Reasoning Core for Image Generation
ACL 2026
SELECting over Tokens: Curating Pre-training Data at Scale via Token Classification
ACL 2026
All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG
ACL 2026
GRAPHIA: Harnessing Social Graph Data to Enhance LLM-Based Social Simulation
ACL 2026
DeepPhy: Benchmarking Agentic VLMs on Physical Reasoning
AAAI 2026
Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning
AAAI 2026
Trimming the Fat: Redundancy-Aware Acceleration Framework for DGNNs
AAAI 2026
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
ACL 2025
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
AAAI 2025
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating
ACL 2025
HiddenDetect: Detecting Jailbreak Attacks against Multimodal Large Language Models via Monitoring Hidden States
ACL 2025
Chinese SafetyQA: A Safety Short-form Factuality Benchmark for Large Language Models
ACL 2025
M2RC-EVAL: Massively Multilingual Repository-level Code Completion Evaluation
ACL 2025
Do not Abstain! Identify and Solve the Uncertainty
ACL 2025
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
ACL 2025
Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models
ACL 2025
ProgCo: Program Helps Self-Correction of Large Language Models
ACL 2025
AIGuard: A Benchmark and Lightweight Detection for E-commerce AIGC Risks
ACL 2025
PGPO: Enhancing Agent Reasoning via Pseudocode-style Planning Guided Preference Optimization
ACL 2025
See the World, Discover Knowledge: A Chinese Factuality Evaluation for Large Vision Language Models
ACL 2025
MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples
COLING 2025
PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering
CVPR 2025
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025
VC4VG: Optimizing Video Captions for Text-to-Video Generation
EMNLP 2025
LeTS: Learning to Think-and-Search via Process-and-Outcome Reward Hybridization
EMNLP 2025
How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models
EMNLP 2025
SMEC:Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression
EMNLP 2025
AIR: Complex Instruction Generation via Automatic Iterative Refinement
EMNLP 2025
GSID: Generative Semantic Indexing for E-Commerce Product Understanding
EMNLP 2025
Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation
EMNLP 2025
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games
ICCV 2025
INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling
ICCV 2025
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
ICLR 2025
Small Models are LLM Knowledge Triggers for Medical Tabular Prediction
ICLR 2025
OmniKV: Dynamic Context Selection for Efficient Long-Context LLMs
ICLR 2025
Enhancing Document Understanding with Group Position Embedding: A Novel Approach to Incorporate Layout Information
ICLR 2025
Minimal Impact ControlNet: Advancing Multi-ControlNet Integration
ICLR 2025
Differentiable Solver Search for Fast Diffusion Sampling
ICML 2025
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning
IJCAI 2025
2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision
NAACL 2025
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models
NIPS 2024
Exploring DCN-like architecture for fast image generation with arbitrary resolution
NIPS 2024
DDK: Distilling Domain Knowledge for Efficient Large Language Models
NIPS 2024
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games
NIPS 2024
Dr.Hair: Reconstructing Scalp-Connected Hair Strands without Pre-Training via Differentiable Rendering of Line Segments
CVPR 2024
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
ACL 2024
SEGMENT+: Long Text Processing with Short-Context Language Models
EMNLP 2024
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
EMNLP 2024
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation
EMNLP 2024
Accelerating Image Generation with Sub-path Linear Approximation Model
ECCV 2024
Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models
ECCV 2024
Percentile Risk-Constrained Budget Pacing for Guaranteed Display Advertising in Online Optimization
AAAI 2024
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling
CVPR 2024
E2-LLM: Efficient and Extreme Length Extension of Large Language Models
ACL 2024
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
ACL 2024
Making Pre-trained Language Models Great on Tabular Prediction
ICLR 2024
Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline
ACL 2024
Demystify Mamba in Vision: A Linear Attention Perspective
NIPS 2024
Combating Bilateral Edge Noise for Robust Link Prediction
NIPS 2023
Co-optimization of Morphology and Behavior of Modular Robots via Hierarchical Deep Reinforcement Learning
RSS 2023
HIT-SCIR at MMNLU-22: Consistency Regularization for Multilingual Spoken Language Understanding
EMNLP 2022
StableMoE: Stable Routing Strategy for Mixture of Experts
ACL 2022
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA
ACL 2022
Sustainable Online Reinforcement Learning for Auto-bidding
NIPS 2022
GBA: A Tuning-free Approach to Switch between Synchronous and Asynchronous Training for Recommendation Models
NIPS 2022
CREATER: CTR-driven Advertising Text Generation with Controlled Pre-Training and Contrastive Fine-Tuning
NAACL 2022
APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction
NIPS 2022
BEAT: A Large-Scale Semantic and Emotional Multi-modal Dataset for Conversational Gestures Synthesis
ECCV 2022
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment
ACL 2021
Consistency Regularization for Cross-Lingual Fine-Tuning
IJCNLP 2021
Consistency Regularization for Cross-Lingual Fine-Tuning
ACL 2021
Allocating Large Vocabulary Capacity for Cross-Lingual Language Model Pre-Training
EMNLP 2021
4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface
ICCV 2021
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment
IJCNLP 2021
Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension
ACL 2020
Efficient Spatio-Temporal Recurrent Neural Network for Video Deblurring
ECCV 2020
RPM-Oriented Query Rewriting Framework for E-commerce Keyword-Based Sponsored Search (Student Abstract)
AAAI 2020
Stereoscopic Flash and No-Flash Photography for Shape and Albedo Recovery
CVPR 2020
An AMR Aligner Tuned by Transition-based Parser
EMNLP 2018
Towards Better UD Parsing: Deep Contextualized Word Embeddings, Ensemble, and Treebank Concatenation
CONLL 2018
Efficient Mechanism Design for Online Scheduling (Extended Abstract)
IJCAI 2017
The HIT-SCIR System for End-to-End Parsing of Universal Dependencies
CONLL 2017
Robust 3D Features for Matching between Distorted Range Scans Captured by Moving Systems
CVPR 2014
Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics
CVPR 2013