Chao Zhang
224 papers · 2009–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (43) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (9) π£ Hot Topic Early Bird
π
Renaissance Researcher
(9)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(43)
π
Keyword Trendsetter Combo
(4)
π
Conference Loyalist
(28)
π
Keyword Champion
π€
Dynamic Duo
(23)
π
Grand Slam
π
Triple Crown
π₯
Mega-Team
(34)
π¬
Deep Specialist
(21)
ποΈ
Keyword Collector
(134)
π₯
Unstoppable
(17)
π
Conference Pioneer
π
Century Club
(209)
β
The Questioner
(2)
π
Trend Setter
β‘
Prolific Year
(31)
Conferences
AAAI (29)
ACL (28)
NIPS (28)
EMNLP (26)
INTERSPEECH (24)
CVPR (17)
ICML (15)
IJCAI (11)
ICLR (11)
ICCV (11)
NAACL (9)
AISTATS (6)
IJCNLP (3)
ECCV (2)
EACL (2)
COLING (1)
UAI (1)
Top co-authors
Research topics
Keywords
large language model
(24)
automatic speech recognition
(10)
uncertainty quantification
(9)
contrastive learning
(8)
model compression
(8)
named entity recognition
(8)
reinforcement learning
(7)
speech recognition
(7)
question answering
(7)
diffusion model
(7)
graph neural network
(7)
multimodal learning
(7)
bayesian inference
(6)
convolutional neural network
(6)
knowledge distillation
(6)
few-shot learning
(6)
text classification
(6)
transfer learning
(5)
zero-shot learning
(5)
vision transformer
(5)
Papers
WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning
EACL 2026
GLIER: Generative Legal Inference and Evidence Ranking for Legal Case Retrieval
ACL 2026
S2G-RAG: Structured Sufficiency and Gap Judging for Iterative Retrieval-Augmented QA
ACL 2026
Beyond Pedagogical Principles: Multi-Horizon Preference Optimization for Efficient Socratic Tutoring
ACL 2026
Revisiting the Reliability of Language Models in Instruction-Following
ACL 2026
RFKG-CoT: Relation-Driven Adaptive Hop-count Selection and Few-Shot Path Guidance for Knowledge-Aware QA
AAAI 2026
Look as You Think: Unifying Reasoning and Visual Evidence Attribution for Verifiable Document RAG via Reinforcement Learning
AAAI 2026
Mamba-Driven Multi-View Discriminative Clustering via Global-Local Cross-View Sequence Modeling
AAAI 2026
Semantic-Augmented Image Clustering via Adaptive Multi-Modal Collaboration
AAAI 2026
Semantic-Aware Feature Enhancement for Partial Label Learning
AAAI 2026
MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence
AAAI 2026
Online Cross-Modal Hashing with Expanding Label Space
AAAI 2026
BrainHGT: A Hierarchical Graph Transformer for Interpretable Brain Network Analysis
AAAI 2026
Mass Concept Erasure in Diffusion Models with Concept Hierarchy
AAAI 2026
BDLF-Qwen3: Enhanced Cross-Architecture Binary Function Similarity Detection Through Binary Dynamic Layer Fusion
AAAI 2026
Video2Roleplay: A Multimodal Dataset and Framework for Video-Guided Role-playing Agents
EMNLP 2025
βIβve Decided to Leakβ: Probing Internals Behind Prompt Leakage Intents
EMNLP 2025
Minimal, Local, and Robust: Embedding-Only Edits for Implicit Bias in T2I Models
EMNLP 2025
An Engorgio Prompt Makes Large Language Model Babble on
ICLR 2025
Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints
AISTATS 2025
Bayesian WeakS-to-Strong from Text Classification to Generation
ICLR 2025
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
EMNLP 2025
Audio-centric Video Understanding Benchmark without Text Shortcut
EMNLP 2025
Think Wider, Detect Sharper: Reinforced Reference Coverage for Document-Level Self-Contradiction Detection
EMNLP 2025
Your Scale Factors are My Weapon: Targeted Bit-Flip Attacks on Vision Transformers via Scale Factor Manipulation
CVPR 2025
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators
ICLR 2025
FG-OrIU: Towards Better Forgetting via Feature-Gradient Orthogonality for Incremental Unlearning
ICCV 2025
Dataset Distillation via Vision-Language Category Prototype
ICCV 2025
Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training
NAACL 2025
RenderBender: A Survey on Adversarial Attacks Using Differentiable Rendering
IJCAI 2025
Community-Aware Graph Transformer for Brain Disorder Identification
IJCAI 2025
Cowpox: Towards the Immunity of VLM-based Multi-Agent Systems
ICML 2025
Efficiently Access Diffusion Fisher: Within the Outer Product Span Space
ICML 2025
LLM-Augmented Chemical Synthesis and Design Decision Programs
ICML 2025
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model
ICML 2025
Improving LLM Video Understanding with 16 Frames Per Second
ICML 2025
Ensembles of Low-Rank Expert Adapters
ICLR 2025
A Benchmark for Semantic Sensitive Information in LLMs Outputs
ICLR 2025
Efficient Evolutionary Search Over Chemical Space with Large Language Models
ICLR 2025
Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin
ICCV 2025
Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics
ICCV 2025
DORM: Preference Data Weights Optimization for Reward Modeling in LLM Alignment
EMNLP 2025
DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization
EMNLP 2025
DF$^2$: Distribution-Free Decision-Focused Learning
UAI 2025
Adapting LLM Agents with Universal Communication Feedback
NAACL 2025
Self-Generated Critiques Boost Reward Modeling for Language Models
NAACL 2025
TextToucher: Fine-Grained Text-to-Touch Generation
AAAI 2025
MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation
AAAI 2025
Fast Incomplete Multi-view Clustering with Adaptive Similarity Completion and Reconstruction
AAAI 2025
Incomplete Multi-view Clustering via Diffusion Contrastive Generation
AAAI 2025
DNCASR: End-to-End Training for Speaker-Attributed ASR
ACL 2025
AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs
ACL 2025
QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions
ACL 2025
Streamlining the Collaborative Chain of Models into A Single Forward Pass in Generation-Based Tasks
ACL 2025
Review-Instruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models
ACL 2025
DecompileBench: A Comprehensive Benchmark for Evaluating Decompilers in Real-World Scenarios
ACL 2025
MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures - A Comprehensive Framework
EMNLP 2025
Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models
INTERSPEECH 2024
D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models
NIPS 2024
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
NIPS 2024
Aligning Large Language Models with Representation Editing: A Control Perspective
NIPS 2024
BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models
NIPS 2024
Solving Zero-Sum Markov Games with Continuous State via Spectral Dynamic Embedding
NIPS 2024
Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis
NIPS 2024
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
NIPS 2024
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs
NIPS 2024
An Improved Empirical Fisher Approximation for Natural Gradient Descent
NIPS 2024
Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation
AAAI 2024
Towards Modeling Uncertainties of Self-Explaining Neural Networks via Conformal Prediction
AAAI 2024
GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework
AAAI 2024
Learning Cluster-Wise Anchors for Multi-View Clustering
AAAI 2024
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation
ACL 2024
Virtual Compiler Is All You Need For Assembly Code Search
ACL 2024
ARL2: Aligning Retrievers with Black-box Large Language Models via Self-guided Adaptive Relevance Labeling
ACL 2024
M3AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
ACL 2024
Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning
ACL 2024
Modelling Variability in Human Annotator Simulation
ACL 2024
Speech-based Slot Filling using Large Language Models
ACL 2024
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
ACL 2024
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models
ACL 2024
Two Birds with One Stone: Enhancing Uncertainty Quantification and Interpretability with Graph Functional Neural Process
AISTATS 2024
Semantic Map-based Generation of Navigation Instructions
COLING 2024
APISR: Anime Production Inspired Real-World Anime Super-Resolution
CVPR 2024
DiaLoc: An Iterative Approach to Embodied Dialog Localization
CVPR 2024
HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification
EACL 2024
Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation
EMNLP 2024
EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees
EMNLP 2024
Bayesian Example Selection Improves In-Context Learning for Speech, Text and Visual Modalities
EMNLP 2024
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers
EMNLP 2024
Data Diversity Matters for Robust Instruction Tuning
EMNLP 2024
A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction
EMNLP 2024
ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search
ICLR 2024
SALMONN: Towards Generic Hearing Abilities for Large Language Models
ICLR 2024
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
ICLR 2024
RAIN: Your Language Models Can Align Themselves without Finetuning
ICLR 2024
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
ICML 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
ICML 2024
Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant Learning
ICML 2024
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models
ICML 2024
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
ICML 2024
Continual Multi-View Clustering with Consistent Anchor Guidance
IJCAI 2024
SOT Triggered Neural Clustering for Speaker Attributed ASR
INTERSPEECH 2024
SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR
INTERSPEECH 2024
Spontaneous Speech-Based Suicide Risk Detection Using Whisper and Large Language Models
INTERSPEECH 2024
Confidence Estimation for Automatic Detection of Depression and Alzheimerβs Disease Based on Clinical Interviews
INTERSPEECH 2024
Can Large Language Models Understand Spatial Audio?
INTERSPEECH 2024
Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study
NAACL 2024
POLYIE: A Dataset of Information Extraction from Polymer Material Scientific Literature
NAACL 2024
Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt
CVPR 2023
A Neural Time Alignment Module for End-to-End Automatic Speech Recognition
INTERSPEECH 2023
TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses
ICCV 2023
One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training
ICCV 2023
Graph Reasoning for Question Answering with Triplet Retrieval
ACL 2023
Context-Aware Query Rewriting for Improving Usersβ Search Experience on E-commerce Websites
ACL 2023
Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression
ACL 2023
Cold-Start Data Selection for Better Few-shot Language Model Fine-tuning: A Prompt-based Uncertainty Propagation Approach
ACL 2023
Robust Graph Dictionary Learning
ICLR 2023
Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms
NIPS 2023
AdaPlanner: Adaptive Planning from Feedback with Language Models
NIPS 2023
Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias
NIPS 2023
ToolQA: A Dataset for LLM Question Answering with External Tools
NIPS 2023
Can Contextual Biasing Remain Effective with Whisper and GPT-2?
INTERSPEECH 2023
Enhanced Tensor Low-Rank and Sparse Representation Recovery for Incomplete Multi-View Clustering
AAAI 2023
Neighborhood-Regularized Self-Training for Learning with Few Labels
AAAI 2023
Towards Optimal Randomized Strategies in Adversarial Example Game
AAAI 2023
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation
INTERSPEECH 2023
Obstructive Sleep Apnea Detection using Pre-trained Speech Representations
INTERSPEECH 2023
Model-Aware Contrastive Learning: Towards Escaping the Dilemmas
ICML 2023
Autoregressive Diffusion Model for Graph Generation
ICML 2023
SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process
ICML 2023
Rank-DETR for High Quality Object Detection
NIPS 2023
CDMA: A Practical Cross-Device Federated Learning Algorithm for General Minimax Problems
AAAI 2023
Knowledge-Selective Pretraining for Attribute Value Extraction
EMNLP 2023
May the Force be with You: Unified Force-Centric Pre-Training for 3D Molecular Conformations
NIPS 2023
Improving Consistency for Text Summarization with Energy Functions
EMNLP 2023
DETRs With Hybrid Matching
CVPR 2023
ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval
ACL 2023
Extracting Shopping Interest-Related Product Types from the Web
ACL 2023
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations
INTERSPEECH 2023
Turn-Taking Prediction for Natural Conversational Speech
INTERSPEECH 2022
DPVI: A Dynamic-Weight Particle-Based Variational Inference Framework
IJCAI 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
NIPS 2022
RoChBert: Towards Robust BERT Fine-tuning for Chinese
EMNLP 2022
PLATO-Ad: A Unified Advertisement Text Generation Framework with Multi-Task Prompt Learning
EMNLP 2022
End-to-end Stochastic Optimization with Energy-based Model
NIPS 2022
COCO-DR: Combating the Distribution Shift in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning
EMNLP 2022
ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select
EMNLP 2022
FlowFormer: A Transformer Architecture for Optical Flow
ECCV 2022
CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data
NAACL 2022
Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation
NIPS 2022
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition
INTERSPEECH 2022
UnfoldML: Cost-Aware and Uncertainty-Based Dynamic 2D Prediction for Multi-Stage Classification
NIPS 2022
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
INTERSPEECH 2022
FORCE: A Framework of Rule-Based Conversational Recommender System
AAAI 2022
Learning a Structured Latent Space for Unsupervised Point Cloud Completion
CVPR 2022
Recurring the Transformer for Video Action Recognition
CVPR 2022
Abandoning the Bayer-Filter To See in the Dark
CVPR 2022
From One to All: Learning to Match Heterogeneous and Partially Overlapped Graphs
AAAI 2022
Self-Training with Differentiable Teacher
NAACL 2022
PRBoost: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning
ACL 2022
AcTune: Uncertainty-Based Active Self-Training for Active Fine-Tuning of Pretrained Language Models
NAACL 2022
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
INTERSPEECH 2022
Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
NAACL 2021
BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition
ACL 2021
A Hybrid Stochastic Gradient Hamiltonian Monte Carlo Method
AAAI 2021
SHPOS: A Theoretical Guaranteed Accelerated Particle Optimization Sampling Method
IJCAI 2021
When in Doubt: Neural Non-Parametric Uncertainty Quantification for Epidemic Forecasting
NIPS 2021
BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition
IJCNLP 2021
Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning
INTERSPEECH 2021
HRFormer: High-Resolution Vision Transformer for Dense Predict
NIPS 2021
Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization
NIPS 2021
Positive-Unlabeled Data Purification in the Wild for Object Detection
CVPR 2021
Semantic Scene Completion via Integrating Instances and Scene In-the-Loop
CVPR 2021
Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework
EMNLP 2021
Efficient Projection-Free Online Methods with Stochastic Recursive Gradient
AAAI 2020
Efficient WaveGlow: An Improved WaveGlow Vocoder with Enhanced Speed
INTERSPEECH 2020
Denoising Multi-Source Weak Supervision for Neural Text Classification
EMNLP 2020
Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection
CVPR 2020
Density-Aware Feature Embedding for Face Clustering
CVPR 2020
Self-Adaptive Training: beyond Empirical Risk Minimization
NIPS 2020
The JD AI Speaker Verification System for the FFSVC 2020 Challenge
INTERSPEECH 2020
Improving Replay Detection System with Channel Consistency DenseNeXt for the ASVspoof 2019 Challenge
INTERSPEECH 2020
Sound Event Localization and Detection Based on Multiple DOA Beamforming and Multi-Task Learning
INTERSPEECH 2020
Text Classification Using Label Names Only: A Language Model Self-Training Approach
EMNLP 2020
SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup
EMNLP 2020
Accelerating Stratified Sampling SGD by Reconstructing Strata
IJCAI 2020
Argot: Generating Adversarial Readable Chinese Texts
IJCAI 2020
Aggregated Gradient Langevin Dynamics
AAAI 2020
SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates
ICML 2020
Accelerating Primal Solution Findings for Mixed Integer Programs Based on Solution Prediction
AAAI 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
EMNLP 2020
Decentralized Gradient Tracking for Continuous DR-Submodular Maximization
AISTATS 2019
Multi-Span Acoustic Modelling Using Raw Waveform Signals
INTERSPEECH 2019
Direct-Path Signal Cross-Correlation Estimation for Sound Source Localization in Reverberation
INTERSPEECH 2019
A Gradual, Semi-Discrete Approach to Generative Network Training via Explicit Wasserstein Minimization
ICML 2019
Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification
ICCV 2019
Orientation-Aware Semantic Segmentation on Icosahedron Spheres
ICCV 2019
Spherical Text Embedding
NIPS 2019
C3AE: Exploring the Limits of Compact Model for Age Estimation
CVPR 2019
Weakly-Supervised Hierarchical Text Classification
AAAI 2019
Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval
ICCV 2019
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems
INTERSPEECH 2018
Learning Environmental Calibration Actions for Policy Self-Evolution
IJCAI 2018
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation
ECCV 2018
Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN
NIPS 2018
Semi-tied Units for Efficient Gating in LSTM and Highway Networks
INTERSPEECH 2018
Sparse DNNs with Improved Adversarial Robustness
NIPS 2018
JUMP: a Jointly Predictor for User Click and Dwell Time
IJCAI 2018
Joint Sub-bands Learning with Clique Structures for Wavelet Domain Super-Resolution
NIPS 2018
Towards Memory-Friendly Deterministic Incremental Gradient Method
AISTATS 2018
Tensor Completion with Side Information: A Riemannian Manifold Approach
IJCAI 2017
Hard-Aware Deeply Cascaded Embedding
ICCV 2017
Detailed, Accurate, Human Shape Estimation From Clothed 3D Scan Sequences
CVPR 2017
Accelerated Doubly Stochastic Gradient Algorithm for Large-scale Empirical Risk Minimization
IJCAI 2017
Functional Faces: Groupwise Dense Correspondence Using Functional Maps
CVPR 2016
Shell PCA: Statistical Shape Modelling in Shell Space
ICCV 2015
Discrete Hyper-Graph Matching
CVPR 2015
A Study on Cross-Population Age Estimation
CVPR 2014
Bootstrapping Large-scale Named Entities using URL-Text Hybrid Patterns
IJCNLP 2013
Generalization Bounds for Domain Adaptation
NIPS 2012
Generalization Bound for Infinitely Divisible Empirical Process
AISTATS 2011
Risk Bounds for Levy Processes in the PAC-Learning Framework
AISTATS 2010
Query Segmentation Based on Eigenspace Similarity
IJCNLP 2009
Query Segmentation Based on Eigenspace Similarity
ACL 2009