conftrace_

Chao Zhang

224 papers · 2009–2026 · 17 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+18 more ↓ πŸ—ΊοΈ Taxonomy Completionist (43) 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge 🌈 Renaissance Researcher (9) 🐣 Hot Topic Early Bird
🌈 Renaissance Researcher (9) πŸŒ‰ Interdisciplinary Bridge πŸ—ΊοΈ Taxonomy Completionist (43) 🌟 Keyword Trendsetter Combo (4) 🏠 Conference Loyalist (28) πŸ† Keyword Champion 🀝 Dynamic Duo (23) πŸ† Grand Slam πŸ‘‘ Triple Crown πŸ‘₯ Mega-Team (34) πŸ”¬ Deep Specialist (21) πŸ—ƒοΈ Keyword Collector (134) πŸ”₯ Unstoppable (17) πŸš€ Conference Pioneer πŸ’Ž Century Club (209) ❓ The Questioner (2) πŸ“ˆ Trend Setter ⚑ Prolific Year (31)

Conferences

AAAI (29) ACL (28) NIPS (28) EMNLP (26) INTERSPEECH (24) CVPR (17) ICML (15) IJCAI (11) ICLR (11) ICCV (11) NAACL (9) AISTATS (6) IJCNLP (3) ECCV (2) EACL (2) COLING (1) UAI (1)

Research topics

Papers

WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning EACL 2026 GLIER: Generative Legal Inference and Evidence Ranking for Legal Case Retrieval ACL 2026 S2G-RAG: Structured Sufficiency and Gap Judging for Iterative Retrieval-Augmented QA ACL 2026 Beyond Pedagogical Principles: Multi-Horizon Preference Optimization for Efficient Socratic Tutoring ACL 2026 Revisiting the Reliability of Language Models in Instruction-Following ACL 2026 RFKG-CoT: Relation-Driven Adaptive Hop-count Selection and Few-Shot Path Guidance for Knowledge-Aware QA AAAI 2026 Look as You Think: Unifying Reasoning and Visual Evidence Attribution for Verifiable Document RAG via Reinforcement Learning AAAI 2026 Mamba-Driven Multi-View Discriminative Clustering via Global-Local Cross-View Sequence Modeling AAAI 2026 Semantic-Augmented Image Clustering via Adaptive Multi-Modal Collaboration AAAI 2026 Semantic-Aware Feature Enhancement for Partial Label Learning AAAI 2026 MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence AAAI 2026 Online Cross-Modal Hashing with Expanding Label Space AAAI 2026 BrainHGT: A Hierarchical Graph Transformer for Interpretable Brain Network Analysis AAAI 2026 Mass Concept Erasure in Diffusion Models with Concept Hierarchy AAAI 2026 BDLF-Qwen3: Enhanced Cross-Architecture Binary Function Similarity Detection Through Binary Dynamic Layer Fusion AAAI 2026 Video2Roleplay: A Multimodal Dataset and Framework for Video-Guided Role-playing Agents EMNLP 2025 β€œI’ve Decided to Leak”: Probing Internals Behind Prompt Leakage Intents EMNLP 2025 Minimal, Local, and Robust: Embedding-Only Edits for Implicit Bias in T2I Models EMNLP 2025 An Engorgio Prompt Makes Large Language Model Babble on ICLR 2025 Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints AISTATS 2025 Bayesian WeakS-to-Strong from Text Classification to Generation ICLR 2025 WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning EMNLP 2025 Audio-centric Video Understanding Benchmark without Text Shortcut EMNLP 2025 Think Wider, Detect Sharper: Reinforced Reference Coverage for Document-Level Self-Contradiction Detection EMNLP 2025 Your Scale Factors are My Weapon: Targeted Bit-Flip Attacks on Vision Transformers via Scale Factor Manipulation CVPR 2025 Audio Large Language Models Can Be Descriptive Speech Quality Evaluators ICLR 2025 FG-OrIU: Towards Better Forgetting via Feature-Gradient Orthogonality for Incremental Unlearning ICCV 2025 Dataset Distillation via Vision-Language Category Prototype ICCV 2025 Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training NAACL 2025 RenderBender: A Survey on Adversarial Attacks Using Differentiable Rendering IJCAI 2025 Community-Aware Graph Transformer for Brain Disorder Identification IJCAI 2025 Cowpox: Towards the Immunity of VLM-based Multi-Agent Systems ICML 2025 Efficiently Access Diffusion Fisher: Within the Outer Product Span Space ICML 2025 LLM-Augmented Chemical Synthesis and Design Decision Programs ICML 2025 video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model ICML 2025 Improving LLM Video Understanding with 16 Frames Per Second ICML 2025 Ensembles of Low-Rank Expert Adapters ICLR 2025 A Benchmark for Semantic Sensitive Information in LLMs Outputs ICLR 2025 Efficient Evolutionary Search Over Chemical Space with Large Language Models ICLR 2025 Unleashing High-Quality Image Generation in Diffusion Sampling Using Second-Order Levenberg-Marquardt-Langevin ICCV 2025 Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics ICCV 2025 DORM: Preference Data Weights Optimization for Reward Modeling in LLM Alignment EMNLP 2025 DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization EMNLP 2025 DF$^2$: Distribution-Free Decision-Focused Learning UAI 2025 Adapting LLM Agents with Universal Communication Feedback NAACL 2025 Self-Generated Critiques Boost Reward Modeling for Language Models NAACL 2025 TextToucher: Fine-Grained Text-to-Touch Generation AAAI 2025 MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation AAAI 2025 Fast Incomplete Multi-view Clustering with Adaptive Similarity Completion and Reconstruction AAAI 2025 Incomplete Multi-view Clustering via Diffusion Contrastive Generation AAAI 2025 DNCASR: End-to-End Training for Speaker-Attributed ASR ACL 2025 AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs ACL 2025 QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions ACL 2025 Streamlining the Collaborative Chain of Models into A Single Forward Pass in Generation-Based Tasks ACL 2025 Review-Instruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models ACL 2025 DecompileBench: A Comprehensive Benchmark for Evaluating Decompilers in Real-World Scenarios ACL 2025 MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures - A Comprehensive Framework EMNLP 2025 Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models INTERSPEECH 2024 D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models NIPS 2024 Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models NIPS 2024 Aligning Large Language Models with Representation Editing: A Control Perspective NIPS 2024 BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models NIPS 2024 Solving Zero-Sum Markov Games with Continuous State via Spectral Dynamic Embedding NIPS 2024 Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis NIPS 2024 HYDRA: Model Factorization Framework for Black-Box LLM Personalization NIPS 2024 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs NIPS 2024 An Improved Empirical Fisher Approximation for Natural Gradient Descent NIPS 2024 Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation AAAI 2024 Towards Modeling Uncertainties of Self-Explaining Neural Networks via Conformal Prediction AAAI 2024 GAD-PVI: A General Accelerated Dynamic-Weight Particle-Based Variational Inference Framework AAAI 2024 Learning Cluster-Wise Anchors for Multi-View Clustering AAAI 2024 Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation ACL 2024 Virtual Compiler Is All You Need For Assembly Code Search ACL 2024 ARL2: Aligning Retrievers with Black-box Large Language Models via Self-guided Adaptive Relevance Labeling ACL 2024 M3AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset ACL 2024 Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning ACL 2024 Modelling Variability in Human Annotator Simulation ACL 2024 Speech-based Slot Filling using Large Language Models ACL 2024 PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs ACL 2024 ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models ACL 2024 Two Birds with One Stone: Enhancing Uncertainty Quantification and Interpretability with Graph Functional Neural Process AISTATS 2024 Semantic Map-based Generation of Navigation Instructions COLING 2024 APISR: Anime Production Inspired Real-World Anime Super-Resolution CVPR 2024 DiaLoc: An Iterative Approach to Embodied Dialog Localization CVPR 2024 HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification EACL 2024 Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation EMNLP 2024 EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees EMNLP 2024 Bayesian Example Selection Improves In-Context Learning for Speech, Text and Visual Modalities EMNLP 2024 BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers EMNLP 2024 Data Diversity Matters for Robust Instruction Tuning EMNLP 2024 A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction EMNLP 2024 ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search ICLR 2024 SALMONN: Towards Generic Hearing Abilities for Large Language Models ICLR 2024 Large Language Models are Efficient Learners of Noise-Robust Speech Recognition ICLR 2024 RAIN: Your Language Models Can Align Themselves without Finetuning ICLR 2024 GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer ICML 2024 EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty ICML 2024 Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant Learning ICML 2024 video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models ICML 2024 BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models ICML 2024 Continual Multi-View Clustering with Consistent Anchor Guidance IJCAI 2024 SOT Triggered Neural Clustering for Speaker Attributed ASR INTERSPEECH 2024 SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR INTERSPEECH 2024 Spontaneous Speech-Based Suicide Risk Detection Using Whisper and Large Language Models INTERSPEECH 2024 Confidence Estimation for Automatic Detection of Depression and Alzheimer’s Disease Based on Clinical Interviews INTERSPEECH 2024 Can Large Language Models Understand Spatial Audio? INTERSPEECH 2024 Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study NAACL 2024 POLYIE: A Dataset of Information Extraction from Polymer Material Scientific Literature NAACL 2024 Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt CVPR 2023 A Neural Time Alignment Module for End-to-End Automatic Speech Recognition INTERSPEECH 2023 TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses ICCV 2023 One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training ICCV 2023 Graph Reasoning for Question Answering with Triplet Retrieval ACL 2023 Context-Aware Query Rewriting for Improving Users’ Search Experience on E-commerce Websites ACL 2023 Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression ACL 2023 Cold-Start Data Selection for Better Few-shot Language Model Fine-tuning: A Prompt-based Uncertainty Propagation Approach ACL 2023 Robust Graph Dictionary Learning ICLR 2023 Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms NIPS 2023 AdaPlanner: Adaptive Planning from Feedback with Language Models NIPS 2023 Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias NIPS 2023 ToolQA: A Dataset for LLM Question Answering with External Tools NIPS 2023 Can Contextual Biasing Remain Effective with Whisper and GPT-2? INTERSPEECH 2023 Enhanced Tensor Low-Rank and Sparse Representation Recovery for Incomplete Multi-View Clustering AAAI 2023 Neighborhood-Regularized Self-Training for Learning with Few Labels AAAI 2023 Towards Optimal Randomized Strategies in Adversarial Example Game AAAI 2023 Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation INTERSPEECH 2023 Obstructive Sleep Apnea Detection using Pre-trained Speech Representations INTERSPEECH 2023 Model-Aware Contrastive Learning: Towards Escaping the Dilemmas ICML 2023 Autoregressive Diffusion Model for Graph Generation ICML 2023 SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process ICML 2023 Rank-DETR for High Quality Object Detection NIPS 2023 CDMA: A Practical Cross-Device Federated Learning Algorithm for General Minimax Problems AAAI 2023 Knowledge-Selective Pretraining for Attribute Value Extraction EMNLP 2023 May the Force be with You: Unified Force-Centric Pre-Training for 3D Molecular Conformations NIPS 2023 Improving Consistency for Text Summarization with Energy Functions EMNLP 2023 DETRs With Hybrid Matching CVPR 2023 ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval ACL 2023 Extracting Shopping Interest-Related Product Types from the Web ACL 2023 Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations INTERSPEECH 2023 Turn-Taking Prediction for Natural Conversational Speech INTERSPEECH 2022 DPVI: A Dynamic-Weight Particle-Based Variational Inference Framework IJCAI 2022 Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning NIPS 2022 RoChBert: Towards Robust BERT Fine-tuning for Chinese EMNLP 2022 PLATO-Ad: A Unified Advertisement Text Generation Framework with Multi-Task Prompt Learning EMNLP 2022 End-to-end Stochastic Optimization with Energy-based Model NIPS 2022 COCO-DR: Combating the Distribution Shift in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning EMNLP 2022 ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select EMNLP 2022 FlowFormer: A Transformer Architecture for Optical Flow ECCV 2022 CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data NAACL 2022 Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation NIPS 2022 Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition INTERSPEECH 2022 UnfoldML: Cost-Aware and Uncertainty-Based Dynamic 2D Prediction for Multi-Stage Classification NIPS 2022 Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification INTERSPEECH 2022 FORCE: A Framework of Rule-Based Conversational Recommender System AAAI 2022 Learning a Structured Latent Space for Unsupervised Point Cloud Completion CVPR 2022 Recurring the Transformer for Video Action Recognition CVPR 2022 Abandoning the Bayer-Filter To See in the Dark CVPR 2022 From One to All: Learning to Match Heterogeneous and Partially Overlapped Graphs AAAI 2022 Self-Training with Differentiable Teacher NAACL 2022 PRBoost: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning ACL 2022 AcTune: Uncertainty-Based Active Self-Training for Active Fine-Tuning of Pretrained Language Models NAACL 2022 Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription INTERSPEECH 2022 Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach NAACL 2021 BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition ACL 2021 A Hybrid Stochastic Gradient Hamiltonian Monte Carlo Method AAAI 2021 SHPOS: A Theoretical Guaranteed Accelerated Particle Optimization Sampling Method IJCAI 2021 When in Doubt: Neural Non-Parametric Uncertainty Quantification for Epidemic Forecasting NIPS 2021 BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition IJCNLP 2021 Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning INTERSPEECH 2021 HRFormer: High-Resolution Vision Transformer for Dense Predict NIPS 2021 Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization NIPS 2021 Positive-Unlabeled Data Purification in the Wild for Object Detection CVPR 2021 Semantic Scene Completion via Integrating Instances and Scene In-the-Loop CVPR 2021 Learning from Language Description: Low-shot Named Entity Recognition via Decomposed Framework EMNLP 2021 Efficient Projection-Free Online Methods with Stochastic Recursive Gradient AAAI 2020 Efficient WaveGlow: An Improved WaveGlow Vocoder with Enhanced Speed INTERSPEECH 2020 Denoising Multi-Source Weak Supervision for Neural Text Classification EMNLP 2020 Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection CVPR 2020 Density-Aware Feature Embedding for Face Clustering CVPR 2020 Self-Adaptive Training: beyond Empirical Risk Minimization NIPS 2020 The JD AI Speaker Verification System for the FFSVC 2020 Challenge INTERSPEECH 2020 Improving Replay Detection System with Channel Consistency DenseNeXt for the ASVspoof 2019 Challenge INTERSPEECH 2020 Sound Event Localization and Detection Based on Multiple DOA Beamforming and Multi-Task Learning INTERSPEECH 2020 Text Classification Using Label Names Only: A Language Model Self-Training Approach EMNLP 2020 SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup EMNLP 2020 Accelerating Stratified Sampling SGD by Reconstructing Strata IJCAI 2020 Argot: Generating Adversarial Readable Chinese Texts IJCAI 2020 Aggregated Gradient Langevin Dynamics AAAI 2020 SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates ICML 2020 Accelerating Primal Solution Findings for Mixed Integer Programs Based on Solution Prediction AAAI 2020 Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data EMNLP 2020 Decentralized Gradient Tracking for Continuous DR-Submodular Maximization AISTATS 2019 Multi-Span Acoustic Modelling Using Raw Waveform Signals INTERSPEECH 2019 Direct-Path Signal Cross-Correlation Estimation for Sound Source Localization in Reverberation INTERSPEECH 2019 A Gradual, Semi-Discrete Approach to Generative Network Training via Explicit Wasserstein Minimization ICML 2019 Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification ICCV 2019 Orientation-Aware Semantic Segmentation on Icosahedron Spheres ICCV 2019 Spherical Text Embedding NIPS 2019 C3AE: Exploring the Limits of Compact Model for Age Estimation CVPR 2019 Weakly-Supervised Hierarchical Text Classification AAAI 2019 Deep Joint-Semantics Reconstructing Hashing for Large-Scale Unsupervised Cross-Modal Retrieval ICCV 2019 Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems INTERSPEECH 2018 Learning Environmental Calibration Actions for Policy Self-Evolution IJCAI 2018 Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation ECCV 2018 Greedy Hash: Towards Fast Optimization for Accurate Hash Coding in CNN NIPS 2018 Semi-tied Units for Efficient Gating in LSTM and Highway Networks INTERSPEECH 2018 Sparse DNNs with Improved Adversarial Robustness NIPS 2018 JUMP: a Jointly Predictor for User Click and Dwell Time IJCAI 2018 Joint Sub-bands Learning with Clique Structures for Wavelet Domain Super-Resolution NIPS 2018 Towards Memory-Friendly Deterministic Incremental Gradient Method AISTATS 2018 Tensor Completion with Side Information: A Riemannian Manifold Approach IJCAI 2017 Hard-Aware Deeply Cascaded Embedding ICCV 2017 Detailed, Accurate, Human Shape Estimation From Clothed 3D Scan Sequences CVPR 2017 Accelerated Doubly Stochastic Gradient Algorithm for Large-scale Empirical Risk Minimization IJCAI 2017 Functional Faces: Groupwise Dense Correspondence Using Functional Maps CVPR 2016 Shell PCA: Statistical Shape Modelling in Shell Space ICCV 2015 Discrete Hyper-Graph Matching CVPR 2015 A Study on Cross-Population Age Estimation CVPR 2014 Bootstrapping Large-scale Named Entities using URL-Text Hybrid Patterns IJCNLP 2013 Generalization Bounds for Domain Adaptation NIPS 2012 Generalization Bound for Infinitely Divisible Empirical Process AISTATS 2011 Risk Bounds for Levy Processes in the PAC-Learning Framework AISTATS 2010 Query Segmentation Based on Eigenspace Similarity IJCNLP 2009 Query Segmentation Based on Eigenspace Similarity ACL 2009