conftrace_

Dong Yu

218 papers · 2007–2026 · 19 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+18 more ↓

🗺️ Taxonomy Completionist (31) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (19)

🗺️ Taxonomy Completionist (31) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🏠 Conference Loyalist (47) 🌟 Keyword Trendsetter Combo (6) 🏆 Keyword Champion (6) 👑 Triple Crown 🔬 Deep Specialist (11) 🧬 Topic Evolution 🤝 Dynamic Duo (36) 🏆 Grand Slam ❓ The Questioner (6) 🗃️ Keyword Collector (112) 📈 Trend Setter 🔥 Unstoppable (11) 🚀 Conference Pioneer ⚡ Prolific Year (35) 💎 Century Club (202)

Conferences

ACL (50) INTERSPEECH (47) EMNLP (46) AAAI (13) NAACL (13) ICLR (10) COLING (7) SEMEVAL (6) EACL (5) IJCNLP (5) ICML (3) IJCAI (3) AACL (2) CONLL (2) NIPS (2) UAI (1) ICCV (1) ECCV (1) CVPR (1)

Top co-authors

Hongming Zhang (38) Linfeng Song (31) Haitao Mi (29) Dian Yu (28) Dan Su (26) Jianshu Chen (23) Meng Yu (22) Xiaoyang Wang (22) Kaiqiang Song (21) Wenlin Yao (21)

Keywords

large language model (24) question answering (13) speech separation (12) attention mechanism (12) speech recognition (10) language model (10) reinforcement learning (10) knowledge distillation (9) machine reading comprehension (9) text generation (9) zero-shot learning (8) benchmark evaluation (7) recurrent neural network (7) transfer learning (7) multimodal learning (6) speech enhancement (6) speech synthesis (6) weakly supervised learning (5) retrieval-augmented generation (5) contrastive learning (5)

Papers

EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving ACL 2026 Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data ACL 2026 DegVoC: Revisiting Neural Vocoder from a Degradation Perspective AAAI 2026 Beyond Euclidean Assumptions: Geometry-Aware Adaptive Routing for Remote Sensing Segmentation AAAI 2026 WebRollback: Enhancing Web Agents with Explicit Rollback Mechanisms EACL 2026 Crossing the Reward Bridge: Expanding Reinforcement Learning with Verifiable Rewards Across Diverse Domains ACL 2026 Your Reasoning Model is Secretly a Reward Model - Optimization-Free Verification from Experience ACL 2026 Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding ACL 2026 WebAggregator: Enhancing Compositional Reasoning Capabilities of Deep Research Agent Foundation Models ACL 2026 SPAGBias: Uncovering and Tracing Structured Spatial Gender Bias in Large Language Models ACL 2026 Beyond Detection: Evaluating Fallacy Awareness of LLMs in Interactive Scenarios ACL 2026 Audio-Thinker: Guiding Large Audio Language Model When and How to Think via Reinforcement Learning AAAI 2026 UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation AAAI 2026 Enhancing Stability and Fidelity for Zero-Shot TTS with a Multi-Level Evaluator AAAI 2026 Revisiting Audio-language Pretraining for Learning General-purpose Audio Representation ACL 2026 Retrieval-augmented GUI Agents with Generative Guidelines EMNLP 2025 Investigating Value-Reasoning Reliability in Small Large Language Models EMNLP 2025 Recall with Reasoning: Chain-of-Thought Distillation for Mamba’s Long-Context Memory and Extrapolation EMNLP 2025 Router-Tuning: A Simple and Effective Approach for Dynamic Depth EMNLP 2025 Atomic Calibration of LLMs in Long-Form Generations AACL 2025 DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems NAACL 2025 Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots NAACL 2025 LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning AAAI 2025 A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression ACL 2025 Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models ACL 2025 LoGU: Long-form Generation with Uncertainty Expressions ACL 2025 Don’t Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls ACL 2025 OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization ACL 2025 Low-Bit Quantization Favors Undertrained LLMs ACL 2025 DeFine: Decision-Making with Analogical Reasoning over Factor Profiles ACL 2025 What’s the most important value? INVP: INvestigating the Value Priorities of LLMs through Decision-making in Social Scenarios COLING 2025 Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models COLING 2025 Atomic Calibration of LLMs in Long-Form Generations IJCNLP 2025 BridgeVoC: Neural Vocoder with Schrödinger Bridge IJCAI 2025 Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models ICML 2025 DSBench: How Far Are Data Science Agents from Becoming Data Science Experts? ICLR 2025 RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph ICLR 2025 LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory ICLR 2025 Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning ICLR 2025 DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search ICLR 2025 Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling EMNLP 2025 Attribution and Application of Multiple Neurons in Multimodal Large Language Models EMNLP 2025 DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse Scenes EMNLP 2025 WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback EMNLP 2025 UNCLE: Benchmarking Uncertainty Expressions in Long-Form Generation EMNLP 2025 WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model EMNLP 2025 Neural Network Augmented Kalman Filter for Robust Acoustic Howling Suppression INTERSPEECH 2024 Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment ICML 2024 Prompt-guided Precise Audio Editing with Diffusion Models ICML 2024 Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing NIPS 2024 Skills-in-Context: Unlocking Compositionality in Large Language Models EMNLP 2024 Inconsistent dialogue responses and how to recover from them EACL 2024 Multi-Channel Multi-Speaker ASR Using Target Speaker’s Solo Segment INTERSPEECH 2024 RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios INTERSPEECH 2024 Comparing Discrete and Continuous Space LLMs for Speech Recognition INTERSPEECH 2024 A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation COLING 2024 MinT: Boosting Generalization in Mathematical Reasoning via Multi-view Fine-tuning COLING 2024 Polarity Calibration for Opinion Summarization NAACL 2024 From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning NAACL 2024 Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations NAACL 2024 MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning NAACL 2024 A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning NAACL 2024 Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning EMNLP 2024 Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models EMNLP 2024 When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives EMNLP 2024 Dense X Retrieval: What Retrieval Granularity Should We Use? EMNLP 2024 Abstraction-of-Thought Makes Language Models Better Reasoners EMNLP 2024 Evaluating Moral Beliefs across LLMs through a Pluralistic Framework EMNLP 2024 Self-Consistency Boosts Calibration for Math Reasoning EMNLP 2024 Event Semantic Classification in Context EACL 2024 SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs ACL 2024 Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer ACL 2024 WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models ACL 2024 Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners ACL 2024 CLOMO: Counterfactual Logical Modification with Large Language Models ACL 2024 Improving LLM Generations via Fine-Grained Self-Endorsement ACL 2024 Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models ACL 2024 MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization ACL 2024 MM-LLMs: Recent Advances in MultiModal Large Language Models ACL 2024 InFoBench: Evaluating Instruction Following Ability in Large Language Models ACL 2024 The Trickle-down Impact of Reward Inconsistency on RLHF ICLR 2024 Thrust: Adaptively Propels Large Language Models with External Knowledge NIPS 2023 Unsupervised Multi-document Summarization with Holistic Inference AACL 2023 SafeConv: Explaining and Correcting Conversational Unsafe Behavior ACL 2023 Generating User-Engaging News Headlines ACL 2023 Faithful Question Answering with Monte-Carlo Planning ACL 2023 Going Beyond Sentence Embeddings: A Token-Level Matching Algorithm for Calculating Semantic Textual Similarity ACL 2023 Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks ACL 2023 OASum: Large-Scale Open Domain Aspect-based Summarization ACL 2023 Prosody-TTS: Improving Prosody with Masked Autoencoder and Conditional Diffusion Model For Expressive Text-to-Speech ACL 2023 Bi-level Finetuning with Task-dependent Similarity Structure for Low-resource Training ACL 2023 Friend-training: Learning from Models of Different but Related Tasks EACL 2023 How do Words Contribute to Sentence Semantics? Revisiting Sentence Embeddings with a Perturbation Method EACL 2023 Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation EMNLP 2023 Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations EMNLP 2023 More Than Spoken Words: Nonverbal Message Extraction and Generation EMNLP 2023 On the Dimensionality of Sentence Embeddings EMNLP 2023 PIVOINE: Instruction Tuning for Open-world Entity Profiling EMNLP 2023 BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS ICLR 2023 Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models ICLR 2023 Unsupervised Multi-document Summarization with Holistic Inference IJCNLP 2023 Multi-mode Neural Speech Coding Based on Deep Generative Networks INTERSPEECH 2023 Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic Howling Suppression INTERSPEECH 2023 Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation INTERSPEECH 2023 Compressed MoE ASR Model Based on Knowledge Distillation and Quantization INTERSPEECH 2023 Bayes Risk Transducer: Transducer with Controllable Alignment Prediction INTERSPEECH 2023 Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation INTERSPEECH 2023 From Polarity to Intensity: Mining Morality from Semantic Space COLING 2022 Learning a Grammar Inducer from Massive Uncurated Instructional Videos EMNLP 2022 Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination EMNLP 2022 Hierarchical Context Tagging for Utterance Rewriting AAAI 2022 Automatic Prosody Annotation with Pre-Trained Text-Speech Model INTERSPEECH 2022 LAE: Language-Aware Encoder for Monolingual and Multilingual ASR INTERSPEECH 2022 Towards Improved Zero-shot Voice Conversion with Conditional DSVAE INTERSPEECH 2022 Joint Neural AEC and Beamforming with Double-Talk Detection INTERSPEECH 2022 BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis ICLR 2022 MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure EMNLP 2022 Salience Allocation as Guidance for Abstractive Summarization EMNLP 2022 FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment Act Flows EMNLP 2022 Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup EMNLP 2022 Efficient Zero-shot Event Extraction with Context-Definition Alignment EMNLP 2022 Meta-learning without data via Wasserstein distributionally-robust model fusion UAI 2022 Variational Graph Autoencoding as Cheap Supervision for AMR Coreference Resolution ACL 2022 Towards Abstractive Grounded Summarization of Podcast Transcripts ACL 2022 Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge ACL 2022 Learning-by-Narrating: Narrative Pre-Training for Zero-Shot Dialogue Comprehension ACL 2022 C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References ACL 2022 End-to-End Chinese Speaker Identification NAACL 2022 FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis IJCAI 2022 Toward Unifying Text Segmentation and Long Document Summarization EMNLP 2022 Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data EMNLP 2021 Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories EMNLP 2021 Instance-adaptive training with noise-robust losses against noisy labels EMNLP 2021 RAST: Domain-Robust Dialogue Rewriting as Sequence Tagging EMNLP 2021 Exophoric Pronoun Resolution in Dialogues with Topic Regularization EMNLP 2021 Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation CVPR 2021 Importance-based Neuron Allocation for Multilingual Neural Machine Translation IJCNLP 2021 TexSmart: A System for Enhanced Natural Language Understanding IJCNLP 2021 Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition INTERSPEECH 2021 TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation INTERSPEECH 2021 MIMO Self-Attentive RNN Beamformer for Multi-Speaker Speech Separation INTERSPEECH 2021 SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts INTERSPEECH 2021 MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment INTERSPEECH 2021 Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation INTERSPEECH 2021 Multi-Channel Speaker Verification for Single and Multi-Talker Speech INTERSPEECH 2021 NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven Conversation AAAI 2021 Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect AAAI 2021 Video-aided Unsupervised Grammar Induction NAACL 2021 TexSmart: A System for Enhanced Natural Language Understanding ACL 2021 Importance-based Neuron Allocation for Multilingual Neural Machine Translation ACL 2021 TenTrans Large-Scale Multilingual Machine Translation System for WMT21 EMNLP 2021 Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification INTERSPEECH 2020 DurIAN: Duration Informed Attention Network for Speech Synthesis INTERSPEECH 2020 BLCU-NLP at SemEval-2020 Task 5: Data Augmentation for Efficient Counterfactual Detecting COLING 2020 SHIKEBLCU at SemEval-2020 Task 2: An External Knowledge-enhanced Matrix for Multilingual and Cross-Lingual Lexical Entailment COLING 2020 BLCU-NLP at SemEval-2020 Task 5: Data Augmentation for Efficient Counterfactual Detecting SEMEVAL 2020 SHIKEBLCU at SemEval-2020 Task 2: An External Knowledge-enhanced Matrix for Multilingual and Cross-Lingual Lexical Entailment SEMEVAL 2020 Semantic Role Labeling Guided Multi-turn Dialogue ReWriter EMNLP 2020 Audio-Visual Multi-Channel Recognition of Overlapped Speech INTERSPEECH 2020 Transferring Source Style in Non-Parallel Voice Conversion INTERSPEECH 2020 Dialogue-Based Relation Extraction ACL 2020 Comprehensive Image Captioning via Scene Graph Decomposition ECCV 2020 MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning ACL 2020 Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints ACL 2020 Neural Spatio-Temporal Beamformer for Target Speech Separation INTERSPEECH 2020 End-to-End Multi-Look Keyword Spotting INTERSPEECH 2020 Token-level Adaptive Training for Neural Machine Translation EMNLP 2020 Better Highlighting: Creating Sub-Sentence Summary Highlights EMNLP 2020 ZPR2: Joint Zero Pronoun Recovery and Resolution using Multi-Task Learning and BERT ACL 2020 Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension ACL 2020 Structural Information Preserving for Graph-to-Text Generation ACL 2020 Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition INTERSPEECH 2020 Peking Opera Synthesis via Duration Informed Attention Network INTERSPEECH 2020 DurIAN-SC: Duration Informed Attention Network Based Singing Voice Conversion System INTERSPEECH 2020 Coordinated Reasoning for Cross-Lingual Knowledge Graph Alignment AAAI 2020 Joint Parsing and Generation for Abstractive Summarization AAAI 2020 Relation Extraction Exploiting Full Dependency Forests AAAI 2020 Modeling Fluency and Faithfulness for Diverse Neural Machine Translation AAAI 2020 Improving Question Answering with External Knowledge EMNLP 2019 Cross-lingual Knowledge Graph Alignment via Graph Matching Neural Network ACL 2019 BLCU-NLP at COIN-Shared Task1: Stagewise Fine-tuning BERT for Commonsense Inference in Everyday Narrations EMNLP 2019 Evidence Sentence Extraction for Machine Reading Comprehension CONLL 2019 Reliability-aware Dynamic Feature Composition for Name Tagging ACL 2019 Knowledge-aware Pronoun Coreference Resolution ACL 2019 Improving Pre-Trained Multilingual Model with Vocabulary Expansion CONLL 2019 Multiplex Word Embeddings for Selectional Preference Acquisition IJCNLP 2019 Improving Machine Reading Comprehension with General Reading Strategies NAACL 2019 Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching ICLR 2019 Unsupervised Neural Aspect Extraction with Sememes IJCAI 2019 Multiplex Word Embeddings for Selectional Preference Acquisition EMNLP 2019 Generating Diverse Story Continuations with Controllable Semantics EMNLP 2019 Multi-Document Summarization with Determinantal Point Processes and Contextualized Representations EMNLP 2019 A Fast and Accurate One-Stage Approach to Visual Grounding ICCV 2019 BLCU_NLP at SemEval-2019 Task 7: An Inference Chain-based GPT Model for Rumour Evaluation SEMEVAL 2019 BLCU_NLP at SemEval-2019 Task 8: A Contextual Knowledge-enhanced GPT Model for Fact Checking SEMEVAL 2019 A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation INTERSPEECH 2019 Large Margin Training for Attention Based End-to-End Speech Recognition INTERSPEECH 2019 Improved Speaker-Dependent Separation for CHiME-5 Challenge INTERSPEECH 2019 Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT INTERSPEECH 2019 Extract, Adapt and Recognize: An End-to-End Neural Network for Corrupted Monaural Speech Recognition INTERSPEECH 2019 Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information INTERSPEECH 2019 Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis INTERSPEECH 2018 Deep Extractor Network for Target Speaker Recovery from Single Channel Speech Mixtures INTERSPEECH 2018 Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition INTERSPEECH 2018 A Multistage Training Framework for Acoustic-to-Word Model INTERSPEECH 2018 Monaural Multi-Talker Speech Recognition with Attention Mechanism and Gated Convolutional Networks INTERSPEECH 2018 BLCU_NLP at SemEval-2018 Task 12: An Ensemble Model for Argument Reasoning Based on Hierarchical Attention SEMEVAL 2018 Deep Discriminative Embeddings for Duration Robust Speaker Verification INTERSPEECH 2018 Text-Dependent Speech Enhancement for Small-Footprint Robust Keyword Detection INTERSPEECH 2018 Permutation Invariant Training of Generative Adversarial Network for Monaural Speech Separation INTERSPEECH 2018 XL-NBT: A Cross-lingual Neural Belief Tracking Framework EMNLP 2018 Empirical Evaluation of Parallel Training Algorithms on Acoustic Modeling INTERSPEECH 2017 Recognizing Multi-Talker Speech with Permutation Invariant Training INTERSPEECH 2017 Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention INTERSPEECH 2016 Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding NAACL 2016 An End-to-end Approach to Learning Semantic Frames with Feedforward Neural Network NAACL 2016 BLCUNLP: Corpus Pattern Analysis for Verbs Based on Dependency Chain SEMEVAL 2015 Voice-Rate: A Dialog System for Consumer Ratings NAACL 2007