Dong Yu
218 papers · 2007–2026 · 19 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
๐บ๏ธ Taxonomy Completionist (31) ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐ Renaissance Researcher (5) ๐ Conference Polyglot (19)
๐บ๏ธ
Taxonomy Completionist
(31)
๐
Renaissance Researcher
(5)
๐
Interdisciplinary Bridge
๐
Conference Loyalist
(47)
๐
Keyword Trendsetter Combo
(6)
๐
Keyword Champion
(6)
๐
Triple Crown
๐ฌ
Deep Specialist
(11)
๐งฌ
Topic Evolution
๐ค
Dynamic Duo
(36)
๐
Grand Slam
โ
The Questioner
(6)
๐๏ธ
Keyword Collector
(112)
๐
Trend Setter
๐ฅ
Unstoppable
(11)
๐
Conference Pioneer
โก
Prolific Year
(35)
๐
Century Club
(202)
Conferences
ACL (50)
INTERSPEECH (47)
EMNLP (46)
AAAI (13)
NAACL (13)
ICLR (10)
COLING (7)
SEMEVAL (6)
EACL (5)
IJCNLP (5)
ICML (3)
IJCAI (3)
AACL (2)
CONLL (2)
NIPS (2)
UAI (1)
ICCV (1)
ECCV (1)
CVPR (1)
Top co-authors
Keywords
large language model
(24)
question answering
(13)
speech separation
(12)
attention mechanism
(12)
speech recognition
(10)
language model
(10)
reinforcement learning
(10)
knowledge distillation
(9)
machine reading comprehension
(9)
text generation
(9)
zero-shot learning
(8)
benchmark evaluation
(7)
recurrent neural network
(7)
transfer learning
(7)
multimodal learning
(6)
speech enhancement
(6)
speech synthesis
(6)
weakly supervised learning
(5)
retrieval-augmented generation
(5)
contrastive learning
(5)
Papers
EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving
ACL 2026
Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data
ACL 2026
DegVoC: Revisiting Neural Vocoder from a Degradation Perspective
AAAI 2026
Beyond Euclidean Assumptions: Geometry-Aware Adaptive Routing for Remote Sensing Segmentation
AAAI 2026
WebRollback: Enhancing Web Agents with Explicit Rollback Mechanisms
EACL 2026
Crossing the Reward Bridge: Expanding Reinforcement Learning with Verifiable Rewards Across Diverse Domains
ACL 2026
Your Reasoning Model is Secretly a Reward Model - Optimization-Free Verification from Experience
ACL 2026
Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding
ACL 2026
WebAggregator: Enhancing Compositional Reasoning Capabilities of Deep Research Agent Foundation Models
ACL 2026
SPAGBias: Uncovering and Tracing Structured Spatial Gender Bias in Large Language Models
ACL 2026
Beyond Detection: Evaluating Fallacy Awareness of LLMs in Interactive Scenarios
ACL 2026
Audio-Thinker: Guiding Large Audio Language Model When and How to Think via Reinforcement Learning
AAAI 2026
UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation
AAAI 2026
Enhancing Stability and Fidelity for Zero-Shot TTS with a Multi-Level Evaluator
AAAI 2026
Revisiting Audio-language Pretraining for Learning General-purpose Audio Representation
ACL 2026
Retrieval-augmented GUI Agents with Generative Guidelines
EMNLP 2025
Investigating Value-Reasoning Reliability in Small Large Language Models
EMNLP 2025
Recall with Reasoning: Chain-of-Thought Distillation for Mambaโs Long-Context Memory and Extrapolation
EMNLP 2025
Router-Tuning: A Simple and Effective Approach for Dynamic Depth
EMNLP 2025
Atomic Calibration of LLMs in Long-Form Generations
AACL 2025
DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
NAACL 2025
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
NAACL 2025
LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning
AAAI 2025
A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression
ACL 2025
Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
ACL 2025
LoGU: Long-form Generation with Uncertainty Expressions
ACL 2025
Donโt Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls
ACL 2025
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
ACL 2025
Low-Bit Quantization Favors Undertrained LLMs
ACL 2025
DeFine: Decision-Making with Analogical Reasoning over Factor Profiles
ACL 2025
Whatโs the most important value? INVP: INvestigating the Value Priorities of LLMs through Decision-making in Social Scenarios
COLING 2025
Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
COLING 2025
Atomic Calibration of LLMs in Long-Form Generations
IJCNLP 2025
BridgeVoC: Neural Vocoder with Schrรถdinger Bridge
IJCAI 2025
Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models
ICML 2025
DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?
ICLR 2025
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph
ICLR 2025
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory
ICLR 2025
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
ICLR 2025
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
ICLR 2025
Understanding and Enhancing Mamba-Transformer Hybrids for Memory Recall and Language Modeling
EMNLP 2025
Attribution and Application of Multiple Neurons in Multimodal Large Language Models
EMNLP 2025
DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse Scenes
EMNLP 2025
WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback
EMNLP 2025
UNCLE: Benchmarking Uncertainty Expressions in Long-Form Generation
EMNLP 2025
WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model
EMNLP 2025
Neural Network Augmented Kalman Filter for Robust Acoustic Howling Suppression
INTERSPEECH 2024
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
ICML 2024
Prompt-guided Precise Audio Editing with Diffusion Models
ICML 2024
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
NIPS 2024
Skills-in-Context: Unlocking Compositionality in Large Language Models
EMNLP 2024
Inconsistent dialogue responses and how to recover from them
EACL 2024
Multi-Channel Multi-Speaker ASR Using Target Speakerโs Solo Segment
INTERSPEECH 2024
RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios
INTERSPEECH 2024
Comparing Discrete and Continuous Space LLMs for Speech Recognition
INTERSPEECH 2024
A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
COLING 2024
MinT: Boosting Generalization in Mathematical Reasoning via Multi-view Fine-tuning
COLING 2024
Polarity Calibration for Opinion Summarization
NAACL 2024
From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning
NAACL 2024
Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations
NAACL 2024
MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning
NAACL 2024
A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning
NAACL 2024
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
EMNLP 2024
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models
EMNLP 2024
When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
EMNLP 2024
Dense X Retrieval: What Retrieval Granularity Should We Use?
EMNLP 2024
Abstraction-of-Thought Makes Language Models Better Reasoners
EMNLP 2024
Evaluating Moral Beliefs across LLMs through a Pluralistic Framework
EMNLP 2024
Self-Consistency Boosts Calibration for Math Reasoning
EMNLP 2024
Event Semantic Classification in Context
EACL 2024
SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs
ACL 2024
Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer
ACL 2024
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
ACL 2024
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners
ACL 2024
CLOMO: Counterfactual Logical Modification with Large Language Models
ACL 2024
Improving LLM Generations via Fine-Grained Self-Endorsement
ACL 2024
Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models
ACL 2024
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
ACL 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
ACL 2024
InFoBench: Evaluating Instruction Following Ability in Large Language Models
ACL 2024
The Trickle-down Impact of Reward Inconsistency on RLHF
ICLR 2024
Thrust: Adaptively Propels Large Language Models with External Knowledge
NIPS 2023
Unsupervised Multi-document Summarization with Holistic Inference
AACL 2023
SafeConv: Explaining and Correcting Conversational Unsafe Behavior
ACL 2023
Generating User-Engaging News Headlines
ACL 2023
Faithful Question Answering with Monte-Carlo Planning
ACL 2023
Going Beyond Sentence Embeddings: A Token-Level Matching Algorithm for Calculating Semantic Textual Similarity
ACL 2023
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
ACL 2023
OASum: Large-Scale Open Domain Aspect-based Summarization
ACL 2023
Prosody-TTS: Improving Prosody with Masked Autoencoder and Conditional Diffusion Model For Expressive Text-to-Speech
ACL 2023
Bi-level Finetuning with Task-dependent Similarity Structure for Low-resource Training
ACL 2023
Friend-training: Learning from Models of Different but Related Tasks
EACL 2023
How do Words Contribute to Sentence Semantics? Revisiting Sentence Embeddings with a Perturbation Method
EACL 2023
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
EMNLP 2023
Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations
EMNLP 2023
More Than Spoken Words: Nonverbal Message Extraction and Generation
EMNLP 2023
On the Dimensionality of Sentence Embeddings
EMNLP 2023
PIVOINE: Instruction Tuning for Open-world Entity Profiling
EMNLP 2023
BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS
ICLR 2023
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models
ICLR 2023
Unsupervised Multi-document Summarization with Holistic Inference
IJCNLP 2023
Multi-mode Neural Speech Coding Based on Deep Generative Networks
INTERSPEECH 2023
Hybrid AHS: A Hybrid of Kalman Filter and Deep Learning for Acoustic Howling Suppression
INTERSPEECH 2023
Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
INTERSPEECH 2023
Compressed MoE ASR Model Based on Knowledge Distillation and Quantization
INTERSPEECH 2023
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction
INTERSPEECH 2023
Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation
INTERSPEECH 2023
From Polarity to Intensity: Mining Morality from Semantic Space
COLING 2022
Learning a Grammar Inducer from Massive Uncurated Instructional Videos
EMNLP 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
EMNLP 2022
Hierarchical Context Tagging for Utterance Rewriting
AAAI 2022
Automatic Prosody Annotation with Pre-Trained Text-Speech Model
INTERSPEECH 2022
LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
INTERSPEECH 2022
Towards Improved Zero-shot Voice Conversion with Conditional DSVAE
INTERSPEECH 2022
Joint Neural AEC and Beamforming with Double-Talk Detection
INTERSPEECH 2022
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
ICLR 2022
MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure
EMNLP 2022
Salience Allocation as Guidance for Abstractive Summarization
EMNLP 2022
FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment Act Flows
EMNLP 2022
Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup
EMNLP 2022
Efficient Zero-shot Event Extraction with Context-Definition Alignment
EMNLP 2022
Meta-learning without data via Wasserstein distributionally-robust model fusion
UAI 2022
Variational Graph Autoencoding as Cheap Supervision for AMR Coreference Resolution
ACL 2022
Towards Abstractive Grounded Summarization of Podcast Transcripts
ACL 2022
Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge
ACL 2022
Learning-by-Narrating: Narrative Pre-Training for Zero-Shot Dialogue Comprehension
ACL 2022
C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References
ACL 2022
End-to-End Chinese Speaker Identification
NAACL 2022
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
IJCAI 2022
Toward Unifying Text Segmentation and Long Document Summarization
EMNLP 2022
Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data
EMNLP 2021
Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories
EMNLP 2021
Instance-adaptive training with noise-robust losses against noisy labels
EMNLP 2021
RAST: Domain-Robust Dialogue Rewriting as Sequence Tagging
EMNLP 2021
Exophoric Pronoun Resolution in Dialogues with Topic Regularization
EMNLP 2021
Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation
CVPR 2021
Importance-based Neuron Allocation for Multilingual Neural Machine Translation
IJCNLP 2021
TexSmart: A System for Enhanced Natural Language Understanding
IJCNLP 2021
Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
INTERSPEECH 2021
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
INTERSPEECH 2021
MIMO Self-Attentive RNN Beamformer for Multi-Speaker Speech Separation
INTERSPEECH 2021
SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts
INTERSPEECH 2021
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
INTERSPEECH 2021
Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation
INTERSPEECH 2021
Multi-Channel Speaker Verification for Single and Multi-Talker Speech
INTERSPEECH 2021
NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven Conversation
AAAI 2021
Tune-In: Training Under Negative Environments with Interference for Attention Networks Simulating Cocktail Party Effect
AAAI 2021
Video-aided Unsupervised Grammar Induction
NAACL 2021
TexSmart: A System for Enhanced Natural Language Understanding
ACL 2021
Importance-based Neuron Allocation for Multilingual Neural Machine Translation
ACL 2021
TenTrans Large-Scale Multilingual Machine Translation System for WMT21
EMNLP 2021
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification
INTERSPEECH 2020
DurIAN: Duration Informed Attention Network for Speech Synthesis
INTERSPEECH 2020
BLCU-NLP at SemEval-2020 Task 5: Data Augmentation for Efficient Counterfactual Detecting
COLING 2020
SHIKEBLCU at SemEval-2020 Task 2: An External Knowledge-enhanced Matrix for Multilingual and Cross-Lingual Lexical Entailment
COLING 2020
BLCU-NLP at SemEval-2020 Task 5: Data Augmentation for Efficient Counterfactual Detecting
SEMEVAL 2020
SHIKEBLCU at SemEval-2020 Task 2: An External Knowledge-enhanced Matrix for Multilingual and Cross-Lingual Lexical Entailment
SEMEVAL 2020
Semantic Role Labeling Guided Multi-turn Dialogue ReWriter
EMNLP 2020
Audio-Visual Multi-Channel Recognition of Overlapped Speech
INTERSPEECH 2020
Transferring Source Style in Non-Parallel Voice Conversion
INTERSPEECH 2020
Dialogue-Based Relation Extraction
ACL 2020
Comprehensive Image Captioning via Scene Graph Decomposition
ECCV 2020
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
ACL 2020
Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints
ACL 2020
Neural Spatio-Temporal Beamformer for Target Speech Separation
INTERSPEECH 2020
End-to-End Multi-Look Keyword Spotting
INTERSPEECH 2020
Token-level Adaptive Training for Neural Machine Translation
EMNLP 2020
Better Highlighting: Creating Sub-Sentence Summary Highlights
EMNLP 2020
ZPR2: Joint Zero Pronoun Recovery and Resolution using Multi-Task Learning and BERT
ACL 2020
Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension
ACL 2020
Structural Information Preserving for Graph-to-Text Generation
ACL 2020
Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
INTERSPEECH 2020
Peking Opera Synthesis via Duration Informed Attention Network
INTERSPEECH 2020
DurIAN-SC: Duration Informed Attention Network Based Singing Voice Conversion System
INTERSPEECH 2020
Coordinated Reasoning for Cross-Lingual Knowledge Graph Alignment
AAAI 2020
Joint Parsing and Generation for Abstractive Summarization
AAAI 2020
Relation Extraction Exploiting Full Dependency Forests
AAAI 2020
Modeling Fluency and Faithfulness for Diverse Neural Machine Translation
AAAI 2020
Improving Question Answering with External Knowledge
EMNLP 2019
Cross-lingual Knowledge Graph Alignment via Graph Matching Neural Network
ACL 2019
BLCU-NLP at COIN-Shared Task1: Stagewise Fine-tuning BERT for Commonsense Inference in Everyday Narrations
EMNLP 2019
Evidence Sentence Extraction for Machine Reading Comprehension
CONLL 2019
Reliability-aware Dynamic Feature Composition for Name Tagging
ACL 2019
Knowledge-aware Pronoun Coreference Resolution
ACL 2019
Improving Pre-Trained Multilingual Model with Vocabulary Expansion
CONLL 2019
Multiplex Word Embeddings for Selectional Preference Acquisition
IJCNLP 2019
Improving Machine Reading Comprehension with General Reading Strategies
NAACL 2019
Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
ICLR 2019
Unsupervised Neural Aspect Extraction with Sememes
IJCAI 2019
Multiplex Word Embeddings for Selectional Preference Acquisition
EMNLP 2019
Generating Diverse Story Continuations with Controllable Semantics
EMNLP 2019
Multi-Document Summarization with Determinantal Point Processes and Contextualized Representations
EMNLP 2019
A Fast and Accurate One-Stage Approach to Visual Grounding
ICCV 2019
BLCU_NLP at SemEval-2019 Task 7: An Inference Chain-based GPT Model for Rumour Evaluation
SEMEVAL 2019
BLCU_NLP at SemEval-2019 Task 8: A Contextual Knowledge-enhanced GPT Model for Fact Checking
SEMEVAL 2019
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation
INTERSPEECH 2019
Large Margin Training for Attention Based End-to-End Speech Recognition
INTERSPEECH 2019
Improved Speaker-Dependent Separation for CHiME-5 Challenge
INTERSPEECH 2019
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT
INTERSPEECH 2019
Extract, Adapt and Recognize: An End-to-End Neural Network for Corrupted Monaural Speech Recognition
INTERSPEECH 2019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information
INTERSPEECH 2019
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis
INTERSPEECH 2018
Deep Extractor Network for Target Speaker Recovery from Single Channel Speech Mixtures
INTERSPEECH 2018
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition
INTERSPEECH 2018
A Multistage Training Framework for Acoustic-to-Word Model
INTERSPEECH 2018
Monaural Multi-Talker Speech Recognition with Attention Mechanism and Gated Convolutional Networks
INTERSPEECH 2018
BLCU_NLP at SemEval-2018 Task 12: An Ensemble Model for Argument Reasoning Based on Hierarchical Attention
SEMEVAL 2018
Deep Discriminative Embeddings for Duration Robust Speaker Verification
INTERSPEECH 2018
Text-Dependent Speech Enhancement for Small-Footprint Robust Keyword Detection
INTERSPEECH 2018
Permutation Invariant Training of Generative Adversarial Network for Monaural Speech Separation
INTERSPEECH 2018
XL-NBT: A Cross-lingual Neural Belief Tracking Framework
EMNLP 2018
Empirical Evaluation of Parallel Training Algorithms on Acoustic Modeling
INTERSPEECH 2017
Recognizing Multi-Talker Speech with Permutation Invariant Training
INTERSPEECH 2017
Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention
INTERSPEECH 2016
Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding
NAACL 2016
An End-to-end Approach to Learning Semantic Frames with Feedforward Neural Network
NAACL 2016
BLCUNLP: Corpus Pattern Analysis for Verbs Based on Dependency Chain
SEMEVAL 2015
Voice-Rate: A Dialog System for Consumer Ratings
NAACL 2007