Haifeng Wang
169 papers · 2004–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
๐ฃ Hot Topic Early Bird ๐บ๏ธ Taxonomy Completionist (14) ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐ Conference Polyglot (16)
๐
Interdisciplinary Bridge
๐
Academic Marathon
(21)
๐บ๏ธ
Taxonomy Completionist
(14)
๐
Conference Loyalist
(56)
๐
Keyword Trendsetter Combo
(5)
๐ค
Dynamic Duo
(112)
๐งฌ
Topic Evolution
๐ฌ
Deep Specialist
(15)
๐
Trend Setter
๐ฅ
Unstoppable
(22)
๐
Conference Pioneer
โก
Prolific Year
(6)
โ
The Questioner
๐๏ธ
Keyword Collector
(419)
๐
Century Club
(163)
Conferences
ACL (61)
EMNLP (33)
IJCNLP (23)
COLING (19)
AAAI (8)
NAACL (7)
IJCAI (6)
CONLL (3)
MICCAI (2)
AACL (1)
CVPR (1)
ICCV (1)
ICLR (1)
ICML (1)
INTERSPEECH (1)
JMLR (1)
Top co-authors
Research topics
Keywords
transfer learning
(9)
large language model
(9)
dialogue system
(8)
neural machine translation
(7)
multi-task learning
(7)
dialogue generation
(7)
simultaneous translation
(6)
reinforcement learning
(6)
pre-trained language model
(6)
knowledge graph
(6)
machine reading comprehension
(6)
information retrieval
(6)
question answering
(5)
machine translation
(5)
representation learning
(5)
speech translation
(4)
speech-to-text translation
(4)
knowledge distillation
(4)
language model
(4)
multi-document summarization
(4)
Papers
Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics
ACL 2026
Distributional Clarity: The Hidden Driver of RL-Friendliness in Large Language Models
ACL 2026
PEAP: Proactive Embodied Action Sequence Planning with Joint Understanding of Vision and Audio Perception
ACL 2026
Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation
ACL 2026
RRAtention: Dynamic Block Sparse Attention via Per-Head Round-Robin Shifts for Long-Context Inference
ACL 2026
BEE-RAG: Balanced Entropy Engineering for Retrieval-Augmented Generation
AAAI 2026
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
COLING 2025
BeamLoRA: Beam-Constraint Low-Rank Adaptation
ACL 2025
Flexibly Distilled 3D Rectified Flow with Anatomical Constraints for Developmental Infant Brain MRI Prediction
MICCAI 2025
Controllable Flow Matching for 3D Contrast-Enhanced Brain MRI Synthesis from Non-Contrast Scans
MICCAI 2025
RepoDebug: Repository-Level Multi-Task and Multi-Language Debugging Evaluation of Large Language Models
EMNLP 2025
HomeBench: Evaluating LLMs in Smart Homes with Valid and Invalid Instructions Across Single and Multiple Devices
ACL 2025
Curiosity-Driven Reinforcement Learning from Human Feedback
ACL 2025
Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking
ACL 2025
TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments
ACL 2025
ToolSpectrum: Towards Personalized Tool Utilization for Large Language Models
ACL 2025
FlashMask: Efficient and Rich Mask Extension of FlashAttention
ICLR 2025
SafeToolBench: Pioneering a Prospective Benchmark to Evaluating Tool Utilization Safety in LLMs
EMNLP 2025
Mixture of Hidden-Dimensions: Not All Hidden-Statesโ Dimensions are Needed in Transformer
ICML 2025
An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation
NAACL 2024
BASES: Large-scale Web Search User Simulation with Large Language Model based Agents
EMNLP 2024
Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
EMNLP 2023
Towards Boosting the Open-Domain Chatbot with Human Feedback
ACL 2023
TOME: A Two-stage Approach for Model-based Retrieval
ACL 2023
XDailyDialog: A Multilingual Parallel Dialogue Corpus
ACL 2023
Towards Zero-Shot Persona Dialogue Generation with In-Context Learning
ACL 2023
Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization
ACL 2023
Implicit Regularization and Entrywise Convergence of Riemannian Optimization for Low Tucker-Rank Tensor Completion
JMLR 2023
Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation
IJCAI 2023
Dual Meta-Learning with Longitudinally Consistent Regularization for One-Shot Brain Tissue Segmentation Across the Human Lifespan
ICCV 2023
A Thorough Examination on Zero-shot Dense Retrieval
EMNLP 2023
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model With Knowledge-Enhanced Mixture-of-Denoising-Experts
CVPR 2023
A Fine-grained Interpretability Evaluation Benchmark for Neural NLP
CONLL 2022
DuReader-Retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine
EMNLP 2022
DuQM: A Chinese Dataset of Linguistically Perturbed Natural Questions for Evaluating the Robustness of Question Matching Models
EMNLP 2022
PLATO-Ad: A Unified Advertisement Text Generation Framework with Multi-Task Prompt Learning
EMNLP 2022
Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards
EMNLP 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
EMNLP 2022
A Fine-grained Interpretability Evaluation Benchmark for Neural NLP
EMNLP 2022
Where to Go for the Holidays: Towards Mixed-Type Dialogs for Clarification of User Goals
ACL 2022
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation
NAACL 2022
Findings of the Third Workshop on Automatic Simultaneous Translation
NAACL 2022
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
ACL 2022
Is Discourse Role Important for Emotion Recognition in Conversation?
AAAI 2022
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation
AACL 2022
Long Time No See! Open-Domain Conversation with Long-Term Persona Memory
ACL 2022
DuReadervis: A Chinese Dataset for Open-domain Document Visual Question Answering
ACL 2022
Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation
ACL 2022
DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications
IJCNLP 2021
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs
AAAI 2021
Discovering Dialog Structure Graph for Coherent Dialog Generation
ACL 2021
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
ACL 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
ACL 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
ACL 2021
DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications
ACL 2021
Link Prediction on N-ary Relational Facts: A Graph-based Approach
ACL 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval
ACL 2021
Correcting Chinese Spelling Errors with Phonetic Pre-training
ACL 2021
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning
ACL 2021
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora
EMNLP 2021
RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking
EMNLP 2021
SgSum:Transforming Multi-document Summarization into Sub-graph Selection
EMNLP 2021
DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation
EMNLP 2021
Data Augmentation with Hierarchical SQL-to-Question Generation for Cross-domain Text-to-SQL Parsing
EMNLP 2021
Mixup Decoding for Diverse Machine Translation
EMNLP 2021
PLATO-KAG: Unsupervised Knowledge-Grounded Conversation via Joint Modeling
EMNLP 2021
Discovering Dialog Structure Graph for Coherent Dialog Generation
IJCNLP 2021
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
IJCNLP 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
IJCNLP 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
IJCNLP 2021
Link Prediction on N-ary Relational Facts: A Graph-based Approach
IJCNLP 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval
IJCNLP 2021
Correcting Chinese Spelling Errors with Phonetic Pre-training
IJCNLP 2021
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning
IJCNLP 2021
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
NAACL 2021
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
NAACL 2021
BSTC: A Large-Scale Chinese-English Speech Translation Dataset
NAACL 2021
Findings of the Second Workshop on Automatic Simultaneous Translation
NAACL 2021
Enhancing Dialog Coherence with Event Graph Grounded Content Planning
IJCAI 2020
Learning Adaptive Segmentation Policy for Simultaneous Translation
EMNLP 2020
Knowledge Graph Grounded Goal Planning for Open-Domain Conversation Generation
AAAI 2020
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding
AAAI 2020
Towards Conversational Recommendation over Multi-Type Dialogs
ACL 2020
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable
ACL 2020
DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset
EMNLP 2020
ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding
AAAI 2020
ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation
IJCAI 2020
Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation
ACL 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
ACL 2020
Leveraging Graph to Improve Abstractive Multi-Document Summarization
ACL 2020
End-to-End Speech Translation with Knowledge Distillation
INTERSPEECH 2019
Proactive Human-Machine Conversation with Explicit Conversation Goal
ACL 2019
STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework
ACL 2019
Joint Extraction of Entities and Overlapping Relations Using Position-Attentive Sequence Labeling
AAAI 2019
Multi-agent Learning for Neural Machine Translation
IJCNLP 2019
Baidu Neural Machine Translation Systems for WMT19
ACL 2019
Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs
IJCNLP 2019
Multi-agent Learning for Neural Machine Translation
EMNLP 2019
Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs
EMNLP 2019
D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading Comprehension
EMNLP 2019
Modeling Coherence for Discourse Neural Machine Translation
AAAI 2019
DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications
ACL 2018
Improving Entity Recommendation with Search Log and Multi-Task Learning
IJCAI 2018
Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification
ACL 2018
Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification
EMNLP 2017
Learning to Explain Entity Relationships by Pairwise Ranking with Convolutional Neural Networks
IJCAI 2017
Active Learning for Dependency Parsing with Partial Annotation
ACL 2016
A Unified Architecture for Semantic Role Labeling and Relation Classification
COLING 2016
Chinese Poetry Generation with Planning based Neural Network
COLING 2016
A Universal Framework for Inductive Transfer Parsing across Multi-typed Treebanks
COLING 2016
Generating Recommendation Evidence Using Translation Model
IJCAI 2016
Cross-lingual Dependency Parsing Based on Distributed Representations
IJCNLP 2015
Multi-Task Learning for Multiple Language Translation
ACL 2015
Cross-lingual Dependency Parsing Based on Distributed Representations
ACL 2015
Multi-Task Learning for Multiple Language Translation
IJCNLP 2015
Learning Sense-specific Word Embeddings By Exploiting Bilingual Resources
COLING 2014
Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System
EMNLP 2014
Revisiting Embedding Features for Simple Semi-supervised Learning
EMNLP 2014
Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model
EMNLP 2014
Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality
EMNLP 2014
Improving Pivot-Based Statistical Machine Translation by Pivoting the Co-occurrence Count of Phrase Pairs
EMNLP 2014
Learning Semantic Hierarchies via Word Embeddings
ACL 2014
Bootstrapping Large-scale Named Entities using URL-Text Hybrid Patterns
IJCNLP 2013
Improving Pivot-Based Statistical Machine Translation Using Random Walk
EMNLP 2013
A Hierarchical Semantics-Aware Distributional Similarity Scheme
IJCNLP 2013
Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information
ACL 2012
Improve SMT Quality with Automatically Extracted Paraphrase Rules
ACL 2012
User Behaviors Lend a Helping Hand: Learning Paraphrase Query Patterns from Search Log Sessions
COLING 2012
Enriching SMT Training Data via Paraphrasing
IJCNLP 2011
Automatically Generating Questions from Queries for Community-based Question Answering
IJCNLP 2011
Harvesting Related Entities with a Search Engine
IJCNLP 2011
Reordering with Source Language Collocations
ACL 2011
Proceedings of 5th International Joint Conference on Natural Language Processing
IJCNLP 2011
Coling 2010: Paraphrases and ApplicationsโTutorial notes
COLING 2010
Paraphrasing with Search Engine Query Logs
COLING 2010
Paraphrases and Applications
COLING 2010
Improving Statistical Machine Translation with Monolingual Collocation
ACL 2010
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts
ACL 2010
Leveraging Multiple MT Engines for Paraphrase Generation
COLING 2010
Dependency Based Chinese Sentence Realization
IJCNLP 2009
Revisiting Pivot Language Approach for Machine Translation
ACL 2009
Dependency Based Chinese Sentence Realization
ACL 2009
Exploiting Heterogeneous Treebanks for Parsing
IJCNLP 2009
Revisiting Pivot Language Approach for Machine Translation
IJCNLP 2009
Exploiting Heterogeneous Treebanks for Parsing
ACL 2009
Collocation Extraction Using Monolingual Word Alignment Method
EMNLP 2009
Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora
ACL 2008
Dependency-Based N-Gram Models for General Purpose Sentence Realisation
COLING 2008
Prediction of Maximal Projection for Semantic Role Labeling
COLING 2008
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora
COLING 2008
Using RBMT Systems to Produce Bilingual Corpus for SMT
CONLL 2007
Recovering Non-Local Dependencies for Chinese
CONLL 2007
Pivot Language Approach for Phrase-Based Statistical Machine Translation
ACL 2007
Using RBMT Systems to Produce Bilingual Corpus for SMT
EMNLP 2007
Recovering Non-Local Dependencies for Chinese
EMNLP 2007
Discriminative Pruning of Language Models for Chinese Word Segmentation
ACL 2006
The Effect of Translation Quality in MT-Based Cross-Language Information Retrieval
ACL 2006
An Equivalent Pseudoword Solution to Chinese Word Sense Disambiguation
ACL 2006
An Equivalent Pseudoword Solution to Chinese Word Sense Disambiguation
COLING 2006
The Effect of Translation Quality in MT-Based Cross-Language Information Retrieval
COLING 2006
Discriminative Pruning of Language Models for Chinese Word Segmentation
COLING 2006
Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs
COLING 2006
Boosting Statistical Word Alignment Using Labeled and Unlabeled Data
COLING 2006
Boosting Statistical Word Alignment Using Labeled and Unlabeled Data
ACL 2006
Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs
ACL 2006
Improving Statistical Word Alignment with Ensemble Methods
IJCNLP 2005
Alignment Model Adaptation for Domain-Specific Word Alignment
ACL 2005
Improving Statistical Word Alignment with a Rule-Based Machine Translation System
COLING 2004
Improving Domain-Specific Word Alignment for Computer Assisted Translation
ACL 2004