Hua Wu
172 papers · 2000–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (19) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (15)
π
Cross-Pollinator
(9)
πΊοΈ
Taxonomy Completionist
(19)
π§
Keyword Pioneer
π
Conference Loyalist
(65)
π
Keyword Trendsetter Combo
(6)
π€
Dynamic Duo
(112)
π§¬
Topic Evolution
π
Grand Slam
π¬
Deep Specialist
(19)
π
Keyword Champion
π₯
Unstoppable
(23)
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(18)
π
Century Club
(167)
ποΈ
Keyword Collector
(52)
Conferences
ACL (69)
EMNLP (44)
IJCNLP (16)
COLING (9)
AAAI (8)
NAACL (8)
IJCAI (7)
ICLR (3)
CONLL (2)
AACL (1)
CVPR (1)
ICML (1)
INTERSPEECH (1)
JMLR (1)
NIPS (1)
Top co-authors
Research topics
Keywords
dialogue system
(12)
large language model
(10)
neural machine translation
(9)
pre-trained language model
(9)
reinforcement learning
(9)
neural network
(8)
attention mechanism
(7)
machine reading comprehension
(7)
knowledge distillation
(7)
dialogue generation
(6)
response generation
(6)
language model
(6)
question answering
(6)
simultaneous translation
(6)
transfer learning
(6)
knowledge graph
(5)
graph neural network
(5)
text generation
(5)
information retrieval
(5)
machine translation
(5)
Papers
AttnPO: Attention-Guided Process Supervision for Efficient Reasoning
ACL 2026
Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation
ACL 2026
Distributional Clarity: The Hidden Driver of RL-Friendliness in Large Language Models
ACL 2026
Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics
ACL 2026
BEE-RAG: Balanced Entropy Engineering for Retrieval-Augmented Generation
AAAI 2026
AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment
EMNLP 2025
BeamLoRA: Beam-Constraint Low-Rank Adaptation
ACL 2025
HFT: Half Fine-Tuning for Large Language Models
ACL 2025
Curiosity-Driven Reinforcement Learning from Human Feedback
ACL 2025
Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking
ACL 2025
Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
ACL 2025
Mixture of Hidden-Dimensions: Not All Hidden-Statesβ Dimensions are Needed in Transformer
ICML 2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
ICLR 2025
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
COLING 2025
Weights-Rotated Preference Optimization for Large Language Models
EMNLP 2025
On Training Data Influence of GPT Models
EMNLP 2024
An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation
NAACL 2024
QDMR-based Planning-and-Solving Prompting for Complex Reasoning Tasks
COLING 2024
NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time
ACL 2024
LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion
ACL 2024
Tool-Augmented Reward Modeling
ICLR 2024
BASES: Large-scale Web Search User Simulation with Large Language Model based Agents
EMNLP 2024
Autoregressive Pre-Training on Pixels and Texts
EMNLP 2024
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
IJCNLP 2023
Learning In-context Learning for Named Entity Recognition
ACL 2023
Towards Zero-Shot Persona Dialogue Generation with In-Context Learning
ACL 2023
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
ACL 2023
Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization
ACL 2023
SQLFlow: An Extensible Toolkit Integrating DB and AI
JMLR 2023
TOME: A Two-stage Approach for Model-based Retrieval
ACL 2023
Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation
IJCAI 2023
A Thorough Examination on Zero-shot Dense Retrieval
EMNLP 2023
IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding
EMNLP 2023
Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
EMNLP 2023
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models
EMNLP 2023
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model With Knowledge-Enhanced Mixture-of-Denoising-Experts
CVPR 2023
Universal Information Extraction as Unified Semantic Matching
AAAI 2023
Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling
ACL 2023
Towards Boosting the Open-Domain Chatbot with Human Feedback
ACL 2023
Q-TOD: A Query-driven Task-oriented Dialogue System
EMNLP 2022
DuQM: A Chinese Dataset of Linguistically Perturbed Natural Questions for Evaluating the Robustness of Question Matching Models
EMNLP 2022
PLATO-Ad: A Unified Advertisement Text Generation Framework with Multi-Task Prompt Learning
EMNLP 2022
Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards
EMNLP 2022
FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness
EMNLP 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
EMNLP 2022
A Fine-grained Interpretability Evaluation Benchmark for Neural NLP
EMNLP 2022
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation
NAACL 2022
Non-Autoregressive Chinese ASR Error Correction with Phonological Training
NAACL 2022
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning
ACL 2022
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation
AACL 2022
Findings of the Third Workshop on Automatic Simultaneous Translation
NAACL 2022
Where to Go for the Holidays: Towards Mixed-Type Dialogs for Clarification of User Goals
ACL 2022
PLANET: Dynamic Content Planning in Autoregressive Transformers for Long-form Text Generation
ACL 2022
Unified Structure Generation for Universal Information Extraction
ACL 2022
Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation
ACL 2022
DuReadervis: A Chinese Dataset for Open-domain Document Visual Question Answering
ACL 2022
Syntax-guided Contrastive Learning for Pre-trained Language Model
ACL 2022
DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-training
ACL 2022
DuReader-Retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine
EMNLP 2022
CDConv: A Benchmark for Contradiction Detection in Chinese Conversations
EMNLP 2022
Long Time No See! Open-Domain Conversation with Long-Term Persona Memory
ACL 2022
A Fine-grained Interpretability Evaluation Benchmark for Neural NLP
CONLL 2022
Fine-grained Entity Typing via Label Reasoning
EMNLP 2021
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs
AAAI 2021
Discovering Dialog Structure Graph for Coherent Dialog Generation
ACL 2021
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
ACL 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
ACL 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
ACL 2021
DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications
ACL 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval
ACL 2021
Correcting Chinese Spelling Errors with Phonetic Pre-training
ACL 2021
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning
ACL 2021
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora
EMNLP 2021
RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking
EMNLP 2021
SgSum:Transforming Multi-document Summarization into Sub-graph Selection
EMNLP 2021
DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation
EMNLP 2021
Learning with Noisy Correspondence for Cross-modal Matching
NIPS 2021
Data Augmentation with Hierarchical SQL-to-Question Generation for Cross-domain Text-to-SQL Parsing
EMNLP 2021
Mixup Decoding for Diverse Machine Translation
EMNLP 2021
Amendable Generation for Dialogue State Tracking
EMNLP 2021
PLATO-KAG: Unsupervised Knowledge-Grounded Conversation via Joint Modeling
EMNLP 2021
Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching
IJCAI 2021
Discovering Dialog Structure Graph for Coherent Dialog Generation
IJCNLP 2021
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
IJCNLP 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
IJCNLP 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
IJCNLP 2021
DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications
IJCNLP 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval
IJCNLP 2021
Correcting Chinese Spelling Errors with Phonetic Pre-training
IJCNLP 2021
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning
IJCNLP 2021
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
NAACL 2021
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
NAACL 2021
BSTC: A Large-Scale Chinese-English Speech Translation Dataset
NAACL 2021
Findings of the Second Workshop on Automatic Simultaneous Translation
NAACL 2021
ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation
IJCAI 2020
Enhancing Dialog Coherence with Event Graph Grounded Content Planning
IJCAI 2020
Learning Adaptive Segmentation Policy for Simultaneous Translation
EMNLP 2020
DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset
EMNLP 2020
Diversified Multiple Instance Learning for Document-Level Multi-Aspect Sentiment Classification
EMNLP 2020
Syntactic and Semantic-driven Learning for Open Information Extraction
EMNLP 2020
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable
ACL 2020
Towards Conversational Recommendation over Multi-Type Dialogs
ACL 2020
Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation
ACL 2020
Knowledge Graph Grounded Goal Planning for Open-Domain Conversation Generation
AAAI 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
ACL 2020
Leveraging Graph to Improve Abstractive Multi-Document Summarization
ACL 2020
Exploring Contextual Word-level Style Relevance for Unsupervised Style Transfer
ACL 2020
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding
AAAI 2020
ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding
AAAI 2020
Enhancing Local Feature Extraction with Global Representation for Neural Text Classification
IJCNLP 2019
End-to-End Speech Translation with Knowledge Distillation
INTERSPEECH 2019
Multi-agent Learning for Neural Machine Translation
IJCNLP 2019
Baidu Neural Machine Translation Systems for WMT19
ACL 2019
Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment
ACL 2019
Proactive Human-Machine Conversation with Explicit Conversation Goal
ACL 2019
STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework
ACL 2019
Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension
ACL 2019
ARNOR: Attention Regularization based Noise Reduction for Distant Supervision Relation Classification
ACL 2019
Enhancing Local Feature Extraction with Global Representation for Neural Text Classification
EMNLP 2019
Multi-agent Learning for Neural Machine Translation
EMNLP 2019
Addressing the Under-Translation Problem from the Entropy Perspective
AAAI 2019
Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs
EMNLP 2019
Modeling Coherence for Discourse Neural Machine Translation
AAAI 2019
Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs
IJCNLP 2019
D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading Comprehension
EMNLP 2019
Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection
IJCAI 2019
Learning to Select Knowledge for Response Generation in Dialog Systems
IJCAI 2019
Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification
ACL 2018
A New Method of Region Embedding for Text Classification
ICLR 2018
Addressing Troublesome Words in Neural Machine Translation
EMNLP 2018
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network
ACL 2018
DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications
ACL 2018
An End-to-End Model for Question Answering over Knowledge Base with Cross-Attention Combining Global Knowledge
ACL 2017
Multi-view Response Selection for Human-Computer Conversation
EMNLP 2016
Semi-Supervised Learning for Neural Machine Translation
ACL 2016
Minimum Risk Training for Neural Machine Translation
ACL 2016
Active Learning for Dependency Parsing with Partial Annotation
ACL 2016
Agreement-Based Joint Training for Bidirectional Attention-Based Neural Machine Translation
IJCAI 2016
Chinese Poetry Generation with Planning based Neural Network
COLING 2016
Latent Topic Embedding
COLING 2016
Multi-Task Learning for Multiple Language Translation
ACL 2015
Multi-Task Learning for Multiple Language Translation
IJCNLP 2015
Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System
EMNLP 2014
Improving Pivot-Based Statistical Machine Translation by Pivoting the Co-occurrence Count of Phrase Pairs
EMNLP 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
ACL 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
ACL 2014
Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality
EMNLP 2014
Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model
EMNLP 2014
Improving Pivot-Based Statistical Machine Translation Using Random Walk
EMNLP 2013
Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information
ACL 2012
Improve SMT Quality with Automatically Extracted Paraphrase Rules
ACL 2012
Reordering with Source Language Collocations
ACL 2011
Improving Statistical Machine Translation with Monolingual Collocation
ACL 2010
Exploiting Heterogeneous Treebanks for Parsing
ACL 2009
Exploiting Heterogeneous Treebanks for Parsing
IJCNLP 2009
Revisiting Pivot Language Approach for Machine Translation
IJCNLP 2009
Collocation Extraction Using Monolingual Word Alignment Method
EMNLP 2009
Revisiting Pivot Language Approach for Machine Translation
ACL 2009
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora
COLING 2008
Using RBMT Systems to Produce Bilingual Corpus for SMT
EMNLP 2007
Pivot Language Approach for Phrase-Based Statistical Machine Translation
ACL 2007
Using RBMT Systems to Produce Bilingual Corpus for SMT
CONLL 2007
Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs
ACL 2006
Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs
COLING 2006
Boosting Statistical Word Alignment Using Labeled and Unlabeled Data
ACL 2006
Boosting Statistical Word Alignment Using Labeled and Unlabeled Data
COLING 2006
Alignment Model Adaptation for Domain-Specific Word Alignment
ACL 2005
Improving Statistical Word Alignment with Ensemble Methods
IJCNLP 2005
Improving Statistical Word Alignment with a Rule-Based Machine Translation System
COLING 2004
Improving Domain-Specific Word Alignment for Computer Assisted Translation
ACL 2004
Synonymous Collocation Extraction Using Translation Information
ACL 2003
Chinese Generation in a Spoken Dialogue Translation System
COLING 2000