Ming Zhou
241 papers · 2000–2025 · 19 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+20 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (28) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(11)
π
Conference Loyalist
(49)
π
Keyword Champion
(2)
π€
Dynamic Duo
(61)
π
Grand Slam
π
Triple Crown
π₯
Mega-Team
(32)
π±
Topic Pioneer
π¬
Deep Specialist
(20)
π§¬
Topic Evolution
ποΈ
Keyword Collector
(80)
π
Conference Pioneer
π
Century Club
(241)
π₯
Unstoppable
(24)
π
Trend Setter
β‘
Prolific Year
(19)
β
The Questioner
(2)
Conferences
ACL (87)
EMNLP (49)
COLING (33)
IJCNLP (18)
NAACL (8)
IJCAI (8)
AAAI (7)
CONLL (6)
ICML (4)
INTERSPEECH (4)
NIPS (4)
ICLR (3)
SEMEVAL (3)
EACL (2)
CVPR (1)
ECCV (1)
CORL (1)
AISTATS (1)
JMLR (1)
Top co-authors
Research topics
Keywords
neural machine translation
(15)
question answering
(12)
neural network
(10)
attention mechanism
(9)
language model
(8)
text generation
(8)
question generation
(7)
contrastive learning
(7)
pre-trained language model
(7)
machine translation
(6)
semantic parsing
(6)
extractive summarization
(6)
transfer learning
(6)
document summarization
(6)
sequence-to-sequence model
(5)
unsupervised learning
(5)
dialogue system
(5)
reinforcement learning
(5)
knowledge distillation
(5)
machine reading comprehension
(5)
Papers
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
AAAI 2025
Efficient Skill Discovery via Regret-Aware Optimization
ICML 2025
Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale
EMNLP 2024
HORIZON: High-Resolution Semantically Controlled Panorama Synthesis
AAAI 2024
A Reinforcement Learning Approach to Improve Low-Resource Machine Translation Leveraging Domain Monolingual Data
COLING 2024
LLMaAA: Making Large Language Models as Active Annotators
EMNLP 2023
A Hybrid Detection and Generation Framework with Separate Encoders for Event Extraction
EACL 2023
MT2: Towards a Multi-Task Machine Translation Model with Translation-Specific In-Context Learning
EMNLP 2023
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
JMLR 2023
UniXcoder: Unified Cross-Modal Pre-training for Code Representation
ACL 2022
Recovering Gold from Black Sand: Multilingual Dense Passage Retrieval with Hard and False Negative Samples
EMNLP 2022
Analytical Reasoning of Text
NAACL 2022
ProQA: Structural Prompt-based Pre-training for Unified Question Answering
NAACL 2022
BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation
NAACL 2022
Instance Regularization for Discriminative Language Model Pre-training
EMNLP 2022
Reasoning over Hybrid Chain for Table-and-Text Open Domain Question Answering
IJCAI 2022
Trace Controlled Text to Image Generation
ECCV 2022
Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text
ACL 2022
CoSQA: 20,000+ Web Queries for Code Search and Question Answering
ACL 2021
Learning to Ask Conversational Questions by Optimizing Levenshtein Distance
ACL 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
IJCAI 2021
SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation
ACL 2021
Control Image Captioning Spatially and Temporally
ACL 2021
BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining
ICML 2021
Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge
ACL 2021
GraphCodeBERT: Pre-training Code Representations with Data Flow
ICLR 2021
Discovering Representation Sprachbund For Multilingual Pre-Training
EMNLP 2021
Jointly Learning to Repair Code and Generate Commit Message
EMNLP 2021
Smart-Start Decoding for Neural Machine Translation
NAACL 2021
InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training
NAACL 2021
K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters
ACL 2021
Grammar-Based Patches Generation for Automated Program Repair
ACL 2021
K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters
IJCNLP 2021
Grammar-Based Patches Generation for Automated Program Repair
IJCNLP 2021
GLGE: A New General Language Generation Evaluation Benchmark
IJCNLP 2021
CoSQA: 20,000+ Web Queries for Code Search and Question Answering
IJCNLP 2021
Learning to Ask Conversational Questions by Optimizing Levenshtein Distance
IJCNLP 2021
SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation
IJCNLP 2021
Control Image Captioning Spatially and Temporally
IJCNLP 2021
Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge
IJCNLP 2021
GLGE: A New General Language Generation Evaluation Benchmark
ACL 2021
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
EMNLP 2020
At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization
COLING 2020
Unsupervised Fine-tuning for Text Clustering
COLING 2020
DocBank: A Benchmark Dataset for Document Layout Analysis
COLING 2020
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation
AAAI 2020
Alternating Language Modeling for Cross-Lingual Pre-Training
AAAI 2020
MoBoAligner: A Neural Alignment Model for Non-Autoregressive TTS with Monotonic Boundary Search
INTERSPEECH 2020
Low Latency End-to-End Streaming Speech Recognition with a Scout Network
INTERSPEECH 2020
Semantic Mask for Transformer Based End-to-End Speech Recognition
INTERSPEECH 2020
UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training
ICML 2020
Self-Adversarial Learning with Comparative Discrimination for Text Generation
ICLR 2020
Multi-Agent Interactions Modeling with Correlated Policies
ICLR 2020
ProphetNet: Predicting Future N-gram for Sequence-to-SequencePre-training
EMNLP 2020
Scheduled DropHead: A Regularization Method for Transformer Models
EMNLP 2020
Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers
EMNLP 2020
CodeBERT: A Pre-Trained Model for Programming and Natural Languages
EMNLP 2020
Improving Grammatical Error Correction with Machine Translation Pairs
EMNLP 2020
Machine Reasoning: Technology, Dilemma and Future
EMNLP 2020
BERT-of-Theseus: Compressing BERT by Progressive Module Replacing
EMNLP 2020
MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers
NIPS 2020
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving
CORL 2020
MuTual: A Dataset for Multi-Turn Dialogue Reasoning
ACL 2020
A Graph-based Coarse-to-fine Method for Unsupervised Bilingual Lexicon Induction
ACL 2020
A Retrieve-and-Rewrite Initialization Method for Unsupervised Machine Translation
ACL 2020
A Simple and Effective Unified Encoder for Document-Level Machine Translation
ACL 2020
MIND: A Large-scale Dataset for News Recommendation
ACL 2020
Curriculum Pre-training for End-to-End Speech Translation
ACL 2020
Graph Neural News Recommendation with Unsupervised Preference Disentanglement
ACL 2020
Improving Neural Machine Translation with Soft Template Prediction
ACL 2020
LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network
ACL 2020
Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder
ACL 2020
Reasoning Over Semantic-Level Graph for Fact Checking
ACL 2020
Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension
ACL 2020
Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction
EMNLP 2020
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
EMNLP 2020
Leveraging Declarative Knowledge in Text and First-Order Logic for Fine-Grained Propaganda Detection
EMNLP 2020
Pre-training for Abstractive Document Summarization by Reinstating Source Text
EMNLP 2020
Neural Deepfake Detection with Factual Structure of Text
EMNLP 2020
Unified Language Model Pre-training for Natural Language Understanding and Generation
NIPS 2019
A Tensorized Transformer for Language Modeling
NIPS 2019
Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks
EMNLP 2019
Response Generation by Context-Aware Prototype Editing
AAAI 2019
Unsupervised Neural Machine Translation with SMT as Posterior Regularization
AAAI 2019
Asking Clarification Questions in Knowledge-Based Question Answering
EMNLP 2019
Explicit Cross-lingual Pre-training for Unsupervised Machine Translation
IJCNLP 2019
Asking Clarification Questions in Knowledge-Based Question Answering
IJCNLP 2019
Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks
IJCNLP 2019
Explicit Cross-lingual Pre-training for Unsupervised Machine Translation
EMNLP 2019
Regularizing Neural Machine Translation by Target-Bidirectional Agreement
AAAI 2019
Coupling Retrieval and Meta-Learning for Context-Dependent Semantic Parsing
ACL 2019
BERT-based Lexical Substitution
ACL 2019
HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization
ACL 2019
Automatic Grammatical Error Correction for Sequence-to-sequence Text Generation: An Empirical Study
ACL 2019
Dense Procedure Captioning in Narrated Instructional Videos
ACL 2019
Exponential convergence rates for Batch Normalization: The power of length-direction decoupling in non-convex optimization
AISTATS 2019
Inspecting Unification of Encoding and Matching with Transformer: A Case Study of Machine Reading Comprehension
EMNLP 2019
Visual Question Generation as Dual Task of Visual Question Answering
CVPR 2018
Attention-Guided Answer Distillation for Machine Reading Comprehension
EMNLP 2018
Question Generation from SQL Queries Improves Neural Semantic Parsing
EMNLP 2018
Neural Latent Extractive Document Summarization
EMNLP 2018
Bidirectional Generative Adversarial Networks for Neural Machine Translation
CONLL 2018
Fine-grained Coordinated Cross-lingual Text Stream Alignment for Endless Language Knowledge Acquisition
EMNLP 2018
Mean Field Multi-Agent Reinforcement Learning
ICML 2018
Reinforced Mnemonic Reader for Machine Reading Comprehension
IJCAI 2018
Multiway Attention Networks for Modeling Sentence Pairs
IJCAI 2018
WaveNet Vocoder with Limited Training Data for Voice Conversion
INTERSPEECH 2018
Learning to Collaborate for Question Answering and Asking
NAACL 2018
Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base
NIPS 2018
Generative Bridging Network for Neural Sequence Prediction
NAACL 2018
Triangular Architecture for Rare Language Translation
ACL 2018
Semantic Parsing with Syntax- and Table-Aware SQL Generation
ACL 2018
Neural Document Summarization by Jointly Learning to Score and Select Sentences
ACL 2018
Fluency Boost Learning and Inference for Neural Grammatical Error Correction
ACL 2018
Neural Open Information Extraction
ACL 2018
Learning Matching Models with Weak Supervision for Response Selection in Retrieval-based Chatbots
ACL 2018
Gated Self-Matching Networks for Reading Comprehension and Question Answering
ACL 2017
Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots
ACL 2017
Sequence-to-Dependency Neural Machine Translation
ACL 2017
Selective Encoding for Abstractive Sentence Summarization
ACL 2017
Chunk-based Decoder for Neural Machine Translation
ACL 2017
SuperAgent: A Customer Service Chatbot for E-commerce Websites
ACL 2017
Learning to Generate Product Reviews from Attributes
EACL 2017
Entity Linking for Queries by Searching Wikipedia Sentences
EMNLP 2017
Question Generation for Question Answering
EMNLP 2017
Stack-based Multi-layer Attention for Transition-based Dependency Parsing
EMNLP 2017
Improved Neural Machine Translation with Source Syntax
IJCAI 2017
Beihang-MSRA at SemEval-2017 Task 3: A Ranking System with Neural Matching Features for Community Question Answering
SEMEVAL 2017
Unsupervised Word and Dependency Path Embeddings for Aspect Term Extraction
IJCAI 2016
Event Detection with Burst Information Networks
COLING 2016
Improving Attention Modeling with Implicit Distortion and Fertility for Machine Translation
COLING 2016
Constraint-Based Question Answering with Knowledge Graph
COLING 2016
Detecting Context Dependent Messages in a Conversational Environment
COLING 2016
A Redundancy-Aware Sentence Regression Framework for Extractive Summarization
COLING 2016
DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents
ACL 2016
Knowledge-Based Semantic Embedding for Machine Translation
ACL 2016
Solving and Generating Chinese Character Riddles
EMNLP 2016
News Stream Summarization using Burst Information Networks
EMNLP 2016
Learning Summary Prior Representation for Extractive Summarization
ACL 2015
Splusplus: A Feature-Rich Two-stage Classifier for Sentiment Analysis of Tweets
SEMEVAL 2015
Learning Summary Prior Representation for Extractive Summarization
IJCNLP 2015
A Dependency-Based Neural Network for Relation Classification
IJCNLP 2015
Efficient Disfluency Detection with Transition-based Parsing
IJCNLP 2015
Question Answering over Freebase with Multi-Column Convolutional Neural Networks
IJCNLP 2015
Hierarchical Recurrent Neural Network for Document Modeling
EMNLP 2015
A Hybrid Neural Model for Type Classification of Entity Mentions
IJCAI 2015
Question Answering over Freebase with Multi-Column Convolutional Neural Networks
ACL 2015
Efficient Disfluency Detection with Transition-based Parsing
ACL 2015
A Dependency-Based Neural Network for Relation Classification
ACL 2015
A Joint Segmentation and Classification Framework for Sentiment Analysis
EMNLP 2014
Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach
COLING 2014
Joint Relational Embeddings for Knowledge-based Question Answering
EMNLP 2014
Coooolll: A Deep Learning System for Twitter Sentiment Classification
SEMEVAL 2014
Adaptive Recursive Neural Network for Target-dependent Twitter Sentiment Classification
ACL 2014
Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification
ACL 2014
A Recursive Recurrent Neural Network for Statistical Machine Translation
ACL 2014
Knowledge-Based Question Answering as Machine Translation
ACL 2014
Learning Topic Representation for SMT with Neural Networks
ACL 2014
Bilingually-constrained Phrase Embeddings for Machine Translation
ACL 2014
A Lexicalized Reordering Model for Hierarchical Phrase-based Translation
COLING 2014
Soft Dependency Matching for Hierarchical Phrase-based Machine Translation
COLING 2014
Multi-Domain Adaptation for SMT Using Multi-Task Learning
EMNLP 2013
Answer Extraction from Passage Graph for Question Answering
IJCAI 2013
Word Alignment Modeling with Context Dependent Deep Neural Network
ACL 2013
Entity Linking for Tweets
ACL 2013
Machine Translation Detection from Monolingual Web-Text
ACL 2013
Learning Entity Representation for Entity Disambiguation
ACL 2013
Paraphrasing Adaptation for Web Search Ranking
ACL 2013
Bilingual Data Cleaning for SMT using Graph-based Random Walk
ACL 2013
Efficient Collective Entity Linking with Stacking
EMNLP 2013
Forced Derivation Tree based Model Training to Statistical Machine Translation
CONLL 2012
Re-training Monolingual Parser Bilingually for Syntactic SMT
EMNLP 2012
Forced Derivation Tree based Model Training to Statistical Machine Translation
EMNLP 2012
Graph-Based Multi-Tweet Summarization using Social Signals
COLING 2012
Lost in Translations? Building Sentiment Lexicons using Context Based Machine Translation
COLING 2012
Twitter Topic Summarization by Ranking Tweets using Social Influence and Content Quality
COLING 2012
QuickView: NLP-based Tweet Search
ACL 2012
Translation Model Size Reduction for Hierarchical Phrase-based Statistical Machine Translation
ACL 2012
Joint Learning of a Dual SMT System for Paraphrase Generation
ACL 2012
Cross-Lingual Mixture Model for Sentiment Classification
ACL 2012
Joint Inference of Named Entity Recognition and Normalization for Tweets
ACL 2012
Learning Translation Consensus with Structured Label Propagation
ACL 2012
Re-training Monolingual Parser Bilingually for Syntactic SMT
CONLL 2012
Target-dependent Twitter Sentiment Classification
ACL 2011
Engkoo: Mining the Web for Language Learning
ACL 2011
Recognizing Named Entities in Tweets
ACL 2011
Hypothesis Mixture Decoding for Statistical Machine Translation
ACL 2011
Translation Model Generalization using Probability Averaging for Machine Translation
COLING 2010
A Joint Rule Selection Model for Hierarchical Phrase-Based Translation
ACL 2010
Semantic Role Labeling for News Tweets
COLING 2010
Improved Discriminative ITG Alignment using Hierarchical Phrase Pairs and Semi-supervised Training
COLING 2010
Adaptive Development Data Selection for Log-linear Model in Statistical Machine Translation
COLING 2010
An Empirical Study on Learning to Rank of Tweets
COLING 2010
An Empirical Study on Web Mining of Parallel Data
COLING 2010
Mixture Model-based Minimum Bayes Risk Decoding using Multiple Machine Translation Systems
COLING 2010
Discriminative Pruning for Discriminative ITG Alignment
ACL 2010
Collective Semantic Role Labeling on Open News Corpus by Leveraging Redundancy
COLING 2010
SRL-Based Verb Selection for ESL
EMNLP 2010
Hybrid Decoding: Decoding with Partial Hypotheses Combination over Multiple SMT Systems
COLING 2010
Collaborative Decoding: Partial Hypothesis Re-ranking Using Translation Consensus between Decoders
IJCNLP 2009
The Feature Subspace Method for SMT System Combination
EMNLP 2009
Collaborative Decoding: Partial Hypothesis Re-ranking Using Translation Consensus between Decoders
ACL 2009
Exploiting Bilingual Information to Improve Web Search
IJCNLP 2009
Mining Bilingual Data from the Web with Adaptively Learnt Patterns
IJCNLP 2009
Exploiting Bilingual Information to Improve Web Search
ACL 2009
Mining Bilingual Data from the Web with Adaptively Learnt Patterns
ACL 2009
Better Synchronous Binarization for Machine Translation
EMNLP 2009
Generating Chinese Couplets using a Statistical MT Approach
COLING 2008
Diagnostic Evaluation of Machine Translation Systems Using Automatically Constructed Linguistic Check-Points
COLING 2008
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
EMNLP 2008
Combining Multiple Resources to Improve SMT-based Paraphrasing Model
ACL 2008
Measure Word Generation for English-Chinese SMT Systems
ACL 2008
Phrase Reordering Model Integrating Syntactic Knowledge for SMT
EMNLP 2007
Low-Quality Product Review Detection in Opinion Summarization
EMNLP 2007
Improving Query Spelling Correction Using Web Search Results
EMNLP 2007
Improving Query Spelling Correction Using Web Search Results
CONLL 2007
Phrase Reordering Model Integrating Syntactic Knowledge for SMT
CONLL 2007
Low-Quality Product Review Detection in Opinion Summarization
CONLL 2007
A Probabilistic Approach to Syntax-based Reordering for Statistical Machine Translation
ACL 2007
Detecting Erroneous Sentences using Automatically Mined Sequential Patterns
ACL 2007
Detection of Non-Native Sentences Using Machine-Translated Training Data
NAACL 2007
Exploring Distributional Similarity Based Models for Query Spelling Correction
COLING 2006
Exploring Distributional Similarity Based Models for Query Spelling Correction
ACL 2006
A DOM Tree Alignment Model for Mining Parallel Data from the Web
COLING 2006
A DOM Tree Alignment Model for Mining Parallel Data from the Web
ACL 2006
Reranking Answers for Definitional QA Using Language Modeling
COLING 2006
Reranking Answers for Definitional QA Using Language Modeling
ACL 2006
Resume Information Extraction with Cascaded Hybrid Model
ACL 2005
Improving Word Alignment Models using Structured Monolingual Corpora
EMNLP 2004
Collocation Translation Acquisition Using Monolingual Corpora
ACL 2004
A New Approach for English-Chinese Named Entity Alignment
EMNLP 2004
Synonymous Collocation Extraction Using Translation Information
ACL 2003
Chinese Named Entity Identification Using Class-based Language Model
COLING 2002
Structure Alignment Using Bilingual Chunking
COLING 2002
An Automatic Evaluation Method for Localization Oriented Lexicalised EBMT System
COLING 2002
Self-Organizing Chinese and Japanese Semantic Maps
COLING 2002
Automatic Detecting/Correcting Errors in Chinese Text by an Approximate Word-Matching Algorithm
ACL 2000
A Unified Statistical Model for the Identification of English BaseNP
ACL 2000
PENS: A Machine-aided English Writing System for Chinese Users
ACL 2000
A Block-Based Robust Dependency Parser for Unrestricted Chinese Text
ACL 2000
Extraction of Chinese Compound Words - An Experimental Study on a Very Large Corpus
ACL 2000