Shuming Shi
152 papers · 2009–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (13) π Interdisciplinary Bridge π Conference Polyglot (12)
πΊοΈ
Taxonomy Completionist
(13)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Loyalist
(48)
π€
Dynamic Duo
(50)
π¬
Deep Specialist
(50)
π
Keyword Champion
(3)
π₯
Unstoppable
(12)
β‘
Prolific Year
(27)
β
The Questioner
(2)
ποΈ
Keyword Collector
(527)
π
Trend Setter
π
Century Club
(151)
π
Conference Pioneer
Conferences
EMNLP (52)
ACL (49)
IJCNLP (19)
AAAI (9)
NAACL (9)
ICLR (5)
COLING (2)
IJCAI (2)
NIPS (2)
AACL (1)
CONLL (1)
CVPR (1)
Top co-authors
Keywords
neural machine translation
(40)
large language model
(19)
machine translation
(17)
text generation
(15)
data augmentation
(8)
response generation
(7)
dialogue system
(6)
dialogue generation
(5)
attention mechanism
(5)
instruction tuning
(5)
domain adaptation
(5)
matching model
(5)
self-attention network
(5)
computer-aided translation
(5)
word embedding
(4)
text classification
(4)
language model
(4)
transfer learning
(4)
benchmark evaluation
(4)
named entity recognition
(4)
Papers
Mixture of Heterogeneous Grouped Experts for Language Modeling
ACL 2026
Learning-enabled Polynomial Lyapunov Function Synthesis via High-Accuracy Counterexample-Guided Framework
CVPR 2025
DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models
EMNLP 2025
Fuzzy Reasoning Chain (FRC): An Innovative Reasoning Framework from Fuzziness to Clarity
EMNLP 2025
Alleviating Hallucinations of Large Language Models through Induced Hallucinations
NAACL 2025
Benchmarking and Improving Long-Text Translation with Large Language Models
ACL 2024
TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild
ACL 2024
Retrieval is Accurate Generation
ICLR 2024
The Reasonableness Behind Unreasonable Translation Capability of Large Language Model
ICLR 2024
MAGE: Machine-generated Text Detection in the Wild
ACL 2024
Advancement in Graph Understanding: A Multimodal Benchmark and Fine-Tuning of Vision-Language Models
ACL 2024
Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction
ACL 2024
Spotting AIβs Touch: Identifying LLM-Paraphrased Spans in Text
ACL 2024
Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability
EMNLP 2024
Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning
EMNLP 2024
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate
EMNLP 2024
Knowledge Verification to Nip Hallucination in the Bud
EMNLP 2024
Benchmarking LLMs via Uncertainty Quantification
NIPS 2024
A Frustratingly Simple Decoding Method for Neural Text Generation
COLING 2024
Reasons to Reject? Aligning Language Models with Judgments
ACL 2024
Addressing Entity Translation Problem via Translation Difficulty and Context Diversity
ACL 2024
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
NIPS 2024
Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model
NAACL 2024
DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping
NAACL 2024
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
ICLR 2024
Knowledge Fusion of Large Language Models
ICLR 2024
IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems
EMNLP 2023
Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
EMNLP 2023
Unsupervised Keyphrase Extraction by Learning Neural Keyphrase Set Function
ACL 2023
Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
ACL 2023
Rethinking Translation Memory Augmented Neural Machine Translation
ACL 2023
Making Better Use of Training Corpus: Retrieval-based Aspect Sentiment Triplet Extraction via Label Interpolation
ACL 2023
A Survey on Zero Pronoun Translation
ACL 2023
Enhancing Grammatical Error Correction Systems with Explanations
ACL 2023
SORTIE: Dependency-Aware Symbolic Reasoning for Logical Data-to-text Generation
ACL 2023
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical Study
AACL 2023
Zero-Shot Rumor Detection with Propagation Structure via Prompt Learning
AAAI 2023
Improved Visual Story Generation with Adaptive Context Modeling
ACL 2023
Explicit Syntactic Guidance for Neural Text Generation
ACL 2023
Effidit: An Assistant for Improving Writing Efficiency
ACL 2023
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical Study
IJCNLP 2023
Findings of the Word-Level AutoCompletion Shared Task in WMT 2023
EMNLP 2023
Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs
EMNLP 2023
ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback
EMNLP 2023
Retrieval-Augmented Few-shot Text Classification
EMNLP 2023
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
EMNLP 2023
Document-Level Machine Translation with Large Language Models
EMNLP 2023
Rethinking Word-Level Auto-Completion in Computer-Aided Translation
EMNLP 2023
Towards Efficient Dialogue Pre-training with Transferable and Interpretable Latent Structure
EMNLP 2022
Exploring and Adapting Chinese GPT to Pinyin Input Method
ACL 2022
BiTIIMT: A Bilingual Text-infilling Method for Interactive Machine Translation
ACL 2022
Learning from Sibling Mentions with Scalable Graph Inference in Fine-Grained Entity Typing
ACL 2022
Redistributing Low-Frequency Words: Making the Most of Monolingual Data in Non-Autoregressive Translation
ACL 2022
Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation
ACL 2022
Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation
ACL 2022
Rethinking Negative Sampling for Handling Missing Entity Annotations
ACL 2022
A Model-agnostic Data Manipulation Method for Persona-based Dialogue Generation
ACL 2022
Investigating Data Variance in Evaluations of Automatic Machine Translation Metrics
ACL 2022
On the Evaluation Metrics for Paraphrase Generation
EMNLP 2022
GuoFeng: A Benchmark for Zero Pronoun Recovery and Translation
EMNLP 2022
MCPG: A Flexible Multi-Level Controllable Framework for Unsupervised Paraphrase Generation
EMNLP 2022
Tencent AI Lab - Shanghai Jiao Tong University Low-Resource Translation System for the WMT22 Translation Task
EMNLP 2022
Findings of the Word-Level AutoCompletion Shared Task in WMT 2022
EMNLP 2022
Tencentβs Multilingual Machine Translation System for WMT22 Large-Scale African Languages
EMNLP 2022
On Synthetic Data for Back Translation
NAACL 2022
Dialogue Response Selection with Hierarchical Curriculum Learning
ACL 2021
Fine-grained Entity Typing without Knowledge Base
EMNLP 2021
Segmenting Natural Language Sentences via Lexical Unit Analysis
EMNLP 2021
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation
EMNLP 2021
Tencent Translation System for the WMT21 News Translation Task
EMNLP 2021
Tencent AI Lab Machine Translation Systems for the WMT21 Biomedical Translation Task
EMNLP 2021
On the Copying Behaviors of Pre-Training for Neural Machine Translation
ACL 2021
TexSmart: A System for Enhanced Natural Language Understanding
ACL 2021
On the Language Coverage Bias for Neural Machine Translation
ACL 2021
Enhancing the Open-Domain Dialogue Evaluation in Latent Space
ACL 2021
Enhancing the Open-Domain Dialogue Evaluation in Latent Space
IJCNLP 2021
On the Language Coverage Bias for Neural Machine Translation
IJCNLP 2021
Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction
ACL 2021
GWLAN: General Word-Level AutocompletioN for Computer-Aided Translation
ACL 2021
Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation
ACL 2021
REAMβ―: An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog Generation
ACL 2021
On the Copying Behaviors of Pre-Training for Neural Machine Translation
IJCNLP 2021
REAMβ―: An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog Generation
IJCNLP 2021
TexSmart: A System for Enhanced Natural Language Understanding
IJCNLP 2021
Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction
IJCNLP 2021
Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition
ICLR 2021
GWLAN: General Word-Level AutocompletioN for Computer-Aided Translation
IJCNLP 2021
Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation
IJCNLP 2021
Dialogue Response Selection with Hierarchical Curriculum Learning
IJCNLP 2021
An Empirical Study on Multiple Information Sources for Zero-Shot Fine-Grained Entity Typing
EMNLP 2021
On the Inference Calibration of Neural Machine Translation
ACL 2020
Rigid Formats Controlled Text Generation
ACL 2020
Evaluating Explanation Methods for Neural Machine Translation
ACL 2020
Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks
AAAI 2020
When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models
EMNLP 2020
The World is Not Binary: Learning to Rank with Grayscale Data for Dialogue Response Selection
EMNLP 2020
On the Branching Bias of Syntax Extracted from Pre-trained Language Models
EMNLP 2020
Neuron Interaction Based Representation Composition for Neural Machine Translation
AAAI 2020
Tencent AI Lab Machine Translation Systems for the WMT20 Biomedical Translation Task
EMNLP 2020
CASE: Context-Aware Semantic Expansion
AAAI 2020
On the Sub-layer Functionalities of Transformer Decoder
EMNLP 2020
Tencent Neural Machine Translation Systems for the WMT20 News Translation Task
EMNLP 2020
Tencent AI Lab Machine Translation Systems for WMT20 Chat Translation Task
EMNLP 2020
Balancing Quality and Human Involvement: An Effective Approach to Interactive Neural Machine Translation
AAAI 2020
Semi-supervised Text Style Transfer: Cross Projection in Latent Space
IJCNLP 2019
Multi-Granularity Self-Attention for Neural Machine Translation
EMNLP 2019
One Model to Learn Both: Zero Pronoun Prediction and Translation
EMNLP 2019
Towards Understanding Neural Machine Translation with Word Importance
EMNLP 2019
Towards Better Modeling Hierarchical Structure for Self-Attention with Ordered Neurons
EMNLP 2019
Self-Attention with Structural Position Representations
EMNLP 2019
Retrieval-guided Dialogue Response Generation via a Matching-to-Generation Framework
EMNLP 2019
A Discrete CVAE for Response Generation on Short-Text Conversation
EMNLP 2019
Semi-supervised Text Style Transfer: Cross Projection in Latent Space
EMNLP 2019
Skeleton-to-Response: Dialogue Generation Guided by Retrieval Memory
NAACL 2019
Microblog Hashtag Generation via Encoding Conversation Contexts
NAACL 2019
Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement
AAAI 2019
Graph Based Translation Memory for Neural Machine Translation
AAAI 2019
Neural Machine Translation with Adequacy-Oriented Learning
AAAI 2019
Generating Multiple Diverse Responses for Short-Text Conversation
AAAI 2019
Exploiting Sentential Context for Neural Machine Translation
ACL 2019
Fine-Grained Sentence Functions for Short-Text Conversation
ACL 2019
Topic-Aware Neural Keyphrase Generation for Social Media Language
ACL 2019
On the Word Alignment from Neural Machine Translation
ACL 2019
Multi-Granularity Self-Attention for Neural Machine Translation
IJCNLP 2019
One Model to Learn Both: Zero Pronoun Prediction and Translation
IJCNLP 2019
Towards Understanding Neural Machine Translation with Word Importance
IJCNLP 2019
Towards Better Modeling Hierarchical Structure for Self-Attention with Ordered Neurons
IJCNLP 2019
Self-Attention with Structural Position Representations
IJCNLP 2019
Retrieval-guided Dialogue Response Generation via a Matching-to-Generation Framework
IJCNLP 2019
A Discrete CVAE for Response Generation on Short-Text Conversation
IJCNLP 2019
Understanding and Improving Hidden Representations for Neural Machine Translation
NAACL 2019
Directional Skip-Gram: Explicitly Distinguishing Left and Right Context for Word Embeddings
NAACL 2018
Joint Learning Embeddings for Chinese Words and their Components via Ladder Structured Networks
IJCAI 2018
Complementary Learning of Word Embeddings
IJCAI 2018
QuaSE: Sequence Editing under Quantifiable Guidance
EMNLP 2018
Target Foresight Based Attention for Neural Machine Translation
NAACL 2018
Towards Less Generic Responses in Neural Conversation Models: A Statistical Re-weighting Method
EMNLP 2018
hyperdoc2vec: Distributed Representations of Hypertext Documents
ACL 2018
Automatic Article Commenting: the Task and Dataset
ACL 2018
Exploiting Deep Representations for Neural Machine Translation
EMNLP 2018
Generating Classical Chinese Poems via Conditional Variational Autoencoder and Adversarial Training
EMNLP 2018
Learning Fine-Grained Expressions to Solve Math Word Problems
EMNLP 2017
Deep Neural Solver for Math Word Problems
EMNLP 2017
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation
ACL 2016
Automatically Solving Number Word Problems by Semantic Parsing and Reasoning
EMNLP 2015
Unsupervised Template Mining for Semantic Category Understanding
EMNLP 2014
Ensemble Semantics for Large-scale Unsupervised Relation Extraction
CONLL 2012
Ensemble Semantics for Large-scale Unsupervised Relation Extraction
EMNLP 2012
Nonlinear Evidence Fusion and Propagation for Hyponymy Relation Mining
ACL 2011
Corpus-based Semantic Class Mining: Distributional vs. Pattern-Based Approaches
COLING 2010
Employing Topic Models for Pattern-based Semantic Class Discovery
ACL 2009
Employing Topic Models for Pattern-based Semantic Class Discovery
IJCNLP 2009