Fandong Meng
173 papers · 2011–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Conference Polyglot (12) π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (15) π Interdisciplinary Bridge π Academic Marathon (14)
πΊοΈ
Taxonomy Completionist
(15)
π§
Keyword Pioneer
π
Academic Marathon
(14)
π
Conference Loyalist
(48)
π€
Dynamic Duo
(149)
π
Grand Slam
π¬
Deep Specialist
(52)
π
Keyword Champion
(3)
π
Conference Pioneer
β‘
Prolific Year
(12)
β
The Questioner
(2)
ποΈ
Keyword Collector
(589)
π
Trend Setter
π
Century Club
(167)
π₯
Unstoppable
(8)
Conferences
ACL (69)
EMNLP (48)
IJCNLP (16)
AAAI (11)
COLING (9)
ICLR (5)
NAACL (5)
ICML (3)
CONLL (2)
IJCAI (2)
NIPS (2)
SEMEVAL (1)
Top co-authors
Keywords
neural machine translation
(39)
large language model
(22)
machine translation
(15)
knowledge distillation
(14)
text generation
(13)
model compression
(7)
cross-lingual summarization
(6)
reinforcement learning
(6)
language model
(6)
information retrieval
(6)
attention mechanism
(6)
contrastive learning
(6)
translation quality
(6)
zero-shot learning
(6)
context modeling
(5)
transformer architecture
(5)
text classification
(5)
transfer learning
(5)
text summarization
(5)
dialogue system
(5)
Papers
SED-SFT: Selectively Encouraging Diversity in Supervised Fine-Tuning
ACL 2026
Figure It Out: Improve the Frontier of Reasoning with Executable Visual States
ACL 2026
Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning
ACL 2026
GRAM-RΒ²: Self-Training Generative Foundation Reward Models for Reward Reasoning
AAAI 2026
Investigating Cross-Modal Skill Injection: Scenarios, Methods, and Hyperparameters
ACL 2026
APB-V: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention
ACL 2026
A Self-Denoising Model for Robust Few-Shot Relation Extraction
ACL 2025
Advancing SMoE for Continuous Domain Adaptation of MLLMs: Adaptive Router and Domain-Specific Loss
ACL 2025
THOR-MoE: Hierarchical Task-Guided and Context-Responsive Routing for Neural Machine Translation
ACL 2025
Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts
ACL 2025
An Empirical Study of Many-to-Many Summarization with Large Language Models
ACL 2025
PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
ACL 2025
MiniPLM: Knowledge Distillation for Pre-training Language Models
ICLR 2025
DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
ICLR 2025
Multilingual Knowledge Editing with Language-Agnostic Factual Neurons
COLING 2025
CM-Align: Consistency-based Multilingual Alignment for Large Language Models
EMNLP 2025
TIU-Bench: A Benchmark for Evaluating Large Multimodal Models on Text-rich Image Understanding
EMNLP 2025
Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings
EMNLP 2025
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
EMNLP 2025
Retrieval-Augmented Machine Translation with Unstructured Knowledge
EMNLP 2025
ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning
EMNLP 2025
Continuous Visual Autoregressive Generation via Score Maximization
ICML 2025
A Law Reasoning Benchmark for LLM with Tree-Organized Structures including Factum Probandum, Evidence and Experiences
ACL 2025
AVG-LLaVA: An Efficient Large Multimodal Model with Adaptive Visual Granularity
ACL 2025
Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping
ACL 2025
Beyond Next Token Prediction: Patch-Level Training for Large Language Models
ICLR 2025
LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information
ACL 2025
DRT: Deep Reasoning Translation via Long Chain-of-Thought
ACL 2025
LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation
ACL 2024
XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners
NAACL 2024
Generative Multi-Modal Knowledge Retrieval with Large Language Models
AAAI 2024
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
COLING 2024
UMTIT: Unifying Recognition, Translation, and Generation for Multimodal Text Image Translation
COLING 2024
Teaching Large Language Models to Translate with Comparison
AAAI 2024
Tree-of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models
AAAI 2024
On Large Language Modelsβ Hallucination with Regard to Known Facts
NAACL 2024
On Prompt-Driven Safeguarding for Large Language Models
ICML 2024
Language Generation with Strictly Proper Scoring Rules
ICML 2024
Large Language Models Are Not Robust Multiple Choice Selectors
ICLR 2024
Towards Codable Watermarking for Injecting Multi-Bits Information to LLMs
ICLR 2024
LexMatcher: Dictionary-centric Data Curation for LLM-based Machine Translation
EMNLP 2024
Enhancing Byzantine-Resistant Aggregations with Client Embedding
EMNLP 2024
On the token distance modeling ability of higher RoPE attention dimension
EMNLP 2024
Multi-Level Cross-Modal Alignment for Speech Relation Extraction
EMNLP 2024
C-LLM: Learn to Check Chinese Spelling Errors Character by Character
EMNLP 2024
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation
ACL 2024
CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers
ACL 2024
Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective
ACL 2024
TasTe: Teaching Large Language Models to Translate through Self-Reflection
ACL 2024
Continual Learning with Semi-supervised Contrastive Distillation for Incremental Neural Machine Translation
ACL 2024
Cross-Lingual Knowledge Editing in Large Language Models
ACL 2024
Exploring Conditional Variational Mechanism to Pinyin Input Method for Addressing One-to-Many Mappings in Low-Resource Scenarios
ACL 2024
Plot Retrieval as an Assessment of Abstract Semantic Association
ACL 2024
Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation
ACL 2024
Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective
ACL 2024
Trust in Internal or External Knowledge? Generative Multi-Modal Entity Linking with Knowledge Retriever
ACL 2024
Outdated Issue Aware Decoding for Factual Knowledge Editing
ACL 2024
Instruction Position Matters in Sequence Generation with Large Language Models
ACL 2024
BranchNorm: Robustly Scaling Extremely Deep Transformers
ACL 2024
Towards Multiple References Era β Addressing Data Leakage and Limited Reference Diversity in Machine Translation Evaluation
ACL 2024
Improving Machine Translation with Large Language Models: A Preliminary Study with Cooperative Decoding
ACL 2024
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
EMNLP 2023
Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias
ACL 2023
Question-Interlocutor Scope Realized Graph Modeling over Key Utterances for Dialogue Reading Comprehension
ACL 2023
RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training
ACL 2023
Understanding Translationese in Cross-Lingual Summarization
EMNLP 2023
Is ChatGPT a Good NLG Evaluator? A Preliminary Study
EMNLP 2023
D2TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal Summarization
EMNLP 2023
Rethinking the Word-level Quality Estimation for Machine Translation from Human Judgement
ACL 2023
Towards Unifying Multi-Lingual and Cross-Lingual Summarization
ACL 2023
Personality Understanding of Fictional Characters during Book Reading
ACL 2023
Soft Language Clustering for Multilingual Model Pre-training
ACL 2023
Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization
ACL 2023
Consistency Regularization Training for Compositional Generalization
ACL 2023
Fed-FA: Theoretically Modeling Client Data Divergence for Federated Language Backdoor Defense
NIPS 2023
Zero-Shot Cross-Lingual Summarization via Large Language Models
EMNLP 2023
Enhancing Argument Structure Extraction with Efficient Leverage of Contextual Information
EMNLP 2023
HyperNetwork-based Decoupling to Improve Model Generalization for Few-Shot Relation Extraction
EMNLP 2023
WeTS: A Benchmark for Translation Suggestion
EMNLP 2022
A Variational Hierarchical Model for Neural Cross-Lingual Summarization
ACL 2022
Conditional Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation
ACL 2022
MSCTD: A Multimodal Sentiment Chat Translation Dataset
ACL 2022
Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation
ACL 2022
Scheduled Multi-task Learning for Neural Chat Translation
ACL 2022
EAG: Extract and Generate Multi-way Aligned Corpus for Complete Multi-lingual Neural Machine Translation
ACL 2022
TAKE: Topic-shift Aware Knowledge sElection for Dialogue Generation
COLING 2022
Categorizing Semantic Representations for Neural Machine Translation
COLING 2022
TSAM: A Two-Stream Attention Model for Causal Emotion Entailment
COLING 2022
Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment
EMNLP 2022
A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models
NIPS 2022
Towards Robust k-Nearest-Neighbor Machine Translation
EMNLP 2022
ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization
EMNLP 2022
Digging Errors in NMT: Evaluating and Understanding Model Errors from Partial Hypothesis Space
EMNLP 2022
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
EMNLP 2022
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible Knowledge Selection
EMNLP 2022
Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning
EMNLP 2022
Findings of the WMT 2022 Shared Task on Translation Suggestion
EMNLP 2022
Summer: WeChat Neural Machine Translation Systems for the WMT22 Biomedical Translation Task
EMNLP 2022
BJTU-WeChatβs Systems for the WMT22 Chat Translation Task
EMNLP 2022
Neutral Utterances are Also Causes: Enhancing Conversational Causal Emotion Entailment with Social Commonsense Knowledge
IJCAI 2022
Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation
NAACL 2022
Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training
NAACL 2022
Confidence-Aware Scheduled Sampling for Neural Machine Translation
IJCNLP 2021
An Iterative Multi-Knowledge Transfer Network for Aspect-Based Sentiment Analysis
EMNLP 2021
Exploring Dynamic Selection of Branch Expansion Orders for Code Generation
IJCNLP 2021
GTM: A Generative Triple-wise Model for Conversational Question Generation
IJCNLP 2021
Prevent the Language Model from being Overconfident in Neural Machine Translation
IJCNLP 2021
Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation
IJCNLP 2021
Context Tracking Network: Graph-based Context Modeling for Implicit Discourse Relation Recognition
NAACL 2021
Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser
EMNLP 2021
Scheduled Sampling Based on Decoding Steps for Neural Machine Translation
EMNLP 2021
Improving Graph-based Sentence Ordering with Iteratively Predicted Pairwise Orderings
EMNLP 2021
Faster Depth-Adaptive Transformers
AAAI 2021
Towards Making the Most of Dialogue Characteristics for Neural Chat Translation
EMNLP 2021
Infusing Multi-Source Knowledge with Heterogeneous Graph Neural Network for Emotional Conversation Generation
AAAI 2021
Competence-based Curriculum Learning for Multilingual Machine Translation
EMNLP 2021
WeChat Neural Machine Translation Systems for WMT21
EMNLP 2021
Unsupervised Knowledge Selection for Dialogue Generation
IJCNLP 2021
Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
IJCNLP 2021
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
IJCNLP 2021
Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation
IJCNLP 2021
Selective Knowledge Distillation for Neural Machine Translation
IJCNLP 2021
Modeling Bilingual Conversational Characteristics for Neural Chat Translation
IJCNLP 2021
Confidence-Aware Scheduled Sampling for Neural Machine Translation
ACL 2021
Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition
ACL 2021
Unsupervised Knowledge Selection for Dialogue Generation
ACL 2021
Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
ACL 2021
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
ACL 2021
Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation
ACL 2021
Selective Knowledge Distillation for Neural Machine Translation
ACL 2021
Modeling Bilingual Conversational Characteristics for Neural Chat Translation
ACL 2021
Exploring Dynamic Selection of Branch Expansion Orders for Code Generation
ACL 2021
GTM: A Generative Triple-wise Model for Conversational Question Generation
ACL 2021
Prevent the Language Model from being Overconfident in Neural Machine Translation
ACL 2021
Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation
ACL 2021
Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition
IJCNLP 2021
Unsupervised Paraphrasing by Simulated Annealing
ACL 2020
Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation
AAAI 2020
Enhancing Pointer Network for Sentence Ordering with Pairwise Ordering Predictions
AAAI 2020
DMRM: A Dual-Channel Multi-Hop Reasoning Model for Visual Dialog
AAAI 2020
[RETRACTED] A Contextual Hierarchical Attention Network with Adaptive Objective for Dialogue State Tracking
ACL 2020
A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation
ACL 2020
Multi-Zone Unit for Recurrent Neural Networks
AAAI 2020
Token-level Adaptive Training for Neural Machine Translation
EMNLP 2020
Multi-Unit Transformers for Neural Machine Translation
EMNLP 2020
Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation
EMNLP 2020
A Sentiment-Controllable Topic-to-Essay Generator with Topic Knowledge Graph
EMNLP 2020
WeChat Neural Machine Translation Systems for WMT20
EMNLP 2020
A Novel Aspect-Guided Deep Transition Model for Aspect Based Sentiment Analysis
IJCNLP 2019
Incremental Transformer with Deliberation Decoder for Document Grounded Conversations
ACL 2019
GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling
ACL 2019
Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation
ACL 2019
Bridging the Gap between Training and Inference for Neural Machine Translation
ACL 2019
CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding
EMNLP 2019
Enhancing Context Modeling with a Query-Guided Capsule Network for Document-level Translation
EMNLP 2019
A Novel Aspect-Guided Deep Transition Model for Aspect Based Sentiment Analysis
EMNLP 2019
DTMT: A Novel Deep Transition Architecture for Neural Machine Translation
AAAI 2019
CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding
IJCNLP 2019
Enhancing Context Modeling with a Query-Guided Capsule Network for Document-level Translation
IJCNLP 2019
Neural Machine Translation with Key-Value Memory-Augmented Attention
IJCAI 2018
Modeling Localness for Self-Attention Networks
EMNLP 2018
Towards Robust Neural Machine Translation
ACL 2018
Interactive Attention for Neural Machine Translation
COLING 2016
Encoding Source Language with Convolutional Neural Network for Machine Translation
IJCNLP 2015
Encoding Source Language with Convolutional Neural Network for Machine Translation
ACL 2015
A Dependency Edge-based Transfer Model for Statistical Machine Translation
COLING 2014
Modeling Term Translation for Document-informed Machine Translation
EMNLP 2014
Translation with Source Constituency and Dependency Trees
EMNLP 2013
Discriminative Boosting from Dictionary and Raw Text β A Novel Approach to Build A Chinese Word Segmenter
COLING 2012
ICT: A Translation based Method for Cross-lingual Textual Entailment
SEMEVAL 2012
Iterative Annotation Transformation with Predict-Self Reestimation for Chinese Word Segmentation
CONLL 2012
Iterative Annotation Transformation with Predict-Self Reestimation for Chinese Word Segmentation
EMNLP 2012
ETS: An Error Tolerable System for Coreference Resolution
CONLL 2011