Yeyun Gong
83 papers · 2013–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
🏃 Academic Marathon (12) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐝 Cross-Pollinator (14)
🌈
Renaissance Researcher
(8)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(10)
🏠
Conference Loyalist
(21)
🤝
Dynamic Duo
(51)
🏆
Grand Slam
👥
Mega-Team
(24)
🔬
Deep Specialist
(15)
🔥
Unstoppable
(7)
⚡
Prolific Year
(10)
📈
Trend Setter
💎
Century Club
(78)
🗃️
Keyword Collector
(274)
❓
The Questioner
Conferences
EMNLP (21)
ACL (16)
COLING (8)
NAACL (8)
ICLR (7)
ICML (7)
AAAI (5)
IJCNLP (5)
IJCAI (3)
NIPS (3)
Top co-authors
Research topics
Keywords
large language model
(16)
text generation
(9)
contrastive learning
(7)
pre-trained language model
(7)
language model
(5)
question generation
(5)
reinforcement learning
(4)
few-shot learning
(4)
chain-of-thought prompting
(4)
dense retrieval
(4)
attention mechanism
(4)
semantic parsing
(3)
domain adaptation
(3)
transfer learning
(3)
continual pre-training
(3)
question answering
(3)
text retrieval
(3)
code generation
(3)
sequence generation
(3)
knowledge distillation
(3)
Papers
Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training
ACL 2026
Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions
ACL 2026
Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
ACL 2026
Too Long, Do Re-weighting for Efficient LLM Reasoning Compression
ACL 2026
How Does Alignment Enhance LLMs’ Multilingual Capabilities? A Language Neurons Perspective
AAAI 2026
Enhancing Large Language Model Performance with Gradient-Based Parameter Selection
AAAI 2025
Adapting LLM Agents with Universal Communication Feedback
NAACL 2025
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
NAACL 2025
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
ACL 2025
Process-based Self-Rewarding Language Models
ACL 2025
Generative Prompt Internalization
NAACL 2025
Optimizing Large Language Model Training Using FP4 Quantization
ICML 2025
Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling
ICML 2025
Automated Proof Generation for Rust Code via Self-Evolution
ICLR 2025
Integrative Decoding: Improving Factuality via Implicit Self-consistency
ICLR 2025
Alchemy: Amplifying Theorem-Proving Capability Through Symbolic Mutation
ICLR 2025
Key-Point-Driven Data Synthesis with Its Enhancement on Mathematical Reasoning
AAAI 2025
Task Oriented In-Domain Data Augmentation
EMNLP 2024
AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators
NAACL 2024
Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
NAACL 2024
Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph
ICLR 2024
APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning
COLING 2024
Knowledge Enhanced Pre-training for Cross-lingual Dense Retrieval
COLING 2024
Not All Tokens Are What You Need for Pretraining
NIPS 2024
PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization
COLING 2024
CMMLU: Measuring massive multitask language understanding in Chinese
ACL 2024
Competition-Level Problems are Effective LLM Evaluators
ACL 2024
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
ICLR 2024
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
ICLR 2024
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
NAACL 2024
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models
ICML 2023
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
NIPS 2023
Query Rewriting in Retrieval-Augmented Large Language Models
EMNLP 2023
Noisy Pair Corrector for Dense Retrieval
EMNLP 2023
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
EMNLP 2023
Allies: Prompting Large Language Model with Beam Search
EMNLP 2023
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise
ICML 2023
CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion
EMNLP 2023
On-the-Fly Adapting Code Summarization on Trainable Cost-Effective Language Models
NIPS 2023
Joint Generator-Ranker Learning for Natural Language Generation
ACL 2023
P3LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training
EMNLP 2022
DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation
ACL 2022
Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations
ACL 2022
Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning
EMNLP 2022
CodeRetriever: A Large Scale Contrastive Pre-Training Method for Code Search
EMNLP 2022
Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis
EMNLP 2022
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval
EMNLP 2022
Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation
EMNLP 2022
Adversarial Retriever-Ranker for Dense Text Retrieval
ICLR 2022
CULG: Commercial Universal Language Generation
NAACL 2022
BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining
ICML 2021
GLGE: A New General Language Generation Evaluation Benchmark
ACL 2021
Mask Attention Networks: Rethinking and Strengthen Transformer
NAACL 2021
EL-Attention: Memory Efficient Lossless Attention for Generation
ICML 2021
GLGE: A New General Language Generation Evaluation Benchmark
IJCNLP 2021
ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation
IJCNLP 2021
FastSeq: Make Sequence Generation Faster
IJCNLP 2021
Poolingformer: Long Document Modeling with Pooling Attention
ICML 2021
KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning
EMNLP 2021
FastSeq: Make Sequence Generation Faster
ACL 2021
ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation
ACL 2021
ProphetNet: Predicting Future N-gram for Sequence-to-SequencePre-training
EMNLP 2020
Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation
EMNLP 2020
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
EMNLP 2020
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
EMNLP 2020
Uncertainty-Aware Label Refinement for Sequence Labeling
EMNLP 2020
Multi-level Alignment Pretraining for Multi-lingual Semantic Parsing
COLING 2020
Leveraging Document-Level Label Consistency for Named Entity Recognition
IJCAI 2020
Graph-Based Transformer with Cross-Candidate Verification for Semantic Parsing
AAAI 2020
An Enhanced Knowledge Injection Model for Commonsense Generation
COLING 2020
RikiNet: Reading Wikipedia Pages for Natural Question Answering
ACL 2020
Neural Semantic Parsing in Low-Resource Settings with Back-Translation and Meta-Learning
AAAI 2020
Joint Type Inference on Entities and Relations via Graph Convolutional Networks
ACL 2019
Aggregating Bidirectional Encoder Representations Using MatchLSTM for Sequence Matching
IJCNLP 2019
Weakly Supervised Multi-task Learning for Semantic Parsing
IJCAI 2019
Aggregating Bidirectional Encoder Representations Using MatchLSTM for Sequence Matching
EMNLP 2019
Hashtag Recommendation for Multimodal Microblog Using Co-Attention Network
IJCAI 2017
Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter
EMNLP 2016
Hashtag Recommendation Using End-To-End Memory Networks with Hierarchical Attention
COLING 2016
Hashtag Recommendation Using Dirichlet Process Mixture Models Incorporating Types of Hashtags
EMNLP 2015
Time-aware Personalized Hashtag Recommendation on Social Media
COLING 2014
A Generative Model for Identifying Target Companies of Microblogs
COLING 2014
Detecting Spammers in Community Question Answering
IJCNLP 2013