conftrace_

Yeyun Gong

83 papers · 2013–2026 · 10 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+14 more ↓ 🏃 Academic Marathon (12) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐝 Cross-Pollinator (14)
🌈 Renaissance Researcher (8) 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (10) 🏠 Conference Loyalist (21) 🤝 Dynamic Duo (51) 🏆 Grand Slam 👥 Mega-Team (24) 🔬 Deep Specialist (15) 🔥 Unstoppable (7) Prolific Year (10) 📈 Trend Setter 💎 Century Club (78) 🗃️ Keyword Collector (274) The Questioner

Conferences

EMNLP (21) ACL (16) COLING (8) NAACL (8) ICLR (7) ICML (7) AAAI (5) IJCNLP (5) IJCAI (3) NIPS (3)

Research topics

Papers

Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training ACL 2026 Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions ACL 2026 Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability ACL 2026 Too Long, Do Re-weighting for Efficient LLM Reasoning Compression ACL 2026 How Does Alignment Enhance LLMs’ Multilingual Capabilities? A Language Neurons Perspective AAAI 2026 Enhancing Large Language Model Performance with Gradient-Based Parameter Selection AAAI 2025 Adapting LLM Agents with Universal Communication Feedback NAACL 2025 Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning NAACL 2025 Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training ACL 2025 Process-based Self-Rewarding Language Models ACL 2025 Generative Prompt Internalization NAACL 2025 Optimizing Large Language Model Training Using FP4 Quantization ICML 2025 Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling ICML 2025 Automated Proof Generation for Rust Code via Self-Evolution ICLR 2025 Integrative Decoding: Improving Factuality via Implicit Self-consistency ICLR 2025 Alchemy: Amplifying Theorem-Proving Capability Through Symbolic Mutation ICLR 2025 Key-Point-Driven Data Synthesis with Its Enhancement on Mathematical Reasoning AAAI 2025 Task Oriented In-Domain Data Augmentation EMNLP 2024 AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators NAACL 2024 Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models NAACL 2024 Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph ICLR 2024 APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning COLING 2024 Knowledge Enhanced Pre-training for Cross-lingual Dense Retrieval COLING 2024 Not All Tokens Are What You Need for Pretraining NIPS 2024 PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization COLING 2024 CMMLU: Measuring massive multitask language understanding in Chinese ACL 2024 Competition-Level Problems are Effective LLM Evaluators ACL 2024 CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing ICLR 2024 ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving ICLR 2024 Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models NAACL 2024 Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models ICML 2023 AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation NIPS 2023 Query Rewriting in Retrieval-Augmented Large Language Models EMNLP 2023 Noisy Pair Corrector for Dense Retrieval EMNLP 2023 Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy EMNLP 2023 Allies: Prompting Large Language Model with Beam Search EMNLP 2023 Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise ICML 2023 CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion EMNLP 2023 On-the-Fly Adapting Code Summarization on Trainable Cost-Effective Language Models NIPS 2023 Joint Generator-Ranker Learning for Natural Language Generation ACL 2023 P3LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training EMNLP 2022 DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation ACL 2022 Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations ACL 2022 Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning EMNLP 2022 CodeRetriever: A Large Scale Contrastive Pre-Training Method for Code Search EMNLP 2022 Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis EMNLP 2022 SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval EMNLP 2022 Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation EMNLP 2022 Adversarial Retriever-Ranker for Dense Text Retrieval ICLR 2022 CULG: Commercial Universal Language Generation NAACL 2022 BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining ICML 2021 GLGE: A New General Language Generation Evaluation Benchmark ACL 2021 Mask Attention Networks: Rethinking and Strengthen Transformer NAACL 2021 EL-Attention: Memory Efficient Lossless Attention for Generation ICML 2021 GLGE: A New General Language Generation Evaluation Benchmark IJCNLP 2021 ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation IJCNLP 2021 FastSeq: Make Sequence Generation Faster IJCNLP 2021 Poolingformer: Long Document Modeling with Pooling Attention ICML 2021 KFCNet: Knowledge Filtering and Contrastive Learning for Generative Commonsense Reasoning EMNLP 2021 FastSeq: Make Sequence Generation Faster ACL 2021 ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation ACL 2021 ProphetNet: Predicting Future N-gram for Sequence-to-SequencePre-training EMNLP 2020 Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation EMNLP 2020 XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation EMNLP 2020 Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space EMNLP 2020 Uncertainty-Aware Label Refinement for Sequence Labeling EMNLP 2020 Multi-level Alignment Pretraining for Multi-lingual Semantic Parsing COLING 2020 Leveraging Document-Level Label Consistency for Named Entity Recognition IJCAI 2020 Graph-Based Transformer with Cross-Candidate Verification for Semantic Parsing AAAI 2020 An Enhanced Knowledge Injection Model for Commonsense Generation COLING 2020 RikiNet: Reading Wikipedia Pages for Natural Question Answering ACL 2020 Neural Semantic Parsing in Low-Resource Settings with Back-Translation and Meta-Learning AAAI 2020 Joint Type Inference on Entities and Relations via Graph Convolutional Networks ACL 2019 Aggregating Bidirectional Encoder Representations Using MatchLSTM for Sequence Matching IJCNLP 2019 Weakly Supervised Multi-task Learning for Semantic Parsing IJCAI 2019 Aggregating Bidirectional Encoder Representations Using MatchLSTM for Sequence Matching EMNLP 2019 Hashtag Recommendation for Multimodal Microblog Using Co-Attention Network IJCAI 2017 Keyphrase Extraction Using Deep Recurrent Neural Networks on Twitter EMNLP 2016 Hashtag Recommendation Using End-To-End Memory Networks with Hierarchical Attention COLING 2016 Hashtag Recommendation Using Dirichlet Process Mixture Models Incorporating Types of Hashtags EMNLP 2015 Time-aware Personalized Hashtag Recommendation on Social Media COLING 2014 A Generative Model for Identifying Target Companies of Microblogs COLING 2014 Detecting Spammers in Community Question Answering IJCNLP 2013