Wayne Xin Zhao
86 papers · 2011–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (10) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π Academic Marathon (14)
π
Academic Marathon
(14)
π
Cross-Pollinator
(14)
π
Renaissance Researcher
(7)
π
Conference Loyalist
(26)
π
Keyword Champion
(4)
π§¬
Topic Evolution
π€
Dynamic Duo
(61)
π¬
Deep Specialist
(16)
β‘
Prolific Year
(6)
π
Century Club
(86)
π₯
Unstoppable
(10)
π
Trend Setter
ποΈ
Keyword Collector
(307)
β
The Questioner
Conferences
ACL (26)
EMNLP (26)
COLING (10)
IJCNLP (7)
IJCAI (6)
AAAI (4)
NAACL (3)
NIPS (2)
CONLL (1)
ECCV (1)
Top co-authors
Research topics
Keywords
large language model
(20)
pretrained language model
(8)
pre-trained language model
(8)
question answering
(7)
text generation
(7)
information retrieval
(6)
model compression
(6)
retrieval-augmented generation
(5)
language model
(5)
domain adaptation
(5)
transfer learning
(5)
attention mechanism
(5)
multimodal learning
(4)
graph neural network
(4)
knowledge distillation
(4)
web search
(4)
chain-of-thought reasoning
(3)
monte carlo tree search
(3)
parameter efficient fine-tuning
(3)
tensor decomposition
(3)
Papers
ViFT: Towards Visual Instruction-Free Fine-tuning for Large Vision-Language Models
EMNLP 2025
YuLan-Mini: Pushing the Limits of Open Data-efficient Language Model
ACL 2025
Towards Effective and Efficient Continual Pre-training of Large Language Models
ACL 2025
Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering
ACL 2025
KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph
ACL 2025
LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation
ACL 2025
Masks Can be Learned as an Alternative to Experts
ACL 2025
Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking
ACL 2025
Socratic Style Chain-of-Thoughts Help LLMs to be a Better Reasoner
ACL 2025
CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability
EMNLP 2025
Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework
EMNLP 2025
ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework
EMNLP 2025
On Domain-Adaptive Post-Training for Multimodal Large Language Models
EMNLP 2025
Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models
EMNLP 2025
Enhancing Chain-of-Thought Reasoning via Neuron Activation Differential Analysis
EMNLP 2025
What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
COLING 2025
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
NAACL 2025
Unleashing the Potential of Large Language Models as Prompt Optimizers: Analogical Analysis with Gradient-based Model Optimizers
AAAI 2025
DAWN-ICL: Strategic Planning of Problem-solving Trajectories for Zero-Shot In-Context Learning
NAACL 2025
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
EMNLP 2025
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
COLING 2025
Smart-Searcher: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
EMNLP 2025
MMATH: A Multilingual Benchmark for Mathematical Reasoning
EMNLP 2025
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
COLING 2024
Enhancing Parameter-efficient Fine-tuning with Simple Calibration Based on Stable Rank
COLING 2024
BASES: Large-scale Web Search User Simulation with Large Language Model based Agents
EMNLP 2024
Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models
ECCV 2024
Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment
EMNLP 2024
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
EMNLP 2024
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering
EMNLP 2024
Exploring Context Window of Large Language Models via Decomposed Positional Vectors
NIPS 2024
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models
NIPS 2024
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
ACL 2024
DATA-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning
ACL 2024
Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models
EMNLP 2024
AuriSRec: Adversarial User Intention Learning in Sequential Recommendation
EMNLP 2024
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models
COLING 2024
Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study
COLING 2024
MVP: Multi-task Supervised Pre-training for Natural Language Generation
ACL 2023
PDFormer: Propagation Delay-Aware Dynamic Long-Range Transformer for Traffic Flow Prediction
AAAI 2023
Continuous Trajectory Generation Based on Two-Stage GAN
AAAI 2023
Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization
ACL 2023
TOME: A Two-stage Approach for Model-based Retrieval
ACL 2023
Learning to Imagine: Visually-Augmented Natural Language Generation
ACL 2023
Visually-augmented pretrained language models for NLP tasks without images
ACL 2023
The Web Can Be Your Oyster for Improving Language Models
ACL 2023
Zero-shot Visual Question Answering with Language Model Feedback
ACL 2023
Diffusion Models for Non-autoregressive Text Generation: A Survey
IJCAI 2023
A Survey of Vision-Language Pre-Trained Models
IJCAI 2022
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation
EMNLP 2022
TextBox 2.0: A Text Generation Library with Pre-trained Language Models
EMNLP 2022
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval
EMNLP 2022
Context-Tuning: Learning Contextualized Prompts for Natural Language Generation
COLING 2022
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models
COLING 2022
A Survey on Complex Knowledge Base Question Answering: Methods, Challenges and Solutions
IJCAI 2021
Pretrained Language Model for Text Generation: A Survey
IJCAI 2021
Virtual Data Augmentation: A Robust and General Framework for Fine-tuning Pre-trained Models
EMNLP 2021
RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking
EMNLP 2021
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
NAACL 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval
IJCNLP 2021
Dual Sparse Attention Network For Session-based Recommendation
AAAI 2021
Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models
IJCNLP 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval
ACL 2021
Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models
ACL 2021
CRSLab: An Open-Source Toolkit for Building Conversational Recommender System
ACL 2021
TextBox: A Unified, Modularized, and Extensible Framework for Text Generation
ACL 2021
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators
ACL 2021
A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base
EMNLP 2021
CRSLab: An Open-Source Toolkit for Building Conversational Recommender System
IJCNLP 2021
TextBox: A Unified, Modularized, and Extensible Framework for Text Generation
IJCNLP 2021
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators
IJCNLP 2021
Towards Topic-Guided Conversational Recommender System
COLING 2020
A Neural Citation Count Prediction Model based on Peer Review Text
EMNLP 2019
Domain Adaptation for Person-Job Fit with Transferable Deep Global Match Network
EMNLP 2019
Generating Long and Informative Reviews with Aspect-Aware Coarse-to-Fine Decoding
ACL 2019
Domain Adaptation for Person-Job Fit with Transferable Deep Global Match Network
IJCNLP 2019
A Neural Citation Count Prediction Model based on Peer Review Text
IJCNLP 2019
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network
ACL 2018
hyperdoc2vec: Distributed Representations of Hypertext Documents
ACL 2018
A Correlated Topic Model Using Word Embeddings
IJCAI 2017
Bayesian Probabilistic Multi-Topic Matrix Factorization for Rating Prediction
IJCAI 2016
Knowledge Sharing via Social Login: Exploiting Microblogging Service for Warming up Social Question Answering Websites
COLING 2014
Mining New Business Opportunities: Identifying Trend related Products by Leveraging Commercial Intents from Microblogs
EMNLP 2013
Joint Learning for Coreference Resolution with Markov Logic
EMNLP 2012
Joint Learning for Coreference Resolution with Markov Logic
CONLL 2012
Topical Keyphrase Extraction from Twitter
ACL 2011