Xipeng Qiu
222 papers · 2009–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (32) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Academic Marathon
(16)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π
Keyword Trendsetter Combo
(9)
π
Conference Loyalist
(68)
π€
Dynamic Duo
(96)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(34)
π¬
Deep Specialist
(38)
π§¬
Topic Evolution
π
Keyword Champion
(3)
π₯
Unstoppable
(17)
β
The Questioner
(13)
π
Century Club
(216)
ποΈ
Keyword Collector
(104)
β‘
Prolific Year
(14)
π
Trend Setter
π
Conference Pioneer
Conferences
ACL (72)
EMNLP (63)
AAAI (13)
COLING (13)
NAACL (13)
IJCAI (12)
IJCNLP (11)
ICLR (7)
ICML (7)
NIPS (4)
CONLL (3)
AISTATS (2)
CVPR (1)
ICCV (1)
Top co-authors
Research topics
Keywords
large language model
(47)
neural network
(12)
multi-task learning
(12)
contrastive learning
(11)
text generation
(10)
reinforcement learning
(9)
in-context learning
(8)
text classification
(8)
transfer learning
(8)
named entity recognition
(8)
attention mechanism
(8)
language model
(8)
representation learning
(7)
graph neural network
(7)
sequence labeling
(6)
extractive summarization
(6)
few-shot learning
(6)
chinese word segmentation
(5)
task-oriented dialogue
(5)
instruction tuning
(5)
Papers
VRPO: Rethinking Value Modeling for Robust RL under Noisy Supervision in LLM Post-Training
ACL 2026
XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs
ACL 2026
Efficient KL Divergence Estimation via Truncated Top-K Integration for Large Language Models
ACL 2026
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding
ACL 2026
Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction
AAAI 2026
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
AAAI 2026
FiNE: Filtering and Improving Noisy Data Elaborately with Large Language Models
NAACL 2025
ReAttention: Training-Free Infinite Context with Finite Attention Scope
ICLR 2025
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs
ICLR 2025
Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training
ICML 2025
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
ICLR 2025
Perceive the Passage of Time: A Systematic Evaluation of Large Language Model in Temporal Relativity
COLING 2025
Case2Code: Scalable Synthetic Data for Code Generation
COLING 2025
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
ICCV 2025
Decoupled Proxy Alignment: Mitigating Language Prior Conflict for Multimodal Alignment in MLLMs
EMNLP 2025
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
EMNLP 2025
VehicleWorld: A Highly Integrated Multi-Device Environment for Intelligent Vehicle Interaction
EMNLP 2025
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework
EMNLP 2025
UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets
EMNLP 2025
ProLongVid: A Simple but Strong Baseline for Long-context Video Instruction Tuning
EMNLP 2025
Multi-Programming Language Sandbox for LLMs
ACL 2025
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning
EMNLP 2025
Dynamic and Generalizable Process Reward Modeling
ACL 2025
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
ACL 2025
How to Mitigate Overfitting in Weak-to-strong Generalization?
ACL 2025
CritiQ: Mining Data Quality Criteria from Human Preferences
ACL 2025
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning
ACL 2025
VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search
ACL 2025
FastMCTS: A Simple Sampling Strategy for Data Synthesis
ACL 2025
AgentGym: Evaluating and Training Large Language Model-based Agents across Diverse Environments
ACL 2025
Towards Economical Inference: Enabling DeepSeekβs Multi-Head Latent Attention in Any Transformer-based LLMs
ACL 2025
Firewall Routing: Blocking Leads to Better Hybrid Inference for LLMs
EMNLP 2025
UnitCoder: Scalable Code Synthesis from Pre-training Corpora
EMNLP 2025
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning
EMNLP 2025
Prior-Fitted Networks Scale to Larger Datasets When Treated as Weak Learners
AISTATS 2025
VideoRoPE: What Makes for Good Video Rotary Position Embedding?
ICML 2025
Are LLMs Rational Investors? A Study on the Financial Bias in LLMs
ACL 2025
LongSafety: Enhance Safety for Long-Context LLMs
ACL 2025
MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
NAACL 2025
Safe Inputs but Unsafe Output: Benchmarking Cross-modality Safety Alignment of Large Vision-Language Models
NAACL 2025
CAMIEval: Enhancing NLG Evaluation through Multidimensional Comparative Instruction-Following Analysis
NAACL 2025
Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures
ICLR 2025
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
ICLR 2025
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
ACL 2024
Scaling Laws for Fact Memorization of Large Language Models
EMNLP 2024
Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
EMNLP 2024
LongWanjuan: Towards Systematic Measurement for Long Text Quality
EMNLP 2024
Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk
EMNLP 2024
Explicit Memory Learning with Expectation Maximization
EMNLP 2024
Turn Waste into Worth: Rectifying Top-k Router of MoE
EMNLP 2024
InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance
EMNLP 2024
Calibrating the Confidence of Large Language Models by Eliciting Fidelity
EMNLP 2024
Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences
EMNLP 2024
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
COLING 2024
Benchmarking Hallucination in Large Language Models Based on Unanswerable Math Word Problem
COLING 2024
The Open-World Lottery Ticket Hypothesis for OOD Intent Classification
COLING 2024
Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration
CVPR 2024
Can AI Assistants Know What They Donβt Know?
ICML 2024
Training-Free Long-Context Scaling of Large Language Models
ICML 2024
SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models
ICLR 2024
Scaling Laws of RoPE-based Extrapolation
ICLR 2024
Unified Active Retrieval for Retrieval Augmented Generation
EMNLP 2024
R3-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL
EMNLP 2024
SpeechAlign: Aligning Speech Generation to Human Preferences
NIPS 2024
Alignment for Honesty
NIPS 2024
Can Language Models Learn to Skip Steps?
NIPS 2024
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
AAAI 2024
LLatrieval: LLM-Verified Retrieval for Verifiable Generation
NAACL 2024
Flames: Benchmarking Value Alignment of LLMs in Chinese
NAACL 2024
Reasoning in Flux: Enhancing Large Language Models Reasoning through Uncertainty-aware Adaptive Guidance
ACL 2024
Enhancing EEG-to-Text Decoding through Transferable Representations from Pre-trained Contrastive EEG-Text Masked Autoencoder
ACL 2024
Full Parameter Fine-tuning for Large Language Models with Limited Resources
ACL 2024
F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods
ACL 2024
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
ACL 2024
L-Eval: Instituting Standardized Evaluation for Long Context Language Models
ACL 2024
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
ACL 2024
Identifying Semantic Induction Heads to Understand In-Context Learning
ACL 2024
GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
ACL 2024
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
ACL 2024
Balanced Data Sampling for Language Model Training with Clustering
ACL 2024
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
EMNLP 2023
Mitigating Negative Style Transfer in Hybrid Dialogue System
AAAI 2023
Text Adversarial Purification as Defense against Adversarial Attacks
ACL 2023
A Probabilistic Framework for Discovering New Intents
ACL 2023
UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction
ACL 2023
DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
ACL 2023
Unified Demonstration Retriever for In-Context Learning
ACL 2023
Distributed Marker Representation for Ambiguous Discourse Markers and Entangled Relations
ACL 2023
Two Birds One Stone: Dynamic Ensemble for OOD Intent Classification
ACL 2023
Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning
ACL 2023
An AMR-based Link Prediction Approach for Document-level Event Argument Extraction
ACL 2023
Dual Cache for Long Document Neural Coreference Resolution
ACL 2023
CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
ACL 2023
An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition
ACL 2023
Investigating Glyph-Phonetic Information for Chinese Spell Checking: What Works and Whatβs Next?
ACL 2023
Towards Open Environment Intent Prediction
ACL 2023
Do Large Language Models Know What They Donβt Know?
ACL 2023
Multijugate Dual Learning for Low-Resource Task-Oriented Dialogue System
ACL 2023
Improving Contrastive Learning of Sentence Embeddings from AI Feedback
ACL 2023
SeqXGPT: Sentence-Level AI-Generated Text Detection
EMNLP 2023
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
EMNLP 2023
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve
EMNLP 2023
Character-LLM: A Trainable Agent for Role-Playing
EMNLP 2023
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
EMNLP 2023
Watermarking LLMs with Weight Quantization
EMNLP 2023
Finding Support Examples for In-Context Learning
EMNLP 2023
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
EMNLP 2023
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities
EMNLP 2023
From Hypergraph Energy Functions to Hypergraph Neural Networks
ICML 2023
Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts
EMNLP 2022
DORE: Document Ordered Relation Extraction based on Generative Framework
EMNLP 2022
Black-Box Tuning for Language-Model-as-a-Service
ICML 2022
Contrast and Generation Make BART a Good Dialogue Emotion Recognizer
AAAI 2022
βIs Whole Word Masking Always Better for Chinese BERT?β: Probing on Chinese Grammatical Error Correction
ACL 2022
KNN-Contrastive Learning for Out-of-Domain Intent Classification
ACL 2022
CoNT: Contrastive Neural Text Generation
NIPS 2022
What Dense Graph Do You Need for Self-Attention?
ICML 2022
A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation
ACL 2022
CodeRetriever: A Large Scale Contrastive Pre-Training Method for Code Search
EMNLP 2022
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation
EMNLP 2022
BBTv2: Towards a Gradient-Free Future with Large Language Models
EMNLP 2022
Improving Abstractive Dialogue Summarization with Speaker-Aware Supervised Contrastive Learning
COLING 2022
CoLo: A Contrastive Learning Based Re-ranking Framework for One-Stage Summarization
COLING 2022
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding
COLING 2022
RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees
EMNLP 2022
Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation
EMNLP 2022
Dialogue Meaning Representation for Task-Oriented Dialogue Systems
EMNLP 2022
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
NAACL 2022
Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with User Simulator
EMNLP 2022
Pre-training with Meta Learning for Chinese Word Segmentation
NAACL 2021
A Unified Generative Framework for Aspect-based Sentiment Analysis
ACL 2021
Fork or Fail: Cycle-Consistent Training with Many-to-One Mappings
AISTATS 2021
QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization
NAACL 2021
Contrastive Aligned Joint Learning for Multilingual Summarization
ACL 2021
TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
ACL 2021
fastHan: A BERT-based Multi-Task Toolkit for Chinese NLP
ACL 2021
A Unified Generative Framework for Various NER Subtasks
ACL 2021
Accelerating BERT Inference for Sequence Labeling via Early-Exit
IJCNLP 2021
Contrastive Aligned Joint Learning for Multilingual Summarization
IJCNLP 2021
Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa
NAACL 2021
Backdoor Attacks on Pre-trained Models by Layerwise Weight Poisoning
EMNLP 2021
SpellBERT: A Lightweight Pretrained Model for Chinese Spelling Check
EMNLP 2021
Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning
EMNLP 2021
Are Factuality Checkers Reliable? Adversarial Meta-evaluation of Factuality in Summarization
EMNLP 2021
Finding Sparse Structures for Domain Specific Neural Machine Translation
AAAI 2021
Enhancing Scientific Papers Summarization with Citation Graph
AAAI 2021
Token-Aware Virtual Adversarial Training in Natural Language Understanding
AAAI 2021
TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
IJCNLP 2021
fastHan: A BERT-based Multi-Task Toolkit for Chinese NLP
IJCNLP 2021
A Unified Generative Framework for Various NER Subtasks
IJCNLP 2021
Accelerating BERT Inference for Sequence Labeling via Early-Exit
ACL 2021
A Unified Generative Framework for Aspect-based Sentiment Analysis
IJCNLP 2021
CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems
EMNLP 2020
A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer Encoder
EMNLP 2020
BERT for Monolingual and Cross-Lingual Reverse Dictionary
EMNLP 2020
Learning Sparse Sharing Architectures for Multiple Tasks
AAAI 2020
Joint Parsing and Generation for Abstractive Summarization
AAAI 2020
Multi-Scale Self-Attention for Text Classification
AAAI 2020
CoLAKE: Contextualized Language and Knowledge Embedding
COLING 2020
GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation
COLING 2020
Improving Image Captioning with Better Use of Caption
ACL 2020
FLAT: Chinese NER Using Flat-Lattice Transformer
ACL 2020
Heterogeneous Graph Neural Networks for Extractive Document Summarization
ACL 2020
Extractive Summarization as Text Matching
ACL 2020
Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information
EMNLP 2020
BERT-ATTACK: Adversarial Attack Against BERT Using BERT
EMNLP 2020
Style Transformer: Unpaired Text Style Transfer without Disentangled Latent Representation
ACL 2019
Searching for Effective Neural Extractive Summarization: What Works and Whatβs Next
ACL 2019
GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge
EMNLP 2019
Learning Multi-Task Communication with Message Passing for Sequence Learning
AAAI 2019
A Closer Look at Data Bias in Neural Extractive Summarization Models
EMNLP 2019
GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge
IJCNLP 2019
VCWE: Visual Character-Enhanced Word Embeddings
NAACL 2019
Star-Transformer
NAACL 2019
Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence
NAACL 2019
Switch-LSTMs for Multi-Criteria Chinese Word Segmentation
AAAI 2019
Information Aggregation via Dynamic Routing for Sequence Encoding
COLING 2018
Same Representation, Different Attentions: Shareable Sentence Representation Learning from Multiple Tasks
IJCAI 2018
Convolutional Interaction Network for Natural Language Inference
EMNLP 2018
A Simple yet Effective Joint Training Method for Cross-Lingual Universal Dependency Parsing
CONLL 2018
Reinforced Mnemonic Reader for Machine Reading Comprehension
IJCAI 2018
Toward Diverse Text Generation with Inverse Reinforcement Learning
IJCAI 2018
Adversarial Multi-task Learning for Text Classification
ACL 2017
Idiom-Aware Compositional Distributed Semantics
EMNLP 2017
A Feature-Enriched Neural Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging
IJCAI 2017
Knowledge Graph Representation with Jointly Structural and Textual Encoding
IJCAI 2017
Dynamic Compositional Neural Networks over Tree Structure
IJCAI 2017
Adaptive Semantic Compositionality for Sentence Modelling
IJCAI 2017
Adversarial Multi-Criteria Learning for Chinese Word Segmentation
ACL 2017
Deep Multi-Task Learning with Shared Memory for Text Classification
EMNLP 2016
Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification
EMNLP 2016
Modelling Interaction of Sentence Pair with Coupled-LSTMs
EMNLP 2016
A New Psychometric-inspired Evaluation Metric for Chinese Word Segmentation
ACL 2016
Implicit Discourse Relation Detection via a Deep Architecture with Gated Relevance Network
ACL 2016
Investigating Language Universal and Specific Properties in Word Embeddings
ACL 2016
Deep Fusion LSTMs for Text Semantic Matching
ACL 2016
Bridging LSTM Architecture and the Neural Dynamics during Reading
IJCAI 2016
Recurrent Neural Network for Text Classification with Multi-Task Learning
IJCAI 2016
Analyzing Linguistic Knowledge in Sequential Model of Sentence
EMNLP 2016
A Re-ranking Model for Dependency Parser with Recursive Convolutional Neural Network
IJCNLP 2015
Convolutional Neural Tensor Network Architecture for Community-Based Question Answering
IJCAI 2015
Learning Context-Sensitive Word Embeddings with Neural Tensor Skip-Gram Model
IJCAI 2015
A Re-ranking Model for Dependency Parser with Recursive Convolutional Neural Network
ACL 2015
Gated Recursive Neural Network for Chinese Word Segmentation
ACL 2015
Sentence Modeling with Gated Recursive Neural Network
EMNLP 2015
Long Short-Term Memory Neural Networks for Chinese Word Segmentation
EMNLP 2015
Transition-based Dependency Parsing Using Two Heterogeneous Gated Recursive Neural Networks
EMNLP 2015
Multi-Timescale Long Short-Term Memory Neural Network for Modelling Sentences and Documents
EMNLP 2015
Gated Recursive Neural Network for Chinese Word Segmentation
IJCNLP 2015
Automatic Corpus Expansion for Chinese Word Segmentation by Exploiting the Redundancy of Web Information
COLING 2014
FudanNLP: A Toolkit for Chinese Natural Language Processing
ACL 2013
Learning Topical Translation Model for Microblog Hashtag Suggestion
IJCAI 2013
Latent Semantic Tensor Indexing for Community-based Question Answering
ACL 2013
Joint Chinese Word Segmentation and POS Tagging on Heterogeneous Annotated Corpora with Multiple Task Learning
EMNLP 2013
Joint Segmentation and Tagging with Coupled Sequences Labeling
COLING 2012
Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features
CONLL 2012
Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features
EMNLP 2012
Hierarchical Text Classification with Latent Concepts
ACL 2011
A Fast Accurate Two-stage Training Algorithm for L1-regularized CRFs with Heuristic Line Search Strategy
IJCNLP 2011
Detecting Hedge Cues and their Scopes with Average Perceptron
CONLL 2010
Hierarchical Multi-Label Text Categorization with Global Margin Maximization
ACL 2009
Hierarchical Multi-Label Text Categorization with Global Margin Maximization
IJCNLP 2009