Pengjun Xie
79 papers · 2017–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (13) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π Academic Marathon (8)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(13)
π
Conference Loyalist
(22)
π€
Dynamic Duo
(39)
π
Grand Slam
π¬
Deep Specialist
(18)
π§¬
Topic Evolution
π
Keyword Champion
(25)
β
The Questioner
ποΈ
Keyword Collector
(309)
π₯
Unstoppable
(9)
π
Century Club
(75)
β‘
Prolific Year
(10)
Conferences
ACL (25)
EMNLP (20)
NAACL (7)
AAAI (6)
IJCNLP (4)
SEMEVAL (4)
COLING (3)
ICLR (3)
EACL (2)
NIPS (2)
CVPR (1)
ICML (1)
IJCAI (1)
Top co-authors
Research topics
Keywords
named entity recognition
(25)
large language model
(15)
retrieval-augmented generation
(13)
information retrieval
(8)
domain adaptation
(7)
entity typing
(6)
few-shot learning
(5)
knowledge base
(5)
sequence labeling
(5)
pretrained language model
(4)
representation learning
(4)
supervised fine-tuning
(3)
multimodal retrieval
(3)
language model
(3)
dense retrieval
(3)
transfer learning
(3)
zero-shot learning
(3)
reinforcement learning
(3)
data augmentation
(3)
text representation
(3)
Papers
Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing
ACL 2026
Nested Browser-Use Learning for Agentic Information Seeking
ACL 2026
ERank: Fusing Supervised Fine-Tuning and Reinforcement Learning for Effective and Efficient Text Reranking
AAAI 2026
Evidence-Augmented Policy Optimization with Reward Co-Evolution for Long-Context Reasoning
ACL 2026
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement
ACL 2025
Language Models are Universal Embedders
ACL 2025
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization
NAACL 2025
Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling
NAACL 2025
LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs β No Silver Bullet for LC or RAG Routing
ICML 2025
Benchmarking Agentic Workflow Generation
ICLR 2025
KBM: Delineating Knowledge Boundary for Adaptive Retrieval in Large Language Models
EMNLP 2025
ProductAgent: Benchmarking Conversational Product Search Agent with Asking Clarification Questions
EMNLP 2025
Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference
EMNLP 2025
EvolveSearch: An Iterative Self-Evolving Search Agent
EMNLP 2025
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
EMNLP 2025
DecoupleSearch: Decouple Planning and Search via Hierarchical Reward Modeling
EMNLP 2025
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
EMNLP 2025
Bridging Modalities: Improving Universal Multimodal Retrieval by Multimodal Large Language Models
CVPR 2025
Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark
COLING 2025
Towards Text-Image Interleaved Retrieval
ACL 2025
WebWalker: Benchmarking LLMs in Web Traversal
ACL 2025
Agentic Knowledgeable Self-awareness
ACL 2025
Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training
COLING 2024
Agent Planning with World Knowledge Model
NIPS 2024
Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network
AAAI 2024
EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce
AAAI 2024
SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding
AAAI 2024
Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
ACL 2024
A Two-Stage Adaptation of Large Language Models for Text Ranking
ACL 2024
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
NIPS 2024
Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking
EACL 2024
Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process
EMNLP 2024
Retrieved In-Context Principles from Previous Mistakes
EMNLP 2024
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
EMNLP 2024
RaFe: Ranking Feedback Improves Query Rewriting for RAG
EMNLP 2024
Knowledge Mechanisms in Large Language Models: A Survey and Perspective
EMNLP 2024
Query Routing for Homogeneous Tools: An Instantiation in the RAG Scenario
EMNLP 2024
Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning
NAACL 2024
Exploring Lottery Prompts for Pre-trained Language Models
ACL 2023
Recall, Expand, and Multi-Candidate Cross-Encode: Fast and Accurate Ultra-Fine Entity Typing
ACL 2023
MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition
ACL 2023
Do PLMs Know and Understand Ontological Knowledge?
ACL 2023
COMBO: A Complete Benchmark for Open KG Canonicalization
EACL 2023
Text Representation Distillation via Information Bottleneck Principle
EMNLP 2023
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition
SEMEVAL 2023
Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks
ACL 2023
DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition
ACL 2023
Adversarial Self-Attention for Language Understanding
AAAI 2023
Few-shot Classification with Hypersphere Modeling of Prototypes
ACL 2023
Improving Low-resource Named Entity Recognition with Graph Propagated Data Augmentation
ACL 2023
DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition
SEMEVAL 2022
Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning
EMNLP 2022
Prompt-learning for Fine-grained Entity Typing
EMNLP 2022
Named Entity and Relation Extraction with Multi-Modal Retrieval
EMNLP 2022
Parallel Instance Query Network for Named Entity Recognition
ACL 2022
Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting
NAACL 2022
DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition
NAACL 2022
Domain-Specific NER via Retrieving Correlated Samples
COLING 2022
Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence Labeling
EMNLP 2022
Modeling Label Correlations for Ultra-Fine Entity Typing with Neural Pairwise Conditional Random Field
EMNLP 2022
A Fine-Grained Domain Adaption Model for Joint Word Segmentation and POS Tagging
EMNLP 2021
Few-NERD: A Few-shot Named Entity Recognition Dataset
IJCNLP 2021
Counterfactual Inference for Text Classification Debiasing
IJCNLP 2021
Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition
IJCNLP 2021
Counterfactual Inference for Text Classification Debiasing
ACL 2021
Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition
ACL 2021
Probing BERT in Hyperbolic Spaces
ICLR 2021
Prototypical Representation Learning for Relation Extraction
ICLR 2021
Knowledge-aware Named Entity Recognition with Alleviating Heterogeneity
AAAI 2021
Few-NERD: A Few-shot Named Entity Recognition Dataset
ACL 2021
Learning with Noise: Improving Distantly-Supervised Fine-grained Entity Typing via Automatic Relabeling
IJCAI 2020
Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation
ACL 2020
Hierarchy-Aware Global Model for Hierarchical Text Classification
ACL 2020
DM_NLP at SemEval-2018 Task 12: A Pipeline System for Toponym Resolution
SEMEVAL 2019
Neural Chinese Address Parsing
NAACL 2019
Better Modeling of Incomplete Annotations for Named Entity Recognition
NAACL 2019
A Neural Multi-digraph Model for Chinese NER with Gazetteers
ACL 2019
DM_NLP at SemEval-2018 Task 8: neural sequence labeling with linguistic features
SEMEVAL 2018
Alibaba at IJCNLP-2017 Task 1: Embedding Grammatical Features into LSTMs for Chinese Grammatical Error Diagnosis Task
IJCNLP 2017