Tao Gui
132 papers · 2017–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
๐ Conference Polyglot (11) ๐งญ Keyword Pioneer ๐บ๏ธ Taxonomy Completionist (21) ๐ Interdisciplinary Bridge ๐ Academic Marathon (8)
๐บ๏ธ
Taxonomy Completionist
(21)
๐งญ
Keyword Pioneer
๐
Academic Marathon
(8)
๐
Conference Loyalist
(42)
๐ค
Dynamic Duo
(108)
๐ฅ
Mega-Team
(34)
๐ฌ
Deep Specialist
(22)
๐งฌ
Topic Evolution
๐
Keyword Champion
(2)
โก
Prolific Year
(8)
โ
The Questioner
(2)
๐๏ธ
Keyword Collector
(492)
๐
Century Club
(119)
๐ฅ
Unstoppable
(9)
๐
Trend Setter
๐
Conference Pioneer
Conferences
ACL (52)
EMNLP (37)
COLING (13)
AAAI (10)
IJCNLP (6)
IJCAI (4)
ICLR (3)
NAACL (3)
CVPR (2)
AACL (1)
ICML (1)
Top co-authors
Research topics
Keywords
large language model
(32)
named entity recognition
(16)
reinforcement learning
(12)
adversarial training
(9)
language model
(9)
transfer learning
(8)
relation extraction
(7)
representation learning
(7)
domain adaptation
(7)
reward model
(7)
reinforcement learning from human feedback
(6)
pre-trained language model
(6)
model compression
(6)
question answering
(5)
text classification
(5)
preference alignment
(4)
adversarial attack
(4)
text generation
(4)
few-shot learning
(4)
natural language processing
(4)
Papers
AgentGym2: Benchmarking Large Language Model Agents in De-Idealized Real-World Environments
ACL 2026
Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization
ACL 2026
Beyond Scaling: Measuring and Predicting the Upper Bound of Knowledge Retention in Language Model Pre-Training
ACL 2026
Counteracting the Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing
ACL 2026
VRPO: Rethinking Value Modeling for Robust RL under Noisy Supervision in LLM Post-Training
ACL 2026
DARM: Distribution-Aware Reward Modeling by Alleviating Biases from Low Preference-Context Dependency Data
ACL 2026
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models
ACL 2026
Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment
ACL 2026
OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding
ACL 2026
LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models
ACL 2026
What Makes a Good Speech Tokenizer for LLM-Centric Speech Generation? A Systematic Study
AAAI 2026
MHA2MLA-VLM: Enabling DeepSeekโs Economical Multi-Head Latent Attention Across Vision-Language Models
AAAI 2026
MetaAct-RL: Training Language Models for Reasoning Through Meta-Action-Based Reinforcement Learning
AAAI 2026
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling
NAACL 2025
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models
CVPR 2025
Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition
COLING 2025
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
COLING 2025
Governance in Motion: Co-evolution of Constitutions and AI models for Scalable Safety
EMNLP 2025
Parrot: A Training Pipeline Enhances Both Program CoT and Natural Language CoT for Reasoning
EMNLP 2025
LoRACoE: Improving Large Language Model via Composition-based LoRA Expert
EMNLP 2025
Toward Optimal LLM Alignments Using Two-Player Games
EMNLP 2025
TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
EMNLP 2025
Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations
EMNLP 2025
Distill Visual Chart Reasoning Ability from LLMs to MLLMs
EMNLP 2025
Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels
EMNLP 2025
LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
EMNLP 2025
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
ICLR 2025
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
ACL 2025
Lost in the Context: Insufficient and Distracted Attention to Contexts in Preference Modeling
ACL 2025
CritiQ: Mining Data Quality Criteria from Human Preferences
ACL 2025
Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric
ACL 2025
AgentGym: Evaluating and Training Large Language Model-based Agents across Diverse Environments
ACL 2025
Towards Economical Inference: Enabling DeepSeekโs Multi-Head Latent Attention in Any Transformer-based LLMs
ACL 2025
Multi-Programming Language Sandbox for LLMs
ACL 2025
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts
ACL 2025
Better Process Supervision with Bi-directional Rewarding Signals
ACL 2025
Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning
AAAI 2025
RMB: Comprehensively benchmarking reward models in LLM alignment
ICLR 2025
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning
EMNLP 2024
Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs
EMNLP 2024
Reward Modeling Requires Automatic Adjustment Based on Data Quality
EMNLP 2024
LongHeads: Multi-Head Attention is Secretly a Long Context Processor
EMNLP 2024
PDF-to-Tree: Parsing PDF Text Blocks into a Tree
EMNLP 2024
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
ICLR 2024
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin
ACL 2024
ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages
ACL 2024
Enhancing Contrastive Learning with Noise-Guided Attack: Towards Continual Relation Extraction in the Wild
ACL 2024
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback
ACL 2024
Navigating the OverKill in Large Language Models
ACL 2024
Unveiling Linguistic Regions in Large Language Models
ACL 2024
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
ACL 2024
Rescue: Ranking LLM Responses with Partial Ordering to Improve Response Generation
ACL 2024
P4: Plug-and-Play Discrete Prompting for Large Language Models Personalization
ACL 2024
Making Harmful Behaviors Unlearnable for Large Language Models
ACL 2024
Length Generalization of Causal Transformers without Position Encoding
ACL 2024
Domain Generalization via Causal Adjustment for Cross-Domain Sentiment Analysis
COLING 2024
ORTicket: Let One Robust BERT Ticket Transfer across Different Tasks
COLING 2024
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
COLING 2024
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals
COLING 2024
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models
NAACL 2024
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
ICML 2024
LLMEval: A Preliminary Study on How to Evaluate Large Language Models
AAAI 2024
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
EMNLP 2024
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding
EMNLP 2024
TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities
EMNLP 2024
LONGAGENT: Achieving Question Answering for 128k-Token-Long Documents through Multi-Agent Collaboration
EMNLP 2024
Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model
ACL 2023
Modeling the Q-Diversity in a Min-max Play Game for Robust Optimization
ACL 2023
Towards Understanding Omission in Dialogue Summarization
ACL 2023
RealBehavior: A Framework for Faithfully Characterizing Foundation Modelsโ Human-like Behavior Mechanisms
EMNLP 2023
Orthogonal Subspace Learning for Language Model Continual Learning
EMNLP 2023
Open Set Relation Extraction via Unknown-Aware Training
ACL 2023
RE-Matching: A Fine-Grained Semantic Matching Method for Zero-Shot Relation Extraction
ACL 2023
Learning โOโ Helps for Learning More: Handling the Unlabeled Entity Problem for Class-incremental NER
ACL 2023
Actively Supervised Clustering for Open Relation Extraction
ACL 2023
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
EMNLP 2023
Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction
EMNLP 2023
Connectivity Patterns are Task Embeddings
ACL 2023
Detecting Adversarial Samples through Sharpness of Loss Landscape
ACL 2023
RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification
EMNLP 2023
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
EMNLP 2023
TextMixer: Mixing Multiple Inputs for Privacy-Preserving Inference
EMNLP 2023
Inductive Relation Inference of Knowledge Graph Enhanced by Ontology Information
EMNLP 2023
TextObfuscator: Making Pre-trained Language Model a Privacy Protector via Obfuscating Word Representations
ACL 2023
Coarse-to-fine Few-shot Learning for Named Entity Recognition
ACL 2023
Correspondence Transformers With Asymmetric Feature Learning and Matching Flow Super-Resolution
CVPR 2023
Characterizing the Impacts of Instances on Robustness
ACL 2023
A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition
ACL 2023
Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents
ACL 2022
Efficient and Robust Knowledge Graph Construction
AACL 2022
Robust Lottery Tickets for Pre-trained Language Models
ACL 2022
MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective
ACL 2022
Flooding-X: Improving BERTโs Resistance to Adversarial Attacks via Loss-Restricted Fine-Tuning
ACL 2022
CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation
ACL 2022
Less Is Better: Recovering Intended-Feature Subspace to Robustify NLU Models
COLING 2022
Read Extensively, Focus Smartly: A Cross-document Semantic Enhancement Method for Visual Documents NER
COLING 2022
PlugAT: A Plug and Play Module to Defend against Textual Adversarial Attack
COLING 2022
LFKQG: A Controlled Generation Framework with Local Fine-tuning for Question Generation over Knowledge Bases
COLING 2022
Causal Intervention Improves Implicit Sentiment Analysis
COLING 2022
Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification Tasks
COLING 2022
Cross-Linguistic Syntactic Difference in Multilingual BERT: How Good is It and How Does It Affect Transfer?
EMNLP 2022
Efficient Adversarial Training with Robust Early-Bird Tickets
EMNLP 2022
TextFusion: Privacy-Preserving Pre-trained Model Inference via Token Fusion
EMNLP 2022
ProofInfer: Generating Proof via Iterative Hierarchical Inference
EMNLP 2022
Searching for Optimal Subword Tokenization in Cross-domain NER
IJCAI 2022
Efficient and Robust Knowledge Graph Construction
IJCNLP 2022
Template-free Prompt Tuning for Few-shot NER
NAACL 2022
Heterogeneous Graph Neural Networks for Keyphrase Generation
EMNLP 2021
Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source Pretraining
EMNLP 2021
A Unified Generative Framework for Various NER Subtasks
IJCNLP 2021
A Relation-Oriented Clustering Method for Open Relation Extraction
EMNLP 2021
SENT: Sentence-level Distant Relation Extraction via Negative Training
IJCNLP 2021
TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
ACL 2021
SENT: Sentence-level Distant Relation Extraction via Negative Training
ACL 2021
A Unified Generative Framework for Various NER Subtasks
ACL 2021
One2Set: Generating Diverse Keyphrases as a Set
ACL 2021
TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
IJCNLP 2021
One2Set: Generating Diverse Keyphrases as a Set
IJCNLP 2021
Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification with K-Means Features
AAAI 2020
Leveraging Document-Level Label Consistency for Named Entity Recognition
IJCAI 2020
Uncertainty-Aware Label Refinement for Sequence Labeling
EMNLP 2020
CNN-Based Chinese NER with Lexicon Rethinking
IJCAI 2019
Long Short-Term Memory with Dynamic Skip Connections
AAAI 2019
A Lexicon-Based Graph Neural Network for Chinese NER
IJCNLP 2019
Switch-LSTMs for Multi-Criteria Chinese Word Segmentation
AAAI 2019
Trainable Undersampling for Class-Imbalance Learning
AAAI 2019
A Lexicon-Based Graph Neural Network for Chinese NER
EMNLP 2019
Cooperative Multimodal Approach to Depression Detection in Twitter
AAAI 2019
Learning Task-Specific Representation for Novel Words in Sequence Labeling
IJCAI 2019
Transferring from Formal Newswire Domain with Hypernet for Twitter POS Tagging
EMNLP 2018
A Lexicon-Based Supervised Attention Model for Neural Sentiment Analysis
COLING 2018
Part-of-Speech Tagging for Twitter with Adversarial Neural Networks
EMNLP 2017