conftrace_

Tao Gui

132 papers · 2017–2026 · 11 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+16 more ↓

🌍 Conference Polyglot (11) 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (21) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8)

🗺️ Taxonomy Completionist (21) 🧭 Keyword Pioneer 🏃 Academic Marathon (8) 🏠 Conference Loyalist (42) 🤝 Dynamic Duo (108) 👥 Mega-Team (34) 🔬 Deep Specialist (22) 🧬 Topic Evolution 🏆 Keyword Champion (2) ⚡ Prolific Year (8) ❓ The Questioner (2) 🗃️ Keyword Collector (492) 💎 Century Club (119) 🔥 Unstoppable (9) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

ACL (52) EMNLP (37) COLING (13) AAAI (10) IJCNLP (6) IJCAI (4) ICLR (3) NAACL (3) CVPR (2) AACL (1) ICML (1)

Top co-authors

Qi Zhang (119) Xuanjing Huang (100) Zhiheng Xi (40) Rui Zheng (33) Shihan Dou (24) Junjie Ye (19) Ruotian Ma (16) Xin Zhou (15) Xiao Wang (15) Jun Zhao (15)

Research topics

Privacy (2) Applications (1)

Keywords

large language model (32) named entity recognition (16) reinforcement learning (12) adversarial training (9) language model (9) transfer learning (8) relation extraction (7) representation learning (7) domain adaptation (7) reward model (7) reinforcement learning from human feedback (6) pre-trained language model (6) model compression (6) question answering (5) text classification (5) preference alignment (4) adversarial attack (4) text generation (4) few-shot learning (4) natural language processing (4)

Papers

AgentGym2: Benchmarking Large Language Model Agents in De-Idealized Real-World Environments ACL 2026 Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization ACL 2026 Beyond Scaling: Measuring and Predicting the Upper Bound of Knowledge Retention in Language Model Pre-Training ACL 2026 Counteracting the Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing ACL 2026 VRPO: Rethinking Value Modeling for Robust RL under Noisy Supervision in LLM Post-Training ACL 2026 DARM: Distribution-Aware Reward Modeling by Alleviating Biases from Low Preference-Context Dependency Data ACL 2026 Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models ACL 2026 Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment ACL 2026 OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding ACL 2026 LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models ACL 2026 What Makes a Good Speech Tokenizer for LLM-Centric Speech Generation? A Systematic Study AAAI 2026 MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention Across Vision-Language Models AAAI 2026 MetaAct-RL: Training Language Models for Reasoning Through Meta-Action-Based Reinforcement Learning AAAI 2026 Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling NAACL 2025 SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models CVPR 2025 Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition COLING 2025 ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios COLING 2025 Governance in Motion: Co-evolution of Constitutions and AI models for Scalable Safety EMNLP 2025 Parrot: A Training Pipeline Enhances Both Program CoT and Natural Language CoT for Reasoning EMNLP 2025 LoRACoE: Improving Large Language Model via Composition-based LoRA Expert EMNLP 2025 Toward Optimal LLM Alignments Using Two-Player Games EMNLP 2025 TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use EMNLP 2025 Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations EMNLP 2025 Distill Visual Chart Reasoning Ability from LLMs to MLLMs EMNLP 2025 Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels EMNLP 2025 LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation EMNLP 2025 Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs ICLR 2025 ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use ACL 2025 Lost in the Context: Insufficient and Distracted Attention to Contexts in Preference Modeling ACL 2025 CritiQ: Mining Data Quality Criteria from Human Preferences ACL 2025 Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric ACL 2025 AgentGym: Evaluating and Training Large Language Model-based Agents across Diverse Environments ACL 2025 Towards Economical Inference: Enabling DeepSeek’s Multi-Head Latent Attention in Any Transformer-based LLMs ACL 2025 Multi-Programming Language Sandbox for LLMs ACL 2025 PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts ACL 2025 Better Process Supervision with Bi-directional Rewarding Signals ACL 2025 Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning AAAI 2025 RMB: Comprehensively benchmarking reward models in LLM alignment ICLR 2025 Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning EMNLP 2024 Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs EMNLP 2024 Reward Modeling Requires Automatic Adjustment Based on Data Quality EMNLP 2024 LongHeads: Multi-Head Attention is Secretly a Long Context Processor EMNLP 2024 PDF-to-Tree: Parsing PDF Text Blocks into a Tree EMNLP 2024 Improving Generalization of Alignment with Human Preferences through Group Invariant Learning ICLR 2024 LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin ACL 2024 ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages ACL 2024 Enhancing Contrastive Learning with Noise-Guided Attack: Towards Continual Relation Extraction in the Wild ACL 2024 StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback ACL 2024 Navigating the OverKill in Large Language Models ACL 2024 Unveiling Linguistic Regions in Large Language Models ACL 2024 AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling ACL 2024 Rescue: Ranking LLM Responses with Partial Ordering to Improve Response Generation ACL 2024 P4: Plug-and-Play Discrete Prompting for Large Language Models Personalization ACL 2024 Making Harmful Behaviors Unlearnable for Large Language Models ACL 2024 Length Generalization of Causal Transformers without Position Encoding ACL 2024 Domain Generalization via Causal Adjustment for Cross-Domain Sentiment Analysis COLING 2024 ORTicket: Let One Robust BERT Ticket Transfer across Different Tasks COLING 2024 RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions COLING 2024 Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals COLING 2024 Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models NAACL 2024 Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning ICML 2024 LLMEval: A Preliminary Study on How to Evaluate Large Language Models AAAI 2024 RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning EMNLP 2024 Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding EMNLP 2024 TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities EMNLP 2024 LONGAGENT: Achieving Question Answering for 128k-Token-Long Documents through Multi-Agent Collaboration EMNLP 2024 Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model ACL 2023 Modeling the Q-Diversity in a Min-max Play Game for Robust Optimization ACL 2023 Towards Understanding Omission in Dialogue Summarization ACL 2023 RealBehavior: A Framework for Faithfully Characterizing Foundation Models’ Human-like Behavior Mechanisms EMNLP 2023 Orthogonal Subspace Learning for Language Model Continual Learning EMNLP 2023 Open Set Relation Extraction via Unknown-Aware Training ACL 2023 RE-Matching: A Fine-Grained Semantic Matching Method for Zero-Shot Relation Extraction ACL 2023 Learning “O” Helps for Learning More: Handling the Unlabeled Entity Problem for Class-incremental NER ACL 2023 Actively Supervised Clustering for Open Relation Extraction ACL 2023 Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement EMNLP 2023 Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction EMNLP 2023 Connectivity Patterns are Task Embeddings ACL 2023 Detecting Adversarial Samples through Sharpness of Loss Landscape ACL 2023 RethinkingTMSC: An Empirical Study for Target-Oriented Multimodal Sentiment Classification EMNLP 2023 Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback EMNLP 2023 TextMixer: Mixing Multiple Inputs for Privacy-Preserving Inference EMNLP 2023 Inductive Relation Inference of Knowledge Graph Enhanced by Ontology Information EMNLP 2023 TextObfuscator: Making Pre-trained Language Model a Privacy Protector via Obfuscating Word Representations ACL 2023 Coarse-to-fine Few-shot Learning for Named Entity Recognition ACL 2023 Correspondence Transformers With Asymmetric Feature Learning and Matching Flow Super-Resolution CVPR 2023 Characterizing the Impacts of Instances on Robustness ACL 2023 A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition ACL 2023 Divide and Conquer: Text Semantic Matching with Disentangled Keywords and Intents ACL 2022 Efficient and Robust Knowledge Graph Construction AACL 2022 Robust Lottery Tickets for Pre-trained Language Models ACL 2022 MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective ACL 2022 Flooding-X: Improving BERT’s Resistance to Adversarial Attacks via Loss-Restricted Fine-Tuning ACL 2022 CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation ACL 2022 Less Is Better: Recovering Intended-Feature Subspace to Robustify NLU Models COLING 2022 Read Extensively, Focus Smartly: A Cross-document Semantic Enhancement Method for Visual Documents NER COLING 2022 PlugAT: A Plug and Play Module to Defend against Textual Adversarial Attack COLING 2022 LFKQG: A Controlled Generation Framework with Local Fine-tuning for Question Generation over Knowledge Bases COLING 2022 Causal Intervention Improves Implicit Sentiment Analysis COLING 2022 Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification Tasks COLING 2022 Cross-Linguistic Syntactic Difference in Multilingual BERT: How Good is It and How Does It Affect Transfer? EMNLP 2022 Efficient Adversarial Training with Robust Early-Bird Tickets EMNLP 2022 TextFusion: Privacy-Preserving Pre-trained Model Inference via Token Fusion EMNLP 2022 ProofInfer: Generating Proof via Iterative Hierarchical Inference EMNLP 2022 Searching for Optimal Subword Tokenization in Cross-domain NER IJCAI 2022 Efficient and Robust Knowledge Graph Construction IJCNLP 2022 Template-free Prompt Tuning for Few-shot NER NAACL 2022 Heterogeneous Graph Neural Networks for Keyphrase Generation EMNLP 2021 Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source Pretraining EMNLP 2021 A Unified Generative Framework for Various NER Subtasks IJCNLP 2021 A Relation-Oriented Clustering Method for Open Relation Extraction EMNLP 2021 SENT: Sentence-level Distant Relation Extraction via Negative Training IJCNLP 2021 TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing ACL 2021 SENT: Sentence-level Distant Relation Extraction via Negative Training ACL 2021 A Unified Generative Framework for Various NER Subtasks ACL 2021 One2Set: Generating Diverse Keyphrases as a Set ACL 2021 TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing IJCNLP 2021 One2Set: Generating Diverse Keyphrases as a Set IJCNLP 2021 Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification with K-Means Features AAAI 2020 Leveraging Document-Level Label Consistency for Named Entity Recognition IJCAI 2020 Uncertainty-Aware Label Refinement for Sequence Labeling EMNLP 2020 CNN-Based Chinese NER with Lexicon Rethinking IJCAI 2019 Long Short-Term Memory with Dynamic Skip Connections AAAI 2019 A Lexicon-Based Graph Neural Network for Chinese NER IJCNLP 2019 Switch-LSTMs for Multi-Criteria Chinese Word Segmentation AAAI 2019 Trainable Undersampling for Class-Imbalance Learning AAAI 2019 A Lexicon-Based Graph Neural Network for Chinese NER EMNLP 2019 Cooperative Multimodal Approach to Depression Detection in Twitter AAAI 2019 Learning Task-Specific Representation for Novel Words in Sequence Labeling IJCAI 2019 Transferring from Formal Newswire Domain with Hypernet for Twitter POS Tagging EMNLP 2018 A Lexicon-Based Supervised Attention Model for Neural Sentiment Analysis COLING 2018 Part-of-Speech Tagging for Twitter with Adversarial Neural Networks EMNLP 2017