Haitao Mi

66 papers · 2008–2026 · 12 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌍 Conference Polyglot (12) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (17)

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (13) 🗺️ Taxonomy Completionist (61) 🤝 Dynamic Duo (22) 🏆 Grand Slam 🌱 Topic Pioneer 🧬 Topic Evolution ❓ The Questioner ⚡ Prolific Year (7) 📈 Trend Setter 🗃️ Keyword Collector (127) 🔥 Unstoppable (5) 💎 Century Club (59)

Conferences

ACL (24) EMNLP (16) COLING (8) IJCNLP (6) EACL (3) ICLR (3) AAAI (1) CONLL (1) ICML (1) IJCAI (1) NAACL (1) NIPS (1)

Top co-authors

Dong Yu (29) Linfeng Song (20) Qun Liu (15) Lifeng Jin (13) Baolin Peng (9) Liang Huang (9) Dian Yu (9) Yang Liu (8) Ye Tian (8) Zhiguo Wang (7)

Keywords

large language model (8) reinforcement learning (6) unsupervised parsing (3) web agent (3) differentiable tree (2) grammar induction (2) hierarchical language modeling (2) dialogue system (2) cky parsing (2) language model (2) response generation (2) mathematical reasoning (2) hallucination mitigation (2) monte carlo tree search (2) recursive transformer (2) self-supervised learning (1) direct preference optimization (1) temporal difference learning (1) embedding learning (1) semi-supervised learning (1)

Papers

Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding ACL 2026 Your Reasoning Model is Secretly a Reward Model - Optimization-Free Verification from Experience ACL 2026 Crossing the Reward Bridge: Expanding Reinforcement Learning with Verifiable Rewards Across Diverse Domains ACL 2026 WebRollback: Enhancing Web Agents with Explicit Rollback Mechanisms EACL 2026 EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving ACL 2026 Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data ACL 2026 WebAggregator: Enhancing Compositional Reasoning Capabilities of Deep Research Agent Foundation Models ACL 2026 WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model EMNLP 2025 Recall with Reasoning: Chain-of-Thought Distillation for Mamba’s Long-Context Memory and Extrapolation EMNLP 2025 LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning AAAI 2025 Low-Bit Quantization Favors Undertrained LLMs ACL 2025 Don’t Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls ACL 2025 Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching ACL 2025 Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models COLING 2025 Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models ICML 2025 Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning ICLR 2025 DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search ICLR 2025 WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback EMNLP 2025 Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing NIPS 2024 Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation ACL 2024 Improving LLM Generations via Fine-Grained Self-Endorsement ACL 2024 A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation COLING 2024 Inconsistent dialogue responses and how to recover from them EACL 2024 Self-Consistency Boosts Calibration for Math Reasoning EMNLP 2024 The Trickle-down Impact of Reward Inconsistency on RLHF ICLR 2024 SafeConv: Explaining and Correcting Conversational Unsafe Behavior ACL 2023 Friend-training: Learning from Models of Different but Related Tasks EACL 2023 More Than Spoken Words: Nonverbal Message Extraction and Generation EMNLP 2023 Bi-level Finetuning with Task-dependent Similarity Structure for Low-resource Training ACL 2023 Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup EMNLP 2022 Fast-R2D2: A Pretrained Recursive Neural Network based on Pruned CKY for Grammar Induction and Text Representation EMNLP 2022 Learning a Grammar Inducer from Massive Uncurated Instructional Videos EMNLP 2022 A Dialogue-based Information Extraction System for Medical Insurance Assessment IJCNLP 2021 R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling IJCNLP 2021 R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling ACL 2021 A Dialogue-based Information Extraction System for Medical Insurance Assessment ACL 2021 IIAS: An Intelligent Insurance Assessment System through Online Real-time Conversation Analysis IJCAI 2021 Semi-supervised Clustering for Short Text via Deep Representation Learning CONLL 2016 Vocabulary Manipulation for Neural Machine Translation ACL 2016 Coverage Embedding Models for Neural Machine Translation EMNLP 2016 Supervised Attentions for Neural Machine Translation EMNLP 2016 Sentence Similarity Learning by Lexical Decomposition and Composition COLING 2016 Feature Optimization for Constituent Parsing via Neural Networks IJCNLP 2015 Shift-Reduce Constituency Parsing with Dynamic Programming and POS Tag Lattice NAACL 2015 Feature Optimization for Constituent Parsing via Neural Networks ACL 2015 A Structured Language Model for Incremental Tree-to-String Translation COLING 2014 Hierarchical MT Training using Max-Violation Perceptron ACL 2014 Max-Violation Perceptron and Forced Decoding for Scalable MT Training EMNLP 2013 Flexible and Efficient Hypergraph Interactions for Joint Hierarchical and Forest-to-String Decoding EMNLP 2013 Rule Markov Models for Fast Tree-to-String Translation ACL 2011 A novel dependency-to-string model for statistical machine translation EMNLP 2011 An Efficient Shift-Reduce Decoding Algorithm for Phrased-Based Machine Translation COLING 2010 Machine Translation with Lattices and Forests COLING 2010 Dependency-Based Bracketing Transduction Grammar for Statistical Machine Translation COLING 2010 Efficient Incremental Decoding for Tree-to-String Translation EMNLP 2010 Constituency to Dependency Translation with Forests ACL 2010 Learning Lexicalized Reordering Models from Reordering Graphs ACL 2010 Sub-Sentence Division for Tree-Based Machine Translation ACL 2009 Joint Decoding with Multiple Translation Models IJCNLP 2009 Joint Decoding with Multiple Translation Models ACL 2009 Lattice-based System Combination for Statistical Machine Translation EMNLP 2009 Sub-Sentence Division for Tree-Based Machine Translation IJCNLP 2009 Forest-based Translation Rule Extraction EMNLP 2008 Refinements in BTG-based Statistical Machine Translation IJCNLP 2008 Forest-Based Translation ACL 2008 Word Lattice Reranking for Chinese Word Segmentation and Part-of-Speech Tagging COLING 2008