Haitao Mi
66 papers · 2008–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (12) π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer π Academic Marathon (17)
π£
Hot Topic Early Bird
π
Cross-Pollinator
(13)
πΊοΈ
Taxonomy Completionist
(61)
π€
Dynamic Duo
(22)
π
Grand Slam
π±
Topic Pioneer
π§¬
Topic Evolution
β
The Questioner
β‘
Prolific Year
(7)
π
Trend Setter
ποΈ
Keyword Collector
(127)
π₯
Unstoppable
(5)
π
Century Club
(59)
Conferences
ACL (24)
EMNLP (16)
COLING (8)
IJCNLP (6)
EACL (3)
ICLR (3)
AAAI (1)
CONLL (1)
ICML (1)
IJCAI (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
large language model
(8)
reinforcement learning
(6)
unsupervised parsing
(3)
web agent
(3)
differentiable tree
(2)
grammar induction
(2)
hierarchical language modeling
(2)
dialogue system
(2)
cky parsing
(2)
language model
(2)
response generation
(2)
mathematical reasoning
(2)
hallucination mitigation
(2)
monte carlo tree search
(2)
recursive transformer
(2)
self-supervised learning
(1)
direct preference optimization
(1)
temporal difference learning
(1)
embedding learning
(1)
semi-supervised learning
(1)
Papers
Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding
ACL 2026
Your Reasoning Model is Secretly a Reward Model - Optimization-Free Verification from Experience
ACL 2026
Crossing the Reward Bridge: Expanding Reinforcement Learning with Verifiable Rewards Across Diverse Domains
ACL 2026
WebRollback: Enhancing Web Agents with Explicit Rollback Mechanisms
EACL 2026
EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving
ACL 2026
Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data
ACL 2026
WebAggregator: Enhancing Compositional Reasoning Capabilities of Deep Research Agent Foundation Models
ACL 2026
WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model
EMNLP 2025
Recall with Reasoning: Chain-of-Thought Distillation for Mambaβs Long-Context Memory and Extrapolation
EMNLP 2025
LiteSearch: Efficient Tree Search with Dynamic Exploration Budget for Math Reasoning
AAAI 2025
Low-Bit Quantization Favors Undertrained LLMs
ACL 2025
Donβt Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls
ACL 2025
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching
ACL 2025
Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
COLING 2025
Do NOT Think That Much for 2+3=? On the Overthinking of Long Reasoning Models
ICML 2025
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
ICLR 2025
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
ICLR 2025
WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback
EMNLP 2025
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
NIPS 2024
Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation
ACL 2024
Improving LLM Generations via Fine-Grained Self-Endorsement
ACL 2024
A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
COLING 2024
Inconsistent dialogue responses and how to recover from them
EACL 2024
Self-Consistency Boosts Calibration for Math Reasoning
EMNLP 2024
The Trickle-down Impact of Reward Inconsistency on RLHF
ICLR 2024
SafeConv: Explaining and Correcting Conversational Unsafe Behavior
ACL 2023
Friend-training: Learning from Models of Different but Related Tasks
EACL 2023
More Than Spoken Words: Nonverbal Message Extraction and Generation
EMNLP 2023
Bi-level Finetuning with Task-dependent Similarity Structure for Low-resource Training
ACL 2023
Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup
EMNLP 2022
Fast-R2D2: A Pretrained Recursive Neural Network based on Pruned CKY for Grammar Induction and Text Representation
EMNLP 2022
Learning a Grammar Inducer from Massive Uncurated Instructional Videos
EMNLP 2022
A Dialogue-based Information Extraction System for Medical Insurance Assessment
IJCNLP 2021
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling
IJCNLP 2021
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling
ACL 2021
A Dialogue-based Information Extraction System for Medical Insurance Assessment
ACL 2021
IIAS: An Intelligent Insurance Assessment System through Online Real-time Conversation Analysis
IJCAI 2021
Semi-supervised Clustering for Short Text via Deep Representation Learning
CONLL 2016
Vocabulary Manipulation for Neural Machine Translation
ACL 2016
Coverage Embedding Models for Neural Machine Translation
EMNLP 2016
Supervised Attentions for Neural Machine Translation
EMNLP 2016
Sentence Similarity Learning by Lexical Decomposition and Composition
COLING 2016
Feature Optimization for Constituent Parsing via Neural Networks
IJCNLP 2015
Shift-Reduce Constituency Parsing with Dynamic Programming and POS Tag Lattice
NAACL 2015
Feature Optimization for Constituent Parsing via Neural Networks
ACL 2015
A Structured Language Model for Incremental Tree-to-String Translation
COLING 2014
Hierarchical MT Training using Max-Violation Perceptron
ACL 2014
Max-Violation Perceptron and Forced Decoding for Scalable MT Training
EMNLP 2013
Flexible and Efficient Hypergraph Interactions for Joint Hierarchical and Forest-to-String Decoding
EMNLP 2013
Rule Markov Models for Fast Tree-to-String Translation
ACL 2011
A novel dependency-to-string model for statistical machine translation
EMNLP 2011
An Efficient Shift-Reduce Decoding Algorithm for Phrased-Based Machine Translation
COLING 2010
Machine Translation with Lattices and Forests
COLING 2010
Dependency-Based Bracketing Transduction Grammar for Statistical Machine Translation
COLING 2010
Efficient Incremental Decoding for Tree-to-String Translation
EMNLP 2010
Constituency to Dependency Translation with Forests
ACL 2010
Learning Lexicalized Reordering Models from Reordering Graphs
ACL 2010
Sub-Sentence Division for Tree-Based Machine Translation
ACL 2009
Joint Decoding with Multiple Translation Models
IJCNLP 2009
Joint Decoding with Multiple Translation Models
ACL 2009
Lattice-based System Combination for Statistical Machine Translation
EMNLP 2009
Sub-Sentence Division for Tree-Based Machine Translation
IJCNLP 2009
Forest-based Translation Rule Extraction
EMNLP 2008
Refinements in BTG-based Statistical Machine Translation
IJCNLP 2008
Forest-Based Translation
ACL 2008
Word Lattice Reranking for Chinese Word Segmentation and Part-of-Speech Tagging
COLING 2008