Hao Tian
30 papers · 2013–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π Conference Polyglot (11)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(11)
π€
Dynamic Duo
(16)
π₯
Mega-Team
(38)
π§¬
Topic Evolution
π
Century Club
(29)
β‘
Prolific Year
(5)
π₯
Unstoppable
(6)
ποΈ
Keyword Collector
(126)
Conferences
ACL (5)
EMNLP (5)
AAAI (4)
CVPR (4)
ICLR (3)
IJCAI (2)
IJCNLP (2)
NAACL (2)
CORL (1)
ICCV (1)
WACV (1)
Top co-authors
Keywords
question answering
(3)
document understanding
(3)
large language model
(3)
text-to-image generation
(3)
language modeling
(2)
object detection
(2)
long-document modeling
(2)
language model
(2)
masked language modeling
(2)
pre-trained model
(2)
transfer learning
(2)
knowledge distillation
(2)
code generation
(2)
diffusion model
(2)
style transfer
(1)
few-shot learning
(1)
graph classification
(1)
unsupervised object detection
(1)
multi-task learning
(1)
transformer architecture
(1)
Papers
DIAA: A Decoding-Efficient Inference Acceleration Approach for On-Device Large Language Models
AAAI 2026
PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
CVPR 2025
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
ICCV 2025
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
ICLR 2025
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
ICLR 2025
JumpCoder: Go Beyond Autoregressive Coder via Online Modification
ACL 2024
Tool-Augmented Reward Modeling
ICLR 2024
COMBHelper: A Neural Approach to Reduce Search Space for Graph Combinatorial Problems
AAAI 2024
Deep Hierarchical Graph Alignment Kernels
IJCAI 2024
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
IJCNLP 2023
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model With Knowledge-Enhanced Mixture-of-Denoising-Experts
CVPR 2023
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
CVPR 2023
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
WACV 2023
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
ACL 2023
Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards
EMNLP 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
EMNLP 2022
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora
EMNLP 2021
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs
AAAI 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
ACL 2021
Unsupervised Object Detection With LIDAR Clues
CVPR 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
IJCNLP 2021
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
NAACL 2021
ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding
AAAI 2020
ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation
IJCAI 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
ACL 2020
Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation
CORL 2018
Multi-view Response Selection for Human-Computer Conversation
EMNLP 2016
Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System
EMNLP 2014
Compound Embedding Features for Semi-supervised Learning
NAACL 2013
Cross-lingual Projections between Languages from Different Families
ACL 2013