Han Xiao
28 papers · 2010–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
๐ Conference Polyglot (11) ๐ Renaissance Researcher (5) ๐ Interdisciplinary Bridge ๐งญ Keyword Pioneer ๐ Academic Marathon (15)
๐
Academic Marathon
(15)
๐
Cross-Pollinator
(10)
๐บ๏ธ
Taxonomy Completionist
(65)
๐งฌ
Topic Evolution
๐ฅ
Mega-Team
(23)
๐
Conference Pioneer
๐
Century Club
(25)
โก
Prolific Year
(9)
๐๏ธ
Keyword Collector
(147)
๐ฅ
Unstoppable
(5)
Conferences
ACL (5)
CVPR (4)
AAAI (3)
EMNLP (3)
IJCAI (3)
ECCV (2)
ICCV (2)
NIPS (2)
ACML (1)
JMLR (1)
MIDL (1)
NSDI (1)
Top co-authors
Keywords
multimodal large language model
(3)
transfer learning
(2)
parameter-efficient fine-tuning
(2)
model compression
(2)
attention mechanism
(2)
efficient inference
(2)
dense retrieval
(2)
information retrieval
(2)
bayesian inference
(1)
mathematical reasoning
(1)
reinforcement learning
(1)
visual question answering
(1)
multilingual nlp
(1)
data augmentation
(1)
embedding learning
(1)
few-shot learning
(1)
multimodal learning
(1)
image retrieval
(1)
vision-language alignment
(1)
chain-of-thought reasoning
(1)
Papers
UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning
AAAI 2026
HMformer: Unleashing Transformerโs Potential for Time Series Forecasting via Hierarchical Multi-Scale Modeling
AAAI 2026
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
CVPR 2025
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
ACL 2025
jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval
EMNLP 2025
AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents
ACL 2025
MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning
ACL 2025
PCA-YOLO: A Small Liver Tumor Detection Model with Patch-Contrastive Attention
MIDL 2025
NuMDS: An Efficient Local Search Algorithm for Minimum Dominating Set Problem
IJCAI 2025
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
CVPR 2025
"SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models"
ECCV 2024
Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT
NIPS 2024
G-Adapter: Towards Structure-Aware Parameter-Efficient Transfer Learning for Graph Transformer Networks
AAAI 2024
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation
CVPR 2024
SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding
ECCV 2024
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
NIPS 2024
Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever
EMNLP 2024
Dual Enhancement in ODI Super-Resolution: Adapting Convolution and Upsampling to Projection Distortion
IJCAI 2024
POSEIDON: A Consolidated Virtual Network Controller that Manages Millions of Tenants via Config Tree
NSDI 2024
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models
EMNLP 2023
Token-Label Alignment for Vision Transformers
ICCV 2023
HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation
ACL 2023
KoPA: Automated Kronecker Product Approximation
JMLR 2022
Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search
CVPR 2022
Generalizable Mixed-Precision Quantization via Attribution Rank Preservation
ICCV 2021
TransG : A Generative Model for Knowledge Graph Embedding
ACL 2016
From One Point to a Manifold: Knowledge Graph Embedding for Precise Link Prediction
IJCAI 2016
Efficient Collapsed Gibbs Sampling for Latent Dirichlet Allocation
ACML 2010