Hong Liu
90 papers · 2016–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (15) π Interdisciplinary Bridge π Renaissance Researcher (6) π Academic Marathon (9)
π
Academic Marathon
(9)
π
Cross-Pollinator
(9)
πΊοΈ
Taxonomy Completionist
(132)
π
Conference Loyalist
(21)
π€
Dynamic Duo
(17)
π
Grand Slam
π
Triple Crown
π±
Topic Pioneer
π¬
Deep Specialist
(11)
π
Century Club
(84)
π
Trend Setter
π₯
Unstoppable
(10)
β‘
Prolific Year
(14)
π
Conference Pioneer
ποΈ
Keyword Collector
(353)
Conferences
AAAI (26)
CVPR (9)
ICCV (8)
IJCAI (7)
INTERSPEECH (7)
MICCAI (7)
ECCV (5)
ICLR (4)
ICML (4)
NIPS (4)
EMNLP (3)
NAACL (2)
ACL (1)
ACML (1)
COLING (1)
WACV (1)
Top co-authors
Research topics
Keywords
person re-identification
(5)
human pose estimation
(5)
multimodal learning
(5)
attention mechanism
(5)
domain adaptation
(4)
action recognition
(4)
semantic segmentation
(4)
unsupervised learning
(4)
temporal modeling
(4)
video understanding
(3)
transfer learning
(3)
federated learning
(3)
self-supervised learning
(3)
transformer architecture
(3)
adversarial robustness
(3)
representation learning
(3)
domain generalization
(3)
depth estimation
(3)
3d vision
(3)
3d human pose estimation
(3)
Papers
QuMAB: Query-based Multi-annotator Behavior Pattern Learning
AAAI 2026
MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings
ACL 2026
SimLabel: Similarity-Weighted Semi-supervision for Multi-annotator Learning with Missing Labels
AAAI 2026
Listening Between the Frames: Bridging Temporal Gaps in Large Audio-Language Models
AAAI 2026
Masked Clustering Prediction for Unsupervised Point Cloud Pre-training
AAAI 2026
Debiased Multiplex Tokenizer for Efficient Map-Free Visual Relocalization
AAAI 2026
GraphGPT: Generative Pre-trained Graph Eulerian Transformer
ICML 2025
UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
ICCV 2025
Recognizing Actions from Robotic View for Natural Human-Robot Interaction
ICCV 2025
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
NAACL 2025
TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation
AAAI 2025
PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes
AAAI 2025
SVTformer: Spatial-View-Temporal Transformer for Multi-View 3D Human Pose Estimation
AAAI 2025
LiD-FL: Towards List-Decodable Federated Learning
AAAI 2025
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
AAAI 2025
RRG-DPO: Direct Preference Optimization for Clinically Accurate Radiology Report Generation
MICCAI 2025
PathoPainter: Augmenting Histopathology Segmentation via Tumor-aware Inpainting
MICCAI 2025
Enhancing WSI-Based Survival Analysis with Report-Auxiliary Self-Distillation
MICCAI 2025
DSFC: Deformation-Aware Learning Strategy via Self-sustaining Feedback Cycle for Medical Vision Foundation Model Domain Adaptation
MICCAI 2025
Conservative-Radical Complementary Learning for Class-incremental Medical Image Analysis with Pre-trained Foundation Models
MICCAI 2025
Paladin: Understanding Video Intentions in Political Advertisement Videos
WACV 2025
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
CVPR 2024
Leveraging Language Model Capabilities for Sound Event Detection
INTERSPEECH 2024
AMG-AVSR: Adaptive Modality Guidance for Audio-Visual Speech Recognition via Progressive Feature Enhancement
ACML 2024
DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion
NIPS 2024
Mitigating robust overfitting via self-residual-calibration regularization (Abstract Reprint)
IJCAI 2024
SCMIL: Sparse Context-aware Multiple Instance Learning for Predicting Cancer Survival Probability Distribution in Whole Slide Images
MICCAI 2024
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
ICLR 2024
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
ICLR 2024
Federated Modality-Specific Encoders and Multimodal Anchors for Personalized Brain Tumor Segmentation
AAAI 2024
Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation
AAAI 2024
Near-Optimal Resilient Aggregation Rules for Distributed Learning Using 1-Center and 1-Mean Clustering with Outliers
AAAI 2024
Audio Generation with Multiple Conditional Diffusion Model
AAAI 2024
Learning to Segment Multiple Organs from Multimodal Partially Labeled Datasets
MICCAI 2024
Position: Towards Implicit Prompt For Text-To-Image Models
ICML 2024
Knowledge-Retrieval Task-Oriented Dialog Systems with Semi-Supervision
INTERSPEECH 2023
Inferential Knowledge-Enhanced Integrated Reasoning for Video Question Answering
AAAI 2023
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models
ICML 2023
STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection
CVPR 2023
FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology Structure and Knowledge Distillation
ICCV 2023
Learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification
ICCV 2023
Co-Evolution of Pose and Mesh for 3D Human Body Estimation from Video
ICCV 2023
Improving Adversarial Robustness via Information Bottleneck Distillation
NIPS 2023
Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition
INTERSPEECH 2023
M3AE: Multimodal Representation Learning for Brain Tumor Segmentation with Missing Modalities
AAAI 2023
Uniform Sequence Better: Time Interval Aware Data Augmentation for Sequential Recommendation
AAAI 2023
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
AAAI 2022
Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-Supervised Action Recognition
AAAI 2022
Pose-Guided Feature Disentangling for Occluded Person Re-identification Based on Transformer
AAAI 2022
Hierarchical Representation-based Dynamic Reasoning Network for Biomedical Question Answering
COLING 2022
MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation
CVPR 2022
Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack
ECCV 2022
An Information Theoretic Approach for Attention-Driven Face Forgery Detection
ECCV 2022
Explainable Question Answering based on Semantic Graph by Global Differentiable Learning and Dynamic Adaptive Reasoning
EMNLP 2022
Information Extraction and Human-Robot Dialogue towards Real-life Tasks A Baseline Study with the MobileCS Dataset
EMNLP 2022
A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems
EMNLP 2022
Self-supervised Learning is More Robust to Dataset Imbalance
ICLR 2022
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering
NAACL 2022
Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
AAAI 2021
Domain General Face Forgery Detection by Learning to Weight
AAAI 2021
Modality-aware Style Adaptation for RGB-Infrared Person Re-Identification
IJCAI 2021
Adversarial Feature Disentanglement for Long-Term Person Re-identification
IJCAI 2021
Towards Robustness Against Natural Language Word Substitutions
ICLR 2021
Learning to Attack Real-World Models for Person Re-identification via Virtual-Guided Meta-Learning
AAAI 2021
Cycle Self-Training for Domain Adaptation
NIPS 2021
Learning to Adapt to Evolving Domains
NIPS 2020
Anti-Bandit Neural Architecture Search for Model Defense
ECCV 2020
Projection & Probability-Driven Black-Box Attack
CVPR 2020
Spatial Pyramid Based Graph Reasoning for Semantic Segmentation
CVPR 2020
API-Net: Robust Generative Classifier via a Single Discriminator
ECCV 2020
Unsupervised Monocular Visual-inertial Odometry Network
IJCAI 2020
Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion
INTERSPEECH 2020
Transferable Adversarial Training: A General Approach to Adapting Deep Classifiers
ICML 2019
Expectation-Maximization Attention Networks for Semantic Segmentation
ICCV 2019
Universal Perturbation Attack Against Image Retrieval
ICCV 2019
Universal Adversarial Perturbation via Prior Driven Uncertainty Approximation
ICCV 2019
Towards Visual Feature Translation
CVPR 2019
Separate to Adapt: Open Set Domain Adaptation via Progressive Separation
CVPR 2019
Learning Neural Bag-of-Matrix-Summarization with Riemannian Network
AAAI 2019
Towards Optimal Discrete Online Hashing with Balanced Similarity
AAAI 2019
Unified Embedding Alignment with Missing Views Inferring for Incomplete Multi-View Clustering
AAAI 2019
Multiple Concurrent Sound Source Tracking Based on Observation-Guided Adaptive Particle Filter
INTERSPEECH 2018
Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining
ECCV 2018
Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation
CVPR 2018
Cross-Modality Binary Code Learning via Fusion Similarity Hashing
CVPR 2017
Multiple Sound Source Counting and Localization Based on Spatial Principal Eigenvector
INTERSPEECH 2017
3D Action Recognition Using Multi-Temporal Depth Motion Maps and Fisher Vector
IJCAI 2016
A Novel Feature Matching Strategy for Large Scale Image Retrieval
IJCAI 2016
Supervised Matrix Factorization for Cross-Modality Hashing
IJCAI 2016
Multi-Channel Linear Prediction Based on Binaural Coherence for Speech Dereverberation
INTERSPEECH 2016