Hehe Fan
41 papers · 2017–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🐝 Cross-Pollinator (13) 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8) 🌈 Renaissance Researcher (8)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(9)
🏃
Academic Marathon
(8)
🤝
Dynamic Duo
(25)
🏆
Grand Slam
🧬
Topic Evolution
⚡
Prolific Year
(8)
💎
Century Club
(39)
🚀
Conference Pioneer
🗃️
Keyword Collector
(165)
🔥
Unstoppable
(9)
Conferences
AAAI (9)
CVPR (8)
ICCV (8)
ICLR (4)
ECCV (3)
ICML (3)
IJCAI (3)
ACL (1)
EMNLP (1)
NIPS (1)
Top co-authors
Keywords
action recognition
(5)
point cloud
(4)
diffusion model
(4)
temporal consistency
(3)
representation learning
(3)
self-supervised learning
(3)
zero-shot learning
(3)
large language model
(3)
video understanding
(3)
video inpainting
(3)
reinforcement learning
(2)
contrastive learning
(2)
vision transformer
(2)
temporal modeling
(2)
transformer architecture
(2)
motion generation
(2)
deep learning
(2)
image restoration
(2)
domain adaptation
(2)
3d vision
(2)
Papers
DLVINet: Advancing Dual-Lens Video Inpainting Beyond Parallax Constraints
AAAI 2026
One Refiner to Unlock Them All: Inference-Time Reasoning Elicitation via Reinforcement Query Refinement
ACL 2026
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization
ICML 2025
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning
AAAI 2025
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition
AAAI 2025
Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
CVPR 2025
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
CVPR 2025
EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space
CVPR 2025
Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs
EMNLP 2025
InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
ICCV 2025
BVINet: Unlocking Blind Video Inpainting with Zero Annotations
ICCV 2025
MMAD: Multi-label Micro-Action Detection in Videos
ICCV 2025
OSDA Agent: Leveraging Large Language Models for De Novo Design of Organic Structure Directing Agents
ICLR 2025
VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing
ICLR 2025
Reaction Graph: Towards Reaction-Level Modeling for Chemical Reactions with 3D Structures
ICML 2025
Prompt-Aware Controllable Shadow Removal
IJCAI 2025
Drafting and Revision: Advancing High-Fidelity Video Inpainting
IJCAI 2025
Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
CVPR 2024
Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
ICML 2024
Clustering for Protein Representation Learning
CVPR 2024
VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation
ECCV 2024
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling
AAAI 2024
DocMSU: A Comprehensive Benchmark for Document-Level Multimodal Sarcasm Understanding
AAAI 2024
TOPA: Extending Large Language Models for Video Understanding via Text-Only Pre-Alignment
NIPS 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
ECCV 2024
PointListNet: Deep Learning on 3D Point Lists
CVPR 2023
Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins
ICLR 2023
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition
ICCV 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
ICCV 2023
Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos
ICCV 2023
SEFormer: Structure Embedding Transformer for 3D Object Detection
AAAI 2023
Text to Point Cloud Localization with Relation-Enhanced Transformer
AAAI 2023
Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction
ECCV 2022
Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation With Reliable Voted Pseudo Labels
CVPR 2022
Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos
CVPR 2021
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
ICLR 2021
Person Tube Retrieval via Language Description
AAAI 2020
Cubic LSTMs for Video Prediction
AAAI 2019
Attract or Distract: Exploit the Margin of Open Set
ICCV 2019
Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification
IJCAI 2018
Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos
ICCV 2017