Hehe Fan

41 papers · 2017–2026 · 10 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🐝 Cross-Pollinator (13) 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8) 🌈 Renaissance Researcher (8)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (9) 🏃 Academic Marathon (8) 🤝 Dynamic Duo (25) 🏆 Grand Slam 🧬 Topic Evolution ⚡ Prolific Year (8) 💎 Century Club (39) 🚀 Conference Pioneer 🗃️ Keyword Collector (165) 🔥 Unstoppable (9)

Conferences

AAAI (9) CVPR (8) ICCV (8) ICLR (4) ECCV (3) ICML (3) IJCAI (3) ACL (1) EMNLP (1) NIPS (1)

Top co-authors

Yi Yang (27) Mohan Kankanhalli (8) Zhiliang Wu (8) Kun Li (7) Fan Ma (6) Linchao Zhu (4) Yu Cheng (3) Yixiao Zhou (3) Zhenglin Zhou (3) Wenjin Hou (2)

Keywords

action recognition (5) point cloud (4) diffusion model (4) temporal consistency (3) representation learning (3) self-supervised learning (3) zero-shot learning (3) large language model (3) video understanding (3) video inpainting (3) reinforcement learning (2) contrastive learning (2) vision transformer (2) temporal modeling (2) transformer architecture (2) motion generation (2) deep learning (2) image restoration (2) domain adaptation (2) 3d vision (2)

Papers

DLVINet: Advancing Dual-Lens Video Inpainting Beyond Parallax Constraints AAAI 2026 One Refiner to Unlock Them All: Inference-Time Reasoning Elicitation via Reinforcement Query Refinement ACL 2026 DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization ICML 2025 ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning AAAI 2025 Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition AAAI 2025 Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration CVPR 2025 Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion CVPR 2025 EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space CVPR 2025 Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs EMNLP 2025 InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation ICCV 2025 BVINet: Unlocking Blind Video Inpainting with Zero Annotations ICCV 2025 MMAD: Multi-label Micro-Action Detection in Videos ICCV 2025 OSDA Agent: Leveraging Large Language Models for De Novo Design of Organic Structure Directing Agents ICLR 2025 VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing ICLR 2025 Reaction Graph: Towards Reaction-Level Modeling for Chemical Reactions with 3D Structures ICML 2025 Prompt-Aware Controllable Shadow Removal IJCAI 2025 Drafting and Revision: Advancing High-Fidelity Video Inpainting IJCAI 2025 Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly CVPR 2024 Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning ICML 2024 Clustering for Protein Representation Learning CVPR 2024 VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation ECCV 2024 Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling AAAI 2024 DocMSU: A Comprehensive Benchmark for Document-Level Multimodal Sarcasm Understanding AAAI 2024 TOPA: Extending Large Language Models for Video Understanding via Text-Only Pre-Alignment NIPS 2024 HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting ECCV 2024 PointListNet: Deep Learning on 3D Point Lists CVPR 2023 Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins ICLR 2023 STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition ICCV 2023 Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos ICCV 2023 Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos ICCV 2023 SEFormer: Structure Embedding Transformer for 3D Object Detection AAAI 2023 Text to Point Cloud Localization with Relation-Enhanced Transformer AAAI 2023 Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction ECCV 2022 Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation With Reliable Voted Pseudo Labels CVPR 2022 Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos CVPR 2021 PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences ICLR 2021 Person Tube Retrieval via Language Description AAAI 2020 Cubic LSTMs for Video Prediction AAAI 2019 Attract or Distract: Exploit the Margin of Open Set ICCV 2019 Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification IJCAI 2018 Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos ICCV 2017