Kunchang Li
22 papers · 2021–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (33) π Renaissance Researcher (5) π Interdisciplinary Bridge π Conference Polyglot (8) π§ Keyword Pioneer
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Grand Slam
π€
Dynamic Duo
(18)
β‘
Prolific Year
(6)
π
Century Club
(22)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(68)
Conferences
CVPR (5)
ECCV (5)
ICLR (5)
ICCV (3)
AAAI (1)
ICML (1)
NIPS (1)
WACV (1)
Top co-authors
Keywords
zero-shot learning
(3)
multimodal learning
(2)
large language model
(2)
video understanding
(2)
few-shot learning
(2)
video generation
(2)
diffusion model
(2)
multi-agent system
(2)
knowledge distillation
(1)
video recognition
(1)
multi-task learning
(1)
self-supervised learning
(1)
style transfer
(1)
in-context learning
(1)
vision transformer
(1)
3d vision
(1)
point cloud
(1)
transfer learning
(1)
multi-modal learning
(1)
token selection
(1)
Papers
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
ICLR 2025
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
ICLR 2025
Make Your Training Flexible: Towards Deployment-Efficient Video Models
ICCV 2025
Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration
AAAI 2025
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision
ICML 2025
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
CVPR 2025
V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents
CVPR 2025
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
NIPS 2024
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
CVPR 2024
Vlogger: Make Your Dream A Vlog
CVPR 2024
VideoMamba: State Space Model for Efficient Video Understanding
ECCV 2024
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
ECCV 2024
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
ICLR 2024
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
ICCV 2023
UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding
ICCV 2023
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
ECCV 2022
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning
ICLR 2022
Pose-Guided Generative Adversarial Net for Novel View Action Synthesis
WACV 2022
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification
ECCV 2022
Self-Slimmed Vision Transformer
ECCV 2022
PointCLIP: Point Cloud Understanding by CLIP
CVPR 2022
CT-Net: Channel Tensorization Network for Video Classification
ICLR 2021