Kunchang Li

22 papers · 2021–2025 · 8 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🗺️ Taxonomy Completionist (33) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏆 Grand Slam 🤝 Dynamic Duo (18) ⚡ Prolific Year (6) 💎 Century Club (22) 🔥 Unstoppable (5) 🗃️ Keyword Collector (68)

Conferences

CVPR (5) ECCV (5) ICLR (5) ICCV (3) AAAI (1) ICML (1) NIPS (1) WACV (1)

Top co-authors

Yali Wang (18) Yu Qiao (18) Yi Wang (10) Limin Wang (10) Yinan He (7) Shaobin Zhuang (5) Xinhao Li (5) Yizhuo Li (4) Guo Chen (3) Zun Wang (3)

Keywords

zero-shot learning (3) multimodal learning (2) large language model (2) video understanding (2) few-shot learning (2) video generation (2) diffusion model (2) multi-agent system (2) knowledge distillation (1) video recognition (1) multi-task learning (1) self-supervised learning (1) style transfer (1) in-context learning (1) vision transformer (1) 3d vision (1) point cloud (1) transfer learning (1) multi-modal learning (1) token selection (1)

Papers

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning ICLR 2025 Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel ICLR 2025 Make Your Training Flexible: Towards Deployment-Efficient Video Models ICCV 2025 Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration AAAI 2025 TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision ICML 2025 Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment CVPR 2025 V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents CVPR 2025 TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration NIPS 2024 MVBench: A Comprehensive Multi-modal Video Understanding Benchmark CVPR 2024 Vlogger: Make Your Dream A Vlog CVPR 2024 VideoMamba: State Space Model for Efficient Video Understanding ECCV 2024 InternVideo2: Scaling Foundation Models for Multimodal Video Understanding ECCV 2024 InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation ICLR 2024 Unmasked Teacher: Towards Training-Efficient Video Foundation Models ICCV 2023 UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding ICCV 2023 MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning ECCV 2022 UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning ICLR 2022 Pose-Guided Generative Adversarial Net for Novel View Action Synthesis WACV 2022 Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification ECCV 2022 Self-Slimmed Vision Transformer ECCV 2022 PointCLIP: Point Cloud Understanding by CLIP CVPR 2022 CT-Net: Channel Tensorization Network for Video Classification ICLR 2021