Chung-Ching Lin

26 papers · 2015–2026 · 9 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (11) 🗺️ Taxonomy Completionist (45)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (8) 🤝 Dynamic Duo (18) 🧬 Topic Evolution ⚡ Prolific Year (7) 💎 Century Club (25) 🔥 Unstoppable (7) 🗃️ Keyword Collector (105)

Conferences

CVPR (10) ICLR (4) ECCV (3) ICCV (2) NIPS (2) WACV (2) ACL (1) EMNLP (1) ICML (1)

Top co-authors

Lijuan Wang (19) Kevin Lin (18) Linjie Li (16) Zicheng Liu (13) Zhengyuan Yang (12) Jianfeng Wang (7) Yuanhao Zhai (4) Rogerio Feris (4) Xiaofei Wang (3) Kate Saenko (3)

Keywords

multimodal learning (3) zero-shot learning (3) gaussian process (2) feature matching (2) video generation (2) video captioning (2) diffusion model (2) video understanding (2) large language model (2) transfer learning (2) generative model (2) vision-language model (2) in-context learning (1) few-shot learning (1) similarity learning (1) adversarial learning (1) pose estimation (1) cross-modal learning (1) metric learning (1) image generation (1)

Papers

Shanks: Simultaneous Hearing and Thinking for Spoken Language Models ACL 2026 Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising WACV 2026 Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension ICCV 2025 Audio-Aware Large Language Models as Judges for Speaking Styles EMNLP 2025 SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation ICLR 2025 GenXD: Generating Any 3D and 4D Scenes ICLR 2025 IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation ECCV 2024 Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation NIPS 2024 MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning CVPR 2024 DisCo: Disentangled Control for Realistic Human Dance Generation CVPR 2024 Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation ECCV 2024 Completing Visual Objects via Bridging Generation and Segmentation ICML 2024 MPT: Mesh Pre-Training With Transformers for Human Pose and Mesh Reconstruction WACV 2024 Equivariant Similarity for Vision-Language Foundation Models ICCV 2023 LAVENDER: Unifying Video-Language Understanding As Masked Language Modeling CVPR 2023 Adaptive Human Matting for Dynamic Videos CVPR 2023 Neural Voting Field for Camera-Space 3D Hand Pose Estimation CVPR 2023 PaintSeg: Painting Pixels for Training-free Segmentation NIPS 2023 Cross-Modal Representation Learning for Zero-Shot Action Recognition CVPR 2022 SwinBERT: End-to-End Transformers With Sparse Attention for Video Captioning CVPR 2022 AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition ICLR 2021 VA-RED$^2$: Video Adaptive Redundancy Reduction ICLR 2021 Video Instance Segmentation Tracking With a Modified VAE Architecture CVPR 2020 AR-Net: Adaptive Frame Resolution for Efficient Action Recognition ECCV 2020 A Prior-Less Method for Multi-Face Tracking in Unconstrained Videos CVPR 2018 Adaptive As-Natural-As-Possible Image Stitching CVPR 2015