Chung-Ching Lin
26 papers · 2015–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (8) π Academic Marathon (11) πΊοΈ Taxonomy Completionist (45)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(8)
π€
Dynamic Duo
(18)
π§¬
Topic Evolution
β‘
Prolific Year
(7)
π
Century Club
(25)
π₯
Unstoppable
(7)
ποΈ
Keyword Collector
(105)
Conferences
CVPR (10)
ICLR (4)
ECCV (3)
ICCV (2)
NIPS (2)
WACV (2)
ACL (1)
EMNLP (1)
ICML (1)
Top co-authors
Keywords
multimodal learning
(3)
zero-shot learning
(3)
gaussian process
(2)
feature matching
(2)
video generation
(2)
video captioning
(2)
diffusion model
(2)
video understanding
(2)
large language model
(2)
transfer learning
(2)
generative model
(2)
vision-language model
(2)
in-context learning
(1)
few-shot learning
(1)
similarity learning
(1)
adversarial learning
(1)
pose estimation
(1)
cross-modal learning
(1)
metric learning
(1)
image generation
(1)
Papers
Shanks: Simultaneous Hearing and Thinking for Spoken Language Models
ACL 2026
Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
WACV 2026
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension
ICCV 2025
Audio-Aware Large Language Models as Judges for Speaking Styles
EMNLP 2025
SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation
ICLR 2025
GenXD: Generating Any 3D and 4D Scenes
ICLR 2025
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
ECCV 2024
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
NIPS 2024
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning
CVPR 2024
DisCo: Disentangled Control for Realistic Human Dance Generation
CVPR 2024
Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation
ECCV 2024
Completing Visual Objects via Bridging Generation and Segmentation
ICML 2024
MPT: Mesh Pre-Training With Transformers for Human Pose and Mesh Reconstruction
WACV 2024
Equivariant Similarity for Vision-Language Foundation Models
ICCV 2023
LAVENDER: Unifying Video-Language Understanding As Masked Language Modeling
CVPR 2023
Adaptive Human Matting for Dynamic Videos
CVPR 2023
Neural Voting Field for Camera-Space 3D Hand Pose Estimation
CVPR 2023
PaintSeg: Painting Pixels for Training-free Segmentation
NIPS 2023
Cross-Modal Representation Learning for Zero-Shot Action Recognition
CVPR 2022
SwinBERT: End-to-End Transformers With Sparse Attention for Video Captioning
CVPR 2022
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
ICLR 2021
VA-RED$^2$: Video Adaptive Redundancy Reduction
ICLR 2021
Video Instance Segmentation Tracking With a Modified VAE Architecture
CVPR 2020
AR-Net: Adaptive Frame Resolution for Efficient Action Recognition
ECCV 2020
A Prior-Less Method for Multi-Face Tracking in Unconstrained Videos
CVPR 2018
Adaptive As-Natural-As-Possible Image Stitching
CVPR 2015