Papers
CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search
Fahad Shamshad, Muzammal Naseer, Karthik Nandakumar
CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP
Runnan Chen, Youquan Liu, Lingdong Kong et al.
CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not
Aneeshan Sain, Ayan Kumar Bhunia, Pinaki Nath Chowdhury et al.
CLIP Is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation
Yuqi Lin, Minghao Chen, Wenxiao Wang et al.
CLIPPING: Distilling CLIP-Based Models With a Student Base for Video-Language Retrieval
Renjing Pei, Jianzhuang Liu, Weimian Li et al.
CLIPPO: Image-and-Language Understanding From Pixels Only
Michael Tschannen, Basil Mustafa, Neil Houlsby
CLIP-S4: Language-Guided Self-Supervised Semantic Segmentation
Wenbin He, Suphanut Jamonnak, Liang Gou et al.
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes From Natural Language
Aditya Sanghi, Rao Fu, Vivian Liu et al.
CLIP the Gap: A Single Domain Generalization Approach for Object Detection
Vidit Vidit, Martin Engilberge, Mathieu Salzmann
CloSET: Modeling Clothed Humans on Continuous Surface With Explicit Template Decomposition
Hongwen Zhang, Siyou Lin, Ruizhi Shao et al.
CLOTH4D: A Dataset for Clothed Human Reconstruction
Xingxing Zou, Xintong Han, Waikeung Wong
Clothed Human Performance Capture With a Double-Layer Neural Radiance Fields
Kangkan Wang, Guofeng Zhang, Suxu Cong et al.
Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World
Yulu Gan, Mingjie Pan, Rongyu Zhang et al.
Clover: Towards a Unified Video-Language Alignment and Fusion Model
Jingjia Huang, Yinan Li, Jiashi Feng et al.
CNVid-3.5M: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset
Tian Gan, Qing Wang, Xingning Dong et al.
Coaching a Teachable Student
Jimuyang Zhang, Zanming Huang, Eshed Ohn-Bar
CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning
James Seale Smith, Leonid Karlinsky, Vyshnavi Gutta et al.
CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior
Jinbo Xing, Menghan Xia, Yuechen Zhang et al.
Collaboration Helps Camera Overtake LiDAR in 3D Detection
Yue Hu, Yifan Lu, Runsheng Xu et al.
Collaborative Diffusion for Multi-Modal Face Generation and Editing
Ziqi Huang, Kelvin C.K. Chan, Yuming Jiang et al.
Collaborative Noisy Label Cleaner: Learning Scene-Aware Trailers for Multi-Modal Highlight Detection in Movies
Bei Gan, Xiujun Shu, Ruizhi Qiao et al.
Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding
Zihang Lin, Chaolei Tan, Jian-Fang Hu et al.
Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
Junyu Gao, Mengyuan Chen, Changsheng Xu
Color Backdoor: A Robust Poisoning Attack in Color Space
Wenbo Jiang, Hongwei Li, Guowen Xu et al.
Combining Implicit-Explicit View Correlation for Light Field Semantic Segmentation
Ruixuan Cong, Da Yang, Rongshan Chen et al.