Papers
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Wenhao Wu, Haipeng Luo, Bo Fang et al.
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Yanxin Long, Youpeng Wen, Jianhua Han et al.
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
Kaixin Xiong, Shi Gong, Xiaoqing Ye et al.
CaPriDe Learning: Confidential and Private Decentralized Learning Based on Encryption-Friendly Distillation Loss
Nurbek Tastan, Karthik Nandakumar
CAP: Robust Point Cloud Classification via Semantic and Structural Modeling
Daizong Ding, Erling Jiang, Yuanmin Huang et al.
CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer
Linfeng Wen, Chengying Gao, Changqing Zou
CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects
Nick Heppert, Muhammad Zubair Irshad, Sergey Zakharov et al.
Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution
Hao-Wei Chen, Yu-Syuan Xu, Min-Fong Hong et al.
Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization
Mengyuan Chen, Junyu Gao, Changsheng Xu
CASP-Net: Rethinking Video Saliency Prediction From an Audio-Visual Consistency Perceptual Perspective
Junwen Xiong, Ganglai Wang, Peng Zhang et al.
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
Haoran You, Yunyang Xiong, Xiaoliang Dai et al.
Catch Missing Details: Image Reconstruction With Frequency Augmented Variational Autoencoder
Xinmiao Lin, Yikang Li, Jenhao Hsiao et al.
Category Query Learning for Human-Object Interaction Classification
Chi Xie, Fangao Zeng, Yue Hu et al.
CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection
Shuailei Ma, Yuefeng Wang, Ying Wei et al.
Causally-Aware Intraoperative Imputation for Overall Survival Time Prediction
Xiang Li, Xuelin Qian, Litian Liang et al.
CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes
Harshil Bhatia, Edith Tretschk, Zorah Lähner et al.
CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion
Zixiang Zhao, Haowen Bai, Jiangshe Zhang et al.
CelebV-Text: A Large-Scale Facial Text-Video Dataset
Jianhui Yu, Hao Zhu, Liming Jiang et al.
Center Focusing Network for Real-Time LiDAR Panoptic Segmentation
Xiaoyan Li, Gang Zhang, Boyue Wang et al.
CFA: Class-Wise Calibrated Fair Adversarial Training
Zeming Wei, Yifei Wang, Yiwen Guo et al.
CF-Font: Content Fusion for Few-Shot Font Generation
Chi Wang, Min Zhou, Tiezheng Ge et al.
Change-Aware Sampling and Contrastive Learning for Satellite Images
Utkarsh Mall, Bharath Hariharan, Kavita Bala
Chat2Map: Efficient Scene Mapping From Multi-Ego Conversations
Sagnik Majumder, Hao Jiang, Pierre Moulon et al.
CHMATCH: Contrastive Hierarchical Matching and Robust Adaptive Threshold Boosted Semi-Supervised Learning
Jianlong Wu, Haozhe Yang, Tian Gan et al.
CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution
Jiezhang Cao, Qin Wang, Yongqin Xian et al.