Papers
BoxSnake: Polygonal Instance Segmentation with Box Supervision
Rui Yang, Lin Song, Yixiao Ge et al.
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images
Nitzan Bitton-Guetta, Yonatan Bitton, Jack Hessel et al.
Breaking Temporal Consistency: Generating Video Universal Adversarial Perturbations Using Image Models
Hee-Seon Kim, Minji Son, Minbeom Kim et al.
Breaking The Limits of Text-conditioned 3D Motion Synthesis with Elaborative Descriptions
Yijun Qian, Jack Urbanek, Alexander G. Hauptmann et al.
Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection
Longrong Yang, Xianpan Zhou, Xuewei Li et al.
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
Zunnan Xu, Zhihong Chen, Yong Zhang et al.
Bring Clipart to Life
Nanxuan Zhao, Shengqi Dang, Hexun Lin et al.
BT^2: Backward-compatible Training with Basis Transformation
Yifei Zhou, Zilu Li, Abhinav Shrivastava et al.
Building3D: A Urban-Scale Dataset and Benchmarks for Learning Roof Structures from Point Clouds
Ruisheng Wang, Shangfeng Huang, Hongxin Yang
Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach
Vimal K B, Saketh Bachu, Tanmay Garg et al.
Building Bridge Across the Time: Disruption and Restoration of Murals In the Wild
Huiyang Shao, Qianqian Xu, Peisong Wen et al.
Building Vision Transformers with Hierarchy Aware Feature Aggregation
Yongjie Chen, Hongmin Liu, Haoran Yin et al.
BUS: Efficient and Effective Vision-Language Pre-Training with Bottom-Up Patch Summarization.
Chaoya Jiang, Haiyang Xu, Wei Ye et al.
C2F2NeUS: Cascade Cost Frustum Fusion for High Fidelity and Generalizable Neural Surface Reconstruction
Luoyuan Xu, Tao Guan, Yuesong Wang et al.
C2ST: Cross-Modal Contextualized Sequence Transduction for Continuous Sign Language Recognition
Huaiwen Zhang, Zihang Guo, Yang Yang et al.
CAD-Estate: Large-scale CAD Model Annotation in RGB Videos
Kevis-Kokitsi Maninis, Stefan Popov, Matthias Nießner et al.
CAFA: Class-Aware Feature Alignment for Test-Time Adaptation
Sanghun Jung, Jungsoo Lee, Nanhee Kim et al.
Calibrating Panoramic Depth Estimation for Practical Localization and Mapping
Junho Kim, Eun Sun Lee, Young Min Kim
Calibrating Uncertainty for Semi-Supervised Crowd Counting
Chen LI, Xiaoling Hu, Shahira Abousamra et al.
CAME: Contrastive Automated Model Evaluation
Ru Peng, Qiuyang Duan, Haobo Wang et al.
Camera-Driven Representation Learning for Unsupervised Domain Adaptive Person Re-identification
Geon Lee, Sanghoon Lee, Dohyung Kim et al.
CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans
Jieneng Chen, Yingda Xia, Jiawen Yao et al.
Candidate-aware Selective Disambiguation Based On Normalized Entropy for Instance-dependent Partial-label Learning
Shuo He, Guowu Yang, Lei Feng
Can Language Models Learn to Listen?
Evonne Ng, Sanjay Subramanian, Dan Klein et al.
Canonical Factors for Hybrid Neural Fields
Brent Yi, Weijia Zeng, Sam Buchanan et al.