Papers
4,428 papers found
Boosting Medical Vision-Language Pretraining via Momentum Self-Distillation under Limited Computing Resources
Phuc Pham, Nhu Pham, Ngoc Quoc Ly
Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training
Kaixuan Lu, Mehmet Onurcan Kaya, Dim P. Papadopoulos
BOP-Distrib: Revisiting 6D Pose Estimation Benchmarks for Better Evaluation under Visual Ambiguities
Boris Meden, Asma Brazi, Fabrice Mayran de Chamisso et al.
BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity
Juil Koo, Wei-Tung Lin, Chanho Park et al.
BrandFusion: Aligning Image Generation with Brand Styles
Parul Gupta, Varun Khurana, Yaman Kumar Singla et al.
brat: Aligned Multi-View Embeddings for Brain MRI Analysis
Maxime Kayser, Maksim Gridnev, Wanting Wang et al.
BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries
Tianle Li, Yongming Rao, Winston Hu et al.
Bridging the Domain Gap in Small Multimodal Models: A Dual-level Alignment Perspective
Aveen Dayal, Peketi Divya, Nidhi Tiwari et al.
BrightRate: Quality Assessment for User-Generated HDR Videos
Shreshth Saini, Bowen Chen, Yilin Wang et al.
Broadcast2Pitch: Game State Reconstruction from Unconstrained Soccer Videos
Yin May Oo, Yewon Hwang, Muhammad Amrulloh Robbani et al.
CAAC: Confidence-Aware Attention Calibration to Reduce Hallucinations in Large Vision-Language Models
Mehrdad Fazli, Bowen Wei, Ahmet Sari et al.
CADE: Continual Weakly-supervised Video Anomaly Detection with Ensembles
Satoshi Hashimoto, Tatsuya Konishi, Tomoya Kaichi et al.
CaFlow: Enhancing Long-Term Action Quality Assessment with Causal Counterfactual Flow
Ruisheng Han, Kanglei Zhou, Shuang Chen et al.
CalibBEV: LiDAR-Camera Calibration via BEV Alignment
Filippo D'Addeo, Lorenzo Cipelli, Adriano Cardace et al.
CAMP-VQA: Caption-Embedded Multimodal Perception for No-Reference Quality Assessment of Compressed Video
Xinyi Wang, Angeliki Katsenou, Junxiao Shen et al.
Can Image Splicing and Copy-Move Forgery Be Detected by the Same Model? Forensim: An Attention-Based State-Space Approach
Soumyaroop Nandi, Prem Natarajan
CanKD: Cross-Attention-based Non-local Operation for Feature-based Knowledge Distillation
Shizhe Sun, Wataru Ohyama
Can We Challenge Open-Vocabulary Object Detectors with Generated Content in Street Scenes?
Annika Mütze, Sadia Ilyas, Christian Dörpelkus et al.
CAPE: A CLIP-Aware Pointing Ensemble of Complementary Heatmap Cues for Embodied Reference Understanding
Fevziye Irem Eyiokur, Dogucan Yaman, Hazım Kemal Ekenel et al.
CaRS: A Causal Intervention Segmentation Framework and Benchmark Dataset for Autonomous Driving under Transitional Weather Conditions
Kondapally Madhavi, K Naveen Kumar, C Krishna Mohan et al.
CAST: Evaluating Multi-Object Trackers with Context-Aware Switch and Transfer Scores
Jin Bai, Gregory D. Hager
CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading
Mishan Aliev, Dmitry Baranchuk, Kirill Struminsky
Causality-Driven Audits of Model Robustness
Nathan Drenkow, William Paul, Chris Ribaudo et al.
Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting
Rishikesh Bhyri, Brian R Quaranto, Junsong Yuan et al.
ChameleonTuner: Automatic ISP Color Tuning in Subjective Scenarios
Zijie Tan, Yuxin Yue, Bahador Rashidi