Papers
AttTrack: Online Deep Attention Transfer for Multi-Object Tracking
Keivan Nalaie, Rong Zheng
AudioViewer: Learning To Visualize Sounds
Chunjin Song, Yuchi Zhang, Willis Peng et al.
Audio-Visual Efficient Conformer for Robust Speech Recognition
Maxime Burchi, Radu Timofte
Audio-Visual Face Reenactment
Madhav Agarwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri et al.
Augmentation by Counterfactual Explanation - Fixing an Overconfident Classifier
Sumedha Singla, Nihal Murali, Forough Arabshahi et al.
Autoencoder-Based Background Reconstruction and Foreground Segmentation With Background Noise Estimation
Bruno Sauvalle, Arnaud de La Fortelle
Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty Quantification
Matthias Rottmann, Marco Reese
Automated Line Labelling: Dataset for Contour Detection and 3D Reconstruction
Hari Santhanam, Nehal Doiphode, Jianbo Shi
Automatically Annotating Indoor Images With CAD Models via RGB-D Scans
Stefan Ainetter, Sinisa Stekovic, Friedrich Fraundorfer et al.
Auxiliary Task-Guided CycleGAN for Black-Box Model Domain Adaptation
Michael Essich, Markus Rehmann, Cristóbal Curio
AVE-CLIP: AudioCLIP-Based Multi-Window Temporal Transformer for Audio Visual Event Localization
Tanvir Mahmud, Diana Marculescu
Backprop Induced Feature Weighting for Adversarial Domain Adaptation With Iterative Label Distribution Alignment
Thomas Westfechtel, Hao-Wei Yeh, Qier Meng et al.
Back to MLP: A Simple Baseline for Human Motion Prediction
Wen Guo, Yuming Du, Xi Shen et al.
Barlow Constrained Optimization for Visual Question Answering
Abhishek Jha, Badri Patro, Luc Van Gool et al.
Benchmarking Visual Localization for Autonomous Navigation
Lauri Suomela, Jussi Kalliola, Atakan Dag et al.
Bent & Broken Bicycles: Leveraging Synthetic Data for Damaged Object Re-Identification
Luca Piano, Filippo Gabriele Pratticò, Alessandro Sebastian Russo et al.
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
Lang Peng, Zhirong Chen, Zhangjie Fu et al.
Beyond RGB: Scene-Property Synthesis With Neural Radiance Fields
Mingtong Zhang, Shuhong Zheng, Zhipeng Bao et al.
Bi-Directional Frame Interpolation for Unsupervised Video Anomaly Detection
Hanqiu Deng, Zhaoxiang Zhang, Shihao Zou et al.
BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds
Youshan Zhang, Jialu Li
Body Part-Based Representation Learning for Occluded Person Re-Identification
Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi
Boosting Neural Video Codecs by Exploiting Hierarchical Redundancy
Reza Pourreza, Hoang Le, Amir Said et al.
Boosting Vision Transformers for Image Retrieval
Chull Hwan Song, Jooyoung Yoon, Shunghyun Choi et al.
Bootstrapping the Relationship Between Images and Their Clean and Noisy Labels
Brandon Smart, Gustavo Carneiro