Papers
The Benefit of Distraction: Denoising Camera-Based Physiological Measurements Using Inverse Attention
Ewa M. Nowara, Daniel McDuff, Ashok Veeraraghavan
The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation
Guillem Brasó, Nikita Kister, Laura Leal-Taixé
The Devil Is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection
Zhikang Zou, Xiaoqing Ye, Liang Du et al.
The Functional Correspondence Problem
Zihang Lai, Senthil Purushwalkam, Abhinav Gupta
The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
Dan Hendrycks, Steven Basart, Norman Mu et al.
The Power of Points for Modeling Humans in Clothing
Qianli Ma, Jinlong Yang, Siyu Tang et al.
The Pursuit of Knowledge: Discovering and Localizing Novel Categories Using Dual Memory
Sai Saketh Rambhatla, Rama Chellappa, Abhinav Shrivastava
The Right To Talk: An Audio-Visual Transformer Approach
Thanh-Dat Truong, Chi Nhan Duong, The De Vu et al.
The Road To Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Yuankai Qi, Zizheng Pan, Yicong Hong et al.
The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data
Cheng Gu, Erik Learned-Miller, Daniel Sheldon et al.
The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation
Xiaoming Zhao, Harsh Agrawal, Dhruv Batra et al.
The Surprising Impact of Mask-Head Architecture on Novel Class Segmentation
Vighnesh Birodkar, Zhichao Lu, Siyang Li et al.
Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis
Jianhua Sun, Yuxuan Li, Hao-Shu Fang et al.
THUNDR: Transformer-Based 3D Human Reconstruction With Markers
Mihai Zanfir, Andrei Zanfir, Eduard Gabriel Bazavan et al.
Time-Equivariant Contrastive Video Representation Learning
Simon Jenni, Hailin Jin
Time-Multiplexed Coded Aperture Imaging: Learned Coded Aperture and Pixel Exposures for Compressive Imaging Systems
Edwin Vargas, Julien N. P. Martel, Gordon Wetzstein et al.
TkML-AP: Adversarial Attacks to Top-k Multi-Label Learning
Shu Hu, Lipeng Ke, Xin Wang et al.
TMCOSS: Thresholded Multi-Criteria Online Subset Selection for Data-Efficient Autonomous Driving
Soumi Das, Harikrishna Patibandla, Suparna Bhattacharya et al.
T-Net: Effective Permutation-Equivariant Network for Two-View Correspondence Learning
Zhen Zhong, Guobao Xiao, Linxin Zheng et al.
TokenPose: Learning Keypoint Tokens for Human Pose Estimation
Yanjie Li, Shoukui Zhang, Zhicheng Wang et al.
Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet
Li Yuan, Yunpeng Chen, Tao Wang et al.
TOOD: Task-Aligned One-Stage Object Detection
Chengjian Feng, Yujie Zhong, Yu Gao et al.
Topic Scene Graph Generation by Attention Distillation From Caption
Wenbin Wang, Ruiping Wang, Xilin Chen
Topologically Consistent Multi-View Face Inference Using Volumetric Sampling
Tianye Li, Shichen Liu, Timo Bolkart et al.