Papers
A Light Weight Model for Active Speaker Detection
Junhua Liao, Haihan Duan, Kanghui Feng et al.
Align and Attend: Multimodal Summarization With Dual Contrastive Losses
Bo He, Jun Wang, Jielin Qiu et al.
AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training
Yifan Jiang, Peter Hedman, Ben Mildenhall et al.
Aligning Bag of Regions for Open-Vocabulary Object Detection
Size Wu, Wenwei Zhang, Sheng Jin et al.
Aligning Step-by-Step Instructional Diagrams to Video Demonstrations
Jiahao Zhang, Anoop Cherian, Yanbin Liu et al.
Align Your Latents: High-Resolution Video Synthesis With Latent Diffusion Models
Andreas Blattmann, Robin Rombach, Huan Ling et al.
All Are Worth Words: A ViT Backbone for Diffusion Models
Fan Bao, Shen Nie, Kaiwen Xue et al.
All-in-Focus Imaging From Event Focal Stack
Hanyue Lou, Minggui Teng, Yixin Yang et al.
All in One: Exploring Unified Video-Language Pre-Training
Jinpeng Wang, Yixiao Ge, Rui Yan et al.
All-in-One Image Restoration for Unknown Degradations Using Adaptive Discriminative Filters for Specific Degradations
Dongwon Park, Byung Hyun Lee, Se Young Chun
ALOFT: A Lightweight MLP-Like Architecture With Dynamic Low-Frequency Transform for Domain Generalization
Jintao Guo, Na Wang, Lei Qi et al.
A Loopback Network for Explainable Microvascular Invasion Classification
Shengxuming Zhang, Tianqi Shi, Yang Jiang et al.
ALSO: Automotive Lidar Self-Supervision by Occupancy Estimation
Alexandre Boulch, Corentin Sautier, Björn Michele et al.
AltFreezing for More General Video Face Forgery Detection
Zhendong Wang, Jianmin Bao, Wengang Zhou et al.
ALTO: Alternating Latent Topologies for Implicit 3D Reconstruction
Zhen Wang, Shijie Zhou, Jeong Joon Park et al.
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection
Chang Liu, Weiming Zhang, Xiangru Lin et al.
Ambiguous Medical Image Segmentation Using Diffusion Models
Aimon Rahman, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu et al.
A Meta-Learning Approach to Predicting Performance and Data Requirements
Achin Jain, Gurumurthy Swaminathan, Paolo Favaro et al.
AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation
Zhen Li, Zuo-Liang Zhu, Ling-Hao Han et al.
An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity
Zhao Xie, Tian Gao, Kewei Wu et al.
Analyzing and Diagnosing Pose Estimation With Attributions
Qiyuan He, Linlin Yang, Kerui Gu et al.
Analyzing Physical Impacts Using Transient Surface Wave Imaging
Tianyuan Zhang, Mark Sheinin, Dorian Chan et al.
Anchor3DLane: Learning To Regress 3D Anchors for Monocular 3D Lane Detection
Shaofei Huang, Zhenwei Shen, Zehao Huang et al.
AnchorFormer: Point Cloud Completion From Discriminative Nodes
Zhikai Chen, Fuchen Long, Zhaofan Qiu et al.
An Empirical Study of End-to-End Video-Language Transformers With Masked Visual Modeling
Tsu-Jui Fu, Linjie Li, Zhe Gan et al.