Papers
Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Jiahe Li, Jiawei Zhang, Xiao Bai et al.
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Yulin Wang, Yang Yue, Rui Lu et al.
Efficient Transformer-based 3D Object Detection with Dynamic Token Halting
Mao Ye, Gregory P. Meyer, Yuning Chai et al.
Efficient Unified Demosaicing for Bayer and Non-Bayer Patterned Image Sensors
Haechang Lee, Dongwon Park, Wongi Jeong et al.
Efficient Video Action Detection with Token Dropout and Context Refinement
Lei Chen, Zhan Tong, Yibing Song et al.
Efficient Video Prediction via Sparsely Conditioned Flow Matching
Aram Davtyan, Sepehr Sameni, Paolo Favaro
Efficient View Synthesis with Neural Radiance Distribution Field
Yushuang Wu, Xiao Li, Jinglu Wang et al.
EfficientViT: Lightweight Multi-Scale Attention for High-Resolution Dense Prediction
Han Cai, Junyan Li, Muyan Hu et al.
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
Shiyue Cao, Yueqin Yin, Lianghua Huang et al.
EGC: Image Generation and Classification via a Diffusion Energy-Based Model
Qiushan Guo, Chuofan Ma, Yi Jiang et al.
EGformer: Equirectangular Geometry-biased Transformer for 360 Depth Estimation
Ilwi Yun, Chanyong Shin, Hyunku Lee et al.
Ego-Humans: An Ego-Centric 3D Multi-Human Benchmark
Rawal Khirodkar, Aayush Bansal, Lingni Ma et al.
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries
Jinjie Mai, Abdullah Hamdi, Silvio Giancola et al.
EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
Chenchen Zhu, Fanyi Xiao, Andres Alvarado et al.
Ego-Only: Egocentric Action Detection without Exocentric Transferring
Huiyu Wang, Mitesh Kumar Singh, Lorenzo Torresani
EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
Yue Xu, Yong-Lu Li, Zhemin Huang et al.
EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
Rishi Hazra, Brian Chen, Akshara Rai et al.
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick, Yale Song, Sayan Nag et al.
EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition
Gabriele Berton, Gabriele Trivigno, Barbara Caputo et al.
EigenTrajectory: Low-Rank Descriptors for Multi-Modal Trajectory Forecasting
Inhwan Bae, Jean Oh, Hae-Gon Jeon
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices
Chen Tang, Li Lyna Zhang, Huiqiang Jiang et al.
ELFNet: Evidential Local-global Fusion for Stereo Matching
Jieming Lou, Weide Liu, Zhuo Chen et al.
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Yuxiang Wei, Yabo Zhang, Zhilong Ji et al.
EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild
Manuel Kaufmann, Jie Song, Chen Guo et al.
EMMN: Emotional Motion Memory Network for Audio-driven Emotional Talking Face Generation
Shuai Tan, Bin Ji, Ye Pan