Papers
Efficient Two-Stage Detection of Human-Object Interactions With a Novel Unary-Pairwise Transformer
Frederic Z. Zhang, Dylan Campbell, Stephen Gould
Efficient Video Instance Segmentation via Tracklet Query and Proposal
Jialian Wu, Sudhir Yarram, Hui Liang et al.
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman, Andrew Westbury, Eugene Byrne et al.
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization
Hao Jiang, Calvin Murdock, Vamsi Krishna Ithapu
Egocentric Prediction of Action Target in 3D
Yiming Li, Ziang Cao, Andrew Liang et al.
Egocentric Scene Understanding via Multimodal Spatial Rectifier
Tien Do, Khiem Vuong, Hyun Soo Park
EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval
Haoyu Ma, Handong Zhao, Zhe Lin et al.
Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation
Wonhui Park, Dongkwon Jin, Chang-Su Kim
Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes
Dongkwon Jin, Wonhui Park, Seong-Gyun Jeong et al.
ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses
Bastian Wandt, James J. Little, Helge Rhodin
ELIC: Efficient Learned Image Compression With Unevenly Grouped Space-Channel Contextual Adaptive Coding
Dailan He, Ziming Yang, Weikun Peng et al.
ELSR: Efficient Line Segment Reconstruction With Planes and Points Guidance
Dong Wei, Yi Wan, Yongjun Zhang et al.
Embracing Single Stride 3D Object Detector With Sparse Transformer
Lue Fan, Ziqi Pang, Tianyuan Zhang et al.
EMOCA: Emotion Driven Monocular Face Capture and Animation
Radek Daněček, Michael J. Black, Timo Bolkart
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
Yaya Shi, Xu Yang, Haiyang Xu et al.
Enabling Equivariance for Arbitrary Lie Groups
Lachlan E. MacDonald, Sameera Ramasinghe, Simon Lucey
En-Compactness: Self-Distillation Embedding & Contrastive Generation for Generalized Zero-Shot Learning
Xia Kong, Zuodong Gao, Xiaofan Li et al.
End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Congcong Li, Xinyao Wang, Longyin Wen et al.
End-to-End Generative Pretraining for Multimodal Video Captioning
Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab et al.
End-to-End Human-Gaze-Target Detection With Transformers
Danyang Tu, Xiongkuo Min, Huiyu Duan et al.
End-to-End Multi-Person Pose Estimation With Transformers
Dahu Shi, Xing Wei, Liangqi Li et al.
End-to-End Reconstruction-Classification Learning for Face Forgery Detection
Junyi Cao, Chao Ma, Taiping Yao et al.
End-to-End Referring Video Object Segmentation With Multimodal Transformers
Adam Botach, Evgenii Zheltonozhskii, Chaim Baskin
End-to-End Semi-Supervised Learning for Video Action Detection
Akash Kumar, Yogesh Singh Rawat
End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps
Ke Guo, Wenxi Liu, Jia Pan