Papers
ProphNet: Efficient Agent-Centric Motion Forecasting With Anchor-Informed Proposals
Xishun Wang, Tong Su, Fang Da et al.
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization
Huan Ren, Wenfei Yang, Tianzhu Zhang et al.
ProTeGe: Untrimmed Pretraining for Video Temporal Grounding by Video Temporal Grounding
Lan Wang, Gaurav Mittal, Sandra Sajeev et al.
ProtoCon: Pseudo-Label Refinement via Online Clustering and Prototypical Consistency for Efficient Semi-Supervised Learning
Islam Nassar, Munawar Hayat, Ehsan Abbasnejad et al.
Prototype-Based Embedding Network for Scene Graph Generation
Chaofan Zheng, Xinyu Lyu, Lianli Gao et al.
Prototypical Residual Networks for Anomaly Detection and Localization
Hui Zhang, Zuxuan Wu, Zheng Wang et al.
Proximal Splitting Adversarial Attack for Semantic Segmentation
Jérôme Rony, Jean-Christophe Pesquet, Ismail Ben Ayed
ProxyFormer: Proxy Alignment Assisted Point Cloud Completion With Missing Part Sensitive Transformer
Shanshan Li, Pan Gao, Xiaoyang Tan et al.
Pruning Parameterization With Bi-Level Optimization for Efficient Semantic Segmentation on the Edge
Changdi Yang, Pu Zhao, Yanyu Li et al.
Pseudo-Label Guided Contrastive Learning for Semi-Supervised Medical Image Segmentation
Hritam Basak, Zhaozheng Yin
PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation With Progressive Video Transformers
Zhongwei Qiu, Qiansheng Yang, Jian Wang et al.
Putting People in Their Place: Affordance-Aware Human Insertion Into Scenes
Sumith Kulal, Tim Brooks, Alex Aiken et al.
PVO: Panoptic Visual Odometry
Weicai Ye, Xinyue Lan, Shuo Chen et al.
PVT-SSD: Single-Stage 3D Object Detector With Point-Voxel Transformer
Honghui Yang, Wenxiao Wang, Minghao Chen et al.
PyPose: A Library for Robot Learning With Physics-Based Optimization
Chen Wang, Dasong Gao, Kuan Xu et al.
PyramidFlow: High-Resolution Defect Contrastive Localization Using Pyramid Normalizing Flow
Jiarui Lei, Xiaobo Hu, Yue Wang et al.
Q-DETR: An Efficient Low-Bit Quantized Detection Transformer
Sheng Xu, Yanjing Li, Mingbao Lin et al.
Q: How To Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
Zaid Khan, Vijay Kumar BG, Samuel Schulter et al.
QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation
Sicheng Yang, Zhiyong Wu, Minglei Li et al.
Quality-Aware Pre-Trained Models for Blind Image Quality Assessment
Kai Zhao, Kun Yuan, Ming Sun et al.
QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity
Siyu Huang, Jie An, Donglai Wei et al.
Quantitative Manipulation of Custom Attributes on 3D-Aware Image Synthesis
Hoseok Do, EunKyung Yoo, Taehyeong Kim et al.
Quantum-Inspired Spectral-Spatial Pyramid Network for Hyperspectral Image Classification
Jie Zhang, Yongshan Zhang, Yicong Zhou
Quantum Multi-Model Fitting
Matteo Farina, Luca Magri, Willi Menapace et al.
Query-Centric Trajectory Prediction
Zikang Zhou, Jianping Wang, Yung-Hui Li et al.