Papers
OT-VP: Optimal Transport-Guided Visual Prompting for Test-Time Adaptation
Yunbei Zhang, Akshay Mehra, Jihun Hamm
PACA: Prespective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
Shutong Jin, Ruiyu Wang, Kuangyi Chen et al.
Paladin: Understanding Video Intentions in Political Advertisement Videos
Hong Liu, Yuta Nakashima, Noboru Babaguchi
PALO: A Polyglot Large Multimodal Model for 5B People
Hanoona Rasheed, Muhammad Maaz, Abdelrahman Shaker et al.
Partial Texture VAE: Color and Texture Encoder for Rock Particle Images
Tetsushi Yamada, Simone Di Santo
PatchFinder: Leveraging Visual Language Models for Accurate Information Retrieval using Model Uncertainty
Roman Colman, Minh Vu, Manish Bhattarai et al.
Patch Ranking: Token Pruning as Ranking Prediction for Efficient CLIP
Cheng-En Wu, Jinhong Lin, Yu Hen Hu et al.
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
Sina Hajimiri, Ismail Ben Ayed, Jose Dolz
PC-GZSL: Prior Correction for Generalized Zero Shot Learning
S Divakar Bhat, Amit More, Mudit Soni et al.
Perceive Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries
Roberto Amoroso, Gengyuan Zhang, Rajat Koner et al.
Per-Pixel Solution of Multispectral Photometric Stereo
Shin Ishihara, Imari Sato
Personalized Mixture of Experts for Multi-Site Medical Image Segmentation
Md Motiur Rahman, Mohamed Trabelsi, Huseyin Uzunalioglu et al.
PETALface: Parameter Efficient Transfer Learning for Low-Resolution Face Recognition
Kartik Narayan, Nithin Gopalakrishnan Nair, Jennifer Xu et al.
PGRID: Power Grid Reconstruction in Informal Developments using High-Resolution Aerial Imagery
Simone Fobi Nsutezo, Amrita Gupta, Duncan Kebut et al.
Phaseformer: Phase-Based Attention Mechanism for Underwater Image Restoration and Beyond
Raqib Khan, Anshul Negi, Ashutosh Kulkarni et al.
Physiology-Aware PolySnake for Coronary Vessel Segmentation
Yizhe Ruan, Lin Gu, Yusuke Kurose et al.
PICASSO: A Feed-Forward Framework for Parametric Inference of CAD Sketches via Rendering Self-Supervision
Ahmet Serdar Karadeniz, Dimitrios Mallis, Nesryne Mejri et al.
PivotAlign: Improve Semi-Supervised Learning by Learning Intra-Class Heterogeneity and Aligning with Pivots
Lingjie Yi, Tao Sun, Yikai Zhang et al.
Pix2Poly: A Sequence Prediction Method for End-to-End Polygonal Building Footprint Extraction from Remote Sensing Imagery
Yeshwanth Kumar Adimoolam, Charalambos Poullis, Melinos Averkiou
Pixel-Wise Shuffling with Collaborative Sparsity for Melanoma Hyperspectral Image Classification
Favour Ekong, Jun Zhou, Kwabena Sarpong et al.
PixSwap: High-Resolution Face Swapping for Effective Reflection of Identity via Pixel-Level Supervision with Synthetic Paired Dataset
Taewoo Kim, Geonsu Lee, Hyukgi Lee et al.
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices
Ming Kang, Fung Fung Ting, Raphael C.-W. Phan et al.
Planar Gaussian Splatting
Farhad G. Zanjani, Hong Cai, Hanno Ackermann et al.
PLReMix: Combating Noisy Labels with Pseudo-Label Relaxed Contrastive Representation Learning
Xiaoyu Liu, Beitong Zhou, Zuogong Yue et al.