Papers
4,428 papers found
PhyEduVideo: A Benchmark for Evaluating Text-to-Video Models for Physics Education
Megha Mariam K.M, Aditya Arun, Zakaria Laskar et al.
PHYSPLAT: a Framework for Photorealistic Hybrid Simulation of Real and Synthetic Elements using 3D Gaussian Splatting
Mario Alfonso-Arsuaga, Henar Dominguez-Elvira, Jorge Casas-Guerrero et al.
PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Zilu Guo, Hongbin Lin, Zhihao Yuan et al.
Point2Pose: A Generative Framework for 3D Human Pose Estimation with Multi-View Point Cloud Dataset
Hyunsoo Lee, Daeum Jeon, Hyeokjae Oh
Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis
Thang-Anh-Quan Nguyen, Laurent Caraffa, Jean-Philippe Tarel et al.
PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications
Yunze Liu, Zifan Wang, Peiran Wu et al.
PointSt3R: Point Tracking through 3D Ground Correspondence
Rhodri Guerrier, Adam W. Harley, Dima Damen
Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices
Saeid Ghafouri, Mohsen Fayyaz, Xiangchen Li et al.
PoseAdapt: Sustainable Human Pose Estimation via Continual Learning Benchmarks and Toolkit
Muhammad Saif Ullah Khan, Didier Stricker
Pose-Diverse Multi-View Virtual Try-on from a Single Frontal Image via Diffusion Transformer
Seonghee Han, Minchang Chung, Gyeongsu Cho et al.
PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction
Ju Shen, Chen Chen, Tam V. Nguyen et al.
Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues
Tuan-Anh Vu, Hai Nguyen-Truong, Ziqiang Zheng et al.
Predicting Task fMRI Contrasts from Resting-State fMRI Using Sparse 3D Convolutions
Ivan Sviridov, Maria Boyko, Maksim Sharaev
PredMapNet: Future and Historical Reasoning for Consistent Online HD Vectorized Map Construction
Bo Lang, Nirav Savaliya, Zhihao Zheng et al.
Pretraining Helps When Capacity Allows: Evidence from Ultra-Small ConvNets
Srikanth Muralidharan, Heitor R. Medeiros, Masih Aminbeidokhti et al.
PrevMatch: Revisiting and Maximizing Temporal Knowledge in Semi-Supervised Semantic Segmentation
Wooseok Shin, Hyun Joon Park, Jin Sob Kim et al.
PRISM-CAFO: Prior-conditioned Remote-sensing Infrastructure Segmentation and Mapping for CAFOs
Oishee Bintey Hoque, Nibir Chandra Mandal, Kyle Luong et al.
Procedure Learning via Regularized Gromov-Wasserstein Optimal Transport
Syed Ahmed Mahmood, Ali Shah Ali, Umer Ahmed et al.
PromptGAR: Flexible Promptive Group Activity Recognition
Zhangyu Jin, Andrew Feng, Ankur Chemburkar et al.
Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model Adaptation
Xiwen Chen, Wenhui Zhu, Peijie Qiu et al.
ProSkill: Segment-Level Skill Assessment in Procedural Videos
Michele Mazzamuto, Daniele Di Mauro, Gianpiero Francesca et al.
ProtoGMVAE: A Variational Auto-Encoder with True Gaussian Mixture Prior for Prototypical-based Self-Explainability
Martin Blanchard, Christophe Ducottet, Damien Muselet et al.
PS3: Part Level Instance Segmentation in 3D
Hong-Xuan Yen, Chiamin Chen, Yanqing Wang et al.
PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification
Sharon Peled, Yosef E. Maruvka, Moti Freiman
PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
Dingbang Huang, Wenbo Li, Yifei Zhao et al.