Peter Vajda

29 papers · 2018–2025 · 5 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🗺️ Taxonomy Completionist (49) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7) 🌍 Conference Polyglot (5) 🧭 Keyword Pioneer

🌍 Conference Polyglot (5) 🏃 Academic Marathon (7) 🤝 Dynamic Duo (18) 🧬 Topic Evolution ❓ The Questioner 💎 Century Club (29) 🚀 Conference Pioneer 🗃️ Keyword Collector (107) ⚡ Prolific Year (5) 🔥 Unstoppable (8)

Conferences

CVPR (16) ECCV (7) ICCV (3) ICLR (2) ICML (1)

Top co-authors

Bichen Wu (18) Peizhao Zhang (15) Xiaoliang Dai (12) Zijian He (10) Jialiang Wang (7) CHIH-YAO MA (6) Kurt Keutzer (5) Sam Tsai (5) Ji Hou (5) Chenfeng Xu (4)

Keywords

diffusion model (5) efficient computing (5) neural architecture search (3) model compression (3) neural network (3) video generation (3) differentiable neural architecture search (2) video editing (2) semantic segmentation (2) video-to-video synthesis (2) temporal consistency (2) hardware-aware design (2) video synthesis (1) vision transformer (1) object detection (1) hyperparameter optimization (1) adversarial learning (1) 3d reconstruction (1) image restoration (1) image synthesis (1)

Papers

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation ICML 2025 Movie Weaver: Tuning-Free Multi-Concept Video Personalization with Anchored Prompts CVPR 2025 LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity CVPR 2025 ControlRoom3D: Room Generation using Semantic Proxy Rooms CVPR 2024 Cache Me if You Can: Accelerating Diffusion Models through Block Caching CVPR 2024 AVID: Any-Length Video Inpainting with Diffusion Model CVPR 2024 FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis CVPR 2024 Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis CVPR 2024 Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference CVPR 2023 NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection ICCV 2023 Open-Vocabulary Semantic Segmentation With Mask-Adapted CLIP CVPR 2023 A Practical Stereo Depth System for Smart Glasses CVPR 2023 Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport Distillation ICLR 2022 Open-Set Semi-Supervised Object Detection ECCV 2022 Cross-Domain Adaptive Teacher for Object Detection CVPR 2022 Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models ECCV 2022 Visual Transformers: Where Do Transformers Really Belong in Vision Models? ICCV 2021 Tackling the Ill-Posedness of Super-Resolution Through Adaptive Target Generation CVPR 2021 Unbiased Teacher for Semi-Supervised Object Detection ICLR 2021 FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining CVPR 2021 FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions CVPR 2020 Deep Space-Time Video Upsampling Networks ECCV 2020 Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild ECCV 2020 Learning to Generate Grounded Visual Captions without Localization Supervision ECCV 2020 SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation ECCV 2020 Efficient Segmentation: Learning Downsampling Near Semantic Boundaries ICCV 2019 ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation CVPR 2019 FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search CVPR 2019 Value-aware Quantization for Training and Inference of Neural Networks ECCV 2018