Peter Vajda
29 papers · 2018–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (49) π Interdisciplinary Bridge π Academic Marathon (7) π Conference Polyglot (5) π§ Keyword Pioneer
π
Conference Polyglot
(5)
π
Academic Marathon
(7)
π€
Dynamic Duo
(18)
π§¬
Topic Evolution
β
The Questioner
π
Century Club
(29)
π
Conference Pioneer
ποΈ
Keyword Collector
(107)
β‘
Prolific Year
(5)
π₯
Unstoppable
(8)
Conferences
CVPR (16)
ECCV (7)
ICCV (3)
ICLR (2)
ICML (1)
Top co-authors
Keywords
diffusion model
(5)
efficient computing
(5)
neural architecture search
(3)
model compression
(3)
neural network
(3)
video generation
(3)
differentiable neural architecture search
(2)
video editing
(2)
semantic segmentation
(2)
video-to-video synthesis
(2)
temporal consistency
(2)
hardware-aware design
(2)
video synthesis
(1)
vision transformer
(1)
object detection
(1)
hyperparameter optimization
(1)
adversarial learning
(1)
3d reconstruction
(1)
image restoration
(1)
image synthesis
(1)
Papers
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
ICML 2025
Movie Weaver: Tuning-Free Multi-Concept Video Personalization with Anchored Prompts
CVPR 2025
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
CVPR 2025
ControlRoom3D: Room Generation using Semantic Proxy Rooms
CVPR 2024
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
CVPR 2024
AVID: Any-Length Video Inpainting with Diffusion Model
CVPR 2024
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
CVPR 2024
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
CVPR 2024
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
CVPR 2023
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
ICCV 2023
Open-Vocabulary Semantic Segmentation With Mask-Adapted CLIP
CVPR 2023
A Practical Stereo Depth System for Smart Glasses
CVPR 2023
Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport Distillation
ICLR 2022
Open-Set Semi-Supervised Object Detection
ECCV 2022
Cross-Domain Adaptive Teacher for Object Detection
CVPR 2022
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models
ECCV 2022
Visual Transformers: Where Do Transformers Really Belong in Vision Models?
ICCV 2021
Tackling the Ill-Posedness of Super-Resolution Through Adaptive Target Generation
CVPR 2021
Unbiased Teacher for Semi-Supervised Object Detection
ICLR 2021
FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining
CVPR 2021
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions
CVPR 2020
Deep Space-Time Video Upsampling Networks
ECCV 2020
Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild
ECCV 2020
Learning to Generate Grounded Visual Captions without Localization Supervision
ECCV 2020
SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation
ECCV 2020
Efficient Segmentation: Learning Downsampling Near Semantic Boundaries
ICCV 2019
ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation
CVPR 2019
FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
CVPR 2019
Value-aware Quantization for Training and Inference of Neural Networks
ECCV 2018