Federico Tombari

114 papers · 2013–2026 · 8 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🏃 Academic Marathon (13) 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (9)

🐝 Cross-Pollinator (9) 🌈 Renaissance Researcher (9) 🗺️ Taxonomy Completionist (114) 🏠 Conference Loyalist (27) 🤝 Dynamic Duo (39) 🏆 Grand Slam 🏆 Keyword Champion (8) 👑 Triple Crown 🔬 Deep Specialist (39) ⚡ Prolific Year (15) 🚀 Conference Pioneer 🗃️ Keyword Collector (368) 🔥 Unstoppable (10) 📈 Trend Setter 💎 Century Club (113)

Conferences

CVPR (40) ECCV (27) ICCV (26) ICLR (6) NIPS (6) WACV (5) AAAI (2) ICML (2)

Top co-authors

Nassir Navab (39) Fabian Manhardt (23) Yan Di (13) Luc Van Gool (13) Yongqin Xian (12) Xiangyang Ji (11) Muhammad Ferjad Naeem (10) Benjamin Busam (9) Ruida Zhang (8) David Joseph Tan (8)

Keywords

3d reconstruction (13) object detection (10) scene graph (8) semantic segmentation (8) point cloud (8) diffusion model (8) pose estimation (7) 3d vision (6) vision-language model (6) zero-shot learning (5) neural radiance field (5) domain adaptation (4) 6d pose estimation (4) 3d scene understanding (4) convolutional neural network (4) multimodal learning (3) instance segmentation (3) text-to-image generation (3) representation learning (3) scene understanding (3)

Papers

OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting WACV 2026 RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization AAAI 2026 Mixed Diffusion for 3D Indoor Scene Synthesis WACV 2026 Learning to Prompt with Text Only Supervision for Vision-Language Models AAAI 2025 Towards Real-Time Open-Vocabulary Video Instance Segmentation WACV 2025 LIME: Localized Image Editing via Attention Regularization in Diffusion Models WACV 2025 TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters ICLR 2025 CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation ICLR 2025 MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning ICCV 2025 UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint ICCV 2025 Contrastive Test-Time Composition of Multiple LoRA Models for Image Generation ICCV 2025 4D Gaussian Splatting SLAM ICCV 2025 Hierarchical 3D Scene Graphs Construction Outdoors ICCV 2025 Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation ICCV 2025 RelationField: Relate Anything in Radiance Fields CVPR 2025 Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation CVPR 2025 One2Any: One-Reference 6D Pose Estimation for Any Object CVPR 2025 Test-Time Visual In-Context Tuning CVPR 2025 LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models CVPR 2025 UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image CVPR 2025 Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos CVPR 2025 LoRACLR: Contrastive Adaptation for Customization of Diffusion Models CVPR 2025 Active Data Curation Effectively Distills Large-Scale Multimodal Models CVPR 2025 ESCAPE: Equivariant Shape Completion via Anchor Point Encoding CVPR 2025 SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance ECCV 2024 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections NIPS 2024 Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models NIPS 2024 SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes CVPR 2024 Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning CVPR 2024 MOHO: Learning Single-view Hand-held Object Reconstruction with Multi-view Occlusion-Aware Supervision CVPR 2024 SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation CVPR 2024 CONFORM: Contrast is All You Need for High-Fidelity Text-to-Image Diffusion Models CVPR 2024 KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation CVPR 2024 HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation CVPR 2024 Diffusion Bridges for 3D Point Cloud Denoising ECCV 2024 SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs ECCV 2024 BRAVE: Broadening the visual encoding of vision-language models ECCV 2024 SILC: Improving Vision Language Pretraining with Self-Distillation ECCV 2024 EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion ECCV 2024 D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction ECCV 2024 Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels ECCV 2024 GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering ECCV 2024 PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations ECCV 2024 Self-supervised Shape Completion via Involution and Implicit Correspondences ECCV 2024 Text-Conditioned Resampler For Long Form Video Understanding ECCV 2024 Denoising Diffusion via Image-Based Rendering ICLR 2024 OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views ICLR 2024 Extracting Training Data From Document-Based VQA Models ICML 2024 U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds ICCV 2023 Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction ICCV 2023 SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection ICCV 2023 Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language ICLR 2023 OpenMask3D: Open-Vocabulary 3D Instance Segmentation NIPS 2023 DDF-HO: Hand-Held Object Reconstruction via Conditional Directed Distance Field NIPS 2023 CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion NIPS 2023 Shape, Pose, and Appearance From a Single Image via Bootstrapped Radiance Field Inversion CVPR 2023 Incremental 3D Semantic Scene Graph Prediction From RGB Sequences CVPR 2023 Robust Monocular Depth Estimation under Challenging Conditions ICCV 2023 SPARF: Neural Radiance Fields From Sparse and Noisy Poses CVPR 2023 I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification CVPR 2023 IPCC-TP: Utilizing Incremental Pearson Correlation Coefficient for Joint Multi-Agent Trajectory Prediction CVPR 2023 Segmenting Known Objects and Unseen Unknowns without Prior Knowledge ICCV 2023 Introducing Language Guidance in Prompt-based Continual Learning ICCV 2023 Bending Graphs: Hierarchical Shape Matching Using Gated Optimal Transport CVPR 2022 3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection CVPR 2022 RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation ECCV 2022 E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs ECCV 2022 Implicit Neural Representations for Image Compression ECCV 2022 3D Compositional Zero-Shot Learning with DeCompositional Consensus ECCV 2022 GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning ECCV 2022 GPV-Pose: Category-Level Object Pose Estimation via Geometry-Guided Point-Wise Voting CVPR 2022 Learning Local Displacements for Point Cloud Completion CVPR 2022 On the Practicality of Deterministic Epistemic Uncertainty ICML 2022 SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation CVPR 2022 I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification NIPS 2022 ZebraPose: Coarse To Fine Surface Encoding for 6DoF Object Pose Estimation CVPR 2022 GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation CVPR 2021 Learning Graph Embeddings for Compositional Zero-Shot Learning CVPR 2021 Variational Transformer Networks for Layout Generation CVPR 2021 SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation ICCV 2021 Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs ICCV 2021 Unconditional Scene Graph Generation ICCV 2021 DB-GAN: Boosting Object Recognition Under Strong Lighting Conditions WACV 2021 SceneGraphFusion: Incremental 3D Scene Graph Prediction From RGB-D Sequences CVPR 2021 Deep Positional and Relational Feature Learning for Rotation-Invariant Point Cloud Analysis ECCV 2020 Quaternion Equivariant Capsule Networks for 3D Point Clouds ECCV 2020 Self6D: Self-Supervised Monocular 6D Object Pose Estimation ECCV 2020 SoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification ECCV 2020 Beyond Controlled Environments: 3D Camera Re-Localization in Changing Indoor Scenes ECCV 2020 Restricting the Flow: Information Bottlenecks for Attribution ICLR 2020 Semantic Image Manipulation Using Scene Graphs CVPR 2020 Learning 3D Semantic Scene Graphs From 3D Indoor Reconstructions CVPR 2020 Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data ICCV 2019 Object-Driven Multi-Layer Scene Decomposition From a Single Image ICCV 2019 Sampling-Free Epistemic Uncertainty Estimation Using Approximated Variance Propagation ICCV 2019 GFrames: Gradient-Based Local Reference Frame for 3D Shape Matching CVPR 2019 3D Point Capsule Networks CVPR 2019 Query-Guided End-To-End Person Search CVPR 2019 ForkNet: Multi-Branch Volumetric Semantic Completion From a Single Depth Image ICCV 2019 RIO: 3D Object Instance Re-Localization in Changing Indoor Environments ICCV 2019 Human Motion Analysis with Deep Metric Learning ECCV 2018 Distortion-Aware Convolutional Filters for Dense Prediction in Panoramic Images ECCV 2018 Deep Model-Based 6D Pose Refinement in RGB ECCV 2018 Guide Me: Interacting With Deep Networks CVPR 2018 Fully-Convolutional Point Networks for Large-Scale Point Clouds ECCV 2018 BOP: Benchmark for 6D Object Pose Estimation ECCV 2018 Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core CVPR 2017 SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again ICCV 2017 Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization ICCV 2017 Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses ICCV 2017 CNN-SLAM: Real-Time Dense Monocular SLAM With Learned Depth Prediction CVPR 2017 Learning a Descriptor-Specific 3D Keypoint Detector ICCV 2015 A Versatile Learning-Based 3D Temporal Tracker: Scalable, Robust, Online ICCV 2015 BOLD Features to Detect Texture-less Objects ICCV 2013