Federico Tombari
114 papers · 2013–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Academic Marathon (13) π Conference Polyglot (8) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (9)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(9)
πΊοΈ
Taxonomy Completionist
(114)
π
Conference Loyalist
(27)
π€
Dynamic Duo
(39)
π
Grand Slam
π
Keyword Champion
(8)
π
Triple Crown
π¬
Deep Specialist
(39)
β‘
Prolific Year
(15)
π
Conference Pioneer
ποΈ
Keyword Collector
(368)
π₯
Unstoppable
(10)
π
Trend Setter
π
Century Club
(113)
Conferences
CVPR (40)
ECCV (27)
ICCV (26)
ICLR (6)
NIPS (6)
WACV (5)
AAAI (2)
ICML (2)
Top co-authors
Keywords
3d reconstruction
(13)
object detection
(10)
scene graph
(8)
semantic segmentation
(8)
point cloud
(8)
diffusion model
(8)
pose estimation
(7)
3d vision
(6)
vision-language model
(6)
zero-shot learning
(5)
neural radiance field
(5)
domain adaptation
(4)
6d pose estimation
(4)
3d scene understanding
(4)
convolutional neural network
(4)
multimodal learning
(3)
instance segmentation
(3)
text-to-image generation
(3)
representation learning
(3)
scene understanding
(3)
Papers
OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting
WACV 2026
RiemanLine: Riemannian Manifold Representation of 3D Lines for Factor Graph Optimization
AAAI 2026
Mixed Diffusion for 3D Indoor Scene Synthesis
WACV 2026
Learning to Prompt with Text Only Supervision for Vision-Language Models
AAAI 2025
Towards Real-Time Open-Vocabulary Video Instance Segmentation
WACV 2025
LIME: Localized Image Editing via Attention Regularization in Diffusion Models
WACV 2025
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
ICLR 2025
CubeDiff: Repurposing Diffusion-Based Image Models for Panorama Generation
ICLR 2025
MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning
ICCV 2025
UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint
ICCV 2025
Contrastive Test-Time Composition of Multiple LoRA Models for Image Generation
ICCV 2025
4D Gaussian Splatting SLAM
ICCV 2025
Hierarchical 3D Scene Graphs Construction Outdoors
ICCV 2025
Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation
ICCV 2025
RelationField: Relate Anything in Radiance Fields
CVPR 2025
Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation
CVPR 2025
One2Any: One-Reference 6D Pose Estimation for Any Object
CVPR 2025
Test-Time Visual In-Context Tuning
CVPR 2025
LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models
CVPR 2025
UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image
CVPR 2025
Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos
CVPR 2025
LoRACLR: Contrastive Adaptation for Customization of Diffusion Models
CVPR 2025
Active Data Curation Effectively Distills Large-Scale Multimodal Models
CVPR 2025
ESCAPE: Equivariant Shape Completion via Anchor Point Encoding
CVPR 2025
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
ECCV 2024
UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections
NIPS 2024
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models
NIPS 2024
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes
CVPR 2024
Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning
CVPR 2024
MOHO: Learning Single-view Hand-held Object Reconstruction with Multi-view Occlusion-Aware Supervision
CVPR 2024
SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
CVPR 2024
CONFORM: Contrast is All You Need for High-Fidelity Text-to-Image Diffusion Models
CVPR 2024
KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation
CVPR 2024
HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation
CVPR 2024
Diffusion Bridges for 3D Point Cloud Denoising
ECCV 2024
SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs
ECCV 2024
BRAVE: Broadening the visual encoding of vision-language models
ECCV 2024
SILC: Improving Vision Language Pretraining with Self-Distillation
ECCV 2024
EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion
ECCV 2024
D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction
ECCV 2024
Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels
ECCV 2024
GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering
ECCV 2024
PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations
ECCV 2024
Self-supervised Shape Completion via Involution and Implicit Correspondences
ECCV 2024
Text-Conditioned Resampler For Long Form Video Understanding
ECCV 2024
Denoising Diffusion via Image-Based Rendering
ICLR 2024
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
ICLR 2024
Extracting Training Data From Document-Based VQA Models
ICML 2024
U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds
ICCV 2023
Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction
ICCV 2023
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
ICCV 2023
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
ICLR 2023
OpenMask3D: Open-Vocabulary 3D Instance Segmentation
NIPS 2023
DDF-HO: Hand-Held Object Reconstruction via Conditional Directed Distance Field
NIPS 2023
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion
NIPS 2023
Shape, Pose, and Appearance From a Single Image via Bootstrapped Radiance Field Inversion
CVPR 2023
Incremental 3D Semantic Scene Graph Prediction From RGB Sequences
CVPR 2023
Robust Monocular Depth Estimation under Challenging Conditions
ICCV 2023
SPARF: Neural Radiance Fields From Sparse and Noisy Poses
CVPR 2023
I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification
CVPR 2023
IPCC-TP: Utilizing Incremental Pearson Correlation Coefficient for Joint Multi-Agent Trajectory Prediction
CVPR 2023
Segmenting Known Objects and Unseen Unknowns without Prior Knowledge
ICCV 2023
Introducing Language Guidance in Prompt-based Continual Learning
ICCV 2023
Bending Graphs: Hierarchical Shape Matching Using Gated Optimal Transport
CVPR 2022
3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection
CVPR 2022
RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation
ECCV 2022
E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs
ECCV 2022
Implicit Neural Representations for Image Compression
ECCV 2022
3D Compositional Zero-Shot Learning with DeCompositional Consensus
ECCV 2022
GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
ECCV 2022
GPV-Pose: Category-Level Object Pose Estimation via Geometry-Guided Point-Wise Voting
CVPR 2022
Learning Local Displacements for Point Cloud Completion
CVPR 2022
On the Practicality of Deterministic Epistemic Uncertainty
ICML 2022
SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation
CVPR 2022
I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification
NIPS 2022
ZebraPose: Coarse To Fine Surface Encoding for 6DoF Object Pose Estimation
CVPR 2022
GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation
CVPR 2021
Learning Graph Embeddings for Compositional Zero-Shot Learning
CVPR 2021
Variational Transformer Networks for Layout Generation
CVPR 2021
SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation
ICCV 2021
Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs
ICCV 2021
Unconditional Scene Graph Generation
ICCV 2021
DB-GAN: Boosting Object Recognition Under Strong Lighting Conditions
WACV 2021
SceneGraphFusion: Incremental 3D Scene Graph Prediction From RGB-D Sequences
CVPR 2021
Deep Positional and Relational Feature Learning for Rotation-Invariant Point Cloud Analysis
ECCV 2020
Quaternion Equivariant Capsule Networks for 3D Point Clouds
ECCV 2020
Self6D: Self-Supervised Monocular 6D Object Pose Estimation
ECCV 2020
SoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification
ECCV 2020
Beyond Controlled Environments: 3D Camera Re-Localization in Changing Indoor Scenes
ECCV 2020
Restricting the Flow: Information Bottlenecks for Attribution
ICLR 2020
Semantic Image Manipulation Using Scene Graphs
CVPR 2020
Learning 3D Semantic Scene Graphs From 3D Indoor Reconstructions
CVPR 2020
Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data
ICCV 2019
Object-Driven Multi-Layer Scene Decomposition From a Single Image
ICCV 2019
Sampling-Free Epistemic Uncertainty Estimation Using Approximated Variance Propagation
ICCV 2019
GFrames: Gradient-Based Local Reference Frame for 3D Shape Matching
CVPR 2019
3D Point Capsule Networks
CVPR 2019
Query-Guided End-To-End Person Search
CVPR 2019
ForkNet: Multi-Branch Volumetric Semantic Completion From a Single Depth Image
ICCV 2019
RIO: 3D Object Instance Re-Localization in Changing Indoor Environments
ICCV 2019
Human Motion Analysis with Deep Metric Learning
ECCV 2018
Distortion-Aware Convolutional Filters for Dense Prediction in Panoramic Images
ECCV 2018
Deep Model-Based 6D Pose Refinement in RGB
ECCV 2018
Guide Me: Interacting With Deep Networks
CVPR 2018
Fully-Convolutional Point Networks for Large-Scale Point Clouds
ECCV 2018
BOP: Benchmark for 6D Object Pose Estimation
ECCV 2018
Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core
CVPR 2017
SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again
ICCV 2017
Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization
ICCV 2017
Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses
ICCV 2017
CNN-SLAM: Real-Time Dense Monocular SLAM With Learned Depth Prediction
CVPR 2017
Learning a Descriptor-Specific 3D Keypoint Detector
ICCV 2015
A Versatile Learning-Based 3D Temporal Tracker: Scalable, Robust, Online
ICCV 2015
BOLD Features to Detect Texture-less Objects
ICCV 2013