Roozbeh Mottaghi
56 papers · 2013–2025 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌍 Conference Polyglot (11) 🏃 Academic Marathon (12) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (11)
🐝
Cross-Pollinator
(11)
🧭
Keyword Pioneer
🏃
Academic Marathon
(12)
🏠
Conference Loyalist
(23)
🤝
Dynamic Duo
(22)
👥
Mega-Team
(23)
🔬
Deep Specialist
(13)
🧬
Topic Evolution
🏆
Keyword Champion
(8)
❓
The Questioner
(3)
🗃️
Keyword Collector
(193)
🔥
Unstoppable
(13)
⚡
Prolific Year
(7)
💎
Century Club
(56)
🚀
Conference Pioneer
Conferences
CVPR (23)
ICLR (8)
ICCV (7)
ECCV (6)
NIPS (6)
AAAI (1)
ACL (1)
CORL (1)
IJCNLP (1)
JMLR (1)
RSS (1)
Top co-authors
Research topics
Keywords
embodied ai
(9)
scene understanding
(8)
visual navigation
(8)
reinforcement learning
(7)
object detection
(6)
neural network
(6)
semantic segmentation
(5)
self-supervised learning
(4)
zero-shot learning
(4)
agent navigation
(3)
mobile manipulation
(3)
embodied navigation
(3)
motion planning
(3)
embodied agent
(3)
robot navigation
(3)
representation learning
(3)
agent system
(3)
transfer learning
(3)
robotic manipulation
(2)
imitation learning
(2)
Papers
PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks
ICLR 2025
Controllable Human-Object Interaction Synthesis
ECCV 2024
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
CVPR 2024
Track2Act: Predicting Point Tracks from Internet Videos enables Generalizable Robot Manipulation
ECCV 2024
From an Image to a Scene: Learning to Imagine the World from a Million 360° Videos
NIPS 2024
GOAT: GO to Any Thing
RSS 2024
Habitat 3.0: A Co-Habitat for Humans, Avatars, and Robots
ICLR 2024
Situated Instruction Following
ECCV 2024
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-per-Second
CVPR 2023
Navigating to Objects Specified by Images
ICCV 2023
UNIFIED-IO: A Unified Model for Vision, Language, and Multi-modal Tasks
ICLR 2023
Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics
ICLR 2023
Neural Priming for Sample-Efficient Adaptation
NIPS 2023
Neural Radiance Field Codebooks
ICLR 2023
HomeRobot: Open-Vocabulary Mobile Manipulation
CORL 2023
ENTL: Embodied Navigation Trajectory Learner
ICCV 2023
Simple but Effective: CLIP Embeddings for Embodied AI
CVPR 2022
🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
NIPS 2022
Ask4Help: Learning to Leverage an Expert for Embodied Tasks
NIPS 2022
Multi-Modal Answer Validation for Knowledge-Based VQA
AAAI 2022
What Do Navigation Agents Learn About Their Environment?
CVPR 2022
Interactron: Embodied Adaptive Object Detection
CVPR 2022
Continuous Scene Representations for Embodied AI
CVPR 2022
A-OKVQA: A Benchmark for Visual Question Answering Using World Knowledge
ECCV 2022
Object Manipulation via Visual Target Localization
ECCV 2022
Visual Room Rearrangement
CVPR 2021
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
ACL 2021
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
IJCNLP 2021
Container: Context Aggregation Networks
NIPS 2021
Contrasting Contrastive Self-Supervised Representation Learning Pipelines
ICCV 2021
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
ICCV 2021
Factorizing Perception and Policy for Interactive Instruction Following
ICCV 2021
What Can You Learn From Your Muscles? Learning Visual Representation from Human Interactions
ICLR 2021
Learning Generalizable Visual Representations via Interactive Gameplay
ICLR 2021
ManipulaTHOR: A Framework for Visual Object Manipulation
CVPR 2021
Pushing It Out of the Way: Interactive Visual Navigation
CVPR 2021
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
CVPR 2020
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
ECCV 2020
Visual Reaction: Learning to Play Catch With Your Drone
CVPR 2020
Learning About Objects by Learning to Interact with Them
NIPS 2020
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
CVPR 2020
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
CVPR 2019
Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning
CVPR 2019
Visual Semantic Navigation using Scene Priors
ICLR 2019
Who Let the Dogs Out? Modeling Dog Behavior From Visual Data
CVPR 2018
SeGAN: Segmenting and Generating the Invisible
CVPR 2018
Visual Semantic Planning Using Deep Successor Representations
ICCV 2017
See the Glass Half Full: Reasoning About Liquid Containers, Their Volume and Content
ICCV 2017
A Task-Oriented Approach for Cost-Sensitive Recognition
CVPR 2016
Complexity of Representation and Inference in Compositional Models with Part Sharing
JMLR 2016
Newtonian Scene Understanding: Unfolding the Dynamics of Objects in Static Images
CVPR 2016
A Coarse-to-Fine Model for 3D Pose Estimation and Sub-Category Recognition
CVPR 2015
The Role of Context for Object Detection and Semantic Segmentation in the Wild
CVPR 2014
Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts
CVPR 2014
Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs
CVPR 2013
Bottom-Up Segmentation for Top-Down Detection
CVPR 2013