Alexander Toshev
23 papers · 2013–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Interdisciplinary Bridge π£ Hot Topic Early Bird π§ Keyword Pioneer π Conference Polyglot (6) π Academic Marathon (12)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(45)
π
Keyword Trendsetter Combo
(3)
π±
Topic Pioneer
π
Keyword Champion
π§¬
Topic Evolution
π₯
Mega-Team
(60)
π₯
Unstoppable
(8)
ποΈ
Keyword Collector
(112)
π
Trend Setter
π
Century Club
(23)
Conferences
CVPR (10)
ICCV (4)
NIPS (4)
CORL (2)
ECCV (2)
EMNLP (1)
Top co-authors
Keywords
deep neural network
(4)
reinforcement learning
(3)
deep learning
(3)
contrastive learning
(3)
object detection
(2)
semantic segmentation
(2)
long-horizon task
(2)
convolutional neural network
(2)
image captioning
(2)
embodied agent
(2)
visual grounding
(2)
multimodal large language model
(2)
natural language generation
(1)
attention mechanism
(1)
video generation
(1)
image generation
(1)
sparse representation
(1)
human pose estimation
(1)
image retrieval
(1)
zero-shot learning
(1)
Papers
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons
CVPR 2025
UINavBench: A Framework for Comprehensive Evaluation of Interactive Digital Agents
ICCV 2025
Multimodal Autoregressive Pre-training of Large Vision Encoders
CVPR 2025
World-consistent Video Diffusion with Explicit 3D Modeling
CVPR 2025
Grounding Multimodal Large Language Models in Actions
NIPS 2024
"MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
ECCV 2024
DataComp-LM: In search of the next generation of training sets for language models
NIPS 2024
STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
EMNLP 2023
Perceptual Grouping in Contrastive Vision-Language Models
ICCV 2023
GAUDI: A Neural Architect for Immersive 3D Scene Generation
NIPS 2022
Modeling Long-horizon Tasks as Sequential Interaction Landscapes
CORL 2020
Adversarial Generative Grammars for Human Activity Prediction
ECCV 2020
Learning Object-conditioned Exploration using Distributed Soft Actor Critic
CORL 2020
Evolving Space-Time Neural Architectures for Videos
ICCV 2019
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks
CVPR 2019
Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control
CVPR 2018
No Fuss Distance Metric Learning Using Proxies
ICCV 2017
Towards Accurate Multi-Person Pose Estimation in the Wild
CVPR 2017
Generation and Comprehension of Unambiguous Object Descriptions
CVPR 2016
Show and Tell: A Neural Image Caption Generator
CVPR 2015
Scalable Object Detection using Deep Neural Networks
CVPR 2014
DeepPose: Human Pose Estimation via Deep Neural Networks
CVPR 2014
Deep Neural Networks for Object Detection
NIPS 2013