Botian Shi
30 papers · 2019–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐝 Cross-Pollinator (10)
🌍
Conference Polyglot
(10)
🏃
Academic Marathon
(6)
🌈
Renaissance Researcher
(8)
🤝
Dynamic Duo
(14)
👥
Mega-Team
(38)
🧬
Topic Evolution
⚡
Prolific Year
(6)
💎
Century Club
(29)
🗃️
Keyword Collector
(125)
🔥
Unstoppable
(7)
Conferences
CVPR (6)
NIPS (5)
ICCV (4)
ICLR (4)
AAAI (3)
ACL (3)
ECCV (2)
EMNLP (1)
IJCAI (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
autonomous driving
(7)
point cloud
(6)
3d object detection
(6)
3d vision
(3)
retrieval-augmented generation
(3)
domain adaptation
(2)
video understanding
(2)
document understanding
(2)
multimodal learning
(2)
semi-supervised learning
(1)
object detection
(1)
scene understanding
(1)
video generation
(1)
video captioning
(1)
benchmark evaluation
(1)
representation learning
(1)
question answering
(1)
domain generalization
(1)
zero-shot learning
(1)
knowledge distillation
(1)
Papers
LeanRAG: Knowledge-Graph-Based Generation with Semantic Aggregation and Hierarchical Retrieval
AAAI 2026
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching
CVPR 2025
Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback
ACL 2025
Docopilot: Improving Multimodal Models for Document-Level Understanding
CVPR 2025
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
CVPR 2025
Chimera: Improving Generalist Model with Domain-Specific Experts
ICCV 2025
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
ICCV 2025
Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning
ICCV 2025
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
ICLR 2025
GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
ICLR 2025
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
NIPS 2024
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models
ICLR 2024
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
ICLR 2024
Better Regression Makes Better Test-time Adaptive 3D Object Detection
ECCV 2024
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
NIPS 2024
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
NIPS 2024
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset
NIPS 2023
RangePerception: Taming LiDAR Range View for Efficient and Accurate 3D Object Detection
NIPS 2023
LWSIS: LiDAR-Guided Weakly Supervised Instance Segmentation for Autonomous Driving
AAAI 2023
Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection
CVPR 2023
Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection
CVPR 2023
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds
ICCV 2023
LoGoNet: Towards Accurate 3D Object Detection With Local-to-Global Cross-Modal Fusion
CVPR 2023
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
ECCV 2022
Hashing based Efficient Inference for Image-Text Matching
IJCNLP 2021
Hashing based Efficient Inference for Image-Text Matching
ACL 2021
A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos
EMNLP 2020
Functionality Discovery and Prediction of Physical Objects
AAAI 2020
Dense Procedure Captioning in Narrated Instructional Videos
ACL 2019
Knowledge Aware Semantic Concept Expansion for Image-Text Matching
IJCAI 2019