Botian Shi

30 papers · 2019–2026 · 10 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐝 Cross-Pollinator (10)

🌍 Conference Polyglot (10) 🏃 Academic Marathon (6) 🌈 Renaissance Researcher (8) 🤝 Dynamic Duo (14) 👥 Mega-Team (38) 🧬 Topic Evolution ⚡ Prolific Year (6) 💎 Century Club (29) 🗃️ Keyword Collector (125) 🔥 Unstoppable (7)

Conferences

CVPR (6) NIPS (5) ICCV (4) ICLR (4) AAAI (3) ACL (3) ECCV (2) EMNLP (1) IJCAI (1) IJCNLP (1)

Top co-authors

Bo Zhang (14) Yu Qiao (13) Jiakang Yuan (9) Tao Chen (9) Yikang LI (8) Xiangchao Yan (7) Pinlong Cai (6) Liang He (6) Xin Li (6) Min Dou (6)

Research topics

Robotics (1)

Keywords

autonomous driving (7) point cloud (6) 3d object detection (6) 3d vision (3) retrieval-augmented generation (3) domain adaptation (2) video understanding (2) document understanding (2) multimodal learning (2) semi-supervised learning (1) object detection (1) scene understanding (1) video generation (1) video captioning (1) benchmark evaluation (1) representation learning (1) question answering (1) domain generalization (1) zero-shot learning (1) knowledge distillation (1)

Papers

LeanRAG: Knowledge-Graph-Based Generation with Semantic Aggregation and Hierarchical Retrieval AAAI 2026 Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching CVPR 2025 Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback ACL 2025 Docopilot: Improving Multimodal Models for Document-Level Understanding CVPR 2025 OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations CVPR 2025 Chimera: Improving Generalist Model with Domain-Specific Experts ICCV 2025 DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving ICCV 2025 Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning ICCV 2025 OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text ICLR 2025 GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training ICLR 2025 Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving NIPS 2024 DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models ICLR 2024 ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation ICLR 2024 Better Regression Makes Better Test-time Adaptive 3D Object Detection ECCV 2024 Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy NIPS 2024 ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving NIPS 2024 AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset NIPS 2023 RangePerception: Taming LiDAR Range View for Efficient and Accurate 3D Object Detection NIPS 2023 LWSIS: LiDAR-Guided Weakly Supervised Instance Segmentation for Autonomous Driving AAAI 2023 Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection CVPR 2023 Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection CVPR 2023 DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds ICCV 2023 LoGoNet: Towards Accurate 3D Object Detection With Local-to-Global Cross-Modal Fusion CVPR 2023 Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection ECCV 2022 Hashing based Efficient Inference for Image-Text Matching IJCNLP 2021 Hashing based Efficient Inference for Image-Text Matching ACL 2021 A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos EMNLP 2020 Functionality Discovery and Prediction of Physical Objects AAAI 2020 Dense Procedure Captioning in Narrated Instructional Videos ACL 2019 Knowledge Aware Semantic Concept Expansion for Image-Text Matching IJCAI 2019