Bohan Li

30 papers · 2019–2026 · 15 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (14) 🐝 Cross-Pollinator (5)

🌍 Conference Polyglot (14) 🏃 Academic Marathon (6) 🧬 Topic Evolution 🗃️ Keyword Collector (119) ⚡ Prolific Year (6) 💎 Century Club (28) 🔥 Unstoppable (7) ❓ The Questioner

Conferences

AAAI (4) ECCV (4) COLING (3) ICCV (3) INTERSPEECH (3) EMNLP (2) ICLR (2) IJCAI (2) ACL (1) CORL (1) CVPR (1) EACL (1) IJCNLP (1) MICCAI (1) NIPS (1)

Top co-authors

Xin Jin (7) Wenjun Zeng (7) Wanxiang Che (5) Xu Tan (4) Sheng Zhao (4) Junxian He (3) Xiao Xu (3) Yutai Hou (3) Tie-yan Liu (3) Yunlong Feng (3)

Keywords

diffusion model (3) few-shot learning (2) speech synthesis (2) evidence lower bound (2) self-supervised learning (2) 3d vision (2) language model (2) variational autoencoder (2) scene generation (2) semantic scene completion (2) posterior collapse (2) representation learning (2) latent representation (2) prompt engineering (2) data augmentation (1) machine translation (1) transfer learning (1) attention mechanism (1) image generation (1) video generation (1)

Papers

BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction EACL 2026 AHAMask: Reliable Task Specification for Large Audio Language Models Without Instructions AAAI 2026 Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation ICCV 2025 Multi-Modal Progressive Fusion for ASD Screening Using Smartphone Video MICCAI 2025 DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation ICCV 2025 Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits COLING 2025 One View, Many Worlds: Single-Image to 3D object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation CORL 2025 UniScene: Unified Occupancy-centric Driving Scene Generation CVPR 2025 TAPTRv2: Attention-based Position Update Improves Tracking Any Point NIPS 2024 Semantic-Guided Generative Image Augmentation Method with Diffusion Models for Image Classification AAAI 2024 One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception AAAI 2024 A Two-Stage Framework with Self-Supervised Distillation for Cross-Domain Text Classification COLING 2024 Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion ECCV 2024 Closed-Loop Unsupervised Representation Disentanglement with $\\beta$-VAE Distillation and Diffusion Probabilistic Feedback ECCV 2024 Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents ECCV 2024 Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion IJCAI 2024 On the Effectiveness of Acoustic BPE in Decoder-Only TTS INTERSPEECH 2024 Revisit Finetuning strategy for Few-Shot Learning to Transfer the Emdeddings ICLR 2023 NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation ICCV 2023 VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing AAAI 2023 MetaPrompting: Learning to Learn Better Prompts COLING 2022 Inverse is Better! Fast and Accurate Prompt for Few-shot Slot Tagging ACL 2022 When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition ECCV 2022 AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios INTERSPEECH 2022 Adaptive Text to Speech for Spontaneous Style INTERSPEECH 2021 AdaSpeech: Adaptive Text to Speech for Custom Voice ICLR 2021 NuCDS: An Efficient Local Search Algorithm for Minimum Connected Dominating Set IJCAI 2020 On the Sentence Embeddings from Pre-trained Language Models EMNLP 2020 A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text IJCNLP 2019 A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text EMNLP 2019