Bohan Li
30 papers · 2019–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (14) 🐝 Cross-Pollinator (5)
🌍
Conference Polyglot
(14)
🏃
Academic Marathon
(6)
🧬
Topic Evolution
🗃️
Keyword Collector
(119)
⚡
Prolific Year
(6)
💎
Century Club
(28)
🔥
Unstoppable
(7)
❓
The Questioner
Conferences
AAAI (4)
ECCV (4)
COLING (3)
ICCV (3)
INTERSPEECH (3)
EMNLP (2)
ICLR (2)
IJCAI (2)
ACL (1)
CORL (1)
CVPR (1)
EACL (1)
IJCNLP (1)
MICCAI (1)
NIPS (1)
Top co-authors
Keywords
diffusion model
(3)
few-shot learning
(2)
speech synthesis
(2)
evidence lower bound
(2)
self-supervised learning
(2)
3d vision
(2)
language model
(2)
variational autoencoder
(2)
scene generation
(2)
semantic scene completion
(2)
posterior collapse
(2)
representation learning
(2)
latent representation
(2)
prompt engineering
(2)
data augmentation
(1)
machine translation
(1)
transfer learning
(1)
attention mechanism
(1)
image generation
(1)
video generation
(1)
Papers
BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
EACL 2026
AHAMask: Reliable Task Specification for Large Audio Language Models Without Instructions
AAAI 2026
Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation
ICCV 2025
Multi-Modal Progressive Fusion for ASD Screening Using Smartphone Video
MICCAI 2025
DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
ICCV 2025
Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits
COLING 2025
One View, Many Worlds: Single-Image to 3D object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
CORL 2025
UniScene: Unified Occupancy-centric Driving Scene Generation
CVPR 2025
TAPTRv2: Attention-based Position Update Improves Tracking Any Point
NIPS 2024
Semantic-Guided Generative Image Augmentation Method with Diffusion Models for Image Classification
AAAI 2024
One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception
AAAI 2024
A Two-Stage Framework with Self-Supervised Distillation for Cross-Domain Text Classification
COLING 2024
Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion
ECCV 2024
Closed-Loop Unsupervised Representation Disentanglement with $\\beta$-VAE Distillation and Diffusion Probabilistic Feedback
ECCV 2024
Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents
ECCV 2024
Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion
IJCAI 2024
On the Effectiveness of Acoustic BPE in Decoder-Only TTS
INTERSPEECH 2024
Revisit Finetuning strategy for Few-Shot Learning to Transfer the Emdeddings
ICLR 2023
NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
ICCV 2023
VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
AAAI 2023
MetaPrompting: Learning to Learn Better Prompts
COLING 2022
Inverse is Better! Fast and Accurate Prompt for Few-shot Slot Tagging
ACL 2022
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition
ECCV 2022
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios
INTERSPEECH 2022
Adaptive Text to Speech for Spontaneous Style
INTERSPEECH 2021
AdaSpeech: Adaptive Text to Speech for Custom Voice
ICLR 2021
NuCDS: An Efficient Local Search Algorithm for Minimum Connected Dominating Set
IJCAI 2020
On the Sentence Embeddings from Pre-trained Language Models
EMNLP 2020
A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text
IJCNLP 2019
A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text
EMNLP 2019