Bo Dai
191 papers · 2010–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (24) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(24)
π
Conference Loyalist
(48)
π€
Dynamic Duo
(39)
π
Triple Crown
π
Grand Slam
π±
Topic Pioneer
π¬
Deep Specialist
(23)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π₯
Unstoppable
(11)
β
The Questioner
π
Century Club
(190)
ποΈ
Keyword Collector
(81)
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(30)
Conferences
NIPS (48)
CVPR (35)
ICML (25)
ICLR (19)
ICCV (17)
ECCV (16)
AISTATS (14)
EMNLP (5)
UAI (3)
AAAI (2)
IJCAI (2)
WACV (2)
ACML (1)
JMLR (1)
L4DC (1)
Top co-authors
Research topics
Keywords
generative adversarial network
(12)
diffusion model
(12)
representation learning
(10)
neural network
(10)
neural rendering
(9)
energy-based model
(8)
generative model
(8)
neural radiance field
(7)
reinforcement learning
(7)
3d reconstruction
(6)
image generation
(6)
variational inference
(6)
3d gaussian splatting
(6)
sample complexity
(6)
action recognition
(5)
graph neural network
(5)
off-policy evaluation
(5)
image synthesis
(5)
novel view synthesis
(5)
unsupervised learning
(4)
Papers
PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
WACV 2026
Reasoning with Exploration: An Entropy Perspective
AAAI 2026
Primal-Dual Spectral Representation for Off-policy Evaluation
AISTATS 2025
Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment
AISTATS 2025
Scalable spectral representations for multiagent reinforcement learning in network MDPs
AISTATS 2025
Spectral Representation for Causal Estimation with Hidden Confounders
AISTATS 2025
GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation
ICLR 2025
CameraCtrl: Enabling Camera Control for Video Diffusion Models
ICLR 2025
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
ICLR 2025
EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
ICLR 2025
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
ICLR 2025
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
ICCV 2025
GausSim: Foreseeing Reality by Gaussian Simulator for Elastic Objects
ICCV 2025
GAS: Generative Avatar Synthesis from a Single Image
ICCV 2025
Multi-identity Human Image Animation with Structural Video Diffusion
ICCV 2025
On Domain-Adaptive Post-Training for Multimodal Large Language Models
EMNLP 2025
EdgeTAM: On-Device Track Anything Model
CVPR 2025
DF$^2$: Distribution-Free Decision-Focused Learning
UAI 2025
Efficient Duple Perturbation Robustness in Low-rank MDPs
L4DC 2025
Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
ICML 2025
Keyframe-Guided Creative Video Inpainting
CVPR 2025
ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model
CVPR 2025
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
CVPR 2025
DRiVE: Diffusion-based Rigging Empowers Generation of Versatile and Expressive Characters
CVPR 2025
Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
CVPR 2025
FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering
CVPR 2025
Efficient Online Reinforcement Learning for Diffusion Policy
ICML 2025
Text to Layer-wise 3D Clothed Human Generation
ECCV 2024
MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model
ECCV 2024
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation
ECCV 2024
SemGrasp: Semantic Grasp Generation via Language Aligned Discretization
ECCV 2024
Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering
CVPR 2024
Generalized Predictive Model for Autonomous Driving
CVPR 2024
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text
CVPR 2024
Task-Oriented Human-Object Interactions Generation With Implicit Neural Representations
WACV 2024
Point Cloud Pre-training with Diffusion Models
CVPR 2024
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
CVPR 2024
DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing
CVPR 2024
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
NIPS 2024
PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios
CVPR 2024
Cinematic Behavior Transfer via NeRF-based Differentiable Filming
CVPR 2024
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation
CVPR 2024
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models
ICML 2024
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
ICML 2024
PhyRecon: Physically Plausible Neural Scene Reconstruction
NIPS 2024
UQE: A Query Engine for Unstructured Databases
NIPS 2024
Learning 3D Garment Animation from Trajectories of A Piece of Cloth
NIPS 2024
Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates
NIPS 2024
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
NIPS 2024
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
ICLR 2024
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
ICLR 2024
Probabilistic Adaptation of Black-Box Text-to-Video Models
ICLR 2024
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
NIPS 2024
InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint
NIPS 2024
Diffusion Spectral Representation for Reinforcement Learning
NIPS 2024
GSDF: 3DGS Meets SDF for Improved Neural Rendering and Reconstruction
NIPS 2024
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning
ICML 2024
RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting
ECCV 2024
DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior
ECCV 2024
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
ECCV 2024
Score-based Continuous-time Discrete Diffusion Models
ICLR 2023
Learning Modulated Transformation in GANs
NIPS 2023
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars
NIPS 2023
Learning Universal Policies via Text-Guided Video Generation
NIPS 2023
Revisiting the Evaluation of Image Synthesis with GANs
NIPS 2023
Ordering-based Conditions for Global Convergence of Policy Gradient Methods
NIPS 2023
AdaPlanner: Adaptive Planning from Feedback with Language Models
NIPS 2023
Discrete Langevin Samplers via Wasserstein Gradient Flow
AISTATS 2023
Learning to Optimize with Stochastic Dominance Constraints
AISTATS 2023
Controllable Mesh Generation Through Sparse Latent Point Diffusion Models
CVPR 2023
Generative Diffusion Prior for Unified Image Restoration and Enhancement
CVPR 2023
Prototype-Based Embedding Network for Scene Graph Generation
CVPR 2023
Grid-Guided Neural Radiance Fields for Large Urban Scenes
CVPR 2023
Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion
CVPR 2023
On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval
EMNLP 2023
MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond
ICCV 2023
LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
ICCV 2023
SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling
ICCV 2023
DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-Centric Rendering
ICCV 2023
3DHumanGAN: 3D-Aware Human Image Generation with 3D Pose Mapping
ICCV 2023
AssetField: Assets Mining and Reconfiguration in Ground Feature Plane Representation
ICCV 2023
Towards Multi-Layered 3D Garments Animation
ICCV 2023
OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs
ICCV 2023
X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
ICCV 2023
Any-scale Balanced Samplers for Discrete Space
ICLR 2023
Latent Variable Representation for Reinforcement Learning
ICLR 2023
Spectral Decomposition Representation for Reinforcement Learning
ICLR 2023
Stochastic Gradient Succeeds for Bandits
ICML 2023
HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE
IJCAI 2023
Energy-based Predictive Representations for Partially Observed Reinforcement Learning
UAI 2023
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
CVPR 2022
Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization
ICML 2022
SMARTAVE: Structured Multimodal Transformer for Product Attribute Value Extraction
EMNLP 2022
A free lunch from the noise: Provable and practical exploration for representation learning
UAI 2022
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
CVPR 2022
Neural Stochastic Dual Dynamic Programming
ICLR 2022
Understanding and Leveraging Overparameterization in Recursive Value Estimation
ICLR 2022
Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-optimal Demonstrations
AAAI 2022
Oracle Inequalities for Model Selection in Offline Reinforcement Learning
NIPS 2022
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
CVPR 2022
Revisiting Skeleton-Based Action Recognition
CVPR 2022
Towards Diverse and Natural Scene-Aware 3D Human Motion Synthesis
CVPR 2022
Monocular 3D Object Reconstruction with GAN Inversion
ECCV 2022
DeciWatch: A Simple Baseline for 10Γ Efficient 2D and 3D Pose Estimation
ECCV 2022
BRACE: The Breakdancing Competition Dataset for Dance Motion Synthesis
ECCV 2022
Transformer with Implicit Edges for Particle-Based Physics Simulation
ECCV 2022
Extract Free Dense Labels from CLIP
ECCV 2022
BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-Scale Scene Rendering
ECCV 2022
The Role of Baselines in Policy Gradient Optimization
NIPS 2022
Improving GANs with A Dynamic Discriminator
NIPS 2022
On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games
NIPS 2022
Offline Policy Selection under Uncertainty
AISTATS 2022
The Curse of Passive Data Collection in Batch Reinforcement Learning
AISTATS 2022
Making Linear MDPs Practical via Contrastive Representation Learning
ICML 2022
Model Selection in Batch Policy Optimization
ICML 2022
A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis
NIPS 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
NIPS 2021
Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data
NIPS 2021
Generative Occupancy Fields for 3D Surface-Aware Image Synthesis
NIPS 2021
Learning to Defend by Learning to Attack
AISTATS 2021
Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach
EMNLP 2021
Understanding the Effect of Stochasticity in Policy Optimization
NIPS 2021
Focal Frequency Loss for Image Reconstruction and Synthesis
ICCV 2021
BlockPlanner: City Block Generation With Vectorized Graph Representation
ICCV 2021
Nearly Horizon-Free Offline Reinforcement Learning
NIPS 2021
Towards understanding retrosynthesis by energy-based models
NIPS 2021
Do 2D GANs Know 3D Shape? Unsupervised 3D Shape Reconstruction from 2D Image GANs
ICLR 2021
Overcoming Catastrophic Forgetting by Bayesian Generative Regularization
ICML 2021
Leveraging Non-uniformity in First-order Non-convex Optimization
ICML 2021
Visually Informed Binaural Audio Generation without Binaural Audios
CVPR 2021
Scene-Aware Generative Network for Human Motion Synthesis
CVPR 2021
Unsupervised 3D Shape Completion Through GAN Inversion
CVPR 2021
LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs
ICML 2021
On the Optimality of Batch Policy Optimization Algorithms
ICML 2021
Self-Supervised Scene De-Occlusion
CVPR 2020
Energy-Based Processes for Exchangeable Data
ICML 2020
Batch Stationary Distribution Estimation
ICML 2020
Differentiable Top-k with Optimal Transport
NIPS 2020
Off-Policy Imitation Learning from Observations
NIPS 2020
Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration
NIPS 2020
CoinDICE: Off-Policy Confidence Interval Estimation
NIPS 2020
Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach
NIPS 2020
Off-Policy Evaluation via the Regularized Lagrangian
NIPS 2020
FineGym: A Hierarchical Video Dataset for Fine-Grained Action Understanding
CVPR 2020
Temporal Pyramid Network for Action Recognition
CVPR 2020
Intra- and Inter-Action Understanding via Temporal Action Parsing
CVPR 2020
Learning to Plan in High Dimensions via Neural Exploration-Exploitation Trees
ICLR 2020
Scalable Deep Generative Modeling for Sparse Graphs
ICML 2020
Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation
ECCV 2020
Named Entity Recognition for Social Media Texts with Semantic Augmentation
EMNLP 2020
Escaping the Gravitational Pull of Softmax
NIPS 2020
Feature Intertwiner for Object Detection
ICLR 2019
Exponential Family Estimation via Adversarial Dynamics Embedding
NIPS 2019
Meta Architecture Search
NIPS 2019
Retrosynthesis Prediction with Conditional Graph Logic Network
NIPS 2019
Recursive Visual Sound Separation Using Minus-Plus Net
ICCV 2019
Energy-Inspired Models: Learning with Sampler-Induced Distributions
NIPS 2019
Kernel Exponential Family Estimation via Doubly Dual Embedding
AISTATS 2019
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
NIPS 2019
Move Forward and Tell: A Progressive Generator of Video Descriptions
ECCV 2018
Syntax-Directed Variational Autoencoder for Structured Data
ICLR 2018
Boosting the Actor with Dual Critic
ICLR 2018
Cooperative neural networks (CoNN): Exploiting prior independence structure for improved classification
NIPS 2018
Learning Steady-States of Iterative Algorithms over Graphs
ICML 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
ICML 2018
Towards Black-box Iterative Machine Teaching
ICML 2018
Decoupled Networks
CVPR 2018
Rethinking the Form of Latent States in Image Captioning
ECCV 2018
Multi-scale Nystrom Method
AISTATS 2018
Predictive Approximate Bayesian Computation via Saddle Points
NIPS 2018
A Neural Compositional Paradigm for Image Captioning
NIPS 2018
Coupled Variational Bayes via Optimization Embedding
NIPS 2018
Structured Inference for Recurrent Hidden Semi-markov Model
IJCAI 2018
Learning towards Minimum Hyperspherical Energy
NIPS 2018
Detecting Visual Relationships With Deep Relational Networks
CVPR 2017
Deep Hyperspherical Learning
NIPS 2017
Contrastive Learning for Image Captioning
NIPS 2017
Towards Diverse and Natural Image Descriptions via a Conditional GAN
ICCV 2017
Learning from Conditional Distributions via Dual Embeddings
AISTATS 2017
Iterative Machine Teaching
ICML 2017
Stochastic Generative Hashing
ICML 2017
Discriminative Embeddings of Latent Variable Models for Structured Data
ICML 2016
Provable Bayesian Inference via Particle Mirror Descent
AISTATS 2016
Nonparametric Estimation of Multi-View Latent Variable Models
ICML 2014
Scalable Kernel Methods via Doubly Stochastic Gradients
NIPS 2014
Transductive Learning with Multi-class Volume Approximation
ICML 2014
Squared-loss Mutual Information Regularization: A Novel Information-theoretic Approach to Semi-supervised Learning
ICML 2013
Robust Low Rank Kernel Embeddings of Multivariate Distributions
NIPS 2013
Maximum Volume Clustering: A New Discriminative Clustering Approach
JMLR 2013
Maximum Volume Clustering
AISTATS 2011
Minimum Conditional Entropy Clustering: A Discriminative Framework for Clustering
ACML 2010