Bin Zhao
51 papers · 2011–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Academic Marathon (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (10) π Cross-Pollinator (11)
π
Cross-Pollinator
(11)
π
Renaissance Researcher
(11)
πΊοΈ
Taxonomy Completionist
(102)
π±
Topic Pioneer
π§¬
Topic Evolution
π
Keyword Champion
π€
Dynamic Duo
(30)
π¬
Deep Specialist
(10)
ποΈ
Keyword Collector
(276)
π
Century Club
(48)
π₯
Unstoppable
(9)
β‘
Prolific Year
(14)
π
Conference Pioneer
π
Trend Setter
Conferences
CVPR (15)
AAAI (7)
ICCV (7)
NIPS (6)
INTERSPEECH (4)
CORL (3)
IJCAI (3)
ECCV (2)
ICML (2)
RSS (2)
Top co-authors
Keywords
point cloud
(4)
object detection
(3)
image restoration
(3)
diffusion model
(3)
depth estimation
(3)
neural radiance field
(3)
video summarization
(2)
contrastive learning
(2)
recurrent neural network
(2)
3d reconstruction
(2)
object tracking
(2)
image categorization
(2)
semantic segmentation
(2)
convolutional neural network
(2)
robotic grasping
(2)
self-supervised learning
(2)
video captioning
(2)
event camera
(2)
cross-modal learning
(2)
parallel computing
(2)
Papers
FreeGaussian: Annotation-free Control of Articulated Objects via 3D Gaussian Splats with Flow Derivatives
AAAI 2026
MindSight: A Bio-Inspired Neural Architecture for Visual Restoration via Cortical Electrical Stimulation
AAAI 2026
CLUHCS:Dual-View Contrastive Learning Enabled Unsupervised Heterogeneous Community Search with Meta-Path Behavior Modeling
AAAI 2026
Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation
CVPR 2025
FastUMI: A Scalable and Hardware-Independent Universal Manipulation Interface with Dataset
CORL 2025
AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations
ICCV 2025
Efficient Diffusion as Low Light Enhancer
CVPR 2025
Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
AAAI 2025
MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation
ICCV 2025
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Models
RSS 2025
Open-Vocabulary Octree-Graph for 3D Scene Understanding
ICCV 2025
Implicit Event-RGBD Neural SLAM
CVPR 2024
Cyclic Learning for Binaural Audio Generation and Localization
CVPR 2024
GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting
CVPR 2024
HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation
CVPR 2024
Any2Point: Empowering Any-modality Transformers for Efficient 3D Understanding
ECCV 2024
KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance
CORL 2024
Color Event Enhanced Single-Exposure HDR Imaging
AAAI 2024
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-Modal Knowledge Transfer
AAAI 2024
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
AAAI 2024
Learning Manipulation by Predicting Interaction
RSS 2024
Decoding Human Language Acquisition: EEG Evidence for Predictive Probabilistic Statistics in Word Segmentation
INTERSPEECH 2024
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
ICML 2024
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Control and Rendering
NIPS 2024
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training
NIPS 2024
Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction
ICCV 2023
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
NIPS 2023
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
NIPS 2023
Affordance-Driven Next-Best-View Planning for Robotic Grasping
CORL 2023
Fully Self-Supervised Depth Estimation From Defocus Clue
CVPR 2023
One-Shot High-Fidelity Talking-Head Synthesis With Deformable Neural Radiance Field
CVPR 2023
Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking
CVPR 2023
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding
ICCV 2023
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement
ICCV 2023
Behavior Contrastive Learning for Unsupervised Skill Discovery
ICML 2023
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
NIPS 2022
RCLane: Relay Chain Prediction for Lane Detection
ECCV 2022
PSRR-MaxpoolNMS: Pyramid Shifted MaxpoolNMS With Relationship Recovery
CVPR 2021
Generating Masks From Boxes by Mining Spatio-Temporal Consistencies in Videos
ICCV 2021
Cortical Oscillatory Hierarchy for Natural Sentence Processing
INTERSPEECH 2020
MaxpoolNMS: Getting Rid of NMS Bottlenecks in Two-Stage Object Detectors
CVPR 2019
Travel Time Estimation without Road Networks: An Urban Morphological Layout Representation Approach
IJCAI 2019
Revealing Spatiotemporal Brain Dynamics of Speech Production Based on EEG and Eye Movement
INTERSPEECH 2018
Video Captioning with Tube Features
IJCAI 2018
HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization
CVPR 2018
MAM-RNN: Multi-level Attention Model Based RNN for Video Captioning
IJCAI 2017
A Neuro-Experimental Evidence for the Motor Theory of Speech Perception
INTERSPEECH 2017
Quasi Real-Time Summarization for Consumer Videos
CVPR 2014
Hierarchical Feature Hashing for Fast Dimensionality Reduction
CVPR 2014
Sparse Output Coding for Large-Scale Visual Recognition
CVPR 2013
Large-Scale Category Structure Aware Image Categorization
NIPS 2011