Hui Zhang
85 papers · 2009–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π Conference Polyglot (15)
π
Academic Marathon
(16)
πΊοΈ
Taxonomy Completionist
(11)
π§
Keyword Pioneer
π¬
Deep Specialist
(12)
π§¬
Topic Evolution
π
Keyword Champion
(2)
ποΈ
Keyword Collector
(314)
β‘
Prolific Year
(13)
π
Conference Pioneer
π
Trend Setter
π
Century Club
(73)
π₯
Unstoppable
(14)
β
The Questioner
(2)
Conferences
AAAI (18)
CVPR (13)
ICCV (10)
INTERSPEECH (10)
ACL (7)
ECCV (6)
MICCAI (5)
EMNLP (3)
NSDI (3)
COLING (2)
IJCAI (2)
NAACL (2)
CORL (1)
IJCNLP (1)
MIDL (1)
NIPS (1)
Top co-authors
Research topics
Keywords
semantic segmentation
(7)
autonomous driving
(5)
diffusion model
(5)
speech separation
(4)
point cloud
(4)
multimodal learning
(4)
image generation
(3)
neural network
(3)
deep neural network
(3)
multimodal large language model
(3)
instance segmentation
(3)
collaborative perception
(3)
signal-to-noise ratio
(2)
3d object detection
(2)
image segmentation
(2)
remote sensing
(2)
3d vision
(2)
depth estimation
(2)
object tracking
(2)
domain adaptation
(2)
Papers
HiPro-CT: A Hierarchical Probabilistic Framework for 3D Medical Vision-Language Alignment
MIDL 2026
Primary Visual Cortex Inspired Point Cloud Analysis Framework
AAAI 2026
EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning
ACL 2026
Magnol.AI Copilot: Multimodal LLMs for Conversational Insight Generation
AAAI 2026
Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects
AAAI 2026
LLaVA-MS-PIT: Multi-Modal Schema-Guided Progressive Instruction Tuning for Multi-Modal Event Extraction
AAAI 2026
FedSDWC: Federated Synergistic Dual-Representation Weak Causal Learning for OOD
AAAI 2026
Proxy Zero-Shot Hashing with Multimodal Fusion via Stable Diffusion
AAAI 2026
VGGS: VGGT-guided Gaussian Splatting for Efficient and Faithful Sparse-View Surface Reconstruction
AAAI 2026
Anomagic: Crossmodal Prompt-driven Zero-shot Anomaly Generation
AAAI 2026
From Discriminative to Generative: A Diffusion-Based Paradigm for Multi-Agent Collaborative Perception
AAAI 2026
Remember Me: Bridging the Long-Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience Strategies
AAAI 2026
DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception
ICCV 2025
Robust Dexterous Grasping of General Objects
CORL 2025
Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation
ICCV 2025
All-in-One Medical Image Restoration with Latent Diffusion-Enhanced Vector-Quantized Codebook Prior
MICCAI 2025
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection
AAAI 2025
Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images
CVPR 2025
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
CVPR 2025
Forget the Token and Pixel: Rethinking Gradient Ascent for Concept Unlearning in Multimodal Generative Models
ACL 2025
FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation
MICCAI 2025
CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework
AAAI 2025
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
AAAI 2025
Enpowering Your Pansharpening Models with Generalizability: Unified Distribution is All You Need
ICCV 2025
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
ICCV 2025
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
ICCV 2025
IBCA: An Intelligent Platform for Social Insurance Benefit Qualification Status Assessment
AAAI 2024
Region Attention Transformer for Medical Image Restoration
MICCAI 2024
Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learning
MICCAI 2024
All-In-One Medical Image Restoration via Task-Adaptive Routing
MICCAI 2024
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
AAAI 2024
Innovative Directional Encoding in Speech Processing: Leveraging Spherical Harmonics Injection for Multi-Channel Speech Enhancement
IJCAI 2024
HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction
CVPR 2024
Validating Privacy-Preserving Face Recognition under a Minimum Assumption
CVPR 2024
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation
ECCV 2024
Segmentation-guided Layer-wise Image Vectorization with Gradient Fills
ECCV 2024
Occlusion-Aware Seamless Segmentation
ECCV 2024
GraspXL: Generating Grasping Motions for Diverse Objects at Scale
ECCV 2024
Is Your HD Map Constructor Reliable under Sensor Corruptions?
NIPS 2024
Focused and Collaborative Feedback Integration for Interactive Image Segmentation
CVPR 2023
Linking Garment With Person via Semantically Associated Landmarks for Virtual Try-On
CVPR 2023
Prototypical Residual Networks for Anomaly Detection and Localization
CVPR 2023
Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling
CVPR 2023
AlphaRoute: Large-Scale Coordinated Route Planning via Monte Carlo Tree Search
AAAI 2023
Retro-FPN: Retrospective Feature Pyramid Network for Point Cloud Semantic Segmentation
ICCV 2023
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
NAACL 2022
Speaker recognition-assisted robust audio deepfake detection
INTERSPEECH 2022
Thin-Plate Spline Motion Model for Image Animation
CVPR 2022
Slot-VPS: Object-Centric Representation Learning for Video Panoptic Segmentation
CVPR 2022
CSL: A Large-scale Chinese Scientific Literature Dataset
COLING 2022
Camera Auto-Calibration from the Steiner Conic of the Fundamental Matrix
ECCV 2022
Learning Frequency-Aware Dynamic Network for Efficient Super-Resolution
ICCV 2021
Order Regularization on Ordinal Loss for Head Pose, Age and Gaze Estimation
AAAI 2021
Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing
ICCV 2021
Free-Form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud
ICCV 2021
Prototypical Matching and Open Set Rejection for Zero-Shot Semantic Segmentation
ICCV 2021
Robust Speaker Extraction Network Based on Iterative Refined Adaptation
INTERSPEECH 2021
AutoSTR: Efficient Backbone Search for Scene Text Recognition
ECCV 2020
Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection
INTERSPEECH 2020
All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting
AAAI 2020
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-Noise Ratio Condition
INTERSPEECH 2019
Learning Alignment for Multimodal Emotion Recognition from Speech
INTERSPEECH 2019
Investigation of Cost Function for Supervised Monaural Speech Separation
INTERSPEECH 2019
Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation
INTERSPEECH 2018
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model
INTERSPEECH 2018
A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction
COLING 2018
Pytheas: Enabling Data-Driven Quality of Experience Optimization Using Group-Based Exploration-Exploitation
NSDI 2017
Multi-Target Ensemble Learning for Monaural Speech Separation
INTERSPEECH 2017
DRLnet: Deep Difference Representation Learning Network and An Unsupervised Optimization Framework
IJCAI 2017
Efficient 3D Room Shape Recovery From a Single Panorama
CVPR 2016
CFA: A Practical Prediction System for Video QoE Optimization
NSDI 2016
Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation
INTERSPEECH 2016
Homography Estimation From the Common Self-Polar Triangle of Separate Ellipses
CVPR 2016
The Common Self-Polar Triangle of Concentric Circles and Its Application to Camera Calibration
CVPR 2015
C3: Internet-Scale Control Plane for Video Quality Optimization
NSDI 2015
Kneser-Ney Smoothing on Expected Counts
ACL 2014
Observational Initialization of Type-Supervised Taggers
ACL 2014
Beyond Left-to-Right: Multiple Decomposition Structures for SMT
NAACL 2013
An Exploration of Forest-to-String Translation: Does Translation Help or Hurt Parsing?
ACL 2012
Convolution Kernel over Packed Parse Forest
ACL 2010
Non-Isomorphic Forest Pair Translation
EMNLP 2010
K-Best Combination of Syntactic Parsers
EMNLP 2009
Forest-based Tree Sequence to String Translation Model
ACL 2009
Forest-based Tree Sequence to String Translation Model
IJCNLP 2009
Fast Translation Rule Matching for Syntax-based Statistical Machine Translation
EMNLP 2009