Hui Zhang

85 papers · 2009–2026 · 16 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (15)

🏃 Academic Marathon (16) 🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🔬 Deep Specialist (12) 🧬 Topic Evolution 🏆 Keyword Champion (2) 🗃️ Keyword Collector (314) ⚡ Prolific Year (13) 🚀 Conference Pioneer 📈 Trend Setter 💎 Century Club (73) 🔥 Unstoppable (14) ❓ The Questioner (2)

Conferences

AAAI (18) CVPR (13) ICCV (10) INTERSPEECH (10) ACL (7) ECCV (6) MICCAI (5) EMNLP (3) NSDI (3) COLING (2) IJCAI (2) NAACL (2) CORL (1) IJCNLP (1) MIDL (1) NIPS (1)

Top co-authors

Xueliang Zhang (7) Min Zhang (6) Haizhou Li (6) Zuxuan Wu (5) ByungIn Yoo (4) Zhiwen Yang (4) Chew Lim Tan (4) Bingzheng Wei (4) Yan Xu (4) Junchen Jiang (3)

Research topics

Applications (1) Digital Humanities (1)

Keywords

semantic segmentation (7) autonomous driving (5) diffusion model (5) speech separation (4) point cloud (4) multimodal learning (4) image generation (3) neural network (3) deep neural network (3) multimodal large language model (3) instance segmentation (3) collaborative perception (3) signal-to-noise ratio (2) 3d object detection (2) image segmentation (2) remote sensing (2) 3d vision (2) depth estimation (2) object tracking (2) domain adaptation (2)

Papers

HiPro-CT: A Hierarchical Probabilistic Framework for 3D Medical Vision-Language Alignment MIDL 2026 Primary Visual Cortex Inspired Point Cloud Analysis Framework AAAI 2026 EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning ACL 2026 Magnol.AI Copilot: Multimodal LLMs for Conversational Insight Generation AAAI 2026 Towards High-Resolution 3D Anomaly Detection: A Scalable Dataset and Real-Time Framework for Subtle Industrial Defects AAAI 2026 LLaVA-MS-PIT: Multi-Modal Schema-Guided Progressive Instruction Tuning for Multi-Modal Event Extraction AAAI 2026 FedSDWC: Federated Synergistic Dual-Representation Weak Causal Learning for OOD AAAI 2026 Proxy Zero-Shot Hashing with Multimodal Fusion via Stable Diffusion AAAI 2026 VGGS: VGGT-guided Gaussian Splatting for Efficient and Faithful Sparse-View Surface Reconstruction AAAI 2026 Anomagic: Crossmodal Prompt-driven Zero-shot Anomaly Generation AAAI 2026 From Discriminative to Generative: A Diffusion-Based Paradigm for Multi-Agent Collaborative Perception AAAI 2026 Remember Me: Bridging the Long-Range Gap in LVLMs with Three-Step Inference-Only Decay Resilience Strategies AAAI 2026 DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception ICCV 2025 Robust Dexterous Grasping of General Objects CORL 2025 Unlocking Constraints: Source-Free Occlusion-Aware Seamless Segmentation ICCV 2025 All-in-One Medical Image Restoration with Latent Diffusion-Enhanced Vector-Quantized Codebook Prior MICCAI 2025 HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection AAAI 2025 Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images CVPR 2025 BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers CVPR 2025 Forget the Token and Pixel: Rethinking Gradient Ascent for Concept Unlearning in Multimodal Generative Models ACL 2025 FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation MICCAI 2025 CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework AAAI 2025 AdaDiff: Adaptive Step Selection for Fast Diffusion Models AAAI 2025 Enpowering Your Pansharpening Models with Generalizability: Unified Distribution is All You Need ICCV 2025 MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance ICCV 2025 CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation ICCV 2025 IBCA: An Intelligent Platform for Social Insurance Benefit Qualification Status Assessment AAAI 2024 Region Attention Transformer for Medical Image Restoration MICCAI 2024 Eddeep: Fast eddy-current distortion correction for diffusion MRI with deep learning MICCAI 2024 All-In-One Medical Image Restoration via Task-Adaptive Routing MICCAI 2024 UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding AAAI 2024 Innovative Directional Encoding in Speech Processing: Leveraging Spherical Harmonics Injection for Multi-Channel Speech Enhancement IJCAI 2024 HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction CVPR 2024 Validating Privacy-Preserving Face Recognition under a Minimum Assumption CVPR 2024 MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation ECCV 2024 Segmentation-guided Layer-wise Image Vectorization with Gradient Fills ECCV 2024 Occlusion-Aware Seamless Segmentation ECCV 2024 GraspXL: Generating Grasping Motions for Diverse Objects at Scale ECCV 2024 Is Your HD Map Constructor Reliable under Sensor Corruptions? NIPS 2024 Focused and Collaborative Feedback Integration for Interactive Image Segmentation CVPR 2023 Linking Garment With Person via Semantically Associated Landmarks for Virtual Try-On CVPR 2023 Prototypical Residual Networks for Anomaly Detection and Localization CVPR 2023 Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling CVPR 2023 AlphaRoute: Large-Scale Coordinated Route Planning via Monte Carlo Tree Search AAAI 2023 Retro-FPN: Retrospective Feature Pyramid Network for Point Cloud Semantic Segmentation ICCV 2023 PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit NAACL 2022 Speaker recognition-assisted robust audio deepfake detection INTERSPEECH 2022 Thin-Plate Spline Motion Model for Image Animation CVPR 2022 Slot-VPS: Object-Centric Representation Learning for Video Panoptic Segmentation CVPR 2022 CSL: A Large-scale Chinese Scientific Literature Dataset COLING 2022 Camera Auto-Calibration from the Steiner Conic of the Fundamental Matrix ECCV 2022 Learning Frequency-Aware Dynamic Network for Efficient Super-Resolution ICCV 2021 Order Regularization on Ordinal Loss for Head Pose, Age and Gaze Estimation AAAI 2021 Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing ICCV 2021 Free-Form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud ICCV 2021 Prototypical Matching and Open Set Rejection for Zero-Shot Semantic Segmentation ICCV 2021 Robust Speaker Extraction Network Based on Iterative Refined Adaptation INTERSPEECH 2021 AutoSTR: Efficient Backbone Search for Scene Text Recognition ECCV 2020 Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection INTERSPEECH 2020 All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting AAAI 2020 UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-Noise Ratio Condition INTERSPEECH 2019 Learning Alignment for Multimodal Emotion Recognition from Speech INTERSPEECH 2019 Investigation of Cost Function for Supervised Monaural Speech Separation INTERSPEECH 2019 Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation INTERSPEECH 2018 Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model INTERSPEECH 2018 A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction COLING 2018 Pytheas: Enabling Data-Driven Quality of Experience Optimization Using Group-Based Exploration-Exploitation NSDI 2017 Multi-Target Ensemble Learning for Monaural Speech Separation INTERSPEECH 2017 DRLnet: Deep Difference Representation Learning Network and An Unsupervised Optimization Framework IJCAI 2017 Efficient 3D Room Shape Recovery From a Single Panorama CVPR 2016 CFA: A Practical Prediction System for Video QoE Optimization NSDI 2016 Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation INTERSPEECH 2016 Homography Estimation From the Common Self-Polar Triangle of Separate Ellipses CVPR 2016 The Common Self-Polar Triangle of Concentric Circles and Its Application to Camera Calibration CVPR 2015 C3: Internet-Scale Control Plane for Video Quality Optimization NSDI 2015 Kneser-Ney Smoothing on Expected Counts ACL 2014 Observational Initialization of Type-Supervised Taggers ACL 2014 Beyond Left-to-Right: Multiple Decomposition Structures for SMT NAACL 2013 An Exploration of Forest-to-String Translation: Does Translation Help or Hurt Parsing? ACL 2012 Convolution Kernel over Packed Parse Forest ACL 2010 Non-Isomorphic Forest Pair Translation EMNLP 2010 K-Best Combination of Syntactic Parsers EMNLP 2009 Forest-based Tree Sequence to String Translation Model ACL 2009 Forest-based Tree Sequence to String Translation Model IJCNLP 2009 Fast Translation Rule Matching for Syntax-based Statistical Machine Translation EMNLP 2009