Shu Zhang

19 papers · 2010–2025 · 9 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🏃 Academic Marathon (15) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (43)

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (43) 🧬 Topic Evolution 🗃️ Keyword Collector (73) 🚀 Conference Pioneer 💎 Century Club (19)

Conferences

CVPR (6) MICCAI (3) COLING (2) ICCV (2) NIPS (2) AAAI (1) CONLL (1) EMNLP (1) IJCNLP (1)

Top co-authors

Ran Xu (5) Caiming Xiong (5) Ning Yu (4) Enze Shi (3) Can Qin (3) Silvio Savarese (3) Stefano Ermon (3) Kui Zhao (2) Feng Ji (2) Huan Wang (2)

Keywords

large language model (2) diffusion model (2) representation learning (2) contrastive learning (1) few-shot learning (1) image generation (1) feature learning (1) motion estimation (1) in-context learning (1) temporal reasoning (1) prompt engineering (1) preference optimization (1) supervised learning (1) multi-label classification (1) multi-modal learning (1) point cloud (1) video understanding (1) person re-identification (1) visual reasoning (1) computer vision (1)

Papers

MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents AAAI 2025 Improving Motor Imagery EEG Signal Quality with Dynamic Visual Cues: An Innovative Paradigm and Dataset MICCAI 2025 BrainAlign: EEG-Vision Alignment via Frequency-Aware Temporal Encoder and Differentiable Cluster Assigner MICCAI 2025 HIVE: Harnessing Human Feedback for Instructional Visual Editing CVPR 2024 ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding CVPR 2024 DTCA: Dual-Branch Transformer with Cross-Attention for EEG and Eye Movement Data Fusion MICCAI 2024 UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild NIPS 2023 Fairness-guided Few-shot Prompting for Large Language Models NIPS 2023 GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation ICCV 2023 Use All the Labels: A Hierarchical Multi-Label Contrastive Learning Framework CVPR 2022 Deep Homography Estimation for Dynamic Scenes CVPR 2020 Towards Rich Feature Discovery With Class Activation Maps Augmentation for Person Re-Identification CVPR 2019 Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering CVPR 2019 Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis ICCV 2017 A Distribution-based Model to Learn Bilingual Word Embeddings COLING 2016 Semi-supervised Classification of Twitter Messages for Organization Name Disambiguation IJCNLP 2013 Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features CONLL 2012 Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features EMNLP 2012 Structure-Aware Review Mining and Summarization COLING 2010