Shu Zhang
19 papers · 2010–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (9) π Academic Marathon (15) π Renaissance Researcher (6) πΊοΈ Taxonomy Completionist (43)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(43)
π§¬
Topic Evolution
ποΈ
Keyword Collector
(73)
π
Conference Pioneer
π
Century Club
(19)
Conferences
CVPR (6)
MICCAI (3)
COLING (2)
ICCV (2)
NIPS (2)
AAAI (1)
CONLL (1)
EMNLP (1)
IJCNLP (1)
Top co-authors
Keywords
large language model
(2)
diffusion model
(2)
representation learning
(2)
contrastive learning
(1)
few-shot learning
(1)
image generation
(1)
feature learning
(1)
motion estimation
(1)
in-context learning
(1)
temporal reasoning
(1)
prompt engineering
(1)
preference optimization
(1)
supervised learning
(1)
multi-label classification
(1)
multi-modal learning
(1)
point cloud
(1)
video understanding
(1)
person re-identification
(1)
visual reasoning
(1)
computer vision
(1)
Papers
MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents
AAAI 2025
Improving Motor Imagery EEG Signal Quality with Dynamic Visual Cues: An Innovative Paradigm and Dataset
MICCAI 2025
BrainAlign: EEG-Vision Alignment via Frequency-Aware Temporal Encoder and Differentiable Cluster Assigner
MICCAI 2025
HIVE: Harnessing Human Feedback for Instructional Visual Editing
CVPR 2024
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024
DTCA: Dual-Branch Transformer with Cross-Attention for EEG and Eye Movement Data Fusion
MICCAI 2024
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
NIPS 2023
Fairness-guided Few-shot Prompting for Large Language Models
NIPS 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
ICCV 2023
Use All the Labels: A Hierarchical Multi-Label Contrastive Learning Framework
CVPR 2022
Deep Homography Estimation for Dynamic Scenes
CVPR 2020
Towards Rich Feature Discovery With Class Activation Maps Augmentation for Person Re-Identification
CVPR 2019
Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering
CVPR 2019
Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis
ICCV 2017
A Distribution-based Model to Learn Bilingual Word Embeddings
COLING 2016
Semi-supervised Classification of Twitter Messages for Organization Name Disambiguation
IJCNLP 2013
Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features
CONLL 2012
Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features
EMNLP 2012
Structure-Aware Review Mining and Summarization
COLING 2010