Zeyu Wang
43 papers · 2020–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Conference Polyglot (14) π Renaissance Researcher (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Academic Marathon (5)
π
Academic Marathon
(5)
π
Cross-Pollinator
(11)
πΊοΈ
Taxonomy Completionist
(92)
π§¬
Topic Evolution
π
Grand Slam
π
Keyword Champion
(2)
π₯
Unstoppable
(5)
π
Trend Setter
π
Century Club
(36)
β
The Questioner
(3)
ποΈ
Keyword Collector
(191)
β‘
Prolific Year
(5)
Conferences
AAAI (13)
CVPR (6)
ICCV (6)
NIPS (5)
ICML (3)
ECCV (2)
AACL (1)
ACL (1)
COLING (1)
EACL (1)
EMNLP (1)
ICLR (1)
IJCNLP (1)
MICCAI (1)
Top co-authors
Research topics
Keywords
diffusion model
(4)
model compression
(4)
multimodal learning
(4)
gaussian splatting
(3)
image fusion
(3)
foundation model
(2)
3d reconstruction
(2)
edge deployment
(2)
vision transformer
(2)
post-training quantization
(2)
adversarial training
(2)
low-resource language
(2)
image generation
(2)
contrastive learning
(2)
knowledge distillation
(2)
self-supervised learning
(2)
point cloud
(2)
multilingual nlp
(2)
vector quantization
(2)
diffusion transformer
(2)
Papers
Breaking Task Boundaries: A Unified Model for 3D Medical Image Fusion and Segmentation Guided by Manifold Perspective
AAAI 2026
EmoVid: A Multimodal Emotion Video Dataset for Emotion-Centric Video Understanding and Generation
AAAI 2026
Breaking the Passive Learning Trap: An Active Perception Strategy for Human Motion Prediction
AAAI 2026
UniMGS: Unifying Mesh and 3D Gaussian Splatting with Single-Pass Rasterization and Proxy-Based Deformation
AAAI 2026
ControlFuse: Instruction-guided Multi-Granularity Controllable Image Fusion
AAAI 2026
SigFusion: Unified Signal-Level Self-Supervised Learning Paradigm for Image Fusion
AAAI 2026
EvDiff3D: Event-Aware Diffusion Repair for High-Fidelity Event-Based 3D Reconstruction
AAAI 2026
ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba
ICCV 2025
MagicColor: Multi-Instance Sketch Colorization
ICCV 2025
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers
AAAI 2025
Spiking Point Transformer for Point Cloud Classification
AAAI 2025
Conditional Semantic Textual Similarity via Conditional Contrastive Learning
COLING 2025
DiT4Edit: Diffusion Transformer for Image Editing
AAAI 2025
RemDet: Rethinking Efficient Model Design for UAV Object Detection
AAAI 2025
Mamba YOLO: A Simple Baseline for Object Detection with State Space Model
AAAI 2025
TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency
MICCAI 2025
Learning Fused State Representations for Control from Multi-View Observations
ICML 2025
What If We Recaption Billions of Web Images with LLaMA-3?
ICML 2025
Text2VDM: Text to Vector Displacement Maps for Expressive and Interactive 3D Sculpting
ICCV 2025
GS-ID: Illumination Decomposition on Gaussian Splatting via Adaptive Light Aggregation and Diffusion-Guided Material Priors
ICCV 2025
Highlight What You Want: Weakly-Supervised Instance-Level Controllable Infrared-Visible Image Fusion
ICCV 2025
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
CVPR 2024
Voila-A: Aligning Vision-Language Models with User's Gaze Attention
NIPS 2024
LaSe-E2V: Towards Language-guided Semantic-aware Event-to-Video Reconstruction
NIPS 2024
DreamCatcher: A Wearer-aware Multi-modal Sleep Event Dataset Based on Earables in Non-restrictive Environments
NIPS 2024
CausalBench: A Comprehensive Benchmark for Evaluating Causal Reasoning Capabilities of Large Language Models
ACL 2024
Revisiting Adversarial Training at Scale
CVPR 2024
SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting
CVPR 2024
MemoNav: Working Memory Model for Visual Navigation
CVPR 2024
HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification
EACL 2024
Rejuvenating image-GPT as Strong Visual Representation Learners
ICML 2024
Can CNNs Be More Robust Than Transformers?
ICLR 2023
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation
ICCV 2023
An Inverse Scaling Law for CLIP Training
NIPS 2023
Masked Autoencoders Enable Efficient Knowledge Distillers
CVPR 2023
Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion
NIPS 2023
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
EMNLP 2022
Detecting Urgency in Multilingual Medical SMS in Kenya
IJCNLP 2022
Multi-Query Video Retrieval
ECCV 2022
Detecting Urgency in Multilingual Medical SMS in Kenya
AACL 2022
Towards Fairness in Visual Recognition: Effective Strategies for Bias Mitigation
CVPR 2020
SiamFC++: Towards Robust and Accurate Visual Tracking with Target Estimation Guidelines
AAAI 2020
Towards Unique and Informative Captioning of Images
ECCV 2020