Ya Zhang
89 papers · 2008–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Academic Marathon (17) π Conference Polyglot (13) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (14)
π
Cross-Pollinator
(14)
π
Renaissance Researcher
(9)
πΊοΈ
Taxonomy Completionist
(112)
π
Conference Loyalist
(22)
π€
Dynamic Duo
(52)
π
Grand Slam
π
Keyword Champion
(2)
π
Triple Crown
π±
Topic Pioneer
β‘
Prolific Year
(26)
π
Conference Pioneer
π
Trend Setter
π₯
Unstoppable
(11)
ποΈ
Keyword Collector
(306)
π
Century Club
(86)
Conferences
CVPR (22)
ICCV (11)
NIPS (11)
ECCV (9)
ICML (8)
ICLR (7)
AAAI (6)
EMNLP (4)
ACL (3)
MICCAI (3)
WACV (3)
IJCAI (1)
IJCNLP (1)
Top co-authors
Keywords
representation learning
(8)
large language model
(7)
multimodal learning
(7)
knowledge distillation
(6)
semantic segmentation
(5)
data augmentation
(5)
medical imaging
(5)
diffusion model
(5)
self-supervised learning
(4)
medical diagnosis
(4)
image generation
(4)
contrastive learning
(4)
weakly supervised learning
(3)
convolutional neural network
(3)
image restoration
(3)
collaborative learning
(3)
feature extraction
(3)
domain adaptation
(3)
instruction tuning
(3)
long-tailed distribution
(3)
Papers
Miner: Mining Intrinsic Mastery for Data-Efficient RL in Large Reasoning Models
ACL 2026
Versatile Vision-Language Model for 3D Computed Tomography
AAAI 2026
MedSΒ³: Towards Medical Slow Thinking with Self-Evolved Soft Dual-sided Process Supervision
AAAI 2026
Towards Universal Soccer Video Understanding
CVPR 2025
ConText: Driving In-context Learning for Text Removal and Segmentation
ICML 2025
MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition
ICML 2025
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection
ICLR 2025
DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models
EMNLP 2025
Fine-tuning with Reserved Majority for Noise Reduction
ICLR 2025
MegaFusion: Extend Diffusion Models towards Higher-Resolution Image Generation without Further Tuning
WACV 2025
MRGen: Segmentation Data Engine For Underrepresented MRI Modalities
ICCV 2025
Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning
ICCV 2025
AutoMedEval: Harnessing Language Models for Automatic Medical Capability Evaluation
ACL 2025
Multi-modal Medical Diagnosis via Large-small Model Collaboration
CVPR 2025
RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining
MICCAI 2025
Brain-Heart-Gut Guided Multi-Constraint Knowledge Distillation for Early Alzheimerβs Disease Diagnosis
MICCAI 2025
On Harmonizing Implicit Subpopulations
ICLR 2024
Probabilistic Conformal Distillation for Enhancing Missing Modality Robustness
NIPS 2024
Revive Re-weighting in Imbalanced Learning by Density Ratio Estimation
NIPS 2024
TAIA: Large Language Models are Out-of-Distribution Data Learners
NIPS 2024
Annotation-Free Audio-Visual Segmentation
WACV 2024
MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models
AAAI 2024
DictLLM: Harnessing Key-Value Data Structures with Large Language Models for Enhanced Medical Diagnostics
ACL 2024
Reprogramming Distillation for Medical Foundation Models
MICCAI 2024
Exploring Training on Heterogeneous Data with Mixture of Low-rank Adapters
ICML 2024
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
ICML 2024
Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning
CVPR 2024
Audio-Visual Segmentation via Unlabeled Frame Exploitation
CVPR 2024
Low-Rank Knowledge Decomposition for Medical Foundation Models
CVPR 2024
Q-value Regularized Transformer for Offline Reinforcement Learning
ICML 2024
Diversified Batch Selection for Training Acceleration
ICML 2024
Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization
ICML 2024
ReMamber: Referring Image Segmentation with Mamba Twister
ECCV 2024
Knowledge-enhanced Visual-Language Pretraining for Computational Pathology
ECCV 2024
Multi-Sentence Grounding for Long-term Instructional Video
ECCV 2024
CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios
EMNLP 2024
RaTEScore: A Metric for Radiology Report Generation
EMNLP 2024
HSDreport: Heart Sound Diagnosis with Echocardiography Reports
EMNLP 2024
Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
ICLR 2024
Long-tailed Diffusion Models with Oriented Calibration
ICLR 2024
Learning Multi-Agent Communication from Graph Modeling Perspective
ICLR 2024
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
CVPR 2024
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
NIPS 2023
Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization
CVPR 2023
Controllable Mesh Generation Through Sparse Latent Point Diffusion Models
CVPR 2023
DR2: Diffusion-Based Robust Degradation Remover for Blind Face Restoration
CVPR 2023
Federated Domain Generalization With Generalization Adjustment
CVPR 2023
Class-Balancing Diffusion Models
CVPR 2023
Enhanced Multimodal Representation Learning With Cross-Modal KD
CVPR 2023
Long-Tailed Partial Label Learning via Dynamic Rebalancing
ICLR 2023
Combating Representation Learning Disparity with Geometric Harmonization
NIPS 2023
Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow
NIPS 2023
Open-vocabulary Object Segmentation with Diffusion Models
ICCV 2023
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation
NIPS 2023
Federated Learning with Bilateral Curation for Partially Class-Disjoint Data
NIPS 2023
Joint-Relation Transformer for Multi-Person Motion Prediction
ICCV 2023
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training for X-ray Diagnosis
ICCV 2023
Prompting Visual-Language Models for Efficient Video Understanding
ECCV 2022
LAR-SR: A Local Autoregressive Model for Image Super-Resolution
CVPR 2022
GroupNet: Multiscale Hypergraph Neural Networks for Trajectory Prediction With Relational Reasoning
CVPR 2022
Task Decoupled Framework for Reference-Based Super-Resolution
CVPR 2022
Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction
ECCV 2022
Registration Based Few-Shot Anomaly Detection
ECCV 2022
Contrastive Learning with Boosted Memorization
ICML 2022
CaT: Weakly Supervised Object Detection With Category Transfer
ICCV 2021
A Fourier-Based Framework for Domain Generalization
CVPR 2021
Handwritten Chinese Font Generation With Collaborative Stroke Refinement
WACV 2021
Invariant Teacher and Equivariant Student for Unsupervised 3D Human Pose Estimation
AAAI 2021
Collaborative Uncertainty in Multi-Agent Trajectory Forecasting
NIPS 2021
Divide and Conquer for Single-Frame Temporal Action Localization
ICCV 2021
Graph Cross Networks with Vertex Infomax Pooling
NIPS 2020
FTL: A universal framework for training low-bit DNNs via Feature Transfer
ECCV 2020
Collaborative Motion Prediction via Neural Motion Message Passing
CVPR 2020
Iteratively-Refined Interactive 3D Medical Image Segmentation With Multi-Agent Reinforcement Learning
CVPR 2020
Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction
CVPR 2020
Bottom-Up Temporal Action Localization with Mutual Regularization
ECCV 2020
Accelerate CNN via Recursive Bayesian Pruning
ICCV 2019
Safeguarded Dynamic Label Regression for Noisy Supervision
AAAI 2019
Understanding VAEs in Fisher-Shannon Plane
AAAI 2019
Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition
CVPR 2019
Masking: A New Perspective of Noisy Supervision
NIPS 2018
Multi-Scale Spatially-Asymmetric Recalibration for Image Classification
ECCV 2018
Collaborative Learning for Weakly Supervised Object Detection
IJCAI 2018
Separating Style and Content for Generalized Style Transfer
CVPR 2018
SORT: Second-Order Response Transform for Visual Recognition
ICCV 2017
Part-Stacked CNN for Fine-Grained Visual Categorization
CVPR 2016
Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization
ICCV 2015
Joint Optimization for Consistent Multiple Graph Matching
ICCV 2013
A Two-Stage Approach to Chinese Part-of-Speech Tagging
IJCNLP 2008