Yanfeng Wang
109 papers · 2018–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (14) πΊοΈ Taxonomy Completionist (19) π Interdisciplinary Bridge π Academic Marathon (7)
π
Academic Marathon
(7)
π
Cross-Pollinator
(14)
π
Renaissance Researcher
(12)
π
Conference Loyalist
(21)
π
Grand Slam
π§¬
Topic Evolution
π
Keyword Champion
π₯
Mega-Team
(23)
π
Triple Crown
π€
Dynamic Duo
(52)
π¬
Deep Specialist
(16)
β‘
Prolific Year
(43)
π
Conference Pioneer
π
Trend Setter
π
Century Club
(102)
ποΈ
Keyword Collector
(412)
π₯
Unstoppable
(8)
Conferences
CVPR (21)
ACL (16)
EMNLP (15)
ICML (10)
NIPS (10)
ICCV (9)
ICLR (9)
AAAI (6)
COLING (3)
ECCV (3)
INTERSPEECH (2)
MICCAI (2)
WACV (2)
NSDI (1)
Top co-authors
Research topics
Keywords
large language model
(18)
multimodal learning
(11)
federated learning
(9)
knowledge distillation
(8)
instruction tuning
(7)
transfer learning
(5)
video understanding
(5)
vision-language model
(5)
medical imaging
(5)
semantic segmentation
(4)
data augmentation
(4)
contrastive learning
(4)
diffusion model
(4)
domain adaptation
(4)
representation learning
(3)
zero-shot learning
(3)
foundation model
(3)
self-supervised learning
(3)
multi-modal learning
(3)
domain generalization
(3)
Papers
SLoRA: Balancing Plasticity and Forgetting in Large Language Models for Continual Learning
ACL 2026
MedSΒ³: Towards Medical Slow Thinking with Self-Evolved Soft Dual-sided Process Supervision
AAAI 2026
MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools
ACL 2026
Versatile Vision-Language Model for 3D Computed Tomography
AAAI 2026
Miner: Mining Intrinsic Mastery for Data-Efficient RL in Large Reasoning Models
ACL 2026
When Seeing Is not Enough: Revealing the Limits of Active Reasoning in MLLMs
ACL 2026
Cross-Modal Coreference Alignment: Enabling Reliable Information Transfer in Omni-LLMs
ACL 2026
AutoMedEval: Harnessing Language Models for Automatic Medical Capability Evaluation
ACL 2025
ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents
ACL 2025
4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video
CVPR 2025
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
CVPR 2025
LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
CVPR 2025
Towards Universal Soccer Video Understanding
CVPR 2025
Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial Reasoning
EMNLP 2025
FedMABench: Benchmarking Mobile GUI Agents on Decentralized Heterogeneous User Data
EMNLP 2025
Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications
ACL 2025
DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models
EMNLP 2025
VocalNet: Speech LLMs with Multi-Token Prediction for Faster and High-Quality Generation
EMNLP 2025
EvolveBench: A Comprehensive Benchmark for Assessing Temporal Awareness in LLMs on Evolving Knowledge
ACL 2025
Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation
ACL 2025
FedDQC: Data Quality Control in Federated Instruction-tuning of Large Language Models
ACL 2025
VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression
AAAI 2025
DICE: Structured Reasoning in LLMs through SLM-Guided Chain-of-Thought Correction
EMNLP 2025
MegaFusion: Extend Diffusion Models towards Higher-Resolution Image Generation without Further Tuning
WACV 2025
RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining
MICCAI 2025
MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition
ICML 2025
ConText: Driving In-context Learning for Text Removal and Segmentation
ICML 2025
Fine-tuning with Reserved Majority for Noise Reduction
ICLR 2025
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection
ICLR 2025
Emerging Safety Attack and Defense in Federated Instruction Tuning of Large Language Models
ICLR 2025
Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning
ICCV 2025
MRGen: Segmentation Data Engine For Underrepresented MRI Modalities
ICCV 2025
MobileA3gent: Training Mobile GUI Agents Using Decentralized Self-Sourced Data from Diverse Users
EMNLP 2025
Audio-Visual Segmentation via Unlabeled Frame Exploitation
CVPR 2024
Probabilistic Conformal Distillation for Enhancing Missing Modality Robustness
NIPS 2024
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark
NIPS 2024
Language-Driven Interactive Traffic Trajectory Generation
NIPS 2024
Revive Re-weighting in Imbalanced Learning by Density Ratio Estimation
NIPS 2024
TAIA: Large Language Models are Out-of-Distribution Data Learners
NIPS 2024
FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models
NIPS 2024
MedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models
AAAI 2024
M3AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
ACL 2024
MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception
ACL 2024
SDA: Semantic Discrepancy Alignment for Text-conditioned Image Retrieval
ACL 2024
DictLLM: Harnessing Key-Value Data Structures with Large Language Models for Enhanced Medical Diagnostics
ACL 2024
CF-TCIR: A Compositor-Free Framework for Hierarchical Text-Conditioned Image Retrieval
ACL 2024
CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation
COLING 2024
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
COLING 2024
Pruning before Fine-tuning: A Retraining-free Compression Framework for Pre-trained Language Models
COLING 2024
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
CVPR 2024
Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning
CVPR 2024
Low-Rank Knowledge Decomposition for Medical Foundation Models
CVPR 2024
Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
CVPR 2024
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
CVPR 2024
ReMamber: Referring Image Segmentation with Mamba Twister
ECCV 2024
MatchTime: Towards Automatic Soccer Game Commentary Generation
EMNLP 2024
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server
EMNLP 2024
CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios
EMNLP 2024
RA2FD: Distilling Faithfulness into Efficient Dialogue Systems
EMNLP 2024
RaTEScore: A Metric for Radiology Report Generation
EMNLP 2024
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation
EMNLP 2024
HSDreport: Heart Sound Diagnosis with Echocardiography Reports
EMNLP 2024
Fake It Till Make It: Federated Learning with Consensus-Oriented Generation
ICLR 2024
An Extensible Framework for Open Heterogeneous Collaborative Perception
ICLR 2024
On Harmonizing Implicit Subpopulations
ICLR 2024
Long-tailed Diffusion Models with Oriented Calibration
ICLR 2024
Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts
ICLR 2024
Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization
ICML 2024
Diversified Batch Selection for Training Acceleration
ICML 2024
Q-value Regularized Transformer for Offline Reinforcement Learning
ICML 2024
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
ICML 2024
Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation
ICML 2024
Exploring Training on Heterogeneous Data with Mixture of Low-rank Adapters
ICML 2024
Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models
INTERSPEECH 2024
Reprogramming Distillation for Medical Foundation Models
MICCAI 2024
POSEIDON: A Consolidated Virtual Network Controller that Manages Millions of Tenants via Config Tree
NSDI 2024
Long-Tailed Partial Label Learning via Dynamic Rebalancing
ICLR 2023
Combating Representation Learning Disparity with Geometric Harmonization
NIPS 2023
EqMotion: Equivariant Multi-Agent Motion Prediction With Invariant Interaction Reasoning
CVPR 2023
Collaboration Helps Camera Overtake LiDAR in 3D Detection
CVPR 2023
Leapfrog Diffusion Model for Stochastic Trajectory Prediction
CVPR 2023
Personalized Federated Learning with Inferred Collaboration Graphs
ICML 2023
FedDisco: Federated Learning with Discrepancy-Aware Collaboration
ICML 2023
Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization
CVPR 2023
Self-Improvement of Non-autoregressive Model via Sequence-Level Distillation
EMNLP 2023
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
NIPS 2023
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training for X-ray Diagnosis
ICCV 2023
Joint-Relation Transformer for Multi-Person Motion Prediction
ICCV 2023
Open-vocabulary Object Segmentation with Diffusion Models
ICCV 2023
Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction
ICCV 2023
Federated Domain Generalization With Generalization Adjustment
CVPR 2023
DR2: Diffusion-Based Robust Degradation Remover for Blind Face Restoration
CVPR 2023
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation
NIPS 2023
Federated Learning with Bilateral Curation for Partially Class-Disjoint Data
NIPS 2023
Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion Recognition
INTERSPEECH 2022
Handwritten Mathematical Expression Recognition via Attention Aggregation Based Bi-directional Mutual Learning
AAAI 2022
Divide and Conquer for Single-Frame Temporal Action Localization
ICCV 2021
H2O: A Benchmark for Visual Human-Human Object Handover Analysis
ICCV 2021
Inferring Emotion from Large-scale Internet Voice Data: A Semi-supervised Curriculum Augmentation based Deep Learning Approach
AAAI 2021
Handwritten Chinese Font Generation With Collaborative Stroke Refinement
WACV 2021
A Fourier-Based Framework for Domain Generalization
CVPR 2021
Iteratively-Refined Interactive 3D Medical Image Segmentation With Multi-Agent Reinforcement Learning
CVPR 2020
Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction
CVPR 2020
Bottom-Up Temporal Action Localization with Mutual Regularization
ECCV 2020
Accelerate CNN via Recursive Bayesian Pruning
ICCV 2019
Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition
CVPR 2019
Transferable Interactiveness Knowledge for Human-Object Interaction Detection
CVPR 2019
Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network
ECCV 2018
The Sogou-TIIC Speech Translation System for IWSLT 2018
EMNLP 2018