Yuan Wang

56 papers · 2016–2026 · 14 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🌍 Conference Polyglot (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🏃 Academic Marathon (9)

🏃 Academic Marathon (9) 🐝 Cross-Pollinator (8) 🗺️ Taxonomy Completionist (90) 🧬 Topic Evolution 👥 Mega-Team (22) 🤝 Dynamic Duo (12) 🏆 Grand Slam 🚀 Conference Pioneer 🗃️ Keyword Collector (253) 📈 Trend Setter 💎 Century Club (49) 🔥 Unstoppable (7) ❓ The Questioner ⚡ Prolific Year (18)

Conferences

AAAI (14) CVPR (12) ICCV (6) IJCAI (5) ACL (4) EMNLP (4) ECCV (2) MICCAI (2) NIPS (2) COLING (1) ICLR (1) ICML (1) MIDL (1) NAACL (1)

Top co-authors

Tianzhu Zhang (12) Rui Sun (8) Shengjin Wang (5) Wangkai Li (5) Huayu Mai (4) Zhaoyang Li (4) Zuozhu Liu (4) Huazhu Fu (3) Qingsong Wei (3) Gang Chen (3)

Research topics

Linguistics (1) Education (1)

Keywords

semantic segmentation (8) large language model (6) multimodal learning (4) vision-language model (4) few-shot learning (3) prototype learning (3) few-shot segmentation (3) representation learning (2) spiking neural network (2) spatial reasoning (2) image classification (2) state space model (2) point cloud (2) foundation model (2) feature learning (2) question answering (2) domain adaptation (2) knowledge distillation (2) federated learning (2) information retrieval (2)

Papers

Act as you think: Reinforcing Consistent Reasoning in Medical Visual Question Answering ACL 2026 Beyond N-grams: A Hierarchical Reward Learning Framework for Clinically-Aware Medical Report Generation AAAI 2026 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation AAAI 2026 IPFormer: Instance Prompt-guided Transformer for Multi-modal Multi-shot Video Understanding AAAI 2026 TCoT: Trajectory Chain-of-Thoughts for Robotic Manipulation with Failure Recovery in Vision-Language-Action Model AAAI 2026 Unreal-MAP: Unreal-Engine-Based General Platform for Multi-agent Reinforcement Learning AAAI 2026 Data Efficient RLVR via Off-Policy Influence Guidance ACL 2026 Generalized Few-Shot Point Cloud Segmentation via LLM-Assisted Hyper-Relation Matching ICCV 2025 Mamba-3VL: Taming State Space Model for 3D Vision Language Learning ICCV 2025 Two Losses, One Goal: Balancing Conflict Gradients for Semi-supervised Semantic Segmentation ICCV 2025 U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration ICCV 2025 ComRAG: Retrieval-Augmented Generation with Dynamic Vector Stores for Real-time Community Question Answering in Industry ACL 2025 V2T-CoT: From Vision to Text Chain-of-Thought for Medical Reasoning and Diagnosis MICCAI 2025 LIBA: Language Instructed Multi-granularity Bridge Assistant for 3D Visual Grounding AAAI 2025 Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters CVPR 2025 HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction CVPR 2025 Dual-Agent Optimization framework for Cross-Domain Few-Shot Segmentation CVPR 2025 Golden Cudgel Network for Real-Time Semantic Segmentation CVPR 2025 Towards Effective and Sparse Adversarial Attack on Spiking Neural Networks via Breaking Invisible Surrogate Gradients CVPR 2025 A Survey of Optimization Modeling Meets LLMs: Progress and Future Directions IJCAI 2025 Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning AAAI 2025 Human-Centric Foundation Models: Perception, Generation and Agentic Modeling IJCAI 2025 Beyond Confidence: Exploiting Homogeneous Pattern for Semi-Supervised Semantic Segmentation ICML 2025 Exploring the Better Multimodal Synergy Strategy for Vision-Language Models AAAI 2025 Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts EMNLP 2025 Do Large Language Models Rank Fairly? An Empirical Study on the Fairness of LLMs as Rankers NAACL 2024 Enhancing LLM Reasoning via Vision-Augmented Prompting NIPS 2024 Frequency Shuffling and Enhancement for Open Set Recognition AAAI 2024 Pay Attention to Target: Relation-Aware Temporal Consistency for Domain Adaptive Video Semantic Segmentation AAAI 2024 Near-Optimal Resilient Aggregation Rules for Distributed Learning Using 1-Center and 1-Mean Clustering with Outliers AAAI 2024 An Aggregation-Free Federated Learning for Tackling Data Heterogeneity CVPR 2024 Exploring Pose-Aware Human-Object Interaction via Hybrid Learning CVPR 2024 Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation CVPR 2024 G^3-LQ: Marrying Hyperbolic Alignment with Explicit Semantic-Geometric Modeling for 3D Visual Grounding CVPR 2024 Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation ECCV 2024 MedCoT: Medical Chain of Thought via Hierarchical Expert EMNLP 2024 Aggregation and Purification: Dual Enhancement Network for Point Cloud Few-shot Segmentation IJCAI 2024 MedSynth: Leveraging Generative Model for Healthcare Data Sharing MICCAI 2024 A New ANN-SNN Conversion Method with High Accuracy, Low Latency and Good Robustness IJCAI 2023 Neural TSP Solver with Progressive Distillation AAAI 2023 Rethinking the Correlation in Few-Shot Segmentation: A Buoys View CVPR 2023 Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation ICCV 2023 Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation NIPS 2023 Dynamic Graph Learning With Content-Guided Spatial-Frequency Relation Reasoning for Deepfake Detection CVPR 2023 Adaptive Agent Transformer for Few-Shot Segmentation ECCV 2022 Exploring Dual Encoder Architectures for Question Answering EMNLP 2022 Estimation and Comparison of Linear Regions for ReLU Networks IJCAI 2022 Learning to Detect 3D Facial Landmarks via Heatmap Regression with Graph Convolutional Network AAAI 2022 Improving Adversarially Robust Few-Shot Image Classification With Generalizable Representations CVPR 2022 Memory-efficient Segmentation of High-resolution Volumetric MicroCT Images MIDL 2022 AdaFit: Rethinking Learning-Based Normal Estimation on Point Clouds ICCV 2021 Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies ICLR 2021 Neural Dynamics and Gamma Oscillation on a Hybrid Excitatory-Inhibitory Complex Network (Student Abstract) AAAI 2020 Toward Automated Content Feedback Generation for Non-native Spontaneous Speech ACL 2019 Improving Users’ Demographic Prediction via the Videos They Talk about EMNLP 2016 Predicting Restaurant Consumption Level through Social Media Footprints COLING 2016