Yizhou Wang

110 papers · 2013–2026 · 16 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (13) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (16)

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🧭 Keyword Pioneer 🏠 Conference Loyalist (29) 🤝 Dynamic Duo (17) 👑 Triple Crown 🏆 Grand Slam 🌱 Topic Pioneer 🔬 Deep Specialist (14) 🏆 Keyword Champion ⚡ Prolific Year (14) 📈 Trend Setter ❓ The Questioner (2) 🗃️ Keyword Collector (409) 💎 Century Club (106) 🚀 Conference Pioneer 🔥 Unstoppable (13)

Conferences

CVPR (29) ICLR (14) ICCV (13) ICML (11) NIPS (10) ECCV (8) AAAI (6) ACL (5) IJCAI (4) WACV (3) EMNLP (2) AISTATS (1) COLING (1) CORL (1) INTERSPEECH (1) MICCAI (1)

Top co-authors

Fangwei Zhong (19) Wentao Zhu (13) CHUNYU WANG (13) Xinwei Sun (13) Xiaoxuan Ma (12) SHIXIANG TANG (11) Wanli Ouyang (11) Yun Fu (10) Hai Ci (8) Yizhou Yu (8)

Research topics

Computer Vision (1) Applications (1)

Keywords

human pose estimation (8) pose estimation (8) object detection (6) transfer learning (6) large language model (5) 3d reconstruction (5) contrastive learning (4) medical imaging (4) few-shot learning (4) multi-agent system (4) domain adaptation (4) attention mechanism (4) action recognition (4) video understanding (4) 3d pose estimation (4) reinforcement learning (3) person re-identification (3) multi-agent reinforcement learning (3) representation learning (3) causal inference (3)

Papers

Communication-Efficient Desire Alignment for Proactive Embodied Human–Agent Interaction ACL 2026 From Words to Pixels: A Comprehensive Survey on Large Language Models in Visual Segmentation ACL 2026 Revealing the Seen, Imagining the Beyond: A Survey of Image-Grounded Chain-of-Thought Reasoning in Multimodal LLMs ACL 2026 How do Role Models Shape Collective Morality? Exemplar-Driven Moral Learning in Multi-Agent Simulation ACL 2026 Towards Zero-Shot 3D Anomaly Localization WACV 2025 Human-Centric Foundation Models: Perception, Generation and Agentic Modeling IJCAI 2025 Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models ICML 2025 Bayesian Active Learning for Bivariate Causal Discovery ICML 2025 Behavior-agnostic Task Inference for Robust Offline In-context Reinforcement Learning ICML 2025 Simulating Human-like Daily Activities with Desire-driven Autonomy ICLR 2025 AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning ICLR 2025 Learning Causal Alignment for Reliable Disease Diagnosis ICLR 2025 Autoregressive Sequence Modeling for 3D Medical Image Representation AAAI 2025 Cautious Next Token Prediction ACL 2025 A Differential Inclusion Approach for Learning Heterogeneous Sparsity in Neuroimaging Analysis AISTATS 2025 Exploring Fine-Grained Human Motion Video Captioning COLING 2025 Aligning Human Motion Generation with Human Perceptions ICLR 2025 DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation ICCV 2025 UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI ICCV 2025 Embodied Representation Alignment with Mirror Neurons ICCV 2025 CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation ICCV 2025 EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds ICCV 2025 Representation Potentials of Foundation Models for Multimodal Alignment: A Survey EMNLP 2025 FreeCloth: Free-form Generation Enhances Challenging Clothed Human Modeling CVPR 2025 D-CoDe: Scaling Image-Pretrained VLMs to Video via Dynamic Compression and Question Decomposition EMNLP 2025 SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens CVPR 2025 InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing CVPR 2025 Shift Equivariant Pose Network WACV 2025 DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM ECCV 2024 Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy NIPS 2024 ScissorBot: Learning Generalizable Scissor Skill for Paper Cutting via Simulation, Imitation, and Sim2Real CORL 2024 Rewrite the Stars CVPR 2024 Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions CVPR 2024 ScoreHypo: Probabilistic Human Mesh Estimation with Hypothesis Scoring CVPR 2024 Real-time Holistic Robot Pose Estimation with Unknown States ECCV 2024 Safe RLHF: Safe Reinforcement Learning from Human Feedback ICLR 2024 Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World ICLR 2024 Don't Judge by the Look: Towards Motion Coherent Video Representation ICLR 2024 Causal Discovery via Conditional Independence Testing with Proxy Variables ICML 2024 Fast Peer Adaptation with Context-aware Exploration ICML 2024 Language Models Represent Beliefs of Self and Others ICML 2024 Cross-Dimensional Medical Self-Supervised Representation Learning Based on a Pseudo-3D Transformation MICCAI 2024 Learning Domain-Agnostic Representation for Disease Diagnosis ICLR 2023 Proactive Multi-Camera Collaboration for 3D Human Pose Estimation ICLR 2023 UniHCP: A Unified Model for Human-Centric Perceptions CVPR 2023 3D Human Mesh Estimation From Virtual Markers CVPR 2023 GFPose: Learning 3D Human Pose Prior With Gradient Fields CVPR 2023 HumanBench: Towards General Human-Centric Perception With Projector Assisted Pretraining CVPR 2023 Social Motion Prediction with Cognitive Hierarchies NIPS 2023 Which Invariance Should We Transfer? A Causal Minimax Learning Approach ICML 2023 Causal Discovery from Subsampled Time Series with Proxy Variables NIPS 2023 MotionBERT: A Unified Perspective on Learning Human Motion Representations ICCV 2023 RSPT: Reconstruct Surroundings and Predict Trajectory for Generalizable Active Object Tracking AAAI 2023 BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset NIPS 2023 ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors NIPS 2023 Cycle-consistent Masked AutoEncoder for Unsupervised Domain Generalization ICLR 2023 Intrinsic Image Decomposition by Pursuing Reflectance Image IJCAI 2022 MemREIN: Rein the Domain Shift for Cross-Domain Few-Shot Learning IJCAI 2022 LUNA: Localizing Unfamiliarity Near Acquaintance for Open-Set Long-Tailed Recognition AAAI 2022 Unsupervised Object Detection Pretraining with Joint Object Priors Generation and Detector Learning NIPS 2022 ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind ICLR 2022 Native phonotactic interference in L2 vowel processing: Mouse-tracking reveals cognitive conflicts during identification INTERSPEECH 2022 MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control NIPS 2022 MoCaNet: Motion Retargeting In-the-Wild via Canonicalization Networks AAAI 2022 Disentangling Disease-related Representation from Obscure for Disease Prediction ICML 2022 Revisiting the Transferability of Supervised Pretraining: An MLP Perspective CVPR 2022 Adaptive Trajectory Prediction via Transferable GNN CVPR 2022 VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data ECCV 2022 Faster VoxelPose: Real-Time 3D Human Pose Estimation by Orthographic Projection ECCV 2022 One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement ECCV 2022 Domain Invariant Masked Autoencoders for Self-Supervised Learning from Multi-Domains ECCV 2022 Causal Intervention for Subject-Deconfounded Facial Action Unit Recognition AAAI 2022 Causal Hidden Markov Model for Time Series Disease Forecasting CVPR 2021 An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation ICCV 2021 ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot ICCV 2021 RODNet: Radar Object Detection Using Cross-Modal Supervision WACV 2021 Towards Distraction-Robust Active Visual Tracking ICML 2021 Context Modeling in 3D Human Pose Estimation: A Unified Perspective CVPR 2021 Towards Unified Surgical Skill Assessment CVPR 2021 Forecasting Irreversible Disease via Progression Learning CVPR 2021 TCGM: An Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning ECCV 2020 Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor Networks NIPS 2020 Pose-Assisted Multi-Camera Collaboration for Active Object Tracking AAAI 2020 Cross-View Correspondence Reasoning Based on Bipartite Graph Convolutional Network for Mammogram Mass Detection CVPR 2020 MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation CVPR 2020 On Computation and Generalization of Generative Adversarial Imitation Learning ICLR 2020 Align, Attend and Locate: Chest X-Ray Diagnosis via Contrast Induced Attention Network With Limited Supervision ICCV 2019 Learning With Unsure Data for Medical Image Diagnosis ICCV 2019 Optimizing Network Structure for 3D Human Pose Estimation ICCV 2019 Max-MIG: an Information Theoretic Approach for Joint Learning from Crowds ICLR 2019 Completeness Modeling and Context Separation for Weakly Supervised Temporal Action Localization CVPR 2019 L_DMI: A Novel Information-theoretic Loss Function for Training Deep Nets Robust to Label Noise NIPS 2019 AD-VAT: An Asymmetric Dueling mechanism for learning Visual Active Tracking ICLR 2019 CRAVES: Controlling Robotic Arm With a Vision-Based Economic System CVPR 2019 Multi-Agent Tensor Fusion for Contextual Trajectory Prediction CVPR 2019 Cascaded Generative and Discriminative Learning for Microcalcification Detection in Breast Mammograms CVPR 2019 MSplit LBI: Realizing Feature Selection and Dense Estimation Simultaneously in Few-shot and Zero-shot Learning ICML 2018 End-to-end Active Object Tracking via Reinforcement Learning ICML 2018 Video Object Segmentation by Learning Location-Sensitive Embeddings ECCV 2018 Collaborative Deep Reinforcement Learning for Joint Object Search CVPR 2017 Mining 3D Key-Pose-Motifs for Action Recognition CVPR 2016 Maximal Sparsity with Deep Networks? NIPS 2016 Quantized Correlation Hashing for Fast Cross-Modal Search IJCAI 2015 Background Subtraction via Generalized Fused Lasso Foreground Modeling CVPR 2015 Exploiting Object Similarity in 3D Reconstruction ICCV 2015 Robust Estimation of 3D Human Poses from a Single Image CVPR 2014 What Object Motion Reveals about Shape with Unknown BRDF and Lighting CVPR 2013 A Method of Perceptual-Based Shape Decomposition ICCV 2013 An Approach to Pose-Based Action Recognition CVPR 2013 Weakly Supervised Learning for Attribute Localization in Outdoor Scenes CVPR 2013