Mengmeng Wang

39 papers · 2017–2026 · 10 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (13) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (10)

🗺️ Taxonomy Completionist (13) 🧭 Keyword Pioneer 🏃 Academic Marathon (8) 🏆 Grand Slam 🤝 Dynamic Duo (23) 🧬 Topic Evolution 🔥 Unstoppable (7) ⚡ Prolific Year (10) ❓ The Questioner (2) 🗃️ Keyword Collector (187) 💎 Century Club (38)

Conferences

AAAI (11) ICCV (7) CVPR (5) IJCAI (4) ACL (3) NIPS (3) ECCV (2) ICLR (2) ICML (1) JMLR (1)

Top co-authors

Yong Liu (23) Guang Dai (15) Jingdong Wang (11) Yi Yuan (5) Haonan Lin (5) Liang Liu (4) Jun Chen (4) Jiahao Wang (4) Jiangning Zhang (4) Guojiang Shen (4)

Keywords

diffusion model (7) convolutional neural network (5) depth estimation (4) attention mechanism (4) image generation (4) feature fusion (3) face reenactment (3) self-supervised learning (3) video action recognition (2) temporal modeling (2) few-shot action recognition (2) text-to-image generation (2) text-to-image diffusion (2) spatiotemporal feature (2) object tracking (2) multimodal learning (2) pose estimation (2) zero-shot learning (2) image editing (2) generative adversarial network (2)

Papers

Improving Region Representation Learning from Urban Imagery with Noisy Long-Caption Supervision AAAI 2026 Action Detail Matters: Refining Video Recognition with Local Action Queries CVPR 2025 SpotActor: Training-Free Layout-Controlled Consistent Image Generation AAAI 2025 Are the Values of LLMs Structurally Aligned with Humans? A Causal Perspective ACL 2025 Low-Biased General Annotated Dataset Generation CVPR 2025 TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking ICCV 2025 Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling ICLR 2025 DynaMind: Reasoning over Abstract Video Dynamics for Embodied Decision-Making ICML 2025 Instructing Text-to-Image Diffusion Models via Classifier-Guided Semantic Optimization IJCAI 2025 VidEvo: Evolving Video Editing through Exhaustive Temporal Modeling IJCAI 2025 LLM-TPF: Multiscale Temporal Periodicity-Semantic Fusion LLMs for Time Series Forecasting IJCAI 2025 EchoGPT: An Interactive Cardiac Function Assessment Model for Echocardiogram Videos IJCAI 2025 Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery NIPS 2024 LooGLE: Can Long-Context Language Models Understand Long Contexts? ACL 2024 LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments ACL 2024 Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold ICLR 2024 SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking CVPR 2024 OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance NIPS 2024 Learning Discretized Neural Networks under Ricci Flow JMLR 2024 Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing NIPS 2024 SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-Form Layout-to-Image Generation AAAI 2024 A Multimodal, Multi-Task Adapting Framework for Video Action Recognition AAAI 2024 Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking ICCV 2023 Revisiting the Spatial and Temporal Modeling for Few-Shot Action Recognition AAAI 2023 RICO: Regularizing the Unobservable for Indoor Compositional Reconstruction ICCV 2023 Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching ICCV 2023 E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context ECCV 2022 One-shot Face Reenactment Using Appearance Adaptive Normalization AAAI 2021 FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Depth Completion AAAI 2021 Self-Supervised Monocular Depth Estimation for All Day Images Using Domain Separation ICCV 2021 HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation AAAI 2021 Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation AAAI 2021 RFNet: Recurrent Forward Network for Dense Point Cloud Completion ICCV 2021 DTVNet: Dynamic Time-lapse Video Generation via Single Still Image ECCV 2020 Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose AAAI 2020 FDN: Feature Decoupling Network for Head Pose Estimation AAAI 2020 FReeNet: Multi-Identity Face Reenactment CVPR 2020 STM: SpatioTemporal and Motion Encoding for Action Recognition ICCV 2019 Large Margin Object Tracking With Circulant Feature Maps CVPR 2017