Mengmeng Wang
39 papers · 2017–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (13) π Renaissance Researcher (6) π Interdisciplinary Bridge π Conference Polyglot (10)
πΊοΈ
Taxonomy Completionist
(13)
π§
Keyword Pioneer
π
Academic Marathon
(8)
π
Grand Slam
π€
Dynamic Duo
(23)
π§¬
Topic Evolution
π₯
Unstoppable
(7)
β‘
Prolific Year
(10)
β
The Questioner
(2)
ποΈ
Keyword Collector
(187)
π
Century Club
(38)
Conferences
AAAI (11)
ICCV (7)
CVPR (5)
IJCAI (4)
ACL (3)
NIPS (3)
ECCV (2)
ICLR (2)
ICML (1)
JMLR (1)
Top co-authors
Keywords
diffusion model
(7)
convolutional neural network
(5)
depth estimation
(4)
attention mechanism
(4)
image generation
(4)
feature fusion
(3)
face reenactment
(3)
self-supervised learning
(3)
video action recognition
(2)
temporal modeling
(2)
few-shot action recognition
(2)
text-to-image generation
(2)
text-to-image diffusion
(2)
spatiotemporal feature
(2)
object tracking
(2)
multimodal learning
(2)
pose estimation
(2)
zero-shot learning
(2)
image editing
(2)
generative adversarial network
(2)
Papers
Improving Region Representation Learning from Urban Imagery with Noisy Long-Caption Supervision
AAAI 2026
Action Detail Matters: Refining Video Recognition with Local Action Queries
CVPR 2025
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
AAAI 2025
Are the Values of LLMs Structurally Aligned with Humans? A Causal Perspective
ACL 2025
Low-Biased General Annotated Dataset Generation
CVPR 2025
TrackAny3D: Transferring Pretrained 3D Models for Category-unified 3D Point Cloud Tracking
ICCV 2025
Manifold Constraint Reduces Exposure Bias in Accelerated Diffusion Sampling
ICLR 2025
DynaMind: Reasoning over Abstract Video Dynamics for Embodied Decision-Making
ICML 2025
Instructing Text-to-Image Diffusion Models via Classifier-Guided Semantic Optimization
IJCAI 2025
VidEvo: Evolving Video Editing through Exhaustive Temporal Modeling
IJCAI 2025
LLM-TPF: Multiscale Temporal Periodicity-Semantic Fusion LLMs for Time Series Forecasting
IJCAI 2025
EchoGPT: An Interactive Cardiac Function Assessment Model for Echocardiogram Videos
IJCAI 2025
Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
NIPS 2024
LooGLE: Can Long-Context Language Models Understand Long Contexts?
ACL 2024
LangSuitΒ·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments
ACL 2024
Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold
ICLR 2024
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
CVPR 2024
OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance
NIPS 2024
Learning Discretized Neural Networks under Ricci Flow
JMLR 2024
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
NIPS 2024
SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-Form Layout-to-Image Generation
AAAI 2024
A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
AAAI 2024
Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking
ICCV 2023
Revisiting the Spatial and Temporal Modeling for Few-Shot Action Recognition
AAAI 2023
RICO: Regularizing the Unobservable for Indoor Compositional Reconstruction
ICCV 2023
Boosting Few-shot Action Recognition with Graph-guided Hybrid Matching
ICCV 2023
E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context
ECCV 2022
One-shot Face Reenactment Using Appearance Adaptive Normalization
AAAI 2021
FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Depth Completion
AAAI 2021
Self-Supervised Monocular Depth Estimation for All Day Images Using Domain Separation
ICCV 2021
HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation
AAAI 2021
Structure-aware Person Image Generation with Pose Decomposition and Semantic Correlation
AAAI 2021
RFNet: Recurrent Forward Network for Dense Point Cloud Completion
ICCV 2021
DTVNet: Dynamic Time-lapse Video Generation via Single Still Image
ECCV 2020
Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose
AAAI 2020
FDN: Feature Decoupling Network for Head Pose Estimation
AAAI 2020
FReeNet: Multi-Identity Face Reenactment
CVPR 2020
STM: SpatioTemporal and Motion Encoding for Action Recognition
ICCV 2019
Large Margin Object Tracking With Circulant Feature Maps
CVPR 2017