Meng Wang
146 papers · 2009–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (21) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
π
Conference Polyglot
(15)
π
Conference Loyalist
(25)
π
Keyword Trendsetter Combo
(3)
π€
Dynamic Duo
(21)
π
Triple Crown
π
Grand Slam
π¬
Deep Specialist
(18)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π₯
Unstoppable
(8)
β‘
Prolific Year
(19)
β
The Questioner
(5)
π
Century Club
(136)
ποΈ
Keyword Collector
(544)
π
Trend Setter
π
Conference Pioneer
Conferences
AAAI (32)
CVPR (31)
IJCAI (20)
ICML (13)
ICCV (10)
ICLR (9)
ECCV (8)
NIPS (6)
ACL (5)
EMNLP (3)
MICCAI (3)
INTERSPEECH (2)
COLING (1)
IJCNLP (1)
MIDL (1)
WACV (1)
Top co-authors
Research topics
Keywords
video understanding
(9)
representation learning
(8)
knowledge distillation
(8)
model compression
(8)
multimodal learning
(7)
domain adaptation
(7)
large language model
(6)
graph neural network
(6)
multi-modal learning
(6)
contrastive learning
(6)
sample complexity
(5)
action recognition
(4)
convolutional neural network
(4)
image restoration
(4)
recommender system
(4)
image classification
(3)
domain generalization
(3)
image captioning
(3)
visual question answering
(3)
embedding learning
(3)
Papers
Benchmarking Trustworthiness in Multimodal LLMs for Video Understanding
AAAI 2026
RecCocktail: A Generalizable and Efficient Framework for LLM-Based Recommendation
AAAI 2026
FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive Diffusion
AAAI 2026
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
AAAI 2026
Towards Non-Stationary Time Series Forecasting with Temporal Stabilization and Frequency Differencing
AAAI 2026
Sparse-Scale Transformer with Bidirectional Awareness for Time Series Forecasting
AAAI 2026
ODL-TempLLM: Ontology-Guided and Description Logic-Reasoned Temporal Reasoning with LLMs
ACL 2026
A-ADAPT: Adaptive Intracranial Artery Segmentation with Morphology-Guided Prompts and Difficulty-Aware Learning
MIDL 2026
Psyche-R1: Towards Reliable Psychological LLMs through Unified Empathy, Expertise, and Reasoning
ACL 2026
Semantic Alignment of Malicious Question Based on Contrastive Semantic Networks and Data Augmentation (Abstract Reprint)
AAAI 2026
Fairness-Aware vCDR-Controlled Generation for Glaucoma Diagnosis
MICCAI 2025
From End-to-end to Step-by-step: Learning to Abstract via Abductive Reinforcement Learning
IJCAI 2025
Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach
IJCAI 2025
Knowledge Swapping via Learning and Unlearning
ICML 2025
Navigating Semantic Drift in Task-Agnostic Class-Incremental Learning
ICML 2025
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights
AAAI 2025
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition
AAAI 2025
PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement
AAAI 2025
FakeDiffer: Distributional Disparity Learning on Differentiated Reconstruction for Face Forgery Detection
AAAI 2025
VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion
AAAI 2025
Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues
AAAI 2025
Cognitive Bias and Reassignment: Who Can Contribute High Quality LLM Data
AAAI 2025
Bi-perspective Splitting Defense: Achieving Clean-Seed-Free Backdoor Security
ICML 2025
TASAR: Transfer-based Attack on Skeletal Action Recognition
ICLR 2025
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
ICLR 2025
Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs
ICLR 2025
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
ICLR 2025
Feedback Favors the Generalization of Neural ODEs
ICLR 2025
Boosting Adversarial Transferability via Residual Perturbation Attack
ICCV 2025
An Information-Theoretic Regularizer for Lossy Neural Image Compression
ICCV 2025
DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model
ICCV 2025
MMAD: Multi-label Micro-Action Detection in Videos
ICCV 2025
GT-Mean Loss: A Simple Yet Effective Solution for Brightness Mismatch in Low-Light Image Enhancement
ICCV 2025
SMoLoRA: Exploring and Defying Dual Catastrophic Forgetting in Continual Visual Instruction Tuning
ICCV 2025
Towards Efficient General Feature Prediction in Masked Skeleton Modeling
ICCV 2025
Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
ICCV 2025
Towards Open-Vocabulary Audio-Visual Event Localization
CVPR 2025
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
CVPR 2025
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
CVPR 2025
Vision-Language Model IP Protection via Prompt-based Learning
CVPR 2025
RIFNet: Bridging Modalities for Accurate and Detailed Ocular Disease Analysis
MICCAI 2025
Parameterized Diffusion Optimization enabled Autoregressive Ordinal Regression for Diabetic Retinopathy Grading
MICCAI 2025
Training A Small Emotional Vision Language Model for Visual Art Comprehension
ECCV 2024
FasMe: Fast and Sample-efficient Meta Estimator for Precision Matrix Learning in Small Sample Settings
NIPS 2024
Temporal Sentence Grounding with Relevance Feedback in Videos
NIPS 2024
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
AAAI 2024
KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking
AAAI 2024
EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer
AAAI 2024
A Dual-Way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking
AAAI 2024
Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers
ACL 2024
Data-Free Quantization via Pseudo-label Filtering
CVPR 2024
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture
CVPR 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
CVPR 2024
Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting
CVPR 2024
Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
ECCV 2024
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models
ECCV 2024
Knowledge-augmented Financial Market Analysis and Report Generation
EMNLP 2024
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
ICML 2024
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
ICML 2024
What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
ICML 2024
Revisiting the Power of Prompt for Visual Tuning
ICML 2024
Adaptive Group Personalization for Federated Mutual Transfer Learning
ICML 2024
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
ICML 2024
OSIC: A New One-Stage Image Captioner Coined
IJCAI 2024
Towards Proactive Interactions for In-Vehicle Conversational Assistants Utilizing Large Language Models
IJCAI 2024
Disentangling Cognitive Diagnosis with Limited Exercise Labels
NIPS 2023
Prompting Large Language Models With Answer Heuristics for Knowledge-Based Visual Question Answering
CVPR 2023
MCL: Multi-Granularity Contrastive Learning Framework for Chinese NER
AAAI 2023
Rethinking Data-Free Quantization as a Zero-Sum Game
AAAI 2023
Model Barrier: A Compact Un-Transferable Isolation Domain for Model Intellectual Property Protection
CVPR 2023
Multi-mode Neural Speech Coding Based on Deep Generative Networks
INTERSPEECH 2023
DC-Former: Diverse and Compact Transformer for Person Re-identification
AAAI 2023
Fair Representation Learning for Recommendation: A Mutual Information Perspective
AAAI 2023
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $\epsilon$-Greedy Exploration
NIPS 2023
Towards Efficient Pre-Trained Language Model via Feature Correlation Distillation
NIPS 2023
Harnessing Unrecognizable Faces for Improving Face Recognition
WACV 2023
A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity
ICLR 2023
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
ICLR 2023
EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition
INTERSPEECH 2023
LP-DIF: Learning Local Pattern-Specific Deep Implicit Function for 3D Objects and Scenes
CVPR 2023
Adaptive Data-Free Quantization
CVPR 2023
Fine-Grained Audible Video Description
CVPR 2023
Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks
ICML 2023
Domain Generalized Stereo Matching via Hierarchical Visual Transformation
CVPR 2023
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis
ICLR 2022
Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling
ICML 2022
Switchable Online Knowledge Distillation
ECCV 2022
AudioβVisual Segmentation
ECCV 2022
Multi-modal Contrastive Representation Learning for Entity Alignment
COLING 2022
A Difference Standardization Method for Mutual Transfer Learning
ICML 2022
Deep Color Consistent Network for Low-Light Image Enhancement
CVPR 2022
Width & Depth Pruning for Vision Transformers
AAAI 2022
Motion Prediction Using Trajectory Cues
ICCV 2021
Leveraging Table Content for Zero-shot Text-to-SQL with Meta-Learning
AAAI 2021
Discrimination-Aware Mechanism for Fine-Grained Representation Learning
CVPR 2021
Making the Relation Matters: Relation of Relation Learning Network for Sentence Semantic Matching
AAAI 2021
Partial-Label and Structure-constrained Deep Coupled Factorization Network
AAAI 2021
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning
ICLR 2021
Proposal-Free Video Grounding with Contextual Pyramid Network
AAAI 2021
Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks
NIPS 2021
Positive Sample Propagation Along the Audio-Visual Event Line
CVPR 2021
Positive-Congruent Training: Towards Regression-Free Model Updates
CVPR 2021
Reward-Constrained Behavior Cloning
IJCAI 2021
Single View Physical Distance Estimation Using Human Pose
ICCV 2021
Large-Scale Few-Shot Learning via Multi-Modal Knowledge Discovery
ECCV 2020
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases
ECCV 2020
Feature Pyramid Transformer
ECCV 2020
Iterative Context-Aware Graph Inference for Visual Dialog
CVPR 2020
Enhanced Blind Face Restoration With Multi-Exemplar Images and Adaptive Spatial Feature Fusion
CVPR 2020
More Grounded Image Captioning by Distilling Image-Text Matching Model
CVPR 2020
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case
ICML 2020
Generative Adversarial Imitation Learning from Failed Experiences (Student Abstract)
AAAI 2020
One-Shot Learning for Long-Tail Visual Relation Detection
AAAI 2020
Learning to Match on Graph for Fashion Compatibility Modeling
AAAI 2020
Revisiting Graph Based Collaborative Filtering: A Linear Residual Graph Convolutional Network Approach
AAAI 2020
Learning to Discretely Compose Reasoning Module Networks for Video Captioning
IJCAI 2020
Multi-Scale Spatial-Temporal Integration Convolutional Tube for Human Action Recognition
IJCAI 2020
Unsupervised Vehicle Re-identification with Progressive Adaptation
IJCAI 2020
Recurrent Relational Memory Network for Unsupervised Image Captioning
IJCAI 2020
Quadratic Sparse Gaussian Graphical Model Estimation Method for Massive Variables
IJCAI 2020
Detail-recovery Image Deraining via Context Aggregation Networks
CVPR 2020
Approximate Optimal Transport for Continuous Densities with Copulas
IJCAI 2019
Dual Visual Attention Network for Visual Dialog
IJCAI 2019
Connectionist Temporal Modeling of Video and Language: a Joint Model for Translation and Sign Labeling
IJCAI 2019
Dense Temporal Convolution Network for Sign Language Translation
IJCAI 2019
Adaptive Transfer Network for Cross-Domain Person Re-Identification
CVPR 2019
ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis
CVPR 2019
TransNFCM: Translation-Based Neural Fashion Compatibility Modeling
AAAI 2019
Graphonomy: Universal Human Parsing via Graph Transfer Learning
CVPR 2019
Personalized Multimedia Item and Key Frame Recommendation
IJCAI 2019
Multi-Cue Correlation Filters for Robust Visual Tracking
CVPR 2018
Fine-grained Image Classification by Visual-Semantic Embedding
IJCAI 2018
Empirical Risk Minimization for Metric Learning Using Privileged Information
IJCAI 2016
A Relaxed Ranking-Based Factor Model for Recommender System from Implicit Feedback
IJCAI 2016
DisturbLabel: Regularizing CNN on the Loss Layer
CVPR 2016
Learned Binary Spectral Shape Descriptor for 3D Shape Correspondence
CVPR 2016
Saliency Detection with a Deeper Investigation of Light Field
IJCAI 2015
3D Deep Shape Descriptor
CVPR 2015
Interaction Part Mining: A Mid-Level Approach for Fine-Grained Action Recognition
CVPR 2015
Online Group Feature Selection
IJCAI 2013
Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews
ACL 2011
Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews
EMNLP 2011
Prediction of Thematic Rank for Structured Semantic Role Labeling
ACL 2009
Prediction of Thematic Rank for Structured Semantic Role Labeling
IJCNLP 2009
Chinese Semantic Role Labeling with Shallow Parsing
EMNLP 2009