Ming Cheng
22 papers · 2019–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (13)
🏃
Academic Marathon
(6)
🐝
Cross-Pollinator
(13)
🗃️
Keyword Collector
(132)
💎
Century Club
(20)
🔥
Unstoppable
(5)
⚡
Prolific Year
(10)
Conferences
CVPR (5)
EMNLP (4)
IJCAI (3)
NAACL (3)
AAAI (2)
ACL (2)
ICLR (1)
INTERSPEECH (1)
WACV (1)
Top co-authors
Keywords
point cloud
(5)
multimodal learning
(4)
3d vision
(3)
domain adaptation
(3)
music performance
(2)
attention mechanism
(2)
diffusion model
(2)
autonomous driving
(2)
pose estimation
(2)
representation learning
(2)
zero-shot learning
(2)
question answering
(2)
convolutional neural network
(2)
visual question answering
(1)
video generation
(1)
temporal modeling
(1)
sparse learning
(1)
text classification
(1)
adversarial learning
(1)
contrastive learning
(1)
Papers
Physically-Based LiDAR Smoke Simulation for Robust 3D Object Detection
AAAI 2026
Walking Further: Semantic-Aware Multimodal Gait Recognition Under Long-Range Conditions
AAAI 2026
GS-CPR: Efficient Camera Pose Refinement via 3D Gaussian Splatting
ICLR 2025
Sci-LoRA: Mixture of Scientific LoRAs for Cross-Domain Lay Paraphrasing
ACL 2025
Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis
CVPR 2025
ProtoVQA: An Adaptable Prototypical Framework for Explainable Fine-Grained Visual Question Answering
EMNLP 2025
A Generalizable Rhetorical Strategy Annotation Model Using LLM-based Debate Simulation and Labelling
EMNLP 2025
Learning Sparsity for Effective and Efficient Music Performance Question Answering
ACL 2025
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models
NAACL 2025
Visual Zero-Shot E-Commerce Product Attribute Value Extraction
NAACL 2025
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding
NAACL 2025
FT2TF: First-Person Statement Text-To-Talking Face Generation
WACV 2025
Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds
CVPR 2024
DiffLoc: Diffusion Model for Outdoor LiDAR Localization
CVPR 2024
Bridging LiDAR Gaps: A Multi-LiDARs Domain Adaptation Dataset for 3D Semantic Segmentation
IJCAI 2024
Learning Musical Representations for Music Performance Question Answering
EMNLP 2024
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification Benchmark
INTERSPEECH 2024
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models
IJCAI 2023
Multi-Graph Fusion Networks for Urban Region Embedding
IJCAI 2022
RF-Net: An End-To-End Image Matching Network Based on Receptive Field
CVPR 2019
Bacteria Biotope Relation Extraction via Lexical Chains and Dependency Graphs
EMNLP 2019
LO-Net: Deep Real-Time Lidar Odometry
CVPR 2019