Ming Ding
33 papers · 2019–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐣 Hot Topic Early Bird
🌉
Interdisciplinary Bridge
🌍
Conference Polyglot
(10)
🏃
Academic Marathon
(6)
🧬
Topic Evolution
👥
Mega-Team
(28)
🤝
Dynamic Duo
(25)
🗃️
Keyword Collector
(123)
⚡
Prolific Year
(7)
💎
Century Club
(32)
🔥
Unstoppable
(7)
Conferences
NIPS (7)
ICLR (6)
ACL (5)
ECCV (3)
EMNLP (3)
IJCAI (3)
WACV (2)
AAAI (1)
CVPR (1)
ICCV (1)
IJCNLP (1)
Top co-authors
Keywords
natural language understanding
(3)
graph neural network
(3)
text-to-image generation
(2)
visual question answering
(2)
end-to-end learning
(2)
question answering
(2)
recommender system
(2)
reward model
(2)
visual language model
(2)
dialog generation
(2)
multimodal learning
(2)
text classification
(1)
temporal reasoning
(1)
model evaluation
(1)
preference alignment
(1)
matrix factorization
(1)
semi-supervised learning
(1)
direct preference optimization
(1)
image captioning
(1)
image generation
(1)
Papers
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
AAAI 2026
AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models
ACL 2025
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
ICLR 2025
BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving
ACL 2025
CodeContests+: High-Quality Test Case Generation for Competitive Programming
EMNLP 2025
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
ICLR 2025
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
When Fairness Meets Privacy: Exploring Privacy Threats in Fair Binary Classifiers via Membership Inference Attacks
IJCAI 2024
CogVLM: Visual Expert for Pretrained Language Models
NIPS 2024
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
"Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction"
ECCV 2024
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion
ECCV 2024
Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer.
ECCV 2024
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
ICLR 2024
TI2Net: Temporal Identity Inconsistency Network for Deepfake Detection
WACV 2023
Proactive Deepfake Defence via Identity Watermarking
WACV 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
NIPS 2023
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
ICLR 2023
GLM-130B: An Open Bilingual Pre-trained Model
ICLR 2023
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
NIPS 2022
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding
ACL 2022
GLM: General Language Model Pretraining with Autoregressive Blank Infilling
ACL 2022
Parameter-Efficient Tuning Makes a Good Classification Head
EMNLP 2022
Rethinking the Setting of Semi-supervised Learning on Graphs
IJCAI 2022
CogView: Mastering Text-to-Image Generation via Transformers
NIPS 2021
UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis
NIPS 2021
Adaptive Diffusion in Graph Neural Networks
NIPS 2021
CogLTX: Applying BERT to Long Texts
NIPS 2020
Towards Knowledge-Based Recommender Dialog System
IJCNLP 2019
ProNE: Fast and Scalable Network Representation Learning
IJCAI 2019
Towards Knowledge-Based Recommender Dialog System
EMNLP 2019
Cognitive Graph for Multi-Hop Reading Comprehension at Scale
ACL 2019