Jingdong Chen
35 papers · 2007–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (18)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(7)
🏃
Academic Marathon
(18)
🌟
Keyword Trendsetter Combo
(5)
🤝
Dynamic Duo
(14)
🏆
Grand Slam
👥
Mega-Team
(69)
🌱
Topic Pioneer
🧬
Topic Evolution
🏆
Keyword Champion
📈
Trend Setter
🗃️
Keyword Collector
(162)
🔥
Unstoppable
(6)
💎
Century Club
(32)
⚡
Prolific Year
(6)
Conferences
CVPR (11)
AAAI (5)
ECCV (5)
ICCV (4)
INTERSPEECH (4)
ICLR (2)
NIPS (2)
ICML (1)
IJCAI (1)
Top co-authors
Research topics
Keywords
vision-language model
(5)
diffusion model
(4)
multimodal learning
(4)
remote sensing
(3)
self-supervised learning
(3)
neural network
(2)
contrastive learning
(2)
semi-dense matching
(2)
semantic segmentation
(2)
image generation
(2)
domain generalization
(2)
transfer learning
(2)
multimodal large language model
(2)
feature pyramid
(2)
foundation model
(2)
multi-modal learning
(2)
feature matching
(2)
speech recognition
(2)
generative model
(2)
computer vision
(1)
Papers
SCAN: Self-Calibrated AutoregressioN for High-Quality Visual Generation
AAAI 2026
UniAlignment: Semantic Alignment for Unified Image Generation, Understanding, Manipulation and Perception
AAAI 2026
HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses Through Reasoning MLLMs
AAAI 2026
SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing
ICCV 2025
When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
ICCV 2025
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
ICLR 2025
CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance
ICCV 2025
HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation
AAAI 2025
Mimir: Improving Video Diffusion Models for Precise Text Understanding
CVPR 2025
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
CVPR 2025
Reversing Flow for Image Restoration
CVPR 2025
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
CVPR 2025
POA: Pre-training Once for Models of All Sizes
ECCV 2024
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
NIPS 2024
Towards Better Vision-Inspired Vision-Language Models
CVPR 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
CVPR 2024
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery
CVPR 2024
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models
ECCV 2024
EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching
ECCV 2024
LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints
ICLR 2024
Uncertainty-guided Learning for Improving Image Manipulation Detection
ICCV 2023
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation
CVPR 2023
CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes
AAAI 2022
Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer
CVPR 2022
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
CVPR 2022
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis
INTERSPEECH 2022
Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
ECCV 2022
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis
INTERSPEECH 2022
LPSNet: A Lightweight Solution for Fast Panoptic Segmentation
CVPR 2021
MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
IJCAI 2021
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario
INTERSPEECH 2021
Variational Connectionist Temporal Classification
ECCV 2020
Cosine Metric Learning for Speaker Verification in the I-vector Space
INTERSPEECH 2018
Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin
ICML 2016
Blind channel identification for speech dereverberation using l1-norm sparse learning
NIPS 2007