Jingdong Chen

35 papers · 2007–2026 · 9 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (18)

🧭 Keyword Pioneer 🐝 Cross-Pollinator (7) 🏃 Academic Marathon (18) 🌟 Keyword Trendsetter Combo (5) 🤝 Dynamic Duo (14) 🏆 Grand Slam 👥 Mega-Team (69) 🌱 Topic Pioneer 🧬 Topic Evolution 🏆 Keyword Champion 📈 Trend Setter 🗃️ Keyword Collector (162) 🔥 Unstoppable (6) 💎 Century Club (32) ⚡ Prolific Year (6)

Conferences

CVPR (11) AAAI (5) ECCV (5) ICCV (4) INTERSPEECH (4) ICLR (2) NIPS (2) ICML (1) IJCAI (1)

Top co-authors

Ming Yang (15) Jian Wang (10) Yingying Zhang (8) Wei Chu (6) Lixiang Ru (6) Lei Yu (6) Jiangwei Lao (6) Biao Gong (5) DanDan Zheng (5) Ruobing Zheng (4)

Research topics

Architectures (1)

Keywords

vision-language model (5) diffusion model (4) multimodal learning (4) remote sensing (3) self-supervised learning (3) neural network (2) contrastive learning (2) semi-dense matching (2) semantic segmentation (2) image generation (2) domain generalization (2) transfer learning (2) multimodal large language model (2) feature pyramid (2) foundation model (2) multi-modal learning (2) feature matching (2) speech recognition (2) generative model (2) computer vision (1)

Papers

SCAN: Self-Calibrated AutoregressioN for High-Quality Visual Generation AAAI 2026 UniAlignment: Semantic Alignment for Unified Image Generation, Understanding, Manipulation and Perception AAAI 2026 HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses Through Reasoning MLLMs AAAI 2026 SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing ICCV 2025 When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning ICCV 2025 Animate-X: Universal Character Image Animation with Enhanced Motion Representation ICLR 2025 CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance ICCV 2025 HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation AAAI 2025 Mimir: Improving Video Diffusion Models for Precise Text Understanding CVPR 2025 MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation CVPR 2025 Reversing Flow for Image Restoration CVPR 2025 SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling CVPR 2025 POA: Pre-training Once for Models of All Sizes ECCV 2024 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight NIPS 2024 Towards Better Vision-Inspired Vision-Language Models CVPR 2024 Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis CVPR 2024 SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery CVPR 2024 StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models ECCV 2024 EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching ECCV 2024 LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints ICLR 2024 Uncertainty-guided Learning for Improving Image Manipulation Detection ICCV 2023 Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation CVPR 2023 CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes AAAI 2022 Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer CVPR 2022 SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization CVPR 2022 Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis INTERSPEECH 2022 Hierarchical Memory Learning for Fine-Grained Scene Graph Generation ECCV 2022 Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis INTERSPEECH 2022 LPSNet: A Lightweight Solution for Fast Panoptic Segmentation CVPR 2021 MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction IJCAI 2021 AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario INTERSPEECH 2021 Variational Connectionist Temporal Classification ECCV 2020 Cosine Metric Learning for Speaker Verification in the I-vector Space INTERSPEECH 2018 Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin ICML 2016 Blind channel identification for speech dereverberation using l1-norm sparse learning NIPS 2007