conftrace_

Jingdong Chen

35 papers · 2007–2026 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+15 more ↓ 🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (18)
🧭 Keyword Pioneer 🐝 Cross-Pollinator (7) 🏃 Academic Marathon (18) 🌟 Keyword Trendsetter Combo (5) 🤝 Dynamic Duo (14) 🏆 Grand Slam 👥 Mega-Team (69) 🌱 Topic Pioneer 🧬 Topic Evolution 🏆 Keyword Champion 📈 Trend Setter 🗃️ Keyword Collector (162) 🔥 Unstoppable (6) 💎 Century Club (32) Prolific Year (6)

Conferences

CVPR (11) AAAI (5) ECCV (5) ICCV (4) INTERSPEECH (4) ICLR (2) NIPS (2) ICML (1) IJCAI (1)

Research topics

Papers

SCAN: Self-Calibrated AutoregressioN for High-Quality Visual Generation AAAI 2026 UniAlignment: Semantic Alignment for Unified Image Generation, Understanding, Manipulation and Perception AAAI 2026 HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses Through Reasoning MLLMs AAAI 2026 SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing ICCV 2025 When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning ICCV 2025 Animate-X: Universal Character Image Animation with Enhanced Motion Representation ICLR 2025 CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance ICCV 2025 HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation AAAI 2025 Mimir: Improving Video Diffusion Models for Precise Text Understanding CVPR 2025 MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation CVPR 2025 Reversing Flow for Image Restoration CVPR 2025 SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling CVPR 2025 POA: Pre-training Once for Models of All Sizes ECCV 2024 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight NIPS 2024 Towards Better Vision-Inspired Vision-Language Models CVPR 2024 Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis CVPR 2024 SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery CVPR 2024 StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models ECCV 2024 EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching ECCV 2024 LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints ICLR 2024 Uncertainty-guided Learning for Improving Image Manipulation Detection ICCV 2023 Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation CVPR 2023 CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes AAAI 2022 Training Object Detectors From Scratch: An Empirical Study in the Era of Vision Transformer CVPR 2022 SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization CVPR 2022 Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis INTERSPEECH 2022 Hierarchical Memory Learning for Fine-Grained Scene Graph Generation ECCV 2022 Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis INTERSPEECH 2022 LPSNet: A Lightweight Solution for Fast Panoptic Segmentation CVPR 2021 MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction IJCAI 2021 AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario INTERSPEECH 2021 Variational Connectionist Temporal Classification ECCV 2020 Cosine Metric Learning for Speaker Verification in the I-vector Space INTERSPEECH 2018 Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin ICML 2016 Blind channel identification for speech dereverberation using l1-norm sparse learning NIPS 2007