Xirong Li

20 papers · 2018–2025 · 9 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🏃 Academic Marathon (7) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (56)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (9) 🏃 Academic Marathon (7) 🧬 Topic Evolution 💎 Century Club (20) ⚡ Prolific Year (7) 🔥 Unstoppable (5) 🗃️ Keyword Collector (104)

Conferences

ICCV (6) AAAI (3) CVPR (3) ACL (2) ECCV (2) COLING (1) IJCAI (1) IJCNLP (1) MICCAI (1)

Top co-authors

Juan Cao (6) Jingyu Liu (3) Qiang Sheng (3) Ruixiang Zhao (3) Jiazhen Liu (3) Yuhan Fu (3) Bangxiang Lan (3) Ruobing Xie (3) Zhanhui Kang (3) Zijie Xin (3)

Keywords

cross-modal matching (3) cross-modal retrieval (3) image classification (2) multimodal learning (2) multimodal large language model (2) video understanding (2) image manipulation detection (2) clip model (2) text-to-video retrieval (2) document reranking (2) claim detection (2) visual question answering (1) image generation (1) preference learning (1) object detection (1) text classification (1) feature learning (1) computer vision (1) feature extraction (1) semantic segmentation (1)

Papers

Multi-Object Sketch Animation by Scene Decomposition and Motion Planning ICCV 2025 Music Grounding by Short Video ICCV 2025 PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset CVPR 2025 D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching AAAI 2025 FunBench: Benchmarking Fundus Reading Skills of MLLMs MICCAI 2025 Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval ICCV 2025 Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization ACL 2025 Tackling Long Code Search with Splitting, Encoding, and Aggregating COLING 2024 Holistic Features are almost Sufficient for Text-to-Video Retrieval CVPR 2024 Geometrized Transformer for Self-Supervised Homography Estimation ICCV 2023 SAFL-Net: Semantic-Agnostic Feature Learning Network with Auxiliary Plugins for Image Manipulation Detection ICCV 2023 Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching ECCV 2022 DRAG: Dynamic Region-Aware GCN for Privacy-Leaking Image Detection AAAI 2022 Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval ECCV 2022 Deepfake Network Architecture Attribution AAAI 2022 Image Manipulation Detection by Multi-View Multi-Scale Supervision ICCV 2021 Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims IJCNLP 2021 Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims ACL 2021 Dual Encoding for Zero-Example Video Retrieval CVPR 2019 Deep Text Classification Can be Fooled IJCAI 2018