Xirong Li
20 papers · 2018–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (9) π Academic Marathon (7) π Renaissance Researcher (7) πΊοΈ Taxonomy Completionist (56)
π£
Hot Topic Early Bird
π
Conference Polyglot
(9)
π
Academic Marathon
(7)
π§¬
Topic Evolution
π
Century Club
(20)
β‘
Prolific Year
(7)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(104)
Conferences
ICCV (6)
AAAI (3)
CVPR (3)
ACL (2)
ECCV (2)
COLING (1)
IJCAI (1)
IJCNLP (1)
MICCAI (1)
Top co-authors
Keywords
cross-modal matching
(3)
cross-modal retrieval
(3)
image classification
(2)
multimodal learning
(2)
multimodal large language model
(2)
video understanding
(2)
image manipulation detection
(2)
clip model
(2)
text-to-video retrieval
(2)
document reranking
(2)
claim detection
(2)
visual question answering
(1)
image generation
(1)
preference learning
(1)
object detection
(1)
text classification
(1)
feature learning
(1)
computer vision
(1)
feature extraction
(1)
semantic segmentation
(1)
Papers
Multi-Object Sketch Animation by Scene Decomposition and Motion Planning
ICCV 2025
Music Grounding by Short Video
ICCV 2025
PhD: A ChatGPT-Prompted Visual Hallucination Evaluation Dataset
CVPR 2025
D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching
AAAI 2025
FunBench: Benchmarking Fundus Reading Skills of MLLMs
MICCAI 2025
Hybrid-Tower: Fine-grained Pseudo-query Interaction and Generation for Text-to-Video Retrieval
ICCV 2025
Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization
ACL 2025
Tackling Long Code Search with Splitting, Encoding, and Aggregating
COLING 2024
Holistic Features are almost Sufficient for Text-to-Video Retrieval
CVPR 2024
Geometrized Transformer for Self-Supervised Homography Estimation
ICCV 2023
SAFL-Net: Semantic-Agnostic Feature Learning Network with Auxiliary Plugins for Image Manipulation Detection
ICCV 2023
Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching
ECCV 2022
DRAG: Dynamic Region-Aware GCN for Privacy-Leaking Image Detection
AAAI 2022
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
ECCV 2022
Deepfake Network Architecture Attribution
AAAI 2022
Image Manipulation Detection by Multi-View Multi-Scale Supervision
ICCV 2021
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims
IJCNLP 2021
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims
ACL 2021
Dual Encoding for Zero-Example Video Retrieval
CVPR 2019
Deep Text Classification Can be Fooled
IJCAI 2018