Jinfa Huang

16 papers · 2020–2026 · 9 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🏃 Academic Marathon (5) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (5)

🏃 Academic Marathon (5) 🐝 Cross-Pollinator (5) 🧭 Keyword Pioneer 🔬 Deep Specialist (11) 🧬 Topic Evolution 🔥 Unstoppable (5) 💎 Century Club (15) ⚡ Prolific Year (6) 🗃️ Keyword Collector (60)

Conferences

ICLR (3) AAAI (2) COLING (2) CVPR (2) EMNLP (2) NIPS (2) ACL (1) IJCAI (1) SEMEVAL (1)

Top co-authors

Peng Jin (6) Li Yuan (6) Jiebo Luo (5) Jie Chen (3) Shaofeng Zhang (3) Yingmei Guo (2) Ge Li (2) Mingxing Xu (2) Zhongwei Wan (2) Haoran Tang (2)

Keywords

text-video retrieval (3) vision-language model (2) text-to-video generation (2) contrastive learning (2) attention mechanism (1) prompt engineering (1) ensemble learning (1) expectation maximization (1) sentiment analysis (1) cross-modal learning (1) video understanding (1) efficient computing (1) cross-modal representation (1) task mapping (1) large multimodal model (1) contextual reasoning (1) ensemble method (1) cross-modal alignment (1) parameter-efficient fine-tuning (1) disentangled representation (1)

Papers

QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension AAAI 2026 CR2PQ: Continuous Relative Rotary Positional Query for Dense Visual Representation Learning ICLR 2025 TACO: Enhancing Multimodal In-context Learning via Task Mapping-Guided Sequence Configuration EMNLP 2025 Identity-Preserving Text-to-Video Generation by Frequency Decomposition CVPR 2025 Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection COLING 2025 Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model ICLR 2025 MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval AAAI 2025 LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference EMNLP 2024 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation NIPS 2024 RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter ACL 2024 Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach ICLR 2024 Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning CVPR 2023 Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment IJCAI 2023 Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations NIPS 2022 Guoym at SemEval-2020 Task 8: Ensemble-based Classification of Visuo-Lingual Metaphor in Memes SEMEVAL 2020 Guoym at SemEval-2020 Task 8: Ensemble-based Classification of Visuo-Lingual Metaphor in Memes COLING 2020