Jinfa Huang
16 papers · 2020–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
🏃 Academic Marathon (5) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (5)
🏃
Academic Marathon
(5)
🐝
Cross-Pollinator
(5)
🧭
Keyword Pioneer
🔬
Deep Specialist
(11)
🧬
Topic Evolution
🔥
Unstoppable
(5)
💎
Century Club
(15)
⚡
Prolific Year
(6)
🗃️
Keyword Collector
(60)
Conferences
ICLR (3)
AAAI (2)
COLING (2)
CVPR (2)
EMNLP (2)
NIPS (2)
ACL (1)
IJCAI (1)
SEMEVAL (1)
Top co-authors
Keywords
text-video retrieval
(3)
vision-language model
(2)
text-to-video generation
(2)
contrastive learning
(2)
attention mechanism
(1)
prompt engineering
(1)
ensemble learning
(1)
expectation maximization
(1)
sentiment analysis
(1)
cross-modal learning
(1)
video understanding
(1)
efficient computing
(1)
cross-modal representation
(1)
task mapping
(1)
large multimodal model
(1)
contextual reasoning
(1)
ensemble method
(1)
cross-modal alignment
(1)
parameter-efficient fine-tuning
(1)
disentangled representation
(1)
Papers
QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension
AAAI 2026
CR2PQ: Continuous Relative Rotary Positional Query for Dense Visual Representation Learning
ICLR 2025
TACO: Enhancing Multimodal In-context Learning via Task Mapping-Guided Sequence Configuration
EMNLP 2025
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
CVPR 2025
Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
COLING 2025
Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model
ICLR 2025
MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval
AAAI 2025
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference
EMNLP 2024
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
NIPS 2024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
ACL 2024
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach
ICLR 2024
Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
CVPR 2023
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
IJCAI 2023
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
NIPS 2022
Guoym at SemEval-2020 Task 8: Ensemble-based Classification of Visuo-Lingual Metaphor in Memes
SEMEVAL 2020
Guoym at SemEval-2020 Task 8: Ensemble-based Classification of Visuo-Lingual Metaphor in Memes
COLING 2020