Chi Chen
21 papers · 2021–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (5) π Renaissance Researcher (6) π Conference Polyglot (5) πΊοΈ Taxonomy Completionist (35)
πΊοΈ
Taxonomy Completionist
(35)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π€
Dynamic Duo
(11)
π¬
Deep Specialist
(15)
π₯
Mega-Team
(25)
π§¬
Topic Evolution
π₯
Unstoppable
(5)
π
Century Club
(17)
β
The Questioner
(2)
β‘
Prolific Year
(9)
ποΈ
Keyword Collector
(84)
Conferences
ACL (12)
EMNLP (4)
ICCV (2)
AAAI (1)
CVPR (1)
IJCNLP (1)
Top co-authors
Keywords
multimodal large language model
(13)
multimodal learning
(5)
benchmark evaluation
(4)
large language model
(3)
visual reasoning
(3)
attention mechanism
(2)
word alignment
(2)
self-supervised learning
(2)
chart understanding
(2)
neural machine translation
(2)
reinforcement learning
(2)
code generation
(2)
visual question answering
(2)
active perception
(1)
video understanding
(1)
instruction following
(1)
masked language model
(1)
cross-modal retrieval
(1)
temporal grounding
(1)
context understanding
(1)
Papers
You Can Have a Second Chance: Unbiased and Multi-bit Watermarking for Diffusion Language Models with Regret-based Remasking
ACL 2026
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
ACL 2026
RSMeM: Knowledge-Enhanced Memory Evolution for Remote Sensing Agents with Systematic Evaluation
ACL 2026
LLaVA-UHD v2: Exploiting Hierarchical Vision Granularity in MLLMs via Inverse Semantic Pyramid
AAAI 2026
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
EMNLP 2025
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
ACL 2025
ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models
ACL 2025
ChartEdit: How Far Are MLLMs From Automating Chart Analysis? Evaluating MLLMsβ Capability via Chart Editing
ACL 2025
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
ACL 2025
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
CVPR 2025
Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
ICCV 2025
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game
ICCV 2025
Think in Safety: Unveiling and Mitigating Safety Alignment Collapse in Multimodal Large Reasoning Model
EMNLP 2025
Model Composition for Multimodal Large Language Models
ACL 2024
CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models
ACL 2024
Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion
ACL 2024
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
EMNLP 2023
Weakly Supervised Vision-and-Language Pre-training with Relative Representations
ACL 2023
End-to-End Unsupervised Vision-and-Language Pre-training with Referring Expression Matching
EMNLP 2022
Mask-Align: Self-Supervised Neural Word Alignment
IJCNLP 2021
Mask-Align: Self-Supervised Neural Word Alignment
ACL 2021