Brian Chen

15 papers · 2019–2025 · 9 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (9) 🏃 Academic Marathon (6) 🗺️ Taxonomy Completionist (28)

🗺️ Taxonomy Completionist (28) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 👥 Mega-Team (26) 🧬 Topic Evolution 💎 Century Club (15) 📈 Trend Setter 🔥 Unstoppable (7) ❓ The Questioner 🗃️ Keyword Collector (83) ⚡ Prolific Year (6)

Conferences

CVPR (3) EMNLP (2) ICCV (2) INTERSPEECH (2) WACV (2) AAAI (1) ACL (1) MLHC (1) NAACL (1)

Top co-authors

Shih-fu Chang (9) Hilde Kuehne (5) James Glass (5) Andrew Rouditchenko (5) Samuel Thomas (5) Brian Kingsbury (4) Rogerio Feris (4) Heng Ji (4) David Harwath (4) Angie Boggust (3)

Keywords

self-supervised learning (7) multimodal learning (5) video retrieval (4) weakly supervised learning (3) contrastive learning (2) coreference resolution (2) instructional video (2) video understanding (2) zero-shot retrieval (2) zero-shot learning (2) video representation (2) knowledge extraction (1) cross-lingual transfer (1) audio-visual learning (1) information extraction (1) image retrieval (1) relation extraction (1) object tracking (1) action recognition (1) visual grounding (1)

Papers

User-in-the-Loop Evaluation of Multimodal LLMs for Activity Assistance WACV 2025 What When and Where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions CVPR 2024 EgoTV: Egocentric Task Verification from Natural Language Task Descriptions ICCV 2023 PreViTS: Contrastive Pretraining With Video Tracking Supervision WACV 2023 Weakly-Supervised Temporal Article Grounding EMNLP 2022 Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval CVPR 2022 Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos ICCV 2021 Joint Multimedia Event Extraction from Video and Article EMNLP 2021 Cascaded Multilingual Audio-Visual Learning from Videos INTERSPEECH 2021 Detecting Atrial Fibrillation in ICU Telemetry data with Weak Labels MLHC 2021 RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System NAACL 2021 AVLnet: Learning Audio-Visual Language Representations from Instructional Videos INTERSPEECH 2021 General Partial Label Learning via Dual Bipartite Graph Autoencoder AAAI 2020 GAIA: A Fine-grained Multimedia Knowledge Extraction System ACL 2020 Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding CVPR 2019