Brian Chen
15 papers · 2019–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (9) π Academic Marathon (6) πΊοΈ Taxonomy Completionist (28)
πΊοΈ
Taxonomy Completionist
(28)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π₯
Mega-Team
(26)
π§¬
Topic Evolution
π
Century Club
(15)
π
Trend Setter
π₯
Unstoppable
(7)
β
The Questioner
ποΈ
Keyword Collector
(83)
β‘
Prolific Year
(6)
Conferences
CVPR (3)
EMNLP (2)
ICCV (2)
INTERSPEECH (2)
WACV (2)
AAAI (1)
ACL (1)
MLHC (1)
NAACL (1)
Top co-authors
Keywords
self-supervised learning
(7)
multimodal learning
(5)
video retrieval
(4)
weakly supervised learning
(3)
contrastive learning
(2)
coreference resolution
(2)
instructional video
(2)
video understanding
(2)
zero-shot retrieval
(2)
zero-shot learning
(2)
video representation
(2)
knowledge extraction
(1)
cross-lingual transfer
(1)
audio-visual learning
(1)
information extraction
(1)
image retrieval
(1)
relation extraction
(1)
object tracking
(1)
action recognition
(1)
visual grounding
(1)
Papers
User-in-the-Loop Evaluation of Multimodal LLMs for Activity Assistance
WACV 2025
What When and Where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
CVPR 2024
EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
ICCV 2023
PreViTS: Contrastive Pretraining With Video Tracking Supervision
WACV 2023
Weakly-Supervised Temporal Article Grounding
EMNLP 2022
Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval
CVPR 2022
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
ICCV 2021
Joint Multimedia Event Extraction from Video and Article
EMNLP 2021
Cascaded Multilingual Audio-Visual Learning from Videos
INTERSPEECH 2021
Detecting Atrial Fibrillation in ICU Telemetry data with Weak Labels
MLHC 2021
RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System
NAACL 2021
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
INTERSPEECH 2021
General Partial Label Learning via Dual Bipartite Graph Autoencoder
AAAI 2020
GAIA: A Fine-grained Multimedia Knowledge Extraction System
ACL 2020
Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding
CVPR 2019