Mayu Otani

21 papers · 2018–2026 · 7 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🏃 Academic Marathon (8) 🌍 Conference Polyglot (7) 🗺️ Taxonomy Completionist (42)

🗺️ Taxonomy Completionist (42) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🧬 Topic Evolution 🤝 Dynamic Duo (13) ⚡ Prolific Year (6) 🚀 Conference Pioneer 💎 Century Club (21) 📈 Trend Setter 🗃️ Keyword Collector (89) 🔥 Unstoppable (7) ❓ The Questioner (2)

Conferences

CVPR (8) WACV (7) ECCV (2) AAAI (1) ACL (1) COLING (1) IJCNLP (1)

Top co-authors

Yuta Nakashima (13) Chenhui Chu (6) Noa Garcia (6) Esa Rahtu (4) Janne Heikkila (4) Riku Togashi (4) Naoto Inoue (4) Edgar Simo-Serra (3) Kota Yamaguchi (3) Kotaro Kikuchi (3)

Keywords

multimodal learning (6) visual question answering (3) generative model (3) semantic segmentation (2) attention supervision (2) feature extraction (2) video summarization (2) visual grounding (2) vector graphics (2) object detection (2) evaluation metric (2) video question answering (2) transfer learning (2) representation learning (2) attention mechanism (1) video classification (1) optimal transport (1) multi-task learning (1) cross-modal learning (1) image retrieval (1)

Papers

Robust Multimodal Emotion Recognition from Incomplete Modalities via Query-Based Unimodal and Cross-Modal Learning WACV 2026 Would Deep Generative Models Amplify Bias in Future Models? CVPR 2024 LayoutFlow: Flow Matching for Layout Generation ECCV 2024 Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift ECCV 2024 Revisiting Pixel-Level Contrastive Pre-Training on Scene Images WACV 2024 Generative Colorization of Structured Mobile Web Pages WACV 2023 Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization WACV 2023 LayoutDM: Discrete Diffusion Model for Controllable Layout Generation CVPR 2023 Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation CVPR 2023 Towards Flexible Multi-Modal Document Models CVPR 2023 Color Recommendation for Vector Graphic Documents Based on Multi-Palette Representation WACV 2023 Does Robustness on ImageNet Transfer to Downstream Tasks? CVPR 2022 AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval CVPR 2022 Optimal Correction Cost for Object Detection Evaluation CVPR 2022 The Laughing Machine: Predicting Humor in Video WACV 2021 Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers IJCNLP 2021 Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers ACL 2021 KnowIT VQA: Answering Knowledge-Based Questions about Videos AAAI 2020 BERT representations for Video Question Answering WACV 2020 Rethinking the Evaluation of Video Summaries CVPR 2019 iParaphrasing: Extracting Visually Grounded Paraphrases via an Image COLING 2018