Mayu Otani
21 papers · 2018–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (7) π Academic Marathon (8) π Conference Polyglot (7) πΊοΈ Taxonomy Completionist (42)
πΊοΈ
Taxonomy Completionist
(42)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π§¬
Topic Evolution
π€
Dynamic Duo
(13)
β‘
Prolific Year
(6)
π
Conference Pioneer
π
Century Club
(21)
π
Trend Setter
ποΈ
Keyword Collector
(89)
π₯
Unstoppable
(7)
β
The Questioner
(2)
Conferences
CVPR (8)
WACV (7)
ECCV (2)
AAAI (1)
ACL (1)
COLING (1)
IJCNLP (1)
Top co-authors
Keywords
multimodal learning
(6)
visual question answering
(3)
generative model
(3)
semantic segmentation
(2)
attention supervision
(2)
feature extraction
(2)
video summarization
(2)
visual grounding
(2)
vector graphics
(2)
object detection
(2)
evaluation metric
(2)
video question answering
(2)
transfer learning
(2)
representation learning
(2)
attention mechanism
(1)
video classification
(1)
optimal transport
(1)
multi-task learning
(1)
cross-modal learning
(1)
image retrieval
(1)
Papers
Robust Multimodal Emotion Recognition from Incomplete Modalities via Query-Based Unimodal and Cross-Modal Learning
WACV 2026
Would Deep Generative Models Amplify Bias in Future Models?
CVPR 2024
LayoutFlow: Flow Matching for Layout Generation
ECCV 2024
Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift
ECCV 2024
Revisiting Pixel-Level Contrastive Pre-Training on Scene Images
WACV 2024
Generative Colorization of Structured Mobile Web Pages
WACV 2023
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
WACV 2023
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
CVPR 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
CVPR 2023
Towards Flexible Multi-Modal Document Models
CVPR 2023
Color Recommendation for Vector Graphic Documents Based on Multi-Palette Representation
WACV 2023
Does Robustness on ImageNet Transfer to Downstream Tasks?
CVPR 2022
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
CVPR 2022
Optimal Correction Cost for Object Detection Evaluation
CVPR 2022
The Laughing Machine: Predicting Humor in Video
WACV 2021
Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers
IJCNLP 2021
Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers
ACL 2021
KnowIT VQA: Answering Knowledge-Based Questions about Videos
AAAI 2020
BERT representations for Video Question Answering
WACV 2020
Rethinking the Evaluation of Video Summaries
CVPR 2019
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image
COLING 2018