Yuta Nakashima

42 papers · 2018–2026 · 14 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🏃 Academic Marathon (7) 🌍 Conference Polyglot (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🏃 Academic Marathon (7) 🤝 Dynamic Duo (14) 🔬 Deep Specialist (10) 🧬 Topic Evolution 🏆 Keyword Champion (2) 🚀 Conference Pioneer ⚡ Prolific Year (5) ❓ The Questioner (2) 🗃️ Keyword Collector (167) 💎 Century Club (40) 🔥 Unstoppable (8) 📈 Trend Setter

Conferences

CVPR (9) WACV (8) ICCV (5) AAAI (4) EMNLP (3) ACL (2) COLING (2) IJCNLP (2) NAACL (2) AACL (1) ECCV (1) ICLR (1) MIDL (1) NIPS (1)

Top co-authors

Noa Garcia (14) Hajime Nagahara (13) Mayu Otani (13) Yusuke Hirota (9) Chenhui Chu (8) Bowen Wang (7) Tomoyuki Kajiwara (5) Liangzhi Li (5) Zhouqiang Jiang (4) Ryo Hachiuma (4)

Keywords

image captioning (8) multimodal learning (7) gender bia (4) societal bia (4) vision-language model (4) knowledge graph (3) sentiment analysis (3) bias mitigation (3) visual question answering (3) semantic segmentation (3) object detection (3) representation learning (3) multi-annotator learning (2) benchmark evaluation (2) explainable ai (2) image classification (2) visual grounding (2) video summarization (2) video classification (2) attention mechanism (2)

Papers

SimLabel: Similarity-Weighted Semi-supervision for Multi-annotator Learning with Missing Labels AAAI 2026 QuMAB: Query-based Multi-annotator Behavior Pattern Learning AAAI 2026 Paladin: Understanding Video Intentions in Political Advertisement Videos WACV 2025 LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences ACL 2025 No Annotations for Object Detection in Art through Stable Diffusion WACV 2025 ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training COLING 2025 Text Normalization for Sentiment Analysis in Japanese Social Media NAACL 2025 SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP ICLR 2025 Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation ICCV 2025 Processing and acquisition traces in visual encoders: What does CLIP know about your camera? ICCV 2025 Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown ICCV 2025 Putting People in LLMs’ Shoes: Generating Better Answers via Question Rewriter AAAI 2025 Would Deep Generative Models Amplify Bias in Future Models? CVPR 2024 DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models NIPS 2024 Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes EMNLP 2024 From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment EMNLP 2024 Instruct Me More! Random Prompting for Visual In-Context Learning WACV 2024 Revisiting Pixel-Level Contrastive Pre-Training on Scene Images WACV 2024 Learning Bottleneck Concepts in Image Classification CVPR 2023 Model-Agnostic Gender Debiased Image Captioning CVPR 2023 Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation CVPR 2023 Uncurated Image-Text Datasets: Shedding Light on Demographic Bias CVPR 2023 Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization WACV 2023 Emotional Intensity Estimation based on Writer’s Personality AACL 2022 AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval CVPR 2022 Optimal Correction Cost for Object Detection Evaluation CVPR 2022 Quantifying Societal Bias Amplification in Image Captioning CVPR 2022 Emotional Intensity Estimation based on Writer’s Personality IJCNLP 2022 Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation ICCV 2021 SCOUTER: Slot Attention-Based Classifier for Explainable Image Recognition ICCV 2021 Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers IJCNLP 2021 WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations NAACL 2021 Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers ACL 2021 The Laughing Machine: Predicting Humor in Video WACV 2021 Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions ECCV 2020 Joint Learning of Vessel Segmentation and Artery/Vein Classification with Post-processing MIDL 2020 BERT representations for Video Question Answering WACV 2020 IterNet: Retinal Image Segmentation Utilizing Structural Redundancy in Vessel Networks WACV 2020 KnowIT VQA: Answering Knowledge-Based Questions about Videos AAAI 2020 IDSOU at WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets EMNLP 2020 Rethinking the Evaluation of Video Summaries CVPR 2019 iParaphrasing: Extracting Visually Grounded Paraphrases via an Image COLING 2018