Yuta Nakashima
42 papers · 2018–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Academic Marathon (7) π Conference Polyglot (14) π§ Keyword Pioneer π Interdisciplinary Bridge π£ Hot Topic Early Bird
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Academic Marathon
(7)
π€
Dynamic Duo
(14)
π¬
Deep Specialist
(10)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π
Conference Pioneer
β‘
Prolific Year
(5)
β
The Questioner
(2)
ποΈ
Keyword Collector
(167)
π
Century Club
(40)
π₯
Unstoppable
(8)
π
Trend Setter
Conferences
CVPR (9)
WACV (8)
ICCV (5)
AAAI (4)
EMNLP (3)
ACL (2)
COLING (2)
IJCNLP (2)
NAACL (2)
AACL (1)
ECCV (1)
ICLR (1)
MIDL (1)
NIPS (1)
Top co-authors
Keywords
image captioning
(8)
multimodal learning
(7)
gender bia
(4)
societal bia
(4)
vision-language model
(4)
knowledge graph
(3)
sentiment analysis
(3)
bias mitigation
(3)
visual question answering
(3)
semantic segmentation
(3)
object detection
(3)
representation learning
(3)
multi-annotator learning
(2)
benchmark evaluation
(2)
explainable ai
(2)
image classification
(2)
visual grounding
(2)
video summarization
(2)
video classification
(2)
attention mechanism
(2)
Papers
SimLabel: Similarity-Weighted Semi-supervision for Multi-annotator Learning with Missing Labels
AAAI 2026
QuMAB: Query-based Multi-annotator Behavior Pattern Learning
AAAI 2026
Paladin: Understanding Video Intentions in Political Advertisement Videos
WACV 2025
LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences
ACL 2025
No Annotations for Object Detection in Art through Stable Diffusion
WACV 2025
ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training
COLING 2025
Text Normalization for Sentiment Analysis in Japanese Social Media
NAACL 2025
SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP
ICLR 2025
Bias in Gender Bias Benchmarks: How Spurious Features Distort Evaluation
ICCV 2025
Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
ICCV 2025
Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown
ICCV 2025
Putting People in LLMsβ Shoes: Generating Better Answers via Question Rewriter
AAAI 2025
Would Deep Generative Models Amplify Bias in Future Models?
CVPR 2024
DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models
NIPS 2024
Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes
EMNLP 2024
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment
EMNLP 2024
Instruct Me More! Random Prompting for Visual In-Context Learning
WACV 2024
Revisiting Pixel-Level Contrastive Pre-Training on Scene Images
WACV 2024
Learning Bottleneck Concepts in Image Classification
CVPR 2023
Model-Agnostic Gender Debiased Image Captioning
CVPR 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
CVPR 2023
Uncurated Image-Text Datasets: Shedding Light on Demographic Bias
CVPR 2023
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
WACV 2023
Emotional Intensity Estimation based on Writerβs Personality
AACL 2022
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval
CVPR 2022
Optimal Correction Cost for Object Detection Evaluation
CVPR 2022
Quantifying Societal Bias Amplification in Image Captioning
CVPR 2022
Emotional Intensity Estimation based on Writerβs Personality
IJCNLP 2022
Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation
ICCV 2021
SCOUTER: Slot Attention-Based Classifier for Explainable Image Recognition
ICCV 2021
Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers
IJCNLP 2021
WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations
NAACL 2021
Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers
ACL 2021
The Laughing Machine: Predicting Humor in Video
WACV 2021
Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions
ECCV 2020
Joint Learning of Vessel Segmentation and Artery/Vein Classification with Post-processing
MIDL 2020
BERT representations for Video Question Answering
WACV 2020
IterNet: Retinal Image Segmentation Utilizing Structural Redundancy in Vessel Networks
WACV 2020
KnowIT VQA: Answering Knowledge-Based Questions about Videos
AAAI 2020
IDSOU at WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets
EMNLP 2020
Rethinking the Evaluation of Video Summaries
CVPR 2019
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image
COLING 2018