Taro Watanabe
164 papers · 2002–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π Conference Polyglot (10) π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (17) π Interdisciplinary Bridge π Academic Marathon (23)
π
Academic Marathon
(23)
πΊοΈ
Taxonomy Completionist
(17)
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(37)
π
Keyword Champion
π₯
Mega-Team
(51)
π¬
Deep Specialist
(25)
π€
Dynamic Duo
(45)
π
Century Club
(151)
ποΈ
Keyword Collector
(466)
β
The Questioner
(6)
β‘
Prolific Year
(10)
π
Trend Setter
π₯
Unstoppable
(8)
π
Conference Pioneer
Conferences
ACL (51)
EMNLP (37)
COLING (22)
NAACL (16)
IJCNLP (15)
EACL (13)
AACL (4)
CONLL (4)
NIPS (1)
WACV (1)
Top co-authors
Research topics
Keywords
large language model
(24)
neural machine translation
(13)
machine translation
(13)
named entity recognition
(10)
language model
(9)
low-resource language
(7)
japanese language
(6)
multilingual nlp
(6)
grammatical error correction
(6)
text generation
(5)
multimodal learning
(5)
vision-language model
(5)
cross-lingual transfer
(5)
information extraction
(4)
neural network
(4)
translation quality
(4)
attention mechanism
(4)
vision language model
(4)
minimum bayes risk decoding
(4)
entity linking
(3)
Papers
entity-linkings: A Unified Library for Entity Linking
EACL 2026
Beyond Sampling: Self-Sorting for Long-Context Ranking
EACL 2026
Revisiting Non-Verbatim Memorization in Large Language Models: The Role of Entity Surface Forms
ACL 2026
HalluCitation Matters: Revealing the Impact of Hallucinated References with 300 Hallucinated Papers in ACL Conferences
ACL 2026
TableMBR: Minimum Bayes Risk Table Generation Based on Structural Consistency
ACL 2026
Diagnosing Vision Language Modelsβ Perception by Leveraging Human Methods for Color Vision Deficiencies
EACL 2026
βYuki Gets Sushi, David Gets Steak?β: Uncovering Gender and Racial Biases in LLM-Based Meal Recommendations
EACL 2026
Cosine Similarity as Logits?: A Scalable Knowledge Probe Using Embedding Vectors from Generative Language Models
EACL 2026
Measuring Linguistic Competence of LLMs on Indigenous Languages of the Americas
EACL 2026
Completely Modular Fine-tuning for Dynamic Language Adaptation
EACL 2026
Toward Automatic Delegation Extraction in Japanese Law
EACL 2026
Towards Singable Lyrics Translation Using Large Language Models
EACL 2026
Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
EMNLP 2025
Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability
ACL 2025
BQA: Body Language Question Answering Dataset for Video Large Language Models
ACL 2025
Rethinking Evaluation Metrics for Grammatical Error Correction: Why Use a Different Evaluation Process than Human?
ACL 2025
gec-metrics: A Unified Library for Grammatical Error Correction Evaluation
ACL 2025
Translating Movie Subtitles by Large Language Models using Movie-meta Information
ACL 2025
Improving Explainability of Sentence-level Metrics via Edit-level Attribution for Grammatical Error Correction
ACL 2025
IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator
ACL 2025
Dictionaries to the Rescue: Cross-Lingual Vocabulary Transfer for Low-Resource Languages Using Bilingual Dictionaries
ACL 2025
Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain
ACL 2025
Superfluous Instruction: Vulnerabilities Stemming from Task-Specific Superficial Expressions in Instruction Templates
ACL 2025
AdTEC: A Unified Benchmark for Evaluating Text Quality in Search Engine Advertising
NAACL 2025
How to Make the Most of LLMsβ Grammatical Knowledge for Acceptability Judgments
NAACL 2025
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
NAACL 2025
Investigating Omission as a Latency Reduction Strategy in Simultaneous Speech Translation
IJCNLP 2025
Agreement-Constrained Probabilistic Minimum Bayes Risk Decoding
IJCNLP 2025
Knowledge Editing Induces Underconfidence in Language Models
EMNLP 2025
Reliability Crisis of Reference-free Metrics for Grammatical Error Correction
EMNLP 2025
BannerBench: Benchmarking Vision Language Models for Multi-Ad Selection with Human Preferences
EMNLP 2025
Decoding Uncertainty: The Impact of Decoding Strategies for Uncertainty Estimation in Large Language Models
EMNLP 2025
HLU: Human Vs LLM Generated Text Detection Dataset for Urdu at Multiple Granularities
COLING 2025
Measuring the Robustness of Reference-Free Dialogue Evaluation Systems
COLING 2025
A Text Embedding Model with Contrastive Example Mining for Point-of-Interest Geocoding
COLING 2025
Beyond Film Subtitles: Is YouTube the Best Approximation of Spoken Vocabulary?
COLING 2025
IRR: Image Review Ranking Framework for Evaluating Vision-Language Models
COLING 2025
JaCorpTrack: Corporate History Event Extraction for Tracking Organizational Changes
EMNLP 2025
LoCt-Instruct: An Automatic Pipeline for Constructing Datasets of Logical Continuous Instructions
EMNLP 2025
SinhalaMMLU: A Comprehensive Benchmark for Evaluating Multitask Language Understanding in Sinhala
EMNLP 2025
Multilingual Dialogue Generation and Localization with Dialogue Act Scripting
EMNLP 2025
Speaker Identification and Dataset Construction Using LLMs: A Case Study on Japanese Narratives
NAACL 2025
Leveraging Dictionaries and Grammar Rules for the Creation of Educational Materials for Indigenous Languages
NAACL 2025
Long-Tail Crisis in Nearest Neighbor Language Models
NAACL 2025
Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks
NAACL 2025
Towards Cross-Lingual Explanation of Artwork in Large-scale Vision Language Models
NAACL 2025
Reliability of Distribution Predictions by LLMs: Insights from Counterintuitive Pseudo-Distributions
NAACL 2025
Tonguescape: Exploring Language Models Understanding of Vowel Articulation
NAACL 2025
Agreement-Constrained Probabilistic Minimum Bayes Risk Decoding
AACL 2025
Investigating Omission as a Latency Reduction Strategy in Simultaneous Speech Translation
AACL 2025
Graph-Structured Trajectory Extraction from Travelogues
ACL 2025
Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
ACL 2025
CoAM: Corpus of All-Type Multiword Expressions
ACL 2025
Diversity Explains Inference Scaling Laws: Through a Case Study of Minimum Bayes Risk Decoding
ACL 2025
Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?
EMNLP 2024
Can Language Models Induce Grammatical Knowledge from Indirect Evidence?
EMNLP 2024
Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation
EMNLP 2024
Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters
EMNLP 2024
Constructing Indonesian-English Travelogue Dataset
COLING 2024
Disentangling Pretrained Representation to Leverage Low-Resource Languages in Multilingual Machine Translation
COLING 2024
Japanese Rule-based Grapheme-to-phoneme Conversion System and Multilingual Named Entity Dataset with International Phonetic Alphabet
NAACL 2024
Applying Linguistic Expertise to LLMs for Educational Material Development in Indigenous Languages
NAACL 2024
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?
NAACL 2024
JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
COLING 2024
Monolingual Paraphrase Detection Corpus for Low Resource Pashto Language at Sentence Level
COLING 2024
Arukikata Travelogue Dataset with Geographic Entity Mention, Coreference, and Link Annotation
EACL 2024
Simul-MuST-C: Simultaneous Multilingual Speech Translation Corpus Using Large Language Model
EMNLP 2024
Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair
EMNLP 2024
mbrs: A Library for Minimum Bayes Risk Decoding
EMNLP 2024
Toward the Evaluation of Large Language Models Considering Score Variance across Instruction Templates
EMNLP 2024
Cross-lingual Contextualized Phrase Retrieval
EMNLP 2024
Towards Artwork Explanation in Large-scale Vision Language Models
ACL 2024
Alignment-Based Decoding Policy for Low-Latency and Anticipation-Free Neural Japanese Input Method Editors
ACL 2024
TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild
ACL 2024
Centroid-Based Efficient Minimum Bayes Risk Decoding
ACL 2024
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
ACL 2024
Modeling Overregularization in Children with Small Language Models
ACL 2024
Unified Interpretation of Smoothing Methods for Negative Sampling Loss Functions in Knowledge Graph Embedding
ACL 2024
Synthetic Context with LLM for Entity Linking from Scientific Tables
ACL 2024
Difficult for Whom? A Study of Japanese Lexical Complexity
EMNLP 2024
Evaluating Language Models in Location Referring Expression Extraction from Early Modern and Contemporary Japanese Texts
EMNLP 2024
Generating Diverse Translation with Perturbed kNN-MT
EACL 2024
Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
NIPS 2023
24-bit Languages
AACL 2023
Model-based Subsampling for Knowledge Graph Completion
AACL 2023
Subset Retrieval Nearest Neighbor Machine Translation
ACL 2023
Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
ACL 2023
Second Language Acquisition of Neural Language Models
ACL 2023
A Closer Look at k-Nearest Neighbors Grammatical Error Correction
ACL 2023
Japanese Lexical Complexity for Non-Native Readers: A New Dataset
ACL 2023
NAISTeacher: A Prompt and Rerank Approach to Generating Teacher Utterances in Educational Dialogues
ACL 2023
NAIST-NICT WMTβ23 General MT Task Submission
EMNLP 2023
Findings of the Word-Level AutoCompletion Shared Task in WMT 2023
EMNLP 2023
24-bit Languages
IJCNLP 2023
Model-based Subsampling for Knowledge Graph Completion
IJCNLP 2023
Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning
WACV 2023
NAIST-NICT-TIT WMT22 General MT Task Submission
EMNLP 2022
Residual Learning of Neural Text Generation with n-gram Language Model
EMNLP 2022
Visualizing the Relationship Between Encoded Linguistic Information and Task Performance
ACL 2022
What Works and Doesnβt Work, A Deep Decoder for Neural Machine Translation
ACL 2022
Adapting to Non-Centered Languages for Zero-shot Multilingual Translation
COLING 2022
Sharing Parameter by Conjugation for Knowledge Graph Embeddings in Complex Space
COLING 2022
Findings of the Word-Level AutoCompletion Shared Task in WMT 2022
EMNLP 2022
Improving Discriminative Learning for Zero-Shot Relation Extraction
ACL 2022
JADES: New Text Simplification Dataset in Japanese Targeted at Non-Native Speakers
EMNLP 2022
Transliteration for Low-Resource Code-Switching Texts: Building an Automatic Cyrillic-to-Latin Converter for Tatar
NAACL 2021
Neural Machine Translation with Synchronous Latent Phrase Structure
ACL 2021
Zero Pronouns Identification based on Span prediction
ACL 2021
Structured Refinement for Sequential Labeling
ACL 2021
Dependency Patterns of Complex Sentences and Semantic Disambiguation for Abstract Meaning Representation Parsing
ACL 2021
A Text Editing Approach to Joint Japanese Word Segmentation, POS Tagging, and Lexical Normalization
EMNLP 2021
Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image Captioning
EACL 2021
Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path
IJCNLP 2021
Neural Machine Translation with Synchronous Latent Phrase Structure
IJCNLP 2021
Zero Pronouns Identification based on Span prediction
IJCNLP 2021
Structured Refinement for Sequential Labeling
IJCNLP 2021
Dependency Patterns of Complex Sentences and Semantic Disambiguation for Abstract Meaning Representation Parsing
IJCNLP 2021
Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path
ACL 2021
User-Generated Text Corpus for Evaluating Japanese Morphological Analysis and Lexical Normalization
NAACL 2021
Coordination Boundary Identification without Labeled Data for Compound Terms Disambiguation
COLING 2020
Denoising Neural Machine Translation Training with Trusted Data and Online Data Selection
EMNLP 2018
Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
IJCNLP 2017
Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
IJCNLP 2017
Phrase-based Machine Translation using Multiple Preordering Candidates
COLING 2016
Transition-based Neural Constituent Parsing
IJCNLP 2015
Transition-based Neural Constituent Parsing
ACL 2015
Leave-one-out Word Alignment without Garbage Collector Effects
EMNLP 2015
Hierarchical Back-off Modeling of Hiero Grammar based on Non-parametric Bayesian Model
EMNLP 2015
Recurrent Neural Network-based Tuple Sequence Model for Machine Translation
COLING 2014
Recurrent Neural Networks for Word Alignment Model
ACL 2014
Unsupervised Word Alignment Using Frequency Constraint in Posterior Regularized EM
EMNLP 2014
Syntax-Augmented Machine Translation using Syntax-Label Clustering
EMNLP 2014
Additive Neural Networks for Statistical Machine Translation
ACL 2013
Tuning SMT with a Large Number of Features via Online Feature Grouping
IJCNLP 2013
Part-of-Speech Induction in Dependency Trees for Statistical Machine Translation
ACL 2013
Hierarchical Phrase Table Combination for Machine Translation
ACL 2013
Bilingual Lexicon Extraction from Comparable Corpora Using Label Propagation
CONLL 2012
Inducing a Discriminative Parser to Optimize Machine Translation Reordering
EMNLP 2012
Locally Training the Log-Linear Model for SMT
EMNLP 2012
Bilingual Lexicon Extraction from Comparable Corpora Using Label Propagation
EMNLP 2012
Inducing a Discriminative Parser to Optimize Machine Translation Reordering
CONLL 2012
Locally Training the Log-Linear Model for SMT
CONLL 2012
Expected Error Minimization with Ultraconservative Update for SMT
COLING 2012
Optimized Online Rank Learning for Machine Translation
NAACL 2012
Head-driven Transition-based Parsing with Top-down Prediction
ACL 2012
Machine Translation without Words through Substring Alignment
ACL 2012
Third-order Variational Reranking on Packed-Shared Dependency Forests
EMNLP 2011
Machine Translation System Combination by Confusion Forest
ACL 2011
An Unsupervised Model for Joint Phrase Alignment and Extraction
ACL 2011
A Succinct N-gram Language Model
ACL 2009
A Succinct N-gram Language Model
IJCNLP 2009
Online Large-Margin Training for Statistical Machine Translation
CONLL 2007
Online Large-Margin Training for Statistical Machine Translation
EMNLP 2007
Left-to-Right Target Generation for Hierarchical Phrase-Based Translation
COLING 2006
Left-to-Right Target Generation for Hierarchical Phrase-Based Translation
ACL 2006
Empirical Study of Utilizing Morph-Syntactic Information in SMT
IJCNLP 2005
A Unified Approach in Speech-to-Speech Translation: Integrating Features of Speech recognition and Machine Translation
COLING 2004
Reordering Constraints for Phrase-Based Statistical Machine Translation
COLING 2004
Example-based Machine Translation Based on Syntactic Transfer with Statistical Models
COLING 2004
A corpus-centered approach to spoken language translation
EACL 2003
Chunk-Based Statistical Translation
ACL 2003
Using Language and Translation Models to Select the Best among Outputs from Multiple MT Systems
COLING 2002
Bidirectional Decoding for Statistical Machine Translation
COLING 2002
Language Model Adaptation with Additional Text Generated by Machine Translation
COLING 2002