Saumitra Yadav
10 papers · 2020–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (5) π Conference Polyglot (3) π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (15)
π
Academic Marathon
(5)
π
Renaissance Researcher
(5)
π€
Dynamic Duo
(10)
β‘
Prolific Year
(6)
π
Century Club
(10)
β
The Questioner
Conferences
EMNLP (5)
AACL (2)
IJCNLP (2)
COLING (1)
Top co-authors
Keywords
low-resource language
(7)
machine translation
(5)
statistical machine translation
(4)
byte pair encoding
(3)
synthetic data generation
(2)
language model
(1)
synthetic datum
(1)
evaluation metric
(1)
subword tokenization
(1)
neural metric
(1)
sequence-to-sequence model
(1)
translation quality
(1)
phrase-based translation
(1)
low-resource translation
(1)
back translation
(1)
controlled generation
(1)
byte-pair encoding
(1)
subword segmentation
(1)
text segmentation
(1)
transformer model
(1)
Papers
Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance
AACL 2025
A3-108 at BHASHA Task1: Asymmetric BPE configuration for Grammar Error Correction
IJCNLP 2025
Segmentation Beyond Defaults: Asymmetrical Byte Pair Encoding for Optimal Machine Translation Performance
IJCNLP 2025
A3-108 at BHASHA Task1: Asymmetric BPE configuration for Grammar Error Correction
AACL 2025
Why should only High-Resource-Languages have all the fun? Pivot Based Evaluation in Low Resource Setting
COLING 2025
A Preliminary Exploration of Phrase-Based SMT and Multi-BPE Segmentations through Concatenated Tokenised Corpora for Low-Resource Indian Languages
EMNLP 2025
CoST of breaking the LLMs
EMNLP 2024
A3-108 Controlling Token Generation in Low Resource Machine Translation Systems
EMNLP 2024
A3-108 Machine Translation System for Similar Language Translation Shared Task 2021
EMNLP 2021
A3-108 Machine Translation System for Similar Language Translation Shared Task 2020
EMNLP 2020