Raj Dabre
87 papers · 2012–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (15) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (11)
🗺️
Taxonomy Completionist
(15)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏠
Conference Loyalist
(25)
🤝
Dynamic Duo
(24)
👥
Mega-Team
(76)
🔬
Deep Specialist
(57)
🏆
Keyword Champion
(2)
📈
Trend Setter
⚡
Prolific Year
(19)
❓
The Questioner
(6)
🗃️
Keyword Collector
(256)
💎
Century Club
(83)
🚀
Conference Pioneer
🔥
Unstoppable
(7)
Conferences
ACL (25)
EMNLP (19)
IJCNLP (11)
AACL (10)
COLING (10)
NAACL (5)
EACL (2)
AAAI (1)
CONLL (1)
ICCV (1)
INTERSPEECH (1)
NIPS (1)
Top co-authors
Keywords
neural machine translation
(28)
machine translation
(27)
low-resource language
(22)
large language model
(12)
transfer learning
(11)
indic language
(8)
cross-lingual transfer
(8)
knowledge distillation
(7)
multilingual nlp
(7)
multilingual model
(7)
asian language
(6)
multilingual translation
(6)
indian language
(5)
domain adaptation
(5)
shared task
(5)
model compression
(5)
human evaluation
(4)
multilingual language model
(4)
automatic evaluation
(4)
pre-trained language model
(3)
Papers
The Reasoning Lingua Franca: A Double-Edged Sword for Multilingual AI
EACL 2026
RiddleBench: A New Generative Reasoning Benchmark for LLMs
EACL 2026
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
AACL 2025
Multilingual Iterative Model Pruning: What Matters?
AACL 2025
PRALEKHA: Cross-Lingual Document Alignment for Indic Languages
AACL 2025
Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
ACL 2025
Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages
ACL 2025
Limited-Resource Adapters Are Regularizers, Not Linguists
ACL 2025
RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
ACL 2025
Findings of the IWSLT 2025 Evaluation Campaign
ACL 2025
PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation
COLING 2025
Exploiting Word Sense Disambiguation in Large Language Models for Machine Translation
COLING 2025
Data and Model Centric Approaches for Expansion of Large Language Models to New languages
EMNLP 2025
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
EMNLP 2025
Findings of the First Shared Task for Creole Language Machine Translation at WMT25
EMNLP 2025
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
ICCV 2025
Multilingual Iterative Model Pruning: What Matters?
IJCNLP 2025
PRALEKHA: Cross-Lingual Document Alignment for Indic Languages
IJCNLP 2025
Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts
IJCNLP 2025
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
IJCNLP 2025
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
NAACL 2025
Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages
NAACL 2025
Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts
AACL 2025
A Morphology-Based Investigation of Positional Encodings
EMNLP 2024
PUB: A Pragmatics Understanding Benchmark for Assessing LLMs’ Pragmatics Capabilities
ACL 2024
NICT’s Cascaded and End-To-End Speech Translation Systems using Whisper and IndicTrans2 for the Indic Task
ACL 2024
Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
NAACL 2024
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
ACL 2024
How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
ACL 2024
An Empirical Study of In-context Learning in LLMs for Machine Translation
ACL 2024
NGLUEni: Benchmarking and Adapting Pretrained Language Models for Nguni Languages
COLING 2024
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
NIPS 2024
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches For Language Models
CONLL 2024
Findings of WMT 2024’s MultiIndic22MT Shared Task for Machine Translation of 22 Indian Languages
EMNLP 2024
Machine Translation Of Marathi Dialects: A Case Study Of Kadodi
EMNLP 2024
Leveraging Adapters for Improved Cross-lingual Transfer for Low-Resource Creole MT
EMNLP 2024
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches For Language Models
EMNLP 2024
Pretraining Language Models Using Translationese
EMNLP 2024
RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization
ACL 2024
DecoMT: Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models
EMNLP 2023
CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation
EMNLP 2023
NICT-AI4B’s Submission to the Indic MT Shared Task in WMT 2023
EMNLP 2023
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages
ACL 2023
Robustness of Multi-Source MT to Transcription Errors
ACL 2023
Developing State-Of-The-Art Massively Multilingual Machine Translation Systems for Related Languages
AACL 2023
Developing State-Of-The-Art Massively Multilingual Machine Translation Systems for Related Languages
IJCNLP 2023
Turning Whisper into Real-Time Transcription System
IJCNLP 2023
MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation
ACL 2023
YANMTT: Yet Another Neural Machine Translation Toolkit
ACL 2023
Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation
ACL 2023
A Multilingual Multiway Evaluation Data Set for Structured Document Translation of Asian Languages
AACL 2022
BERTSeg: BERT Based Unsupervised Subword Segmentation for Neural Machine Translation
IJCNLP 2022
FeatureBART: Feature Based Sequence-to-Sequence Pre-Training for Low-Resource NMT
COLING 2022
Overview of the 9th Workshop on Asian Translation
COLING 2022
NICT’s Submission to the WAT 2022 Structured Document Translation Task
COLING 2022
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages
EMNLP 2022
NICT at MixMT 2022: Synthetic Code-Mixed Pre-training and Multi-way Fine-tuning for Hinglish–English Translation
EMNLP 2022
Fusion of Self-supervised Learned Models for MOS Prediction
INTERSPEECH 2022
When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?
NAACL 2022
BERTSeg: BERT Based Unsupervised Subword Segmentation for Neural Machine Translation
AACL 2022
KreolMorisienMT: A Dataset for Mauritian Creole Machine Translation
AACL 2022
IndicBART: A Pre-trained Model for Indic Natural Language Generation
ACL 2022
Overview of the 8th Workshop on Asian Translation
ACL 2021
NICT-5’s Submission To WAT 2021: MBART Pre-training And In-Domain Fine Tuning For Indic Languages
ACL 2021
NICT-5’s Submission To WAT 2021: MBART Pre-training And In-Domain Fine Tuning For Indic Languages
IJCNLP 2021
Overview of the 8th Workshop on Asian Translation
IJCNLP 2021
Pre-training via Leveraging Assisting Languages for Neural Machine Translation
ACL 2020
Balancing Cost and Benefit with Tied-Multi Transformers
ACL 2020
Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages
COLING 2020
Multilingual Neural Machine Translation
COLING 2020
Overview of the 7th Workshop on Asian Translation
AACL 2020
Improving Low-Resource NMT through Relevance Based Linguistic Features Incorporation
COLING 2020
Combining Sequence Distillation and Transfer Learning for Efficient Low-Resource Neural Machine Translation Models
EMNLP 2020
NICT‘s Submission To WAT 2020: How Effective Are Simple Many-To-Many Neural Machine Translation Models?
AACL 2020
Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation
EMNLP 2019
Recurrent Stacking of Layers for Compact Neural Machine Translation Models
AAAI 2019
Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation
IJCNLP 2019
NICT’s Supervised Neural Machine Translation Systems for the WMT19 News Translation Task
ACL 2019
NICT’s Supervised Neural Machine Translation Systems for the WMT19 Translation Robustness Task
ACL 2019
NICT’s Machine Translation Systems for the WMT19 Similar Language Translation Task
ACL 2019
NICT’s participation to WAT 2019: Multilingualism and Multi-step Fine-Tuning for Low Resource NMT
EMNLP 2019
Overview of the 6th Workshop on Asian Translation
EMNLP 2019
Proceedings of the 6th Workshop on Asian Translation
EMNLP 2019
An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation
ACL 2017
Neural Machine Translation: Basics, Practical Aspects and Recent Trends
IJCNLP 2017
Leveraging Small Multilingual Corpora for SMT Using Many Pivot Languages
NAACL 2015
Morphological Analyzer for Affix Stacking Languages: A Case Study of Marathi
COLING 2012