conftrace_

Raj Dabre

87 papers · 2012–2026 · 12 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+15 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (15) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (11)

🗺️ Taxonomy Completionist (15) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (25) 🤝 Dynamic Duo (24) 👥 Mega-Team (76) 🔬 Deep Specialist (57) 🏆 Keyword Champion (2) 📈 Trend Setter ⚡ Prolific Year (19) ❓ The Questioner (6) 🗃️ Keyword Collector (256) 💎 Century Club (83) 🚀 Conference Pioneer 🔥 Unstoppable (7)

Conferences

ACL (25) EMNLP (19) IJCNLP (11) AACL (10) COLING (10) NAACL (5) EACL (2) AAAI (1) CONLL (1) ICCV (1) INTERSPEECH (1) NIPS (1)

Top co-authors

Anoop Kunchukuttan (26) Haiyue Song (15) Chenhui Chu (12) Sadao Kurohashi (12) Ratish Puduppully (11) Ondřej Bojar (10) Hideki Tanaka (8) Masao Utiyama (8) Thanmay Jayakumar (8) Eiichiro Sumita (7)

Keywords

neural machine translation (28) machine translation (27) low-resource language (22) large language model (12) transfer learning (11) indic language (8) cross-lingual transfer (8) knowledge distillation (7) multilingual nlp (7) multilingual model (7) asian language (6) multilingual translation (6) indian language (5) domain adaptation (5) shared task (5) model compression (5) human evaluation (4) multilingual language model (4) automatic evaluation (4) pre-trained language model (3)

Papers

The Reasoning Lingua Franca: A Double-Edged Sword for Multilingual AI EACL 2026 RiddleBench: A New Generative Reasoning Benchmark for LLMs EACL 2026 CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation AACL 2025 Multilingual Iterative Model Pruning: What Matters? AACL 2025 PRALEKHA: Cross-Lingual Document Alignment for Indic Languages AACL 2025 Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs ACL 2025 Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages ACL 2025 Limited-Resource Adapters Are Regularizers, Not Linguists ACL 2025 RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs ACL 2025 Findings of the IWSLT 2025 Evaluation Campaign ACL 2025 PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation COLING 2025 Exploiting Word Sense Disambiguation in Large Language Models for Machine Translation COLING 2025 Data and Model Centric Approaches for Expansion of Large Language Models to New languages EMNLP 2025 CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation EMNLP 2025 Findings of the First Shared Task for Creole Language Machine Translation at WMT25 EMNLP 2025 TikZero: Zero-Shot Text-Guided Graphics Program Synthesis ICCV 2025 Multilingual Iterative Model Pruning: What Matters? IJCNLP 2025 PRALEKHA: Cross-Lingual Document Alignment for Indic Languages IJCNLP 2025 Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts IJCNLP 2025 CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation IJCNLP 2025 WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines NAACL 2025 Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages NAACL 2025 Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts AACL 2025 A Morphology-Based Investigation of Positional Encodings EMNLP 2024 PUB: A Pragmatics Understanding Benchmark for Assessing LLMs’ Pragmatics Capabilities ACL 2024 NICT’s Cascaded and End-To-End Speech Translation Systems using Whisper and IndicTrans2 for the Indic Task ACL 2024 Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages NAACL 2024 IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages ACL 2024 How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages? ACL 2024 An Empirical Study of In-context Learning in LLMs for Machine Translation ACL 2024 NGLUEni: Benchmarking and Adapting Pretrained Language Models for Nguni Languages COLING 2024 CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark NIPS 2024 An Empirical Comparison of Vocabulary Expansion and Initialization Approaches For Language Models CONLL 2024 Findings of WMT 2024’s MultiIndic22MT Shared Task for Machine Translation of 22 Indian Languages EMNLP 2024 Machine Translation Of Marathi Dialects: A Case Study Of Kadodi EMNLP 2024 Leveraging Adapters for Improved Cross-lingual Transfer for Low-Resource Creole MT EMNLP 2024 An Empirical Comparison of Vocabulary Expansion and Initialization Approaches For Language Models EMNLP 2024 Pretraining Language Models Using Translationese EMNLP 2024 RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization ACL 2024 DecoMT: Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models EMNLP 2023 CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation EMNLP 2023 NICT-AI4B’s Submission to the Indic MT Shared Task in WMT 2023 EMNLP 2023 IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages ACL 2023 Robustness of Multi-Source MT to Transcription Errors ACL 2023 Developing State-Of-The-Art Massively Multilingual Machine Translation Systems for Related Languages AACL 2023 Developing State-Of-The-Art Massively Multilingual Machine Translation Systems for Related Languages IJCNLP 2023 Turning Whisper into Real-Time Transcription System IJCNLP 2023 MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation ACL 2023 YANMTT: Yet Another Neural Machine Translation Toolkit ACL 2023 Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation ACL 2023 A Multilingual Multiway Evaluation Data Set for Structured Document Translation of Asian Languages AACL 2022 BERTSeg: BERT Based Unsupervised Subword Segmentation for Neural Machine Translation IJCNLP 2022 FeatureBART: Feature Based Sequence-to-Sequence Pre-Training for Low-Resource NMT COLING 2022 Overview of the 9th Workshop on Asian Translation COLING 2022 NICT’s Submission to the WAT 2022 Structured Document Translation Task COLING 2022 IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages EMNLP 2022 NICT at MixMT 2022: Synthetic Code-Mixed Pre-training and Multi-way Fine-tuning for Hinglish–English Translation EMNLP 2022 Fusion of Self-supervised Learned Models for MOS Prediction INTERSPEECH 2022 When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation? NAACL 2022 BERTSeg: BERT Based Unsupervised Subword Segmentation for Neural Machine Translation AACL 2022 KreolMorisienMT: A Dataset for Mauritian Creole Machine Translation AACL 2022 IndicBART: A Pre-trained Model for Indic Natural Language Generation ACL 2022 Overview of the 8th Workshop on Asian Translation ACL 2021 NICT-5’s Submission To WAT 2021: MBART Pre-training And In-Domain Fine Tuning For Indic Languages ACL 2021 NICT-5’s Submission To WAT 2021: MBART Pre-training And In-Domain Fine Tuning For Indic Languages IJCNLP 2021 Overview of the 8th Workshop on Asian Translation IJCNLP 2021 Pre-training via Leveraging Assisting Languages for Neural Machine Translation ACL 2020 Balancing Cost and Benefit with Tied-Multi Transformers ACL 2020 Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages COLING 2020 Multilingual Neural Machine Translation COLING 2020 Overview of the 7th Workshop on Asian Translation AACL 2020 Improving Low-Resource NMT through Relevance Based Linguistic Features Incorporation COLING 2020 Combining Sequence Distillation and Transfer Learning for Efficient Low-Resource Neural Machine Translation Models EMNLP 2020 NICT‘s Submission To WAT 2020: How Effective Are Simple Many-To-Many Neural Machine Translation Models? AACL 2020 Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation EMNLP 2019 Recurrent Stacking of Layers for Compact Neural Machine Translation Models AAAI 2019 Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation IJCNLP 2019 NICT’s Supervised Neural Machine Translation Systems for the WMT19 News Translation Task ACL 2019 NICT’s Supervised Neural Machine Translation Systems for the WMT19 Translation Robustness Task ACL 2019 NICT’s Machine Translation Systems for the WMT19 Similar Language Translation Task ACL 2019 NICT’s participation to WAT 2019: Multilingualism and Multi-step Fine-Tuning for Low Resource NMT EMNLP 2019 Overview of the 6th Workshop on Asian Translation EMNLP 2019 Proceedings of the 6th Workshop on Asian Translation EMNLP 2019 An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation ACL 2017 Neural Machine Translation: Basics, Practical Aspects and Recent Trends IJCNLP 2017 Leveraging Small Multilingual Corpora for SMT Using Many Pivot Languages NAACL 2015 Morphological Analyzer for Affix Stacking Languages: A Case Study of Marathi COLING 2012