Jan Christian Blaise Cruz
20 papers · 2021–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
🐝 Cross-Pollinator (14) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (6)
🌈
Renaissance Researcher
(6)
🌍
Conference Polyglot
(7)
🤝
Dynamic Duo
(11)
👥
Mega-Team
(92)
🔥
Unstoppable
(5)
💎
Century Club
(20)
⚡
Prolific Year
(7)
🗃️
Keyword Collector
(81)
❓
The Questioner
(2)
Conferences
EMNLP (11)
AACL (2)
IJCNLP (2)
NAACL (2)
ACL (1)
COLING (1)
NIPS (1)
Top co-authors
Research topics
Keywords
machine translation
(6)
multilingual nlp
(4)
low-resource language
(3)
vision language model
(3)
multilingual translation
(2)
neural machine translation
(2)
noisy channel reranking
(2)
zero-shot prompting
(2)
multilingual large language model
(2)
large language model
(2)
multimodal learning
(2)
low-resource translation
(2)
visual question answering
(2)
text generation
(1)
knowledge distillation
(1)
prompt engineering
(1)
dataset creation
(1)
benchmark evaluation
(1)
word sense disambiguation
(1)
social intelligence
(1)
Papers
FilBench: Can LLMs Understand and Generate Filipino?
EMNLP 2025
Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Senses
NAACL 2025
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
NAACL 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
ACL 2025
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
COLING 2025
MoMentS: A Comprehensive Multimodal Benchmark for Theory of Mind
EMNLP 2025
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
EMNLP 2025
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
EMNLP 2024
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
NIPS 2024
Samsung R&D Institute Philippines @ WMT 2024 Indic MT Task
EMNLP 2024
Samsung R&D Institute Philippines @ WMT 2024 Low-resource Languages of Spain Shared Task
EMNLP 2024
Towards Automatic Construction of Filipino WordNet: Word Sense Induction and Synset Induction Using Sentence Embeddings
AACL 2023
Current Status of NLP in South East Asia with Insights from Multilingualism and Language Diversity
AACL 2023
Multilingual Large Language Models Are Not (Yet) Code-Switchers
EMNLP 2023
Samsung R&D Institute Philippines at WMT 2023
EMNLP 2023
Towards Automatic Construction of Filipino WordNet: Word Sense Induction and Synset Induction Using Sentence Embeddings
IJCNLP 2023
Current Status of NLP in South East Asia with Insights from Multilingualism and Language Diversity
IJCNLP 2023
Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
EMNLP 2023
Samsung Research Philippines - Datasaur AI’s Submission for the WMT22 Large Scale Multilingual Translation Task
EMNLP 2022
Data Processing Matters: SRPH-Konvergen AI’s Machine Translation System for WMT’21
EMNLP 2021