Guijin Son
14 papers · 2024–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Cross-Pollinator (14) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) πΊοΈ Taxonomy Completionist (21)
πΊοΈ
Taxonomy Completionist
(21)
π₯
Mega-Team
(32)
π¬
Deep Specialist
(10)
β‘
Prolific Year
(10)
β
The Questioner
(2)
π
Century Club
(14)
Conferences
ACL (4)
COLING (3)
EMNLP (3)
NAACL (3)
ICML (1)
Top co-authors
Keywords
large language model
(9)
korean language
(4)
benchmark evaluation
(3)
instruction following
(2)
multilingual benchmark
(2)
in-context learning
(2)
question answering
(2)
low-resource language
(2)
chain-of-thought reasoning
(2)
cross-lingual transfer
(1)
preference optimization
(1)
text generation
(1)
language model evaluation
(1)
text classification
(1)
named entity recognition
(1)
knowledge benchmark
(1)
multilingual nlp
(1)
mathematical problem solving
(1)
inference efficiency
(1)
instruction tuning
(1)
Papers
Multi-Step Reasoning in Korean and the Emergent Mirage
NAACL 2025
Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning
ACL 2025
Controlling Language Confusion in Multilingual LLMs
ACL 2025
FINKRX: Establishing Best Practices for Korean Financial NLP
ACL 2025
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap
EMNLP 2025
On the Robustness of Reward Models for Language Model Alignment
ICML 2025
KMMLU: Measuring Massive Multitask Language Understanding in Korean
NAACL 2025
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
NAACL 2025
Multi-LMentry: Can Multilingual LLMs Solve Elementary Tasks Across Languages?
EMNLP 2025
From KMMLU-Redux to Pro: A Professional Korean Benchmark Suite for LLM Evaluation
EMNLP 2025
HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models
COLING 2024
KRX Bench: Automating Financial Benchmark Creation via Large Language Models
COLING 2024
ESG Classification by Implicit Rule Learning via GPT-4
COLING 2024
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?
ACL 2024