conftrace_

Guijin Son

14 papers · 2024–2025 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🐝 Cross-Pollinator (14) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🗺️ Taxonomy Completionist (21)

🗺️ Taxonomy Completionist (21) 👥 Mega-Team (32) 🔬 Deep Specialist (10) ⚡ Prolific Year (10) ❓ The Questioner (2) 💎 Century Club (14)

Conferences

ACL (4) COLING (3) EMNLP (3) NAACL (3) ICML (1)

Top co-authors

Hyunwoo Ko (5) Seungone Kim (3) Dasol Choi (2) Hanwool Lee (2) James Thorne (2) Jiwoo Hong (2) Chami Hwang (2) Noah Lee (2) Hanearl Jung (2) Luca Moroni (1)

Keywords

large language model (9) korean language (4) benchmark evaluation (3) instruction following (2) multilingual benchmark (2) in-context learning (2) question answering (2) low-resource language (2) chain-of-thought reasoning (2) cross-lingual transfer (1) preference optimization (1) text generation (1) language model evaluation (1) text classification (1) named entity recognition (1) knowledge benchmark (1) multilingual nlp (1) mathematical problem solving (1) inference efficiency (1) instruction tuning (1)

Papers

Multi-Step Reasoning in Korean and the Emergent Mirage NAACL 2025 Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning ACL 2025 Controlling Language Confusion in Multilingual LLMs ACL 2025 FINKRX: Establishing Best Practices for Korean Financial NLP ACL 2025 Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap EMNLP 2025 On the Robustness of Reward Models for Language Model Alignment ICML 2025 KMMLU: Measuring Massive Multitask Language Understanding in Korean NAACL 2025 The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models NAACL 2025 Multi-LMentry: Can Multilingual LLMs Solve Elementary Tasks Across Languages? EMNLP 2025 From KMMLU-Redux to Pro: A Professional Korean Benchmark Suite for LLM Evaluation EMNLP 2025 HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models COLING 2024 KRX Bench: Automating Financial Benchmark Creation via Large Language Models COLING 2024 ESG Classification by Implicit Rule Learning via GPT-4 COLING 2024 Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once? ACL 2024