Yungi Kim
10 papers · 2024–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π£ Hot Topic Early Bird π Conference Polyglot (4) π Cross-Pollinator (12) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (21)
π§
Keyword Pioneer
π€
Dynamic Duo
(10)
β‘
Prolific Year
(6)
π
Century Club
(10)
ποΈ
Keyword Collector
(52)
Conferences
EMNLP (3)
NAACL (3)
ACL (2)
COLING (2)
Top co-authors
Keywords
large language model
(9)
data quality
(2)
data pipeline
(2)
data filtering
(2)
korean language
(2)
model evaluation
(1)
chain-of-thought reasoning
(1)
ensemble learning
(1)
continued pretraining
(1)
web corpus
(1)
instruction tuning
(1)
evaluation framework
(1)
sequential learning
(1)
language model
(1)
direct preference optimization
(1)
model scaling
(1)
language model evaluation
(1)
evaluation benchmark
(1)
large language model evaluation
(1)
model ensemble
(1)
Papers
LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models
EMNLP 2025
sDPO: Donβt Use Your Data All at Once
COLING 2025
Open Ko-LLM Leaderboard2: Bridging Foundational and Practical Evaluation for Korean LLMs
NAACL 2025
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language Models
NAACL 2025
Rethinking KenLM: Good and Bad Model Ensembles for Efficient Text Quality Filtering in Large Web Corpora
ACL 2025
Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models
COLING 2025
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models
EMNLP 2024
Evalverse: Unified and Accessible Library for Large Language Model Evaluation
EMNLP 2024
Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark
ACL 2024
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
NAACL 2024