Yekun Chai
24 papers · 2020–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🐣 Hot Topic Early Bird
🌉
Interdisciplinary Bridge
🌍
Conference Polyglot
(9)
🏃
Academic Marathon
(5)
🧬
Topic Evolution
👥
Mega-Team
(41)
🤝
Dynamic Duo
(10)
🗃️
Keyword Collector
(111)
⚡
Prolific Year
(8)
💎
Century Club
(24)
🔥
Unstoppable
(6)
Conferences
EMNLP (9)
ACL (4)
COLING (3)
ICLR (2)
NAACL (2)
ICML (1)
IJCNLP (1)
NIPS (1)
SEMEVAL (1)
Top co-authors
Keywords
large language model
(7)
language model
(4)
multilingual nlp
(3)
adversarial training
(2)
pre-trained language model
(2)
text classification
(2)
programming language
(2)
multilingual model
(2)
cross-lingual transfer
(2)
transfer learning
(2)
few-shot learning
(2)
multimodal learning
(2)
multilingual language model
(2)
code generation
(2)
transformer model
(2)
sarcasm detection
(1)
information retrieval
(1)
natural language processing
(1)
catastrophic forgetting
(1)
feature attribution
(1)
Papers
CodeMixBench: Evaluating Code-Mixing Capabilities of LLMs Across 18 Languages
EMNLP 2025
Debiasing Multilingual LLMs in Cross-lingual Latent Space
EMNLP 2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
ICLR 2025
Curiosity-Driven Reinforcement Learning from Human Feedback
ACL 2025
Graph-Augmented Open-Domain Multi-Document Summarization
COLING 2025
Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code
COLING 2025
EvolKV: Evolutionary KV Cache Compression for LLM Inference
EMNLP 2025
Understanding Subword Compositionality of Large Language Models
EMNLP 2025
Tool-Augmented Reward Modeling
ICLR 2024
GiLOT: Interpreting Generative Language Models via Optimal Transport
ICML 2024
Tokenization Falling Short: On Subword Robustness in Large Language Models
EMNLP 2024
HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
COLING 2024
Autoregressive Pre-Training on Pixels and Texts
EMNLP 2024
On Training Data Influence of GPT Models
EMNLP 2024
$\mathcal{M}^4$: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models
NIPS 2023
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
ACL 2023
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
IJCNLP 2023
X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection
SEMEVAL 2022
Predicate-Argument Based Bi-Encoder for Paraphrase Identification
ACL 2022
X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection
NAACL 2022
Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards
EMNLP 2022
COIN: Conversational Interactive Networks for Emotion Recognition in Conversation
NAACL 2021
Counter-Contrastive Learning for Language GANs
EMNLP 2021
Highway Transformer: Self-Gating Enhanced Self-Attentive Networks
ACL 2020