Yekun Chai

24 papers · 2020–2025 · 9 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🏃 Academic Marathon (5) 🧬 Topic Evolution 👥 Mega-Team (41) 🤝 Dynamic Duo (10) 🗃️ Keyword Collector (111) ⚡ Prolific Year (8) 💎 Century Club (24) 🔥 Unstoppable (6)

Conferences

EMNLP (9) ACL (4) COLING (3) ICLR (2) NAACL (2) ICML (1) IJCNLP (1) NIPS (1) SEMEVAL (1)

Top co-authors

Yu Sun (10) Shuohuan Wang (10) Hua Wu (8) Qiwei Peng (6) Hao Tian (4) Xuhong Li (4) Haidong Zhang (2) Haifeng Wang (2) Guanghao Chen (2) Yitong Xu (2)

Keywords

large language model (7) language model (4) multilingual nlp (3) adversarial training (2) pre-trained language model (2) text classification (2) programming language (2) multilingual model (2) cross-lingual transfer (2) transfer learning (2) few-shot learning (2) multimodal learning (2) multilingual language model (2) code generation (2) transformer model (2) sarcasm detection (1) information retrieval (1) natural language processing (1) catastrophic forgetting (1) feature attribution (1)

Papers

CodeMixBench: Evaluating Code-Mixing Capabilities of LLMs Across 18 Languages EMNLP 2025 Debiasing Multilingual LLMs in Cross-lingual Latent Space EMNLP 2025 MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions ICLR 2025 Curiosity-Driven Reinforcement Learning from Human Feedback ACL 2025 Graph-Augmented Open-Domain Multi-Document Summarization COLING 2025 Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code COLING 2025 EvolKV: Evolutionary KV Cache Compression for LLM Inference EMNLP 2025 Understanding Subword Compositionality of Large Language Models EMNLP 2025 Tool-Augmented Reward Modeling ICLR 2024 GiLOT: Interpreting Generative Language Models via Optimal Transport ICML 2024 Tokenization Falling Short: On Subword Robustness in Large Language Models EMNLP 2024 HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization COLING 2024 Autoregressive Pre-Training on Pixels and Texts EMNLP 2024 On Training Data Influence of GPT Models EMNLP 2024 $\mathcal{M}^4$: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models NIPS 2023 ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages ACL 2023 ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models IJCNLP 2023 X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection SEMEVAL 2022 Predicate-Argument Based Bi-Encoder for Paraphrase Identification ACL 2022 X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection NAACL 2022 Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards EMNLP 2022 COIN: Conversational Interactive Networks for Emotion Recognition in Conversation NAACL 2021 Counter-Contrastive Learning for Language GANs EMNLP 2021 Highway Transformer: Self-Gating Enhanced Self-Attentive Networks ACL 2020