Papers
Adaptive LLM Routing under Budget Constraints
Pranoy Panda, Raghav Magazine, Chaitanya Devaguptapu et al.
Can Federated Learning Safeguard Private Data in LLM Training? Vulnerabilities, Attacks, and Defense Evaluation
Wenkai Guo, Xuefeng Liu, Haolin Wang et al.
Efficient Latent Semantic Clustering for Scaling Test-Time Computation of LLMs
Sungjae Lee, Hoyoung Kim, Jeongyeon Hwang et al.
Under the Shadow of Babel: How Language Shapes Reasoning in LLMs
Chenxi Wang, Yixuan Zhang, Lang Gao et al.
Exploring Context Strategies in LLMs for Discourse-Aware Machine Translation
Ritvik Choudhary, Rem Hida, Masaki Hamada et al.
Improving Preference Alignment of LLM with Inference-Free Self-Refinement
Fukun Ma, Kaibin Tian, Jieting Xue et al.
Do Before You Judge: Self-Reference as a Pathway to Better LLM Evaluation
Wei-Hsiang Lin, Sheng-Lun Wei, Hen-Hsen Huang et al.
FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline
Parker Seegmiller, Kartik Mehta, Soumya Saha et al.
DIPLomA: Efficient Adaptation of Instructed LLMs to Low-Resource Languages via Post-Training Delta Merging
Ixak Sarasua, Ander Corral, Xabier Saralegi
Are LLMs Empathetic to All? Investigating the Influence of Multi-Demographic Personas on a Model’s Empathy
Ananya Malik, Nazanin Sabri, Melissa M. Karnaze et al.
Uncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM Systems
Aakriti Agrawal, Rohith Aralikatti, Anirudh Satheesh et al.
RAC: Efficient LLM Factuality Correction with Retrieval Augmentation
Changmao Li, Jeffrey Flanigan
UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets
Wenyu Wang, Mengqi Zhang, Xiaotian Ye et al.
SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks
Fenia Christopoulou, Ronald Cardenas, Gerasimos Lampouras et al.
Low-Resource Languages LLM Disinformation is Within Reach: The Case of Walliserdeutsch
Andrei Kucharavy, Sherine Seppey, Cyril Vallez et al.
Can We Edit LLMs for Long-Tail Biomedical Knowledge?
Xinhao Yi, Jake Lever, Kevin Bryson et al.
Cache Saver: A Modular Framework for Efficient, Affordable, and Reproducible LLM Inference
Nearchos Potamitis, Lars Henning Klein, Bardia Mohammadi et al.
Evaluating Cultural Knowledge and Reasoning in LLMs Through Persian Allusions
Melika Nobakhtian, Yadollah Yaghoobzadeh, Mohammad Taher Pilehvar
Saudi-Alignment Benchmark: Assessing LLMs Alignment with Cultural Norms and Domain Knowledge in the Saudi Context
Manal Alhassoun, Imaan Mohammed Alkhanen, Nouf Alshalawi et al.
AraHalluEval: A Fine-grained Hallucination Evaluation Framework for Arabic LLMs
Aisha Alansari, Hamzah Luqman
WojoodOntology: Ontology-Driven LLM Prompting for Unified Information Extraction Tasks
Alaa Aljabari, Nagham Hamad, Mohammed Khalilia et al.
Can LLMs Directly Retrieve Passages for Answering Questions from Qur’an?
Sohaila Eltanbouly, Salam Albatarni, Shaimaa Hassanein et al.
Zero-Shot and Fine-Tuned Evaluation of Generative LLMs for Arabic Word Sense Disambiguation
Yossra Noureldien, Abdelrazig Mohamed, Farah Attallah
Bridging Dialectal Gaps in Arabic Medical LLMs through Model Merging
Ahmed Ibrahim, Abdullah Hosseini, Hoda Helmy et al.
Tool Calling for Arabic LLMs: Data Strategies and Instruction Tuning
Asım Ersoy, Enes Altinisik, Kareem Mohamed Darwish et al.