Papers
5,479 papers found
FINEST: Improving LLM Responses to Sensitive Topics Through Fine-Grained Evaluation
Juhyun Oh, Nayeon Lee, Chani Jung et al.
Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs
Junbo Li, Peng Zhou, Rui Meng et al.
Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation
Minhua Lin, Zhengzhang Chen, Yanchi Liu et al.
Learning to Judge: LLMs Designing and Applying Evaluation Rubrics
Clemencia Siro, Pourya Aliannejadi, Mohammad Aliannejadi
Visual–Linguistic Abductive Reasoning with LLMs for Knowledge-based Visual Question Answering
Jieun Kim, Yujin Jeong, Sung-Bae Cho
MapCoder-Lite: Distilling Multi-Agent Coding into a Single Small LLM
Woongkyu Lee, Junhee Cho, Jungwook Choi
CrisiText: A dataset of warning messages for LLM training in emergency communication
Giacomo Gonella, Gian Maria Campedelli, Stefano Menini et al.
Cards Against Contamination: TCG-Bench for Difficulty-Scalable Multilingual LLM Reasoning
Sultan AlRashed, Jianghui Wang, Francesco Orabona
Arabic Dialect Translation with Small LLMs: Enhancing through Reasoning-Oriented Reinforcement Learning
Sohaila Abdulsattar, Keith Ross
Enhancing Urdu Sentiment Classification through Instruction-Tuned LLMs and Cross-Lingual Transfer
Hasan Faraz Khan, Noor Fatima, Irfan Ahmad
Current state of LLMs for Arabic dialectal machine translation
Josef Jon, Rawan Bondok, Ondřej Bojar
Reasoning Beyond Labels: Measuring LLM Sentiment in Low-Resource, Culturally Nuanced Contexts
Millicent Ochieng, Anja Thieme, Ignatius Ezeani et al.
Synthetic Data Generation Pipeline for Low-Resource Swahili Sentiment Analysis: Multi-LLM Judging with Human Validation
Samuel Gyamfi, Alfred Malengo Kondoro, Yankı Öztürk et al.
Building a Conversational AI Assistant for African Travel Services with LLMs and RAG
Grace Kevine Ngoufo, Shamsuddeen Hassan Muhammad, Kevin Jeff Fogang Fokoa
Hybrid Neural-LLM Pipeline for Morphological Glossing in Endangered Language Documentation: A Case Study of Jungar Tuvan
Siyu Liang, Talant Mawkanuli, Gina-Anne Levow
Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?
Grace Chang Yuan, Xiaoman Zhang, Sung Eun Kim et al.
Graph-Enhanced LLM Analysis of Multimodal Health Communities: A Computational Framework for Patient Discourse Understanding on TikTok
Tawakalit Agboola, Oluwaseun Ajao
Normalizing Health Concepts with Biomedical Embedding and LLMs
Iram Azam, Keyuan Jiang, Gordon Bernard
LLM Plug-ins Are Not a Free Lunch for Clinical Time-Series Prediction
Juhwan Choi, Kwanhyung Lee, Sangchul Hahn et al.
Why Are We Lonely? Leveraging LLMs to Measure and Understand Loneliness in Caregivers and Non-caregivers
Michelle Damin Kim, Ellie S. Paek, Yufen Lin et al.
Studying Expert-ese: Profiling and Classification of Domain-Specific Language Variation in Architecture with Traditional Machine Learning and LLMs
Carmen Schacht, Renate Delucchi Danhier
LLMs Got Rhyme? Hybrid Phonological Filtering for Greek Poetry Rhyme Detection and Generation
Stergios Chatzikyriakidis, Anastasia Natsina
Measuring Social Integration Through Participation: Categorizing Organizations and Leisure Activities in the Displaced Karelians Interview Archive using LLMs
Joonatan Laato, Veera Schroderus, Jenna Kanerva et al.