Papers
1,125 papers found
Simulating Training Data Leakage in Multiple-Choice Benchmarks for LLM Evaluation
Naila Shafirni Hidayat, Muhammad Dehan Al Kautsar, Alfan Farizki Wicaksono et al.
SLM-SQL: An Exploration of Small Language Models for Text-to-SQL
Lei Sheng, Xu Shuai Shuai
Small Changes, Large Consequences: Analyzing the Allocational Fairness of LLMs in Hiring Contexts
Preethi Seshadri, Hongyu Chen, Sameer Singh et al.
Smruti: Grammatical Error Correction for Gujarati using LLMs with Non-Parametric Memory
Vrund Dobariya, Jatayu Baxi, Bhavika Gambhava et al.
SmurfCat at SHROOM-CAP: Factual but Awkward? Fluent but Wrong? Tackling Both in LLM Scientific QA
Timur Ionov, Evgenii Nikolaev, Artem Vazhentsev et al.
Social Bias in Popular Question-Answering Benchmarks
Angelie Kraft, Judith Simon, Sonja Schimmler
SOMAJGYAAN: A Dataset for Evaluating LLMs on Bangla Culture, Social Knowledge, and Low-Resource Language Adaptation
Fariha Anjum Shifa, Muhtasim Ibteda Shochcho, Abdullah Ibne Hanif Arean et al.
Source Attribution for Large Language Models
Vipula Rawte, Koustava Goswami, Puneet Mathur et al.
Spatial-Aware Visual Program Guided Reasoning for Answering Complex Visual Questions
Haoran Wang, Kai Shu
Speaking the Right Language: The Impact of Expertise (Mis)Alignment in User-AI Interactions
Shramay Palta, Nirupama Chandrasekaran, Rachel Rudinger et al.
Speak & Spell: LLM-Driven Controllable Phonetic Error Augmentation for Robust Dialogue State Tracking
Jihyun Lee, Solee Im, Wonjun Lee et al.
Speech-to-Speech Machine Translation for Dialectal Variations of Hindi
Sanmay Sood, Siddharth Rajput, Md Shad Akhtar
SPORTSQL: An Interactive System for Real-Time Sports Reasoning and Visualization
Sebastian Martinez, Naman Ahuja, Fenil Bardoliya et al.
Stacked LoRA: Isolated Low-Rank Adaptation for Lifelong Knowledge Management
Heramb Vivek Patil, Vaishnavee Sanam, Minakshi Pradeep Atre
StanceMining: An open-source stance detection library supporting time-series and visualization
Benjamin Steel, Derek Ruths
Standardizing Heterogeneous Corpora with DUUR: A Dual Data- and Process-Oriented Approach to Enhancing NLP Pipeline Integration
Leon Lukas Hammerla, Alexander Mehler, Giuseppe Abrami
Standardizing the Measurement of Text Diversity: A Tool and Comparative Analysis
Chantal Shaib, Venkata S Govindarajan, Joe Barrow et al.
STAR: Self-Automated Back-Querying for Production Data Generation
Kellen Tan Cheng, Anna Lisa Gentile, Chad DeLuca et al.
Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?
Kunal Kingkar Das, Manoj Balaji Jagadeeshan, Nallani Chakravartula Sahith et al.
Structure-Aware Chunking for Abstractive Summarization of Long Legal Documents
Himadri Sonowal, Saisab Sadhu
Structured Document Translation via Format Reinforcement Learning
Haiyue Song, Johannes Eschbach-Dymanus, Hour Kaing et al.
Structured Outputs in Prompt Engineering: Enhancing LLM Adaptability on Counterintuitive Instructions
Jingjing Ye, Song Bai, Zhenyang Li et al.
StuD: A Multimodal Approach for Stuttering Detection with RAG and Fusion Strategies
Pragya Khanna, Priyanka Kommagouni, Vamshi Raghu Simha Narasinga et al.
Supporting Plain Language Summarization of Psychological Meta-Analyses with Large Language Models
Yarik Menchaca Resendiz, Martin Kerwer, Anita Chasiotis et al.
Surprisal Dynamics for the Detection of Multi-Word Expressions in English
Diego Alves, Sergei Bagdasarov, Elke Teich