Papers
Non-Determinism of “Deterministic” LLM System Settings in Hosted Environments
Berk Atıl, Sarp Aykent, Alexa Chittams et al.
Test Set Quality in Multilingual LLM Evaluation
Chalamalasetti Kranti, Gabriel Bernier-Colborne, Yvan Gauthier et al.
LLM Driven Legal Text Analytics: A Case Study For Food Safety Violation Cases
Suyog Joshi, Soumyajit Basu, Lipika Dey et al.
MEDEQUALQA: Evaluating Biases in LLMs with Counterfactual Reasoning
Rajarshi Ghosh, Abhay Gupta, Hudson McBride et al.
Reasoning-Enhanced Retrieval for Misconception Prediction: A RAG-Inspired Approach with LLMs
Chaudhary Divya, Chang Xue, Shaorui Sun
A benchmark for end-to-end zero-shot biomedical relation extraction with LLMs: experiments with OpenAI models
Aviv Brokman, Xuguang Ai, Yuhang Jiang et al.
Bridging the Gap: Instruction-Tuned LLMs for Scientific Named Entity Recognition
Necva Bölücü, Maciej Rybinski, Stephen Wan
A Hybrid LLM and Supervised Model Pipeline for Polymer Property Extraction from Tables in Scientific Literature
Van-Thuy Phi, Dinh-Truong Do, Hoang-An Trieu et al.
Structured Outputs in Prompt Engineering: Enhancing LLM Adaptability on Counterintuitive Instructions
Jingjing Ye, Song Bai, Zhenyang Li et al.
Citation Drift: Measuring Reference Stability in Multi-Turn LLM Conversations
Gokul Srinath Seetha Ram
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
Deepon Halder, Thanmay Jayakumar, Raj Dabre
Disentangling Mathematical Reasoning in LLMs: A Methodological Investigation of Internal Mechanisms
Tanja Baeumel, Josef van Genabith, Simon Ostermann
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents
Petr Anokhin, Nikita Semenov, Artyom Sorokin et al.