Papers
Are LLMs Truly Graph-Savvy? A Comprehensive Evaluation of Graph Generation
Ege Demirci, Rithwik Kerur, Ambuj Singh
Foundations of PEERS: Assessing LLM Role Performance in Educational Simulations
Jasper Meynard Arana, Kristine Ann M. Carandang, Ethan Robert Casin et al.
Building Japanese Creativity Benchmarks and Applying them to Enhance LLM Creativity
So Fukuda, Hayato Ogawa, Kaito Horio et al.
Controlling Language Confusion in Multilingual LLMs
Nahyun Lee, Yeongseo Woo, Hyunwoo Ko et al.
Unstructured Minds, Predictable Machines: A Comparative Study of Narrative Cohesion in Human and LLM Stream-of-Consciousness Writing
Nellia Dzhubaeva, Katharina Trinley, Laura Pissani
Exploiting contextual information to improve stance detection in informal political discourse with LLMs
Arman Engin Sucu, Yixiang Zhou, Mario A. Nascimento et al.
GenDLN: Evolutionary Algorithm-Based Stacked LLM Framework for Joint Prompt Optimization
Pia Chouayfati, Niklas Herbster, Ábel Domonkos Sáfrán et al.
Speculative Reward Model Boosts Decision Making Ability of LLMs Cost-Effectively
Jiawei Gu, Shangsong Liang
CoAlign: Uncertainty Calibration of LLM for Geospatial Repartition
Zejun Xie, Zhiqing Hong, Wenjun Lyu et al.
Efficient Out-of-Scope Detection in Dialogue Systems via Uncertainty-Driven LLM Routing
Álvaro Zaera, Diana Nicoleta Popa, Ivan Sekulic et al.
A Perspective on LLM Data Generation with Few-shot Examples: from Intent to Kubernetes Manifest
Antonino Angi, Liubov Nedoshivina, Alessio Sacco et al.
Enriching children’s stories with LLMs: Delivering multilingual data enrichment for children’s books at scale and across markets
Zarah Weiss, Christof Meyer, Mikael Andersson
Semantic Outlier Removal with Embedding Models and LLMs
Eren Akbiyik, João F. M. De Almeida, Rik Melis et al.
LEAP & LEAN: Look-ahead Planning and Agile Navigation for LLM Agents
Nikhil Verma, Manasa Bharadwaj
SQLGenie: A Practical LLM based System for Reliable and Efficient SQL Generation
Pushpendu Ghosh, Aryan Jain, Promod Yenigalla
Domain Adaptation of Foundation LLMs for e-Commerce
Christian Herold, Michael Kozielski, Tala Bazazo et al.
One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL
Hyungjoo Chae, Dongjin Kang, Jihyuk Kim et al.
A Parallelized Framework for Simulating Large-Scale LLM Agents with Realistic Environments and Interactions
Jun Zhang, Yuwei Yan, Junbo Yan et al.
ENGinius: A Bilingual LLM Optimized for Plant Construction Engineering
Wooseong Lee, Minseo Kim, Taeil Hur et al.
Are LLMs reliable? An exploration of the reliability of large language models in clinical note generation
Kristine Ann M. Carandang, Jasper Meynard Arana, Ethan Robert Casin et al.
EcoDoc: A Cost-Efficient Multimodal Document Processing System for Enterprises Using LLMs
Ravi K. Rajendran, Biplob Debnath, Murugan Sankaradass et al.
How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs
Guhao Feng, Kai Yang, Yuntian Gu et al.
BayesKD: Bayesian Knowledge Distillation for Compact LLMs in Constrained Fine-tuning Scenarios
Wei Li, Lujun Li, Mark G. Lee et al.
Detecting and Mitigating Challenges in Zero-Shot Video Summarization with Video LLMs
Luca Cagliero, Lorenzo Vaiani, Eliana Pastor et al.
Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
Runchu Tian, Yanghao Li, Yuepeng Fu et al.