Papers
16,749 papers found
AppTek’s Automatic Speech Translation: Generating Accurate and Well-Readable Subtitles
Frithjof Petrick, Patrick Wilken, Evgeny Matusov et al.
A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs
Chiwan Park, Wonjun Jang, Daeryong Kim et al.
A Practical Tool to Help Automate Interlinear Glossing: a Study on Mukrī Kurdish
Hiwa Asadpour, Shu Okabe, Alexander Fraser
APT: Improving Specialist LLM Performance with Weakness Case Acquisition and Iterative Preference Training
Jun Rao, Zepeng Lin, Xuebo Liu et al.
AQuAECHR: Attributed Question Answering for European Court of Human Rights
Korbinian Q. Weidinger, Santosh T.y.s.s, Oana Ichim et al.
A Query-Response Framework for Whole-Page Complex-Layout Document Image Translation with Relevant Regional Concentration
Zhiyang Zhang, Yaping Zhang, Yupu Liang et al.
Arbiters of Ambivalence: Challenges of using LLMs in No-Consensus tasks
Bhaktipriya Radharapu, Manon Revel, Megan Ung et al.
ARC ‘Challenge’ Is Not That Challenging
Łukasz Borchmann
Archaeology at BEA 2025 Shared Task: Are Simple Baselines Good Enough?
Ana Roșu, Jany-Gabriel Ispas, Sergiu Nisioi
ArchiDocGen: Multi-Agent Framework for Expository Document Generation in the Architectural Industry
Junjie Jiang, Haodong Wu, Yongqi Zhang et al.
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
Łukasz Borchmann, Michał Pietruszka, Wojciech Jaśkowski et al.
A Reality Check on Context Utilisation for Retrieval-Augmented Generation
Lovisa Hagström, Sara Vera Marjanovic, Haeun Yu et al.
Are Any-to-Any Models More Consistent Across Modality Transfers Than Specialists?
Jiwan Chung, Janghan Yoon, Junhyeong Park et al.
Are Bias Evaluation Methods Biased ?
Lina Berrayana, Sean Rooney, Luis Garcés-Erice et al.
A rebuttal of two common deflationary stances against LLM cognition
Zak Hussain, Rui Mata, Dirk U. Wulff
Are Dialects Better Prompters? A Case Study on Arabic Subjective Text Classification
Leila Moudjari, Farah Benamara
A Reinforcement Learning Framework for Cross-Lingual Stance Detection Using Chain-of-Thought Alignment
Binghui Li, Minghui Zou, Xiaowang Zhang et al.
Are Large Language Models for Education Reliable Across Languages?
Vansh Gupta, Sankalan Pal Chowdhury, Vilém Zouhar et al.
Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice
Federico Ravenda, Seyed Ali Bahrainian, Andrea Raballo et al.
Are LLMs Rational Investors? A Study on the Financial Bias in LLMs
Yuhang Zhou, Yuchen Ni, Zhiheng Xi et al.
Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs
Jasmin Wachter, Michael Radloff, Maja Smolej et al.
Are LLMs reliable? An exploration of the reliability of large language models in clinical note generation
Kristine Ann M. Carandang, Jasper Meynard Arana, Ethan Robert Casin et al.
Are LLMs Truly Graph-Savvy? A Comprehensive Evaluation of Graph Generation
Ege Demirci, Rithwik Kerur, Ambuj Singh
Are Multimodal Large Language Models Pragmatically Competent Listeners in Simple Reference Resolution Tasks?
Simeon Junker, Manar Ali, Larissa Koch et al.
Are Optimal Algorithms Still Optimal? Rethinking Sorting in LLM-Based Pairwise Ranking with Batching and Caching
Juan Wisznia, Cecilia Bolaños, Juan Tollo et al.