Papers
2,781 papers found
SparQLe: Speech Queries to Text Translation Through LLMs
Amirbek Djanibekov, Hanan Aldarmaki
Prompting LLMs: Length Control for Isometric Machine Translation
Dávid Javorský, Ondřej Bojar, François Yvon
Can LLMs Recognize Their Own Analogical Hallucinations? Evaluating Uncertainty Estimation for Analogical Reasoning
Zheng Chen, Zhaoxin Feng, Jianfei Ma et al.
Reasoning or Memorization? Investigating LLMs’ Capability in Restoring Chinese Internet Homophones
Jianfei Ma, Zhaoxin Feng, Huacheng Song et al.
Understanding Verbatim Memorization in LLMs Through Circuit Discovery
Ilya Lasy, Peter Knees, Stefan Woltran
Memorization is Language-Sensitive: Analyzing Memorization and Inference Risks of LLMs in a Multilingual Setting
Ali Satvaty, Anna Visman, Dan Seidel et al.
Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases
Shanshan Xu, Santosh T.y.s.s, Yanai Elazar et al.
LongSafety: Enhance Safety for Long-Context LLMs
Mianqiu Huang, Xiaoran Liu, Shaojun Zhou et al.
ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving
Zain Ul Abedin, Shahzeb Qamar, Lucie Flek et al.
Guardians of Trust: Risks and Opportunities for LLMs in Mental Health
Miguel Baidal, Erik Derner, Nuria Oliver
What Counts Underlying LLMs’ Moral Dilemma Judgments?
Wenya Wu, Weihong Deng
Safe in Isolation, Dangerous Together: Agent-Driven Multi-Turn Decomposition Jailbreaks on LLMs
Devansh Srivastav, Xiao Zhang
FrontierScience Bench: Evaluating AI Research Capabilities in LLMs
Matthew Li, Santiago Torres-Garcia, Shayan Halder et al.
TeXpert: A Multi-Level Benchmark for Evaluating LaTeX Code Generation by LLMs
Sahil Kale, Vijaykant Nadadur
Predicting The Scholarly Impact of Research Papers Using Retrieval-Augmented LLMs
Tamjid Azad, Ibrahim Al Azher, Sagnik Ray Choudhury et al.
Inductive Learning on Heterogeneous Graphs Enhanced by LLMs for Software Mention Detection
Gabriel Silva, Mário Rodriges, António Teixeira et al.
Comparing LLMs and BERT-based Classifiers for Resource-Sensitive Claim Verification in Social Media
Max Upravitelev, Nicolau Duran-Silva, Christian Woerle et al.
ClimateCheck2025: Multi-Stage Retrieval Meets LLMs for Automated Scientfic Fact-Checking
Anna Kiepura, Jessica Lam
A.M.P at SciHal2025: Automated Hallucination Detection in Scientific Content via LLMs and Prompt Engineering
Le Nguyen Anh Khoa, Thìn Đặng Văn
UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation
Thanet Markchom, Tong Wu, Liting Huang et al.
NlpUned at SemEval-2025 Task 10: Beyond Training: A Taxonomy-Guided Approach to Role Classification Using LLMs
Alberto Caballero, Alvaro Rodrigo, Roberto Centeno
CSIRO-LT at SemEval-2025 Task 11: Adapting LLMs for Emotion Recognition for Multiple Languages
Jiyu Chen, Necva Bölücü, Sarvnaz Karimi et al.
LATE-GIL-nlp at Semeval-2025 Task 10: Exploring LLMs and transformers for Characterization and extraction of narratives from online news
Ivan Diaz, Fredin Vázquez, Christian Luna et al.