Papers
3,922 papers found
Reasoning about Uncertainty: Do Reasoning Models Know When They Don’t Know?
Zhiting Mei, Christina Zhang, Tenny Yin et al.
Reasoning Beyond Labels: Measuring LLM Sentiment in Low-Resource, Culturally Nuanced Contexts
Millicent Ochieng, Anja Thieme, Ignatius Ezeani et al.
Reasoning Beyond Literal: Cross-style Multimodal Reasoning for Figurative Language Understanding
Seyyed Saeid Cheshmi, Hahnemann Ortiz, James Mooney et al.
Reasoning or Knowledge: Stratified Evaluation of Biomedical LLMs
Rahul Thapa, Qingyang Wu, Kevin Wu et al.
Reasoning’s Razor: Reasoning Improves Accuracy but Hurts Recall at Critical Operating Points in Safety and Hallucination Detection
Atoosa Chegini, Hamid Kazemi, Garrett Souza et al.
Reassessing Active Learning Adoption in Contemporary NLP: A Community Survey
Julia Romberg, Christopher Schröder, Julius Gonsior et al.
ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting
Yuxing Tian, Fengran Mo, Weixu Zhang et al.
ReBPE: Iteratively Improving the Internal Structure of a Structured Tokeniser by Mining its Internal Structure
Thomas Bauwens, Miryam de Lhoneux
RECAP: REwriting Conversations for Intent Understanding in Agentic Planning
Kushan Mitra, Dan Zhang, Hannah Kim et al.
ReciFine: Finely Annotated Recipe Dataset for Controllable Recipe Generation
Nuhu Ibrahim, Rishi Ravikumar, Robert Stevens et al.
RECIPE-TKG: From Sparse History to Structured Reasoning for LLM-based Temporal Knowledge Graph Completion
Ömer Faruk Akgül, Feiyu Zhu, Yuxin Yang et al.
Recursive numeral systems are highly regular and easy to process
Ponrawee Prasertsom, Andrea Silvi, Jennifer Culbertson et al.
Redefining Retrieval Evaluation in the Era of LLMs
Giovanni Trappolini, Florin Cuconasu, Simone Filice et al.
Reducing Hallucinations in Language Model-based SPARQL Query Generation Using Post-Generation Memory Retrieval
Aditya Sharma, Christopher Pal, Amal Zouaq
ReFACT: A Benchmark for Scientific Confabulation Detection with Positional Error Annotations
Yindong Wang, Martin Preiß, Margarita Bugueño et al.
ReflectiveRAG: Rethinking Adaptivity in Retrieval-Augmented Generation
Akshay Verma, Swapnil Gupta, Siddharth Pillai et al.
Reflect, Rewrite, Repeat: How Simple Arithmetic Enables Advanced Reasoning in Small Language Models
Mengdie Flora Wang, Haochen Xie, Mun Young Kim et al.
RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models
Aashiq Muhamed, Leonardo F. R. Ribeiro, Markus Dreyer et al.
Regional Variation in the Performance of ASR Models on Croatian and Serbian
Tanja Samardžić, Peter Rupnik, Nikola Ljubešić
REGLAT at AbjadGenEval: Multi-Model Ensemble Approach for Arabic AI-Generated Text Detection
Mariam Labib Francies, Nsrin Ashraf, Ahmed Megahed Fetouh et al.
REGLAT at AbjadMed: Handling Imbalanced Arabic Medical Text Classification via Hierarchical KNN-MLP Architecture
Ahmed Megahed Fetouh, Mohammed Rahmath, Omer Dawood et al.
RegNLI: Detecting Online Product Misbranding through Legal and Linguistic Alignment
Diya Saha, Abhishek Bharadwaj Varanasi, Tirthankar Dasgupta et al.
REIGNITE at AbjadMed: Imbalance-Aware Fine-Tuning of Pretrained Arabic Transformers for Arabic Medical Text Classification Task
Nahid Montasir Rifat, Foyez Ahmed Dewan
ReMedQA: Are We Done With Medical Multiple-Choice Benchmarks?
Alessio Cocchieri, Luca Ragazzi, Giuseppe Tagliavini et al.
Repairing Regex Vulnerabilities via Localization-Guided Instructions
Sicheol Sung, Joonghyuk Hahn, Yo-Sub Han