Papers - Conftrace

RelEdit: Evaluating Conceptual Knowledge Editing in Language Models via Relational Reasoning

Yifan Niu, Miao Peng, Nuo Chen et al.

2025 ACL

Relevance Scores Calibration for Ranked List Truncation via TMP Adapter

Pavel Posokhov, Sergei Masliukhin, Skrylnikov Stepan et al.

2025 ACL

Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?

Chengwei Qin, Wenhan Xia, Tan Wang et al.

2025 ACL

Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction

Xiaowei Zhu, Yubing Ren, Yanan Cao et al.

2025 ACL

RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service

Yihang Cheng, Lan Zhang, Junyang Wang et al.

2025 ACL

Removal of Hallucination on Hallucination: Debate-Augmented RAG

Wentao Hu, Wengyu Zhang, Yiyang Jiang et al.

2025 ACL

Removing Prompt-template Bias in Reinforcement Learning from Human Feedback

Chaojie Wang, Haonan Shi, Long Tian et al.

2025 ACL

RePanda: Pandas-powered Tabular Verification and Reasoning

Atoosa Chegini, Keivan Rezaei, Hamid Eghbalzadeh et al.

2025 ACL

REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities

Alexander Pugachev, Alena Fenogenova, Vladislav Mikhailov et al.

2025 ACL

Representation Bending for Large Language Model Safety

Ashkan Yousefpour, Taeheon Kim, Ryan Sungmo Kwon et al.

2025 ACL

Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and Attitudes

Meng Li, Michael Vrazitulis, David Schlangen

2025 ACL

REPRO-Bench: Can Agentic AI Systems Assess the Reproducibility of Social Science Research?

Chuxuan Hu, Liyun Zhang, Yeji Lim et al.

2025 ACL

Reproducing the Argument Quality Prediction of Project Debater

Ines Zelch, Matthias Hagen, Benno Stein et al.

2025 ACL

ReproHum #0031-01: Reproducing the Human Evaluation of Readability from “It is AI’s Turn to Ask Humans a Question”

Daniel Braun

2025 ACL

ReproHum #0033-05: Human Evaluation of Factuality from A Multidisciplinary Perspective

Andra-Maria Florescu, Marius Micluța-Câmpeanu, Stefana Arina Tabusca et al.

2025 ACL

ReproHum #0067-01: A Reproduction of the Evaluation of Cross-Lingual Summarization

Supryadi, Chuang Liu, Deyi Xiong

2025 ACL

ReproHum #0669-08: Reproducing Sentiment Transfer Evaluation

Kristýna Onderková, Mateusz Lango, Patrícia Schmidtová et al.

2025 ACL

ReproHum #0729-04: Human Evaluation Reproduction Report for “MemSum: Extractive Summarization of Long Documents Using Multi-Step Episodic Markov Decision Processes”

Simeon Junker

2025 ACL

ReproHum #0729-04: Partial reproduction of the human evaluation of the MemSum and NeuSum summarisation systems

Simon Mille, Michela Lorandi

2025 ACL

ReproHum #0744-02: A Reproduction of the Human Evaluation of Meaning Preservation in “Factorising Meaning and Form for Intent-Preserving Paraphrasing”

Julius Steen, Katja Markert

2025 ACL

ReproHum: #0744-02: Investigating the Reproducibility of Semantic Preservation Human Evaluations

Mohammad Arvan, Natalie Parde

2025 ACL

Reranking-based Generation for Unbiased Perspective Summarization

Narutatsu Ri, Nicholas Deas, Kathleen McKeown

2025 ACL

Re-ranking Using Large Language Models for Mitigating Exposure to Harmful Content on Social Media Platforms

Rajvardhan Oak, Muhammad Haroon, Claire Wonjeong Jo et al.

2025 ACL

ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision

Dosung Lee, Wonjun Oh, Boyoung Kim et al.

2025 ACL

Research Borderlands: Analysing Writing Across Research Cultures

Shaily Bhatt, Tal August, Maria Antoniak

2025 ACL