Papers
5,479 papers found
Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code
Jungin Kim, Shinwoo Park, Yo-Sub Han
Linguistic Cues for LLM-based Implicit Discourse Relation Classification
Yi Fan, Michael Strube, Wei Liu
LARA: LLM-based Agile Power Distribution Network Restoration from Disastrous Events
Jishnu Warrier, Heqing Huang, Yuzhang Lin et al.
Training-Free Text Emotion Tagging via LLM-Based Best-Worst Scaling
Lukas Christ, Shahin Amiriparian
LLM-to-Speech: A Synthetic Data Pipeline for Training Dialectal Text-to-Speech Models
Ahmed Khamis, Hesham Ali Ahmed
Who Judges the Judge? Evaluating LLM-as-a-Judge for French Medical open-ended QA
Ikram Belmadani, Oumaima El Khettari, Pacôme Constant dit Beaufils et al.
Measuring the Symbolic Power of Languages with LLM-based Multilingual Persuasion Simulation
Yin Jou Huang, Fei Cheng
LLM-as-a-Judge for Low-Resource Languages: Adapting Ragas and Comparative Ranking for Romanian
Claudiu Creanga, Liviu P Dinu
Anchoring the Judge: Curriculum-Based Adaptation and Reference-Anchored MQM for LLM-Based Machine Translation of an Unseen Low-Resource Language - A Case of Nupe
Umar Baba Umar, Sulaimon Adebayo Bashir, Abdulmalik Danlami Mohammed
Comparing LLM-Based Translation Approaches for Extremely Low-Resource Languages
Jared Coleman, Ruben Rosales, Kira Toal et al.
LLM-as-a-qualitative-judge: automating error analysis in natural language generation
Nadezhda Chirkova, Tunde Oluwaseyi Ajayi, Seth Aycock et al.
Whom to Trust? Analyzing the Divergence Between User Satisfaction and LLM-as-a-Judge in E-Commerce RAG Systems
Arif Türkmen, Kaan Efe Keleş
Read Between the Tracks: Exploring LLM-driven Intent-based Music Recommendations
Anna Hausberger, Petra Jósár, Markus Schedl
FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation
Song Jin, Shuqi Li, Shukun Zhang et al.
Navigating the Alpha Jungle: An LLM-Powered MCTS Framework for Formulaic Alpha Factor Mining
Yu Shi, Yitong Duan, Jian Li
FIXME: Towards End-to-End Benchmarking of LLM-Aided Design Verification
Gwok-Waa Wan, SamZaak Wong, Shengchu Su et al.
SoMe: A Realistic Benchmark for LLM-based Social Media Agents
Dizhan Xue, Jing Cui, Shengsheng Qian et al.
An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply Chains
Zihe Yan, Kai Luo, Haoyu Yang et al.
MicLog: Towards Accurate and Efficient LLM-based Log Parsing via Progressive Meta In-Context Learning
Jianbo Yu, Yixuan Li, Hai Xu et al.
A Theory of Adaptive Scaffolding for LLM-Based Pedagogical Agents
Clayton Cohn, Surya Rayala, Namrata Srivastava et al.
Mind the Gap: The Divergence Between Human and LLM-Generated Tasks
Yi-Long Lu, Jiajun Song, Chunhui Zhang et al.
Ψ-Arena: Interactive Assessment and Optimization of LLM-based Psychological Counselors with Tripartite Feedback
Shijing Zhu, Zhuang Chen, Guanqun Bi et al.