Papers
WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning
Yuchen Zhuang, Di Jin, Jiaao Chen et al.
Unsupervised Detection of LLM-Generated Text in Korean Using Syntactic and Semantic Cues
Heejeong Jeon, MinSu Park, YunSeok Choi et al.
SEAM: Bridging the Temporal-Semantic Granularity Gap for LLM-based Speech Recognition
Junseok Oh, Ji-Hwan Kim
DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router
Minghao Guo, Qingcheng Zeng, Xujiang Zhao et al.
Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code
Jungin Kim, Shinwoo Park, Yo-Sub Han
Linguistic Cues for LLM-based Implicit Discourse Relation Classification
Yi Fan, Michael Strube, Wei Liu
LARA: LLM-based Agile Power Distribution Network Restoration from Disastrous Events
Jishnu Warrier, Heqing Huang, Yuzhang Lin et al.
Training-Free Text Emotion Tagging via LLM-Based Best-Worst Scaling
Lukas Christ, Shahin Amiriparian
LLM-to-Speech: A Synthetic Data Pipeline for Training Dialectal Text-to-Speech Models
Ahmed Khamis, Hesham Ali Ahmed
Who Judges the Judge? Evaluating LLM-as-a-Judge for French Medical open-ended QA
Ikram Belmadani, Oumaima El Khettari, Pacôme Constant dit Beaufils et al.
Measuring the Symbolic Power of Languages with LLM-based Multilingual Persuasion Simulation
Yin Jou Huang, Fei Cheng
LLM-as-a-Judge for Low-Resource Languages: Adapting Ragas and Comparative Ranking for Romanian
Claudiu Creanga, Liviu P Dinu
Anchoring the Judge: Curriculum-Based Adaptation and Reference-Anchored MQM for LLM-Based Machine Translation of an Unseen Low-Resource Language - A Case of Nupe
Umar Baba Umar, Sulaimon Adebayo Bashir, Abdulmalik Danlami Mohammed
Comparing LLM-Based Translation Approaches for Extremely Low-Resource Languages
Jared Coleman, Ruben Rosales, Kira Toal et al.
LLM-as-a-qualitative-judge: automating error analysis in natural language generation
Nadezhda Chirkova, Tunde Oluwaseyi Ajayi, Seth Aycock et al.
Whom to Trust? Analyzing the Divergence Between User Satisfaction and LLM-as-a-Judge in E-Commerce RAG Systems
Arif Türkmen, Kaan Efe Keleş
Read Between the Tracks: Exploring LLM-driven Intent-based Music Recommendations
Anna Hausberger, Petra Jósár, Markus Schedl
FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation
Song Jin, Shuqi Li, Shukun Zhang et al.
Navigating the Alpha Jungle: An LLM-Powered MCTS Framework for Formulaic Alpha Factor Mining
Yu Shi, Yitong Duan, Jian Li
FIXME: Towards End-to-End Benchmarking of LLM-Aided Design Verification
Gwok-Waa Wan, SamZaak Wong, Shengchu Su et al.
SoMe: A Realistic Benchmark for LLM-based Social Media Agents
Dizhan Xue, Jing Cui, Shengsheng Qian et al.
An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply Chains
Zihe Yan, Kai Luo, Haoyu Yang et al.