Papers
LLM-Independent Adaptive RAG: Let the Question Speak for Itself
Maria Marina, Nikolay Ivanov, Sergey Pletenev et al.
LLM-OREF: An Open Relation Extraction Framework Based on Large Language Models
Hongyao Tu, Liang Zhang, Yujie Lin et al.
Can LLMs Help You at Work? A Sandbox for Evaluating LLM Agents in Enterprise Environments
Harsh Vishwakarma, Ankush Agarwal, Ojas Patil et al.
On LLM-Based Scientific Inductive Reasoning Beyond Equations
Brian S. Lin, Jiaxin Yuan, Zihan Zhou et al.
Doubling Your Data in Minutes: Ultra-fast Tabular Data Generation via LLM-Induced Dependency Graphs
Shuo Yang, Zheyu Zhang, Bardh Prenkaj et al.
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains
Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma
Analyzing Uncertainty of LLM-as-a-Judge: Interval Evaluations with Conformal Prediction
Huanxin Sheng, Xinyi Liu, Hangfeng He et al.
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Tianhao Wu, Weizhe Yuan, Olga Golovneva et al.
Improving Clustering with Positive Pairs Generated from LLM-Driven Labels
Xiaotong Zhang, Ying Li
Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems
Xu Shen, Yixin Liu, Yiwei Dai et al.
Evaluating LLM-Generated Diagrams as Graphs
Chumeng Liang, Jiaxuan You
Improve LLM-as-a-Judge Ability as a General Ability
Jiachen Yu, Shaoning Sun, Xiaohui Hu et al.
ToolSafety: A Comprehensive Dataset for Enhancing Safety in LLM-Based Agent Tool Invocations
Yuejin Xie, Youliang Yuan, Wenxuan Wang et al.
SSA: Semantic Contamination of LLM-Driven Fake News Detection
Cheng Xu, Nan Yan, Shuhao Guan et al.
Context-Aware Hierarchical Taxonomy Generation for Scientific Papers via LLM-Guided Multi-Aspect Clustering
Kun Zhu, Lizi Liao, Yuxuan Gu et al.
TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review
Yuan Chang, Ziyue Li, Hengyuan Zhang et al.
CheckEval: A reliable LLM-as-a-Judge framework for evaluating text generation using checklists
Yukyung Lee, JoongHoon Kim, Jaehee Kim et al.
ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks
Heng Zhou, Hejia Geng, Xiangyuan Xue et al.
DynamicNER: A Dynamic, Multilingual, and Fine-Grained Dataset for LLM-based Named Entity Recognition
Hanjun Luo, Yingbin Jin, Yiran Wang et al.
Prototypical Human-AI Collaboration Behaviors from LLM-Assisted Writing in the Wild
Sheshera Mysore, Debarati Das, Hancheng Cao et al.
Igniting Creative Writing in Small Language Models: LLM-as-a-Judge versus Multi-Agent Refined Rewards
Xiaolong Wei, Bo Lu, Xingyu Zhang et al.
Media Source Matters More Than Content: Unveiling Political Bias in LLM-Generated Citations
Sunhao Dai, Zhanshuo Cao, Wenjie Wang et al.
Evaluating the Effectiveness and Scalability of LLM-Based Data Augmentation for Retrieval
Pranjal A Chitale, Bishal Santra, Yashoteja Prabhu et al.
DivScore: Zero-Shot Detection of LLM-Generated Text in Specialized Domains
Zhihui Chen, Kai He, Yucheng Huang et al.
LLM-Driven Completeness and Consistency Evaluation for Cultural Heritage Data Augmentation in Cross-Modal Retrieval
Jian Zhang, Junyi Guo, Junyi Yuan et al.