Papers
Scaling Laws Are Unreliable for Downstream Tasks: A Reality Check
Nicholas Lourie, Michael Y. Hu, Kyunghyun Cho
Scaling Low-Resource MT via Synthetic Data Generation with LLMs
Ona de Gibert, Joseph Attieh, Teemu Vahtola et al.
Scaling Rich Style-Prompted Text-to-Speech Datasets
Anuj Diwan, Zhisheng Zheng, David Harwath et al.
Scaling, Simplification, and Adaptation: Lessons from Pretraining on Machine-Translated Text
Dan John Velasco, Matthew Theodore Roque
Scaling Up Temporal Domain Generalization via Temporal Experts Averaging
Aoming Liu, Kevin Miller, Venkatesh Saligrama et al.
SCDTour: Embedding Axis Ordering and Merging for Interpretable Semantic Change Detection
Taichi Aida, Danushka Bollegala
SCE: Semantic Consistency Enhanced Reinforcement Learning for Multi-Hop Knowledge Graph Reasoning
Yanwen Huang, Yao Liu, Qiao Liu et al.
Schema Generation for Large Knowledge Graphs Using Large Language Models
Bohui Zhang, Yuan He, Lydia Pintscher et al.
ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts
Dongwon Noh, Donghyeok Koh, Junghun Yuk et al.
SciClaims: An End-to-End Generative System for Biomedical Claim Analysis
Raúl Ortega, Jose Manuel Gomez-Perez
SciCompanion: Graph-Grounded Reasoning for Structured Evaluation of Scientific Arguments
Joshua Alan Flashner, Adithya Kulkarni, Dawei Zhou
Scientific Paper Retrieval with LLM-Guided Semantic-Based Ranking
Yunyi Zhang, Ruozhen Yang, Siqi Jiao et al.
SciEvent: Benchmarking Multi-domain Scientific Event Extraction
Bofu Dong, Pritesh Shah, Sumedh Sonawane et al.
SciNLP: A Domain-Specific Benchmark for Full-Text Scientific Entity and Relation Extraction in NLP
Decheng Duan, Jitong Peng, Yingyi Zhang et al.
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
David Wadden, Kejian Shi, Jacob Morrison et al.
SciSketch: An Open-source Framework for Automated Schematic Diagram Generation in Scientific Papers
Zihang Wang, Yilun Zhao, Kaiyan Zhang et al.
SCoder: Progressive Self-Distillation for Bootstrapping Small-Scale Data Synthesizers to Empower Code LLMs
Xinyu Zhang, Changzhi Zhou, Linmei Hu et al.
SCRIBE: Structured Chain Reasoning for Interactive Behaviour Explanations using Tool Calling
Fares Fawzi, Vinitra Swamy, Dominik Glandorf et al.
SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models
Peng Ding, Wen Sun, Dailin Li et al.
SEAL: Structure and Element Aware Learning Improves Long Structured Document Retrieval
Xinhao Huang, Zhibo Ren, Yipeng Yu et al.
SeaPO: Strategic Error Amplification for Robust Preference Optimization of Large Language Models
Jun Rao, Yunjie Liao, Xuebo Liu et al.
SEARA: An Automated Approach for Obtaining Optimal Retrievers
Zou Yuheng, Wang Yiran Yiran, Tian Yuzhu et al.
Searching for the Most Human-like Emergent Language
Brendon Boldt, David R. Mortensen
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Xiaoxi Li, Guanting Dong, Jiajie Jin et al.
Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty
Peilin Wu, Mian Zhang, Xinlu Zhang et al.