Papers
Surprise Calibration for Better In-Context Learning
Zhihang Tan, Jingrui Hou, Ping Wang et al.
SurveyGen: Quality-Aware Scientific Survey Generation with Large Language Models
Tong Bao, Mir Tafseer Nayeem, Davood Rafiei et al.
SVeritas: Benchmark for Robust Speaker Verification under Diverse Conditions
Massa Baali, Sarthak Bisht, Francisco Teixeira et al.
SWAM: Adaptive Sliding Window and Memory-Augmented Attention Model for Rumor Detection
Mei Guo, Chen Chen, Chunyan Hou et al.
SWAN: An Efficient and Scalable Approach for Long-Context Language Modeling
Krishna C Puvvada, Faisal Ladhak, Santiago Akle Serano et al.
SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence
Yao Zhang, Chenyang Lin, Shijie Tang et al.
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks
Adamenko Pavel, Ivanov Mikhail, Aidar Valeev et al.
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
Aurick Qiao, Zhewei Yao, Samyam Rajbhandari et al.
SwiftPrune: Hessian-Free Weight Pruning for Large Language Models
Yuhan Kang, Yang Shi, Mei Wen et al.
Sycophancy Mitigation Through Reinforcement Learning with Uncertainty-Aware Adaptive Reasoning Trajectories
Mohammad Beigi, Ying Shen, Parshin Shojaee et al.
SYNC: A Synthetic Long-Context Understanding Benchmark for Controlled Comparisons of Model Capabilities
Shuyang Cao, Kaijian Zou, Lu Wang
SynC-LLM: Generation of Large-Scale Synthetic Circuit Code with Hierarchical Language Models
Shang Liu, Yao Lu, Wenji Fang et al.
Synergizing Multimodal Temporal Knowledge Graphs and Large Language Models for Social Relation Recognition
Haorui Wang, Zheng Wang, Yuxuan Zhang et al.
Syntactic Blind Spots: How Misalignment Leads to LLMs’ Mathematical Errors
Dane A Williamson, Yangfeng Ji, Matthew B. Dwyer
<SYNTACT>: Structuring Your Natural Language SOPs into Tailored Ambiguity-Resolved Code Templates
Sachin Kumar Giroh, Pushpendu Ghosh, Aryan Jain et al.
Syntax-Aware Retrieval Augmentation for Neural Symbolic Regression
Canmiao Zhou, Han Huang
Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssist
Martín Santillán Cooper, Zahra Ashktorab, Hyo Jin Do et al.
Synthetic Proofs with Tool-Integrated Reasoning: Contrastive Alignment for LLM Mathematics with Lean
Mark Obozov, Michael Diskin, Aleksandr Beznosikov et al.
Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics
Jiarui Liu, Yueqi Song, Yunze Xiao et al.
Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of Health for Clinical Text
Avijit Mitra, Zhichao Yang, Emily Druhl et al.
SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains
Krithika Ramesh, Daniel Smolyak, Zihao Zhao et al.
SYSTRAN @ WMT 2025 General Translation Task
Dakun Zhang, Yara Khater, Ramzi Rahli et al.
T2: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering
Zhengyi Zhao, Shubo Zhang, Zezhong Wang et al.
T2R-BENCH: A Benchmark for Real World Table-to-Report Task
Jie Zhang, Changzai Pan, Sishi Xiong et al.
TABARD: A Novel Benchmark for Tabular Anomaly Analysis, Reasoning and Detection
Manan Roy Choudhury, Anirudh Iyengar Kaniyar Narayana Iyengar, Shikhhar Siingh et al.