Papers
Shy-hunyuan-MT at WMT25 General Machine Translation Shared Task
Mao Zheng, Zheng Li, Yang Du et al.
Side Effects of Erasing Concepts from Diffusion Models
Shaswati Saha, Sourajit Saha, Manas Gaur et al.
SIFT: Grounding LLM Reasoning in Contexts via Stickers
Zihao Zeng, Xuyao Huang, Boxiu Li et al.
SilVar: Speech-Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization
Tan-Hanh Pham, Le Hoang Nam, Phu-Vinh Nguyen et al.
SimBA: Simplifying Benchmark Analysis Using Performance Matrices Alone
Nishant Subramani, Alfredo Gomez, Mona T. Diab
SIMBA UQ: Similarity-Based Aggregation for Uncertainty Quantification in Large Language Models
Debarun Bhattacharjya, Balaji Ganesan, Junkyu Lee et al.
Similarity = Value? Consultation Value-Assessment and Alignment for Personalized Search
Weicong Qin, Yi Xu, Weijie Yu et al.
SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models
Amirhossein Dabiriaghdam, Lele Wang
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Shuang Sun, Huatong Song, Yuhao Wang et al.
SimpleDoc: Multi‐Modal Document Understanding with Dual‐Cue Page Retrieval and Iterative Refinement
Chelsi Jain, Yiran Wu, Yifan Zeng et al.
Simple Factuality Probes Detect Hallucinations in Long-Form Natural Language Generation
Jiatong Han, Neil Band, Muhammed Razzak et al.
Simple Yet Effective: An Information-Theoretic Approach to Multi-LLM Uncertainty Quantification
Maya Kruse, Majid Afshar, Saksham Khatwani et al.
Simulating Identity, Propagating Bias: Abstraction and Stereotypes in LLM-Generated Text
Pia Sommerauer, Giulia Rambelli, Tommaso Caselli
SimulatorArena: Are User Simulators Reliable Proxies for Multi-Turn Evaluation of AI Assistants?
Yao Dou, Michel Galley, Baolin Peng et al.
SimVBG: Simulating Individual Values by Backstory Generation
Bangde Du, Ziyi Ye, Zhijing Wu et al.
Sindbad at AraHealthQA Track 1: Leveraging Large Language Models for Mental Health Q&A
AbdulRahman A. Morsy, Saad Mankarious, Ayah Zirikly
Single layer tiny Co4 outpaces GPT-2 and GPT-BERT
Noor Ul Zain, Mohsin Raza Naseem, Ahsan Adeel
Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization
Yutao Zhu, Jiajie Jin, Hongjin Qian et al.
SinhalaMMLU: A Comprehensive Benchmark for Evaluating Multitask Language Understanding in Sinhala
Ashmari Pramodya, Nirasha Nelki, Heshan Shalinda et al.
SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs
Zhiqiang Liu, Enpei Niu, Yin Hua et al.
Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Xing Zhang, Jiaheng Wen, Fangkai Yang et al.
Skeletons Matter: Dynamic Data Augmentation for Text-to-Query
Yuchen Ji, Bo Xu, Jie Shi et al.
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Simon A. Aytes, Jinheon Baek, Sung Ju Hwang
SkewRoute: Training-Free LLM Routing for Knowledge Graph Retrieval-Augmented Generation via Score Skewness of Retrieved Context
Hairu Wang, Yuan Feng, Yukun Cao et al.