Papers
SafeR-CLIP: Mitigating NSFW Content in Vision-Language Models While Preserving Pre-Trained Knowledge
Adeel Yousaf, Joseph Fioresi, James Beetham et al.
SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
Qiusi Zhan, Angeline Budiman-Chan, Abdelrahman Zayed et al.
SAFE: Semantic- and Frequency-Enhanced Curriculum for Cross-Domain Deepfake Detection
Yulin Yao, Kangfeng Zheng, Bin Wu et al.
SafeSieve: From Heuristics to Experience in Progressive Pruning for LLM-based Multi-Agent Communication
Ruijia Zhang, Xinyan Zhao, Ruixiang Wang et al.
Safety Alignment of Large Language Models via Contrasting Safe and Harmful Distributions
Xiaoyun Zhang, Zhengyue Zhao, Wenxuan Shi et al.
SafetyMem: Adaptive Jailbreak Defense via Dual-Component Safety Memory
Hao Wang, Ziyi Ni, Huacan Wang et al.
Safety of Large Language Models Beyond English: A Systematic Literature Review of Risks, Biases, and Safeguards
Aleksandra Krasnodębska, Katarzyna Dziewulska, Karolina Seweryn et al.
SafetyReminder: Reviving Delayed Safety Awareness of Vision-Language Models to Defend Against Jailbreak Attacks
Peiyuan Tang, Haojie Xin, Xiaodong Zhang et al.
Safety-Utility Conflicts Are Not Global: Surgical Alignment via Head-Level Diagnosis
Wang Cai, Yilin Wen, Jinchang Hou et al.
Safe-Unsafe Concept Separation Emerges from a Single Direction in Language Models Activation Space
Andrea Ermellino, Lorenzo Malandri, Fabio Mercorio et al.
Safe Vision-Language Models via Unsafe Weights Manipulation
Moreno D'incà, Elia Peruzzo, Xingqian Xu et al.
SAFO: Stable Adaptive Fairness Optimization for LLM-Based Social Survey Simulation
Chenxi Lin, Zhuoren Jiang, Kaisong Song et al.
SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation
Paul Grimal, Michael Soumm, Hervé Le Borgne et al.
SAGE: A Compositional Multi-Agent LLM Framework with Pedagogical Reasoning for Structured Collaborative Problem Solving
Van-Khanh Tran, Van-Khai Dang, Duc-Huy Nguyen
SAGE: An Agentic Explainer Framework for Interpreting SAE Features in Language Models
Jiaojiao Han, Wujiang Xu, Mingyu Jin et al.
SAGE: A Search-AuGmented Evaluation of Large Language Models on Free-Form QA
Sher Badshah, Ali Emami, Hassan Sajjad
SAGE : A Top-Down Bottom-Up Knowledge-Grounded User Simulator for Multi-turn Agent Evaluation
Ryan Shea, Yunan Lu, Liang Qiu et al.
SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement
Yuan Ge, Junxiang Zhang, Xiaoqian Liu et al.
SAGE: Sparse Adaptive Guidance for Dependency-Aware Tabular Data Generation
Shuo Yang, Zheyu Zhang, Bardh Prenkaj et al.
SAGE: Spuriousness-Aware Guided Prompt Exploration for Mitigating Multimodal Bias
Wenqian Ye, Di Wang, Guangtao Zheng et al.
SAGE: Steerable Agentic Data Generation for Deep Search with Execution Feedback
Fangyuan Xu, Rujun Han, Yanfei Chen et al.
SAGE: Structured Attribute-Guided Enhancement for GZSL
Zao Zhang, Liguo Sun, Pin Lyu
SAGE: Synergistic Adaptive Gating of Experts for Hateful Video Detection
Jie Huang, Xin Liao, Junjie Wang et al.
Sahara Tokenizers at PARSEME 2.0 Subtask 1: Combining Contextual Embeddings with Structural Decoding for Multi-Word Expression Detection
Yunus Karatepe, Mert Sülük, Zeynep Tuğçe Kırımlı et al.