Papers
How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation
Zhuohan Long, Siyuan Wang, Shujun Liu et al.
How Much Do Large Language Models Know about Human Motion? A Case Study in 3D Avatar Control
Kunhang Li, Jason Naradowsky, Yansong Feng et al.
How Much Do LLMs Hallucinate across Languages? On Realistic Multilingual Estimation of LLM Hallucination
Saad Obaid Ul Islam, Anne Lauscher, Goran Glavaš
How Persuasive Is Your Context?
Tu Nguyen, Kevin Du, Alexander Miserlis Hoyle et al.
How Private are Language Models in Abstractive Summarization?
Anthony Hughes, Nikolaos Aletras, Ning Ma
How Real Are Synthetic Therapy Conversations? Evaluating Fidelity in Prolonged Exposure Dialogues
Suhas Bn, Dominik O. Mattioli, Andrew M. Sherrill et al.
How Reliable is Multilingual LLM-as-a-Judge?
Xiyan Fu, Wei Liu
How Sampling Affects the Detectability of Machine-written texts: A Comprehensive Study
Matthieu Dubois, François Yvon, Pablo Piantanida
How Sememic Components Can Benefit Link Prediction for Lexico-Semantic Knowledge Graphs?
Hansi Wang, Yue Wang, Qiliang Liang et al.
How to Fine-Tune Safely on a Budget: Model Adaptation Using Minimal Resources
Anh C. Pham, Mihir Thalanki, Michael Sun et al.
How to Generalize the Detection of AI-Generated Text: Confounding Neurons
Claudio Borile, Carlo Abrate
How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models
Kangtao Lv, Haibin Chen, Yujin Yuan et al.
How to Make Large Language Models Generate 100% Valid Molecules?
Wen Tao, Jing Tang, Alvin Chan et al.
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation
Ruohao Guo, Wei Xu, Alan Ritter
How Well Can AI Models Generate Human Eye Movements During Reading?
Ivan Stebakov, Ilya Pershin
How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts?
Sohee Yang, Sang-Woo Lee, Nora Kassner et al.
HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics
Dong Liu, Yanxuan Yu
HS-STaR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation
Feng Xiong, Hongling Xu, Yifei Wang et al.
HULAT-UC3M at TSAR 2025 Shared Task A Prompt-Based Approach using Lightweight Language Models for Readability-Controlled Text Simplification
Jesus M. Sanchez-Gomez, Lourdes Moreno, Paloma Martínez et al.
Human-AI Moral Judgment Congruence on Real-World Scenarios: A Cross-Lingual Analysis
Nan Li, Bo Kang, Tijl De Bie
Human and LLM-based Assessment of Teaching Acts in Expert-led Explanatory Dialogues
Aliki Anagnostopoulou, Nils Feldhus, Yi-Sheng Hsu et al.
Human-Inspired Obfuscation for Model Unlearning: Local and Global Strategies with Hyperbolic Representations
Zekun Wang, Jingjie Zeng, Yingxu Li et al.
Humanity’s Last Code Exam: Can Advanced LLMs Conquer Human’s Hardest Code Competition?
Xiangyang Li, Xiaopeng Li, Kuicai Dong et al.
Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design
Yunze Xiao, Lynnette Hui Xian Ng, Jiarui Liu et al.
Humans Hallucinate Too: Language Models Identify and Correct Subjective Annotation Errors With Label-in-a-Haystack Prompts
Georgios Chochlakis, Peter Wu, Tikka Arjun Singh Bedi et al.