Papers
How Much Do Large Language Model Cheat on Evaluation? Benchmarking Overestimation Under the One-Time-Pad-Based Framework
Zi Liang, Liantong Yu, Zhang Shiyu et al.
How Much Pretraining Does Structured Data Need?
Daniel Fadlon, Kfir Bar
How multilingual are multilingual LLMs? A case study in Northern Sámi-Finnish Translation
Jonne Sälevä, Constantine Lignos
How Quantization Shapes Bias in Large Language Models
Federico Marcuzzi, Xuefei Ning, Roy Schwartz et al.
How Reasoning Influences Intersectional Biases in Vision–Language Models (Student Abstract)
Adit Desai, Sudipta Roy, Mohna Chakraborty
How Reliable are Confidence Estimators for Large Reasoning Models? A Systematic Benchmark on High-Stakes Domains
Reza Khanmohammadi, Erfan Miahi, Simerjot Kaur et al.
How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities
Aly M. Kassem, Bernhard Schölkopf, Zhijing Jin
How Should We Model the Probability of a Language?
Rasul Dent, Pedro Ortiz Suarez, Thibault Clérice et al.
How to Contextualize Empirical Data for Risk Analysis with LLMs: A Case Study of Power Outages
Haiyun Huang, Yukun Li, Marco A Pretell et al.
How to Design and Train Your Implicit Neural Representation for Video Compression
Matthew Gwilliam, Roy Zhang, Namitha Padmanabhan et al.
How to Efficiently Explore Noisy Historical Data? Leveraging Corpus Pre-Targeting to Enhance Graph-based RAG
Donghan Bian, Marie Puren, Florian Cafiero
How to Make LMs Strong Node Classifiers?
Zhe Xu, Kaveh Hassani, Si Zhang et al.
How Wide and How Deep? Mitigating Over-squashing of GNNs via Channel Capacity Constrained Estimation
Zinuo You, Jin Zheng, John Cartlidge
HPSU: A Benchmark for Human-Level Perception in Real-World Spoken Speech Understanding
Chen Li, Peiji Yang, Yicheng Zhong et al.
HQ-SVC: Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios
Bingsong Bai, Yizhong Geng, Fengping Wang et al.
H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation
Hongzhe Bi, Lingxuan Wu, Tianwei Lin et al.
HSA-Net: Hierarchical and Structure-Aware Framework for Efficient and Scalable Molecular Language Modeling
Zihang Shao, Wentao Lei, Lei Wang et al.
HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models Through Curriculum Tuning
Qihao Yang, Xuelin Wang, Jiale Chen et al.
HTG-GCL: Leveraging Hierarchical Topological Granularity from Cellular Complexes for Graph Contrastive Learning
Qirui Ji, Bin Qin, Yifan Jin et al.
HTN Plan Verification by Qualitative Temporal Reasoning
Tobias Schwartz, Diedrich Wolter
HTTrack: Learning to Perceive Targets via Historical Trajectories in Satellite Video Tracking
Jiahao Wang, Fang Liu, Licheng Jiao et al.
HuiduRep: A Robust Self-Supervised Framework for Learning Neural Representations from Extracellular Recordings
Feng Cao, Zishuo Feng, Jicong Zhang et al.
Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Sicheng Xie, Haidong Cao, Zejia Weng et al.
HumanBench: Two Heads, No Legs, But Mostly Human, the State of Generative Capabilities in T2I Models
Anubhooti Jain, Mayank Vatsa, Richa Singh
Human-Centric Open-Future Task Discovery: Formulation, Benchmark, and Scalable Tree-Based Search
Zijian Song, Xiaoxin Lin, Tao Pu et al.