Papers
Gumbel Reranking: Differentiable End-to-End Reranker Optimization
Siyuan Huang, Zhiyuan Ma, Jintao Du et al.
GUM-SAGE: A Novel Dataset and Approach for Graded Entity Salience Prediction
Jessica Lin, Amir Zeldes
Habib University at SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Detection
Owais Waheed, Hammad Sajid, Kushal Chandani et al.
Habib University at SemEval-2025 Task 9: Using Ensemble Models for Food Hazard Detection
Rabia Shahab, Iqra Azfar, Hammad Sajid et al.
HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring
Zhixiong Su, Yichen Wang, Herun Wan et al.
HAF-RM: A Hybrid Alignment Framework for Reward Model Training
Shujun Liu, Xiaoyu Shen, Yuhang Lai et al.
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models
Xiao Wang, Jingyun Hua, Weihong Lin et al.
Hallucination Detectives at SemEval-2025 Task 3: Span-Level Hallucination Detection for LLM-Generated Answers
Passant Elchafei, Mervat Abu - Elkheir
Hallucination Detox: Sensitivity Dropout (SenD) for Large Language Model Training
Shahrad Mohammadzadeh, Juan David Guerra, Marco Bonizzato et al.
HalluLens: LLM Hallucination Benchmark
Yejin Bang, Ziwei Ji, Alan Schelten et al.
HalluRAG-RUG at SemEval-2025 Task 3: Using Retrieval-Augmented Generation for Hallucination Detection in Model Outputs
Silvana Abdi, Mahrokh Hassani, Rosalien Kinds et al.
HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection
Mohamed Abdallah, Samhaa El - Beltagy
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Abhilasha Ravichander, Shrusti Ghela, David Wadden et al.
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Assistant Scenarios
Jun Wang, Jiamu Zhou, Xihuai Wang et al.
Hanging in the Balance: Pivotal Moments in Crisis Counseling Conversations
Vivian Nguyen, Lillian Lee, Cristian Danescu-Niculescu-Mizil
Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems
Hansa Meghwani, Amit Agarwal, Priyaranjan Pattnayak et al.
Harmonizing Divergent Lemmatization and Part-of-Speech Tagging Practices for Latin Participles through the LiLa Knowledge Base
Marco Passarotti, Federica Iurescia, Paolo Ruffolo
Harnessing Large Language Models for Disaster Management: A Survey
Zhenyu Lei, Yushun Dong, Weiyu Li et al.
Harnessing PDF Data for Improving Japanese Large Multimodal Models
Jeonghun Baek, Akiko Aizawa, Kiyoharu Aizawa
Harnessing Whisper for Prosodic Stress Analysis
Samuel S. Sohn, Sten Knutsen, Karin Stromswold
HASH-RAG: Bridging Deep Hashing with Retriever for Efficient, Fine Retrieval and Augmented Generation
Jinyu Guo, Xunlei Chen, Qiyang Xia et al.
Has Machine Translation Evaluation Achieved Human Parity? The Human Reference and the Limits of Progress
Lorenzo Proietti, Stefano Perrella, Roberto Navigli
HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model Inference
Ping Gong, Jiawei Yi, Shengnan Wang et al.
HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter
Manuel Tonneau, Diyi Liu, Niyati Malhotra et al.
Hate Explained: Evaluating NER-Enriched Text in Human and Machine Moderation of Hate Speech
Andres Carvallo, Marcelo Mendoza, Miguel Fernandez et al.