Papers
HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Classification
He Zhu, Junran Wu, Ruomei Liu et al.
hinoki at SemEval-2024 Task 7: Numeral-Aware Headline Generation (English)
Hinoki Crum, Steven Bethard
HIT-MI&T Lab at SemEval-2024 Task 6: DeBERTa-based Entailment Model is a Reliable Hallucination Detector
Wei Liu, Wanyao Shi, Zijian Zhang et al.
Holistic Evaluation of Large Language Models: Assessing Robustness, Accuracy, and Toxicity for Real-World Applications
David Cecchini, Arshaan Nazir, Kalyan Chakravarthy et al.
How are Prompts Different in Terms of Sensitivity?
Sheng Lu, Hendrik Schuff, Iryna Gurevych
How did we get here? Summarizing conversation dynamics
Yilun Hua, Nicholas Chernogor, Yuzhe Gu et al.
How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes
Harmon Bhasin, Timothy Ossowski, Yiqiao Zhong et al.
How Does Stereotype Content Differ across Data Sources?
Kathleen Fraser, Svetlana Kiritchenko, Isar Nejadgholi
How Good are Modern LLMs in Generating Relevant and High-Quality Questions at Different Bloom’s Skill Levels for Indian High School Social Science Curriculum?
Nicy Scaria, Suma Dharani Chenna, Deepak Subramani
How Interpretable are Reasoning Explanations from Prompting Large Language Models?
Yeo Wei Jie, Ranjan Satapathy, Rick Goh et al.
How Lexical is Bilingual Lexicon Induction?
Harsh Kohli, Helian Feng, Nicholas Dronen et al.
How Much Annotation is Needed to Compare Summarization Models?
Chantal Shaib, Joe Barrow, Alexa Siu et al.
How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities
Lingbo Mo, Boshi Wang, Muhao Chen et al.
How Well Can a Genetic Algorithm Fine-tune Transformer Encoders? A First Approach
Vicente Ivan Sanchez Carmona, Shanshan Jiang, Bin Dong
How Well Do Large Language Models Truly Ground?
Hyunji Lee, Se June Joo, Chaeeun Kim et al.
How Well Do Tweets Represent Sub-Dialects of Egyptian Arabic?
Mai Mohamed Eida, Mayar Nassar, Jonathan Dunn
HPipe: Large Language Model Pipeline Parallelism for Long Context on Heterogeneous Cost-effective Devices
Ruilong Ma, Xiang Yang, Jingyu Wang et al.
HSE NLP Team at MEDIQA-CORR 2024 Task: In-Prompt Ensemble with Entities and Knowledge Graph for Medical Error Correction
Airat Valiev, Elena Tutubalina
HTCCN: Temporal Causal Convolutional Networks with Hawkes Process for Extrapolation Reasoning in Temporal Knowledge Graphs
Tingxuan Chen, Jun Long, Liu Yang et al.
HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?
Shubhashis Roy Dipta, Sadat Shahriar
Human-AI Interaction in the Age of LLMs
Diyi Yang, Sherry Tongshuang Wu, Marti A. Hearst
HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants
Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci
HW-TSC 2024 Submission for the SemEval-2024 Task 1: Semantic Textual Relatedness (STR)
Mengyao Piao, Su Chang, Yuang Li et al.
HW-TSC at SemEval-2024 Task 5: Self-Eval? A Confident LLM System for Auto Prediction and Evaluation for the Legal Argument Reasoning Task
Xiaofeng Zhao, Xiaosong Qiao, Kaiwen Ou et al.
HW-TSC at SemEval-2024 Task 9: Exploring Prompt Engineering Strategies for Brain Teaser Puzzles Through LLMs
Yinglu Li, Zhao Yanqing, Min Zhang et al.