Papers
Decoder-Only LLMs can be Masked Auto-Encoders
Dan Qiao, Yuan Gao, Zheming Yang et al.
Mitigating Posterior Salience Attenuation in Long-Context LLMs with Positional Contrastive Decoding
Zikai Xiao, Ziyang Wang, Wen Ma et al.
Sparse-to-Dense: A Free Lunch for Lossless Acceleration of Video Understanding in LLMs
Xuan Zhang, Cunxiao Du, Sicheng Yu et al.
LLMs syntactically adapt their language use to their conversational partner
Florian Kandra, Vera Demberg, Alexander Koller
Revisiting LLMs as Zero-Shot Time Series Forecasters: Small Noise Can Break Large Models
Junwoo Park, Hyuck Lee, Dohyun Lee et al.
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs
Ananth Muppidi, Abhilash Nandy, Sambaran Bandyopadhyay
Rethinking Semantic Parsing for Large Language Models: Enhancing LLM Performance with Semantic Hints
Kaikai An, Shuzheng Si, Helan Hu et al.
Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure
Zheyuan Yang, Zexi Kuang, Xue Xia et al.
Multi-Programming Language Sandbox for LLMs
Shihan Dou, Jiazheng Zhang, Jianxiang Zang et al.
AutoAlign: Get Your LLM Aligned with Minimal Annotations
Xinyu Lu, Dong Xu, Chunkang Zhang et al.
ZeroSumEval: An Extensible Framework For Scaling LLM Evaluation with Inter-Model Competition
Hisham Abdullah Alyahya, Haidar Khan, Yazeed Alnumay et al.
CodeArena: A Collective Evaluation Platform for LLM Code Generation
Mingzhe Du, Anh Tuan Luu, Bin Ji et al.
Value Compass Benchmarks: A Comprehensive, Generative and Self-Evolving Platform for LLMs’ Value Evaluation
Jing Yao, Xiaoyuan Yi, Shitong Duan et al.
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren, Yihong Liu, Hinrich Schuetze
Pun2Pun: Benchmarking LLMs on Textual-Visual Chinese-English Pun Translation via Pragmatics Model and Linguistic Reasoning
Yiran Rex Ma, Shan Huang, Yuting Xu et al.
Quantifying the Influence of Irrelevant Contexts on Political Opinions Produced by LLMs
Samuele D’Avenia, Valerio Basile
Making Sense of Korean Sentences: A Comprehensive Evaluation of LLMs through KoSEnd Dataset
Seunguk Yu, Kyeonghyun Kim, JungMin Yun et al.
A Dual-Layered Evaluation of Geopolitical and Cultural Bias in LLMs
Sean Kim, Hyuhng Joon Kim
Evaluating Credibility and Political Bias in LLMs for News Outlets in Bangladesh
Tabia Tanzin Prama, Md. Saiful Islam
Mitigating Hallucination by Integrating Knowledge Graphs into LLM Inference – a Systematic Literature Review
Robin Wagner, Emanuel Kitzelmann, Ingo Boersch
Learning and Enforcing Context-Sensitive Control for LLMs
Mohammad Albinhassan, Pranava Madhyastha, Mark Law et al.
When Will the Tokens End? Graph-Based Forecasting for LLMs Output Length
Grzegorz Piotrowski, Mateusz Bystroński, Mikołaj Hołysz et al.
Only for the Unseen Languages, Say the Llamas: On the Efficacy of Language Adapters for Cross-lingual Transfer in English-centric LLMs
Julian Schlenker, Jenny Kunz, Tatiana Anikina et al.