Papers
Is This LLM Library Learning? Evaluation Must Account For Compute and Behaviour
Ian Berlot-Attwell, Tobias Sesterhenn, Frank Rudzicz et al.
Is Word Sense Disambiguation Dead in the LLM Era?
Roberto Navigli
Is Your (Reasoning) Multimodal Language Model Vulnerable Toward Distractions?
Ming Liu, Hao Chen, Jindong Wang et al.
iTAG: Inverse Design for Natural Text Generation with Accurate Causal Graph Annotations
Wenshuo Wang, Boyu Cao, Nan Zhuang et al.
Iterative Dual-Model Alignment for Story Evaluation
Bruce Qin, Dan Goldwasser
Iterative Multi-Granular RAG with Contextual Hierarchical Graph
Yanli Hu, Teng Liu, Zhuangyi Zhou et al.
Iterative Structured Pruning for Large Language Models with Multi-Domain Calibration
Guangxin Wu, Hao Zhang, Zhang Zhibin et al.
IterCOMP: Reasoning-aware Adaptive Prompt Compression for Multi-hop Question Answering
JungMin Yun, YoungBin Kim
ITPP: Learning Disentangled Event Dynamics in Marked Temporal Point Processes
Wang-Tao Zhou, Zhao Kang, Ke Yan et al.
It’s All About the Confidence: An Unsupervised Approach for Multilingual Historical Entity Linking using Large Language Models
Cristian Santini, Marieke van Erp, Mehwish Alam
ITSELF: Attention Guided Fine-Grained Alignment for Vision-Language Retrieval
Tien-Huy Nguyen, Huu-Loc Tran, Thanh Duc Ngo
It’s High Time: A Survey of Temporal Question Answering
Bhawna Piryani, Abdelrahman Abdallah, Jamshid Mozafari et al.
It’s Not What You Say, It’s How You Say It: Evaluating LLM Responses to Expressions of Belief
Kevin Du, Clara Kümpel, Michelle Wastl et al.
ITUNLP2 at MWE-2026 AdMIRe 2: Modular Zero-Shot Pipelines for Multimodal Idiom Grounding and Ranking
Özge Umut, Bora Şenceylan
ITUNLP at MWE-2026 AdMIRe 2: A Zero-Shot LLM Pipeline for Multimodal Idiom Understanding and Ranking
Atakan Site, Oğuz Ali Arslan, Gülşen Eryiğit
IUQ: Interrogative Uncertainty Quantification for Long-Form Large Language Model Generation
Haozhi Fan, Jinhao Duan, Kaidi Xu
IYKYK: Using language models to decode extremist cryptolects
Christine de Kock, Arij Riabi, Zeerak Talat et al.
J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization
Austin Xu, Yilun Zhou, Xuan-Phi Nguyen et al.
Jailbreaking Multimodal Large Language Models using Multi-Clip Video
Choongwon Kang, Seungjong Sun, Hyunmin Jun et al.
Jailbreaking Safeguarded Text-to-Image Models via Large Language Models
Zhengyuan Jiang, Yuepeng Hu, Yuchen Yang et al.
Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs
James Beetham, Souradip Chakraborty, Mengdi Wang et al.
Jailbreak-Zero: A Path to Pareto Optimal Red Teaming for Large Language Models
Kai Hu, Abhinav Aggarwal, Mehran Khodabandeh et al.
Jakiro: Boosting Speculative Decoding via Decoupled MoE
Haiduo Huang, Fuwei Yang, Zhenhua Liu et al.
JanusMM: A Benchmark for Self-Deprecation Understanding in Real-World Multimodal Conversations
Xinyi Xu, Bingguang Hao, Yongyi Xiong et al.
JARVIS or Ultron? A Survey on the Safety and Security Threats of Computer-Using Agents
Ada Chen, Yongjiang Wu, Junyuan Zhang et al.