Papers
TPR: A Training Procedure Representation to Augment XR Simulations with LLMs
Michael Guevarra, Christabel Wayllace, Srijita Das et al.
GPTKB v1.5: A Massive Knowledge Base for Exploring Factual LLM Knowledge
Yujia Hu, Tuan-Phong Nguyen, Shrestha Ghosh et al.
RefLens: End-to-End Evidence-Grounded Citation Verification with LLM Agents
SeungHoo Lee, JuneHyoung Kwon, Jooweon Choi et al.
Chatsparent: An Interactive System for Detecting and Mitigating Cognitive Fatigue in LLMs
Riju Marwah, Vishal Pallagani, Ritvik Garimella et al.
KnowThyself: An Agentic Assistant for LLM Interpretability
Suraj Prasai, Mengnan Du, Ying Zhang et al.
AuditAgent: LLM Agent for Risks Auditing in Recommender Systems
Du Su, Zhenxing Chen, Shilong Zhao et al.
Magnol.AI Copilot: Multimodal LLMs for Conversational Insight Generation
Hui Zhang, Guangchen Ruan, Hui Xiao
Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA
Yiran Zhang, Mingyang Lin, Mark Dras et al.
Testing Simulation Theory in LLMs’ Theory of Mind
Koshiro Aoki, Daisuke Kawahara
Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning
Wendy Yaqiao Liu, Rui Jerry Huang, Anastasia Miin et al.
Two Step Automatic Post Editing of Patent Machine Translation based on Pre-trained Encoder Models and LLMs
Kosei Buma, Takehito Utsuro, Masaaki Nagata
Are LLMs Good for Semantic Role Labeling via Question Answering?: A Preliminary Analysis
Ritwik Raghav, Abhik Jana
Visualizing and Benchmarking LLM Factual Hallucination Tendencies via Internal State Analysis and Clustering
Nathan Mao, Varun Kaushik, Shreya Shivkumar et al.
VariantBench: A Framework for Evaluating LLMs on Justifications for Genetic Variant Interpretation
Humair Basharat, Simon Plotkin, Charlotte Le et al.
Tutorial on Trustworthy Legal Text Processing with LLMs: Retrieval, Rhetorical Roles, Summarization, and Trustworthy Generation
Anand Kumar M, Sangeetha S, Manikandan R et al.
Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs
Peng Yifeng, Zhizheng Wu, Chen Chen
LLMs as Architects and Critics for Multi-Source Opinion Summarization
Anuj Attri, Arnav Attri, Suman Banerjee et al.
Atomic Calibration of LLMs in Long-Form Generations
Caiqi Zhang, Ruihan Yang, Zhisong Zhang et al.
Estimating Causal Effects of Text Interventions Leveraging LLMs
Siyi Guo, Myrl G Marmarelis, Fred Morstatter et al.
HalluCounter: Reference-free LLM Hallucination Detection in the Wild!
Ashok Urlana, Gopichand Kanumolu, Charaka Vinayak Kumar et al.
Smruti: Grammatical Error Correction for Gujarati using LLMs with Non-Parametric Memory
Vrund Dobariya, Jatayu Baxi, Bhavika Gambhava et al.
LLM in the Loop: Creating the ParaDeHate Dataset for Hate Speech Detoxification
Shuzhou Yuan, Ercong Nie, Lukas Kouba et al.
Emotion-Aware Dysarthric Speech Reconstruction: LLMs and Multimodal Evaluation with MCDS
Kaushal Attaluri, Radhika Mamidi, Sireesha Chittepu et al.
Illusions of Relevance: Arbitrary Content Injection Attacks Deceive Retrievers, Rerankers, and LLM Judges
Manveer Singh Tamber, Jimmy Lin
Learning from Hallucinations: Mitigating Hallucinations in LLMs via Internal Representation Intervention
Sora Kadotani, Kosuke Nishida, Kyosuke Nishida