Papers
Reasoning-Enhanced Retrieval for Misconception Prediction: A RAG-Inspired Approach with LLMs
Chaudhary Divya, Chang Xue, Shaorui Sun
A benchmark for end-to-end zero-shot biomedical relation extraction with LLMs: experiments with OpenAI models
Aviv Brokman, Xuguang Ai, Yuhang Jiang et al.
Bridging the Gap: Instruction-Tuned LLMs for Scientific Named Entity Recognition
Necva Bölücü, Maciej Rybinski, Stephen Wan
A Hybrid LLM and Supervised Model Pipeline for Polymer Property Extraction from Tables in Scientific Literature
Van-Thuy Phi, Dinh-Truong Do, Hoang-An Trieu et al.
Structured Outputs in Prompt Engineering: Enhancing LLM Adaptability on Counterintuitive Instructions
Jingjing Ye, Song Bai, Zhenyang Li et al.
Citation Drift: Measuring Reference Stability in Multi-Turn LLM Conversations
Gokul Srinath Seetha Ram
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
Deepon Halder, Thanmay Jayakumar, Raj Dabre
Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran et al.
Comparing Discrete and Continuous Space LLMs for Speech Recognition
Yaoxun Xu, Shi-Xiong Zhang, Jianwei Yu et al.
From Text to Emotion: Unveiling the Emotion Annotation Capabilities of LLMs
Minxue Niu, Mimansa Jaiswal, Emily Mower Provost
Synthesizing Long-Form Speech merely from Sentence-Level Corpus with Content Extrapolation and LLM Contextual Enrichment
Shijie Lai, Minglu He, Zijing Zhao et al.
SALSA: Speedy ASR-LLM Synchronous Aggregation
Ashish Mittal, Darshan Prabhu, Sunita Sarawagi et al.
Non-Linear Inference Time Intervention: Improving LLM Truthfulness
Jakub Hoscilowicz, Adam Wiacek, Jan Chojnacki et al.
Enhancing Multimodal Emotion Recognition through ASR Error Compensation and LLM Fine-Tuning
Jehyun Kyung, Serin Heo, Joon-Hyuk Chang
Can LLMs’ Tuning Methods Work in Medical Multimodal Domain?
Jiawei Chen, Yue Jiang, Dingkang Yang et al.
HiA: Towards Chinese Multimodal LLMs for Comparative High-Resolution Joint Diagnosis
Xinpeng Ding, Yongqiang Chu, Renjie Pi et al.
Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis
Chun-Hsiao Yeh, Jiayun Wang, Andrew D. Graham et al.
PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery
Runlong He, Mengya Xu, Adrito Das et al.
Confidence Calibration for Multimodal LLMs: An Empirical Study through Medical VQA
Yuetian Du, Yucheng Wang, Ming Kong et al.
DentEval: Fine-tuning-Free Expert-Aligned Assessment in Dental Education via LLM Agents
Xinyu Deng, Vesna Miletic, Elvis Trinh et al.
MCA-RG: Enhancing LLMs with Medical Concept Alignment for Radiology Report Generation
Qilong Xing, Zikai Song, Youjia Zhang et al.
More performant and scalable: Rethinking contrastive vision-language pre-training of radiology in the LLM era
Yingtai Li, Haoran Lai, Xiaoqian Zhou et al.
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster
Fenghe Tang, Wenxin Ma, Zhiyang He et al.
Unleashing the Power of LLMs for Medical Video Answer Localization
Junbin Xiao, Qingyun Li, Yusen Yang et al.