Papers
5,479 papers found
PHPFND: Detecting Fake News via Post-Hoc Processing of LLMs Hallucination
Jinke Ma, Jiachen Ma, Wei Zhang et al.
CTX-Coder: Cross-Attention Architectures Empower LLMs for Long-Context Vulnerability Detection
Jujie Wang, Kangfeng Zheng, Bin Wu et al.
Bot Meets Shortcut: How Can LLMs Aid in Handling Unknown Invariance OOD Scenarios?
Shiyan Zheng, Herun Wan, Minnan Luo et al.
Is Symbolic Music a Specific Language? Exploring Inspiration-to-Structure Machine Composition via LLMs
Zhejing Hu, Yan Liu, Zhi Zhang et al.
Surgical AI Copilot: Energy-Based Fourier Gradient Low-Rank Adaptation for Surgical LLM Agent Reasoning and Planning
Jiayuan Huang, Runlong He, Danyal Zaman Khan et al.
AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments
Zikang Leng, Megha Thukral, Yaqi Liu et al.
ARCHE: A Novel Task to Evaluate LLMs on Latent Reasoning Chain Extraction
Pengze Li, Jiaqi Liu, Junchi Yu et al.
Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier
Hyeongseop Rha, Jeong Hun Yeo, Yeonju Kim et al.
Exploiting Synergistic Cognitive Biases to Bypass Safety in LLMs
Xikang Yang, Biyu Zhou, Xuehai Tang et al.
Investigating Prosocial Behavior Theory in LLM Agents Under Policy-Induced Inequities
Yujia Zhou, Hexi Wang, Qingyao Ai et al.
UQ-Bench: A Benchmark for Evaluating Multimodal LLMs on Underwater Image Quality Assessment
Jingchao Cao, Guo An, Feng Gao et al.
Disentangling Adversarial Prompts: A Semantic-Graph Defense for Robust LLM Security
Xiang Fang, Wanlong Fang
LLM2CLIP: Powerful Language Model Unlocks Richer Cross-Modality Representation
Weiquan Huang, Aoqi Wu, Yifan Yang et al.
AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization
Zhonghua Jiang, Kui Chen, Kunxi Li et al.
R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios
Zhu Lu, Tiantian Geng, Yangye Chen et al.
Benchmarking Visual LLMs Resilience to Unanswerable Questions on Visually Rich Documents
Davide Napolitano, Luca Cagliero, Fabrizio Battiloro
KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMs
Baiyang Song, Jun Peng, Yuxin Zhang et al.
SmartSight: Mitigating Hallucination in Video-LLMs Without Compromising Video Understanding via Temporal Attention Collapse
Yiming Sun, Mi Zhang, Feifei Li et al.
TIME: Temporal-Sensitive Multi-Dimensional Instruction Tuning and Robust Benchmarking for Video-LLMs
Yunxiao Wang, Meng Liu, Wenqi Liu et al.
MAGIC: Mastering Physical Adversarial Generation in Context Through Collaborative LLM Agents
Yun Xing, Nhat Chung, Jie Zhang et al.
HouseTune: Two-Stage Floorplan Generation with LLM Assistance
Ziyang Zong, Guanying Chen, Zhaohuan Zhan et al.
HISE-KT: Synergizing Heterogeneous Information Networks and LLMs for Explainable Knowledge Tracing with Meta-Path Optimization
Zhiyi Duan, Zixing Shi, Hongyu Yuan et al.
Knowledge-Enhanced Image Captioning with Adaptive Graph-based Multimodal Alignment and LLM
Guoyi Li, Die Hu, Haozhe Li et al.
DGP: A Dual-Granularity Prompting Framework for Fraud Detection with Graph-Enhanced LLMs
Yuan Li, Jun Hu, Bryan Hooi et al.
AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization
Jingyi Liao, Yongyi Su, Rong-Cheng Tu et al.