Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Models
Deep Learning
›
Models
›
Large Language Models
2678 directly classified papers
Papers per year
2014: 1
2017: 2
2018: 1
2019: 13
2020: 17
2021: 26
2022: 105
2023: 314
2024: 931
2025: 1268
Papers
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
CVPR 2025
Debiasing Multimodal Large Language Models via Noise-Aware Preference Optimization
CVPR 2025
SynTab-LLaVA: Enhancing Multimodal Table Understanding with Decoupled Synthesis
CVPR 2025
DrVideo: Document Retrieval Based Long Video Understanding
CVPR 2025
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
CVPR 2025
LLM-driven Multimodal and Multi-Identity Listening Head Generation
CVPR 2025
ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models
CVPR 2025
MLLM-as-a-Judge for Image Safety without Human Labeling
CVPR 2025
VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding
CVPR 2025
CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation
CVPR 2025
Teaching Large Language Models to Regress Accurate Image Quality Scores Using Score Distribution
CVPR 2025
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach
CVPR 2025
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model
CVPR 2025
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
CVPR 2025
Can Machines Understand Composition? Dataset and Benchmark for Photographic Image Composition Embedding and Understanding
CVPR 2025
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
CVPR 2025
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
CVPR 2025
PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
CVPR 2025
Distilled Prompt Learning for Incomplete Multimodal Survival Prediction
CVPR 2025
Self-Improvement in Multimodal Large Language Models: A Survey
EMNLP 2025
Confusion is the Final Barrier: Rethinking Jailbreak Evaluation and Investigating the Real Misuse Threat of LLMs
EMNLP 2025
Attention Consistency for LLMs Explanation
EMNLP 2025
A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models
EMNLP 2025
NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models
EMNLP 2025
Multilingual Verbalisation of Knowledge Graphs
EMNLP 2025
<
1
…
44
45
46
…
108
>