← Architectures

Deep Learning › Architectures ›

Transformers

9294 directly classified papers

Papers per year

Papers

LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks COLING 2025

Typed-RAG: Type-Aware Decomposition of Non-Factoid Questions for Retrieval-Augmented Generation ACL 2025

Neural Document Segmentation Using Weighted Sliding Windows with Transformer Encoders COLING 2025

Segment-Based Attention Masking for GPTs ACL 2025

Dll5143A@NLU of Devanagari Script Languages 2025: Detection of Hate Speech and Targets Using Hierarchical Attention Network COLING 2025

AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling ACL 2025

byteSizedLLM@DravidianLangTech 2025: Detecting AI-Generated Product Reviews in Dravidian Languages Using XLM-RoBERTa and Attention-BiLSTM NAACL 2025

Transformer Architectures for Vocabulary Test Item Difficulty Prediction ACL 2025

byteSizedLLM@DravidianLangTech 2025: Fake News Detection in Dravidian Languages Using Transliteration-Aware XLM-RoBERTa and Transformer Encoder-Decoder NAACL 2025

Seamlessly Integrating Tree-Based Positional Embeddings into Transformer Models for Source Code Representation ACL 2025

IITR-CIOL@NLU of Devanagari Script Languages 2025: Multilingual Hate Speech Detection and Target Identification in Devanagari-Scripted Languages COLING 2025

Unique Hard Attention: A Tale of Two Sides ACL 2025

MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking CVPR 2025

InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model ACL 2025

SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception CVPR 2025

Language Repository for Long Video Understanding ACL 2025

BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with Transformer CVPR 2025

Smarter, Not Harder: Training-Free Adaptive Computation for Transformers ACL 2025

Semantic and Sequential Alignment for Referring Video Object Segmentation CVPR 2025

From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalities ACL 2025

byteSizedLLM@DravidianLangTech 2025: Abusive Tamil and Malayalam Text targeting Women on Social Media Using XLM-RoBERTa and Attention-BiLSTM NAACL 2025

VP-MEL: Visual Prompts Guided Multimodal Entity Linking ACL 2025

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines CVPR 2025

NBF at SemEval-2025 Task 5: Light-Burst Attention Enhanced System for Multilingual Subject Recommendation ACL 2025

FocusLLM: Precise Understanding of Long Context by Dynamic Condensing ACL 2025