Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Architectures
Deep Learning
›
Architectures
›
Transformers
9294 directly classified papers
Papers per year
2011: 1
2014: 2
2015: 6
2016: 17
2017: 67
2018: 156
2019: 404
2020: 769
2021: 1217
2022: 1446
2023: 1628
2024: 1574
2025: 1647
2026: 360
Papers
LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
COLING 2025
Typed-RAG: Type-Aware Decomposition of Non-Factoid Questions for Retrieval-Augmented Generation
ACL 2025
Neural Document Segmentation Using Weighted Sliding Windows with Transformer Encoders
COLING 2025
Segment-Based Attention Masking for GPTs
ACL 2025
Dll5143A@NLU of Devanagari Script Languages 2025: Detection of Hate Speech and Targets Using Hierarchical Attention Network
COLING 2025
AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling
ACL 2025
byteSizedLLM@DravidianLangTech 2025: Detecting AI-Generated Product Reviews in Dravidian Languages Using XLM-RoBERTa and Attention-BiLSTM
NAACL 2025
Transformer Architectures for Vocabulary Test Item Difficulty Prediction
ACL 2025
byteSizedLLM@DravidianLangTech 2025: Fake News Detection in Dravidian Languages Using Transliteration-Aware XLM-RoBERTa and Transformer Encoder-Decoder
NAACL 2025
Seamlessly Integrating Tree-Based Positional Embeddings into Transformer Models for Source Code Representation
ACL 2025
IITR-CIOL@NLU of Devanagari Script Languages 2025: Multilingual Hate Speech Detection and Target Identification in Devanagari-Scripted Languages
COLING 2025
Unique Hard Attention: A Tale of Two Sides
ACL 2025
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking
CVPR 2025
InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model
ACL 2025
SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception
CVPR 2025
Language Repository for Long Video Understanding
ACL 2025
BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with Transformer
CVPR 2025
Smarter, Not Harder: Training-Free Adaptive Computation for Transformers
ACL 2025
Semantic and Sequential Alignment for Referring Video Object Segmentation
CVPR 2025
From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalities
ACL 2025
byteSizedLLM@DravidianLangTech 2025: Abusive Tamil and Malayalam Text targeting Women on Social Media Using XLM-RoBERTa and Attention-BiLSTM
NAACL 2025
VP-MEL: Visual Prompts Guided Multimodal Entity Linking
ACL 2025
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
CVPR 2025
NBF at SemEval-2025 Task 5: Light-Burst Attention Enhanced System for Multilingual Subject Recommendation
ACL 2025
FocusLLM: Precise Understanding of Long Context by Dynamic Condensing
ACL 2025
<
1
…
19
20
21
…
372
>