Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Architectures
Deep Learning
›
Architectures
›
Transformers
9294 directly classified papers
Papers per year
2011: 1
2014: 2
2015: 6
2016: 17
2017: 67
2018: 156
2019: 404
2020: 769
2021: 1217
2022: 1446
2023: 1628
2024: 1574
2025: 1647
2026: 360
Papers
QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
AAAI 2025
TRANSFORMER EXPLAINER: Interactive Learning of Text-Generative Models
AAAI 2025
EchoDiffusion: Waveform Conditioned Diffusion Models for Echo-Based Depth Estimation
AAAI 2025
Harnessing Event Sensory Data for Error Pattern Prediction in Vehicles: A Language Model Approach
AAAI 2025
VCRMNER: Visual Cue Refinement in Multimodal NER using CLIP Prompts
COLING 2025
Enhancing Masked Time-Series Modeling via Dropping Patches
AAAI 2025
CUNI at WMT25 General Translation Task
EMNLP 2025
Emergence of symbolic abstraction heads for in-context learning in large language models
COLING 2025
TabGLM: Tabular Graph Language Model for Learning Transferable Representations Through Multi-Modal Consistency Minimization
AAAI 2025
A Scalable and Effective Alternative to Graph Transformers
AAAI 2025
IndoNLP 2025 Shared Task: Romanized Sinhala to Sinhala Reverse Transliteration Using BERT
COLING 2025
Light3R-SfM: Towards Feed-forward Structure-from-Motion
CVPR 2025
From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs
COLING 2025
MSVIT: Improving Spiking Vision Transformer Using Multi-scale Attention Fusion
IJCAI 2025
Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq Approaches
COLING 2025
Split Adaptation for Pre-trained Vision Transformers
CVPR 2025
Adversarial Attention Perturbations for Large Object Detection Transformers
ICCV 2025
Low-Resource Interlinear Translation: Morphology-Enhanced Neural Models for Ancient Greek
COLING 2025
Identifying Aggression and Offensive Language in Code-Mixed Tweets: A Multi-Task Transfer Learning Approach
COLING 2025
Multi-Omics Analysis for Cancer Subtype Inference via Unrolling Graph Smoothness Priors
IJCAI 2025
Bias Detection in Media: Traditional Models vs. Transformers in Analyzing Social Media Coverage of the Israeli-Gaza Conflict
COLING 2025
Multilingual Propaganda Detection: Exploring Transformer-Based Models mBERT, XLM-RoBERTa, and mT5
COLING 2025
BBPOS: BERT-based Part-of-Speech Tagging for Uzbek
COLING 2025
QuantileFormer: Probabilistic Time Series Forecasting with a Pattern-Mixture Decomposed VAE Transformer
IJCAI 2025
Vision-Language Embodiment for Monocular Depth Estimation
CVPR 2025
<
1
…
27
28
29
…
372
>