conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Architectures
Deep Learning
›
Architectures
›
Transformers
9,294 papers
Papers per year
2011: 1
2014: 2
2015: 6
2016: 17
2017: 67
2018: 156
2019: 404
2020: 769
2021: 1217
2022: 1446
2023: 1628
2024: 1574
2025: 1647
2026: 360
Papers
Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition
ICCV 2021
Exploring Relational Context for Multi-Task Dense Prediction
ICCV 2021
Going Deeper With Image Transformers
ICCV 2021
UltraPose: Synthesizing Dense Pose With 1 Billion Points by Human-Body Decoupling 3D Model
ICCV 2021
Multiscale Vision Transformers
ICCV 2021
GroupFormer: Group Activity Recognition With Clustered Spatial-Temporal Transformer
ICCV 2021
Conditional DETR for Fast Training Convergence
ICCV 2021
MDETR - Modulated Detection for End-to-End Multi-Modal Understanding
ICCV 2021
CR-Fill: Generative Image Inpainting With Auxiliary Contextual Reconstruction
ICCV 2021
STVGBert: A Visual-Linguistic Transformer Based Framework for Spatio-Temporal Video Grounding
ICCV 2021
Voxel Transformer for 3D Object Detection
ICCV 2021
Toward Spatially Unbiased Generative Models
ICCV 2021
Rethinking Spatial Dimensions of Vision Transformers
ICCV 2021
Context-Aware Scene Graph Generation With Seq2Seq Transformers
ICCV 2021
Learning Multi-Scene Absolute Pose Regression With Transformers
ICCV 2021
An Empirical Study of Training Self-Supervised Vision Transformers
ICCV 2021
On the Robustness of Vision Transformers to Adversarial Examples
ICCV 2021
HiFT: Hierarchical Feature Transformer for Aerial Tracking
ICCV 2021
Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet
ICCV 2021
SOTR: Segmenting Objects With Transformers
ICCV 2021
Frequency-Aware Spatiotemporal Transformers for Video Inpainting Detection
ICCV 2021
Rethinking and Improving Relative Position Encoding for Vision Transformer
ICCV 2021
Action-Conditioned 3D Human Motion Synthesis With Transformer VAE
ICCV 2021
Detecting Human-Object Relationships in Videos
ICCV 2021
Episodic Transformer for Vision-and-Language Navigation
ICCV 2021
<
1
…
296
297
298
…
372
>