Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Techniques
Deep Learning
›
Techniques
›
Model Architecture
4351 directly classified papers
Papers per year
2006: 1
2008: 1
2009: 1
2010: 1
2013: 5
2014: 5
2015: 22
2016: 57
2017: 124
2018: 231
2019: 415
2020: 405
2021: 617
2022: 503
2023: 619
2024: 584
2025: 505
2026: 255
Papers
TableKV: KV Cache Compression for In-Context Table Processing
ACL 2025
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning
CVPR 2025
Deconstructing Attention: Investigating Design Principles for Effective Language Modeling
IJCNLP 2025
Beyond Skip Connection: Pooling and Unpooling Design for Elimination Singularities
AAAI 2025
MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models
ACL 2025
A Layer Selection Approach to Test Time Adaptation
AAAI 2025
Interpreting the Effects of Quantization on LLMs
IJCNLP 2025
PHLoRA: data-free Post-hoc Low-Rank Adapter extraction from full-rank checkpoint
IJCNLP 2025
Interpreting the Effects of Quantization on LLMs
AACL 2025
Mitigate Position Bias in LLMs via Scaling a Single Hidden States Channel
ACL 2025
FAST: Efficient Action Tokenization for Vision-Language-Action Models
RSS 2025
PipeThreader: Software-Defined Pipelining for Efficient DNN Execution
OSDI 2025
Structural Deep Encoding for Table Question Answering
ACL 2025
Deconstructing Attention: Investigating Design Principles for Effective Language Modeling
AACL 2025
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
CVPR 2025
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
AAAI 2025
NBF at SemEval-2025 Task 5: Light-Burst Attention Enhanced System for Multilingual Subject Recommendation
SEMEVAL 2025
SyntaxMind at BLP-2025 Task 1: Leveraging Attention Fusion of CNN and GRU for Hate Speech Detection
IJCNLP 2025
Enhancing Long-range Dependency with State Space Model and Kolmogorov-Arnold Networks for Aspect-based Sentiment Analysis
COLING 2025
KVFKT: A New Horizon in Knowledge Tracing with Attention-Based Embedding and Forgetting Curve Integration
COLING 2025
DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization
ICCV 2025
Make Your Training Flexible: Towards Deployment-Efficient Video Models
ICCV 2025
CACA: Context-Aware Cross-Attention Network for Extractive Aspect Sentiment Quad Prediction
COLING 2025
Disentangle to Decay: Linear Attention with Trainable Decay Factor
COLING 2025
HUNet: Homotopy Unfolding Network for Image Compressive Sensing
CVPR 2025
<
1
…
16
17
18
…
175
>