Deep Learning › Techniques ›

Model Architecture

4351 directly classified papers

Papers per year

Papers

TableKV: KV Cache Compression for In-Context Table Processing ACL 2025

AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning CVPR 2025

Deconstructing Attention: Investigating Design Principles for Effective Language Modeling IJCNLP 2025

Beyond Skip Connection: Pooling and Unpooling Design for Elimination Singularities AAAI 2025

MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models ACL 2025

A Layer Selection Approach to Test Time Adaptation AAAI 2025

Interpreting the Effects of Quantization on LLMs IJCNLP 2025

PHLoRA: data-free Post-hoc Low-Rank Adapter extraction from full-rank checkpoint IJCNLP 2025

Interpreting the Effects of Quantization on LLMs AACL 2025

Mitigate Position Bias in LLMs via Scaling a Single Hidden States Channel ACL 2025

FAST: Efficient Action Tokenization for Vision-Language-Action Models RSS 2025

PipeThreader: Software-Defined Pipelining for Efficient DNN Execution OSDI 2025

Structural Deep Encoding for Table Question Answering ACL 2025

Deconstructing Attention: Investigating Design Principles for Effective Language Modeling AACL 2025

EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality CVPR 2025

SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks AAAI 2025

NBF at SemEval-2025 Task 5: Light-Burst Attention Enhanced System for Multilingual Subject Recommendation SEMEVAL 2025

SyntaxMind at BLP-2025 Task 1: Leveraging Attention Fusion of CNN and GRU for Hate Speech Detection IJCNLP 2025

Enhancing Long-range Dependency with State Space Model and Kolmogorov-Arnold Networks for Aspect-based Sentiment Analysis COLING 2025

KVFKT: A New Horizon in Knowledge Tracing with Attention-Based Embedding and Forgetting Curve Integration COLING 2025

DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization ICCV 2025

Make Your Training Flexible: Towards Deployment-Efficient Video Models ICCV 2025

CACA: Context-Aware Cross-Attention Network for Extractive Aspect Sentiment Quad Prediction COLING 2025

Disentangle to Decay: Linear Attention with Trainable Decay Factor COLING 2025

HUNet: Homotopy Unfolding Network for Image Compressive Sensing CVPR 2025