Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Efficient Computing
1253 directly classified papers
Papers per year
2008: 1
2009: 1
2012: 2
2013: 2
2014: 10
2015: 6
2016: 14
2017: 19
2018: 59
2019: 71
2020: 113
2021: 128
2022: 162
2023: 159
2024: 225
2025: 281
Papers
Joint Pre-Encoding Representation and Structure Embedding for Efficient and Low-Resource Knowledge Graph Completion
EMNLP 2024
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs
EMNLP 2024
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
EMNLP 2024
Scaling Laws for Linear Complexity Language Models
EMNLP 2024
Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training
EMNLP 2024
TroL: Traversal of Layers for Large Language and Vision Models
EMNLP 2024
Turn Waste into Worth: Rectifying Top-k Router of MoE
EMNLP 2024
Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding
EMNLP 2024
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping
EMNLP 2024
FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model
EMNLP 2024
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters
EMNLP 2024
TensorOpera Router: A Multi-Model Router for Efficient LLM Inference
EMNLP 2024
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
NIPS 2024
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
EMNLP 2024
Optical Diffusion Models for Image Generation
NIPS 2024
MOSEL: Inference Serving Using Dynamic Modality Selection
EMNLP 2024
Point Transformer V3: Simpler Faster Stronger
CVPR 2024
UniPTS: A Unified Framework for Proficient Post-Training Sparsity
CVPR 2024
You Only Need Less Attention at Each Stage in Vision Transformers
CVPR 2024
Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
CVPR 2024
Look-Up Table Compression for Efficient Image Restoration
CVPR 2024
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
EMNLP 2024
Query-OPT: Optimizing Inference of Large Language Models via Multi-Query Instructions in Meeting Summarization
EMNLP 2024
Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum
NIPS 2024
SGLang: Efficient Execution of Structured Language Model Programs
NIPS 2024
<
1
…
18
19
20
…
51
>