← Optimization & Theory

Deep Learning › Optimization & Theory ›

Efficient Computing

1253 directly classified papers

Papers per year

Papers

Joint Pre-Encoding Representation and Structure Embedding for Efficient and Low-Resource Knowledge Graph Completion EMNLP 2024

InfiniPot: Infinite Context Processing on Memory-Constrained LLMs EMNLP 2024

Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention EMNLP 2024

Scaling Laws for Linear Complexity Language Models EMNLP 2024

Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training EMNLP 2024

TroL: Traversal of Layers for Large Language and Vision Models EMNLP 2024

Turn Waste into Worth: Rectifying Top-k Router of MoE EMNLP 2024

Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding EMNLP 2024

FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping EMNLP 2024

FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model EMNLP 2024

Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters EMNLP 2024

TensorOpera Router: A Multi-Model Router for Efficient LLM Inference EMNLP 2024

AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising NIPS 2024

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models EMNLP 2024

Optical Diffusion Models for Image Generation NIPS 2024

MOSEL: Inference Serving Using Dynamic Modality Selection EMNLP 2024

Point Transformer V3: Simpler Faster Stronger CVPR 2024

UniPTS: A Unified Framework for Proficient Post-Training Sparsity CVPR 2024

You Only Need Less Attention at Each Stage in Vision Transformers CVPR 2024

Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement CVPR 2024

Look-Up Table Compression for Efficient Image Restoration CVPR 2024

ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers EMNLP 2024

Query-OPT: Optimizing Inference of Large Language Models via Multi-Query Instructions in Meeting Summarization EMNLP 2024

Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum NIPS 2024

SGLang: Efficient Execution of Structured Language Model Programs NIPS 2024