← Optimization & Theory

Deep Learning › Optimization & Theory ›

Efficient Computing

1253 directly classified papers

Papers per year

Papers

DocMamba: Efficient Document Pre-training with State Space Model AAAI 2025

C3oT: Generating Shorter Chain-of-Thought Without Compromising Effectiveness AAAI 2025

Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal AAAI 2025

Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference AAAI 2025

3D-RPE: Enhancing Long-Context Modeling Through 3D Rotary Position Encoding AAAI 2025

Dynamic-Width Speculative Beam Decoding for LLM Inference AAAI 2025

PointBeV: A Sparse Approach for BeV Predictions CVPR 2024

ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale ACL 2024

Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression ACL 2024

XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection ACL 2024

USHER: Holistic Interference Avoidance for Resource Optimized ML Inference OSDI 2024

PocketLLM: Enabling On-Device Fine-Tuning for Personalized LLMs ACL 2024

MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech ACL 2024

Accelerating Multilingual Language Model for Excessively Tokenized Languages ACL 2024

Graph-Structured Speculative Decoding ACL 2024

An Empirical Study of Distributed Deep Learning Training on Edge (Student Abstract) AAAI 2024

MapLE: Matching Molecular Analogues Promptly with Low Computational Resources by Multi-Metrics Evaluation (Student Abstract) AAAI 2024

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding ACL 2024

The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers (Student Abstract) AAAI 2024

A Model for Estimating the Economic Costs of Computer Vision Systems That Use Deep Learning AAAI 2024

Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs ACL 2024

FlexiBO: A Decoupled Cost-Aware Multi-objective Optimization Approach for Deep Neural Networks (Abstract Reprint) AAAI 2024

DeepBern-Nets: Taming the Complexity of Certifying Neural Networks Using Bernstein Polynomial Activations and Precise Bound Propagation AAAI 2024

Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval ACL 2024

ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference AAAI 2024