Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Efficient Computing
1253 directly classified papers
Papers per year
2008: 1
2009: 1
2012: 2
2013: 2
2014: 10
2015: 6
2016: 14
2017: 19
2018: 59
2019: 71
2020: 113
2021: 128
2022: 162
2023: 159
2024: 225
2025: 281
Papers
DocMamba: Efficient Document Pre-training with State Space Model
AAAI 2025
C3oT: Generating Shorter Chain-of-Thought Without Compromising Effectiveness
AAAI 2025
Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal
AAAI 2025
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference
AAAI 2025
3D-RPE: Enhancing Long-Context Modeling Through 3D Rotary Position Encoding
AAAI 2025
Dynamic-Width Speculative Beam Decoding for LLM Inference
AAAI 2025
PointBeV: A Sparse Approach for BeV Predictions
CVPR 2024
ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale
ACL 2024
Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression
ACL 2024
XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection
ACL 2024
USHER: Holistic Interference Avoidance for Resource Optimized ML Inference
OSDI 2024
PocketLLM: Enabling On-Device Fine-Tuning for Personalized LLMs
ACL 2024
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech
ACL 2024
Accelerating Multilingual Language Model for Excessively Tokenized Languages
ACL 2024
Graph-Structured Speculative Decoding
ACL 2024
An Empirical Study of Distributed Deep Learning Training on Edge (Student Abstract)
AAAI 2024
MapLE: Matching Molecular Analogues Promptly with Low Computational Resources by Multi-Metrics Evaluation (Student Abstract)
AAAI 2024
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
ACL 2024
The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers (Student Abstract)
AAAI 2024
A Model for Estimating the Economic Costs of Computer Vision Systems That Use Deep Learning
AAAI 2024
Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs
ACL 2024
FlexiBO: A Decoupled Cost-Aware Multi-objective Optimization Approach for Deep Neural Networks (Abstract Reprint)
AAAI 2024
DeepBern-Nets: Taming the Complexity of Certifying Neural Networks Using Bernstein Polynomial Activations and Precise Bound Propagation
AAAI 2024
Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
ACL 2024
ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference
AAAI 2024
<
1
…
11
12
13
…
51
>