Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Efficient Computing
6876 directly classified papers
Papers per year
2003: 3
2004: 2
2005: 3
2006: 3
2007: 8
2008: 10
2009: 7
2010: 11
2011: 12
2012: 15
2013: 53
2014: 48
2015: 55
2016: 97
2017: 135
2018: 233
2019: 369
2020: 502
2021: 664
2022: 741
2023: 1039
2024: 1063
2025: 1395
2026: 408
Papers
Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching
AAAI 2026
CMedBench: A Comprehensive Benchmark for Efficient Medical Large Language Models
AAAI 2026
Talon: Breaking the Synchronization Barrier in Speculative Decoding with Hybrid Model-based and Retrieve-based Drafting
AAAI 2026
DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation
AAAI 2026
HALO: Hardware-Aware Quantization with Low Critical-Path-Delay Weights for LLM Acceleration
AAAI 2026
LatentLLM: Activation-Aware Transform to Multi-Head Latent Attention
AAAI 2026
Learnable Permutation for Structured Sparsity on Transformer Models
AAAI 2026
CasMoE: A Cascaded Framework for Efficient MoE Inference on Resource-constrained Devices
AAAI 2026
MemeBQ:Memory Efficient Binary Quantization of LLMs
AAAI 2026
Efficient Plug-and-Play Weight Refinement for Sparse Large Models
AAAI 2026
AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive Transformers
AAAI 2026
AdaSpec: Adaptive Multilingual Speculative Decoding with Self-Synthesized Language-Aware Training and Vocabulary Simplification
AAAI 2026
Mnemosyne: Accelerating Multi-Hop Question Answering via Cache Hit Order Fitting
AAAI 2026
Towards Better Correctness and Efficiency in Code Generation
AAAI 2026
MoSE: Hierarchical Self-Distillation Enhances Early Layer Embeddings
AAAI 2026
Confidence-Guided Stepwise Model Routing for Cost-Efficient Reasoning
AAAI 2026
Res-Bench: Benchmarking the Robustness of Multimodal Large Language Models to Dynamic Resolution Input
AAAI 2026
AdaFuse: Accelerating Dynamic Adapter Inference via Token-Level Pre-Gating and Fused Kernel Optimization
AAAI 2026
OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification
AAAI 2026
ZipLJP: Zipped Information Processor for Legal Judgment Prediction
AAAI 2026
Fine-Tuned LLMs Know They Don’t Know: A Parameter-Efficient Approach to Recovering Honesty
AAAI 2026
Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction
AAAI 2026
PRGB Benchmark: A Robust Placeholder-Assisted Algorithm for Benchmarking Retrieval-Augmented Generation
AAAI 2026
Prune4Web: DOM Tree Pruning Programming for Web Agent
AAAI 2026
QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching
AAAI 2026
<
1
…
15
16
17
…
276
>