Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Efficient Computing
596 directly classified papers
Papers per year
2007: 2
2009: 1
2011: 1
2014: 2
2016: 1
2017: 4
2018: 7
2019: 20
2020: 47
2021: 53
2022: 70
2023: 60
2024: 140
2025: 183
2026: 5
Papers
JumpCoder: Go Beyond Autoregressive Coder via Online Modification
ACL 2024
Carbon Footprint Reduction for Sustainable Data Centers in Real-Time
AAAI 2024
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers
ACL 2024
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
ACL 2024
Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models
CVPR 2024
AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning
EMNLP 2024
QKFormer: Hierarchical Spiking Transformer using Q-K Attention
NIPS 2024
SnapKV: LLM Knows What You are Looking for Before Generation
NIPS 2024
SIRIUS : Contexual Sparisty with Correction for Efficient LLMs
NIPS 2024
Provable Tempered Overfitting of Minimal Nets and Typical Nets
NIPS 2024
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
NIPS 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
CVPR 2024
BOLD: Boolean Logic Deep Learning
NIPS 2024
Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps
EMNLP 2024
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters
EMNLP 2024
Towards Accurate Post-training Quantization for Diffusion Models
CVPR 2024
TC-LIF: A Two-Compartment Spiking Neuron Model for Long-Term Sequential Modelling
AAAI 2024
Colour Passing Revisited: Lifted Model Construction with Commutative Factors
AAAI 2024
EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees
EMNLP 2024
MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric
CVPR 2024
Optimized Speculative Sampling for GPU Hardware Accelerators
EMNLP 2024
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation
OSDI 2024
LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation
IJCAI 2024
Predicting Carpark Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
IJCAI 2024
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
ACL 2024
<
1
…
9
10
11
…
24
>