Artificial Intelligence › Core AI ›

Efficient Computing

596 directly classified papers

Papers per year

Papers

JumpCoder: Go Beyond Autoregressive Coder via Online Modification ACL 2024

Carbon Footprint Reduction for Sustainable Data Centers in Real-Time AAAI 2024

CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers ACL 2024

LLM in a flash: Efficient Large Language Model Inference with Limited Memory ACL 2024

Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models CVPR 2024

AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning EMNLP 2024

QKFormer: Hierarchical Spiking Transformer using Q-K Attention NIPS 2024

SnapKV: LLM Knows What You are Looking for Before Generation NIPS 2024

SIRIUS : Contexual Sparisty with Correction for Efficient LLMs NIPS 2024

Provable Tempered Overfitting of Minimal Nets and Typical Nets NIPS 2024

FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification NIPS 2024

LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking CVPR 2024

BOLD: Boolean Logic Deep Learning NIPS 2024

Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps EMNLP 2024

Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters EMNLP 2024

Towards Accurate Post-training Quantization for Diffusion Models CVPR 2024

TC-LIF: A Two-Compartment Spiking Neuron Model for Long-Term Sequential Modelling AAAI 2024

Colour Passing Revisited: Lifted Model Construction with Commutative Factors AAAI 2024

EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees EMNLP 2024

MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric CVPR 2024

Optimized Speculative Sampling for GPU Hardware Accelerators EMNLP 2024

Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation OSDI 2024

LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation IJCAI 2024

Predicting Carpark Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach IJCAI 2024

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models ACL 2024