Co-occurring keywords
Papers
xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics
EMNLP 2024
TinyAgent: Function Calling at the Edge
EMNLP 2024
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit
EMNLP 2024
The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers (Student Abstract)
AAAI 2024
Nearest is Not Dearest: Towards Practical Defense against Quantization-conditioned Backdoor Attacks
CVPR 2024
ATQ: Activation Transformation forWeight-Activation Quantization of Large Language Models
EMNLP 2024
Exploiting LLM Quantization
NIPS 2024
Accuracy is Not All You Need
NIPS 2024