Co-occurring keywords
Papers
Sign Value Constraint Decomposition for Efficient 1-Bit Quantization of Speech Translation Tasks
INTERSPEECH 2024
Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization
EMNLP 2024
xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics
EMNLP 2024
The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers (Student Abstract)
AAAI 2024
One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
INTERSPEECH 2024
Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity
AAAI 2024
TinyAgent: Function Calling at the Edge
EMNLP 2024