Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon May 9, 2024 • 12
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs Paper • 2306.16601 • Published Jun 28, 2023 • 4