CPU-only LLM Inference

In this article, we’ll be putting our second-hand AMD Threadripper 1950x to some inference tests 🔥 - can you already smell some overheated plastic? No? That’s because the "be quiet!" is at the heart of our QuietBee 🐝😎

  —  28 min read