Version: nightly

Guides

Explore best practices and step-by-step tutorials for specific LLM use cases. Dive into the following guides to learn how to use LLM to build your own applications.

📄️ Why is Sparsity Important for LLMs?

Large Language Models (LLMs) have a large size that often poses challenges in terms of computational efficiency and memory usage. Weight sparsity is a technique that can significantly alleviate these issues, enhancing the practicality and scalability of LLMs. Here we outline the key benefits of weight sparsity in LLMs, focusing on three main aspects:

Guides

📄️ Why is Sparsity Important for LLMs?

📄️ Convert LLMs From Hugging Face

📄️ Compress LLMs With SparseGPT

📄️ Sparse Fine-Tuning LLMs on GSM8k

📄️ LLM Serving on Windows