Version: 1.7.0

LLMs - Causal Language Modeling

This section provides comprehensive guidance and resources to help you excel in various Large Language Model (LLM) tasks using Neural Magic's powerful suite of tools.

Deployment

Learn how to seamlessly deploy sparsified LLMs for accelerated text generation and reduced resource demands.

📄️ Serving LLMs

DeepSparse is a CPU inference runtime that takes advantage of sparsity to accelerate neural network inference. Coupled with SparseML, our optimization library for pruning and quantizing your models, DeepSparse delivers exceptional inference performance on CPU hardware.

Optimizing LLMs

Discover state-of-the-art techniques to significantly reduce the footprint and increase the inference speed of your LLMs.

Optimizing LLMs with SparseML

Discover how to optimize LLMs using SparseML, Neural Magic's open-source library for model optimization.

Data Formats

Explore the most common data formats for LLMs, including text, JSONL, and more.

SparseML Data Formats

Learn about the most common data formats for LLMs, including text, JSONL, and more.

Presparsified Models

Discover a selection of presparsified LLMs to help you get started with your text generation tasks.

📄️ Sparse LLMs

Discover and utilize optimized LLM models from SparseZoo and Hugging Face Hub for efficient DeepSparse deployment.

Guides

Explore best practices and step-by-step tutorials for specific LLM use cases.

📄️ Guides

Explore best practices and step-by-step tutorials for specific LLM use cases.

LLMs - Causal Language Modeling

Deployment

📄️ Serving LLMs

Optimizing LLMs

Optimizing LLMs with SparseML

Data Formats

SparseML Data Formats

Presparsified Models

📄️ Sparse LLMs

Guides

📄️ Guides

Content

Actions

Support

Issues

LLMs - Causal Language Modeling

Deployment​

📄️ Serving LLMs

Optimizing LLMs​

Optimizing LLMs with SparseML

Data Formats​

SparseML Data Formats

Presparsified Models​

📄️ Sparse LLMs

Guides​

📄️ Guides

Content

Actions

Support

Issues

Deployment

Optimizing LLMs

Data Formats

Presparsified Models

Guides