Skip to main content

4 docs tagged with "sparsification"

View All Tags

Installing SparseML

Install SparseML, Neural Magic's toolkit for optimizing deep learning models through state-of-the-art sparsification techniques.

Optimizing LLMs

Optimize large language models (LLMs) for efficient inference using one-shot pruning and quantization. Learn how to improve model performance and reduce costs without sacrificing accuracy.

What is Neural Magic?

Neural Magic empowers you to optimize and deploy deep learning models on CPUs with GPU-class performance. Unlock efficiency, accessibility, and cost savings with our software solutions.