4 docs tagged with "sparsification"

Installing SparseML

Install SparseML, Neural Magic's toolkit for optimizing deep learning models through state-of-the-art sparsification techniques.

Optimizing LLMs

Optimize large language models (LLMs) for efficient inference using one-shot pruning and quantization. Learn how to improve model performance and reduce costs without sacrificing accuracy.

Sparsification: Compressing Neural Networks

A comprehensive overview of sparsification techniques used to create smaller, faster, and more energy-efficient neural networks while maintaining accuracy.

What is Neural Magic?

Neural Magic empowers you to optimize and deploy deep learning models on CPUs with GPU-class performance. Unlock efficiency, accessibility, and cost savings with our software solutions.