Neural Magic LogoNeural Magic Logo
Products
menu-icon
Products
DeepSparse EngineSparseMLSparseZoo
Products
DeepSparse

tool icon   DeepSparse

Sparsity-aware neural network inference engine for GPU-class performance on CPUs

DeepSparse is a CPU runtime that takes advantage of sparsity within neural networks to reduce compute. Read more about sparsification.

Neural Magic's DeepSparse is able to integrate into popular deep learning libraries (e.g., Hugging Face, Ultralytics) allowing you to leverage DeepSparse for loading and deploying sparse models with ONNX. ONNX gives the flexibility to serve your model in a framework-agnostic environment. Support includes PyTorch, TensorFlow, Keras, and many other frameworks.

Editions

DeepSparse is available in two editions:

Using DeepSparse on Google Cloud Run
DeepSparse Community