Skip to main content

2 docs tagged with "deepsparse"

View All Tags

Deploying LLMs

Deploy large language models (LLMs) for text generation using Neural Magic's DeepSparse. This doc includes code examples, performance benchmarking, and server setup.

Installing DeepSparse

Install DeepSparse, Neural Magic's high-performance inference engine, for optimized deep learning model deployment on CPUs.