DeepSpeed Inference
inference_framework
Overview
Developed byMicrosoft
Founded2020
LicenseApache License 2.0
Open source✓ Open Source
Use casehigh-performance inference for large language models
Integrates with
Also see
Alternative to
Knowledge graph stats
Claims14
Avg confidence91%
Avg freshness100%
Last updatedUpdated 2 days ago
Trust distribution
100% unverified
DeepSpeed Inference
product
Microsoft's optimized inference engine for large-scale language models with memory and speed optimizations
Compare with...pricing model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| free | ○Unverified | High | Fresh | 1 |
integrates with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| PyTorch | ○Unverified | High | Fresh | 1 |
| Transformers | ○Unverified | High | Fresh | 1 |
open source
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| true | ○Unverified | High | Fresh | 1 |
developed by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Microsoft | ○Unverified | High | Fresh | 1 |
maintained by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Microsoft | ○Unverified | High | Fresh | 1 |
requires
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Python | ○Unverified | High | Fresh | 1 |
license type
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Apache License 2.0 | ○Unverified | High | Fresh | 1 |
primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| high-performance inference for large language models | ○Unverified | High | Fresh | 1 |
supports model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| GPT models | ○Unverified | High | Fresh | 1 |
| BERT models | ○Unverified | Moderate | Fresh | 1 |
founded year
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| 2020 | ○Unverified | Moderate | Fresh | 1 |
alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| TensorRT | ○Unverified | Moderate | Fresh | 1 |
| ONNX Runtime | ○Unverified | Moderate | Fresh | 1 |