DeepSpeed-MII
LLM Inference Framework
Overview
Developed byMicrosoft
LicenseApache 2.0
Open source✓ Open Source
Use caseLLM inference optimization and serving
Technical
Protocols
Integrates with
Also see
Based onDeepSpeed
Knowledge graph stats
Claims16
Avg confidence92%
Avg freshness100%
Last updatedUpdated yesterday
Trust distribution
100% unverified
Governance
Not assessed
DeepSpeed-MII
product
Microsoft's model inference library providing high-throughput and low-latency serving for transformer models.
Compare with...open source
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| true | ○Unverified | High | Fresh | 1 |
pricing model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| free | ○Unverified | High | Fresh | 1 |
maintained by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Microsoft | ○Unverified | High | Fresh | 1 |
developed by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Microsoft | ○Unverified | High | Fresh | 1 |
based on
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| DeepSpeed | ○Unverified | High | Fresh | 1 |
primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| LLM inference optimization and serving | ○Unverified | High | Fresh | 1 |
| High-performance inference for large language models | ○Unverified | High | Fresh | 1 |
license type
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Apache 2.0 | ○Unverified | High | Fresh | 1 |
| Apache License 2.0 | ○Unverified | High | Fresh | 1 |
requires
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| PyTorch | ○Unverified | High | Fresh | 1 |
integrates with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Hugging Face Transformers | ○Unverified | Moderate | Fresh | 1 |
supports model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| GPT models | ○Unverified | Moderate | Fresh | 1 |
| BERT models | ○Unverified | Moderate | Fresh | 1 |
supports protocol
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| gRPC | ○Unverified | Moderate | Fresh | 1 |