TensorRT-LLM
llm_inference
Overview
Developed byNVIDIA
LicenseApache 2.0
Open source✓ Open Source
Use caseHigh-performance inference for Large Language Models
Integrates with
Knowledge graph stats
Claims24
Avg confidence93%
Avg freshness99%
Last updatedUpdated 4 days ago
Trust distribution
100% unverified
Governance
Not assessed
TensorRT-LLM
product
NVIDIA Apache 2.0 library for optimized LLM inference on NVIDIA GPUs
Compare with...primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| High-performance inference for Large Language Models | ○Unverified | High | Fresh | 1 |
| Large Language Model inference optimization | ○Unverified | High | Fresh | 1 |
| High-performance LLM inference optimization | ○Unverified | High | Fresh | 1 |
| optimized inference for large language models | ○Unverified | High | Fresh | 1 |
developed by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| NVIDIA | ○Unverified | High | Fresh | 1 |
based on
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| TensorRT | ○Unverified | High | Fresh | 1 |
open source
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| true | ○Unverified | High | Fresh | 1 |
requires
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| CUDA | ○Unverified | High | Fresh | 1 |
| NVIDIA GPU | ○Unverified | High | Fresh | 1 |
supports protocol
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Python API | ○Unverified | High | Fresh | 1 |
maintained by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| NVIDIA | ○Unverified | High | Fresh | 1 |
pricing model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| free | ○Unverified | High | Fresh | 1 |
license type
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Apache 2.0 | ○Unverified | High | Fresh | 1 |
| Apache License 2.0 | ○Unverified | High | Fresh | 1 |
integrates with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Triton Inference Server | ○Unverified | High | Fresh | 1 |
| Hugging Face Transformers | ○Unverified | High | Fresh | 1 |
supports model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| BLOOM | ○Unverified | High | Fresh | 1 |
| GPT | ○Unverified | Moderate | Fresh | 1 |
| LLaMA | ○Unverified | Moderate | Fresh | 1 |
| BERT | ○Unverified | Moderate | Fresh | 1 |
api compatible with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Hugging Face Transformers | ○Unverified | High | Fresh | 1 |
alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Text Generation Inference | ○Unverified | Moderate | Fresh | 1 |
| vLLM | ○Unverified | Moderate | Fresh | 1 |
competes with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| vLLM | ○Unverified | Moderate | Fresh | 1 |