Deployments
Inference endpoints and training jobs across all clusters.
addNew deployment
P
Parthexpand_moresearch
| Deployment | Model | Cluster | Status | Req / hr | Cost / hr | |
|---|---|---|---|---|---|---|
Llama 3 70B Inference dep_01 | Llama 3 70B Instruct | prod-us-east-4 A100 SXM x8 | Running | 4,820 | $13.20 | View |
Stable Diffusion XL dep_02 | Stable Diffusion XL | prod-eu-west-1 A100 PCIe x4 | Running | 1,240 | $4.80 | View |
Code Llama 34B Fine-tune dep_03 | Code Llama 34B | train-us-west-2 H100 SXM x16 | Scheduled | 0 | $54.40 | View |
Llama 3 8B (Internal) dep_04 | Llama 3 8B Instruct | dev-us-east-1 RTX 4090 x2 | Running | 920 | $1.10 | View |
Embedding Service dep_05 | BGE Large EN v1.5 | dev-us-east-1 RTX 4090 x1 | Stopped | 0 | $0.55 | View |