Deployments

Inference endpoints and training jobs across all clusters.

addNew deployment
P
Parthexpand_more
search
DeploymentModelClusterStatusReq / hrCost / hr

Llama 3 70B Inference

dep_01

Llama 3 70B Instruct

prod-us-east-4

A100 SXM x8

Running4,820$13.20View

Stable Diffusion XL

dep_02

Stable Diffusion XL

prod-eu-west-1

A100 PCIe x4

Running1,240$4.80View

Code Llama 34B Fine-tune

dep_03

Code Llama 34B

train-us-west-2

H100 SXM x16

Scheduled0$54.40View

Llama 3 8B (Internal)

dep_04

Llama 3 8B Instruct

dev-us-east-1

RTX 4090 x2

Running920$1.10View

Embedding Service

dep_05

BGE Large EN v1.5

dev-us-east-1

RTX 4090 x1

Stopped0$0.55View