Dedicated GPU instances with one-click JupyterLab, ComfyUI, vLLM serving, and web terminal templates. Billed per second, only while the instance runs, at rates from $0.65/hr.
Billing runs per second at the listed hourly rate, only while the instance is running. Click a GPU to open its deploy page.
Billing is per second at the listed hourly rate, and only while the instance is running. The rate is locked in when you deploy, and stopping or destroying the instance stops the charge.
One-click templates cover JupyterLab notebooks, ComfyUI, vLLM model serving (bring a Hugging Face model id), and a browser web terminal. You connect through the authenticated EmpirioLabs connect endpoint or call the workload through /v1/gpu/connect/{instance_id}/{path} on the API.
Yes. Everything the dashboard does is also available through the API: deploy, stop, and destroy instances under /v1/gpu on api.empiriolabs.ai, and reach the running workload through the connect endpoint. The full reference is in the GPU Cloud docs.
Runtime storage targets range from 100 to 300 GB with a 150 GB default, bundled into the displayed hourly price.
Create an EmpirioLabs account, open GPU Cloud in the dashboard, pick a GPU and template, and deploy. Billing is pay-as-you-go credits.
Check out our pricing or reach out if you want your own model deployed on our stack.