Rent cloud GPUs by the hour

Dedicated GPU instances with one-click JupyterLab, ComfyUI, vLLM serving, and web terminal templates. Billed per second, only while the instance runs, at rates from $0.65/hr.

Cloud GPU pricing per hour

Billing runs per second at the listed hourly rate, only while the instance is running. Click a GPU to open its deploy page.

GPU
Specs
Availability
Price
24 GB VRAM
0 available
$0.65/hr
32 GB VRAM
0 available
$0.90/hr
48 GB VRAM · 1, 2, 4x configs
38 available
$0.65/hr
48 GB VRAM · 1, 2, 4x configs
25 available
$1.50/hr
96 GB VRAM · 1, 2, 4, 8x configs
19 available
$4.00/hr
24 GB VRAM
0 available
$0.69/hr
48 GB VRAM · 1, 2, 4, 8x configs
64 available
$1.50/hr
48 GB VRAM · 1, 2, 4x configs
10 available
$1.60/hr
48 GB VRAM
0 available
$1.49/hr
40 GB VRAM
0 available
$1.99/hr
80 GB VRAM · 1, 2, 4, 8x configs
29 available
$1.70/hr
80 GB VRAM · 1, 2, 4, 8x configs
64 available
$2.90/hr
94 GB VRAM · 1, 2, 4x configs
7 available
$5.60/hr
141 GB VRAM · 1, 2x configs
3 available
$5.50/hr
180 GB VRAM
0 available
$6.99/hr
Deploy a GPU instance

How GPU Cloud works

1. Pick a GPU
Choose a card and a runtime storage target from the live catalog.
2. Pick a template
JupyterLab, ComfyUI, a vLLM model server, or a browser web terminal, ready in minutes.
3. Connect
Open the workload in the browser or call it through the authenticated EmpirioLabs connect endpoint.

GPU Cloud: common questions

How is GPU Cloud billed?

Billing is per second at the listed hourly rate, and only while the instance is running. The rate is locked in when you deploy, and stopping or destroying the instance stops the charge.

What can I run on a GPU instance?

One-click templates cover JupyterLab notebooks, ComfyUI, vLLM model serving (bring a Hugging Face model id), and a browser web terminal. You connect through the authenticated EmpirioLabs connect endpoint or call the workload through /v1/gpu/connect/{instance_id}/{path} on the API.

Can I manage GPU Cloud through the API?

Yes. Everything the dashboard does is also available through the API: deploy, stop, and destroy instances under /v1/gpu on api.empiriolabs.ai, and reach the running workload through the connect endpoint. The full reference is in the GPU Cloud docs.

How much storage do instances include?

Runtime storage targets range from 100 to 300 GB with a 150 GB default, bundled into the displayed hourly price.

How do I get started?

Create an EmpirioLabs account, open GPU Cloud in the dashboard, pick a GPU and template, and deploy. Billing is pay-as-you-go credits.

Ready to use better endpoints?

Check out our pricing or reach out if you want your own model deployed on our stack.