GPU Cloud: Rent Cloud GPUs by the Hour from $0.65/hr

Cloud GPU pricing per hour

Billing runs per second at the listed hourly rate, only while the instance is running. Click a GPU to open its deploy page.

GPU

Specs

Availability

Price

RTX 4090

24 GB VRAM

0 available

$0.65/hr

RTX 5090

32 GB VRAM

0 available

$0.90/hr

RTX A6000

48 GB VRAM · 1, 2, 4x configs

38 available

$0.65/hr

RTX 6000 Ada

48 GB VRAM · 1, 2, 4x configs

25 available

$1.50/hr

RTX PRO 6000 Blackwell

96 GB VRAM · 1, 2, 4, 8x configs

19 available

$4.00/hr

24 GB VRAM

0 available

$0.69/hr

L40

48 GB VRAM · 1, 2, 4, 8x configs

64 available

$1.50/hr

L40S

48 GB VRAM · 1, 2, 4x configs

10 available

$1.60/hr

A40

48 GB VRAM

0 available

$1.49/hr

A100 40GB

40 GB VRAM

0 available

$1.99/hr

A100 80GB

80 GB VRAM · 1, 2, 4, 8x configs

29 available

$1.70/hr

H100 80GB

80 GB VRAM · 1, 2, 4, 8x configs

64 available

$2.90/hr

H100 NVL

94 GB VRAM · 1, 2, 4x configs

7 available

$5.60/hr

H200

141 GB VRAM · 1, 2x configs

3 available

$5.50/hr

B200

180 GB VRAM

0 available

$6.99/hr

Deploy a GPU instance

How GPU Cloud works

1. Pick a GPU: Choose a card and a runtime storage target from the live catalog.
2. Pick a template: JupyterLab, ComfyUI, a vLLM model server, or a browser web terminal, ready in minutes.
3. Connect: Open the workload in the browser or call it through the authenticated EmpirioLabs connect endpoint.

GPU Cloud: common questions

How is GPU Cloud billed?

Billing is per second at the listed hourly rate, and only while the instance is running. The rate is locked in when you deploy, and stopping or destroying the instance stops the charge.

What can I run on a GPU instance?

One-click templates cover JupyterLab notebooks, ComfyUI, vLLM model serving (bring a Hugging Face model id), and a browser web terminal. You connect through the authenticated EmpirioLabs connect endpoint or call the workload through /v1/gpu/connect/{instance_id}/{path} on the API.

Can I manage GPU Cloud through the API?

Yes. Everything the dashboard does is also available through the API: deploy, stop, and destroy instances under /v1/gpu on api.empiriolabs.ai, and reach the running workload through the connect endpoint. The full reference is in the GPU Cloud docs.

How much storage do instances include?

Runtime storage targets range from 100 to 300 GB with a 150 GB default, bundled into the displayed hourly price.

How do I get started?

Create an EmpirioLabs account, open GPU Cloud in the dashboard, pick a GPU and template, and deploy. Billing is pay-as-you-go credits.

Rent cloud GPUs by the hour

Cloud GPU pricing per hour

How GPU Cloud works

GPU Cloud: common questions

How is GPU Cloud billed?

What can I run on a GPU instance?

Can I manage GPU Cloud through the API?

How much storage do instances include?

How do I get started?

Ready to use better endpoints?

Rent cloud GPUs by the hour

Cloud GPU pricing per hour

How GPU Cloud works

GPU Cloud: common questions

How is GPU Cloud billed?

What can I run on a GPU instance?

Can I manage GPU Cloud through the API?

How much storage do instances include?

How do I get started?

More from EmpirioLabs

Model APIs

Hosted Agents

Pricing

Ready to use better endpoints?