What can I request through Lilac Dedicated?

Lilac Dedicated supports requests for dedicated GPU virtual machines, bare-metal nodes, and multi-node clusters. Bare-metal and cluster requests may start from an eight-GPU node where appropriate, with the final configuration qualified by the Lilac team.

How does Lilac Serverless work?

Lilac Serverless provides OpenAI-compatible, per-token inference for supported models without requiring a GPU reservation.

How does Lilac Flex work?

Eligible customers enable Lilac Flex once on a GPU reservation. After opt-in, Lilac automatically makes eligible idle windows available to approved spot workloads, with no manual listings or day-to-day management. When demand is matched, the customer can earn bill credits under the applicable agreement. Matching and credits are not guaranteed.

What is the Capacity Exchange?

The Capacity Exchange lets customers relist eligible Lilac reservations or take over an approved Lilac commitment from another verified Lilac customer. Any transfer is subject to demand, market pricing, customer verification, credit approval, provider approval, and applicable terms.

Backed by Y Combinator

The AI cloud built for
fast-moving startups.

Reserve GPUs, exchange Lilac contracts, or run inference.

View GPU pricing Get started

Trusted by

Z.ai Y Combinator Osmosis Understudy Labs Saturn Cloud Eden AI NanoGPT Linum Infron Coral Bricks Floot

Why Lilac

Dedicated GPUs. Flexibility built in.

BuyCapacity from Lilac

Reserve GPU VMs, dedicated nodes, or multi-node clusters from Lilac. We provide the capacity, access, and support.

MonetizeUnused reservation windows

Enable Lilac Flex once and Lilac automatically matches eligible idle windows to approved spot workloads. When demand is matched, the resulting credits reduce your GPU bill.

ResellEligible Lilac commitments

Relist an eligible Lilac reservation to another verified Lilac customer. Transfers depend on demand, pricing, and approval.

Products

One GPU Network.
Four ways to use Compute.

Dedicated clusters

Dedicated GPU infrastructure, directly from Lilac

Compare indicative pricing across GPU VMs, bare-metal nodes, and multi-node clusters. Lilac handles capacity, access, and support.

H100 to B300Explore dedicated GPUs

Reserve H100s

Indicative pricing. Final rates depend on configuration and term.

*Estimate pricing

Serverless

Frontier models, simple per-token pricing

Use supported open models through an OpenAI-compatible API. No GPU reservation, no minimums, and no infrastructure to manage.

Live catalogExplore models

Loading the live model catalog…

Subscriptions

Monthly inference credits that stretch further

Choose a flat monthly credit pool for supported models. Lower-cost capacity lets a predictable subscription cover more product usage.

Up to 12x valueGet started

Basic$10/mo$80 value Pro$30/mo$300 value Max$100/mo$1,200 value

Batch

Run GPU jobs to completion

Submit a container image and a command. Lilac runs it on available GPU capacity with simple per-second pricing and no cluster to manage.

Private betaJoin private beta

Batch jobsPrivate beta

eval-suite-nightlyrunning

H100 $1.00/hr · 2.82 hrs · ~$2.82

train-baselinerunning

H200 $1.50/hr · 1.14 hrs · ~$1.71

data-processdone

H100 $1.00/hr · 0.46 hrs · ~$0.46

Commitment lifecycle

Commit without getting trapped.

Reserve what you need now. Enable Lilac Flex to automatically monetize eligible idle windows with spot workloads, or transfer an approved Lilac commitment as plans change.

Reserve

Choose the capacity you need now.

Select the GPU, region, scale, and term. The final quote names the site, network, delivery date, and commercial terms.

Lilac Flex

Automatically monetize eligible downtime.

Enable Flex once and Lilac automatically matches eligible idle windows to approved spot workloads. When demand is matched, bill credits lower your effective GPU cost. No manual listings are needed.

Explore Lilac Flex

Transfer