Skip to main content

Documentation Index

Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

Runcrate dedicated clusters give you bare-metal GPU capacity reserved for your organization alone — single-tenant infrastructure with the latest NVIDIA GPUs, InfiniBand networking, and predictable monthly pricing.

No shared tenancy

Your cluster is yours alone. Dedicated hardware with full root access and no noisy neighbors.

No spot interruptions

Reserved capacity means your workloads run uninterrupted, 24/7, for the duration of your contract.

Latest NVIDIA GPUs

H100, H200, B200, and B300 GPUs with NVLink and InfiniBand interconnect.

Flexible scale

From 16-node clusters for focused training to 128+ node deployments for frontier-scale workloads.

Who are dedicated clusters for?

  • ML teams running large-scale distributed training jobs
  • AI companies that need guaranteed GPU capacity for production inference
  • Enterprises with compliance or data residency requirements
  • Research labs working on frontier models or large experiments
  • Inference Engine users who need a dedicated deployment of a model or workload
  • GPU Instance users who have outgrown on-demand and need reserved capacity

How it compares

On-demand instancesDedicated clusters
InfrastructureShared multi-tenantDedicated single-tenant
GPUsH100, A100, L40S, RTX 4090H100, H200, B200, B300
NetworkingStandardInfiniBand / NVLink
Scale1–8 GPUs16–128+ nodes
CommitmentNone12–24 months
PricingHourly on-demandFixed monthly rate
Setup60 seconds1–2 weeks
AccessSelf-serveContact sales

Get started

Contact sales

Tell us about your GPU requirements and we’ll put together a proposal within 24 hours.
Or email us directly at support@runcrate.ai with:
  • Number of GPUs / nodes needed
  • GPU type preference (H100, H200, B200, B300)
  • Preferred region or data center location
  • Contract duration (12 or 24 months)
  • Target start date