Run ComfyUI on a Cloud GPU

Run ComfyUI — the node-based workflow editor for Stable Diffusion and FLUX — on a dedicated cloud GPU. Generate images faster than your local machine, run batch jobs 24/7, and access high-VRAM GPUs for large models like FLUX.1 Pro and SDXL.

What you’ll build

A remote ComfyUI instance accessible from your browser, running on a dedicated GPU with all models and custom nodes pre-installed. You can connect from anywhere and generate images without taxing your local hardware.

GPU sizing

Model	Min VRAM	Recommended GPU
SDXL	12 GB	RTX 4090
FLUX.1 Schnell	16 GB	RTX 4090
FLUX.1 Dev / Pro	24 GB	RTX 4090, L40S
FLUX.2 Pro (4MP)	32 GB+	L40S, A100
Multiple models loaded	48 GB+	L40S, A100

Setup (CLI)

1. Deploy a GPU instance

# RTX 4090 for SDXL/FLUX.1, L40S for FLUX.2 or multiple models
runcrate instances create --name comfyui --gpu RTX4090

2. Install ComfyUI

runcrate ssh comfyui -- "
  cd /root &&
  git clone https://github.com/comfyanonymous/ComfyUI.git &&
  cd ComfyUI &&
  pip install -r requirements.txt
"

3. Download models

# FLUX.1 Schnell (fast, good quality)
runcrate ssh comfyui -- "
  cd /root/ComfyUI/models/checkpoints &&
  wget -q 'https://huggingface.co/black-forest-labs/FLUX.1-schnell/resolve/main/flux1-schnell.safetensors'
"

# SDXL base
runcrate ssh comfyui -- "
  cd /root/ComfyUI/models/checkpoints &&
  wget -q 'https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/resolve/main/sd_xl_base_1.0.safetensors'
"

# CLIP and VAE for FLUX
runcrate ssh comfyui -- "
  cd /root/ComfyUI/models/clip &&
  wget -q 'https://huggingface.co/comfyanonymous/flux_text_encoders/resolve/main/t5xxl_fp8_e4m3fn.safetensors' &&
  wget -q 'https://huggingface.co/comfyanonymous/flux_text_encoders/resolve/main/clip_l.safetensors'
"

runcrate ssh comfyui -- "
  cd /root/ComfyUI/models/vae &&
  wget -q 'https://huggingface.co/black-forest-labs/FLUX.1-schnell/resolve/main/ae.safetensors'
"

4. Install popular custom nodes

runcrate ssh comfyui -- "
  cd /root/ComfyUI/custom_nodes &&
  git clone https://github.com/ltdrdata/ComfyUI-Manager.git &&
  git clone https://github.com/Fannovel16/comfyui_controlnet_aux.git &&
  git clone https://github.com/cubiq/ComfyUI_IPAdapter_plus.git &&
  cd comfyui_controlnet_aux && pip install -r requirements.txt
"

5. Start ComfyUI

runcrate ssh comfyui -- "
  cd /root/ComfyUI &&
  nohup python main.py --listen 0.0.0.0 --port 8188 > /root/comfyui.log 2>&1 &
"

6. Connect

# Get the IP
runcrate instances info comfyui

Open http://<INSTANCE_IP>:8188 in your browser. ComfyUI is now running on your cloud GPU.

Persistent setup with storage

Downloading models every time you deploy a new instance is slow. Use a storage volume to persist your models:

# Create a storage volume for models
runcrate volumes create --name comfyui-models --size 200 --region us-east-1

# Deploy instance with storage attached
runcrate instances create --name comfyui --gpu RTX4090 --storage comfyui-models

# Symlink model directories to the persistent volume
runcrate ssh comfyui -- "
  mkdir -p /workspace/checkpoints /workspace/clip /workspace/vae /workspace/controlnet &&
  ln -sf /workspace/checkpoints /root/ComfyUI/models/checkpoints &&
  ln -sf /workspace/clip /root/ComfyUI/models/clip &&
  ln -sf /workspace/vae /root/ComfyUI/models/vae &&
  ln -sf /workspace/controlnet /root/ComfyUI/models/controlnet
"
# Download models to /workspace/checkpoints/ (they persist across instances)

Next time you deploy with the same volume, the models are already there — just symlink and start ComfyUI.

Using MCP

“Deploy an RTX 4090 instance called comfyui. Clone the ComfyUI repo, install requirements, download the FLUX.1 Schnell checkpoint, and start the server on port 8188. Give me the URL when it’s ready.”

The agent runs through the full setup and returns the URL to open in your browser.

Tips

Save workflows as JSON — export from ComfyUI and store in your repo for reproducible generation
Use FP8 quantized models for FLUX.2 — 40% less VRAM, similar quality
Kill the instance when done — runcrate instances delete comfyui stops billing immediately
Use the API mode — ComfyUI has a REST API at /prompt for programmatic batch generation

​What you’ll build

​GPU sizing

​Setup (CLI)

​1. Deploy a GPU instance

​2. Install ComfyUI

​3. Download models

​4. Install popular custom nodes

​5. Start ComfyUI

​6. Connect

​Persistent setup with storage

​Using MCP

​Tips