Walk through a full training workflow without leaving your editor. Your AI agent handles GPU provisioning, file transfers, training execution, and cleanup.Documentation Index
Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
”Deploy an A100, upload my dataset, and start training.”
The agent orchestrates the entire setup:list_gpu_types— confirms A100 80GB availability and pricing ($1.60/hr)create_instance— deploys an instance namedfine-tune-runwith an A100instance_status— polls every 10 seconds until the instance isrunningfile_upload— uploadstrain.pyto/workspace/train.pyfile_upload— uploadsdataset.jsonlto/workspace/dataset.jsonlssh_execute— runspip install torch transformers datasets acceleratessh_execute— runsnohup python /workspace/train.py > /root/train.log 2>&1 &
Deployed fine-tune-run (1x A100 80GB, $1.60/hr). Uploadedtrain.pyanddataset.jsonl. Training started — check progress with “how’s the training going?"
"How’s the training going? Show me GPU usage and the last few log lines.”
The agent checks two things in parallel:ssh_execute— runsnvidia-smi --query-gpu=utilization.gpu,memory.used,memory.total --format=csv,noheaderssh_execute— runstail -20 /root/train.log
GPU utilization: 97% | Memory: 68,420 / 81,920 MiB (83%)Loss is decreasing steadily. At the current rate, training will finish in ~2.5 hours.
”Training is done. Download the model checkpoint and tear down the instance.”
The agent wraps up the workflow:ssh_execute— runsls -lh /workspace/output/to confirm checkpoint filesfile_download— downloads/workspace/output/adapter_model.safetensorsfile_download— downloads/workspace/output/adapter_config.jsonfile_download— downloads/workspace/output/training_args.jsonbilling_usage— checks total spend for this sessiondelete_instance— terminatesfine-tune-run
Downloaded 3 files (adapter weights, config, training args). Total cost for this training run: $6.40 (4 hours on A100). Instance fine-tune-run terminated. Billing stopped.
Tools used in this workflow
| Tool | Purpose |
|---|---|
list_gpu_types | Check GPU availability and pricing |
create_instance | Provision the training machine |
instance_status | Wait for deployment to complete |
file_upload | Transfer training code and data |
ssh_execute | Install dependencies, start training, check logs |
file_download | Retrieve trained model artifacts |
billing_usage | Verify session cost |
delete_instance | Clean up and stop billing |