nvidia/Llama-4-Scout-17B-16E-Instruct-FP8

Model OptimizerModel Optimizersafetensorsllama4nvidiamodeloptquantizedother
260.4K

No model card available.

DEPLOY IN 60 SECONDS

Run Llama-4-Scout-17B-16E-Instruct-FP8 on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.