TheBloke/Llama-2-70B-Chat-AWQ

text generationtransformersentransformerssafetensorsllamatext-generationfacebookmetallama2
vLLMRunnable with vLLM
86.5K
DEPLOY IN 60 SECONDS

Run Llama-2-70B-Chat-AWQ on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.