cyankiwi/GLM-4.7-Flash-AWQ-4bit

text generationtransformersenzhtransformerssafetensorsglm4_moe_litetext-generationconversationalenmit
vLLMRunnable with vLLM
261.8K
DEPLOY IN 60 SECONDS

Run GLM-4.7-Flash-AWQ-4bit on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.