zai-org/GLM-4.7-Flash

text generationtransformersenzhtransformerssafetensorsglm4_moe_litetext-generationconversationalenmit
vLLMRunnable with vLLM
1.7M
DEPLOY IN 60 SECONDS

Run GLM-4.7-Flash on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.