QuantTrio/MiniMax-M2-AWQ

text generationtransformerstransformerssafetensorsmixtraltext-generationvLLMAWQapache-2.0
vLLMRunnable with vLLM
400.5K
DEPLOY IN 60 SECONDS

Run MiniMax-M2-AWQ on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.