Mixture-of-Experts at frontier scale
397B parameters split across specialized experts, with a router that activates just 17B per token. You get the capacity of a massive dense model at a fraction of the compute cost.
An open, MIT-licensed mixture-of-experts model from Rio AI Labs. 397 billion parameters, 17 billion active per token, and a one-million-token context window built for long-context reasoning, coding, and multilingual work.
MIT LICENSE · MIXTURE-OF-EXPERTS · 1M CONTEXT
Each token activates a sparse subset of experts. Only 17B of 397B parameters fire per token, so inference stays light while capacity stays large.
WHY RIO-3.5-OPEN
397B parameters split across specialized experts, with a router that activates just 17B per token. You get the capacity of a massive dense model at a fraction of the compute cost.
A 1,010,000-token window holds entire codebases, document sets, and long conversations in memory, so the model reasons over the whole problem instead of a fragment of it.
Built on the Qwen3.5-397B-A17B base with SwiReasoning, the model switches reasoning depth on demand to stay accurate on hard problems without wasting tokens on easy ones.
Sparse activation and adaptive reasoning keep latency and cost low. Run long-context workloads in production without paying for parameters every token doesn't need.
MODEL CARD
CAPABILITIES
TRAINED BY RIO AI LABS
Rio AI Labs is the applied AI group of the Prefeitura do Rio de Janeiro. Rio-3.5-Open-397B is its open release: a frontier-scale mixture-of-experts model trained for efficient, long-context reasoning and shipped under a permissive MIT license so anyone can run, fine-tune, and build on it.
Weights, the model card, and usage details live on Hugging Face. MIT licensed and ready to deploy.
Open the repositoryhuggingface.co/prefeitura-rio/Rio-3.5-Open-397B