Help Center

Improved

Fixed

nebius

In Review

Planned

Completed

Won't do

Duplicat

Сompute Cloud

Managed Database

Network

Documentation

Managed Service for Kubernetes®

Billing

Marketplace

Other

User interface

Main Roadmap

Share your ideas about our platform

Hey {name|there}! 👋

<p>Currently SO by constrained decoding is supported by large models (&gt;80B) only with tps &lt;50. This is insanely slow for agentic architectures based on SGR, and you keep slowing down inference. Please provide at least one fast, large, good model with SO</p>

Larger FAST models with SO.

Ivan Matveev

Nebius

Larger FAST models with SO.

Subscribe to post

Subscribe to post