I asked this before, but maybe I chose the wrong tag, and maybe I overcomplicated the request, but here I am presenting it in a much simpler way. This would be the last I ever mention this, so rest assured! :)
I’m asking for the feature where if the last message in the API payload is of type assistant, it should be continued as if it’s mid-generation; no different assistant reply, no newlines before generation, nothing in-between at all! Just natural completions of an incomplete assistant message because it was the last in the payload. That’s it.
Continuation of last message is a very important AI Chat Completions feature that’s present in almost all most major API providers, and we need it in Nebius too. It’s not a “nice-to-have” but could be the deciding factor for whether Nebius is appropriate for a product or not and not having it forces developers to ‘hack’ around it in ways that don’t take advantage of Nebius’s context caching.
Thank you.
Reference:
https://docs.anthropic.com/en/docs/build-with-claude/prompt-engineering/prefill-claudes-response
Please authenticate to join the conversation.
In Review
🖋️ Nebius AI Studio
11 months ago

L
Get notified by email when there are changes.
In Review
🖋️ Nebius AI Studio
11 months ago

L
Get notified by email when there are changes.