If last API message is assistant, it should be continued.

I asked this before, but maybe I chose the wrong tag, and maybe I overcomplicated the request, but here I am presenting it in a much simpler way. This would be the last I ever mention this, so rest assured! :)


I’m asking for the feature where if the last message in the API payload is of type assistant, it should be continued as if it’s mid-generation; no different assistant reply, no newlines before generation, nothing in-between at all! Just natural completions of an incomplete assistant message because it was the last in the payload. That’s it.


Continuation of last message is a very important AI Chat Completions feature that’s present in almost all most major API providers, and we need it in Nebius too. It’s not a “nice-to-have” but could be the deciding factor for whether Nebius is appropriate for a product or not and not having it forces developers to ‘hack’ around it in ways that don’t take advantage of Nebius’s context caching.

Thank you.

Reference:

https://docs.anthropic.com/en/docs/build-with-claude/prompt-engineering/prefill-claudes-response

Please authenticate to join the conversation.

Upvoters
Status

In Review

Board

🖋️ Nebius AI Studio

Date

11 months ago

Author

L

Subscribe to post

Get notified by email when there are changes.