How to simply route requests with the same request id to the same model instance? #7861

fighterhit · 2024-12-07T13:02:05Z

fighterhit
Dec 7, 2024

How can I simply route all inference requests with the same request id to the same model instance, and then execute inference using dynamic_batching? It sounds like this can be achieved by using a stateful model to change the request id to a sequence id, but it feels too complicated and requires additional control input, because I only need the ability to route to the same instance and dynamic batches. Is there an easy solution? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to simply route requests with the same request id to the same model instance? #7861

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

How to simply route requests with the same request id to the same model instance? #7861

Uh oh!

fighterhit Dec 7, 2024

Replies: 0 comments

fighterhit
Dec 7, 2024