Triton Inference Server Feature Requests

Triton Inference Server Feature Requests

Timeout threshold in dynamic and sequence batcher

Add a timeout threshold to the dynamic and/or sequence batcher per model, which would hold the request in queue until it meets the threshold, then reject it

  • Guest
  • Feb 4 2020
  • Shipped
  • Attach files