Rayboy
RRunPod
•Created by Rayboy on 8/12/2024 in #⚡|serverless
Using the vLLM RunPod worker image and the OpenAI endpoints, how can I get the executionTime?
The standard endpoint provides executionTime as well as an ID that points to an execution that I can use /status on:
The OpenAI API endpoints unfortunately do not provide this, only token usage and a "chat-" ID that maybe I can do something with, but I can not find any documentation on:
Any help would be appreciated!
10 replies