Eren
Eren
RRunPod
Created by JohnDoe on 2/12/2025 in #⚡|serverless
Pulling from the wrong cache when multiple Dockerfiles in same GitHub repo
Hey @flash-singh let me also add one of my similar issue: 1 Repo, 2 branch, 2 deployment Branches are mostly identical but each branch's Dockerfile has an ENV variable model name different - And on docker build, downloads the model with this ENV variable & uses that ENV variable on runtime to use that model On my concurrent deployment for these 2 branches After deployment, on deployment b - branch b: It tried to find correct model name (ENV on runtime was ok) but couldn't find it's model I doubt that it used the cache layer from branch-a's deployment of download_models.py, code is 100% same, file reads model_type from ENV Re-deploy branch-b fixed issue
24 replies
RRunPod
Created by pkpio on 2/4/2025 in #⚡|serverless
Setting up CD for serverless endpoint
it's the one with mutation saveTemplate($input: SaveTemplateInput) {
12 replies
RRunPod
Created by pkpio on 2/4/2025 in #⚡|serverless
Setting up CD for serverless endpoint
Yes it is not well documented, you can go ahead to web UI and view Network tab requests The request got triggered when clicking New Release -> Save the request you need, you can copy it via CURL, remove headers and put api key, then you will be changin imageName there when you want to release a new image This updates Template image that endpoint uses
12 replies
RRunPod
Created by Eren on 1/28/2025 in #⚡|serverless
delayTime representing negative value
Hey @yhlong00000 no I did not save the request id
4 replies
RRunPod
Created by Eren on 1/28/2025 in #⚡|serverless
delayTime representing negative value
No description
4 replies
RRunPod
Created by EMPZ on 12/16/2024 in #⚡|serverless
GitHub integration: "exporting to oci image format" takes forever.
I'm following this thread since December and would be absolutely happy to have faster image builds As Arkadiy said in my case too there is only change on last uncached part on code level, however I still got >1 hr (sometimes 2-3 or even 4 hours) of "exporting to oci image format. This takes a little bit of time. Please be patient." It also randomly fails due to network errors after a while on build
25 replies
RRunPod
Created by jackson hole on 1/13/2025 in #⚡|serverless
I want to increase/decrease workers by code, can you help?
🚀 PS: To set Active Worker you need to set the workersMin variable in json
8 replies
RRunPod
Created by jackson hole on 1/13/2025 in #⚡|serverless
I want to increase/decrease workers by code, can you help?
it's actually possible by Graph QL API, you can find a sample Endpoint mutation call below
curl --location --globoff 'https://api.runpod.io/graphql?api_key={{RUNPOD_API_KEY}}' \
--header 'content-type: application/json' \
--data '{"query":"mutation saveEndpoint($input: EndpointInput!) {\n saveEndpoint(input: $input) {\n gpuIds\n id\n idleTimeout\n locations\n name\n networkVolumeId\n scalerType\n scalerValue\n templateId\n userId\n workersMax\n workersMin\n gpuCount\n __typename\n }\n}","variables":{"input":{"gpuIds":"ADA_24,AMPERE_24,-NVIDIA L4,-NVIDIA RTX A5000","gpuCount":1,"allowedCudaVersions":"","id":"{{ID}}","idleTimeout":1,"locations":null,"name":"faster_whisper -fb","networkVolumeId":null,"scalerType":"QUEUE_DELAY","scalerValue":4,"workersMax":3,"workersMin":0,"executionTimeoutMs":180000}}}'
curl --location --globoff 'https://api.runpod.io/graphql?api_key={{RUNPOD_API_KEY}}' \
--header 'content-type: application/json' \
--data '{"query":"mutation saveEndpoint($input: EndpointInput!) {\n saveEndpoint(input: $input) {\n gpuIds\n id\n idleTimeout\n locations\n name\n networkVolumeId\n scalerType\n scalerValue\n templateId\n userId\n workersMax\n workersMin\n gpuCount\n __typename\n }\n}","variables":{"input":{"gpuIds":"ADA_24,AMPERE_24,-NVIDIA L4,-NVIDIA RTX A5000","gpuCount":1,"allowedCudaVersions":"","id":"{{ID}}","idleTimeout":1,"locations":null,"name":"faster_whisper -fb","networkVolumeId":null,"scalerType":"QUEUE_DELAY","scalerValue":4,"workersMax":3,"workersMin":0,"executionTimeoutMs":180000}}}'
GraphQL API Docs: https://graphql-spec.runpod.io/ Without API Docs: you can go to your dashboard and open the Developer Tools by F12 go to Network tab then make an update on your endpoint and find the mutation API call then you can just copy that as CURL and work on it on postman remember to remove all Headers and add api_key query param
8 replies