codeRetarded
RRunPod
•Created by codeRetarded on 3/12/2024 in #⚡|serverless
Serverless multi gpu
Then you mean exporting the variable before running the code? But I don't seem to understand why does it work correctly for the first time the worker is spawned
13 replies
RRunPod
•Created by codeRetarded on 3/12/2024 in #⚡|serverless
Serverless multi gpu
oh you mean adding devices in the dockerfile while creating the container?
13 replies
RRunPod
•Created by codeRetarded on 3/12/2024 in #⚡|serverless
Serverless multi gpu
I don't know if I should make any changes to runpod source code for multi-gpu?
13 replies
RRunPod
•Created by codeRetarded on 3/12/2024 in #⚡|serverless
Serverless multi gpu
so this is my code, where I am trying to run a chat model, get_chat_response is the handler
13 replies
RRunPod
•Created by codeRetarded on 3/12/2024 in #⚡|serverless
Serverless multi gpu
Update : If I stop for some large amount of time and then send a request then it is working. I think it is working every time after some refresh. Please help.
13 replies
RRunPod
•Created by codeRetarded on 2/2/2024 in #⚡|serverless
Docker daemon is not started by default?
okayy, i have a particular usecase, is there a workaround?
5 replies
RRunPod
•Created by codeRetarded on 1/31/2024 in #⚡|serverless
Best way to deploy a new LLM serverless, where I don't want to build large docker images
@Alpay Ariyak thank you for the suggestion, this is something towards which I was looking for. Reduces docker time and uses serverless but if I have a large model won't the worker download it everytime it is sent a request?
27 replies
RRunPod
•Created by codeRetarded on 1/31/2024 in #⚡|serverless
Best way to deploy a new LLM serverless, where I don't want to build large docker images
@justin your suggestion seems to be creating separate pods for the model and the code, but this will just increase the cost by double if I were to only use serverless and downloaded the model from huggingface/github repos. Thanks for the depot suggestion, it seems interesting for docker interaction
27 replies