letajmal
RRunPod
•Created by letajmal on 3/12/2024 in #⚡|serverless
How can i make a follow up question to the endpoint
How can i make a follow up question to the endpoint like a thread. Eg: Chat in chatGPT
7 replies
RRunPod
•Created by letajmal on 3/8/2024 in #⚡|serverless
I am getting no response from serverless
11 replies
RRunPod
•Created by letajmal on 3/7/2024 in #⚡|serverless
Should i wait for the worker to pull my image
I have a large image (100 GB), should i wait for worker to pull the image before starting any inference
7 replies
RRunPod
•Created by letajmal on 3/1/2024 in #⚡|serverless
What is the recommended System Req for Building Worker Base Image
I was trying to build a custom runpod/worker-vllm:base-0.3.1-cuda${WORKER_CUDA_VERSION} image, but my 16vCPU, 64GB RAM server crashed. What is the recommended system spec for this purpose
19 replies