kaj
RRunPod
•Created by openmind on 12/16/2024 in #⚡|serverless
How to deploy ModelsLab/Uncensored-llama3.1-nemotron?
are you quantizing or halving? or running full f32? You will need to probably do both
11 replies
RRunPod
•Created by openmind on 12/16/2024 in #⚡|serverless
How to deploy ModelsLab/Uncensored-llama3.1-nemotron?
70B llama models typically need a little over 48GB, try 80GB vram gpus
11 replies
RRunPod
•Created by jim on 12/12/2024 in #⚡|serverless
Github integration
ah, we shouldn't need write access. cc @Rutvik
9 replies
RRunPod
•Created by wuxmes on 12/7/2024 in #⚡|serverless
Template id missing in serverless dashboard
Hey, thanks for the feedback! would it help if it showed the template name in addition to the docker image?
4 replies
RRunPod
•Created by alsruf36 on 12/7/2024 in #⚡|serverless
Disk size when building a github repository as an image on Serverless
no, container disk size is only for running pods, it has no effect on the docker builders
3 replies
RRunPod
•Created by Ben on 12/6/2024 in #⚡|serverless
Can't make serverless endpoints from GHCR container with new Runpod website update
Ah, I can see how that description can be misleading, you can still use images from other registries. I'll update that description, thanks for pointing that out
6 replies
Clarify RAM available
this should have been already fixed. Are you still having issues? It seems to work fine on my end (https://karalite.kaj.rocks/chrome_vCVXLLL3aN.mp4)
12 replies
RRunPod
•Created by zkreutzjanz on 5/27/2024 in #⚡|serverless
Clone endpoint failing in UI
not yet, looking into a possible solution
29 replies
RRunPod
•Created by zkreutzjanz on 5/27/2024 in #⚡|serverless
Clone endpoint failing in UI
might be that the GPU IDs you're sending aren't quite right?
29 replies
RRunPod
•Created by zkreutzjanz on 5/27/2024 in #⚡|serverless
Clone endpoint failing in UI
doesn't seem to be allowedCudaVersions causing the issue
29 replies
RRunPod
•Created by zkreutzjanz on 5/27/2024 in #⚡|serverless
Clone endpoint failing in UI
hm, I can't seem to replicate this on my own, can you detail what steps in the UI you take to reproduce this issue?
29 replies
RRunPod
•Created by ashleyk on 3/11/2024 in #⚡|serverless
What is N95 in serverless metrics?
from what i can see yes
10 replies