DeepSeek R1 Serverless for coding

I'm interested in running an FP16 DeepSeek R1 and I am wondering if Serverless is the way to go or if a Pod would be better. I need this for 2-3 hours at a time and I would like a 'dedicated' access to this environment. Which DeepSeek R1 model should I pick (GGUF?) and how should I configure the deployment tool in Serverless to get it to run on an H100? Thanks in advance for any help.
6 Replies
lsdvaibhavvvv
lsdvaibhavvvv4w ago
Hi i am also trying to host a deepseek r1 on serverless. It fails at the endpoint level
<MarDev/>
<MarDev/>4w ago
Yo bro, did you found a solution ?
MindDragon
MindDragonOP4w ago
I don't think that anyone did. I'm trying a full pod of 4080's (5x) ... idk what else to do
lsdvaibhavvvv
lsdvaibhavvvv4w ago
Here is the detail error
No description
lsdvaibhavvvv
lsdvaibhavvvv4w ago
Now i am trying to host using docker container, and would manually do the needful on the server side. Let's see.
nerdylive
nerdylive3w ago
maybe the endpoint didn'thave enough vram (gpu vram)

Did you find this page helpful?