R
RunPod4w ago
zethos

Cannot find any model weights with `/models/huggingface-cache/hub/models...`

Hi, I made a docker image using the "STEP-2" mentioned in Readme file. I created an template with docker image with below environment variables: MODEL_NAME="migtissera/Tess-3-Mistral-Large-2-123B" MAX_MODEL_LEN=65536 TENSOR_PARALLEL_SIZE=8 GPU_MEMORY_UTILIZATION=0.92 ENABLE_CHUNKED_PREFILL=1 NCCL_P2P_DISABLE=1 OMP_NUM_THREADS=1 ENFORCE_EAGER=1 The docker image: snbhanja/tess3mistrallarge128b:latest I tried to deploy this into a serverless with 8 48GB GPU. I get the below error but I didn't get this error when the very first time it is deployed: RuntimeError: Cannot find any model weights with `/models/huggingface-cache/hub/models--migtissera--Tess-3-Mistral-Large-2-123B/snapshots/8047f7cc9615909650b6a4ae5d13719d3e11594d Even if i delete the serveless endpoint and try to make one using this, it gives same error. Full log: https://github.com/user-attachments/files/18603761/logs.11.txt
1 Reply
yhlong00000
yhlong000003w ago
What if you remove env variables model name

Did you find this page helpful?