The bug I upgraded Immich and the machine learning container fails to start. Output of sudo docker logs -f immich_machine_learning [09/06/23 10:35:59] INFO Booting worker with pid: 4585 [09/06/23 1...
The download only happens if the file doesn’t exist at the cache folder when it gets a request.
My guess for what might be happening is that it downloads to a temp file that it renames to the download path on completion. If this file is corrupt, it might not be deleted when it tries to download the model again, and later attempts fail because it tries to resume the download with a corrupt file. This is just a guess, though.
start.sh is in the working directory of immich-machine-learning. There’s a related PR that improves downloading beyond the timeout fix, but you can probably fix this by setting the timeout to a high value like 600s and deleting the model cache volume.
Yeah, the temp file is visual.onnv.part or something, but deleting it didn’t work, nor did manually placing the fully downloaded file into that directory.
Ok, Yeah, I saw the PR and the path, but I don’t see it within the immich container for some reason. Do you know what the path to the working directory might look like?
So if you have this file in the path it's looking for (), you should restart the container and run a CLIP job. It should load without any errors. If it still doesn't work, I'd suggest deleting the model cache volume, making sure you're on the latest release (1.78) and trying again.
The bug When I search for anything in immich, I get generic errors in the UI. In docker logs, I can see that something is trying to download Downloading clip model 'ViT-B-32::openai' ... an...