9 Replies
@flash-singh I can't use runpod, for that strange issue. I have the same docker image, but built a near a month ago. It works perfectly.
all the jobs get stuck or just that one?
all jobs in that queue
Looks like the first job from cold worker start always fine, 2+ more that 50% chance to be stucked
ping me endpoint id
the endpoint is bad, or using a bad sdk, can you make sure its updated
i can see the jobs being taken from queue but not being reported back as soon as job is taken
uucgkak7h76hfd
SDK the latest versionI create a new endpoint with the same docker image. Problem almost the same
p6j8tqfojfhmll
I have older docker image, used in production. All works good. Version of that SDK is 1.7.2
My latest works on 1.7.4 and have that problems
I have this pip diff https://www.diffchecker.com/eYoE7Gm2/
Where runpod 1.7.2 is the older working good docker image.
Diffchecker - Compare text online to find the difference between tw...
Diffchecker will compare text to find the difference between two text files. Just paste your files and click Find Difference!
So i can confirm that 1.7.4 contains some bugs around it
1.7.2 works well without any issue