Why it can be stucked IN_PROGRESS?

No description
9 Replies
rougsig
rougsigOP3d ago
@flash-singh I can't use runpod, for that strange issue. I have the same docker image, but built a near a month ago. It works perfectly.
flash-singh
flash-singh3d ago
all the jobs get stuck or just that one?
rougsig
rougsigOP3d ago
all jobs in that queue Looks like the first job from cold worker start always fine, 2+ more that 50% chance to be stucked
flash-singh
flash-singh3d ago
ping me endpoint id the endpoint is bad, or using a bad sdk, can you make sure its updated i can see the jobs being taken from queue but not being reported back as soon as job is taken
rougsig
rougsigOP2d ago
uucgkak7h76hfd SDK the latest version
rougsig
rougsigOP2d ago
I create a new endpoint with the same docker image. Problem almost the same p6j8tqfojfhmll
No description
rougsig
rougsigOP2d ago
I have older docker image, used in production. All works good. Version of that SDK is 1.7.2 My latest works on 1.7.4 and have that problems
rougsig
rougsigOP2d ago
I have this pip diff https://www.diffchecker.com/eYoE7Gm2/ Where runpod 1.7.2 is the older working good docker image.
Diffchecker - Compare text online to find the difference between tw...
Diffchecker will compare text to find the difference between two text files. Just paste your files and click Find Difference!
rougsig
rougsigOP2d ago
So i can confirm that 1.7.4 contains some bugs around it 1.7.2 works well without any issue
Want results from more Discord servers?
Add your server