I want to deploy a serverless endpoint with using Unsloth
Unsloth do bnb qunatization and it's better loading their model, I think. I did training using Unsloth on a pod; I want to deploy it on a serverless endpoint and get the OpenIA client API
1 Reply