RunPod•12mo ago

is there any method to deploy bert architecture models serverlessly?

Solution:

@Adam? https://www.runpod.io/console/explore then select this...

9 Replies

Hi There is using huggingface's transformers you can cache the model first in some path, then load it using relative / absolute path. put the model in the image / use a network volume

Adam?OP•11mo ago

Hi, thanks for help Is there any template for this? Or I should write the handler by myself?

nerdylive•11mo ago

yes there is use VLLM template

Solution

nerdylive•11mo ago

@Adam? https://www.runpod.io/console/explore then select this

nerdylive•11mo ago

If you need help with configuring that template theres a setup menu after you click on it

Adam?OP•11mo ago

Yeah I have try this

nerdylive•11mo ago

oh wait bert isnt compatible with that isnt it?

Adam?OP•11mo ago

But it doesn't supports bart models Exactly

nerdylive•11mo ago

Hmm then you'll have to write an custom handler first for beert using transformers pipeline works too

Gaming

Programming

is there any method to deploy bert architecture models serverlessly?

Did you find this page helpful?