Issue in pod

i'm facing issue from last 2 days, sometime RTX 4090 generates 60 token/second and sometime it 30-20 token/second to generate same response. don't know what is behind this ????
0 Replies
No replies yetBe the first to reply to this messageJoin
Want results from more Discord servers?
Add your server