Trying to understand Parallelize

I have some code which I would like to speed up and cannot vectorize since it uses custom types. I am trying to use parallelize to split up the compute and am running in a Docker container on Intel Mac, Mojo 0.7.0. I have tried many combinations of parallelize[func](jobs, workers) but the best improvement I can get is about 2.5x. Why can I not do any better?
Was this page helpful?