kh
Global Networking
I am trying to use Global Networking. i have 1 master and 2 worker GPUs, all on different pods, but in the same data centre. it seems that the ports are not open between the pods and only port 22 is. I tried to specify a specific TCP port to expose when starting up the Pods too, but it does not work. I need to allow communications between the Pods for torch.dist
11 replies