8000 GitHub - heluocs/mpi-operator: Repository for the MPI operator.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

heluocs/mpi-operator

 
 

Repository files navigation

MPI Operator

The MPI Operator makes it easy to run allreduce-style distributed training.

Deploy

kubectl create -f deploy/

Test

Launch a multi-node tensorflow benchmark training job:

kubectl create -f examples/tensorflow-benchmarks.yaml

Once everything starts, the logs are available in the launcher pod.

About

Repository for the MPI operator.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Go 96.4%
  • Shell 2.5%
  • Dockerfile 1.1%
0