System is build with CUDA/OpenCL
nodes on top of (GPU) hosts to allow the distributed workloads layered on top of physical and/or virtual hosts.
The Infra & monitoring should be done by invoking you favorite k8s orchestrator eg. Rancher 1.6.x or 2.x
This docker image serves as the base for applications that use OpenCL/OpenGL/Vulkan Compute abstractions and also serves as middle-ware for ThereminQ, Bonsai Nbody and Mandelbulber container images. Recently a gaming tag was added for quick PoC purposes.
Spin up the controller image and you will have a VNC WebUI with vendor/driver support and setting in an X session avaliable at port 6080
Expose the port if you want, an SSL loadbalancer advised as 6080
will be cleartext. Default password is 00000000
docker run --gpus all --device=/dev/kfd --device=/dev/dri:/dev/dri -d twobombs/cudacluster
The worker node sees the underlying hardware PCI bus X-times the amount of nodes on the host so the controller node will see an X-amount of CUDA cores. Both the worker and controller nodes are of the same image; a controller can therefore also work because it has the binaries and settings to do so.
This setting used to optimize the workload and usage of the GPUs, and also allows the abstract of running workload on thousands of GPU cores while only running on a small subset of those cores, therefore accurately simulating scaling. To make deployment for such environments easier one can use k3d in combination with Rancher 2.4+
This container image has drivers and/or configurations for
- AMD ( OCL & ROCm )
- Nvidia ( CUDA, Vulkan & OpenCL )
- Intel ( OCLgrind & Beignet Compute )
- CPU only OpenCL POCL
- VirtualCL for OpenCL clusters
v2023
upgrade to Ubuntu 22.04 and CUDA 12.1v2022c
added virtualcl https://mosix.cs.huji.ac.il/txt_vcl.htmlv2022b
vulkan: added Zink GL https://launchpad.net/~kisak/+archive/ubuntu/kisak-mesav2022a
gaming: added Sunshine for streaming https://github.com/SunshineStream/Sunshinev2021
gaming: added Wine, Steam Lutrix and Vulkan Engine in Gaming tagv2020
iteration: upgraded to CUDA 11+ @ Ubuntu 20.04- OpenCL 1.2+ drivers for the 3 main Compute vendors plus one CPU only POCL version