Tags: Hamidreza-Ramezani/nccl
Tags
2.6.4-1 Add support for network collectives. Add support for XML topology dump/injection. Add text values for GDR and P2P Levels, including "NVL". Add speed detection for PCI, Infiniband and Ethernet cards. Add CPU detection for ARM and AMD CPUs. Add support for adaptive routing on Infiniband. Change NET plugin API to v3 : merge PCI path and GPU pointer capability into a single structure and add other properties.
2.5.6-1 (NVIDIA#255) Add LL128 Protocol. Rewrite the topology detection and tree/ring creation (NVIDIA#179). Improve tree performance by sending/receiving from different GPUs. Add model-based tuning to switch between the different algorithms and protocols. Rework P2P/SHM detection in containers (NVIDIA#155, NVIDIA#248). Detect duplicated devices and return an error (NVIDIA#231). Add tuning for GCP
2.4.8-1 Fix NVIDIA#209: improve socket transport performance Split transfers over multiple sockets Launch multiple threads to drive sockets Detect AWS NICs and set nsockets/nthreads accordingly
PreviousNext