Tags · jesunsahariar/nccl · GitHub

8000 Tags · jesunsahariar/nccl · GitHub

More Web Proxy on the site http://driver.im/

Tags

v2.4.2-1

2.4.2-1

Add tree algorithms for allreduce to improve performance at scale.
Add ncclCommAbort() and ncclCommGetAsyncError() to properly handle
network errors and be permit recover.
Detect initial CPU affinity and no longer escape it.

Jan 29, 2019
1450d42
zip
tar.gz

v2.3.7-1

2.3.7-1

Improved LL tuning for multi-node jobs.
Improved bootstrap for large job scaling.
Fixed a hang during bootstrap due to socket reuse.
Added operation name to the COLL INFO logging.

Oct 24, 2018
b56650c
zip
tar.gz

v2.3.5-5

2.3.5-5

Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .

Sep 25, 2018
f93fe9b
zip
tar.gz

v2.1.2

Fix bug 2011094 : Crash when user stream is NULL.

Make sure we call cudaSetDevice in GroupEnd and if the user stream
is NULL.
Add an API test and an option in the perf tests as well.

Oct 25, 2017
498c402
zip
tar.gz

v1.3.4-1

Added Pascal nvcc flags, bumped version

Mar 24, 2017
649f04d
zip
tar.gz

v1.3.0-1

Add scan tests

Sep 22, 2016
ca330b1
zip
tar.gz

v1.2.3-1+cuda8.0

Preparing for 1.2.3 rebuild

Jun 13, 2016
4d9188a
zip

v1.2.3-1+cuda7.5

Updating for .deb rebuild

Jun 13, 2016
aa8f669
zip
tar.gz

v1.2.2-1+cuda8.0

Gencodes changed to NV recommended

Jun 6, 2016
560f3ff
zip
tar.gz

v1.2.2-1+cuda7.5

Gencodes changed to NV recommended

Jun 6, 2016
177505b
zip
tar.gz

PreviousNext

0