8000 Tags · jesunsahariar/nccl · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Tags: jesunsahariar/nccl

Tags

v2.4.2-1

Toggle v2.4.2-1's commit message
2.4.2-1

Add tree algorithms for allreduce to improve performance at scale.
Add ncclCommAbort() and ncclCommGetAsyncError() to properly handle
network errors and be permit recover.
Detect initial CPU affinity and no longer escape it.

v2.3.7-1

Toggle v2.3.7-1's commit message
2.3.7-1

Improved LL tuning for multi-node jobs.
Improved bootstrap for large job scaling.
Fixed a hang during bootstrap due to socket reuse.
Added operation name to the COLL INFO logging.

v2.3.5-5

Toggle v2.3.5-5's commit message
2.3.5-5

Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .

v2.1.2

Toggle v2.1.2's commit message
Fix bug 2011094 : Crash when user stream is NULL.

Make sure we call cudaSetDevice in GroupEnd and if the user stream
is NULL.
Add an API test and an option in the perf tests as well.

v1.3.4-1

Toggle v1.3.4-1's commit message
Added Pascal nvcc flags, bumped version

v1.3.0-1

Toggle v1.3.0-1's commit message
Add scan tests

v1.2.3-1+cuda8.0

Toggle v1.2.3-1+cuda8.0's commit message
Preparing for 1.2.3 rebuild

v1.2.3-1+cuda7.5

Toggle v1.2.3-1+cuda7.5's commit message
Updating for .deb rebuild

v1.2.2-1+cuda8.0

Toggle v1.2.2-1+cuda8.0's commit message
Gencodes changed to NV recommended

v1.2.2-1+cuda7.5

Toggle v1.2.2-1+cuda7.5's commit message
Gencodes changed to NV recommended

0