nccl实现多GPU交互



官方教程官网下载


下载与安装

前往官网选择相应cuda版本进行安装
如cuda11.6的安装指令如下

Network installers (x86)

If not already done, make sure you install the network repository:


Network Installer for Ubuntu20.04

- $ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64/cuda-keyring_1.0-1_all.deb
- $ sudo dpkg -i cuda-keyring_1.0-1_all.deb
- $ sudo apt-get update

Network Installer for Ubuntu18.04

- $ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-keyring_1.0-1_all.deb
- $ sudo dpkg -i cuda-keyring_1.0-1_all.deb
- $ sudo apt-get update

Network Installer for RedHat/CentOS 8

- $ sudo dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo

Network Installer for RedHat/CentOS 7

- $ sudo yum-config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel7/x86_64/cuda-rhel7.repo

then run the following command to installer NCCL:
For Ubuntu:sudo apt install libnccl2=2.12.12-1+cuda11.6 libnccl-dev=2.12.12-1+cuda11.6
For RHEL/Centos: sudo yum install libnccl-2.12.12-1+cuda11.6 libnccl-devel-2.12.12-1+cuda11.6 libnccl-static-2.12.12-1+cuda11.6


文章作者: Austin
版权声明: 本博客所有文章除特別声明外,均采用 CC BY 4.0 许可协议。转载请注明来源 Austin !
  目录