最近安装maxnet_gpu版本,出现错误,特此记录和给出解决方法。 OSError: libnccl.so.2: cannot open shared object file: No such file or directory mxnet 主要是缺少 libnccl库。由于登录官网下载需要注册,这里记录一下对应cuda10.2的安装命令。

Ubuntu18.04版本的,实测20版本也能用

$ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-keyring_1.0-1_all.deb

$ sudo dpkg -i cuda-keyring_1.0-1_all.deb

$ sudo apt-get update

sudo apt install libnccl2=2.15.5-1+cuda10.2 libnccl-dev=2.15.5-1+cuda10.2

CUDA 11.8版本

Network Installer for Ubuntu22.04

$ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.0-1_all.deb

$ sudo dpkg -i cuda-keyring_1.0-1_all.deb

$ sudo apt-get update

Network Installer for Ubuntu20.04

$ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64/cuda-keyring_1.0-1_all.deb

$ sudo dpkg -i cuda-keyring_1.0-1_all.deb

$ sudo apt-get update

Network Installer for Ubuntu18.04

$ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-keyring_1.0-1_all.deb

$ sudo dpkg -i cuda-keyring_1.0-1_all.deb

$ sudo apt-get update

最后:

sudo apt install libnccl2=2.15.5-1+cuda11.8 libnccl-dev=2.15.5-1+cuda11.8

好文链接

评论可见,请评论后查看内容,谢谢!!!
 您阅读本篇文章共花了: