Skip to content
This repository was archived by the owner on Aug 15, 2025. It is now read-only.

Conversation

atalman
Copy link
Contributor

@atalman atalman commented Jan 9, 2024

Fix: pytorch/pytorch#116977

Nccl 2.19.3 don't exist for cuda 11.8 and cuda 12.1. Refer to https://docs.nvidia.com/deeplearning/nccl/release-notes/rel_2-19-3.html#rel_2-19-3 CUDA 12.0, 12.2, 12.3 are supported.

Hence we do manual build. Follow this build process: https://github.com/NVIDIA/nccl/tree/v2.19.3-1?tab=readme-ov-file#build

We want nccl version be exactly the same as installed here: https://github.com/pytorch/pytorch/blob/main/.github/scripts/generate_binary_build_matrix.py#L45

@huydhn huydhn merged commit 7afbef9 into pytorch:release/2.2 Jan 9, 2024
@Skylion007
Copy link
Contributor

Oh yeah, I asked about this in November. Should have brought this up: NVIDIA/nccl#1093

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants