Skip to content

Don't detect local GPU if $DS_IGNORE_CUDA_DETECTION is set#7896

Merged
tohtana merged 4 commits intodeepspeedai:masterfrom
Flamefire:DS_IGNORE_CUDA_DETECTION-fix
Mar 24, 2026
Merged

Don't detect local GPU if $DS_IGNORE_CUDA_DETECTION is set#7896
tohtana merged 4 commits intodeepspeedai:masterfrom
Flamefire:DS_IGNORE_CUDA_DETECTION-fix

Conversation

@Flamefire
Copy link
Contributor

Make this consistent over all OPs:
For cross-compilation we should not check the local GPU version.

This is also important when having multiple different GPUs in the system, so checking only GPU 0 could be wrong.

It looks like checks for the version of CUDA should be moved out of this and checks for compute capability vs CUDA version moved to filter_ccs instead.

CUDAOpBuilder.is_compatible could call compute_capability_args to see if the CCs after filtering turn up empty.

Make this consistent over all OPs:
For cross-compilation we should not check the local GPU version.
@tohtana tohtana enabled auto-merge (squash) March 24, 2026 14:01
@tohtana tohtana merged commit 26c954f into deepspeedai:master Mar 24, 2026
2 checks passed
@Flamefire Flamefire deleted the DS_IGNORE_CUDA_DETECTION-fix branch March 24, 2026 14:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants