[CUDA] Drop CUDA 10 support #89582

eqy · 2022-11-23T19:50:46Z

CC @ptrblck @ngimel @malfet

pytorch-bot · 2022-11-23T19:50:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89582

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures

As of commit 39a85f4:

FLAKY - The following jobs failed but were likely due to flakiness present on master:

linux-focal-rocm5.3-py3.8-distributed / test (distributed, 1, 2, linux.rocm.gpu)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

yaox12 · 2022-12-05T03:02:04Z

There's some legacy code wrapped with the __CUDACC_VER_MAJOR__ macro, should we remove them too? For example,

pytorch/caffe2/utils/math_gpu.cu

Line 725 in 41c3b41

#if __CUDACC_VER_MAJOR__ < 8 || defined(USE_ROCM)

ngimel · 2022-12-05T03:53:03Z

aten/src/ATen/Dispatch.h

  })
 #endif

 // Workaround for C10_UNUSED because CUDA 10.2 and below fails to handle unused


remove this comment also

ngimel · 2022-12-05T03:53:46Z

aten/src/ATen/Dispatch.h

-#if defined(__CUDACC__) && CUDA_VERSION < 11000
-#define C10_UNUSED_DISPATCH_CUDA_WORKAROUND
-#else
 #define C10_UNUSED_DISPATCH_CUDA_WORKAROUND C10_UNUSED


probably all C10_UNUSED_DISPATCH... can be replaced by C10_UNUSED now?

Nit: can we just replace C10_UNUSED_DISPATCH_CUDA_WORKAROUND with C10_UNUSED everywhere?

ngimel · 2022-12-05T04:00:36Z

c10/util/complex.h

 template <typename T>
 C10_HOST_DEVICE constexpr thrust::complex<T>
 cuda101bug_cast_c10_complex_to_thrust_complex(const c10::complex<T>& x) {
-#if defined(CUDA_VERSION) && (CUDA_VERSION < 10020)


similarly here, cuda101bug... uses should just be replaced with static_cast, as the comment suggests

eqy · 2022-12-06T03:31:35Z

Hmm, what macro should be used instead of USE_ROCM or TORCH_HIP_VERSION in c10/util/*? The former isn't defined there for hip builds and the latter seems to still break on the test CI runs (e.g., https://github.com/pytorch/pytorch/actions/runs/3625677594/jobs/6114427598#logs )

eqy · 2022-12-06T04:32:27Z

CC @jithunnair-amd who might know more about the last issue

eqy · 2022-12-30T01:14:17Z

I've left the version guards in the c10/cuda files in for now as no one from AMD/ROCM has responded. Dropping the [WIP] tag.

…ld files

ngimel · 2023-01-04T17:29:51Z

aten/src/ATen/Dispatch.h

-#if defined(__CUDACC__) && CUDA_VERSION < 11000
-#define C10_UNUSED_DISPATCH_CUDA_WORKAROUND
-#else
 #define C10_UNUSED_DISPATCH_CUDA_WORKAROUND C10_UNUSED


Nit: can we just replace C10_UNUSED_DISPATCH_CUDA_WORKAROUND with C10_UNUSED everywhere?

eqy · 2023-01-05T05:09:34Z

@pytorchmergebot merge -f "ROCM failure appears unrelated"

pytorchmergebot · 2023-01-05T05:11:48Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

@ptrblck

Follow-up of #89582 to drop flags like `CUDA11OrLater` in tests. Note that in some places it appears that `TEST_WITH_ROCM` is _implicitly_ guarded against via the `CUDA11OrLater` version check, based on my best-guess of how `torch.version.cuda` would behave in ROCM builds, so I've added `not TEST_WITH_ROCM` in cases where ROCM wasn't previously explicitly allowed. CC @ptrblck @malfet @ngimel Pull Request resolved: #92605 Approved by: https://github.com/ngimel

pytorch-bot bot added the release notes: sparse release notes category label Nov 23, 2022

malfet approved these changes Nov 23, 2022

View reviewed changes

eqy added the open source label Nov 23, 2022

ngimel reviewed Dec 5, 2022

View reviewed changes

eqy added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 6, 2022

eqy force-pushed the remove_cuda_10 branch from 2b836c9 to 8aa4960 Compare December 29, 2022 21:00

eqy changed the title ~~[WIP][CUDA] Drop CUDA 10 support~~ [CUDA] Drop CUDA 10 support Dec 30, 2022

eqy added the ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR label Dec 30, 2022

eqy requested a review from ngimel December 30, 2022 04:39

eqy added 15 commits January 3, 2023 18:53

check in

ca6f2f3

fix?

0c2a574

fix?

f1c9744

fix for rocm

d45bb74

fixes for ROCM

a40d814

fixes for rocm

ec9c02e

lint

29fba9e

fix for rocm

1e1e20b

fix outdated check

9d30ff9

more fixes

317d969

more removals

4c1105b

remove more stuff, try to fix USE_ROCM

c200d5b

lint

bdfc49d

try another macro

ae208f9

try another fix for rocm and remove references to old versions in bui…

8bcb729

…ld files

eqy added 5 commits January 3, 2023 18:54

update

cb1329f

ROCM->HIP

8dc084e

try adding to mappings for C10

a674fa7

leave in guards as workaround for ROCM

3501085

lint

a11ad6b

eqy force-pushed the remove_cuda_10 branch from 62ad76f to a11ad6b Compare January 3, 2023 20:11

ngimel approved these changes Jan 4, 2023

View reviewed changes

eqy added 2 commits January 4, 2023 18:22

switch to C10_UNUSED

ab2ee8a

lint

39a85f4

pytorchmergebot added the Merged label Jan 5, 2023

pytorchmergebot closed this in bac33ea Jan 5, 2023

eqy mentioned this pull request Jan 19, 2023

[CUDA] Drop CUDA < 11.0 test flags #92605

Closed

tpkessler mentioned this pull request Mar 26, 2023

Undeclared CUBLAS_GEMM_DEFAULT_TENSOR_OP with ROCm backend #97640

Closed

[CUDA] Drop CUDA 10 support #89582

[CUDA] Drop CUDA 10 support #89582

Uh oh!

Conversation

eqy commented Nov 23, 2022

Uh oh!

pytorch-bot bot commented Nov 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89582

❌ 1 Failures

Uh oh!

yaox12 commented Dec 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngimel Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

ngimel Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

ngimel Jan 4, 2023

Choose a reason for hiding this comment

Uh oh!

ngimel Dec 5, 2022

Choose a reason for hiding this comment

Uh oh!

eqy commented Dec 6, 2022

Uh oh!

eqy commented Dec 6, 2022

Uh oh!

eqy commented Dec 30, 2022

Uh oh!

ngimel Jan 4, 2023

Choose a reason for hiding this comment

Uh oh!

eqy commented Jan 5, 2023

Uh oh!

pytorchmergebot commented Jan 5, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot bot commented Nov 23, 2022 •

edited

Loading

yaox12 commented Dec 5, 2022 •

edited

Loading