[CMake] Fix `USE_FBGEMM_GENAI` option by malfet · Pull Request #164165 · pytorch/pytorch

malfet · 2025-09-29T18:43:36Z

Stack from ghstack (oldest at bottom):

-> [CMake] Fix USE_FBGEMM_GENAI option #164165

cmake_dependent_option condition should be USE_ROCM OR (USE_CUDA AND NOT MSVC) (similar to the one for flash attention)
Default settings should be user overridable, i.e. even if one builds for SM_10, they should be able to pass USE_FBGEMM_GENAI=0 and skip the build

[ghstack-poisoned]

pytorch-bot · 2025-09-29T18:43:39Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/164165

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 58c1c07 with merge base 069ccf5 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

- `cmake_dependent_option` condition should be `USE_ROCM OR (USE_CUDA AND NOT MSVC)` (similar to the one for flash attention) - Default settings should be user overridable, i.e. even if one builds for SM_10, they should be able to pass `USE_FBGEMM_GENAI=0` and skip the build ghstack-source-id: e741c64 Pull Request resolved: #164165

Skylion007

Looks good, but I'm confused by we are removing useful debugging print info from the Summary CMake

Skylion007 · 2025-09-29T18:45:41Z

cmake/Summary.cmake

  message(STATUS "  USE_EIGEN_FOR_BLAS    : ${CAFFE2_USE_EIGEN_FOR_BLAS}")
  message(STATUS "  USE_EIGEN_FOR_SPARSE  : ${USE_EIGEN_SPARSE}")
  message(STATUS "  USE_FBGEMM            : ${USE_FBGEMM}")
+  message(STATUS "  USE_FBGEMM_GENAI      : ${USE_FBGEMM_GENAI}")


Why remove the printing here?

I'm adding the print statement, don't I?

Skylion007 · 2025-09-29T18:46:33Z

CMakeLists.txt

+IF(USE_ROCM AND "gfx942" IN_LIST PYTORCH_ROCM_ARCH)
+  message(WARNING "Setting USE_FBGEMM_GENAI for gfx942 to ON by default, doing ROCM build")
+  set(USE_FBGEMM_GENAI_DEFAULT ON)
+elseif(USE_CUDA AND "$ENV{TORCH_CUDA_ARCH_LIST}" MATCHES "10.0" AND CMAKE_CUDA_COMPILER_VERSION VERSION_GREATER_EQUAL 12.8 AND NOT WIN32)


Only for datacenter Blackwell? Do we not need it for 11.0 or 12.0 arches too?

This check is insane to be frank, and yes, it should probably be enabled for more GPU architectures, but at least it makes the logic user-overridable, so one can build for SM_12 if they choose to

Yeah, this is going to be a footgun for consumer blackwell soon though

I'm not sure if it'll work on consumer Blackwells, as it compiles for sm_10a, which is not directly runnable/translatable for sm_12

The mxfp8 grouped gemm in fbgemm must be built with sm100a and cuda 12.8+, which is why this check exists the way it does. Furthermore, torch nightly builds only set TORCH_CUDA_ARCH_LIST=10.0, never 10.0a due to lack of portability, so we have to add sm100a to the build targets for fbgemm.

I am not sure if sm120 supports the tcgen* PTX instructions needed for the mxfp8 grouped gemm kernel?

[ghstack-poisoned]

- `cmake_dependent_option` condition should be `USE_ROCM OR (USE_CUDA AND NOT MSVC)` (similar to the one for flash attention) - Default settings should be user overridable, i.e. even if one builds for SM_10, they should be able to pass `USE_FBGEMM_GENAI=0` and skip the build ghstack-source-id: fe63e9b Pull Request resolved: #164165

malfet · 2025-09-29T21:03:59Z

@pytorchbot merge

pytorchmergebot · 2025-09-29T21:05:53Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

fe6f4d2

[ghstack-poisoned]

malfet requested review from danielvegamyhre and drisspg September 29, 2025 18:44

malfet added release notes: build release notes category topic: bug fixes topic category labels Sep 29, 2025

Skylion007 approved these changes Sep 29, 2025

View reviewed changes

malfet added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 29, 2025

Update

58c1c07

[ghstack-poisoned]

pytorchmergebot added the merging label Sep 29, 2025

pytorchmergebot closed this in 55840fb Sep 30, 2025

pytorchmergebot added Merged and removed merging labels Sep 30, 2025

github-actions bot deleted the gh/malfet/542/head branch October 31, 2025 02:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CMake] Fix `USE_FBGEMM_GENAI` option#164165

[CMake] Fix `USE_FBGEMM_GENAI` option#164165
malfet wants to merge 2 commits intogh/malfet/542/basefrom
gh/malfet/542/head

malfet commented Sep 29, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 29, 2025 •

edited

Loading

Uh oh!

Skylion007 left a comment

Uh oh!

Skylion007 Sep 29, 2025

Uh oh!

malfet Sep 29, 2025

Uh oh!

Skylion007 Sep 29, 2025

Uh oh!

malfet Sep 29, 2025

Uh oh!

Skylion007 Sep 29, 2025

Uh oh!

malfet Sep 29, 2025

Uh oh!

danielvegamyhre Sep 29, 2025 •

edited

Loading

Uh oh!

malfet commented Sep 29, 2025

Uh oh!

pytorchmergebot commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

malfet commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/164165

✅ No Failures

Uh oh!

Skylion007 left a comment

Choose a reason for hiding this comment

Uh oh!

Skylion007 Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

malfet Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Skylion007 Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

malfet Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Skylion007 Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

malfet Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

danielvegamyhre Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

malfet commented Sep 29, 2025

Uh oh!

pytorchmergebot commented Sep 29, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

malfet commented Sep 29, 2025 •

edited

Loading

pytorch-bot bot commented Sep 29, 2025 •

edited

Loading

danielvegamyhre Sep 29, 2025 •

edited

Loading