Add cuda headers automatically for compile_kernel by msaroufim · Pull Request #162634 · pytorch/pytorch

msaroufim · 2025-09-10T20:21:34Z

Issue was pointed out before by @ngimel and more recently by https://gau-nernst.github.io/nvrtc-matmul/#missing-cuda-and-c-headers- by @gau-nernst

Benefit is now we can add

#include <cuda_fp16.h> without crapping out

pytorch-bot · 2025-09-10T20:21:39Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162634

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit af54ca8 with merge base 1051c7d ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

trunk / linux-jammy-cuda12.8-py3.10-gcc11 / test (distributed, 3, 3, lf.linux.g4dn.12xlarge.nvidia.gpu) (gh) (similar failure)
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! KeyboardInterrupt !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

This comment was automatically generated by Dr. CI and updates every 15 minutes.

msaroufim · 2025-09-10T21:03:26Z

@pytorchbot merge

pytorchmergebot · 2025-09-10T21:05:23Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-09-10T23:27:54Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / linux-jammy-cuda12.8-py3.10-gcc11 / test (distributed, 3, 3, lf.linux.g4dn.12xlarge.nvidia.gpu)

Details for Dev Infra team

Raised by workflow job

msaroufim · 2025-09-11T00:02:28Z

@pytorchbot merge -i

pytorchmergebot · 2025-09-11T00:04:12Z

Merge started

Your change will be merged while ignoring the following 1 checks: trunk / linux-jammy-cuda12.8-py3.10-gcc11 / test (distributed, 3, 3, lf.linux.g4dn.12xlarge.nvidia.gpu)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

@ngimel

Issue was pointed out before by @ngimel and more recently by https://gau-nernst.github.io/nvrtc-matmul/#missing-cuda-and-c-headers- by @gau-nernst Benefit is now we can add `#include <cuda_fp16.h>` without crapping out Pull Request resolved: pytorch#162634 Approved by: https://github.com/ngimel

@ngimel

Issue was pointed out before by @ngimel and more recently by https://gau-nernst.github.io/nvrtc-matmul/#missing-cuda-and-c-headers- by @gau-nernst Benefit is now we can add `#include <cuda_fp16.h>` without crapping out Pull Request resolved: pytorch#162634 Approved by: https://github.com/ngimel

@ngimel

Issue was pointed out before by @ngimel and more recently by https://gau-nernst.github.io/nvrtc-matmul/#missing-cuda-and-c-headers- by @gau-nernst Benefit is now we can add `#include <cuda_fp16.h>` without crapping out Pull Request resolved: pytorch#162634 Approved by: https://github.com/ngimel

@ngimel

Issue was pointed out before by @ngimel and more recently by https://gau-nernst.github.io/nvrtc-matmul/#missing-cuda-and-c-headers- by @gau-nernst Benefit is now we can add `#include <cuda_fp16.h>` without crapping out Pull Request resolved: pytorch#162634 Approved by: https://github.com/ngimel

Add cuda headers automatically for compile_kernel

77168e5

msaroufim requested review from eqy and syed-ahmed as code owners September 10, 2025 20:21

msaroufim added the release notes: cuda release notes category label Sep 10, 2025

msaroufim added 3 commits September 10, 2025 13:31

update with c++ standard lib

bfb5d88

update

c016c89

remove cpp lib

af54ca8

ngimel approved these changes Sep 10, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 10, 2025

pytorchmergebot added the merging label Sep 10, 2025

pytorchmergebot removed the merging label Sep 10, 2025

pytorchmergebot added the merging label Sep 11, 2025

pytorchmergebot added the Merged label Sep 11, 2025

pytorchmergebot closed this in 4fd2a2b Sep 11, 2025

pytorchmergebot removed the merging label Sep 11, 2025

msaroufim mentioned this pull request Sep 17, 2025

compile_kernel: fast inline compilation with nvrtc tracker #163142

Open

20 tasks

github-actions bot deleted the compile_kernel_include_dir branch October 11, 2025 02:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cuda headers automatically for compile_kernel#162634

Add cuda headers automatically for compile_kernel#162634
msaroufim wants to merge 4 commits intomainfrom
compile_kernel_include_dir

msaroufim commented Sep 10, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 10, 2025 •

edited

Loading

Uh oh!

msaroufim commented Sep 10, 2025

Uh oh!

pytorchmergebot commented Sep 10, 2025

Uh oh!

pytorchmergebot commented Sep 10, 2025

Uh oh!

msaroufim commented Sep 11, 2025

Uh oh!

pytorchmergebot commented Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

msaroufim commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162634

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

msaroufim commented Sep 10, 2025

Uh oh!

pytorchmergebot commented Sep 10, 2025

Merge started

Uh oh!

pytorchmergebot commented Sep 10, 2025

Merge failed

Uh oh!

msaroufim commented Sep 11, 2025

Uh oh!

pytorchmergebot commented Sep 11, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

msaroufim commented Sep 10, 2025 •

edited

Loading

pytorch-bot bot commented Sep 10, 2025 •

edited

Loading