[BE] Preserve caller source location in the error message by albanD · Pull Request #162808 · pytorch/pytorch

albanD · 2025-09-12T14:23:17Z

Summary:
Currently the C10_CUDA_CHECK only shows source location in CUDAException like below:

Exception raised from c10_cuda_check_implementation at fbcode/caffe2/c10/cuda/CUDAException.cpp:44

which is not terribly useful.

By checking the original diff D39619861 that introduced c10_cuda_check_implementation, it seems the original macro would show the source location correctly but c10_cuda_check_implementation broke it.

This diff will propagate caller source location to c10_cuda_check_implementation to fix the issue.

Test Plan:
CI

Observed desired error message after the change:

CUDA error: an illegal memory access was encountered
Search for `cudaErrorIllegalAddress' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information.
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Device-side assertion tracking was not enabled by user.
Exception raised from operator() at fbcode/sigrid/predictor/aed/AedContainer.cpp:659 (most recent call first):

Note the last line reports actual caller location.

Rollback Plan:

Reviewed By: Raymo111

Differential Revision: D81880552

Summary: Currently the C10_CUDA_CHECK only shows source location in CUDAException like below: ``` Exception raised from c10_cuda_check_implementation at fbcode/caffe2/c10/cuda/CUDAException.cpp:44 ``` which is not terribly useful. By checking the original diff D39619861 that introduced c10_cuda_check_implementation, it seems the original macro would show the source location correctly but c10_cuda_check_implementation broke it. This diff will propagate caller source location to c10_cuda_check_implementation to fix the issue. Test Plan: CI Observed desired error message after the change: ``` CUDA error: an illegal memory access was encountered Search for `cudaErrorIllegalAddress' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Device-side assertion tracking was not enabled by user. Exception raised from operator() at fbcode/sigrid/predictor/aed/AedContainer.cpp:659 (most recent call first): ``` Note the last line reports actual caller location. Rollback Plan: Reviewed By: Raymo111 Differential Revision: D81880552

pytorch-bot · 2025-09-12T14:23:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162808

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 37a1255 with merge base 03798b0 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-09-12T14:23:28Z

@albanD has exported this pull request. If you are a Meta employee, you can view the originating diff in D81880552.

albanD · 2025-09-12T14:27:16Z

@pytorchbot merge

pytorchmergebot · 2025-09-12T14:29:10Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-09-12T14:39:51Z

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

Lint / lintrunner-clang / linux-job

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

albanD · 2025-09-12T14:47:52Z

@pytorchbot merge

pytorchmergebot · 2025-09-12T14:50:03Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-09-12T15:11:28Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / win-vs2022-cuda12.6-py3 / build

Details for Dev Infra team

Raised by workflow job

albanD · 2025-09-15T13:21:59Z

@pytorchbot merge

pytorchmergebot · 2025-09-15T13:24:02Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…2808) Summary: Currently the C10_CUDA_CHECK only shows source location in CUDAException like below: ``` Exception raised from c10_cuda_check_implementation at fbcode/caffe2/c10/cuda/CUDAException.cpp:44 ``` which is not terribly useful. By checking the original diff D39619861 that introduced c10_cuda_check_implementation, it seems the original macro would show the source location correctly but c10_cuda_check_implementation broke it. This diff will propagate caller source location to c10_cuda_check_implementation to fix the issue. Test Plan: CI Observed desired error message after the change: ``` CUDA error: an illegal memory access was encountered Search for `cudaErrorIllegalAddress' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information. CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Device-side assertion tracking was not enabled by user. Exception raised from operator() at fbcode/sigrid/predictor/aed/AedContainer.cpp:659 (most recent call first): ``` Note the last line reports actual caller location. Rollback Plan: Reviewed By: Raymo111 Differential Revision: D81880552 Pull Request resolved: pytorch#162808 Approved by: https://github.com/janeyx99

albanD requested review from eqy and syed-ahmed as code owners September 12, 2025 14:23

facebook-github-bot added fb-exported meta-exported labels Sep 12, 2025

janeyx99 approved these changes Sep 12, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 12, 2025

albanD added the topic: not user facing topic category label Sep 12, 2025

janeyx99 added release notes: cuda release notes category and removed topic: not user facing topic category labels Sep 12, 2025

albanD removed the release notes: cuda release notes category label Sep 12, 2025

janeyx99 added the release notes: cuda release notes category label Sep 12, 2025

albanD added the topic: bug fixes topic category label Sep 12, 2025

pytorchmergebot added the merging label Sep 12, 2025

pytorchmergebot removed the merging label Sep 12, 2025

lint

f7ee89a

pytorchmergebot added the merging label Sep 12, 2025

pytorchmergebot removed the merging label Sep 12, 2025

c++17 only

37a1255

pytorchmergebot added the merging label Sep 15, 2025

pytorchmergebot closed this in 09cbf34 Sep 15, 2025

pytorchmergebot added Merged and removed merging labels Sep 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BE] Preserve caller source location in the error message#162808

[BE] Preserve caller source location in the error message#162808
albanD wants to merge 3 commits intopytorch:mainfrom
albanD:export-D81880552

albanD commented Sep 12, 2025

Uh oh!

pytorch-bot bot commented Sep 12, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 12, 2025

Uh oh!

albanD commented Sep 12, 2025

Uh oh!

pytorchmergebot commented Sep 12, 2025

Uh oh!

pytorchmergebot commented Sep 12, 2025

Uh oh!

albanD commented Sep 12, 2025

Uh oh!

pytorchmergebot commented Sep 12, 2025

Uh oh!

pytorchmergebot commented Sep 12, 2025

Uh oh!

albanD commented Sep 15, 2025

Uh oh!

pytorchmergebot commented Sep 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

albanD commented Sep 12, 2025

Uh oh!

pytorch-bot bot commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/162808

✅ No Failures

Uh oh!

facebook-github-bot commented Sep 12, 2025

Uh oh!

albanD commented Sep 12, 2025

Uh oh!

pytorchmergebot commented Sep 12, 2025

Merge started

Uh oh!

pytorchmergebot commented Sep 12, 2025

Merge failed

Uh oh!

albanD commented Sep 12, 2025

Uh oh!

pytorchmergebot commented Sep 12, 2025

Merge started

Uh oh!

pytorchmergebot commented Sep 12, 2025

Merge failed

Uh oh!

albanD commented Sep 15, 2025

Uh oh!

pytorchmergebot commented Sep 15, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Sep 12, 2025 •

edited

Loading