Skip to content

[xpu][test] Enable profiler test for XPU#165423

Closed
chuanqi129 wants to merge 2 commits intopytorch:mainfrom
chuanqi129:xpu_profiling_test
Closed

[xpu][test] Enable profiler test for XPU#165423
chuanqi129 wants to merge 2 commits intopytorch:mainfrom
chuanqi129:xpu_profiling_test

Conversation

@chuanqi129
Copy link
Collaborator

@chuanqi129 chuanqi129 commented Oct 14, 2025

@chuanqi129 chuanqi129 requested a review from etaf October 14, 2025 08:46
@chuanqi129 chuanqi129 requested a review from a team as a code owner October 14, 2025 08:46
@chuanqi129 chuanqi129 added keep-going Don't stop on first failure, keep running tests until the end ciflow/xpu Run XPU CI tasks labels Oct 14, 2025
@pytorch-bot
Copy link

pytorch-bot bot commented Oct 14, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/165423

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (3 Unrelated Failures)

As of commit 72f466d with merge base 2325c51 (image):

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@etaf
Copy link
Collaborator

etaf commented Oct 14, 2025

Seems the failure in test/test_xpu.py::TestXpuXPU::test_lazy_init_xpu is caused by this PR.

@etaf
Copy link
Collaborator

etaf commented Oct 16, 2025

@pytorchbot rebase -b main

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased xpu_profiling_test onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via git checkout xpu_profiling_test && git pull --rebase)

@mikaylagawarecki mikaylagawarecki added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Oct 17, 2025
@etaf
Copy link
Collaborator

etaf commented Oct 20, 2025

@pytorchbot rebase -b main

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased xpu_profiling_test onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via git checkout xpu_profiling_test && git pull --rebase)

@chuanqi129
Copy link
Collaborator Author

@pytorchbot rebase -b main

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased xpu_profiling_test onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via git checkout xpu_profiling_test && git pull --rebase)

@chuanqi129 chuanqi129 force-pushed the xpu_profiling_test branch 2 times, most recently from d8a5ac2 to 920f38b Compare October 27, 2025 15:53
@etaf etaf changed the title [CI] Enable profiler test for XPU [xpu][test] Enable profiler test for XPU Oct 28, 2025
@Stonepia
Copy link
Contributor

Stonepia commented Nov 4, 2025

TODO:

  1. Will enable the tests in [XPU][Test] Enable XPU tests in inductor/test_analysis.py #166840 after this PR merged. Currently, the CI will fail with the err msg:
inductor/test_analysis.py::TestAnalysisXPU::test_combine_profiles_xpu_float16 [W1104 04:37:03.461419559 kineto_shim.cpp:459] Adding profiling metadata requires using torch.profiler with Kineto support (USE_KINETO=1)
[W1104 04:37:03.520726527 Module.cpp:197] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1...

[W1104 04:37:03.520954455 Module.cpp:197] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1...

[W1104 04:37:33.261990690 kineto_shim.cpp:459] Adding profiling metadata requires using torch.profiler with Kineto support (USE_KINETO=1)
[W1104 04:37:59.879077103 kineto_shim.cpp:459] Adding profiling metadata requires using torch.profiler with Kineto support (USE_KINETO=1)
FAILED [66.9644s] [100%]

Copy link
Contributor

@atalman atalman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@chuanqi129
Copy link
Collaborator Author

@pytorchbot rebase -b main

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased xpu_profiling_test onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via git checkout xpu_profiling_test && git pull --rebase)

@pytorch-bot
Copy link

pytorch-bot bot commented Nov 9, 2025

❌ 🤖 pytorchbot command failed:

@pytorchbot: error: argument command: invalid choice: 'decided' (choose from 'merge', 'revert', 'rebase', 'label', 'drci', 'cherry-pick')

usage: @pytorchbot [-h] {merge,revert,rebase,label,drci,cherry-pick} ...

Try @pytorchbot --help for more info.

@chuanqi129
Copy link
Collaborator Author

@pytorchbot drci

@chuanqi129
Copy link
Collaborator Author

@pytorchbot drci

@chuanqi129
Copy link
Collaborator Author

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 10, 2025
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/inductor ciflow/trunk Trigger trunk jobs on your pull request ciflow/xpu Run XPU CI tasks keep-going Don't stop on first failure, keep running tests until the end Merged module: inductor open source topic: not user facing topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Projects

None yet

Development

Successfully merging this pull request may close these issues.

DISABLED test_aoti_user_defined_triton_kernel_profiling_xpu (__main__.AOTInductorTestABICompatibleGpu)

9 participants