-
Notifications
You must be signed in to change notification settings - Fork 26.3k
[Profiler] Specialized AppendOnlyQueue #73409
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
CI Flow Status⚛️ CI FlowRuleset - Version:
|
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit ea050e7 (more details on the Dr. CI page): 💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
|
This pull request was exported from Phabricator. Differential Revision: D34231072 |
|
This pull request was exported from Phabricator. Differential Revision: D34231072 |
5d755d4 to
8fcf1d0
Compare
|
This pull request was exported from Phabricator. Differential Revision: D34231072 |
8fcf1d0 to
fda6ee7
Compare
|
This pull request was exported from Phabricator. Differential Revision: D34231072 |
fda6ee7 to
f2a2d9f
Compare
|
This pull request was exported from Phabricator. Differential Revision: D34231072 |
f2a2d9f to
64303e5
Compare
Summary: Pull Request resolved: pytorch#73409 We can do better than `vector` or `deque`, and it's sufficiently important to the hot path to justify a custom container. (This is part of the larger queue refactor, but this is a standalone drop-in replacement so we don't need to wait.) Test Plan: It's a pretty simple container type, so I just added a few cpp tests for emplace and read back. I also ran the overhead benchmark (replicates=9) with both `--stressTestKineto` (0.99 -> 0.94 us) and `--stressTestKineto --kinetoProfileMemory` (1.36 -> 1.27 us). Reviewed By: swolchok Differential Revision: D34231072 fbshipit-source-id: fccbc9ab7adaa1645c631fee6aa85787f85df599
|
This pull request was exported from Phabricator. Differential Revision: D34231072 |
64303e5 to
ea050e7
Compare
Summary: Pull Request resolved: #73409 We can do better than `vector` or `deque`, and it's sufficiently important to the hot path to justify a custom container. (This is part of the larger queue refactor, but this is a standalone drop-in replacement so we don't need to wait.) Test Plan: It's a pretty simple container type, so I just added a few cpp tests for emplace and read back. I also ran the overhead benchmark (replicates=9) with both `--stressTestKineto` (0.99 -> 0.94 us) and `--stressTestKineto --kinetoProfileMemory` (1.36 -> 1.27 us). Reviewed By: swolchok Differential Revision: D34231072 fbshipit-source-id: ed57299729d444d59cf843a0d38a3ee2240eeec1
|
Hey @robieta. |
Summary: We can do better than
vectorordeque, and it's sufficiently important to the hot path to justify a custom container. (This is part of the larger queue refactor, but this is a standalone drop-in replacement so we don't need to wait.)Test Plan: It's a pretty simple container type, so I just added a few cpp tests for emplace and read back. I also ran the overhead benchmark (replicates=9) with both
--stressTestKineto(0.99 -> 0.94 us) and--stressTestKineto --kinetoProfileMemory(1.36 -> 1.27 us).Differential Revision: D34231072