Skip to content

[benchmark] Add HF LLM benchmarks#156967

Closed
angelayi wants to merge 8 commits intomainfrom
angelayi/benchmark2
Closed

[benchmark] Add HF LLM benchmarks#156967
angelayi wants to merge 8 commits intomainfrom
angelayi/benchmark2

Conversation

@angelayi
Copy link
Contributor

@angelayi angelayi commented Jun 26, 2025

@angelayi angelayi requested review from anijain2305 and zou3519 June 26, 2025 17:02
@pytorch-bot
Copy link

pytorch-bot bot commented Jun 26, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/156967

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit a20e357 with merge base a749c40 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@BoyuanFeng
Copy link
Contributor

Thanks for adding more models! A few minor comments. Also, please fix the ci.

@BoyuanFeng
Copy link
Contributor

Curious, will we add these models into existing Huggingface column or a new column called "huggingface_llm"? It might be a bit confusing with two columns starting with "huggingface"..

image

@angelayi angelayi force-pushed the angelayi/benchmark2 branch 4 times, most recently from 0749c30 to bbf4a09 Compare August 11, 2025 15:37
@angelayi
Copy link
Contributor Author

@BoyuanFeng yes! I have updated to merge everything into the huggingface column.

@angelayi angelayi marked this pull request as ready for review August 11, 2025 15:43
elif args.export_nativert:
frozen_model_iter_fn = export_nativert(model, example_inputs)
use_generate_mode = kwargs.get("use_generate_mode", False)
if use_generate_mode:
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added this use_generate_mode flag so that we only apply torch.compile/export to model.forward, instead of applying it to model.generate

@angelayi angelayi force-pushed the angelayi/benchmark2 branch 2 times, most recently from f48faf2 to 441527d Compare August 12, 2025 04:45
@huydhn
Copy link
Contributor

huydhn commented Aug 20, 2025

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased angelayi/benchmark2 onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout angelayi/benchmark2 && git pull --rebase)

@huydhn
Copy link
Contributor

huydhn commented Aug 20, 2025

(I'm rebase to bring in the latest transformers from #159291)

@angelayi angelayi force-pushed the angelayi/benchmark2 branch 2 times, most recently from 31864dd to 3edcecb Compare August 26, 2025 22:08
@huydhn
Copy link
Contributor

huydhn commented Aug 27, 2025

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased angelayi/benchmark2 onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout angelayi/benchmark2 && git pull --rebase)

Signed-off-by: Huy Do <huydhn@gmail.com>
- GPTJForCausalLM
- GPTJForQuestionAnswering
# Model too big
- google/gemma-3-4b-it
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Once this lands, let me see if we could use a100 for all HF models instead. This should resolve this issue

Signed-off-by: Huy Do <huydhn@gmail.com>
@huydhn huydhn requested a review from a team as a code owner September 12, 2025 01:29
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
@huydhn
Copy link
Contributor

huydhn commented Sep 14, 2025

@pytorchbot merge -f 'Previous round jobs were all ok'

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

jeffdaily added a commit to ROCm/pytorch that referenced this pull request Sep 15, 2025
PR pytorch#156967 added HF LLM benchmarks but did not add the ci expected
accuracy files for ROCm.
pytorchmergebot pushed a commit that referenced this pull request Sep 15, 2025
PR #156967 added HF LLM benchmarks but did not add the ci expected accuracy files for ROCm.

Pull Request resolved: #162965
Approved by: https://github.com/jeffdaily

Co-authored-by: Jeff Daily <jeff.daily@amd.com>
facebook-github-bot pushed a commit to pytorch/benchmark that referenced this pull request Sep 16, 2025
Summary:
Results in https://docs.google.com/spreadsheets/d/1xXOPg9JjEmPx0zc5QBNdyXQq8-K2_r4ybHaiS-q7pZ0/edit?gid=88695043#gid=88695043

X-link: pytorch/pytorch#156967
Approved by: https://github.com/huydhn

Reviewed By: wdvr

Differential Revision: D82462749

fbshipit-source-id: e7f087c0deb38b4441c568f7cca4691506e35a32

Co-authored-by: Huy Do <huydhn@gmail.com>
markc-614 pushed a commit to markc-614/pytorch that referenced this pull request Sep 17, 2025
markc-614 pushed a commit to markc-614/pytorch that referenced this pull request Sep 17, 2025
)

PR pytorch#156967 added HF LLM benchmarks but did not add the ci expected accuracy files for ROCm.

Pull Request resolved: pytorch#162965
Approved by: https://github.com/jeffdaily

Co-authored-by: Jeff Daily <jeff.daily@amd.com>
mansiag05 pushed a commit to mansiag05/pytorch that referenced this pull request Sep 22, 2025
mansiag05 pushed a commit to mansiag05/pytorch that referenced this pull request Sep 22, 2025
)

PR pytorch#156967 added HF LLM benchmarks but did not add the ci expected accuracy files for ROCm.

Pull Request resolved: pytorch#162965
Approved by: https://github.com/jeffdaily

Co-authored-by: Jeff Daily <jeff.daily@amd.com>
cleonard530 pushed a commit to cleonard530/pytorch that referenced this pull request Sep 22, 2025
cleonard530 pushed a commit to cleonard530/pytorch that referenced this pull request Sep 22, 2025
)

PR pytorch#156967 added HF LLM benchmarks but did not add the ci expected accuracy files for ROCm.

Pull Request resolved: pytorch#162965
Approved by: https://github.com/jeffdaily

Co-authored-by: Jeff Daily <jeff.daily@amd.com>
dsashidh pushed a commit to dsashidh/pytorch that referenced this pull request Sep 26, 2025
dsashidh pushed a commit to dsashidh/pytorch that referenced this pull request Sep 26, 2025
)

PR pytorch#156967 added HF LLM benchmarks but did not add the ci expected accuracy files for ROCm.

Pull Request resolved: pytorch#162965
Approved by: https://github.com/jeffdaily

Co-authored-by: Jeff Daily <jeff.daily@amd.com>
@github-actions github-actions bot deleted the angelayi/benchmark2 branch October 15, 2025 02:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants