Skip to content

Conversation

@wz337
Copy link
Contributor

@wz337 wz337 commented Jan 24, 2023

Fixes #92823

@pytorch-bot pytorch-bot bot added the release notes: distributed (fsdp) release notes category label Jan 24, 2023
@pytorch-bot
Copy link

pytorch-bot bot commented Jan 24, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/92933

Note: Links to docs will display an error until the docs builds have been completed.

❌ 8 Failures

As of commit cdbb246:

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base 811e95a:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@wz337 wz337 requested a review from kumpera January 24, 2023 23:50
@wz337 wz337 marked this pull request as ready for review January 24, 2023 23:52
Copy link
Contributor

@kumpera kumpera left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👏

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a rebase job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Successfully rebased default_planner_flags onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout default_planner_flags && git pull --rebase)

@wz337
Copy link
Contributor Author

wz337 commented Jan 25, 2023

@pytorchmergebot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 25, 2023
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: 2 mandatory check(s) failed (Rule Distributed). The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team Raised by workflow job

@kumpera
Copy link
Contributor

kumpera commented Feb 7, 2023

@wz337 a lot of DCP tests only run during merging and not CI testing since they require muiltiple gpus.

@wz337
Copy link
Contributor Author

wz337 commented Feb 8, 2023

@pytorchmergebot merge -f "unrelated xla, functorch, dynamo, crossref test failures"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/trunk Trigger trunk jobs on your pull request Merged release notes: distributed (fsdp) release notes category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[PT-D][Checkpoint] Fix load_sharded_optimizer_state_dict() when default flatten_sharded_tensors to True

3 participants