Skip to content

Commit 61ec047

Browse files
committed
Update on "[DTensor] create default process group when absent"
[ghstack-poisoned]
2 parents 7cb3154 + 9a060dd commit 61ec047

File tree

370 files changed

+11896
-6124
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

370 files changed

+11896
-6124
lines changed

.github/actions/setup-rocm/action.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -16,6 +16,8 @@ runs:
1616
# ignore expansion of "docker ps -q" since it could be empty
1717
# shellcheck disable=SC2046
1818
docker stop $(docker ps -q) || true
19+
# Prune all stopped containers.
20+
docker container prune -f
1921
2022
- name: Runner health check system info
2123
if: always()

.github/actions/teardown-rocm/action.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ runs:
1212
# ignore expansion of "docker ps -q" since it could be empty
1313
# shellcheck disable=SC2046
1414
docker stop $(docker ps -q) || true
15+
# Prune all stopped containers.
16+
docker container prune -f
1517
# Prune everything docker if there are more than 10 images (~200GB).
1618
# This is easier than using a time filter, e.g., "until=24h".
1719
# Might fail if a prune is already in progress by another runner.

.github/actions/upload-test-artifacts/action.yml

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -49,6 +49,9 @@ runs:
4949
if [ -f 'usage_log.txt' ]; then
5050
zip "usage-log-${FILE_SUFFIX}.zip" 'usage_log.txt'
5151
fi
52+
if ls test/**/*.log 1> /dev/null 2>&1; then
53+
zip -r "usage-log-${FILE_SUFFIX}.zip" test -i '*.log'
54+
fi
5255
5356
# Windows zip
5457
- name: Zip JSONs for upload
@@ -76,7 +79,7 @@ runs:
7679
FILE_SUFFIX: ${{ inputs.file-suffix }}
7780
run: |
7881
# -ir => recursive include all files in pattern
79-
7z a "usage-log-$Env:FILE_SUFFIX.zip" 'usage_log.txt'
82+
7z a "usage-log-$Env:FILE_SUFFIX.zip" 'usage_log.txt' -ir'!test\*.log'
8083
8184
# S3 upload
8285
- name: Store Test Downloaded JSONs on S3
@@ -141,5 +144,7 @@ runs:
141144
name: usage-log-runattempt${{ github.run_attempt }}-${{ inputs.file-suffix }}.zip
142145
retention-days: 14
143146
if-no-files-found: ignore
144-
path: usage_log.txt
147+
path: |
148+
usage_log.txt
149+
test/**/*.log
145150
continue-on-error: true

.github/ci_commit_pins/vision.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
32d254bbfcf14975f846765775584e61ef25a5bc
1+
7dc5e5bd60b55eb4e6ea5c1265d6dc7b17d2e917

.github/merge_rules.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -366,6 +366,7 @@
366366
approved_by:
367367
- mruberry
368368
- lezcano
369+
- Skylion007
369370
mandatory_checks_name:
370371
- EasyCLA
371372
- Lint
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
typing_extensions
2+
jinja2

.github/scripts/README.md

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ couple of utility templates used to discern common utilities that can be used am
2222

2323
You will need `jinja2` in order to regenerate the workflow files which can be installed using:
2424
```bash
25-
pip install -r .github/requirements.txt
25+
pip install -r .github/requirements/regenerate-requirements.txt
2626
```
2727

2828
Workflows can be generated / regenerated using the following command:
@@ -56,3 +56,10 @@ Exceptions to AWS for self hosted:
5656
New runner types can be added by committing changes to `.github/scale-config.yml`. Example: https://github.com/pytorch/pytorch/pull/70474
5757

5858
> NOTE: New runner types can only be used once the changes to `.github/scale-config.yml` have made their way into the default branch
59+
60+
### Testing [pytorch/builder](https://github.com/pytorch/builder) changes
61+
62+
In order to test changes to the builder scripts:
63+
64+
1. Specify your builder PR's branch and repo as `builder_repo` and `builder_branch` in [`.github/templates/common.yml.j2`](https://github.com/pytorch/pytorch/blob/32356aaee6a77e0ae424435a7e9da3d99e7a4ca5/.github/templates/common.yml.j2#LL10C26-L10C32). 2. Regenerate workflow files with `.github/regenerate.sh` (see above).
65+
3. Submit fake PR to PyTorch. If changing binaries build, add an appropriate label like `ciflow/binaries` to trigger the builds.

0 commit comments

Comments
 (0)