torch.hub security improvement: add new trust_repo parameter #72060

vmoens · 2022-01-31T14:45:32Z

As pointed by #71205, torch.hub.load assumes that the user trusts the repo from where the code is gathered and exececuted. We propose a solution to make sure that the user is aware of the security threat that this can represent.

Solution: Adds a trust_repo parameter to the load, list and help functions in torch.hub.
For now, the default trust_repo=None warns that, in the future, the user will need to authorize explicitly every repo before downloading it.
Once the repo has been trusted (via trust_repo=True or via a command prompt input) it will be added to the list of trusted repositories.

pytorch-bot · 2022-01-31T14:45:37Z

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/pytorch/pytorch/blob/13436bf97aa5572ecb26d188ec65c5797459ba39/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default
Add ciflow labels to this PR to trigger more builds:

Workflows	Labels (bold enabled)	Status
Triggered Workflows
linux-binary-conda	`ciflow/binaries`, `ciflow/binaries_conda`, `ciflow/default`	✅ triggered
linux-binary-libtorch-cxx11-abi	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
linux-binary-libtorch-pre-cxx11	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
linux-binary-manywheel	`ciflow/binaries`, `ciflow/binaries_wheel`, `ciflow/default`	✅ triggered
linux-bionic-py3.7-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/trunk`, `ciflow/xla`	✅ triggered
linux-docs	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/docs`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-vulkan-bionic-py3.7-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda11.3-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-cuda11.3-py3.7-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3-clang5-mobile-build	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`, `ciflow/trunk`	✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-static	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc7	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc7-no-ops	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single-full-jit	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/trunk`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/trunk`, `ciflow/win`	✅ triggered
windows-binary-libtorch-cxx11-abi	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
windows-binary-libtorch-pre-cxx11	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
windows-binary-wheel	`ciflow/binaries`, `ciflow/binaries_wheel`, `ciflow/default`	✅ triggered
Skipped Workflows
caffe2-linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
docker-builds	`ciflow/all`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-arm64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-arm64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-arm64-custom-ops	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-arm64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-arm64-metal	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-x86-64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-x86-64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-x86-64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
libtorch-linux-xenial-cuda10.2-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`, `ciflow/trunk`	🚫 skipped
linux-bionic-rocm4.5-py3.7	`ciflow/linux`, `ciflow/rocm`	🚫 skipped
linux-docs-push	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
linux-xenial-cuda11.3-py3.7-gcc7-no-ops	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
macos-10-15-py3-arm64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
macos-10-15-py3-lite-interpreter-x86-64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
macos-11-py3-x86-64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
parallelnative-linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
periodic-libtorch-linux-bionic-cuda11.5-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-libtorch-linux-xenial-cuda11.1-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-bionic-cuda11.5-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	🚫 skipped
periodic-linux-xenial-cuda11.1-py3.7-gcc7-debug	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped
periodic-win-vs2019-cuda11.5-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-build	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped

facebook-github-bot · 2022-01-31T14:45:38Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/72060
Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit c03c9dd (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

NicolasHug

Thanks @vmoens , I took a brief look but this looks good. I have a few comments below, LMK what you think. We'll also need to add a few tests ideally (although I'm not sure yet how to handle the prompt)

torch/hub.py

NicolasHug · 2022-01-31T15:00:27Z

torch/hub.py

+        if is_trusted:
+            print("The repository is already trusted.")
+            return


I feel like we should just return without a message here. IIUC this should only ever happen if the user used trust_repo=False while the repo is already trusted?

yes that's the case.
I thought that if one asks for trust_repo=False then answers yes to the prompt, then a notification that this was already trusted could be useful. But happy to remove this if there's no use.

torch/hub.py

Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

_PREDEFINED_TRUSTED list to tuple

vmoens · 2022-01-31T16:52:11Z

(although I'm not sure yet how to handle the prompt)

I have written 4 tests:

trust_repo = False / response = ''
trust_repo = False / response = 'y'
trust_repo = "check" / response = 'y'
trust_repo = None

torch/hub.py

_TRUSTED_REPO_PREFIXES

NicolasHug

Thanks a lot @vmoens ,

I added a few tests and also made sure to check how many times input() was called, to make sure we prompt the user only when we need to. I also removed the use of the legacy_file and instead just check the content of the cache directly, as discussed offline. I'll approve the PR, if you're OK with my own change let's merge this next week and road-test it :) !

vmoens · 2022-04-01T12:57:32Z

Thanks a lot @vmoens ,

I added a few tests and also made sure to check how many times input() was called, to make sure we prompt the user only when we need to. I also removed the use of the legacy_file and instead just check the content of the cache directly, as discussed offline. I'll approve the PR, if you're OK with my own change let's merge this next week and road-test it :) !

Awesome sounds good to me! Thanks for the many improvements

NicolasHug

@pytorchmergebot please merge this

NicolasHug · 2022-04-05T09:13:37Z

@pytorchmergebot please merge this

NicolasHug · 2022-04-05T09:28:00Z

@pytorchmergebot merge this please .... I'm begging ?

github-actions · 2022-04-05T09:30:09Z

Hey @vmoens.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: As pointed by #71205, `torch.hub.load` assumes that the user trusts the repo from where the code is gathered and exececuted. We propose a solution to make sure that the user is aware of the security threat that this can represent. **Solution**: Adds a `trust_repo` parameter to the `load`, `list` and `help` functions in torch.hub. For now, the default `trust_repo=None` warns that, in the future, the user will need to authorize explicitly every repo before downloading it. Once the repo has been trusted (via `trust_repo=True` or via a command prompt input) it will be added to the list of trusted repositories. Pull Request resolved: #72060 Approved by: https://github.com/NicolasHug Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/41992791e82566eb977d0a2e062cca0c7fd9d56d Reviewed By: b0noI Differential Revision: D35404306 Pulled By: vmoens fbshipit-source-id: 30eb34135a4961abdc8fba70e3cb085dc5c073d0

vmoens added 2 commits January 31, 2022 14:32

add trust_repo option to load, list and help

b48ddd0

formatting

13436bf

pytorch-bot bot added the ciflow/default label Jan 31, 2022

facebook-github-bot added the cla signed label Jan 31, 2022

vmoens requested a review from NicolasHug January 31, 2022 14:46

vmoens added the module: hub label Jan 31, 2022

vmoens marked this pull request as ready for review January 31, 2022 14:52

vmoens added 2 commits January 31, 2022 14:54

formatting

b377c02

set tests to trust_repo=True

5476c77

NicolasHug reviewed Jan 31, 2022

View reviewed changes

vmoens and others added 9 commits January 31, 2022 15:31

Update torch/hub.py

a0c0b40

Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

Update torch/hub.py

95e3947

Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

Update hub.py

269fec8

Update hub.py

ac8e906

Update hub.py

12a4af1

startswith => __eq__

919c18f

Update hub.py

5c81233

_PREDEFINED_TRUSTED list to tuple

some tests

7242fb4

Merge remote-tracking branch 'origin/hub_security' into hub_security

8229a5e

vmoens added 3 commits January 31, 2022 16:59

Update hub.py

2bd2f5e

Update test_utils.py

2a1cc1e

Update hub.py

2cd3082

NicolasHug reviewed Jan 31, 2022

View reviewed changes

torch/hub.py Outdated Show resolved Hide resolved

vmoens added 4 commits February 1, 2022 09:27

Update hub.py

3f19af6

Update hub.py

cc9c3ae

_TRUSTED_REPO_PREFIXES

e82dcc7

_TRUSTED_REPO_PREFIXES

Update hub.py

8a5e88b

vmoens and others added 7 commits March 4, 2022 13:22

make mypy happy

7e3ea9c

testing if I can push, please ignore

ba4531a

Merge branch 'master' of github.com:pytorch/pytorch into hub_security

225b889

More robust with better setup/teardown logic + remove legacy file

e690406

remove pathlib

811963b

Add test for builtin trusted owners

21b9386

hopefully fix test?

f8aa000

suo removed the ciflow/default label Mar 22, 2022

NicolasHug added 7 commits March 23, 2022 09:25

Merge branch 'master' of github.com:pytorch/pytorch into hub_security

b10e5b6

cos

78a247c

remove force_reload everywhere

69aacdc

Merge branch 'master' of github.com:pytorch/pytorch into hub_security

7ce3460

Revert changes to fbgemm

35c9506

Merge branch 'master' of github.com:pytorch/pytorch into hub_security

414e2a0

More robust tests: used mocked input to check how many times it's called

c03c9dd

NicolasHug approved these changes Apr 1, 2022

View reviewed changes

NicolasHug changed the title ~~torch.hub security improvement~~ torch.hub security improvement: add new trust_repo parameter Apr 1, 2022

NicolasHug reviewed Apr 5, 2022

View reviewed changes

pytorchmergebot closed this in 4199279 Apr 5, 2022

NicolasHug added topic: security release notes: hub labels Apr 5, 2022

NicolasHug mentioned this pull request Apr 5, 2022

Fix all .md files to set "trust_repo=True" pytorch/hub#276

Open

NicolasHug mentioned this pull request Apr 14, 2022

use context manager for path extension in torch.hub #75786

Closed

NicolasHug mentioned this pull request May 11, 2022

Update calls to torch.hub.* to use trust_repo=True. pytorch/hub#281

Merged

github-actions bot deleted the hub_security branch February 15, 2024 01:56

torch.hub security improvement: add new trust_repo parameter #72060

torch.hub security improvement: add new trust_repo parameter #72060

Uh oh!

Conversation

vmoens commented Jan 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 31, 2022

⚛️ CI Flow

Uh oh!

facebook-github-bot commented Jan 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NicolasHug Jan 31, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Jan 31, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vmoens commented Jan 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

vmoens commented Apr 1, 2022

Uh oh!

NicolasHug left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug commented Apr 5, 2022

Uh oh!

NicolasHug commented Apr 5, 2022

Uh oh!

github-actions bot commented Apr 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vmoens commented Jan 31, 2022 •

edited

Loading

facebook-github-bot commented Jan 31, 2022 •

edited

Loading

vmoens commented Jan 31, 2022 •

edited

Loading

NicolasHug left a comment •

edited

Loading