-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Introduce causal mask #90508
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Introduce causal mask #90508
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90508
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 FailuresAs of commit 5560eb4: This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
1 similar comment
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
dc8ff81 to
fd44c03
Compare
fd44c03 to
035ce36
Compare
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
035ce36 to
5fa6d34
Compare
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
5fa6d34 to
b5e948f
Compare
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
b5e948f to
9f5c9b2
Compare
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
1 similar comment
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
9f5c9b2 to
fb42950
Compare
fb42950 to
552160e
Compare
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
1 similar comment
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
552160e to
9c85b9b
Compare
4aa2325 to
c29f729
Compare
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
c29f729 to
d529803
Compare
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
albanD
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change sounds ok. Docstrings need to be updated.
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
d529803 to
d7a17f1
Compare
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
d7a17f1 to
abd8ef9
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The change itself sounds ok.
You most likely want to rebase above master to avoid trunk failures.
Summary: Pull Request resolved: pytorch#90508 Introduce causal mask Test Plan: sandcastle & github ci/cd Reviewed By: albanD, mleshen Differential Revision: D41723137 fbshipit-source-id: 5cb222d672e8f174a1447a91414068ba87490dc3
|
This pull request was exported from Phabricator. Differential Revision: D41723137 |
abd8ef9 to
5560eb4
Compare
|
@pytorchbot -f merge |
|
❌ 🤖 pytorchbot command failed: Try |
|
@pytorchbot merge -f |
|
❌ 🤖 pytorchbot command failed: Try |
|
@pytorchbot merge -f "lint runner was broken. Using -f as suggested" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Summary: Introduce causal mask
This PR introduces a causal mask option is_causal, since current custom kernels do not support arbitrary masks.
(Also, building these masks is expensive, and avoiding to build them is a significant opportunity since causal mask is extremely common for training Transformer models.)
Test Plan: sandcastle & github ci/cd
Differential Revision: D41723137