Skip to content

Conversation

@mikekgfb
Copy link
Contributor

@mikekgfb mikekgfb commented Dec 8, 2022

Summary: Introduce causal mask

This PR introduces a causal mask option is_causal, since current custom kernels do not support arbitrary masks.
(Also, building these masks is expensive, and avoiding to build them is a significant opportunity since causal mask is extremely common for training Transformer models.)

Test Plan: sandcastle & github ci/cd

Differential Revision: D41723137

@pytorch-bot
Copy link

pytorch-bot bot commented Dec 8, 2022

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90508

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 Failures

As of commit 5560eb4:

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base 01e7f46:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

1 similar comment
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

1 similar comment
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

1 similar comment
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

Copy link
Collaborator

@albanD albanD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Change sounds ok. Docstrings need to be updated.

@mikekgfb mikekgfb requested a review from albanD December 16, 2022 10:18
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

Copy link
Collaborator

@albanD albanD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The change itself sounds ok.
You most likely want to rebase above master to avoid trunk failures.

Summary:
Pull Request resolved: pytorch#90508

Introduce causal mask

Test Plan: sandcastle & github ci/cd

Reviewed By: albanD, mleshen

Differential Revision: D41723137

fbshipit-source-id: 5cb222d672e8f174a1447a91414068ba87490dc3
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D41723137

@mikekgfb
Copy link
Contributor Author

@pytorchbot -f merge

@pytorch-bot
Copy link

pytorch-bot bot commented Dec 16, 2022

❌ 🤖 pytorchbot command failed:

@pytorchbot: error: unrecognized arguments: -f

usage: @pytorchbot [-h] {merge,revert,rebase,label,drci} ...

Try @pytorchbot --help for more info.

@mikekgfb
Copy link
Contributor Author

@pytorchbot merge -f

@pytorch-bot
Copy link

pytorch-bot bot commented Dec 16, 2022

❌ 🤖 pytorchbot command failed:

@pytorchbot merge: error: argument -f/--force: expected one argument

usage: @pytorchbot merge [-g | -f MESSAGE | -l] [-r [{viable/strict,master}]]

Try @pytorchbot --help for more info.

@mikekgfb
Copy link
Contributor Author

mikekgfb commented Dec 16, 2022

@pytorchbot merge -f "lint runner was broken. Using -f as suggested"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants