Skip to content

Conversation

@ngimel
Copy link
Collaborator

@ngimel ngimel commented Mar 12, 2018

Currentlty max 64 blocks of 256 threads are launched, which is very low occupancy and performance on Pascal and Maxwell cards. This PR increases the number of blocks to 200, current max that MTGP can handle without generating additional starting parameter sets. Longer term, core THC should probably be moving to philox (distributions are already using it) and aim for 100% occupancy.

@ezyang
Copy link
Contributor

ezyang commented Mar 12, 2018

OS X failure is spurious

@soumith soumith merged commit c7611f7 into pytorch:master Mar 12, 2018
@ngimel ngimel deleted the rng branch April 25, 2018 20:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants