Enable batched QR decomposition and add a `some` option #20689

vishwakftw · 2019-05-19T17:58:01Z

This PR covers two important points with respect to the QR decomposition:

batching of input matrices ([Feature request] Batch linear algebra operators #7500)
adding some as an option in torch.qr akin to NumPy's mode option (QR option to return full matrices #10538)

Changelog:

Enable batching for inputs to torch.qr
Move QR decomposition implementation to ATen (CPU and CUDA)
Remove existing implementations in TH/THC
Add a some option to torch.qr that will enable users to switch between complete and reduced decomposition
Modify doc strings

Test plan:

Add new tests, remove old ones.

Closes #10538

…omposition

…qr-to-aten

…of Q

…qr-to-aten

vishwakftw · 2019-05-27T16:32:03Z

@pytorchbot rebase this please

aten/src/ATen/native/BatchLinearAlgebra.cpp

soumith · 2019-05-28T07:02:18Z

aten/src/ATen/native/cuda/BatchLinearAlgebra.cu

+void magmaGeqrf<double>(
+    magma_int_t m, magma_int_t n, double* dA, magma_int_t ldda,
+    double* tau, double* dT, magma_int_t* info, bool is_v2) {
+  if (!is_v2) {


i think we can safely remove the !is_v2 specializations. we stopped supporting magma v1 long ago

The is_v2 specialization is to different between geqrf2 and geqrf. I will add a comment about this to avoid confusion.

test/test_torch.py

soumith

please ignore my comments.
I reviewed commit-by-commit, expecting that my reviews for each commit will be preserved.
But after reviewing all commits and submitting, only the first commit's comments have been posted.
All of those were addressed by you on later commits.

soumith · 2019-05-28T07:11:46Z

@pytorchbot rebase this please

…qr-to-aten

vishwakftw · 2019-05-28T16:59:12Z

@soumith the pending build / tests have completed successfully.

This is the message on the details page: GitHub rate limited us while adding a commit status of success: You have triggered an abuse detection mechanism. Please wait a few minutes before you try again.

facebook-github-bot

@soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: This PR covers two important points with respect to the QR decomposition: - batching of input matrices (#7500) - adding `some` as an option in `torch.qr` akin to NumPy's `mode` option (#10538) Changelog: - Enable batching for inputs to `torch.qr` - Move QR decomposition implementation to ATen (CPU and CUDA) - Remove existing implementations in TH/THC - Add a `some` option to `torch.qr` that will enable users to switch between complete and reduced decomposition - Modify doc strings Pull Request resolved: pytorch/pytorch#20689 Differential Revision: D15529230 Pulled By: soumith fbshipit-source-id: 16af82b1d2db8a3a758fa8a5f798d83f5f950efb

facebook-github-bot · 2019-05-29T15:31:00Z

@soumith merged this pull request in f6ec464.

kvuong2711 · 2019-06-09T17:20:16Z

Hi,

How can I use this batched implementation of QR now?

a = torch.randn(3, 4, 5)
q, r = torch.qr(a)

gives me an error RuntimeError: invalid argument 1: A should be 2 dimensional at /pytorch/aten/src/TH/generic/THTensorLapack.cpp:680

vishwakftw · 2019-06-09T17:29:25Z

Please install from source, or a nightly build dated after May 30. This is not available in any of the current releases.

kvuong2711 · 2019-06-09T17:56:22Z

Hi,

I'm curious about the internal implementation of this batched QR. In a high-level sense, does it make use of a for-loop to loop through all the k tensors of sizes m * n and calculate QR sequentially for each of them, or is it a whole new implementation of batched QR that optimizes the strength of GPU (tensor operations only)? It's terribly slow for millions of 4x4 tensors for me, so I guess that's the former?

vishwakftw · 2019-06-10T00:51:20Z

Yes, it is the former. The GPU implementation depends on a package called MAGMA. They have batched support for some of their operations, and in fact they have a batched version of geqrf which is used to obtain the R matrix and the Householder reflectors to generate Q. Unfortunately it’s not LAPACK compliant (which makes extracting R difficult) which is why I had to use the former method.

nikhilvyas · 2024-10-06T23:47:38Z

@vishwakftw is the statement "batched qr/eigh basically uses a for-loop over all matrices in the batch" still true? Do you know how hard it would be to have a real parallelized implementation? Specifically in the application I have in mind we only need the Q matrices.

vishwakftw added 2 commits May 18, 2019 22:32

Move CPU QR decomposition to ATen with batching

20fbf3e

Add some option to QR to toggle between complete and reduced QR dec…

e1befdf

…omposition

pytorchbot added module: cpu CPU specific problem (e.g., perf, algorithm) module: cuda Related to torch.cuda, and CUDA support in general module: internals Related to internal abstractions in c10 and ATen module: operators labels May 19, 2019

Move CUDA QR decomposition to ATen with batching

550a9e4

pytorchbot added the module: cublas Problem related to cublas support label May 20, 2019

vishwakftw added 3 commits May 22, 2019 10:36

Fix minor bugs and clarify passed arguments

2204842

Merge branch 'master' of https://github.com/pytorch/pytorch into cpu-…

cbfde52

…qr-to-aten

Modify tests to reflect batching and the new some option

859f764

pytorchbot added the module: pybind Related to our Python bindings / interactions with other Python libraries label May 22, 2019

vishwakftw added 2 commits May 23, 2019 10:35

Merge branch 'master' of https://github.com/pytorch/pytorch into cpu-…

0b8b823

…qr-to-aten

Modify doc string to reflect changes, make failing jit test pass

db77941

pytorchbot added module: docs Related to our documentation, both in docs/ and docblocks module: tests Issues related to tests (not the torch.testing module) labels May 23, 2019

vishwakftw changed the title ~~[WIP] Enable batched QR decomposition and add a some option~~ Enable batched QR decomposition and add a some option May 23, 2019

vishwakftw added 4 commits May 23, 2019 20:02

Fix memory error, remove erroneous statement with respect to strides …

634544c

…of Q

Fix ORGQR bug in CPU implementation and test_lapack_empty

4b9fb3e

Merge branch 'master' of https://github.com/pytorch/pytorch into cpu-…

3ddb5ce

…qr-to-aten

Remove unused import and rephrase comment about workspace queries

ac1ce1b

Merge remote-tracking branch 'origin/master' into HEAD

b85819e

soumith reviewed May 28, 2019

View reviewed changes

soumith approved these changes May 28, 2019

View reviewed changes

Merge remote-tracking branch 'origin/master' into HEAD

d83d2ce

gpleiss mentioned this pull request May 28, 2019

[Feature request] Batch linear algebra operators #7500

Closed

14 tasks

Merge branch 'master' of https://github.com/pytorch/pytorch into cpu-…

6557a6a

…qr-to-aten

facebook-github-bot reviewed May 28, 2019

View reviewed changes

facebook-github-bot closed this in f6ec464 May 29, 2019

vishwakftw mentioned this pull request May 29, 2019

QR option to return full matrices #10538

Closed

facebook-github-bot added the merged label May 29, 2019

vishwakftw deleted the cpu-qr-to-aten branch May 29, 2019 17:17

ezyang added the open source label Jun 24, 2019

calincru mentioned this pull request Jul 7, 2019

Batched symeig and qr are very slow on GPU #22573

Open

hartb mentioned this pull request Sep 30, 2020

Incorrect results when loading OpenBLAS with OpenMP prior to pytorch on POWER9 OpenMathLib/OpenBLAS#2869

Closed

mruberry added the Merged label Oct 28, 2020

Enable batched QR decomposition and add a some option #20689

Enable batched QR decomposition and add a some option #20689

Uh oh!

Conversation

vishwakftw commented May 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vishwakftw commented May 27, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

soumith May 28, 2019

Choose a reason for hiding this comment

Uh oh!

vishwakftw May 28, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

soumith commented May 28, 2019

Uh oh!

vishwakftw commented May 28, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 29, 2019

Uh oh!

kvuong2711 commented Jun 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vishwakftw commented Jun 9, 2019

Uh oh!

kvuong2711 commented Jun 9, 2019

Uh oh!

vishwakftw commented Jun 10, 2019

Uh oh!

nikhilvyas commented Oct 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Enable batched QR decomposition and add a `some` option #20689

Enable batched QR decomposition and add a `some` option #20689

vishwakftw commented May 19, 2019 •

edited

Loading

kvuong2711 commented Jun 9, 2019 •

edited

Loading

nikhilvyas commented Oct 6, 2024 •

edited

Loading