Speedup sparse init #6899

mttk · 2018-04-24T14:31:11Z

Removes the nested for loop in favor of list indexing
Obtains ~10-20x speedup on average.

Following the discussion in the issue, I ran a shootout between the currently implemented initialization, the version proposed by @stefanonardo (Opt1) and the version proposed by @fmassa (Opt2).

I ran sparse init on tensors of sizes [(500, 500), (100, 100), (50, 50), (10, 10)] and for sparsities of [1, 0.1, 0.01]. Each initialization was re-ran 10 times. What I learned:

The original version is consistently >= 10x slower
Opt1 is faster for larger numbers of zero elements (due to what I assume, topk having complexity of k*n), where k is the number of zeros
Opt2 is faster for smaller numbers of zero elements

However, in the largest case I tested (500x500 tensor, sparsity=1), Opt2 is about the same speed as the original version, and in the other edge case (500x500 tensor, sparsity=0.01), it is ~27% slower with a higher speed deviation.

Implementation credit goes to @stefanonardo

Full (badly formatted) results are here for the next month: https://pastebin.com/fB4mHqM9

fmassa

This looks good to me, thanks!

I have a minor comment, but apart from that this is good to merge!

torch/nn/init.py

-                tensor[row_idx, col_idx] = 0
-
+            t = tensor[:, col_idx]
+            t[zero_indices] = 0


torch/nn/init.py

        for col_idx in range(cols):
-            row_indices = list(range(rows))
-            random.shuffle(row_indices)
+            row_indices = torch.randperm(rows)


fmassa · 2018-04-29T14:51:15Z

It looks like there are some tests failing

17:41:49 ======================================================================
17:41:49 FAIL: test_sparse_default_std (test_nn.TestNNInit)
17:41:49 ----------------------------------------------------------------------
17:41:49 Traceback (most recent call last):
17:41:49   File "test_nn.py", line 5124, in test_sparse_default_std
17:41:49     assert column[column == 0].nelement() >= math.ceil(sparsity * cols)
17:41:49 AssertionError
17:41:49 
17:41:49 ----------------------------------------------------------------------

By looking at the failing test, I think that there is a bug in the sparsity check, and it should be math.ceil(sparsity * rows) instead, as in the sparse implementation

Could you verify that this is indeed the case, and fix the test?

ssnl · 2018-05-01T16:22:42Z

@mttk Thanks for your PR. Could you kindly fix the test please?

mttk · 2018-05-01T17:34:17Z

@ssnl @fmassa sorry -- had an extended weekend so I used it to AFK a bit. Fixed as Francisco suggested, waiting for checks.

Ok -- the int() is necessary. I'm not sure why the original test fails (which states that too few elements are zeros). It's also weird that previously, the tests passed.
I'll install py2.7 pytorch and run tests locally. I'll finish this tomorrow.

test/test_nn.py

                for col_idx in range(input_tensor.size(1)):
                    column = input_tensor[:, col_idx]
-                    assert column[column == 0].nelement() >= math.ceil(sparsity * cols)
+                    assert column[column == 0].nelement() >= math.ceil(sparsity * cols), "{} : {}, {}".format(column[column == 0].nelement(), sparsity, cols)


mttk · 2018-05-03T13:28:23Z

@ssnl @fmassa everything works now, thanks Francisco.

fmassa · 2018-05-03T13:29:29Z

Thanks a lot Martin!

* Sparse initialization speedup * +empty line * simplify indexing * Can't reproduce locally... * Can't reproduce locally...+ * Can't reproduce locally...+ * Fix test, cleanup

mttk requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners April 24, 2018 14:31

Sparse initialization speedup

7e5ebb6

mttk force-pushed the speedup_sparse_init branch from 9890049 to 7e5ebb6 Compare April 24, 2018 14:32

+empty line

cb2a662

fmassa approved these changes Apr 24, 2018

View reviewed changes

simplify indexing

de6ccde

apaszke approved these changes Apr 24, 2018

View reviewed changes

fmassa added awaiting response (this tag is deprecated) This tag is deprecated while we figure out what to do with it pytorch labels Apr 29, 2018

Can't reproduce locally...

5cb4de7

mttk force-pushed the speedup_sparse_init branch from ced6c05 to 5cb4de7 Compare May 3, 2018 10:31

mttk added 2 commits May 3, 2018 13:25

Can't reproduce locally...+

b13b0d4

Can't reproduce locally...+

9a8100c

fmassa reviewed May 3, 2018

View reviewed changes

Fix test, cleanup

f866f5d

fmassa merged commit c96f262 into pytorch:master May 3, 2018

fmassa mentioned this pull request May 5, 2018

nn.init.sparse poor performance #6021

Closed

ezyang added the open source label Jun 24, 2019

Speedup sparse init #6899

Speedup sparse init #6899

Uh oh!

Conversation

mttk commented Apr 24, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

fmassa commented Apr 29, 2018

Uh oh!

ssnl commented May 1, 2018

Uh oh!

mttk commented May 1, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

mttk commented May 3, 2018

Uh oh!

fmassa commented May 3, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

mttk commented Apr 24, 2018 •

edited

Loading

mttk commented May 1, 2018 •

edited

Loading