You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
Move CPU implementation of the `addcmul` operator to Aten ( #22797 )
### before
```python
In [11]: timeit x.addcmul(a, b)
1.31 ms ± 18.3 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
```
### after
```python
In [9]: timeit x.addcmul(a, b)
588 µs ± 22.8 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
```
Adding custom code for the case when `value == 1`, doesn't provide significant performance gain.
Pull Request resolved: #22874
Differential Revision: D16359348
Pulled By: VitalyFedyunin
fbshipit-source-id: 941ead835672fca78a1fcc762da052e64308b111
0 commit comments