Commit c74af6a
committed
Update on "[Quant] Add fused linear-leaky_relu op for onednn backend"
**Summary**
Post op fusion can reduce data movement overhead and improve inference performance. This PR adds fused `linear-leaky_relu` op for `onednn` backend, which will be used for int8 inference with `onednn` backend. Cannot call this op with other quantization backends otherwise an error is thrown.
**Test Plan**
python test_quantization.py TestQuantizedLinear
cc jerryzh168 jianyuh raghuramank100 jamesr66a vkuzo jgong5 leslie-fang-intel VitalyFedyunin mingfeima XiaobingSuper sanchitintel ashokei jingxu10
[ghstack-poisoned]File tree
326 files changed
+9713
-4580
lines changed- .circleci/scripts
- .github
- ci_commit_pins
- scripts
- workflows
- .jenkins/pytorch
- aten/src/ATen
- core/dispatch
- cpu/vec/vec256/vsx
- cuda
- functorch
- native
- cpu
- cuda
- cudnn
- mps/operations
- quantized/cpu
- sparse
- test
- benchmarks/dynamo
- c10
- core
- test/util
- util
- caffe2
- perfkernels
- video
- docs/source
- community
- dynamo
- functorch
- _src
- aot_autograd
- eager_transforms
- make_functional
- vmap
- benchmarks
- compile
- experimental
- test
- ao/sparsity
- cpp_extensions
- cpp/jit
- distributed
- _composable
- _tensor
- checkpoint
- fsdp
- tensor/parallel
- distributions
- dynamo
- functorch
- inductor
- onnx
- profiler
- quantization
- ao_migration
- core
- fx
- tools
- autograd
- dynamo
- torch
- _C
- _dynamo
- optimizations
- variables
- _functorch
- _inductor
- codegen
- triton_ops
- ao
- ns
- fx
- pruning/_experimental/pruner
- quantization
- fx
- autograd
- csrc
- autograd/functions
- distributed/c10d
- jit/codegen/cuda/test
- lazy/core
- utils
- distributed
- _composable
- _tensor
- ops
- algorithms/ddp_comm_hooks
- checkpoint
- fsdp
- tensor
- parallel
- distributions
- fx
- experimental
- nn
- onnx
- optim
- profiler
- quantization/fx
- testing
- _internal
- distributed
- _shard/sharded_tensor
- utils/data
- datapipes
- iter
- utils
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
326 files changed
+9713
-4580
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
| 11 | + | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
50 | | - | |
| 50 | + | |
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
98 | 98 | | |
99 | 99 | | |
100 | 100 | | |
101 | | - | |
| 101 | + | |
102 | 102 | | |
103 | 103 | | |
104 | 104 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
50 | | - | |
| 50 | + | |
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
| 55 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
228 | 228 | | |
229 | 229 | | |
230 | 230 | | |
231 | | - | |
232 | 231 | | |
233 | 232 | | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
234 | 239 | | |
235 | 240 | | |
236 | 241 | | |
| |||
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | | - | |
| 13 | + | |
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| 19 | + | |
19 | 20 | | |
20 | 21 | | |
21 | 22 | | |
| |||
215 | 216 | | |
216 | 217 | | |
217 | 218 | | |
| 219 | + | |
218 | 220 | | |
219 | 221 | | |
220 | 222 | | |
| |||
249 | 251 | | |
250 | 252 | | |
251 | 253 | | |
| 254 | + | |
252 | 255 | | |
253 | 256 | | |
254 | 257 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
101 | 101 | | |
102 | 102 | | |
103 | 103 | | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
104 | 114 | | |
105 | 115 | | |
106 | 116 | | |
| |||
148 | 158 | | |
149 | 159 | | |
150 | 160 | | |
| 161 | + | |
151 | 162 | | |
152 | 163 | | |
153 | 164 | | |
| |||
779 | 790 | | |
780 | 791 | | |
781 | 792 | | |
| 793 | + | |
782 | 794 | | |
783 | 795 | | |
784 | 796 | | |
| |||
0 commit comments