Commit bf91e1a
authored
[Transform] SpinQuant R4 (#1746)
## Purpose ##
* Support R4 transforms before R3. R3 requires hooking into the
attention module, where as R4 does not
## Prerequisites ##
* vllm-project/vllm#22486
## Testing ##
* Performed sanity checks with HF and vLLM
---------
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>1 parent 6af0778 commit bf91e1a
1 file changed
+18
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
104 | 104 | | |
105 | 105 | | |
106 | 106 | | |
107 | | - | |
| 107 | + | |
108 | 108 | | |
109 | 109 | | |
110 | 110 | | |
| |||
237 | 237 | | |
238 | 238 | | |
239 | 239 | | |
240 | | - | |
| 240 | + | |
241 | 241 | | |
242 | 242 | | |
243 | 243 | | |
244 | | - | |
245 | | - | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
246 | 260 | | |
0 commit comments