add int8 matmul support to CUDA backend #3508

verstatx · 2023-10-04T10:52:33Z

Description

Adds support for int8 matmul in the CUDA backend using cublasGemmEx functions. This modifies the gemm functions' api to support a different output array type, so all backends were modified.

This PR depends on s8 support: #3507
Fixes: #1656

Checklist

Rebased on latest master with signed 8-bit integer support #3507
Code compiles
Tests pass
Functions documented

edwinsolisf

Tested on RTX 3070Ti all backends on Ubuntu & Windows

changes to gemm account for differing input/output types

verstatx · 2025-03-17T04:18:29Z

Squashed commits and rebased on master. Included a minor lint fix to whitespace that I apparently missed. My apologies if this interfered with your process!

verstatx marked this pull request as draft October 4, 2023 10:52

melonakos marked this pull request as ready for review February 11, 2025 21:48

melonakos added this to the 3.10 milestone Feb 11, 2025

edwinsolisf previously approved these changes Mar 16, 2025

View reviewed changes

verstatx dismissed edwinsolisf’s stale review via ebbfe11 March 17, 2025 04:09

verstatx force-pushed the int8_matmul branch from 6580141 to ebbfe11 Compare March 17, 2025 04:09

Add int8 matmul support to the CUDA backend

8341aca

changes to gemm account for differing input/output types

verstatx force-pushed the int8_matmul branch from ebbfe11 to 8341aca Compare March 17, 2025 04:14

edwinsolisf self-requested a review March 20, 2025 23:40

edwinsolisf approved these changes Mar 20, 2025

View reviewed changes

edwinsolisf merged commit ccac73e into arrayfire:master Mar 28, 2025
2 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add int8 matmul support to CUDA backend #3508

add int8 matmul support to CUDA backend #3508

Uh oh!

verstatx commented Oct 4, 2023 •

edited

Loading

Uh oh!

edwinsolisf left a comment

Uh oh!

verstatx commented Mar 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add int8 matmul support to CUDA backend #3508

add int8 matmul support to CUDA backend #3508

Uh oh!

Conversation

verstatx commented Oct 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

edwinsolisf left a comment

Choose a reason for hiding this comment

Uh oh!

verstatx commented Mar 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

verstatx commented Oct 4, 2023 •

edited

Loading