adding F-contig layout support + other cleanups by SwayamInSync · Pull Request #108 · numpy/numpy-quaddtype

SwayamInSync · 2026-06-05T12:15:33Z

closes #89

This PR improves the BLAS integration very parallel to the NumPy's own logic.

Mirrors NumPy's own @TYPE@_matmul dispatch. NumPy hands the inner loop raw strides and expects the loop itself to dispatch, so we do the same instead of assuming row-major.
Ports NumPy's is_blasable2d check: an operand is BLAS-able iff its fast axis is contiguous and the slow axis is a valid leading dimension; each operand is tested in both orientations (C- and F-blasable).
BLAS-able operands are passed to QBLAS zero-copy with the matching transpose flag ('N'/'T') and lda derived from strides, so F-contiguous/transposed inputs need no copy (QBLAS already supports transa/transb correctly).
Non-BLAS-able operands (fully strided, negative strides) and non-row-major output are gathered into contiguous temps and scattered back, exactly NumPy's copy fallback.
Oversized (dim > INT_MAX) or empty cores fall back to the naive strided loop, matching NumPy's too_big_for_blas guard.

I think this is a good base to integrate dot operations efficiently in follow-up PRs

SwayamInSync · 2026-06-05T12:16:59Z

Ahh the fix to run older-cpu workflows is done in #104 , once that will merge we can update here.

Copilot

Pull request overview

This PR fixes incorrect matmul results for non-row-major (notably F-contiguous and transposed-view) QuadPrecision inputs by making the QBLAS dispatch stride-aware (mirroring NumPy’s layout/BLAS-ability checks) and expanding test coverage to exercise the full layout matrix and copy-fallback paths.

Changes:

Update the matmul strided loop to detect BLAS-able 2D operands in either orientation and pass the appropriate transa/transb + lda/ldb/ldc, copying only when required.
Remove the prior xfails for Fortran-order/transposed batched cases and add new tests covering layout combinations, output layout, empty-core behavior, and copy-fallback scenarios.
Adjust QBLAS interface edge handling for empty GEMV/GEMM calls.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
`tests/test_dot.py`	Removes `xfail`s for issue #89 and adds comprehensive layout/copy-path tests for `matmul`.
`src/csrc/umath/matmul.cpp`	Implements stride-derived BLAS dispatch (incl. transpose flags) plus gather/scatter copy fallback and size/empty guards.
`src/csrc/quadblas_interface.cpp`	Tweaks empty-dimension behavior for `qblas_gemv` and `qblas_gemm`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

SwayamInSync · 2026-06-08T19:39:39Z

cc: @ngoldbaum this one too :)

ngoldbaum

Overall looks good, just a few minor suggestions inline.

ngoldbaum · 2026-06-08T21:57:20Z

+        }
        return 0;
    }
    if (!alpha || !A || !x || !beta || !y) {


You already checked !beta above so might as well delete that here

ngoldbaum · 2026-06-08T21:59:41Z

+        if kind == "F":
+            return np.asfortranarray(a)
+        if kind == "T":
+            return np.ascontiguousarray(a.T).T


Isn't this the same as F-order? Maybe makes more sense to just return a.T, which is different from a fortran-order copy.

ngoldbaum · 2026-06-08T22:00:57Z

           Sleef_quad *beta, Sleef_quad *C, size_t ldc)
 {
-    if (m == 0 || n == 0 || k == 0) {
+    if (m == 0 || n == 0) {


Claude says that no tests hit this code change. You could add a test that does np.matmul(a, b) where the contracted dimension is zero to hit this.

ngoldbaum · 2026-06-08T22:01:49Z

+        rng = np.random.default_rng(seed=201)
+        A_f = rng.standard_normal((6, 5))
+        B_f = rng.standard_normal((5, 7))
+        _assert_matmul_matches_float64(_qnd(A_f)[::-1], _qnd(B_f), A_f[::-1], B_f)


Maybe add a test for negative column strides too? e.g. A[:, ::-1].

ngoldbaum · 2026-06-08T22:04:19Z

+        rng = np.random.default_rng(seed=202)
+        A_f = rng.standard_normal((20, 7))
+        B_f = rng.standard_normal((7, 11))
+        out = np.asfortranarray(np.zeros((10, 11), dtype=QuadPrecDType()))


Maybe also add a test for strided out? Something like out=np.empty(14, dtype=QuadPrecDType())[::2]; np.matmul(A, B, out=out).

adding F-contig layout support + other cleanups

2afe729

SwayamInSync requested a review from Copilot June 5, 2026 14:47

Copilot started reviewing on behalf of SwayamInSync June 5, 2026 14:47 View session

SwayamInSync requested a review from ngoldbaum June 5, 2026 14:47

Copilot AI reviewed Jun 5, 2026

View reviewed changes

Comment thread src/csrc/quadblas_interface.cpp

Comment thread src/csrc/umath/matmul.cpp

SwayamInSync added 2 commits June 5, 2026 17:24

Merge branch 'main' into fix-matmul-layout-89

28764ee

gemv null handling

784a75f

SwayamInSync mentioned this pull request Jun 6, 2026

ENH: Support the dot family for user-dtypes numpy/numpy#31574

Open

ngoldbaum approved these changes Jun 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adding F-contig layout support + other cleanups#108

adding F-contig layout support + other cleanups#108
SwayamInSync wants to merge 3 commits into
numpy:mainfrom
SwayamInSync:fix-matmul-layout-89

SwayamInSync commented Jun 5, 2026 •

edited

Loading

Uh oh!

SwayamInSync commented Jun 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

SwayamInSync commented Jun 8, 2026

Uh oh!

ngoldbaum left a comment

Uh oh!

ngoldbaum Jun 8, 2026

Uh oh!

ngoldbaum Jun 8, 2026

Uh oh!

ngoldbaum Jun 8, 2026

Uh oh!

ngoldbaum Jun 8, 2026

Uh oh!

ngoldbaum Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

SwayamInSync commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SwayamInSync commented Jun 5, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

SwayamInSync commented Jun 8, 2026

Uh oh!

ngoldbaum left a comment

Choose a reason for hiding this comment

Uh oh!

ngoldbaum Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

ngoldbaum Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

ngoldbaum Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

ngoldbaum Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

ngoldbaum Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SwayamInSync commented Jun 5, 2026 •

edited

Loading