WIP: Test non-canonical sparse arrays #14

uellue · 2023-06-27T09:35:52Z

scipy.sparse generally supports them, while sparse.pydata.org struggles.

Make sure there are no false positives and update converter matrix to support them correctly.
Report upstream if applicable
Changelog
Minor release

Closes #13

scipy.sparse generally supports them, while sparse.pydata.org struggles. TOD: Make sure there are no false positives and updtae converter matrix to support them correctly. Also, perhaps report upstream to be documented properly?

TODO get COO to work correctly -- it broke from deduplication

* Detection and workaround for cupy/cupy#7713 * Correct coo_matrix scrambling: canonical format attribute is cached and sparse.COO depends on it. Create new scrambled coo_matrix instead of modifying in place so that the attributes are correct. * Update conversion cost

uellue · 2023-07-11T18:30:45Z

I can't reproduce the issues on Linux with scipy.sparse.csc_matrix locally, even with the same versions of Python, NumPy, SciPy. Weird! I'm out of depth what to do here.

uellue · 2023-07-11T18:57:40Z

The error seems to be caused by the duplicate since the last two entries in the array differ. Strange that it doesn't occur locally. Race condition? Compiler/CPU architecture/Undefined Behavior? Hard to report without a local reproducer!

uellue · 2023-07-11T19:02:35Z

Actually got a "hit" on Windows. Different probability points at a race condition?

* Also prune when deduplicating * Deduplicate CSC before densifying

Handle empty cols/rows at the end of the array correctly. Edge cases to be tested, but unlikely to occur.

uellue · 2023-07-11T20:47:02Z

Actually got a "hit" on Windows. Different probability points at a race condition?

Likely cause was an incorrect routine to add duplicates. It added the value to the wrong column/row if the last one was empty.

WIP: Test non-canonical sparse arrays

111edb8

scipy.sparse generally supports them, while sparse.pydata.org struggles. TOD: Make sure there are no false positives and updtae converter matrix to support them correctly. Also, perhaps report upstream to be documented properly?

uellue marked this pull request as draft June 27, 2023 09:36

This was referenced Jul 4, 2023

Constructing GCXS from non-canonical scipy.sparse.csr_matrix results in wrong results pydata/sparse#602

Open

sparse.GCXS silently requires canonical form #13

Closed

uellue added 3 commits July 4, 2023 15:06

WIP: Test operations on array; fixed CSR

52ca394

TODO get COO to work correctly -- it broke from deduplication

Version and changelog entry for minor release

8bc1f9a

uellue marked this pull request as ready for review July 11, 2023 18:37

Very verbose output to catch spurious issues in CI

4292b98

uellue added 4 commits July 11, 2023 21:15

Tentative fix for CSC test failure

8294f78

* Also prune when deduplicating * Deduplicate CSC before densifying

Remove tentative fix since ineffective in CI

6f54e90

Create new CSC/CSR array instead of modifying in-place

c5ffb75

Fix routine that adds duplicates to CSC/CSR

82e13a2

Handle empty cols/rows at the end of the array correctly. Edge cases to be tested, but unlikely to occur.

uellue merged commit dfe02ed into LiberTEM:main Jul 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Test non-canonical sparse arrays #14

WIP: Test non-canonical sparse arrays #14

uellue commented Jun 27, 2023 •

edited

Loading

uellue commented Jul 11, 2023

uellue commented Jul 11, 2023

uellue commented Jul 11, 2023

uellue commented Jul 11, 2023

WIP: Test non-canonical sparse arrays #14

WIP: Test non-canonical sparse arrays #14

Conversation

uellue commented Jun 27, 2023 • edited Loading

uellue commented Jul 11, 2023

uellue commented Jul 11, 2023

uellue commented Jul 11, 2023

uellue commented Jul 11, 2023

uellue commented Jun 27, 2023 •

edited

Loading