Dbit #95

lillux · 2023-12-04T17:03:50Z

This pull-request proposes a reader for DBiT-seq experiments.
The reader handles DBiT-seq experiments with or without histological images. It expects the histological image to be cropped and transformed correctly by the user, to match the area covered by the chip.
The reader accepts one experiment per object.

The barcode_position file is supposed to be a tab separated text file, headerless, with 2 columns: the first for the barcode coordinate (A and B, 50 barcodes each), the second for the barcode sequence.

A1    AACGTGAT
A2    AAACATCG
A3    ATGCCTAA
A4    AGTGGTCA
A5    ACCACTGT
A6    ACATTGGC
A7    CAGATCTG
A8    CATCAAGT
A9    CGCTGATC
A10  ACAAGCTA
...

Here a barcode_position file as a reference:
barcode_list.txt

We are happy to have feedback from you!
Thanks for your support!

for more information, see https://pre-commit.ci

…dbit

for more information, see https://pre-commit.ci

codecov-commenter · 2023-12-05T09:32:33Z

Codecov Report

Attention: 87 lines in your changes are missing coverage. Please review.

Comparison is base (449f4b8) 36.83% compared to head (955a52c) 35.73%.
Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #95      +/-   ##
==========================================
- Coverage   36.83%   35.73%   -1.10%     
==========================================
  Files          15       16       +1     
  Lines        1075     1192     +117     
==========================================
+ Hits          396      426      +30     
- Misses        679      766      +87

Files	Coverage Δ
src/spatialdata_io/__init__.py	`100.00% <100.00%> (ø)`
src/spatialdata_io/_constants/_constants.py	`100.00% <100.00%> (ø)`
src/spatialdata_io/readers/dbit.py	`21.62% <21.62%> (ø)`

for more information, see https://pre-commit.ci

giovp

hi @lillux , thank you for this PR! and sorry for getting back to you so late. It looks good, I made some few comments that should just be about a little refactoring to strive for simplicity, let me know if anything is unclear, looking forward to get this through the finish line!

src/spatialdata_io/__init__.py

giovp · 2024-01-07T08:46:57Z

src/spatialdata_io/_constants/_constants.py

+    """Keys for DBiT formatted dataset."""
+
+    # files and directories
+    COUNTS_FILE = ".h5ad"


just out of curiosity, but which pipeline/preprocessing output an h5ad directly?

One thing is that DBiT seq can be used for both scATAC and scRNA data. We currently use a simple script to process fastq so that they can be feeded to any scRNA/scATAC processing pipeline. From that point on, it's just SOTA processing producing h5ad files. The obs_name in the h5ad will be then used to map texels on the surface.

great, thank you for the clarification @dawe !

src/spatialdata_io/_constants/_constants.py

src/spatialdata_io/readers/DBiT.py

for more information, see https://pre-commit.ci

lillux

Dear @giovp, we have reviewed the code as suggested. Thanks for the useful comments, let us know if there is something else to fix.

Best,
Lillo

giovp

thank you @lillux and sorry for late reply. So, to be totally honest, I don't have experience with this data, and also am not really able to test it. We also don't really have a triage/experimental section where we could put this yet. However, if it enables yours and other users analysis, and considering that it seems like a widespread enough technology, I would be positive to include this as is in the main module. What @LucaMarconato @melonora @kevinyamauchi think?

If we get couple positive feedback from other core devs, I'd be happy to merge this!

ah one more thing, if you could add a line in the changelog about this would be very helpful! Thank you!

giovp · 2024-02-05T17:17:58Z

src/spatialdata_io/_constants/_constants.py

+    """Keys for DBiT formatted dataset."""
+
+    # files and directories
+    COUNTS_FILE = ".h5ad"


great, thank you for the clarification @dawe !

src/spatialdata_io/readers/dbit.py

melonora · 2024-02-05T19:08:11Z

thank you @lillux and sorry for late reply. So, to be totally honest, I don't have experience with this data, and also am not really able to test it. We also don't really have a triage/experimental section where we could put this yet. However, if it enables yours and other users analysis, and considering that it seems like a widespread enough technology, I would be positive to include this as is in the main module. What @LucaMarconato @melonora @kevinyamauchi think?

If we get couple positive feedback from other core devs, I'd be happy to merge this!

ah one more thing, if you could add a line in the changelog about this would be very helpful! Thank you!

I am not against it. Maybe one thing is that with multiple tables soon fully supported in SpatialData and tables not necessarily having to annotate a SpatialElement we could include the barcode list. We could either wait until it is ready or merge and afterwards open a PR for this.

kevinyamauchi · 2024-02-05T20:48:13Z

I haven't had time to review, but I would be okay with an optimistic merge if it allows people to try out this type of data.

I would vote to merge once @giovp 's comments are addressed. We can circle back to the multiple tables later (as suggested by @melonora) necessary. I think it's better to get it in and iterate rather than wait for the multiple tables to land.

Thanks, @lillux !

LucaMarconato · 2024-02-05T21:45:38Z

Thanks @lillux; very in favor of merging as it could be helpful to other people working with this technology.

On the practical side, if you have some data to suggest/share so that we can test this, please let us know. Otherwise gonna also give a pass to the code before merge. Let us know please 😊

This support DBiT with grids other than 50 x 50.

for more information, see https://pre-commit.ci

lillux · 2024-02-06T20:04:47Z

Thanks all for your comments!

Here are some data to test the reader.

We are available to maintain the reader to keep up with future updates of both the DBiT technology and SpatialData development.

@giovp I just added an entry for the DBiT reader in the Changelog, let me know if that is what you meant.

kevinyamauchi · 2024-02-07T19:47:00Z

We are available to maintain the reader to keep up with future updates of both the DBiT technology and SpatialData development.

That's great to hear, @lillux ! Thanks for the updates to the PR.

I'm not sure how the changelogs work, so I'll let @giovp or @LucaMarconato give the final review and merge.

for more information, see https://pre-commit.ci

LucaMarconato · 2024-02-09T00:06:41Z

@lillux I have just checked and tried the reader, great work! Merging now 😊

We plan to make a release by the end of the month (it's going to take a while because at the moment we are working on some large PRs in spatialdata), but looking forward to have this in PyPI!

kevinyamauchi · 2024-02-09T10:19:03Z

Nice work, everyone! Super cool to see another reader added.

lillux · 2024-02-09T12:34:06Z

Thanks @giovp , @LucaMarconato , @kevinyamauchi , @melonora for your support and advices in bringing this code to production! I'm looking forward to further contribute to the scverse ecosystem!

All the best,
Lillo

Dbit

lillux and others added 16 commits December 1, 2023 11:15

Added draft of DBiT reader

af2a72f

Hardcoded variable for grid scaling

c50fc92

Merge branch 'main' of https://github.com/lillux/spatialdata-io

faeb59d

Added comments

1666707

Ignore spyder data

61b71a0

Remove spyder data

7dec791

Removed metadata

22c25e4

Add symbol export spec in DBiT

8c35aef

Fixed typos

d944741

Added _constants for DBiT, and related check

b8fd2a6

[pre-commit.ci] auto fixes from pre-commit.com hooks

d12c518

for more information, see https://pre-commit.ci

Open .h5ad with anndata instead of scanpy

6ea5614

Merge branch 'dbit' of https://github.com/lillux/spatialdata-io into …

b41451d

…dbit

[pre-commit.ci] auto fixes from pre-commit.com hooks

ed34628

for more information, see https://pre-commit.ci

Fixed import

fb55943

[pre-commit.ci] auto fixes from pre-commit.com hooks

697064e

for more information, see https://pre-commit.ci

lillux and others added 6 commits December 5, 2023 11:22

fixes from pre-commit.ci

f2960aa

[pre-commit.ci] auto fixes from pre-commit.com hooks

1c493e3

for more information, see https://pre-commit.ci

fixed exceptions and type hints

63a916f

[pre-commit.ci] auto fixes from pre-commit.com hooks

3234bba

for more information, see https://pre-commit.ci

Fix typing

9f862da

fixed variable names

d7dc6e2

giovp requested changes Jan 7, 2024

View reviewed changes

lillux and others added 6 commits January 9, 2024 16:33

Merge branch 'scverse:main' into dbit

24bb806

Merge branch 'scverse:main' into dbit

48fca73

code revision started

fbe6c82

[pre-commit.ci] auto fixes from pre-commit.com hooks

3821757

for more information, see https://pre-commit.ci

Code revision part 2

8d4ae3e

[pre-commit.ci] auto fixes from pre-commit.com hooks

92df11d

for more information, see https://pre-commit.ci

lillux and others added 7 commits January 26, 2024 00:11

Fix mypy

74ca806

fix mypy

7578970

Docstring fix

ee959b8

mypy errors fix

c04ba2a

mypy errors

1b95a97

ignore mypy

fd6d9cc

[pre-commit.ci] auto fixes from pre-commit.com hooks

aa58adc

for more information, see https://pre-commit.ci

lillux commented Jan 29, 2024

View reviewed changes

lillux requested a review from giovp January 29, 2024 09:22

Merge branch 'scverse:main' into dbit

a965e32

giovp reviewed Feb 5, 2024

View reviewed changes

lillux and others added 5 commits February 6, 2024 08:21

Now the dimension of the DBiT grid depends on the number of barcodes.

c49c5d5

This support DBiT with grids other than 50 x 50.

Docstrings fix

ffd8a9d

[pre-commit.ci] auto fixes from pre-commit.com hooks

b1173c2

for more information, see https://pre-commit.ci

Added DBiT-seq reader to changelog

2ec3f7a

[pre-commit.ci] auto fixes from pre-commit.com hooks

955a52c

for more information, see https://pre-commit.ci

lillux and others added 3 commits February 7, 2024 22:44

Merge branch 'scverse:main' into dbit

90ba226

Merge branch 'main' into dbit

b9ddfb0

[pre-commit.ci] auto fixes from pre-commit.com hooks

cb94f9b

for more information, see https://pre-commit.ci

LucaMarconato approved these changes Feb 9, 2024

View reviewed changes

LucaMarconato merged commit 94d856f into scverse:main Feb 9, 2024
6 checks passed

lucas-diedrich pushed a commit to lucas-diedrich/spatialdata-io that referenced this pull request Nov 26, 2024

Merge pull request scverse#95 from lillux/dbit

54d0d2a

Dbit

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dbit #95

Dbit #95

lillux commented Dec 4, 2023

codecov-commenter commented Dec 5, 2023 •

edited

Loading

giovp left a comment

giovp Jan 7, 2024

dawe Jan 16, 2024

giovp Feb 5, 2024

lillux left a comment

giovp left a comment •

edited

Loading

giovp Feb 5, 2024

melonora commented Feb 5, 2024

kevinyamauchi commented Feb 5, 2024

LucaMarconato commented Feb 5, 2024

lillux commented Feb 6, 2024

kevinyamauchi commented Feb 7, 2024

LucaMarconato commented Feb 9, 2024

kevinyamauchi commented Feb 9, 2024

lillux commented Feb 9, 2024

Dbit #95

Dbit #95

Conversation

lillux commented Dec 4, 2023

codecov-commenter commented Dec 5, 2023 • edited Loading

Codecov Report

giovp left a comment

Choose a reason for hiding this comment

giovp Jan 7, 2024

Choose a reason for hiding this comment

dawe Jan 16, 2024

Choose a reason for hiding this comment

giovp Feb 5, 2024

Choose a reason for hiding this comment

lillux left a comment

Choose a reason for hiding this comment

giovp left a comment • edited Loading

Choose a reason for hiding this comment

giovp Feb 5, 2024

Choose a reason for hiding this comment

melonora commented Feb 5, 2024

kevinyamauchi commented Feb 5, 2024

LucaMarconato commented Feb 5, 2024

lillux commented Feb 6, 2024

kevinyamauchi commented Feb 7, 2024

LucaMarconato commented Feb 9, 2024

kevinyamauchi commented Feb 9, 2024

lillux commented Feb 9, 2024

codecov-commenter commented Dec 5, 2023 •

edited

Loading

giovp left a comment •

edited

Loading