-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve Wilms Tumor Dataset Annotation (SCPCP000006) - explore predicted.score
and has_cnv.score
thresholds
#856
Comments
Hi @maud-p, glad to see you back here in issues! I wanted to give you a heads up about continuing this module - I am still working behind the scenes on your module to get it all running in CI. I have updated the label transfer code but it's not yet merged into main (but will be within the next 2 weeks I think 🤞), since I am still working in a separate branch to fix some bugs we are now able to find with all code running in CI. You can see code as we work on it in this branch: https://github.com/AlexsLemonade/OpenScPCA-analysis/tree/feature/wilms-tumor-06-azimuth. While I am still working in my fork, rather than sending PRs to main, I am sending them here. Once this is entirely finished, we'll merge that branch into main. FYI - one silly (!!!) bug I found is that somehow we never actually applied the score threshold in inferCNV - woops!! So as part of this, I am making sure we use the threshold in that script too! I think that working on the module while I am still doing this will result in _a lot_of conflicts which will be very challenging to resolve. Also, the results will slightly change because of the new label transfer code, and the actual use of the |
@sjspielman thank you for all your efforts in making the analysis run in CI! I understand and I can wait, no problem at all! |
Hi @maud-p ! We're all done working to update your module to ensure it runs smoothly through CI. We've made a decent number of changes to the module workflow. Here are the most important ones to be aware of:
Before you return to analysis (if you're still interested!), I recommend you take a little time to look over how the module now looks, let us know what questions you have! Let us know if we can help you sync back up too, in case of conflicts when you pull into your fork. Also, FYI - we will be teaching a workshop the week of December 9th, so we may be slower to respond during those few days. |
Hi @sjspielman , Thank you so much! From a first rapid look, this all seems like great changes and really well organized, so thank you so much! I am still willing to continue working on the dataset! I'll try to play with the threshold parameters in the last step I'll add the differential expression analysis to find the marker genes candidate in the module, if you think it can be a good add on, and then we can discuss further how you like to follow the IF/IHC validation, (if you like to!). Time-wise, I plan to focus on that analysis starting from January 2025, I am afraid I won't make it before Christmas. Thank you again, looking forward the next steps! |
Absolutely, this analysis module is all yours! It's here to work on whenever you are able; no rush on our end :) Enjoy your holiday season!! |
If you are filing this issue based on a specific GitHub Discussion, please link to the relevant Discussion.
This issue follows the PR #844 and the 2 comments:
Describe the goals of the changes to the analysis module.
I would like to explore difefrent thresholds for filtering and annotating based on the
predicted.score
andcnv.score
.I would like to:
improve the umpa reduction visualization with a 2-colors plot showing only one annotation and the rest in grey.
look at the distribution of
predicted.score
for each of thepredicted.compartment
andpredicted.cell_type
. So far, we only used thepredicted.score
to select normal cells (i.e. endothelial and immune cells), but don't use it to filter out cells with very low confident annotation (label as unknown).render few notebook with a cnv_threshold of 0, 1 or 2 and evaluate the identification of normal cells. I'd like to check the distribution of the
predicted.score
of endothelial, immune, normal kidney and normal stroma cells using each of the threshold. It can be that, due to false positive cnv, normal cells showed some infered cnv. If this is the case, we should expect to recover more normal cells with highpredicted.score
using highercnv_threshold
.What will your pull request contain?
Few changes in the
07
notebookWill you require additional software beyond what is already in the analysis module?
No response
Will you require different computational resources beyond what the analysis module already uses?
No response
If known, when do you expect to file the pull request?
~ November
The text was updated successfully, but these errors were encountered: