-
Notifications
You must be signed in to change notification settings - Fork 126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add sintax #577
Add sintax #577
Conversation
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me (except that I feel not confident with my review of ampliseq.nf because it seems to convoluted now).
Docs need a bit of polishing before it can be merged, imho.
Because I solved the merge conflict: please pull your branch before you continue working! |
I broke the |
I think |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that looks great!
Moving dada2 & sintax taxonomy classification to own subworkflows simplifies ampliseq.nf
significantly, I think. But my recommendation to put phylogenetic placement taxonomical assignment into its own subworkflow was a very bad idea, sorry, because that is already a subworkflow.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great simplification of the DADA2 taxonomic workflow, thanks! Thats so much easier to read now I think.
I tested a little (not everything) and all I saw was fine, except two tiny points:
- the sintax taxonomy table had NA while other (DADA2, QIIME2) are blank when no info is available
- Processes SINTAX_TAXONOMY_WF:* and QIIME2_TAXONOMY:QIIME2_CLASSIFY are shown even when not executed, while DADA2_TAXONOMY_WF is not shown when not executed
The first point might be good to address, the second point doesnt matter I think.
On an unrelated note, I recognized that NFCORE_AMPLISEQ:AMPLISEQ:QIIME2_DIVERSITY:QIIME2_TREE
is not listed and therefore no QIIME2_DIVERSITY:*
is executing with -profile test
. That still works with 2.5.0
but seems already broken in dev
, so seems unrelated to your PR. I might have broken that in the phylogenetic palcement PR, because I fiddled with the tree info. Will need to investigate at another time.
Thanks for the pointers!
True, fixed it.
This I think is because there is a check if DADA2_TAXONOMY_WF should run, while there is no test for e.g. SINTAX_TAXONOMY_WF:*, instead they are fed an empty input channel and therefore not run. I added a test for sintax, but left the qiime things as they are. There are a lot of qiime modules that are run or not depending on their input channels. |
Thanks! |
This PR adds the SINTAX algorithm as an alternative for taxonomy assignment, and addresses issue #471.
If --sintax_ref_taxonomy is given, SINTAX is used for taxonomical classification in addition to DADA2. The SINTAX taxonomy is used instead of any other classification in downstream steps (qiime, sbdiexport). I also added the option --skip_dada_taxonomy, to skip running DADA2 to get taxonomies.
PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).docs/usage.md
is updated.docs/output.md
is updated.CHANGELOG.md
is updated.README.md
is updated (including new tool citations and authors/contributors).