Splice types #215

mumichae · 2021-06-24T11:20:04Z

Annotate Splicing outputs with different splice types.

mumichae · 2021-06-24T12:27:42Z

drop/modules/aberrant-splicing-pipeline/Counting/03_filter_expression_FraseR.R

 #'   - theta:  '`sm cfg.getProcessedDataDir()+
 #'                  "/aberrant_splicing/datasets/savedObjects/raw-{dataset}/theta.h5"`'
+#'   - txdb: '`sm cfg.getProcessedDataDir() + "/aberrant_expression/{annotation}/txdb.db"`'


Snakemake cannot find the txdb, because the annotation wildcard is not specified in the output.
I wouldn't annotate the fds here anyway, because it is not thought to be annotation-depedent. Rather, it would be more useful to only annotate in the steps after FRASER fitting

c-mertes · 2021-08-13T21:29:58Z

drop/modules/aberrant-splicing-pipeline/FRASER/07_extract_results_FraseR.R

+#'   - spliceTypeSetup: '`sm cfg.AS.getWorkdir() + "/spliceTypeConfig.R"`'
+#'   - addAnnotation:  '`sm cfg.AS.getWorkdir() + "/fds_annotation.R"`'
+#'   - addSpliceType: '`sm cfg.AS.getWorkdir() + "/spliceType_frameshift_annotation.R"`'
+#'   - subtypes: '`sm cfg.AS.getWorkdir() + "/subtypes_exonSkipping_inconclusive.R"`'
+#'   - annotate_blacklist: '`sm str(projectDir / ".drop" / "helpers" / "annotate_blacklist.R")`'


I would strongly advice to have this in a separate package or within FRASER. But not as just scripts lying around within drop. This functionality could be even good for FRASER by itself. So please think twice if you want to put it just here as is.

c-mertes · 2021-08-13T21:32:07Z

drop/modules/aberrant-splicing-pipeline/Counting/00_define_datasets_from_anno.R

 #'    - ids: '`sm lambda w: sa.getIDsByGroup(w.dataset, assay="RNA")`'
 #'    - fileMappingFile: '`sm cfg.getRoot() + "/file_mapping.csv"`'
 #'  input:
+#'    - setup: '`sm cfg.AS.getWorkdir() + "/config.R"`'


Are we sure we want the config in requirements and not as params? There should be nothing in there that could change the outcome of the results.
This goes of course also for the other scripts.

c-mertes · 2021-08-13T21:32:49Z

drop/modules/aberrant-splicing-pipeline/Counting/03_filter_expression_FraseR.R

+
+# Add the junction annotations to the fds
+#message("03: load db for annotation")
+#txdb <- loadDb(snakemake@input$txdb)


Please clean up if you do not need the lines.

c-mertes · 2021-08-13T22:02:40Z

drop/modules/aberrant-splicing-pipeline/fds_annotation.R

+
+  ### Do the annotation just for the most used intron (highest median expression)
+  print("start calculating annotations")
+  annotations <- sapply(c(1:length(fds_junctions)), function(i){


not sure how fast this is, but rather think of vectorized versions. granges is optimized to find overlaps between 1000s of ranges but not 1 by 1 over 1000s ranges.

c-mertes · 2021-08-13T22:04:38Z

drop/modules/aberrant-splicing-pipeline/fds_annotation.R

@@ -0,0 +1,67 @@
+### 20210604 klutz
+
+### basic annotations (start, end, none, both) for full fds


Add a bit more details what the function is doing. Especially going through it you somehow only looking at the max expressed junctions? Why? Please describe this as a doc. For more details see roxygen and some bioconductor packages.

mumichae · 2021-11-17T11:53:06Z

Hi, is this PR still being worked on?

fds annotation code, blacklist files

5ced6c0

mumichae commented Jun 24, 2021

View reviewed changes

mumichae changed the base branch from master to dev June 24, 2021 12:32

lutzkaro added 6 commits June 25, 2021 13:16

fds first annotation, UTR regions

35228e4

splice types, blacklist

ff37aa7

expression blacklist

3f0a51d

small changes

372f41a

small changes

ef755a3

updated from dev

31ea16f

c-mertes requested changes Aug 13, 2021

View reviewed changes

small changes and comments

0906ad4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Splice types #215

Splice types #215

mumichae commented Jun 24, 2021

mumichae Jun 24, 2021

c-mertes Aug 13, 2021

c-mertes Aug 13, 2021

c-mertes Aug 13, 2021

c-mertes Aug 13, 2021

c-mertes Aug 13, 2021

mumichae commented Nov 17, 2021

		@@ -0,0 +1,67 @@
		### 20210604 klutz

		### basic annotations (start, end, none, both) for full fds

Splice types #215

Are you sure you want to change the base?

Splice types #215

Conversation

mumichae commented Jun 24, 2021

mumichae Jun 24, 2021

Choose a reason for hiding this comment

c-mertes Aug 13, 2021

Choose a reason for hiding this comment

c-mertes Aug 13, 2021

Choose a reason for hiding this comment

c-mertes Aug 13, 2021

Choose a reason for hiding this comment

c-mertes Aug 13, 2021

Choose a reason for hiding this comment

c-mertes Aug 13, 2021

Choose a reason for hiding this comment

mumichae commented Nov 17, 2021