Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SCT data or scale.data to select top highly expressed genes in scRNAseq to compare with VST-transformed values in bulkRNAseq? #9660

Open
denvercal1234GitHub opened this issue Jan 31, 2025 · 0 comments

Comments

@denvercal1234GitHub
Copy link

denvercal1234GitHub commented Jan 31, 2025

Hi there,

I had a dataset of bulkRNAseq that was processed via DESeq2. For a sample, I selected the top 100 highly expressed genes as below using the vat-transformed values.

vsd <- DESeq2::vst(dds_Exvivo_DatePlSubs, blind = F)
filtered_vsd <- vsd_Tfh[rowMeans(vsd, na.rm = TRUE) > 0, ]
select_vsd <- order(rowMeans(filtered_vsd), decreasing = TRUE)[1:100]

I now want to check whether there was any genes within these 100 top-highly expressed genes by RNAseq that were not detected by scRNAseq.

I don't think I should use scale.data here, but actually scale.data being z-scored/variance-stabilized data would be more equivalent to vst, because SCT data would only be normalized and log (and not variance-stabilized).

My question is would the SCTransform layer of data be appropriate for this purpose, as below? If so, should I use counts(dds_Exvivo_DatePlSubs, normalized = TRUE) instead to select top highly expressed genes in RNAseq to compare?

avg_expr <- rowMeans(GetAssayData(IndividualDatasetMERGED, layer = "data"), na.rm = TRUE)
avg_expr_filtered <- avg_expr[avg_expr > 0]
select_avg <- names(sort(avg_expr_filtered, decreasing = TRUE)[1:100])

Thank you for your help!

Related: https://support.bioconductor.org/p/9161164/#9161168

@denvercal1234GitHub denvercal1234GitHub changed the title SCT data or scale.data to select top highly expressed genes to compare with VST-transformed values in bulkRNAseq? SCT data or scale.data to select top highly expressed genes in scRNAseq to compare with VST-transformed values in bulkRNAseq? Jan 31, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant