Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bgzipped ref fasta take much longer time #260

Open
baozg opened this issue Nov 12, 2024 · 2 comments
Open

bgzipped ref fasta take much longer time #260

baozg opened this issue Nov 12, 2024 · 2 comments
Labels
performance Issues related to computational perfromance third-party problem Problem is related to other tools, libraries, system etc

Comments

@baozg
Copy link

baozg commented Nov 12, 2024

Hi,

Didi IsoQuant support bgzipped reference fasta? I have a chance to run same bam with / without bgzipped fasta. The uncompressed fasta run took ~1h to finish, but the compressed run more than 24 hours

@andrewprzh
Copy link
Collaborator

@baozg

I think I never tested performance on bzgipped reference. It seems that the library I use (pyfaidx) is not very efficient in this case (and IsoQuant does use reference genome a lot). Although it supposed to make an index.

I will test it and see what can be done, at least a warning can be useful - thanks for the report!

Best
Andrey

@andrewprzh andrewprzh added performance Issues related to computational perfromance third-party problem Problem is related to other tools, libraries, system etc labels Nov 18, 2024
@andrewprzh
Copy link
Collaborator

@baozg

New version 3.6.3 does not use bgzipped reference genomes but extracts it first to ensure better performance. Thanks for the report!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance Issues related to computational perfromance third-party problem Problem is related to other tools, libraries, system etc
Projects
None yet
Development

No branches or pull requests

2 participants