You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been trying to run vargeno on non-human data and running into problems at the indexing stage. No error is reported during the process, but the .dict files are both empty, and so the genotyping step fails.
I'm working with a fragmentary reference assembly of a grasshopper genome, so both the bioinformatic and biological properties of the data are not at all what vargeno was designed for.
Do you have any tips for troubleshooting? Attached (here) is a sample of the .vcf input. Since my data is not human data and I'm obviously not working with dbSNP it's a little unclear how to properly format this file. Variants were detected with freebayes in the first instance.
Here is the terminal output:
$ vargeno index packardii.sub.fa snp.vcf test
[BloomFilter constructBfFromGenomeseq] bit vector: 755356701/9600000000
[BloomFilter constructBfFromGenomeseq] lite bit vector: 988176227/18400000000
[BloomFilter constructBfFromVCF] bit vector: 0/1120000000
SNP Dictionary
Total k-mers: 21626752
Unambig k-mers: 20575340
Ambig unique k-mers: 296062
Ambig total k-mers: 1051412
Ref Dictionary
Total k-mers: 1305711431
Unambig k-mers: 1130124620
Ambig unique k-mers: 36489256
Ambig total k-mers: 175586811
And here are the output files:
-rw-r--r-- 1 oliver users 12348187 Feb 5 11:42 test.chrlens
-rw-r--r-- 1 oliver users 1200000008 Feb 5 10:43 test.ref.bf
-rw-r--r-- 1 oliver users 2300000008 Feb 5 10:43 test.ref.bf.lite.bf
-rw-r--r-- 1 oliver users 0 Feb 5 14:47 test.ref.dict
-rw-r--r-- 1 oliver users 140000008 Feb 5 11:41 test.snp.bf
-rw-r--r-- 1 oliver users 0 Feb 5 11:42 test.snp.dict
All of the test files (in /vargeno/test) run fine and reproduce the provided output files. I'm running on Ubuntu 18.04.5 in a conda environment with the following packages:
I've been trying to run vargeno on non-human data and running into problems at the indexing stage. No error is reported during the process, but the
.dict
files are both empty, and so the genotyping step fails.I'm working with a fragmentary reference assembly of a grasshopper genome, so both the bioinformatic and biological properties of the data are not at all what vargeno was designed for.
Do you have any tips for troubleshooting? Attached (here) is a sample of the
.vcf
input. Since my data is not human data and I'm obviously not working with dbSNP it's a little unclear how to properly format this file. Variants were detected with freebayes in the first instance.Here is the terminal output:
And here are the output files:
All of the test files (in
/vargeno/test
) run fine and reproduce the provided output files. I'm running on Ubuntu 18.04.5 in a conda environment with the following packages:The text was updated successfully, but these errors were encountered: