-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Minimum length of scaffolds? #10
Comments
Hi, There is no hard-coded minimum length. All the cutoffs can be specified in the parameters file that is given to Panseq. The If this doesn't answer your question, would you be able to provide a small example? Thanks, |
Hi Chad, Below is a list of genomes with a minimum scaffold length of 1kb. The true length of the genome input vs. the length of the genome as interpreted by the binary_table is then shown, as well as the difference in genome size and the reduction in genome size when comparing the true genome length vs. the genome length as determined by the sum of all fragments in the core and accessory genomes. The number of scaffolds in the input genome is also shown
As you can see, two of the genomes have a pretty large difference in size, >10% However, below is the same analysis, but only counting scaffolds greater than 1.5kb when calculating the length of the original genome:
The two outliers now show much less difference in size, which is what made me think that there was a hard-coded cutoff in the code. This is really no longer a problem for me, as I can just set my minimum contig length threshold to 2kb, but I was just wondering if you had any thoughts about why this could be happening? Thanks for your help- I'm really liking the program a lot |
Hi Matt, Just to check: Chad |
Hi Chad, Yes, that's exactly right -Matt |
Hello,
I was wondering if there is a minimum length requirement for scaffolds somewhere hard-coded in the program?
I performed a test where I compared the length of genomes before running Panseq on them and after (by summing up all of the fragments each genome had in the Pangenome, as well as the length of the fragments). For the most part these numbers agreed very well (1-2% difference), but with highly fragmented genomes (with a lot of small pieces) there was as much as a 13% reduction in genome size.
The smallest scaffolds I used as input are 1kb, but I was wondering if I should make that even higher?
Thank you for the wonderful tool,
-Matt
The text was updated successfully, but these errors were encountered: