Submit a preprint


HMMploidy: inference of ploidy levels from short-read sequencing datause asterix (*) to get italics
Samuele Soraggi, Johanna Rhodes, Isin Altinkaya, Oliver Tarrant, Francois Balloux, Matthew C Fisher, Matteo FumagalliPlease use the format "First name initials family name" as in "Marie S. Curie, Niels H. D. Bohr, Albert Einstein, John R. R. Tolkien, Donna T. Strickland"
<p>The inference of ploidy levels from genomic data is important to understand molecular mechanisms underpinning genome evolution. However, current methods based on allele frequency and sequencing depth variation do not have power to infer ploidy levels at low- and mid-depth sequencing data, as they do not account for data uncertainty. Here we introduce HMMploidy, a novel tool that leverages the information from multiple samples and combines the information from sequencing depth and genotype likelihoods. We demonstrate that HMMploidy outperforms existing methods in most tested scenarios, especially at low-depth with large sample size. We apply HMMploidy to sequencing data from the pathogenic fungus <em>Cryptococcus neoformans </em>and retrieve pervasive patterns of aneuploidy, even when artificially downsampling the sequencing data. We envisage that HMMploidy will have wide applicability to low-depth sequencing data from polyploid and aneuploid species.&nbsp;</p> should fill this box only if you chose 'All or part of the results presented in this preprint are based on data'. URL must start with http:// or https:// should fill this box only if you chose 'Scripts were used to obtain or analyze the results'. URL must start with http:// or https:// should fill this box only if you chose 'Codes have been used in this study'. URL must start with http:// or https://
high-throughput DNA sequencing, ploidy, polyploidy, aneuploidy, hidden Markov model, genotype likelihood
NonePlease indicate the methods that may require specialised expertise during the peer review process (use a comma to separate various required expertises).
Design and analysis of algorithms, Evolutionary Biology, Genetics and population Genetics, Probability and statistics
No need for them to be recommenders of PCI Math Comp Biol. Please do not suggest reviewers for whom there might be a conflict of interest. Reviewers are not allowed to review preprints written by close colleagues (with whom they have published in the last four years, with whom they have received joint funding in the last four years, or with whom they are currently writing a manuscript, or submitting a grant proposal), or by family members, friends, or anyone for whom bias might affect the nature of the review - see the code of conduct
e.g. John Doe []
2021-07-01 05:26:31
Alan Rogers