Submit a preprint

307

HairSplitter: haplotype assembly from long, noisy readsuse asterix (*) to get italics
Roland Faure, Dominique Lavenier, Jean-François FlotPlease use the format "First name initials family name" as in "Marie S. Curie, Niels H. D. Bohr, Albert Einstein, John R. R. Tolkien, Donna T. Strickland"
2024
<p>Long-read assemblers face challenges in discerning closely related viral or<br>bacterial strains, often collapsing similar strains in a single sequence. This limitation has<br>been hampering metagenome analysis, where diverse strains may harbor crucial functional<br>distinctions.<br>We introduce a novel software, HairSplitter, designed to retrieve strains from a<br>strain-oblivious assembly and long reads. The method uses a custom variant calling pro-<br>cess to operate with erroneous long reads and introduces a new read clustering algorithm<br>to recover an a priori unknown number of strains. On noisy long reads, HairSplitter can<br>recover more strains while being faster than state-of-the-art tools, both in the viral and the<br>bacterial case.</p>
https://doi.org/10.5281/zenodo.10495032You should fill this box only if you chose 'All or part of the results presented in this preprint are based on data'. URL must start with http:// or https://
https://You should fill this box only if you chose 'Scripts were used to obtain or analyze the results'. URL must start with http:// or https://
https://github.com/rolandfaure/hairsplitterYou should fill this box only if you chose 'Codes have been used in this study'. URL must start with http:// or https://
Metagenomes; Metaviromes; Haplotyping; Genome assembly; Strain separation
NonePlease indicate the methods that may require specialised expertise during the peer review process (use a comma to separate various required expertises).
Design and analysis of algorithms, Development, Genomics and Transcriptomics, Probability and statistics
Paola Bonizzoni bonizzoni@disco.unimib.it, Tobias Marschall tobias.marshall@hhu.de, Nicola Segata nicola.segata@unitn.it, Sergey Koren sergey.koren@nih.gov, Chirag Jain chirag@iisc.ac.in, Jue Ruan jys@agis.org.cn, Sergey Koren [sergey.koren@nih.gov] suggested: I suggest Arang Rhie (Arang.rhie@nih.gov) or Dmitry Antipov (dmitry.antipov@nih.gov) as alternate reviewers. No need for them to be recommenders of PCI Math Comp Biol. Please do not suggest reviewers for whom there might be a conflict of interest. Reviewers are not allowed to review preprints written by close colleagues (with whom they have published in the last four years, with whom they have received joint funding in the last four years, or with whom they are currently writing a manuscript, or submitting a grant proposal), or by family members, friends, or anyone for whom bias might affect the nature of the review - see the code of conduct
e.g. John Doe [john@doe.com]
2024-02-15 10:17:04
Giulio Ermanno Pibiri