Author(s)
Brandon S Gaut
Published 19 Projects
Abiotic Stress Genetics Scaffolding Evolutionary Biology Plant Biology
Content
Background: Despite marked recent improvements in long-read sequencing technology, the assembly of diploid genomes remains a difficult task. A major obstacle is distinguishing between alternative contigs that represent highly heterozygous regions. If primary and secondary contigs are not properly identified, the primary assembly will overrepresent both the size and complexity of the genome, which complicates downstream analysis such as scaffolding. Results: Here we illustrate a new method, which we call HapSolo, that identifies secondary contigs and defines a primary assembly based on multiple pairwise contig alignment metrics. HapSolo evaluates candidate primary assemblies using BUSCO scores and then distinguishes among candidate assemblies using a cost function. The cost function can be defined by the user but by default considers the number of missing, duplicated and single BUSCO genes within the assembly. HapSolo performs hill climbing to minimize cost over thousands of candidate assemblies. We illustrate the performance of HapSolo on genome data from three species: the Chardonnay grape (Vitis vinifera), with a genome of 490Mb, a mosquito (Anopheles funestus; 200Mb) and the Thorny Skate (Amblyraja radiata; 2,650 Mb). Conclusions: HapSolo rapidly identified candidate assemblies that yield improvements in assembly metrics, including decreased genome size and improved N50 scores. Contig N50 scores improved by 35%, 9% and 9% for Chardonnay, mosquito and the thorny skate, respectively, relative to unreduced primary assemblies. The benefits of HapSolo were amplified by down-stream analyses, which we illustrated by scaffolding with Hi-C data. We found, for example, that prior to the application of HapSolo, only 52% of the Chardonnay genome was captured in the largest 19 scaffolds, corresponding to the number of chromosomes. After the application of HapSolo, this value increased to ~84%. The improvements for mosquito scaffolding were similar to that of Chardonnay (from 61% to 86%), but even more pronounced for thorny skate. We compared the scaffolding results to assemblies that were based on another published method for identifying secondary contigs, with generally superior results for HapSolo. ### Competing Interest Statement The authors have declared no competing interest.
More Projects
Loren Frank
13 views • 2 years ago
Global Immunotalks
390 views • 2 years ago
Laurel Yohe
2 views • 2 years ago
Global Immunotalks
130 views • 2 years ago
Jignesh H. Parmar
0 views • 2 years ago
Winston A. Haynes
0 views • 2 years ago
Noam Mazor
0 views • 2 years ago
Global Immunotalks
182 views • 2 years ago
Cem Yuksel
345 views • 2 years ago
Oscar Gonzalez-Recio
3 views • 2 years ago
Please pick a style:
Sydney C. Weiser. (2021, Nov 2).HapSolo: An optimization approach for removing secondary haplotigs during diploid genome assembly and scaffolding[Video]. Scitok. https://scitok.com/project/p/1dd6162e
A. Solares Edwin. "HapSolo: An optimization approach for removing secondary haplotigs during diploid genome assembly and scaffolding" Scitok, uploaded by C. Weiser Sydney, 2 Nov, 2021, https://scitok.com/project/p1dd6162e
Sydney C. Weiser. "HapSolo: An optimization approach for removing secondary haplotigs during diploid genome assembly and scaffolding" Scitok. (Nov 2, 2021). https://scitok.com/project/p/1dd6162e
Sydney C. Weiser (Nov 2, 2021). HapSolo: An optimization approach for removing secondary haplotigs during diploid genome assembly and scaffolding Scitok. https://scitok.com/project/p/1dd6162e
Sydney C. Weiser. HapSolo: An optimization approach for removing secondary haplotigs during diploid genome assembly and scaffolding[video]. 2021 Nov 2. https://scitok.com/project/p/1dd6162e
@online{al2006link, title={ HapSolo: An optimization approach for removing secondary haplotigs during diploid genome assembly and scaffolding }, author={ C. Weiser, Sydney }, organization={Scitok}, month={ Nov }, day={ 2 }, year={ 2021 }, url = {https://scitok.com/project/p/1dd6162e}, }