Skip to main content
ARS Home » Northeast Area » Orono, Maine » National Cold Water Marine Aquaculture Center » Research » Publications at this Location » Publication #398756

Research Project: Genetic Improvement of North American Atlantic Salmon and the Eastern Oyster for Aquaculture Production

Location: National Cold Water Marine Aquaculture Center

Title: A second unveiling: haplotig masking of the eastern oyster genome improves population-level inference

Author
item PURITZ, JON - University Of Rhode Island
item GUO, XIMING - Rutgers University
item HARE, MATT - Cornell University
item HE, YAN - Rutgers University
item HILLIER, LADEANA - Washington University
item JIN, SHUBO - Rutgers University
item LIU, MING - Rutgers University
item LOTTERHOS, KATIE - Northeastern University
item MINX, PAT - Washington University
item MODAK, TEJASHREE - University Of Rhode Island
item Proestou, Dina
item RICE, EDWARD - Washington University
item TOMLISON, CHAD - Washington University
item WARREN, WESLEY - Washington University
item WITKOP, ERIN - University Of Rhode Island
item ZHAO, HONGGANG - Cornell University
item GOMEZ-CHIARRI, MARTA - University Of Rhode Island

Submitted to: Molecular Ecology Resources
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 3/20/2023
Publication Date: 4/25/2023
Citation: Puritz, J., Guo, X., Hare, M., He, Y., Hillier, L., Jin, S., Liu, M., Lotterhos, K., Minx, P., Modak, T., Proestou, D.A., Rice, E., Tomlison, C., Warren, W., Witkop, E., Zhao, H., Gomez-Chiarri, M. 2023. A second unveiling: haplotig masking of the eastern oyster genome improves population-level inference. Molecular Ecology Resources. https://doi.org/10.1111/1755-0998.13801.
DOI: https://doi.org/10.1111/1755-0998.13801

Interpretive Summary: High-quality, annotated reference genomes provide a foundation for current and future research. Genome assembly algorithms perform best when the sequenced material is inbred with low levels of heterozygosity. Marine invertebrates are among the most heterozygous, polymorphic species, making contiguous genome assembly a challenge. Here we report post-hoc improvement of the eastern oyster, Crassostrea virginica, genome that resulted in a complete, highly contiguous, more acccurate reference assembly. The post-hoc improvement involved masking haplotigs, or polymorphic alleles of the same genetic locus that were incorporated in the original assembly as distinct loci. Removing haplotigs had a large impact on downstream analyses including SNP discovery, estimates of genetic diversity, and population genetic analyses. Moving forward, the haplotic masked genome should be applied to research projects aimed at supporting eastern oyster aquaculture.

Technical Abstract: Genome assembly can be challenging for species, such as terrestrial arthropods and broadcast-spawning marine invertebrates, that are characterized by high amounts of polymorphism and heterozygosity, large effective population sizes, and low levels of population differentiation. High levels of heterozygosity can result in genome mis-assemblies and a larger than expected genome size due to the haplotig versions of a single locus being assembled as separate loci. Here, we assembled and annotated the first chromosome-level genome for the eastern oyster, Crassostrea virginica. Publicly released and annotated in 2017, the assembly is still one of the most complete and contiguous molluscan genomes to date. We also developed a post hoc method to break up chimeric contigs and mask haplotigs in published heterozygous genomes and evaluated improvements to the accuracy of downstream genome analysis. Masking haplotigs had a large impact on SNP discovery and estimates of nucleotide diversity and had more subtle and nuanced effects on estimates of heterozygosity, population structure analysis, and outlier detection. We show that haplotig-masking can be a powerful tool for improving genomes and we present a halpotig-masked genome assembly for the eastern oyster that will be a critical resource for foundational research into molluscan adaptation to a changing environment and for selective breeding for the aquaculture industry.