Skip to main content
ARS Home » Plains Area » Manhattan, Kansas » Center for Grain and Animal Health Research » ABADRU » Research » Publications at this Location » Publication #411284

Research Project: Biology and Management of Dipteran Pests of Livestock and Other Animals

Location: Arthropod-borne Animal Diseases Research

Title: FAIR Header Reference genome: A TRUSTworthy standard

Author
item WRIGHT, ADAM - Ontario Cancer Institute
item WILKINSON, MARK - Universidad Politécnica De Madrid
item MUNGALL, CHRIS - Berkeley National Laboratory
item CAIN, SCOTT - Ontario Cancer Institute
item RICHARDS, STEPHEN - Baylor College Of Medicine
item STERNBERG, PAUL - California Institute Of Technology
item PROVIN, ELLEN - Texas A&M University
item JACOBS, JONATHAN - American Type Culture Collections
item Geib, Scott
item RACITI, DANIELA - California Institute Of Technology
item YOOK, KAREN - California Institute Of Technology
item STEIN, LINCOLN - Ontario Cancer Institute
item Molik, David

Submitted to: Briefings in Bioinformatics
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 2/22/2024
Publication Date: 3/28/2024
Citation: Wright, A., Wilkinson, M.D., Mungall, C., Cain, S., Richards, S., Sternberg, P., Provin, E., Jacobs, J.L., Geib, S.M., Raciti, D., Yook, K., Stein, L., Molik, D.C. 2024. FAIR Header Reference genome: A TRUSTworthy standard. Briefings in Bioinformatics. 25(3). Article bbae122. https://doi.org/10.1093/bib/bbae122.
DOI: https://doi.org/10.1093/bib/bbae122

Interpretive Summary: Scientists often share genomic information to study diseases and develop treatments. However, different websites that store this information may not talk to each other very well, which makes it hard to combine the data and learn more from it. Also, sometimes important details about where the data came from can get lost. To fix this problem, a new set of rules called FHR was created to make sure genomic data is shared in a way that is easy to find, access, and use across different websites. FHR is also designed to make sure important details about the data are kept so researchers can see where it came from. Using FHR is easy because it supports common ways of storing data and only needs a few important details to be shared.

Technical Abstract: The lack of interoperable data standards among reference genome data-sharing platforms inhibits cross-platform analysis while increasing the risk of data provenance loss. Here, we describe the FAIR-bioHeaders Reference genome (FHR), a metadata standard guided by the principles of Findability, Accessibility, Interoperability, and Reuse (FAIR) in addition to the principles of Transparency, Responsibility, User focus, Sustainability, and Technology (TRUST). The objective of FHR is to provide an extensive set of data serialisation methods and minimum data field requirements while still maintaining extensibility, flexibility, and expressivity in an increasingly decentralised genomic data ecosystem. The effort needed to implement FHR is low; FHR's design philosophy ensures easy implementation while retaining the benefits gained from recording both machine and human-readable provenance.