Skip to main content
ARS Home » Midwest Area » Columbia, Missouri » Plant Genetics Research » Research » Publications at this Location » Publication #324578

Title: MeSH-informed enrichment analysis and MeSH-guided semantic similarity among functional terms and gene products in chicken

Author
item MOROTA, GOTA - University Of Nebraska
item Beissinger, Timothy
item PENAGARICANO, FRANCISCO - University Of Florida

Submitted to: G3, Genes/Genomes/Genetics
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 5/31/2016
Publication Date: 8/9/2016
Publication URL: https://handle.nal.usda.gov/10113/5508058
Citation: Morota, G., Beissinger, T.M., Penagaricano, F. 2016. MeSH-informed enrichment analysis and MeSH-guided semantic similarity among functional terms and gene products in chicken. G3, Genes/Genomes/Genetics. 6(8):2447-2453. doi: 10.1534/g3.116.031096.

Interpretive Summary: Medical Subject Headings (MeSH), the National Library of Medicine's controlled vocabulary for indexing scientific publications, is making inroads as a compliment to gene ontology (GO) for assisting in the interpretation of genomic studies. Here, we report the MeSH annotation of the chicken genome. We find that analyzing the genome using MeSH in addition to GO enhances biological interpretations that are possible from GO alone. Our findings show the benefits of using MeSH as an alternative choice of annotation in order to draw biological inferences from a list of genes of interest. The use of MeSH in conjunction with GO will be instrumental in facilitating the understanding of the genetic basis of complex traits not only in poultry, but also in other agricutural species, including crops. Such an understanding is also critical in breeding programs for imprved livestock and crops and this work will accelerate these efforts.

Technical Abstract: Such Biomedical vocabularies and ontologies aid in recapitulating biological knowledge. The annotation of gene products is mainly accelerated by Gene Ontology (GO) and more recently by Medical Subject Headings (MeSH). MeSH is the National Library of Medicine's controlled vocabulary and it is making inroads into gene annotation and enrichment analysis. The availability of MeSH annotation in farm animals poses both new challenges and opportunities for downstream analysis in functional genomic studies. Here we report the MeSH annotation of the chicken genome and we illustrate some features of different MeSH-based analyses, including MeSH-informed enrichment analysis and MeSH-guided semantic similarity among terms and gene products, using two lists of chicken genes available in public repositories. The two published datasets that were employed represent (i) differentially expressed genes detected in a genome-wide gene expression study and (ii) candidate genes under selective sweep or epistatic selection that were detected in a whole-genome scan study. The comparison of MeSH with GO overrepresentation analyses suggested not only that MeSH supports the findings obtained from GO analysis but also that MeSH is able to further enrich the representation of biological knowledge. Based on the hierarchical structures of MeSH and GO, we computed semantic similarities among vocabularies as well as semantic similarities among selected genes. The respective hierarchies of MeSH and GO yielded the similarity levels between significant functional terms, and the annotation of each term yielded then the measures of gene similarity. Our findings show the benefits of using MeSH as an alternative choice of annotation in order to draw biological inferences from a list of genes of interest. We argue that the use of MeSH in conjunction with GO will be instrumental in facilitating the understanding of the genetic basis of complex traits in poultry and other farm species.