Skip to main content
ARS Home » Research » Publications at this Location » Publication #258229

Title: Systematic identification of conserved regulatory elements in upstream promoter regions of the cattle genome

Author
item Liu, Ge - George
item HANSON, RICHARD - Case Western Reserve University (CWRU)
item YANG, JIANQI - Case Western Reserve University (CWRU)

Submitted to: Meeting Abstract
Publication Type: Abstract Only
Publication Acceptance Date: 8/27/2010
Publication Date: 9/2/2010
Citation: Liu, G., Hanson, R.W., Yang, J. 2010. Systematic identification of conserved regulatory elements in upstream promoter regions of the cattle genome. Meeting Abstract. No. 10.

Interpretive Summary: We developed a systematic approach, combining position-specific weight matrixes (JASPAR) and phylogenetic footprinting algorithm (TFLOC), to identify transcription factor binding sites (TFBSs) in mammalian promoter regions. Using this approach, we studied TFBSs in human-cattle-dog alignments. To test its usage in other vertebrates including cattle, we applied it to various species combinations including human-chimpanzee-macaque, human-mouse-rat and human-cattle-dog. The best prediction was produced by the human-cattle-dog comparison with a higher sensitivity and a higher true-positive rate. These results highlight the importance of choosing species at proper evolutionary distance for comparative genomics studies. The cattle TFBS dataset will be made freely available at the earliest opportunity to the scientific community through a public accessible website.

Technical Abstract: Cross-species DNA sequence comparison is the primary approach to discover regulatory elements by identifying highly conserved sequences due to evolutionary constraints. Previously, we reported that a systematic approach, combining position-specific weight matrixes (JASPAR) and phylogenetic footprinting algorithm (TFLOC), was implemented and optimized to identify transcription factor binding sites (TFBSs) in mammalian promoter regions within human-mouse-rat alignments. To generate a dataset of conserved TFBSs in the cattle genome, we further applied this approach to human-cattle-dog alignments. We first estimated the impact of evolutionary distance on predictive power. TFLOC and PhastCons were applied to various species combinations including human-chimpanzee-macaque, human-mouse-rat and human-cattle-dog. Computational prediction was compared with previously known sites at diverse genomic loci. Those newly discovered sites were further confirmed by experimental verifications including gel shifting and reporter assays. The best prediction was produced by the human-cattle-dog comparison with a higher sensitivity and a higher true-positive rate. The closer human-chimpanzee-macaque comparison produced more spurious sites, while the more distant human-mouse-rat comparison had a lower sensitivity. These results highlight the importance of choosing species at proper evolutionary distance for comparative genomics studies. The cattle TFBS dataset will be made freely available at the earliest opportunity to the scientific community through a public accessible website.