Book contents
- Frontmatter
- Contents
- Preface
- SECTION I INTRODUCTION AND BIOLOGICAL DATABASES
- SECTION II SEQUENCE ALIGNMENT
- SECTION III GENE AND PROMOTER PREDICTION
- 8 Gene Prediction
- 9 Promoter and Regulatory Element Prediction
- SECTION IV MOLECULAR PHYLOGENETICS
- SECTION V STRUCTURAL BIOINFORMATICS
- SECTION V GENOMICS AND PROTEOMICS
- APPENDIX
- Index
- Plate section
- References
9 - Promoter and Regulatory Element Prediction
Published online by Cambridge University Press: 05 June 2012
- Frontmatter
- Contents
- Preface
- SECTION I INTRODUCTION AND BIOLOGICAL DATABASES
- SECTION II SEQUENCE ALIGNMENT
- SECTION III GENE AND PROMOTER PREDICTION
- 8 Gene Prediction
- 9 Promoter and Regulatory Element Prediction
- SECTION IV MOLECULAR PHYLOGENETICS
- SECTION V STRUCTURAL BIOINFORMATICS
- SECTION V GENOMICS AND PROTEOMICS
- APPENDIX
- Index
- Plate section
- References
Summary
An issue related to gene prediction is promoter prediction. Promoters are DNA elements located in the vicinity of gene start sites (which should not be confused with the translation start sites) and serve as binding sites for the gene transcription machinery, consisting of RNA polymerases and transcription factors. Therefore, these DNA elements directly regulate gene expression. Promoters and regulatory elements are traditionally determined by experimental analysis. The process is extremely time consuming and laborious. Computational prediction of promoters and regulatory elements is especially promising because it has the potential to replace a great deal of extensive experimental analysis.
However, computational identification of promoters and regulatory elements is also a very difficult task, for several reasons. First, promoters and regulatory elements are not clearly defined and are highly diverse. Each gene seems to have a unique combination of sets of regulatory motifs that determine its unique temporal and spatial expression. There is currently a lack of sufficient understanding of all the necessary regulatory elements for transcription. Second, the promoters and regulatory elements cannot be translated into protein sequences to increase the sensitivity for their detection. Third, promoter and regulatory sites to be predicted are normally short (six to eight nucleotides) and can be found in essentially any sequence by random chance, thus resulting in high rates of false positives associated with theoretical predictions.
- Type
- Chapter
- Information
- Essential Bioinformatics , pp. 113 - 124Publisher: Cambridge University PressPrint publication year: 2006