Skip to main content Accessibility help
×
Hostname: page-component-848d4c4894-2xdlg Total loading time: 0 Render date: 2024-07-04T21:35:28.478Z Has data issue: false hasContentIssue false

8 - Gene Prediction

Published online by Cambridge University Press:  05 June 2012

Jin Xiong
Affiliation:
Texas A & M University
Get access

Summary

With the rapid accumulation of genomic sequence information, there is a pressing need to use computational approaches to accurately predict gene structure. Computational gene prediction is a prerequisite for detailed functional annotation of genes and genomes. The process includes detection of the location of open reading frames (ORFs) and delineation of the structures of introns as well as exons if the genes of interest are of eukaryotic origin. The ultimate goal is to describe all the genes computationally with near 100% accuracy. The ability to accurately predict genes can significantly reduce the amount of experimental verification work required.

However, this may still be a distant goal, particularly for eukaryotes, because many problems in computational gene prediction are still largely unsolved. Gene prediction, in fact, represents one of the most difficult problems in the field of pattern recognition. This is because coding regions normally do not have conserved motifs. Detecting coding potential of a genomic region has to rely on subtle features associated with genes that may be very difficult to detect.

Through decades of research and development, much progress has been made in prediction of prokaryotic genes. A number of gene prediction algorithms for prokaryotic genomes have been developed with varying degrees of success. Algorithms for eukarytotic gene prediction, however, are still yet to reach satisfactory results. This chapter describes a number of commonly used prediction algorithms, their theoretical basis, and limitations.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Aggarwal, G., and Ramaswamy, R. 2002. Ab initio gene identification: Prokaryote genome annotation with GeneScan and GLIMMER. J. Biosci. 27:7–14CrossRefGoogle ScholarPubMed
Ashurst, J. L., and Collins, J. E. 2003. Gene annotation: Prediction and testing. Annu. Rev. Genomics Hum. Genet. 4:69–88CrossRefGoogle Scholar
Azad, R. K., and Borodovsky, M. 2004. Probabilistic methods of identifying genes in prokaryotic genomes: Connections to the HMM theory. Brief. Bioinform. 5:118–30CrossRefGoogle ScholarPubMed
Cruveiller, S., Jabbari, K., Clay, O., and Bemardi, G. 2003. Compositional features of eukaryotic genomes for checking predicted genes. Brief. Bioinform. 4:43–52CrossRefGoogle ScholarPubMed
Davuluri, R. V., and Zhang, M. Q. 2003. “Computer software to find genes in plant genomic DNA.” In Plant Functional Genomics, edited by Grotewold, E., 87–108. Totowa, NJ: Human PressCrossRefGoogle Scholar
Guigo, R., and Wiehe, T. 2003. “Gene prediction accuracy in large DNA sequences.” In Frontiers in Computational Genomics, edited by Galperin, M. Y. and Koonin, E. V., 1–33. Norfolk, UK: Caister Academic PressGoogle Scholar
Guigo, R., Dermitzakis, E. T., Agarwal, P., Ponting, C. P., Parra, G., Reymond, A., Abril, J. F.. 2003. Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes. Proc. Natl. Acad. Sci. USA 100:1140–5CrossRefGoogle ScholarPubMed
Li, W., and Godzik, A. 2002. Discovering new genes with advanced homology detection. Trends Biotechnol. 20:315–16CrossRefGoogle ScholarPubMed
Makarov, V. 2002. Computer programs for eukaryotic gene prediction. Brief. Bioinform. 3:195–9CrossRefGoogle ScholarPubMed
Mathe, C., Sagot, M. F., Schiex, T., and Rouze, P. 2002. Current methods of gene prediction, their strengths and weaknesses. Nucleic Acids Res. 30:4103–17CrossRefGoogle ScholarPubMed
Parra, G., Agarwal, P., Abril, J. F., Wiehe, T., Fickett, J. W., and Guigo, R. 2003. Comparative gene prediction in human and mouse. Genome Res. 13:108–17CrossRefGoogle Scholar
Wang, J., Li, S., Zhang, Y., Zheng, H., Xu, Z., Ye, J., Yu, J., and Wong, G. K. 2003. Vertebrate gene predictions and the problem of large genes. Nat. Rev. Genet. 4:741–9CrossRefGoogle ScholarPubMed
Wang, Z., Chen, Y., and Li, Y. 2004. A brief review of computational gene prediction methods. Geno. Prot. Bioinfo. 4:216–21CrossRefGoogle Scholar
Zhang, M. Q. 2002. Computational prediction of eukaryotic protein coding genes. Nat. Rev. Genetics. 3:698–709CrossRefGoogle ScholarPubMed

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Gene Prediction
  • Jin Xiong, Texas A & M University
  • Book: Essential Bioinformatics
  • Online publication: 05 June 2012
  • Chapter DOI: https://doi.org/10.1017/CBO9780511806087.009
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • Gene Prediction
  • Jin Xiong, Texas A & M University
  • Book: Essential Bioinformatics
  • Online publication: 05 June 2012
  • Chapter DOI: https://doi.org/10.1017/CBO9780511806087.009
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • Gene Prediction
  • Jin Xiong, Texas A & M University
  • Book: Essential Bioinformatics
  • Online publication: 05 June 2012
  • Chapter DOI: https://doi.org/10.1017/CBO9780511806087.009
Available formats
×