Book contents
- Frontmatter
- Contents
- List of contributors
- Foreword: The future of drug discovery and healthcare
- Acknowledgments
- 1 The art and science of the drug discovery pipeline
- 2 Computational approaches to drug target identification
- 3 Understanding human disease knowledge through text mining
- 4 Integrating translational biomarkers into drug development
- 5 Computational phenotypic assessment of small molecules in drug discovery
- 6 Data visualization and the DDP process
- 7 Information visualization – important IT considerations
- 8 Example of computational biology at the new drug application (NDA) and regulatory approval stages
- 9 Clinical trial failures and drug repositioning
- Appendix I Additional knowledge-based analysis approaches
- Appendix II Open source tools and public data sources
- Index
- Plate section
Appendix II - Open source tools and public data sources
Published online by Cambridge University Press: 05 February 2016
- Frontmatter
- Contents
- List of contributors
- Foreword: The future of drug discovery and healthcare
- Acknowledgments
- 1 The art and science of the drug discovery pipeline
- 2 Computational approaches to drug target identification
- 3 Understanding human disease knowledge through text mining
- 4 Integrating translational biomarkers into drug development
- 5 Computational phenotypic assessment of small molecules in drug discovery
- 6 Data visualization and the DDP process
- 7 Information visualization – important IT considerations
- 8 Example of computational biology at the new drug application (NDA) and regulatory approval stages
- 9 Clinical trial failures and drug repositioning
- Appendix I Additional knowledge-based analysis approaches
- Appendix II Open source tools and public data sources
- Index
- Plate section
Summary
As technology evolves, the drug discovery science is quickly becoming data science rather than purely biological sciences or chemistry in the traditional sense. Scientists are generating experimental and clinical data at ever-faster paces, many of which become publicly available. How to best use these data and realize the full potential of the data becomes a continuous topic throughout the drug discovery process.
In this section, rather than provide an exhaustive list of the resources available to the scientist, we focus on the few widely used (free or commercial) data sources and analysis tools.
Literature and textual resources
1. eUtils
The Entrez Programming Utilities (E-utilities, or eUtils) are a set of nine server-side programs that provide a stable interface into the Entrez query and database system at the National Center for Biotechnology Information (NCBI). The E-utilities use a fixed URL syntax that translates a standard set of input parameters into the values necessary for various NCBI software components to search for and retrieve the requested data. The E-utilities are therefore the structured interface to the Entrez system, which currently includes 38 databases covering a variety of biomedical data, including nucleotide and protein sequences, gene records, three-dimensional molecular structures, and the biomedical literature.
2. EMBASE
With extensive international journal and conference coverage, Embase is a key resource for generating systematic reviews, supporting effective evidence-based medicine and drug and medical device tracking. Embase facilitates the clinical decision-making process and allows you to get to market faster, while still ensuring the required drug safety and pharmacovigilance.
3. MeSH
MeSH is the National Library of Medicine's controlled vocabulary thesaurus. It consists of sets of terms naming descriptors in a hierarchical structure that permits searching at various levels of specificity.
Genomics resources
1. OMIM
OMIM is a comprehensive, authoritative compendium of human genes and genetic phenotypes that is freely available and updated daily. OMIM is authored and edited at the McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, under the direction of Dr. Ada Hamosh. Its official home is omim.org.
2. dbVar
A database of genomic structural variation including insertions, deletions, duplications, inversions, deletion-insertions, mobile element insertions, translocations, and complex rearrangements.
3. GEO
GEO is a public functional genomics data repository supporting MIAME-compliant data submissions. Array- and sequence-based data are accepted. Tools are provided to help users query and download experiments and curated gene expression profiles.
- Type
- Chapter
- Information
- Publisher: Cambridge University PressPrint publication year: 2016