Review of proteomics with applications to genetic epidemiology
- 23 January 2003
- journal article
- review article
- Published by Wiley in Genetic Epidemiology
- Vol. 24 (2) , 83-98
- https://doi.org/10.1002/gepi.10226
Abstract
Mapping of the human genome has the potential to transform the traditional methods of genetic epidemiology. The complete draft sequence of the 3.3 billion nucleotides comprising the genome is now available over the Internet, including the location and nearly complete sequence of the 26,000 to 31,000 protein‐encoding genes. However, aside from water, almost everything in the human body is either made of, or by, proteins. Although the DNA code provides the instructions for their amino acid sequence, there are an estimated 1.5 million proteins. Thus, the correlation between DNA sequence and protein is low, reflecting alternate splicing as well as post‐translational modification. The purpose of this article is to explore ways in which the emerging field of proteomics, the study of proteins in a cell, may inform our approach to gene mapping. This article reviews the various technical approaches currently available for proteomics. Technologies are available to quantify protein expression (and compare normal versus disease states), identify proteins through comparison with sequence information in databases or direct sequencing (which can then be mapped to chromosomal locations to ensure appropriate markers), elucidate protein‐protein interactions (which may underlie disease), determine localization of proteins within the cell (abnormal trafficking of proteins could have an inherited basis), and characterize modifications of proteins (which is relevant to modifier gene candidates). Several examples are presented to illustrate the potential application of proteomics to the field of genetic epidemiology, and we conclude with various considerations regarding design and analysis. Genet Epidemiol 24:83–98, 2003.Keywords
This publication has 76 references indexed in Scilit:
- Comparative assessment of large-scale data sets of protein–protein interactionsNature, 2002
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Probing the proteome – protein arrays and their applicationsDrug Discovery Today, 2001
- Solution and chip arrays in protein profilingTrends in Biotechnology, 2001
- A map of human genome sequence variation containing 1.42 million single nucleotide polymorphismsNature, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- The language of covalent histone modificationsNature, 2000
- A strategy for the identification of proteins localized to subcellular spaces: Application to E. coli periplasmic proteinsInternational Journal of Mass Spectrometry and Ion Processes, 1997
- An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein databaseJournal of the American Society for Mass Spectrometry, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990