News / Science News |
Revealing the human proteome
NIH | JUNE 10, 2014
Researchers completed a draft map of the human proteome—the set of all proteins in the human body. The accomplishment will help advance a broad range of research into human health and disease.
In 2003, the Human Genome Project created a draft map of the human genome—all the genes in the human body. Genomics has since driven many advances in medical science.
Genes control the most basic functions of the cell, including what proteins to make and when. Researchers have identified more than 20,000 protein-coding genes. However, scientific understanding of the proteome has lagged behind that of the genome, partly because of the proteome’s complexities.
The relationship between genes and proteins isn’t a simple matter of one gene coding for one protein. Stretches of DNA can be read and translated into proteins in different ways. Proteins are also more difficult to sequence than genes.
Several projects are now underway to characterize the human proteome. In their new study, a team of researchers headed by Drs. Akhilesh Pandey at Johns Hopkins University and Harsha Gowda at the Institute of Bioinformatics in Bangalore, India, used an advanced form of mass spectrometry to sequence proteins and create a draft map of the human proteome.
The team examined 30 normal human tissue and cell types: 17 adult tissues, 7 fetal tissues, and 6 blood cell types. Samples from 3 people per tissue type were processed through several steps, and then the protein fragments (peptides) were analyzed on high-resolution Fourier-transform mass spectrometers. The amino acid sequences were next compared to known sequences.
The resulting draft human proteome map includes proteins encoded by more than 17,000 genes—about 84% of the total known protein-coding genes. Among these are hundreds of proteins from regions already known to encode other proteins. The map also includes 193 novel proteins from regions previously thought to be non-coding.
“Housekeeping genes” that are expressed in all tissues and cell types have been thought to be involved in basic cellular functions. However, the resulting “housekeeping proteins” haven’t been well understood.
The team detected proteins encoded by 2,350 genes across all human cells and tissues. These housekeeping proteins comprised about 75% of total protein mass. They included histones, ribosomal proteins, metabolic enzymes, and cytoskeletal proteins.
The study also revealed new insights into how genes are expressed. For instance, nearly 200 genes begin at locations other than those predicted based on genetic sequence.