Researchers at the University of Maryland School of Medicine (UMSOM) coauthored a study, published February 25, 2021, in the journal Science, that details the sequencing of 32 full human genomes. This reference data includes individuals from around the world and better captures the genetic diversity of the human species. Among other applications, the work will enable population-specific studies on genetic predispositions to human diseases, as well as the discovery of more complex forms of genetic variation.
Twenty years ago this month, the International Human Genome Sequencing Consortium announced the first draft of the human genome reference sequence. The Human Genome Project, as it was called, required 11 years of work and involved more than 1,000 scientists from 40 countries. This reference, however, did not represent a single individual, but instead was a composite of humans that could not accurately capture the complexity of human genetic variation.
Building on this, scientists have conducted several sequencing projects over the last 20 years to identify and catalog genetic differences between an individual and the reference genome. Those differences usually focused on small single base changes and missed larger genetic alterations. Current technologies now are beginning to detect and characterize larger differences—called structural variants—such as insertions of new genetic material. Structural variants are more likely than smaller genetic differences to interfere with gene function.
The new finding in Science announced a novel and significantly more comprehensive reference dataset that was obtained using a combination of advanced sequencing and mapping technologies. The new reference dataset reflects 32 assembled human genomes, representing 25 different human populations from across the globe. Importantly, each of the genomes was assembled without guidance from the first human genome composite. As a result, the new dataset better captures genetic differences from different human populations.
"We've entered a new era in genomics where whole human genomes can be sequenced with exciting new technologies that provide more substantial and accurate reads of the DNA bases," said study coauthor Scott Devine, PhD, associate professor of medicine at UMSOM and faculty member of the Institute of Genome Science (IGS). "This is allowing researchers to study areas of the genome that previously were not accessible but are relevant to human traits and diseases."
IGS's Genome Resource Center (GRC) was one of three sequencing centers, along with Jackson Labs and the University of Washington, that generated the data using a new sequencing technology that was developed recently by Pacific Biosciences. The GRC was one of only five early access centers that was asked to test the new platform.
Devine helped to lead the sequencing efforts for this study and also led the subgroup of authors who discovered the presence of "mobile elements" (i.e., pieces of DNA that can move around and get inserted into other areas of the genome). Other members of IGS at UMSOM are among the 65 coauthors. Luke Tallon, PhD, scientific director of the Genomic Resource Center, worked with Devine to generate one of the first human genome sequences on the Pacific Biosciences platform that was contributed to this study. Nelson Chuang, a graduate student in Devine's lab also contributed to the project.
"The landmark new research demonstrates a giant step forward in our understanding of the underpinnings of genetically-driven health conditions," said E. Albert Reece, MD, PhD, MBA, executive vice president for medical affairs, University of Maryland Baltimore, and the John Z. and Akiko K. Bowers Distinguished Professor and dean, UMSOM. "This advance will hopefully fuel future studies aimed at understanding the impact of human genome variation on human diseases."
- This press release was originally published on the University of Maryland School of Medicine