Nature recently published the results of the world’s largest and most comprehensive study on the effects of common genetic variation on proteins circulating in the blood and how these associations can contribute to disease. This unprecedented population-scale investigation of proteins, powered by turning biological samples into data from the UK Biobank, will help scientists better understand how and why diseases develop, which could help drive the development of new diagnostics and treatments for a wide range of health conditions.
To develop this unique and unparalleled dataset, researchers measured the abundance of nearly 3,000 circulating proteins, many of which were previously difficult to capture, from over 54,000 participants in the UK Biobank—which has been collecting data and tracking the health of 500,000 volunteer participants enrolled between 2006 and 2010. The study identified over 14,000 associations between common genetic variants and proteins circulating in the blood, over 80 percent of which were previously unknown. Scientists worldwide will be able to access the proteomic data in the coming weeks.
This landmark study was commissioned, funded, and carried out by the Pharma Proteomics Project, a collaboration between 13 leading biopharmaceutical companies. The team analyzed the data, demonstrating the vast potential for future research using the study. These include:
Genome-wide association studies to build an open-access library of all the common gene variants that influence protein levels in blood. This can be used to study complex biological processes, find proteins that are key players in causing disease, identify new drug targets, potentially shorten development time for earlier-stage drug candidates, and increase success rates for clinical trials.
Profiling of blood protein levels across the top 20 most common health conditions in UK Biobank. This revealed that, for example, inflammatory proteins, long thought to contribute towards mental health conditions, are significantly higher in patients with depression.
Training machine learning models to determine how successfully blood proteins can predict demographic factors. This analysis found that blood proteins can predict age, sex, and body mass index (BMI) with very high accuracy. In the future, this technology could be used to compare chronological age with biological age and determine how this is related to the risk of future diseases.
Naomi Allen, MSc, DPhil, professor in epidemiology and chief scientist of UK Biobank, said: “This momentous study offers whole new avenues of research to the biomedical community, and is a leading example of how cross-sector collaboration can bring about results that are so much greater than the sum of their parts. All of these data will soon be available to bona fide researchers across the globe, alongside the existing genomic, lifestyle, and health data that UK Biobank holds for its 500,000 volunteers. I am excited for researchers to use these data to identify patterns that could transform our understanding of how diseases develop, and to identify potential new treatment pathways.”
Other future innovative work expected to result from this study includes using proteins circulating in the blood to predict whether someone will develop a disease several years before the condition occurs, classifying diseases into distinct biological subtypes, and using proteins in the blood to predict drug efficacy and safety prior to clinical trials.
- This press release was originally published on the UK Biobank website