NIH’s Proteogenomic Dataset to Propel Cancer Research
Pan-cancer dataset built from multiomic analyses and proteomic studies is publicly available for global use
The National Institutes of Health is releasing a comprehensive dataset that standardizes genomic, proteomic, imaging, and clinical data from individual studies of more than 1,000 tumors across 10 cancer types.
Researchers from around the world will be able to use this publicly available resource to uncover new molecular insights into how cancers develop and progress. The dataset was generated by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) at the National Cancer Institute, part of the National Institutes of Health.
The pan-cancer proteogenomic dataset, described in a paper published in Cancer Cell, builds on decades of technological advances in proteomic science. The launch of this dataset supports the Biden-Harris Administration’s Cancer MoonshotSM goal of accelerating cancer research through improved data sharing.
Two additional research papers published in Cell by CPTAC investigators demonstrate the dataset's potential as a valuable resource for scientific discovery. In the first paper, multiomic analyses are used to link cancer driver mutations with protein patterns. The second paper delves into protein modifications that regulate cell signaling and physiology to show associations with DNA repair, metabolism, and immunity across different tumor types.
The pan-cancer proteogenomic dataset will be publicly available through the NCI Cancer Research Data Commons repositories.
- This press release was originally published on the National Institutes of Health website