NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Section
Column
width65%

Proteomics Data Management has three subprojects; Proteomics Data Commons (PDC), Clinical Proteomics Tumor Analysis Consortium (CPTAC) Data Assay Portal and CPTAC DCC (Data Coordinating Center). PDC starts on April 1, 2020 while the other two projects will start August 31st 2021. All three projects are already underway with ESAC, Inc. The PDC is part of NCI’s CRDC. Reports are provided to Division of Cancer Biology (DCB) and through the BIDS monthly reporting system. Anticipated period of performance is 4 years, with severable funding. EPLC is needed and an ATO is already in place (held with ESAC).


Project Details:

YT20-068 (FCAS ID is NCI-CSSI-20-YT0057)

Task Order Title and Number: NCI Operational Task Order: 75N91019F00129

Contract type: Severable

Base period of performance: Date of Award - August 30, 2020

PIDS:

  • 600.129.27.01.014.002.0001 – Proteomic Data Mgmt
  • 600.129.27.01.014.002.0001.001 – Proteomic Data Mgmt;Gen
  • 600.129.27.01.014.002.0001.007 – Proteomic Data Mgmt;TPM
  • 600.129.27.01.014.002.0001.003 – Proteomic Data Mgmt;DEV


Scope:

The vision for a Cancer Research Data Commons (CRDC) is a virtual, expandable infrastructure that will eventually support collaboration among researchers, computational scientists, and tool developers. It will house multiple cloud-based Commons Nodes for multiple data types, initially including genomic, imaging, and proteomics data. In the future, additional nodes will support other data types. Genomics data in the CRDC is supported by the Genomic Data Commons (GDC), which provides a means of data submission, user interfaces, and search & visualization tools. The proteomics data node, as described below, will incorporate data from the Clinical Proteomics Tumor Analysis Consortium (CPTAC) and other sources. The Office of Cancer Clinical Proteomics Research works closely with CBIIT to integrate the PDC with CRDC.

The Proteomic Data Commons (PDC) is a node of the CRDC that collects, harmonizes, and hosts proteomic data across a variety of sample types. It incorporates data from various programs including CPTAC and ICPC where the majority of data are generated from quantitative mass spectrometry-based proteomic targeted assays. The PDC is open access and provides a space to query, visualize, and download available datasets. PDC has migrated to production and can be viewed here: https://pdc.esacinc.com/pdc/ .

Anticipated PDC benefits:

  • Enhance access to NCI and research community-generated cancer proteomic data and tools
  • Take advantage of increases in compute efficiency and scalability
  • Accelerate bioinformatics tool development
  • Serve the individual researcher who has limited or no access to high-performance computing
  • Make proteomic data and analysis tools readily accessible to the broad cancer research community
  • Support collaboration across research institutes and scientific disciplines
  • Reduce data storage redundancies


Column
width35%

To do:

  • hold a project kickoff meeting


Panel
borderColor#659EC7
bgColor#FFFAF0
titleBGColor#659EC7
borderStylesolid
titleDates To Remember


Panel
borderColor#659EC7
bgColor#FFFAF0
titleBGColor#659EC7
borderStylesolid
titleContacts


Excerpt Include
GDL:Cancer Data Service Project Plan
GDL:Cancer Data Service Project Plan
nopaneltrue

PDC

United States Human Proteomic Organization (US HUPO)

Clinical Proteomics Tumor Analysis Consortium (CPTAC)

The Applied Proteogenomics OrganizationaL Learning and Outcomes (APOLLO)

International Cancer Proteogenome Consortium (ICPC) 


The Global Alliance for Genomics and Health (GA4GH)

Fence (GitHub)

IndexD (GitHub)