NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Data repositories are important tools in cancer research, providing . They provide safe and sustainable locations to store data, providing provide access to input data for meta-analyses, and allowing allow researchers to collaborate and share information across a common resource.

The problem is that repositories are often not flexible enough to store data that do not conform to known standards. Genomics, for example, benefits from community genomics standards groups that develop standard programmatic interfaces for managing, describing, and annotating genomic data (attribution: https://gdc.cancer.gov/about-data/data-standards), but not all . Many other fields of study and application are as lucky. Emerging data types such as...do not yet have data storage standardsare without such standards, yet generate significant amounts of data.

The Center for Strategic Scientific Initiatives Data Coordinating Center (CSSI DCC) stores and manages access to data generated in support of cancer research funded or supported by the CSSI. Frederick National Lab, under the leadership of Andrew Quong, developed the CSSI DCC Portal, the repository for CSSI DCC data. The data currently in the DCC conform to the standard Investigation-Study-Assay tab-delimited format (ISA-TAB) format, which describes a scientific investigation, its study or studies, and each study's assay(s).

...