Skip Navigation
NIH | National Cancer Institute | NCI Wiki   New Account Help Tips
Skip to end of metadata
Go to start of metadata

A data freeze is the capture of publication data used and produced by an Analysis Working Group and the distribution of this information via a Publication page. A data freeze allows published results to be reproduced at a later date if desired.

  1. For disease marker papers (a global analysis publication), the DCC is responsible for working with the AWG to put together a publication page referring to all the published data in the publication.
  2. For disease follow-on papers, where the TCGA Network is one of the authors, the authors have the option to ask the DCC to host a publication page referring to published DCC data. The DCC is not responsible for creating the publication page. Authors are responsible for creating a publication page using a DCC supplied template. Two weeks notification to the DCC is required prior to generating this publication page.
  3. For disease follow-on papers, where the TCGA Network is not one of the authors, the DCC is not involved with the publication page.

Older archives submitted to the DCC are periodically transferred to backup tapes and removed from the file system. Data freezes incorporate specific revisions of archives that may not always remain as the latest archive versions and could be rendered inaccessible for download by being removed from the file system. It is imperative to retain these archives on the file system so that publication analyses can be reproduced to match the outcome reported at the time it was published.

Capturing a data freeze involves marking which archives need to be frozen under a data freeze set and also linking the freeze set to a publication.

The relationships between publication, data freeze set and archive are described as follows:

  • A publication can have one or more data freeze sets (for example, an analysis group uses sample subsets that reference different archives).
  • A data freeze set can belong to one or more publications (for example, one data freeze set is used to write two papers).
  • A data freeze set can span one or more disease studies (for example, lung study = LUAD + LUSC).
  • A data freeze set can contain one or more archives.
  • An archive can belong to one or more data freeze sets (for example, the same COAD archive is frozen for a colorectal and cross-cancer comparison data freeze).
  • No labels