NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: For CDP-1471: Updated download statements.
Panel
titleContents of this Page
Table of Contents

The CSSI DCC Portal is a public repository of experiment-related information describing cancer research investigations. You can use the portal to browse, search, and download eight Physical Sciences Oncology Network (PSON) access data generated through CSSI funded projects and other user uploaded data sets. These This data sets are is in ISA-TAB Tab format, which organizes investigation, study, and assay data according to the rules in the ISA-TAB Tab specification

Multiexcerpt include
MultiExcerptNameExitDisclaimer
nopaneltrue
PageWithExcerptwikicontent:Exit Disclaimer to Include
 For more information about the ISA-Tab format, refer to the Understanding ISA-Tab page.  

Each data set contains three files--investigation, study, and assay--that conform to the ISA-TAB Tab structure and naming conventions. Within this structure are fields that are standard for each type of file, though null values are allowed; that is, not every data set includes values for each field. The portal allows you to filter these fields in an interactive way so that you can visualize the data in a pie chart or list.

You can also search investigations, studies, and assays using any keyword. You can download selected files, the entire data archive, or only the metadata associated with a study.

This chapter provides detailed instructions on how to browse, search, and download data.

...

You can browse and explore investigation data from the investigation, study, and assay files.

To browse investigations

...

The

...

Understanding the Pie Charts and Investigations List

The Browse Investigation page has two interactive components:

...

CSSI DCC Portal allows you to perform various tasks:

  • Search investigations, studies, and assays using any keyword.
  • Download the data.
  • Download the metadata.
  • Download selected files, with the metadata.
  • Upload your own investigation data to the portal and request open access for your uploaded data. (For more information, refer to Uploading Investigation Data and Requesting Open Access.)

The following page family describes how to browse, search, and download data.

Page Tree
expandDepth3
root@self

...

Adding and Removing Fields

You can customize which pie charts appear at the top of the page. Since the pie charts control how you filter the investigation data, you may prefer something other than, or in addition to, the three default fields of Study Protocol Name, Study Assay Technology Type, and Study Assay Measurement Type.

To add or remove fields

...

Exploring Investigation Details

You can continue exploring investigation data by clicking a link to investigation, study, or assay details. Links to these details are on the Browse Investigations page or on the Search Investigations page, after you search on a keyword or phrase. These details include counts of studies, assays, samples, and files. The metadata available for an investigation determines if other entities, such as sources and collections, are available for counts. From these details pages, you can visualize the structure of the investigation and download selected study files, download the full archive, and download only the metadata.

To explore investigation details

  1. Browse investigations or search investigations until you find an investigation, a study, or an assay in which you are interested.
    The search results appear.
    Image Removed

  2. Click the link corresponding with the investigation, study, or assay you are interested in exploring.
    The respective investigation details, study details, or assay details page appears.

  3. You can do the following from the details pages:

Investigation Details Page

The investigation details page shows the investigation name at the top followed by a visualization of the investigation filename, study filename and number of samples in the study, and assay filename and number of files (and total file size) in the assay. Below the visualization are links to download all or part of the investigation, its identifier, and its description. All of the icons on the page are clickable links.

Info

The currently selected entity has a green box behind its icon in the visualization. When the investigation is selected, its study/studies, and assay(s) are also selected. In the following screenshot, the investigation is selected.

Image Removed

Study Details Page

...

Assay Details Page

The assay details page shows the study name at the top followed by a visualization of the investigation filename, study filename and number of samples in the study, and assay filename and number of files (and total file size) in the assay. Below the visualization are links to download all or part of the assay, its file name, measurement type, and technology type. In the Processes and Filters section, the relationship of the study to its processes, and its processes to its files, are depicted in clickable icons. Click any icon to further filter the investigation data and download only a selected portion of it.

Assay Details pageImage Removed

Visualizing and Filtering Data

Once you browse or search the CSSI DCC data sets and reach a selected investigation details, study details, or assay details page, you can continue exploring the data. The ISA-TAB format is hierarchical, with investigation components becoming more granular as you proceed down the hierarchy. The largest organizing entity is the investigation, which holds one or more studies. Each study includes one or more assays. Assays are composed of samples, which in turn are composed of protocols. Data files are often associated with a protocol.

The following image depicts the hierarchy, without the samples and protocols.

Structure of ISA data model as described in the text on this page.Image Removed

Visualizing and Filtering Investigations

The investigation details page shows icons that represent the relationship of the investigation to its studies and assays. In the case of PSON Cell Line Genomic Characterization - mRNA, the investigation has one study and one assay.
Image Removed

You cannot filter data currently at the investigation level any further in the CSSI DCC Portal. You can only download the investigation's full data at this point, or start exploring its studies and assays.

Visualizing and Filtering Study Data

If you select the study in the PSON Cell Line Genomic Characterization - mRNA investigation or arrive at any other Study Details page through a search, you can visualize the study's file structure and filter on any field.

To visualize and select study data

...

Image Removed

In the Visualize and Select area, click one of the entities.

Info
titleStudy Hierarchy

The hierarchy of entities for studies according to the ISA-TAB standard is as follows, from less granular to more granular:

Source > Protocol > Sample

For example, click the Source. Metadata for that source appears.

Info

Follow this same procedure if you want to filter on Protocol or Sample instead of Source.

...

Visualizing and Filtering Assay Data

If you select the assay in the PSON Cell Line Genomic Characterization - mRNA investigation or arrive at any other Assay Details page through a search, you can visualize the assay's file structure and filter on any field.

To visualize and select assay data

...

Image Removed

In the Visualize and Select area, click one of the entities. Note that the width of some visualizations require you to scroll by clicking the arrows at the bottom of the page.

Info
titleAssay Hierarchy

The hierarchy of entities for assays according to the ISA-TAB standard is as follows, from less granular to more granular:

Sample > Protocol > Data File

For example, click the sample. Metadata for the sample appears.

Info

Follow this same procedure if you want to filter on protocol instead of sample.

...

Searching Investigation Data

You can search investigations, studies, or assays using any keyword.

To search investigation data

...

Downloading Investigation Data (not yet complete)

You can download the full data of an investigation, only selected metadata, or only selected data (which includes that data's metadata). All of the data currently in the portal is public.

The full data associated with an investigation is a compressed (.zip) file that combines the investigation, study, and assay text files, and associated data files. Each text file contains only metadata, which appear as column names when imported into a spreadsheet. The data files often contain image files and spreadsheets and can be a large file size.

You can also filter an investigation and download only the metadata associated with that selection. The metadata includes the assay, investigation, and study text files of that selected data. Metadata describes the structure of the data collected in an investigation and translates to the file columns, field definitions, and placeholders that appear in a spreadsheet. It is in Investigation-Study-Assay tab-delimited format (ISA-TAB), which is based on the ISA-TAB specification

Multiexcerpt include
MultiExcerptNameExitDisclaimer
nopaneltrue
PageWithExcerptwikicontent:Exit Disclaimer to Include
.

Finally, you can filter an investigation and download that selected data. Selected data includes both the data files and associated metadata.

To download investigation files

  1. Filter the investigation data until you reach the entity you want to download.
    The entity's details page appears. It may involve a selected investigation, study, assay, source, protocol, or sample.
  2. Decide if you want to download the full data, selected metadata, or selected data by clicking the respective button.

...

 

  • Your browser's file download dialog box appears, which prompts you to open or save the file.
    Firefox File Download Dialog BoxImage Removed

To download multiple files

...

Downloading Full Data

The full data associated with an investigation combines the assay, investigation, and study files with the metadata and data files. Data files often include images. While it is compressed as a .zip file, it can still have a large file size. Before you choose to download an investigation's full data, note the file size listed on the Download Full Data button.

To download an archive

...

Downloading Selected Data

 

Downloading Selected Metadata

The metadata includes only the assay, investigation, and study text files, which are the file columns, field definitions, and placeholders that describe the structure of the data collected in this study. They are in Investigation-Study-Assay tab-delimited format (ISA-TAB), which is based on the ISA-TAB specification Exit Disclaimer logoImage Removed .

To download metadata

...

Open the file or save it for later analysis.

Info
titleMetadata File Contents

The metadata file is in compressed format. It contains one folder called metadata and a text file for each of the following: assay, investigation, and study.

The contents of the archive folder look similar to the following (10290 is just an example of the file identifier):

metadata/

   a_10290.txt

   i_10290.txt

   s_10290.txt