NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: For CDP-1471: Updated download statements.
Panel
titleContents of this Page
Table of Contents

The CSSI DCC Portal is a public repository of experiment-related information describing cancer research investigations. You can use the portal to browse, search, and download eight Physical Sciences Oncology Network (PSON) and access data generated through CSSI funded projects and other user uploaded data sets. These This data sets are is in ISA-TAB Tab format, which organizes investigation, study, and assay data according to the rules in the ISA-TAB specification.   Tab specification

Multiexcerpt include
MultiExcerptNameExitDisclaimer
nopaneltrue
PageWithExcerptwikicontent:Exit Disclaimer to Include
.  For more information about the ISA-Tab format, refer to the Understanding ISA-Tab page.  

Each data set contains three files--investigation, study, and assay--that conform to the ISA-TAB Tab structure and naming conventions. Within this structure are field names fields that are standard for each type of file, though null values are allowed; that is, not every data set includes values for each field. The portal allows you to filter these fields in a hierarchical an interactive way so that you can visualize the data in a pie chart or list.

You can also search investigations, studies, and assays using any keyword. This chapter provides detailed instructions on how to browse, search, and download data.

Browsing Investigations

You can browse and explore investigation data from the investigation, study, and assay files.

To browse investigations

...

The CSSI DCC Portal allows you to perform various tasks:

  • Search investigations, studies, and assays using any keyword.
  • Download the data.
  • Download the metadata.
  • Download selected files, with the metadata.
  • Upload your own investigation data to the portal and request open access for your uploaded data. (For more information, refer to Uploading Investigation Data and Requesting Open Access.)

The following page family describes how to browse, search, and download data.

Page Tree
expandDepth3
root@self

...

Exploring Investigation Data

On the Browse Investigations page, you can explore investigation data in the following ways:

...

Info

Note that not all investigations in the portal have a value for each field. For example, in the Characterization of Circulating Tumor Cells (CTCs) study in the screenshot above, there is no value for Study Assay Technology Type. This is reflected in second pie chart for Study Assay Technology Type at the top of the screen. In that pie chart, five  investigations have no value for this field while the remaining three have the value of "nucleotide sequencing."

...

Filtering Investigation Data

You can filter the investigation data on the Browse Investigations page in the following ways:

  • By adding and removing fields to and from the pie charts.
  • By clicking one or more pie slices.

Filter the list to see fewer results that match your interests. To filter the list, click the checkbox next to one or more categories in the Current Investigations list. For example, click the Biomedical Investigations 9909 category to see the current investigations that match that category.
Image Removed

View the buttons under the study title, which show the number of studies, assays, and files associated with this investigation.
No. of Studies (1), No. of Assays (1), No. of Files (2)Image Removed

Viewing Study Details

While the Search Result page includes summaries of all of the studies meeting your filters or search criteria, additional details are available about each study. Click the study title to view the study's measurement types, experimental factors, study description, protocols, and files.

The data archive is a compressed (.zip) file that combines the assay, investigation, and study text files, and associated study data files. The study text files contain the metadata. Downloading the data archive means that you do not need to also download the metadata and files for that investigation.

If you only need the metadata, you can download it as a compressed file (.zip). The metadata includes only the assay, investigation, and study text files, which are the file columns, field definitions, and placeholders that describe the structure of the data collected in this study. They are in Investigation-Study-Assay tab-delimited format (ISA-TAB), which is based on the ISA-TAB specification

Multiexcerpt include
MultiExcerptNameExitDisclaimer
nopaneltrue
PageWithExcerptwikicontent:Exit Disclaimer to Include
.

While they are also included in the archive if you choose to download that, you can also download selected study files. These study files might be image files, spreadsheets, or any other data associated with the selected study that is not the metadata.

Info

Your role determines which studies you can view in the CSSI DCC Portal.

To view study details

  1. Browse experiments.
    The Search Result page appears.
    Search Result pageImage Removed
  2. Click a study title, for example, "Muscle Satellite Cells: MyoD and p53 genes".
    The study details page appears.
    Image Removed

  3. Click Expand All to view the investigation's associated protocols and files.
    The Protocols section is a table showing Protocol Name, Protocol Type, and Description.
    Image Removed
    The Files section lists File Name, Size, Type, and Submission Date. Enter any keyword from these column names into the search box to search this list. You can also download selected files.
    Files list with a Search for files box, Download Selected button, and file information such as   ks121202MGC3a.CEL  9 MB  CEL  21 MAy 2015Image Removed

    Info

    Clicking Modify Search returns you to the Search Result page where you can change how you filter the current investigations.

Searching Within Results

An alternative to filtering by investigation is to narrow down a long list of search results by searching within those results.

To search within results

  1. Browse experiments.
    The Search Result page appears.
    Search Result pageImage Removed
  2. Locate the search box at the top of the page.
    Image Removed
  3. In the search box, enter any keyword that may appear in the investigation metadata. This can include any keyword or part of a word. The number of results updates as you enter your keyword. In the following example, only the first two characters belonging to the word "cell" were entered, narrowing the number of search results from 18 to 15.
    Image Removed

Searching Investigations

You can search investigations by keywords and filter the results. You do not need to log in to CSSI DCC Portal to search investigations.

To search experiments

...

 

Downloading Study Files

You can download selected files, the entire data archive, or only the metadata associated with a study. All of the data currently in the Portal is public.

To download study files

  1. Select an investigation and view its details.
    The study details page appears.
    Image Removed
  2. In the Files area, click the down arrow to expand it.
    Files list with a Search for files box, Download Selected button, and file information such as   ks121202MGC3a.CEL  9 MB  CEL  21 MAy 2015Image Removed

To download a single file

  • To select only one file, click its File Name link.
    Your browser's file download dialog box appears, which prompts you to open or save the file.
    Firefox File Download Dialog BoxImage Removed

To download multiple files

  1. Click the box in the Select File column for any of the files in the list.
    Image Removed
  2. ClickDownload Selected buttonImage Removed.
    The Download window appears.
    Download link will be sent via email. Enter email address below to receive download instructions.Image Removed
  3. Enter your email address and click OK.
    The Success window appears.
    Your request has been received. You'll receive an email with a link to download files once your download request is processedImage Removed
  4. Click OK.
  5. Look for an email in your inbox with the subject line File Download.
  6. Save or open the file.

Downloading Archives

Once you select an investigation and view its study details, you can choose to download the entire archive. An archive is a compressed file (.zip) that combines the assay, investigation, and study files with the metadata and data files.

You do not need to log in to CSSI DCC Portal to download an archive.

To download an archive

  1. Select an investigation and view its details.
    The study details page appears.
    Image Removed
  2. Click Download Archive ButtonImage Removed.
    The Download window appears.
    Download link will be sent via email. Enter email address below to receive download instructions.Image Removed
  3. Enter your email address and click OK.
    The Success window appears.
    Your request has been received. You'll receive an email with a link to download files once your download request is processedImage Removed
  4. Click OK.
  5. Look for an email in your inbox with the subject line File Download.
  6. Save or open the file.

Downloading Metadata

You do not need to log in to CSSI DCC Portal to download an investigation's metadata. The metadata includes only the assay, investigation, and study text files, which are the file columns, field definitions, and placeholders that describe the structure of the data collected in this study. They are in Investigation-Study-Assay tab-delimited format (ISA-TAB), which is based on the ISA-TAB specification Exit Disclaimer logoImage Removed .

To download metadata

  1. Select an investigation and view its details.
    The study details page appears.
    Image Removed
  2. Click Download Metadata ButtonImage Removed.
    Your browser opens a download dialog box with your download file in it.
    Opening Archive10290_metadata.zip dialog boxImage Removed
  3. Open the file or save it for later analysis.

    Info
    titleMetadata File Contents

    The metadata file is in compressed format. It contains one folder called metadata and a text file for each of the following: assay, investigation, and study.

    The contents of the archive folder look similar to the following (10290 is just an example of the file identifier):

    metadata/

       a_10290.txt

       i_10290.txt

       s_10290.txt