NIH | National Cancer Institute | NCI Wiki  

Error rendering macro 'rw-search'

null

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 24 Next »

Contents of this Page

The CSSI DCC Portal is a public repository of experiment-related information describing cancer research investigations. You can use the portal to browse, search, and download eight Physical Sciences Oncology Network (PSON) data sets. These data sets are in ISA-TAB format, which organizes investigation, study, and assay data according to the rules in the ISA-TAB specification.   

Each data set contains three files--investigation, study, and assay--that conform to the ISA-TAB structure and naming conventions. Within this structure are field names that are standard for each type of file, though null values are allowed; that is, not every data set includes values for each field. The portal allows you to filter these fields in a hierarchical way so that you can visualize the data in a pie chart or list.

You can also search investigations, studies, and assays using any keyword. This chapter provides detailed instructions on how to browse, search, and download data.

Browsing Investigations

You can browse and explore investigation data from the investigation, study, and assay files.

To browse investigations

  • From the CSSI Data Portal home page, click the Browse button.
    Browse button
    The Browse Investigations page appears. A subset of it appears in the following screenshot.

    Browse Investigations page showing filters and the CTCs and PSON Cell Line Genomic Characterization - Exome investigations
    The Browse Investigation page has two sections:
    • Pie charts that show fields from the investigations. The three default pie charts show combined statistics from the following investigation fields for all eight PSON data sets: Study Protocol Name, Study Assay Measurement Type, and Study Assay Technology Type. You can customize which fields appear in the pie charts and click pie slices to drill down to detailed data. You can return to the default pie charts anytime by clicking either Reset All or selecting Investigations > Browse.

    • A list of investigation details below the pie charts. These investigation details match the fields you have selected in the pie charts; for example, in the screenshot above, each investigation includes the same investigation fields as the pie chart--Study Protocol Name, Study Assay Measurement Type, and Study Assay Technology Type–plus Study Title and Description. The default list shows every investigation available in the portal, but when you add or remove fields from the pie charts, the number of investigations may change. You can return to the default (full) list at anytime by selecting Investigations > Browse.

Exploring Investigation Data

On the Browse Investigations page, you can explore investigation data in the following ways:

    • By filtering the investigations in the portal so that only those matching the fields you select appear in the pie charts and the list. If you do not filter the investigations, you see a pie chart analysis of the three default filters: Study Protocol Name, Study Assay Measurement Type, and Study Assay Technology Type. All of the investigations appear in the list.

      Note that not all investigations in the portal have a value for each field. For example, in the Characterization of Circulating Tumor Cells (CTCs) study in the screenshot above, there is no value for Study Assay Technology Type. This is reflected in second pie chart for Study Assay Technology Type at the top of the screen. In that pie chart, five  investigations have no value for this field while the remaining three have the value of "nucleotide sequencing."

    • By clicking the study title to view and filter study details, download selected study files, download the entire archive, and download only the metadata.

Filtering Investigation Data

You can filter the investigation data on the Browse Investigations page in the following ways:

  • By adding and removing fields to and from the pie charts.
  • By clicking one or more pie slices.

Filter the list to see fewer results that match your interests. To filter the list, click the checkbox next to one or more categories in the Current Investigations list. For example, click the Biomedical Investigations 9909 category to see the current investigations that match that category.

View the buttons under the study title, which show the number of studies, assays, and files associated with this investigation.
No. of Studies (1), No. of Assays (1), No. of Files (2)

Viewing Study Details

While the Search Result page includes summaries of all of the studies meeting your filters or search criteria, additional details are available about each study. Click the study title to view the study's measurement types, experimental factors, study description, protocols, and files.

The data archive is a compressed (.zip) file that combines the assay, investigation, and study text files, and associated study data files. The study text files contain the metadata. Downloading the data archive means that you do not need to also download the metadata and files for that investigation.

If you only need the metadata, you can download it as a compressed file (.zip). The metadata includes only the assay, investigation, and study text files, which are the file columns, field definitions, and placeholders that describe the structure of the data collected in this study. They are in Investigation-Study-Assay tab-delimited format (ISA-TAB), which is based on the ISA-TAB specification Exit Disclaimer logo .

While they are also included in the archive if you choose to download that, you can also download selected study files. These study files might be image files, spreadsheets, or any other data associated with the selected study that is not the metadata.

Your role determines which studies you can view in the CSSI DCC Portal.

To view study details

  1. Browse experiments.
    The Search Result page appears.
    Search Result page
  2. Click a study title, for example, "Muscle Satellite Cells: MyoD and p53 genes".
    The study details page appears.

  3. Click Expand All to view the investigation's associated protocols and files.
    The Protocols section is a table showing Protocol Name, Protocol Type, and Description.

    The Files section lists File Name, Size, Type, and Submission Date. Enter any keyword from these column names into the search box to search this list. You can also download selected files.
    Files list with a Search for files box, Download Selected button, and file information such as   ks121202MGC3a.CEL  9 MB  CEL  21 MAy 2015

    Clicking Modify Search returns you to the Search Result page where you can change how you filter the current investigations.

Searching Within Results

An alternative to filtering by investigation is to narrow down a long list of search results by searching within those results.

To search within results

  1. Browse experiments.
    The Search Result page appears.
    Search Result page
  2. Locate the search box at the top of the page.
  3. In the search box, enter any keyword that may appear in the investigation metadata. This can include any keyword or part of a word. The number of results updates as you enter your keyword. In the following example, only the first two characters belonging to the word "cell" were entered, narrowing the number of search results from 18 to 15.

Searching Investigations

You can search investigations by keywords and filter the results. You do not need to log in to CSSI DCC Portal to search investigations.

To search experiments

  1. Navigate to the CSSI Data Portal.
  2. Click the Search button.
     Search Search experiments by keywords and filter results or look up the activity of genes of interest.
    The Search Result page appears, displaying only a search box.
    Search Result page with Search keyword box
  3. In the search box, enter a keyword. Keywords are any word that appears in the investigation's metadata. This includes all file columns, field definitions, and placeholders that describe the structure of the data collected in this study.

    As you enter the characters in the keyword, the Search Result page immediately refreshes with the search results.

 

Downloading Study Files

You can download selected files, the entire data archive, or only the metadata associated with a study. All of the data currently in the Portal is public.

To download study files

  1. Select an investigation and view its details.
    The study details page appears.
  2. In the Files area, click the down arrow to expand it.
    Files list with a Search for files box, Download Selected button, and file information such as   ks121202MGC3a.CEL  9 MB  CEL  21 MAy 2015

To download a single file

  • To select only one file, click its File Name link.
    Your browser's file download dialog box appears, which prompts you to open or save the file.
    Firefox File Download Dialog Box

To download multiple files

  1. Click the box in the Select File column for any of the files in the list.
  2. ClickDownload Selected button.
    The Download window appears.
    Download link will be sent via email. Enter email address below to receive download instructions.
  3. Enter your email address and click OK.
    The Success window appears.
    Your request has been received. You'll receive an email with a link to download files once your download request is processed
  4. Click OK.
  5. Look for an email in your inbox with the subject line File Download.
  6. Save or open the file.

Downloading Archives

Once you select an investigation and view its study details, you can choose to download the entire archive. An archive is a compressed file (.zip) that combines the assay, investigation, and study files with the metadata and data files.

You do not need to log in to CSSI DCC Portal to download an archive.

To download an archive

  1. Select an investigation and view its details.
    The study details page appears.
  2. Click Download Archive Button.
    The Download window appears.
    Download link will be sent via email. Enter email address below to receive download instructions.
  3. Enter your email address and click OK.
    The Success window appears.
    Your request has been received. You'll receive an email with a link to download files once your download request is processed
  4. Click OK.
  5. Look for an email in your inbox with the subject line File Download.
  6. Save or open the file.

Downloading Metadata

You do not need to log in to CSSI DCC Portal to download an investigation's metadata. The metadata includes only the assay, investigation, and study text files, which are the file columns, field definitions, and placeholders that describe the structure of the data collected in this study. They are in Investigation-Study-Assay tab-delimited format (ISA-TAB), which is based on the ISA-TAB specification Exit Disclaimer logo .

To download metadata

  1. Select an investigation and view its details.
    The study details page appears.
  2. Click Download Metadata Button.
    Your browser opens a download dialog box with your download file in it.
    Opening Archive10290_metadata.zip dialog box
  3. Open the file or save it for later analysis.

    Metadata File Contents

    The metadata file is in compressed format. It contains one folder called metadata and a text file for each of the following: assay, investigation, and study.

    The contents of the archive folder look similar to the following (10290 is just an example of the file identifier):

    metadata/

       a_10290.txt

       i_10290.txt

       s_10290.txt

 

  • No labels