This chapter describes the processes for searching studies within caIntegrator.

Topics in this chapter include:

Search Overview

The search and browse functions in caIntegrator allow you to search for subject annotation data, genomic data or imaging data that were uploaded into the application as part of a study. When gene expression and imaging data are uploaded into a caIntegrator study, mapping files that correlate sample IDs in those files to subject IDs (patient IDs) in the subject annotation data file must also be uploaded. When you launch a search, caIntegrator finds and integrates the subject annotation, genomic and imaging data based on the mapping files and the criteria that you define in the search query.

In a search query, you can specify criteria for just one of the data types, or configure complex search criteria that join two or three data types. The available criteria for the query were defined when the study was deployed.

The basic workflow for a study search follows these steps:

  1. Select the study to be searched.
  2. Select one data type:
  3. Define criteria for the search in the selected data type and run the search.
  4. For a more complex search, select multiple criteria from more than one data type.
  5. Specify whether you want subject/imaging annotations to display or genomic data to display.
  6. Review search results.
  7. Configure results column and sorting display settings. You can do this before or after you run a search. If you choose to do it after, you must re-run the search.
  8. Download annotation search results as a CSV file. The CSV file contains only the data you specified in the annotation and display configurations.
  9. Follow links to NBIA in the search results to view or download images located in the search.

Searching a caIntegrator Study

To initiate a search of all annotations and/or other data in a study, follow these steps:

  1. In caIntegrator, in the upper right hand corner, select the study you want to browse or perform a simple search.
  2. On the left sidebar, under the first section that displays the study name, click Search [Study Name]. This opens a simple search query page with five tabs.
  3. On the Criteria tab, shown in the following figure, in the drop-down list, select the type of data you want to search.
    Criteria tab, described in text
  4. You can perform a search using one or more criteria you set in one of the data types, or you can define criteria in more than one data type per query, creating a more complex search.
  5. Click Add to further define criteria for the search.
    Continue with:
    Annotation and Image Data Searches
    Gene Expression Searches
    Copy Number Searches
    CGHCalls Searches
  6. To add additional criteria for the search, repeat steps 4 and 5, as appropriate. You can set more than one data type or more than one criterion for a data type. The criteria become cumulative, thus refining the search.
  7. Once you have configured the query criteria, select the Boolean Or or Andsearch operator at the bottom of the page.
  8. Click the Remove button to clear any data elements you have defined.
  9. You can launch the search from this Criteria tab. Click the Run Search button. For information about the search results, see 5 - Viewing Query Results. You may want to run the search first to see what kind of results you get before you configure the data display, described in step 10.
    – or –
  10. On the Results Type tab, you can specify the columns you want to display in the search results data. On the Sorting tab, you can specify how the data is to be sorted. For more information, see Results Type Tab and Sorting Tab.

    As long as you are still in the current query session, you can return to the Criteria, Columns and Sorting tabs to add, modify or remove data and display criteria and re-run the search. If you configure another query without saving the first, the first query will be lost. If you save the query, your current search criteria are saved.

Annotation and Image Data Searches

If the study manager defined the study's own annotation groups, then those group names are listed in the criteria drop-down list. If the study manager did not define the study's annotation groups when the study was created, then all annotations are placed, by default, in a group called "Annotations default".

Once you select an annotation group data type, an additional drop-down list displays data elements that are annotation definitions specified for that data type when the data was uploaded into the study. An example is shown in the following figure.
Annotation definition examples, described in text

  1. Select a search criterion from among the options. You can make only one selection at a time.

    If the study includes imaging data, imaging annotations should be available in the Annotations list, as shown in the following figure.

    Example imaging definitions, described in text

  2. Each choice opens other fields relevant to the selection where you can further define your search query.
  3. Continue with step 6 in Searching a caIntegrator Study.

Gene Expression Searches

For a gene expression search, follow these steps:

For the Gene Expression selection, select Gene Name, Expression Level or Fold Change. Follow the links for descriptions about each selection. Additional search fields will display upon selection. If the study includes multiple platforms, a Platform option is also visible.

Expression Level Selection

Additional fields display for the Expression Level selection.

  1. Select the range type from the following options:
  2. Enter the Expression Level appropriate for your query. The value that you enter is compared to the expression levels that are reported by the probes on the array platform. The probe value is typically a normalized value.This criteria changes slightly based the range type you select.
    The default value of 100 is a fixed default and does not reflect any values on the array. It simply represents a starting point for the query.
  3. Continue with step 6 in Searching a caIntegrator Study.

Fold Change Selection

Additional fields display for the Fold Change selection. The fold change option appears only if genomic control samples have been uploaded to the study. Fold change identifies genes with expression differences compared to control samples, as defined when the study was deployed in caIntegrator. You can enter query values in greater/lesser-than-or-equal-to arguments.

  1. Select or enter data for the Fold change fields shown in the following figure and described below.
    Fold change fields, described in text
  2. Continue with step 6 in Searching a caIntegrator Study.

Copy Number Searches

In some diseases, like cancer, cells that are abnormal can exhibit a change in the chromosomal structure in that parts of a chromosome can be amplified or deleted. 'Copy number' experiments that measure variation in genomic structure use molecular markers to detect amplification or deletion of chromosomal segments. Typically, copy number alteration experiments compare a genomic sample from a diseased tissue (for example, a tumor) to a control sample (for example, blood).

The Copy Number query option, as described in Searching a caIntegrator Study, appears only if copy number data have been uploaded to the study. A copy number search identifies subjects or samples that have a copy number amplification or deletion in the genome range specified. Searches can be constructed with gene names, chromosome number and/or chromosome coordinates. You can enter query values in greater/lesser-than-or-equal-to arguments.

For the Copy Number selection, select Gene Name or Segmentation.

Segmentation Search

Additional fields display for the Segmentation selection. Segmentation is the process of defining the chromosomal boundaries (coordinates) of the region deleted or amplified in the sample.

  1. Select or enter data for the copy number query fields shown in the following figure and described below.
    copy number query fields, described in text.
  2. Continue with step 6 in Searching a caIntegrator Study.

CGHCalls Searches

Additional fields display for the Calls selection. CGHCalls calls aberrations for array CGH data using a six state mixture model.

  1. Select or enter data for the copy number query fields shown in the following figure and described below.
    copy number query fields, described in text
  2. Continue with step 6 in Searching a caIntegrator Study.

Configuring Query Results Display

You can specify columns for the way you want the search results to display either before or after you run the search. If you run the search directly from the Criteria tab before setting the results type/sorting features, by default only the Subject Identifiers display on the Search Results tab. You can then come back to the Results Type Tab and Sorting Tab to expand the display options and re-run the search, having set the display parameters.

For more information, see 5 - Viewing Query Results.

Results Type Tab

The selection you make on the Results Type tab determines whether caIntegrator displays search results for subject annotation or for genomic data. It filters the search based on the criteria you set on the Criteria tab, whether it is annotation, gene expression or image series data type(s). In other words, if you select annotation criteria on the Criteria tab, but select Genomic on the Results Type tab, the data subset that displays on the Search Results tab is genomic data that is filtered by the annotation criteria you defined on the Criteria tab.

  1. On the Results Type tab, shown in the following figure, select the Annotation, Copy Number or Genomic radio button to search annotation data. Search criteria are described below.
    Result Type tab, described in text
  2. Annotation- Select the annotation elements that you want to display in the search results. All elements listed are column headers in the data uploaded to the study. You can make multiple selections on this list.

    For subject annotations, the Patient or Subject Identifier displays by default in the search results.


    Results display as tabular data.

  3. Copy Number - This option appears only if the open study includes copy number data. If you select this option, the annotation elements initially displayed on this tab disappear, and you are asked to run the query again. Based on the criteria you defined, the Query Results tab shows a data matrix containing samples against the genomic region you specified. For more information, see Copy Number Searches and Copy Number Data.
  4. Gene Expression - Select the Reporter Type and Results Orientation.
  5. Imaging - If imaging annotations have been added to the study, annotation elements also display on the lower right section of this page when you select Annotation. All elements listed are column headers in the image annotation data uploaded to the study. You can make multiple selections on this list.

    If you select even one Image Annotation on the Results Type tab, the Image Series IDs display by default in the search results. If you select no Image Annotations on the Results Type tab, however, even if you have selected image series criteria on the Criteria tab, no image series IDs display in the search results. The fact that images can be located, however, in NBIA is indicated by two image-related buttons at the bottom of the Query Results page. You can open the images in NBIA, but they will be at StudyInstance UID level. See Relationship of Subject to Study to Series to Images.

  6. Use the Select All or Unselect All buttons to aid you in making your selections.
    Results display as tabular data. For more information, see Subject Annotation and Imaging Data.

The column selection is saved as part of the query if you save it. See Saving a Query.

Sorting Tab

On the Sorting tab, you can set the sort order for data columns in the query results. You can also indicate whether column contents are sorted in ascending or descending order.

The columns that display on the Sorting tab are those criteria that you selected on the Results Type Tab for an Annotation Results type search.

Sorting is not applicable to copy number search results. For those results, no options are available on the Sorting tab.

  1. Select the Sorting tab and indicate the left to right column order of the Search Results by changing one or more numbers in the Column Order column in the table shown in the following figure.
    Sorting tab where user can set the sort order for selected columns of query results. Refer to text.
  2. In the Row Order column, indicate how you want columns sorted, Ascending or Descending, or leave the default, No Sort, if you choose.
  3. Click Run Query at the bottom of the page to execute your sorting changes in the search results. When you do so, the change in column order is visible on the Query Results tab, as well as on the Sorting tab. For example, any column that you have indicated to be number "1" now appears in Query Results immediately after the Subject Identifier column and at the top of the Set Sort Order table on the Sorting tab.
    Sorting parameters are saved as part of the query if you choose to save it using the Save Query feature. See Saving a Query.
  4. If you click the Reset button before running the query from the Sorting tab, the original column settings are restored.

For information about the search results, see 5 - Viewing Query Results.

Managing Queries

When you create a search query in caIntegrator, you can save the query for later use or edit it.

Saving a Query

To save a query, follow these steps:

  1. Click the Save As tab and enter a Search Name and Search Description, unique to the search. Example: Batch ID 6 and female
  2. Click Save.

Once the query is saved, it is listed by its name under the Study Data > Queries > My Queries in the left sidebar, whenever the study to which the query applies is selected. Click on the saved query in this list to either edit or re-run the query. Click on the query name to retrieve query results. If you hover over the Name text for the query, a pop-up displays the query description.

Editing a Query

To edit a query, follow these steps:

  1. Select a query in the left sidebar under the Study Data > Queries > My Queries.
  2. Click the Edit icon (edit icon) corresponding to the study.
  3. Change the query and display criteria on the Criteria, Columns and Sorting tabs.
  4. On the Save As tab, check the appropriate options and click Save As. You can use the same name as the original query or modify the name as needed.

Exporting Query Results

After running a search, you can export the result set or a subset as a tab-delimited text file. For more information, see Exporting Data.