NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Scrollbar
iconsfalse

This section includes the following topics.

Table of Contents
minLevel3

...

AnchorRTF36363638313a204368617074RTF36363638313a204368617074Searching a caIntegrator Study
This chaptersection chapter describes the processes for searching studies within caIntegrator.
Topics in this chaptersection include:

...

This chapter includes the following topics.

Table of Contents
minLevel2

...

Search Overview

searching:overviewThe search and browse functions in caIntegrator allow you to search for subject annotation data, genomic data or imaging data that were uploaded into the application as part of a study. When gene expression and imaging data are uploaded into a caIntegrator study, mapping files that correlate sample IDs in those files to subject IDs (patient IDs) in the subject annotation data file must also be uploaded. When you launch a search, caIntegrator finds and integrates the subject annotation, genomic and imaging data based on the mapping files and the criteria that you define in the search query.
In a search query, you can specify criteria for just one of the data types, or configure complex search criteria that join two or three data types. The available criteria for the query were defined when the study was deployed.
The basic workflow for a study search follows these steps:

  • Select the study to be searched.
  • Select one data type:
  • Wiki Markup
    *\[Annotations\]* – Annotation data can be labeled 'default' or given the annotation 'group' name when annotation groups are specified by the manager, for example, chronologic, therapy, diagnosis, patient, or other annotation group types. This selection searches one or more uploaded CSV files for data identifiers or annotations (column headers) specified during study creation.
  • Wiki Markup
    *\[Genomic\]* – Genomic data can be gene expression or copy number data. This selection searches caArray experiments samples uploaded in the study for gene expression or copy number data by gene name, reporter ID, chromosome number, chromosome coordinates and/or segmentation values representing amplification or deletion.
  • Image Data – Searches NBIA imaging files uploaded in the study for image annotations or links to images, identified by subject identifiers or image series IDs.
  • Define criteria for the search in the selected data type and run the search.
  • For a more complex search, select multiple criteria from more than one data type.
  • Specify whether you want subject/imaging annotations to display or genomic data to display.
  • Review search results.
  • Configure results column and sorting display settings. You can do this before or after you run a search. If you choose to do it after, you must re-run the search.
  • Download annotation search results as a CSV file. The CSV file contains only the data you specified in the annotation and display configurations.
  • Follow links to NBIA in the search results to view or download images located in the search.

...

Searching a Study*

To initiate a search
searching:study;study:searchingTo initiate a search of all annotations and/or other data in a study, follow these steps:

  • In caIntegrator, in the upper right hand corner, select the study you want to browse or perform a simple search.
  • Wiki Markup
    On the left sidebar, under the first section that displays the study name, click *Search \[Study Name\]{*}. This opens a simple search query page with five tabs (). !worddavb0c0bc712982eb0268f4f1837af797ae.png|heightvspace=1944,width=561!
    Anchor
     alt="Search page"!
  • RTF37313338353a204361707469RTF37313338353a204361707469Search page AnchorRTF34383730303a204e756d6265RTF34383730303a204e756d6265On the Criteria tab, in the drop-down list, select the type of data you want to search (). Default or defined annotation data types are available in the search criteria drop-down listImage ModifiedanchorRTF39303234343a204361707469RTF39303234343a204361707469Default or defined annotation data types are available in the search criteria drop-down list
  • You can perform a search using one or more criteria you set in one of the data types, or you can define criteria in more than one data type per query, creating a more complex search.
  • Annotations (listed as 'default' or by annotation group name when specified when the study was created)
  • Gene Expression or Copy Number
  • Image Series
  • AnchorRTF39383139383a204e756d6265RTF39383139383a204e756d6265Click Add to further define criteria for the search.

Continue with:

  • AnchorRTF38323630323a204e756d6265RTF38323630323a204e756d6265To add additional criteria for the search, repeat steps and , as appropriate. You can set more than one data type or more than one criterion for a data type. The criteria become cumulative, thus refining the search.
  • Once you have configured the query criteria, select the Boolean Or or And search operator at the bottom of the page.
  • Or finds a data subset with at least one of the search criteria
  • And finds a data subset with both/or all search criteria.
  • Click the Remove button to clear any data elements you have defined.
  • You can launch the search from this tab. Click the Run Search button. For information about the search results, see . You may want to run the search first to see what kind of results you get before you configure the data display, described in step .

– or –

  • AnchorRTF32373335313a204e756d6265RTF32373335313a204e756d6265On the On the Results Type tab, you can specify the columns you want to display in the search results data. On the Sorting tab, you can specify how the data is to be sorted. For more information, see and .
  • As long as you are still in the current query session, you can return to the Criteria, Columns and Sorting tabs to add, modify or remove data and display criteria and re-run the search. If you configure another query without saving the first, the first query will be lost. If you save the query, your current search criteria are saved.

...

Annotation and Image Data Searches

  • If the study manager defined the study's own annotation groups, then those group names are listed in the criteria drop-down list. If the study manager did not define the study's annotation groups when the study was created, then all annotations are placed, by default, in a group called "Annotations default".
  • searching:annotations;annotation:searching for patients/samples;searching:images;images:searchingOnce Once you select an annotation group data type, AnchorRTF39333430373a204e756d6265RTF39333430373a204e756d6265 an additional drop-down list displays data elements that are annotation definitions specified when the data was uploaded into the study (). Image Removed AnchorRTF36343839333a204361707469
    Annotation data elements available in the search criteria drop-down list reflect definitions specified in the corresponding studyImage AddedRTF36343839333a204361707469Annotation data elements available in the search criteria drop-down list reflect definitions specified in the corresponding study
  • Select a search criterion from among the options. You can make only one selection at a time.
  • If the study includes imaging data, imaging annotations should be available in the Annotations list.
    "Annotation search criteriaImage Modified

...

  • AnchorRTF36343337343a204361707469RTF36343337343a204361707469 Annotation search criteria, including criteria for imaging AnchorRTF39313636363a204e756d6265RTF39313636363a204e756d6265
  • Each choice opens other fields relevant to the selection where you can further define your search query.
  • If permissible values were added when the annotation was defined, you must select among the values in a drop-list that displays on the right side of the page.
  • If no permissible values were defined as part of the annotation, you have the option to enter descriptive text in a text box on the right side of the page (). Image Removed AnchorRTF33353133353a204361707469RTF33353133353a204361707469You may be able to further define search criteria when you select a specific subject annotation or imaging annotation element().
    You may be able to further define search criteria when you select a specific subject annotation or imaging annotation elementImage Added
  • When working with image data, if only an Imaging Mapping file was uploaded when the study was created and not an Image Series Annotation file, you cannot enter image search criteria. The search results will, however, display a link that allows you to view the associated images in NBIA.

Continue with step in .
AnchorRTF35303236333a204865616469
RTF35303236333a204865616469 Gene Expression Data searching:gene expression data;gene expression:searching dataSearchesSearches

  • AnchorRTF33353438303a204e756d6265RTF33353438303a204e756d6265For the Gene Expression selection, select Gene Name, Expression Level or Fold Change. If the study includes multiple platforms, a Platform option is also visible.
  • Gene Name, Expression Level or Fold Change – Enter one or more gene symbols in the text box or click the icons to locate genes in the following databases. If you enter more than one gene in the text box, separate the entries by commas. If multiple platforms are part of the study, your platform selection in the Fold Change query criteria determines the control samples that are available.
  • If you leave the gene symbols field blank, caIntegrator searches all gene symbols for a match to the other criteria you specify.

...

The default value of 100 is a fixed default and does not reflect any values on the array. It simply represents a starting point for the query.
Additional fields display for the Fold Change selection.
The fold change :search;searching:fold changefold change option appears only if genomic control samples have been uploaded to the study. Fold change identifies genes with expression differences compared to control samples, as defined when the study was deployed in caIntegrator. You can enter query values in greater/lesser-than-or-equal-to arguments.

  • Select or enter data for the Fold change fields shown in Fields for identifying fold change search criteriaImage Modified : AnchorRTF33373236303a204361707469RTF33373236303a204361707469Fields for identifying fold change search criteria :
  • Control Sample Set – Select from the drop down list the name of the uploaded control sample set to serve as the fold change reference.
  • Regulation Type – Select the term that describes the gene expression in comparison with the control samples: Up is increased expression; Down is decreased expression; Up or Down is increased or decreased; Unchanged means no change in expression.
  • Up-Regulation Folds – Enter a numerical value representing fold change. The number you enter here is dependent upon the Regulation Type you selected.
  • Up = Up Regulation Folds – Samples with a fold change greater than this value, when compared to the control samples, will be returned.
  • Down = Down Regulation Folds – Samples with a fold change less than this value, when compared to the control samples, will be returned.
  • Up or Down = Down Regulations Folds, Up Regulation Folds – Samples with a fold change either up or down, when compared to the control samples, will be returned.
  • Unchanged = Samples with a fold change between the two specified values will be returned.

For example, if you enter 2.0 in this field, after selecting Up in the previous field, the search will locate genes whose expression is 2 times (2-fold up regulation) the base value.
Continue with step in . AnchorRTF33353437333a204865616469RTF33353437333a204865616469

Copy Number Searches

searching:copy number data;copy number:searching data,In some diseases, like cancer, cells that are abnormal can exhibit a change in the chromosomal structure in that parts of a chromosome can be amplified or deleted. 'Copy number' experiments that measure variation in genomic structure use molecular markers to detect amplification or deletion of chromosomal segments. Typically, copy number alteration experiments compare a genomic sample from a diseased tissue (for example, a tumor) to a control sample (for example, blood).
The Copy Number query option, as described in , appears only if copy number data have been uploaded to the study. A copy number search identifies patients or samples that have a copy number amplification or deletion in the genome range specified. Searches can be constructed with gene names, chromosome number and/or chromosome coordinates. You can enter query values in greater/lesser-than-or-equal-to arguments.

...

  • Select or enter data for the copy number query fields shown in Fields for identifying copy number search criteriaImage Modified .Fields for identifying copy number search criteria AnchorRTF38333632323a204865616469RTF38333632323a204865616469

Segmentation is the process of defining the chromosomal boundaries (coordinates) of the region deleted or amplified in the sample.

...

  • Genome Interval > Chromosome Number – In the text box that opens, enter the chromosome number you want the query to search against.
  • Genome Interval > Chromosome Coordinates – In the From and To text boxes that open, enter the range on the chromosome you want to search. This defines the chromosomal boundaries of the region with the suspected copy number variations. Fields for identifying copy number chromosome coordinates valuesImage Modified
  • Fields for identifying copy number chromosome coordinates values

The Bioconductor DNAcopy algorithm (see on page 68) identifies the location of the amplification or deletion and then reports it as the base pair at the start and stop of the segment. Each segment is then catalogued with chromosome number, start coordinate, stop coordinate, genes in the segment, and the segment mean value.
Additional fields display for the Calls selection.

  • Select or enter data for the copy number query fields shown in and described below.
    Fields for identifying CGHCalls search optionsImage ModifiedanchorRTF34343439323a204361707469RTF34343439323a204361707469Fields for identifying

CGHCalls

...

CGHCallsCGHCalls calls aberrations for array CGH data using a six state mixture model.

...

For more information about CGHCalls, see Continue with step in .

Choosing Genes

caIntegrator AnchorRTF35363136343a204865616469RTF35363136343a204865616469Choosing Genes
choosing genesgenes:finding;searching:for genescaIntegrator provides three methods whereby you can obtain gene names for a gene expression search.

  • caBIO:genes search; searching:caBIO;genes:searching caBIOcaBIO – This link searches caBIO, then pulls identified genes into caIntegrator for analysis.
  • Click the caBIO icon ( ).
  • Enter Search Terms. Note that caIntegrator can perform a search on a partial HUGO symbol. For example, as search using ACH would find matches with 'achalasia' and 'arachidonate'.
  • Select if you want to search in Gene Keywords, Gene Symbols, Gene Alias, Database Cross Reference Identifier or Pathways (from the drop-down list).
  • Gene Keywords searches the description field in caBIO; the result displays in the Full Name Column.
  • Gene Symbols searches only the Unigene and HUGO gene symbols in caBIO.
  • Gene Alias searches for one or more gene symbols which are synonymous for the current gene symbol.  
  • Database Cross Reference Identifier searches for the symbol for this gene as it appears in other databases.
  • Pathways searches only the pathway names in caBIO. Note that searching in Pathways is a two step process. First, the initial Pathway search produces search results which are pathways. Second, from the pathway search results screen, you must select pathways of interest, then click Search Pathways for Genes to obtain a list of genes related to the selected pathways.
  • Select the Any or All choice to determine how your search terms will be matched. Any finds any match for any search term you entered. All finds only results that match all of the search terms.
  • Choose the Taxon from the drop-down list and click Search. The search results display (). Example caBIO gene search criteria and search resultsImage Modified AnchorRTF32363433393a204361707469RTF32363433393a204361707469Example caBIO gene search criteria and search results
  • In the search results, use the check boxes to identify the genes whose symbols you want to use in the gene expression analysis.
  • Click Use Genes at the bottom of the page. This pulls the checked genes into the Criteria tab (). Genes pulled in from caBIO display on the Criteria tabImage Modified
  • AnchorRTF33353633313a204361707469RTF33353633313a204361707469Genes pulled in from caBIO display on the Criteria tabGene List – This link locates gene lists saved in caIntegrator.
  • Click the gene list:search;searching:gene list;list:searching gene;genes:searching gene listGenes Genes List icon () to open a Gene List Picker dialog. For more information, see on page 69.

...

  • GISTIC Amplified genes is a list of gene symbols in which the corresponding regions of the genome are significantly amplified.
  • GISTIC Deleted genes is a list of gene symbols in which the corresponding regions of the genome are significantly deleted.
  • In the drop-down menu that lists previously saved gene lists, select a gene list. In the list that appears, use the check boxes to identify the genes whose symbols you want to use in the gene expression analysis.
  • Click Use Genes at the bottom of the dialog. This pulls the checked genes into the Search Criteria tab.
  • CGAP, genes search;searching:CGAP;genes:searching CGAPCGAP – Use this directory to identify genes. Before clicking this link you must enter gene symbols in the text box. This link does not pull anything into caIntegrator but does provide information about the gene(s) whose names you entered.

Query Results

columns, defining display;Results Type tab;searching:Results Type tab;query:resultsYou You can specify columns for the way you want the search results to display either before or after you run the search. If you run the search directly from the Criteria tab before setting the results type/sorting features, by default only the Subject Identifiers display on the Search Results tab. You can then come back to the and to expand the display options and re-run the search, having set the display parameters.
For more information, see on page 65.
anchorRTF31333638343a204865616469RTF31333638343a204865616469

Results Type Tab

The selection you make on the Results Type tab determines whether caIntegrator displays search results for subject annotation or genomic data. It filters the search based on the criteria you set on the Criteria tab, whether it is annotation, gene expression or image series data type(s). In other words, if you select annotation criteria on the Criteria tab, but select Genomic on the Results Type tab, the data subset that displays on the Search Results tab is genomic data that is filtered by the annotation criteria you defined on the Criteria tab.

  • On the Results Type tab, select the Annotation, Copy Number or Genomic radio button to search annotation data (). "Results Type tabImage Modified AnchorRTF32373133313a204361707469RTF32373133313a204361707469Results Type tab, annotation options

Annotation – Select the annotation elements that you want to display in the search results. All elements listed are column headers in the data uploaded to the study. You can make multiple selections on this list.

...

The column selection is saved as part of the query if you save it. See .
anchorRTF32373732393a204865616469RTF32373732393a204865616469

Sorting Tab

columns, defining display;Sorting tab;sorting copy number resultsOn On the Sorting tab, you can set the sort order for data columns in the query results. You can also indicate whether column contents are sorted in ascending or descending order.
The columns that display on the Sorting tab are those criteria that you selected on the for an Annotation Results type search.

  • Sorting is not applicable to copy number search results. For those results, no options are available on the Sorting tab.
  • Select the Sorting tab and indicate the left to right column order of the Search Results by changing one or more numbers in the Column Order column in this table (). Sorting tabImage Modified AnchorRTF31363530393a204361707469RTF31363530393a204361707469Sorting tab
  • In the Row Order column, indicate how you want columns sorted, Ascending or Descending, or leave the default, No Sort, if you choose.
  • Click Run Query at the bottom of the page to execute your sorting changes in the search results. When you do so, the change in column order is visible on the Query Results tab, as well as on the Sorting tab. For example, any column that you have indicated to be number "1" now appears in Query Results immediately after the Subject Identifier column and at the top of the Set Sort Order table on the Sorting tab.

...

For information about the search results, see .
anchorRTF33383734383a204865616469RTF33383734383a204865616469

Managing Queries

managing:queries;query:managingWhen When you create a search query in caIntegrator, you can save the query for later use or edit it.
For more information, see these topics:
AnchorRTF38313930383a204865616469RTF38313930383a204865616469

Saving a Query

query:saving;saving queryTo To save a query, follow these steps:

...

Once the query is saved, it is listed by its name under the Study Data > Queries > My Queries in the left sidebar, whenever the study to which the query applies is selected. Click on the saved query in this list to either edit or re-run the query. Click on the query name to retrieve query results. If you hover over the Name text for the query, a pop-up displays the query description. AnchorRTF38343437353a204865616469RTF38343437353a204865616469

Editing a Query

query:editing;editing:query;query See also searchingTo To edit a query, follow these steps:

  • To edit a query, select it in the left sidebar under the Study Data > Queries > My Queries.
  • Click the Edit icon ( edit iconImage Modified ) corresponding to the study.
  • Change the query and display criteria on the Criteria, Columns and Sorting tabs.
  • On the Save As tab, check the appropriate options and click Save As. You can use the same name as the original query or modify the name as needed.

...

Exporting Query Results

query:exporting results;exporting:query resultsAfter After running a search, you can export the result set or a subset as a tab-delimited text file. For more information, see on page 80.