NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Select the study whose data you want to analyze in the upper right portion of the caIntegrator page. You must select a study saved as a subject annotation study, but which has genomic data.
  2. Click GenePattern Analysis in the left sidebar of caIntegrator. This opens the GenePattern Analysis Status page.
  3. In the GenePattern Analysis Status page, select Comparative Marker Selection (Grid Service) from the drop down list and click New Analysis Job. This opens the Comparative Marker Selection Analysis page, shown in the following figure.
    ”Comparative Marker Selection analysis parameters”
  4. Select or define CMS analysis parameters, described in the following table. An asterisk indicates required fields. The default settings are valid; they should provide valid results.

    CMS Parameter

    Description

    Job Name*

    Assign a unique name to the analysis you are configuring.

    Preprocess Server*

    A server which hosts the grid-enabled data GenePattern PreProcess Dataset module. Select one from the list and caIntegrator will use the selected server for this portion of the processing.

    Comparative Server*

    A server which hosts the grid-enabled data GenePattern Comparative Marker Selection module. Select one from the list and caIntegrator will use the selected server for this portion of the processing.

    Annotation Queries and Lists*

    All subject annotation queries and gene lists with appropriate data for the analysis are listed. Select and move two or more queries from the All Available Queries panel to the Selected Queries panel using the Add > and Remove < buttons.
    <ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="87fd5e9862fa20aa-352c2870-414645fd-a7c4a86b-9937abeed27352343d3fae54"><ac:plain-text-body><![CDATA[Note: The [SL] and [Q] prefixes to list names indicate "Subject Lists" or "Saved Queries". A "G" in the prefix indicates the list is Global. For more information, see [Creating a Gene or Subject List

    https://wiki.nci.nih.gov/x/FoDnAg#4-ViewingQueryResults-CreatingaGeneorSubjectList].

    ]]></ac:plain-text-body></ac:structured-macro>

    Filter Flag

    Variation filter and thresholding flag

    Preprocessing Flag*

    Discretization and normalization flag

    Min Change*

    Minimum fold change for filter

    Min Delta*

    Minimum delta for filter

    Threshold*

    Value for threshold

    Ceiling*

    Value for ceiling

    Max Sigma Binning*

    Maximum sigma for binning

    Probability Threshold*

    Value for uniform probability threshold filter

    Num Exclude*

    Number of experiments to exclude (max & min) before applying variation filter

    Log Base Two

    Whether to take the log base two after thresholding; default setting is "Yes".

    Number of Columns Above Threshold*

    Remove row if n columns are not >= than the given threshold
    In other words, the module can remove rows in which the given number of columns does not contain a value greater or equal to a user defined threshold.

    Test Direction*

    The test to perform (up-regulated for class0; up-regulated for class1, two sided). By default, Comparative Marker Selection performs the two-sided test.

    Test Statistic*

    Select the statistic to use.

    Min Std*

    The minimum standard deviation if test statistic includes the min std option. Used only if test statistic includes the min std option.

    Number of Permutations*

    The number of permutations to perform. (Use 0 to calculate asymptotic P-values.) The number of permutations you specify depends on the number of hypotheses being tested and the significance level that you want to achieve (3). The greater the number of permutations, the more accurate the P-value.
    Complete – Perform all possible permutations. By default, complete is set to No and Number of Permutations determines the number of permutations performed. If you have a small number of samples, you might want to perform all possible permutations.
    Balanced – Perform balanced permutations

    Random Seed*

    The seed for the random number generator.

    Smooth P-values

    Whether to smooth P-values by using the Laplace's Rule of Succession. By default, Smooth P-values is set to Yes, which means P-values are always less than 1.0 and greater than 0.0.

    Phenotype Test*

    Tests to perform when class membership has more than 2 classes: one versus-all, all pairs.
    Note: The P-values obtained from the one-versus-all comparison are not fully corrected for multiple hypothesis testing.

  5. When you have completed the form, click Perform Analysis.

...

  1. Select the study whose data you want to analyze in the upper right portion of the caIntegrator page. You must select a study with copy number (either Affymetrix SNP or Agilent Copy Number) data.
  2. Click GenePattern Analysis in the left sidebar of caIntegrator. This opens the GenePattern Analysis Status page.
  3. In the GenePattern Analysis Status page, select GISTIC (Grid Service) from the drop down list and click New Analysis Job. This opens the GISTIC Analysis page, shown in the following figure.
    ”GISTIC analysis criteria”
  4. Select or define GISTIC analysis parameters, described in the following table. You must indicate a Job Name, but you can accept the other default settings, which are valid and should produce valid results. Asterisks identify required fields.

    GISTIC Parameters

    Description

    Job Name*

    Assign a unique name to the analysis you are configuring.

    GISTIC Service Type*

    Select whether to use the GISTIC web service or grid service and provide or select the service address. If the web service is selected, authentication information is also required

    GenePattern User Name/Password

    Include these to log into GenePattern for the analysis.

    Annotation Queries and Lists

    All annotation queries display in this list as well as an option to select all non-control samples. Select an annotation query if you wish to run GISTIC on a subset of the data and select all non-control samples if wish to include all samples.

    Select Platform

    This option appears only if more than one copy number platform exists in the study. Select the appropriate platform from the drop-down list ().

    Exclude Sample Control Set*

    From the drop-down list, select the name of the control set you want to exclude from the analysis. Click None if that is applicable.

    Amplifications Threshold*

    Threshold for copy number amplifications. Regions with a log2 ratio above this value are considered amplified. Default = 0.1.

    Deletions Threshold*

    Threshold for copy number deletions. Regions with a log2 ratio below the negative of this value are considered deletions. Default = 0.1.

    Join Segment Size*

    Smallest number of markers to allow in segments from the segmented data. Segments that contain fewer than this number of markers are joined to the neighboring segment that is closest in copy number. Default = 4.

    <ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="67bcd04eda9b16cd-dc7ded55-43834860-b51bbf2e-5340a27c13083a44c57b739f"><ac:plain-text-body><![CDATA[

    QV Thresh[hold]*

    Threshold for q-values. Regions with q-values below this number are considered significant. Default = 0.25.

    ]]></ac:plain-text-body></ac:structured-macro>

    Remove X*

    Flag indicating whether to remove data from the X-chromosome before analysis. Allowed values = {1,0}. Default = 1(yes).

    cnv File

    This selection is optional.
    Browse for the file. There are two options for the CNV file.
    Option #1 enables you to identify CNVs by marker name. Permissible file format is described as follows:
    A two column, tab-delimited file with an optional header row. The marker names given in this file must match the marker names given in the markers_file. The CNV identifiers are for user use and can be arbitrary. The column headers are:
    Marker Name
    CNV Identifier
    Option #2 enables you to identify CNVs by genomic location. Permissible file format is described as follows:
    A 6 column, tab-delimited file with an optional header row. The 'CNV Identifier', 'Narrow Region Start' and 'Narrow Region End' are for user use and can be arbitrary. The column headers are:
    CNV Identifier
    Chromosome
    Narrow Region Start
    Narrow Region End
    Wide Region Start
    Wide Region End

  5. When you have completed the form, click Perform Analysis.
  6. When the job is complete, the system displays a completion date on the GenePattern Analysis status page. Click the Download link. This downloads zipped result files to your local work station. The number of files and their file type will vary according to the processing. The results format is compatible with GenePattern visualizers and can be uploaded within GenePattern.
  7. Additionally, upon completion of a successful GISTIC anaylsis, caIntegrator automatically displays the two gene lists that it generates in the Gene List Picker so that you can use them in a caIntegrator query or plot calculation. The lists are visible only to your userID. For more information, see #Choosing Genes. The genes will also display in Saved Copy Number Analyses in the left sidebar. See on #Editing a GISTIC Analysis.
    Warning
    titleCaution

    If samples from a copy number source are deleted, the GISTIC job in which they are appear is also deleted.

...

See also Creating a Gene or Subject List and Editing a Gene or Subject List.

Viewing Data with the Integrative Genomics Viewer

Once you have run a query for gene expression, or for copy number, you can view results in the Integrative Genomics Viewer (IGV).

The IGV is a high-performance visualization tool for interactive exploration of large, integrated datasets. It supports a wide variety of data types including sequence alignments, microarrays, and genomic annotations.

Info
titleNote

For more information about the Integrative Genomics Viewer or to connect independently to the IGV home page, see Integrative Genomics Viewer login. You may also want to refer to the IGV User Guide. The IGV viewer and the NCI Heat Map viewer both require you to install a version of Java containing Java Web Start. For more information, see #Java for IGV and Heat Map Viewewr.

There are two ways to integrate caIntegrator with the IGV. To configure the connection to IGV, follow one of these methods.

...

Method 1 IGV

...

  1. With the appropriate study open, at the bottom of the Query Results page, click the View in Integrative Genomics Viewer button.
  2. If you click the button at the bottom of the page with any of the query results line items selected, caIntegrator creates IGV files, with a monitor informing you of this. After the files are created, click the Launch Integrative Viewer hypertext link.
  3. Follow the instructions through the intermediate dialog boxes. After clicking Open with the Java program listed, the IGV.jnlp opens, displaying the dataset in the computer screen. An example displays in the following figure.
    ”IGV Viewer displaying expression results from data isolated in caIntegrator”Image Removed
  4. Move your mouse to hover over the genes graphic at the bottom of the page, indicated in the figure.
  5. Click the mouse when you've identified a gene of interest.
    This opens the genome site at UCSC, where you can learn more about the gene. The following figures exhibits the kind of metadata you can expect from the UCSC genome site.
    ”Example of the kind of metadata you can learn about a gene at the UCSC genome website”Image Removed

...

Method 2 IGV

...

  1. With the appropriate study open, click Integrative Genomics Viewer on the left sidebar. This opens the View IGV Selector page, shown in the following figure.
    ”The page for configuring the connection to the IGV”Image Removed
  2. In the drop-down list, select the Gene Expression Platform for the data you want to view.
  3. Select the Copy Number Platform ID.
  4. The Annotations - Default panel displays existing annotation fields for the gene expression data in the open study. Select those fields you want to view when you open the IGV. Use the buttons for convenience if you want to Select All or Unselect All, when all are checked.
  5. Click View to see the data in the Integrative Genomic Viewer. caIntegrator creates IGV files of the data.
  6. After the files are created, click the Launch Integrative Viewer hypertext link that appears.
  7. Continue with Step 3 in #Method 1 IGV.

Viewing Data with Heat Map Viewer

Once you have run a query for gene expression, or for copy number, you can view results in the Heat Map Viewer (HMV).

Info
titleNote

For more information about the Heat Map Viewer or to connect independently to the HMV home page, see Heat Map Viewer documentation. For HMV documentation, see https://cgwb.nci.nih.gov/goldenPath/heatmap/documentation/index.html. The IGV viewer and the NCI Heat Map viewer both require you to install a version of Java containing Java Web Start. For more information, see #Java for IGV and Heat Map Viewer..

There are two ways to integrate caIntegrator with the Heat Map Viewer. To configure the connection, follow one of these methods.

...

Method 1 HMV

...

  1. With the appropriate study open, at the bottom of the Query Results page, click the View in Heat Map Viewer button.
  2. If you click the button at the bottom of the page with any of the query results line items selected, caIntegrator creates HMV files, with a monitor informing you of this. After the files are created, click the Launch Heat Map Viewer hypertext link.
  3. Follow the instructions through the intermediate dialog boxes. After clicking Open with the Java program listed, the runs, displaying the dataset in the computer screen. An example displays in the following figure.
    ”Data display in Heat Map Viewer”Image Removed

...

Method 2 HMV

...

  1. With the appropriate study open, click Heat Map Viewer on the left sidebar. This opens the View Heat Map Viewer Selector page, shown in the following figure.
    ”View Heat Map Selector page”Image Removed
  2. Select the appropriate Copy Number Platform in the drop down list.
  3. The Annotations - Default panel displays existing annotation fields for the gene expression data in the open study. Select one or more annotations in the annotation list. For convenience, you can use the Select All or Unselect All buttons.
  4. Click View to view the data you select in Heat Map Viewer. caIntegrator creates Heat Map Viewer files of the data.
  5. After the files are created, click the Launch Heat Map Viewer hypertext link that appears.
  6. Continue with Step 3 in #Method 1 HMV.
    Info
    titleNote

    For interpretation of the results and using HMV features, see the help files opened from HMV.

Java for IGV and Heat Map Viewer

To use the IGV and the NCI Heat Map viewer, described in #Viewing Data with the Integrative Genomics Viewer and #Viewing Data with Heat Map Viewer, you must install a version of Java containing Java Web Start. You must install recent versions of the Java Development Kit (JDK 1.5.0 aka JDK 5.0 or newer) or Java Runtime Environment (JRE 1.5.0 aka JRE 5.0 or newer). The easiest option is to install JRE 5.0. For more information, see http://www.java.com/en/download/faq/java_webstart.xml.

Without Java Web Start, when you click Launch Integrative Genomics Viewer or Launch Heat Map Viewer, a dialog box displays in your browser giving you the option to save or open with igv.jnlp (IGV) or retrieveFile.jnlp (HMV). Clicking the Open option starts the Java Web Start Launcher (default), installing the Java app so that you can view the files.

Info
titleNote

The first time you launch the IGV or HMV with Java properly installed, regardless of browser type, a warning may appear: the "the digital signature cannot be verified". Click Run to proceed with opening the viewer.

...

Scrollbar
iconsfalse