NIH | National Cancer Institute | NCI Wiki  

The information and links on this page are no longer being updated and are provided for reference purposes only.


The NCI instance of the caArray database is now retired. For details about the retirement, see the caArray Retirement Announcement.

Contents of this Page
At a Glance Details
  • Version Number and Release Date: 2.5.3 May 2014
  • Primary audience: Microarray data producers and application developers
  • Grid Enabled? Yes
  • Installation Level: Intermediate - technical assistance may be required, download may require supporting infrastructure or software
  • System Requirements: Refer to the System Requirements section.

Tool Overview

caArray logo caArray is an open-source, web and programmatically accessible microarray data management system that supports the annotation of microarray data using MAGE-TAB Exit Disclaimer logo and web-based forms. Data and annotations may be kept private to the owner, shared with user-defined collaboration groups, or made public.

Installation and Downloads


System Requirements

While caArray is a web-based application, it does have a dependent software stack, which includes: MySQL 5.0.27 database; JBoss 5.1. (hosts the caArray grid service and the caArray application); and JEMS installer 1.0.2 GA (supports EJB 3.0 specification)

Dependent Software



J2SE Development Kit

6.0 (J2SE 6.0)

Apache Ant




JBoss for UPT


JBOSS for Grid Service


JBOSS for caArray application


GitHub Repositories

Support for caArray

You can contact the NCI CBIIT Application Support if you have any questions or concerns.


For a list of articles that discuss caArray, visit caArray Citations.

Integration with other tools

Integration with geWorkbench

geWorkbench can query for data in an instance of caArray via a Java API. Several operations are supported. You can:

  1. Query for experiments, for example by platform or by species.
  2. Select from which hybridizations in an experiment you wish to download data.
  3. Download derived (summarized, probeset-level) data that has been parsed into the database. An example is the values loaded from Affymetrix CHP files (created through GCOS/MAS5). geWorkbench does not currently support retrieving or analyzing Affy CEL files directly.

The current release of geWorkbench, version 2.5.0, connects with caArray 2.5.0 and 2.5.1
For more information, visit the geWorkbench project page. Exit Disclaimer logo

The software can be downloaded.

Integration with caIntegrator

caIntegrator can extract the genomic data from caArray hosted on either a private server can:

  1. Extract the genomic data into caIntegrator.
  2. Integrate the genomic data with the clinical data and imaging data in caIntegrator.
  3. Perform genomic data analysis with the built-in GenePattern Tool within caIntegrator.

The current release of caIntegrator, version 1.4, works with caArray 2.5.

To install a local copy, download a copy of the software.

Integration with GenePattern

GenePattern has a module that can retrieve data from caArray using its Java API. The caArray2.3.0 Importer lets you retrieve raw data files from caArray by experiment name or id. You can specify the extension of the data file that you want to download. The default is .CEL.

You can process CEL files using the GenePattern module ExpressionFileCreator. Once you have used ExpressionFileCreator, the data can be further analyzed using many different GenePattern analysis modules.

The caArray 2.3.0 Importer module is compatible with caArray v2.3.x. You may download the caArray2.3.0Importer.

The caArray 2.1.0 Importer module is compatible with caArray v2.1.x and v2.2.0. You may download the caArray2.1.0Importer.

The GenePattern team has recently devleoped caArray2ImportViewer, a visualizer to import data from caArray into GenePattern. It is compatible with caArray 2.4.0.x. Please see the documentation to learn more.

You may also download GenePattern. Exit Disclaimer logo

Once you've fetched your microarray data from caArray from any of the above versions of the caArrayImporter, you can use that data in any GenePattern module or pipeline that accepts gene expression data. To view compatible modules from within GenePattern you can click the context arrow next to the imported file (the result file of your caArrayImporter job in Recent Jobs) to see which modules can accept this dataset.

Help Downloading Files

For help accessing PDF, audio, video, and compressed files on this wiki, go to Help Downloading Files.

  • No labels