NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Wiki Markup
{scrollbar:icons=false}

h1. Question: What are MAGE-TAB Files?

*Topic*: caArray Usage

*Release*: caArray 2.X

*Date entered*: 02/12/2009

Topics on this page include the following:
{toc:minLevel=2}

h2. Answer

MAGE-TAB (MicroArray Gene Expression Tabular) format files refer to simple tab-delimited, spreadsheet-based files, which can be used for annotating and communicating microarray data in a [MIAME compliant fashion|http://www.mged.org/Workgroups/MIAME/miame.html]. MAGE-TAB specification is based on the [Microarray and Gene Expression|http://www.mged.org/Workgroups/MAGE/mage.html] Object Model ([MAGE-OM|http://www.mged.org/Workgroups/MIAME/miame_mage-om.html]). MAGE-TAB [specification|http://www.mged.org/mage-tab/spec1.0.html] and related [publications|http://www.biomedcentral.com/1471-2105/7/489] provide more details on the format.

The following table summarizes the definition of each MAGE-TAB format file. 

NoteFor more thatinformation asee [MAGE-TAB format Array Design File (ADF) is not mandatory in caArray, since array design files for the common arrays are usually available from their respective array vendors. If an array design file (which may not be in MAGE-TAB format) is available from its array provider, you should choose it over an ADF. Third party array design files are uploaded using the [Manage Array Design --Overview|https://wiki.nci.nih.gov/x/Nxo9Ag#6Oxo9Ag#7-CurationToolsSubmittingDatatoanExperiment-ManagingArrayDesigns] interface under caArray's *Curation* tab. An ADF file, on the other hand, can be uploaded together with the rest of MAGE-TAB files. An ADF file is not be validated or parsed by caArray. It will be imported directly into caArray according to the table that follows.MAGETABincaArrayOverview]

{text-extractor}{multi-excerpt-include:MAGE-TAB_definitions |name=relationship|nopanel=true}

h3. MAGE-TAB Formatted Files
|| Abbreviation || File Type || Description || caArray compatible? || Processed by caArray? ||
| IDF | Investigation Design File | Provides an overview of the experiment, including the experimental variables (factors) used,  protocols, quality control strategy, publication information and contact details | Yes | Yes: Parsed, Validated before import |
| SDRF | Sample Data Relationship File | Describes relationships between samples, arrays, data files, protocols,imported factorinto values etc. It is a table in which each hybridization channel is represented by a row, and columns represent the steps of the experiment. The ordering of these columns is important, and reads left-to-right in chronological order. | Yes | Yes: Parsed, Validated before import |
| ADF | Array Design File | Provides the array-level annotation for the experiment. It relates the row-level identifiers in the caARRAY map to data files to biological sequence annotation | Yes | No. Directly Import |
| TXT or other | Data Matrix | Contains processed array data files in tab-delimited text format. Rows may represent genes/ exons/ genomic locations. Columns represent samples or experimental conditions. | Yes | No. Directly Import |

h3. What are MAGE-TAB Files?

The term of "MAGE-TAB Files" (refer to [caArray 002 - How do I upload MicroArray Gene Expression Data into caArray?|caArray 002 - How do I upload MicroArray Gene Expression Data into caArray?], step 2 for an example), has been used to refer not only MAGE-TAB formatted files as summarized in the table, but also files that are supported by MAGE-TAB parser mentioned in the last section. To be more specific, MAGE-TAB files also include the third party's raw array data files, derived data files and array design files from array providers, as shown in the illustration.

In summary, MAGE-TAB files map to each other. Together they represent the complete experiment.

also imported into the same caARRAY experiment. The following figure illustrates the process.

{color:red} replacing this diagram{color}
*Diagram Identifying Content of MAGE-TAB Files*
!MAGE-TAB-Files.jpg|align=center,alt="Diagram Identifying Content of MAGE-TAB Files"!

_According to the figure above, MAGE-TAB files can be divided into two categories: array data files, which are in ASCII or binary format, and descriptive files, which contain information about array design and investigation data. These two categories can be further subdivided as follows:_


_Array data files can either be raw (i.e., unprocessed) data files whose format is specified by a third-party vendor, or they can be derived (i.e., normalized) data files. The latter are either generic data in MAGE-TAB format, or Affymetrix .CHP files that are not in MAGE-TAB format but can still be parsed by caArray._


_Descriptive files can contain information about investigation data or array design. Investigation data files conform to one of two specifications: Investigation Description File (IDF) or Sample and Data Relationship File (SDRF). Array design files conform to one of two specifications: MAGE-TAB formatted design files with a .ADF file extension, or non-MAGE-TAB formatted design files from third-party vendors that can still be parsed by caArray._

h3. Building MAGE-TAB Formatted files

Because of the varied nature of every caArray experiment and its corresponding data, it is impossible to outline the exact steps for creating a caArray-compatible MAGE-TAB file. There are strict guidelines, however, for characteristics of MAGE-TAB files that meet the criteria. Refer to [Appendix A - MAGE-TAB in caArray|https://wiki.nci.nih.gov/x/Rho9Ag] in the caArray User's Guide for specific details regarding caArray-compatible MAGE-TAB files. To provide context, see the [MAGE-TAB Specification document|http://www.mged.org/mage-tab/spec1.0.html].

To get started, you can generate a MAGE-TAB template file from [EMBL-EBI's MAGE TAB site|http://www.mged.org/mage-tab/tools.html], or create your own IDF and SDRF files based on the [Sourceforge MAGE-TAB documentation|http://tab2mage.sourceforge.net/docs/magetab_docs.html].

Be sure and review the details in [Appendix A - MAGE-TAB in caArray|https://wiki.nci.nih.gov/x/Rho9Ag] to ensure that your MAGE-TAB files meet the specifications for compatibility with caArray.

For more information about when to use MAGE-TAB annotation files in caArray, refer to [caArray 008 - How do I use the Annotation Tab versus MAGE-TAB Annotation Files in caArray?|caArray 008 - How do I use the Annotation Tab versus MAGE-TAB Annotation Files in caArray?].  For more information on how to upload MAGE-TAB files, refer to [Importing MAGE_TAB Files|https://wiki.nci.nih.gov/x/Oxo9Ag#7-SubmittingDatatoanExperiment-ImportingMAGETABData] in the caArray User's Guide.

h2. Have a comment?

Please leave your comment in the [caArray End User Forum|https://cabig-kc.nci.nih.gov/Molecular/forums/viewtopic.php?f=6&t=577].

{scrollbar:icons=false}