NIH | National Cancer Institute | NCI Wiki  

Question: How do I add an organism to the Organism List in caArray?

Topic: caArray Usage

Release: Up to caArray 2.2

Date entered: 03/06/2009

Details about the Question

I did not see my sample organism on the Organism list. How can I add mine to the list?

Answer

Organism List

"Organism" represents the source of sample biomaterial used in an experiment. It is one of the mandatory fields describing each caArray experiment. An initialization script, which is part of the caArray installation package, will insert the most commonly used organisms into caArray's database table: Organism. The organisms can then be selected from two drop-down lists in following sections:

  • Create New Experiment
  • Import New Array Design

Manually Adding Organism

If the organism list in caArray does not contain the organism for your samples, there are two other ways to add new organisms into caArray.

  • Any new organism defined in the MAGE-TAB Annotation files (caArray 007 - What are the MAGE-TAB Files?) that are used to annotate experiments will be automatically added to caArray's database during the data file import. The organism is defined in MAGE-TAB's SDRF file following the sample source definition. It has the column heading "Characteristics[MATKC:Organism].
  • New organisms can also be manually added at the database level. The SQL statement below is a simple insert statement that can be used to add a new entry to Organism table:
 insert into organism
   (ncbi_taxonomy_id, common_name, scientific_name, taxonomy_rank, term_source)
 select 9606, 'human', 'Homo sapiens', 'species', id
 from term_source where term_source.name='ncbitax';

There are two fields that need to be clarified in this SQL statement:

  • term_source: This is a numeric field. It is really the term source identifier, ID, from term_source table. In the example above, the scientific name is defined in the ncbitax taxonomy, a known source already defined in the term_source table. If your taxonomy source is not in the Term_Source table yet, it needs to be added into the Term_Source table prior to executing this insert statement.
  • ncbi_taxonomy_id: it is the taxonomy ID used by ncbitax.

Prior to any database modification, a database backup is highly recommended.

Have a comment?

Please leave your comment in the caArray End User Forum.