Page History
...
Note | ||
---|---|---|
| ||
The NCI Thesaurus has grown large enough that it can no longer be loaded on many typical desktop machines. We recommend a 64-bit operating system running on a multiprocessor computer with a minimum of 4g of memory. Server class Linux machines are the typical target for these loads. The time to load NCI Thesaurus will vary depending on machine, memory, and disk speed. Expect a couple of hours for a higher end machine. Take a look at Best Practices for recommendations for loading large terminologies like NCI Thesaurus in optimal way. |
Step | Action | ||||||
---|---|---|---|---|---|---|---|
1 | Using a web or ftp client go to the URL: ftp://ftp1.nci.nih.gov/pub/cacore/EVS/ | ||||||
2 | Select the version of NCI Thesaurus OWL file you wish to download. Save the file to a directory on your machine. | ||||||
3 | Extract the OWL file from the ZIP download and save in a directory on your machine. This directory will be referred to as NCI_THESAURUS_DIRECTORY in script examples. | ||||||
4 | Create Manifest and Preferences files. (optional)
| ||||||
5 | Using the LexEVS command line, load the NCI Thesaurus with no options:
For Windows installation use the following command:
For Linux installation use the following command:
This should work best with a "by code" type Thesaurus source terminology. | ||||||
6 | Using the LexEVS command line, load the NCI Thesaurus with options:
For Windows installation use the following command:
For Linux installation use the following command:
|
Example output from load of NCI Thesaurus 05.12f
Code Block |
---|
…
[LexBIG] Processing TOP Node... Retired_Kind
[LexBIG] Clearing target of NCI_Thesaurus...
[LexBIG] Writing NCI_Thesaurus to target...
[LexBIG] Finished loading DB - loading transitive expansion table
[LexBIG] ComputeTransitive - Processing Anatomic_Structure_Has_Location
[LexBIG] ComputeTransitive - Processing Anatomic_Structure_is_Physical_Part_of
[LexBIG] ComputeTransitive - Processing Biological_Process_Has_Initiator_Process
[LexBIG] ComputeTransitive - Processing Biological_Process_Has_Result_Biological_Process
[LexBIG] ComputeTransitive - Processing Biological_Process_Is_Part_of_Process
[LexBIG] ComputeTransitive - Processing Conceptual_Part_Of
[LexBIG] ComputeTransitive - Processing Disease_Excludes_Finding
[LexBIG] ComputeTransitive - Processing Disease_Has_Associated_Disease
[LexBIG] ComputeTransitive - Processing Disease_Has_Finding
[LexBIG] ComputeTransitive - Processing Disease_May_Have_Associated_Disease
[LexBIG] ComputeTransitive - Processing Disease_May_Have_Finding
[LexBIG] ComputeTransitive - Processing Gene_Product_Has_Biochemical_Function
[LexBIG] ComputeTransitive - Processing Gene_Product_Has_Chemical_Classification
[LexBIG] ComputeTransitive - Processing Gene_Product_is_Physical_Part_of
[LexBIG] ComputeTransitive - Processing hasSubtype
[LexBIG] Finished building transitive expansion - building index
[LexBIG] Getting a results from sql (a page if using mysql)
[LexBIG] Indexed 0 concepts.
[LexBIG] Indexed 5000 concepts.
[LexBIG] Indexed 10000 concepts.
[LexBIG] Indexed 15000 concepts.
[LexBIG] Indexed 20000 concepts.
[LexBIG] Indexed 25000 concepts.
[LexBIG] Indexed 30000 concepts.
[LexBIG] Indexed 35000 concepts.
[LexBIG] Indexed 40000 concepts.
[LexBIG] Indexed 45000 concepts.
[LexBIG] Indexed 46000 concepts.
[LexBIG] Getting a results from sql (a page if using mysql)
[LexBIG] Closing Indexes Mon, 27 Feb 2006 01:44:22
[LexBIG] Finished indexing
|
...
Step | Action | ||||||
---|---|---|---|---|---|---|---|
1 | Using a web or ftp client go to the URL: http://ncicb.nci.nih.gov/download/evsportal.jsp | ||||||
2 | Select the version of NCI Metathesaurus RRF you wish to download. There may only be one. Save the file to a directory on your machine. | ||||||
3 | Extract the files from the ZIP download and save to a directory on your machine. This directory will be referred to as NCI_METATHESAURUS_DIRECTORY. RELASE_INFO.RRF is required to be present for the load utility to work. | ||||||
4 | Check that you are able to open a large number of files before starting the load.
Usually having around 10,000 available open files is sufficient. If your limit is set to low this will need to be raised. | ||||||
5 | Using the LexEVS utilities load the NCI Thesaurus:
For Windows installation use the following command:
For Linux installation use the following command:
|
...