NIH | National Cancer Institute | NCI Wiki  

Error rendering macro 'rw-search'

null

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This section describes the steps to download and install a full version of the NCI Thesaurus for the LexEVS Service.

Note
titleNote

The NCI Thesaurus has grown large enough that it can no longer be loaded on many typical desktop machines. We recommend a 64-bit operating system running on a multiprocessor computer with a minimum of 4g of memory. Server class Linux machines are the typical target for these loads. The time to load NCI Thesaurus will vary depending on machine, memory, and disk speed. Expect a couple of hours for a higher end machine.

Step

Action

1

Using a web or ftp client go to the URL: ftp://ftp1.nci.nih.gov/pub/cacore/EVS/ screenshot of FTP directory

2

Select the version of NCI Thesaurus OWL you wish to download. Save the file to a directory on your machine.

3

Extract the OWL file from the zip download and save in a directory on your machine. This directory will be referred to as NCI_THESAURUS_DIRECTORY

4

Using the LexEVS utilities load the NCI Thesaurus:

Code Block
cd {LEXEVS_HOME}/admin

For Windows installation use the following command:

Code Block
LoadOWL.bat –in "file:///{NCI_THESAURUS_DIRECTORY}/Thesaurus_10.10d.owl

For Linux installation use the following command:

Code Block
LoadOWL.sh –in "file:///{NCI_THESAURUS_DIRECTORY}/Thesaurus_10.10d.owl
Note
titleNote

The NCI Thesaurus has grown large enough that it can no longer be loaded on many typical desktop machines. We recommend a 64-bit operating system running on a multiprocessor computer with a minimum of 4g of memory. Server class Linux machines are the typical target for these loads. The time to load NCI Thesaurus will vary depending on machine, memory, and disk speed. Expect a couple of hours for a higher end machine.

Example output from load of NCI Thesaurus 05.12f

...

This section describes the steps to download and install a full version of the NCI Metathesaurus for the LexEVS Service.

Note
titleNote

NCI Metathesaurus contains many individual vocabularies some of which are large vocabularies in and of themselves. It requires many hours to load and index. It can require 36 hours on a multiprocessor machine with 6g plus memory. The total time to load NCI MetaThesaurus will vary depending on machine, memory, and disk speed. Because this loader uses a batch loading strategy it is less dependent on memory, but some users will see 3 or 4 day load times with average multiprocessor processing power.

Step

Action

1

Using a web or ftp client go to the URL: ftp://ftp1.nci.nih.gov/pub/cacore/EVS/ screenshot of FTP directory

2

Select the version of NCI Metathesaurus RRF you wish to download. Save the file to a directory on your machine.

3

Extract the RRF files from the zip download and save in a directory on your machine. This directory will be referred to as NCI_METATHESAURUS_DIRECTORY.

Info
titleNote

RELASE_INFO.RRF is required to be present for the load utility to work.

4

Using the LexEVS utilities load the NCI Thesaurus:

Code Block
{LEXEVS_HOME}/admin

For Windows installation use the following command:

Code Block
LoadMetaBatch.bat –in "file:///{NCI_METATHESAURUS_DIRECTORY}/"

For Linux installation use the following command:

Code Block
LoadMetaBatch.sh –in "file:///{NCI_THESAURUS_DIRECTORY}/"

...

titleNote

...

Resuming Loads

Since this loader is resource hungry we provide the option to restart should you find your resource settings to be inadequate. Resuming loads which have crashed or been interrupted by server problems is possible using the ResumeBatchLoad script set.

...