NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Because this default is very taxing on the index processing at the end of the load, we recommend changing it to SEQUENTIAL_INTEGER for any terminology unless you have a priority need for Global Unique Identifiers. Note that this setting is final once any terminology is loaded. You can not change this after it is in effect. Even launching any LexEVS administrative command or opening the LexEVS administrative GUI will make this permanent. The only way to start over and change the setting is to change the lbconfig.props file, drop the database created for LexEVS, and recreate the database. If you are going to make the change this setting then do so before you do anything with LexEVS.

Database Connection setting for loading large terminologies

When large terminologies like NCI Metathesaurus is being loaded, the database connection might get suspended while waiting for the lucene indexing to complete. To mitigate this problem set the autoReconnect to true in the database URL in the lbconfig.props file. If it is MySQL, the URL would look like:

DB_URL=jdbc:mysql://bmidev4:3307/testLoad2010?autoReconnect=true

and you will have to restart MySQL database using following parameters:
--tmpdir /data/mysqltmp
--wait_timeout=100000

The tmpdir could be pointing to any directory but make sure it has enough space (in case NCI Metathesaurus, around 20Gb will be ideal). The --wait_timeout=100000 which is the number of seconds for 27 hours, will hold the connection for 27 hours.

Monitoring a load

While terminologies are being loaded, you can monitor the progress using the LexEVS logs (both 'load' and 'full' log) and if using MySQL, use INNODB tools to monitor Inserts per second. ( SHOW INNODB STATUS )

Setting a terminology as active

...