NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The extent of text match algorithms in LexEVS has grown quite a lot over the decade the application has been in existence.   Many matching algorithms overlap in their functionality and dependencies.  We've created a review of each of these algorithms with notes on their index dependencies and search focus with an eye towards simplifying and updating the search functionality.  NCI should review and decide if any of these can be removed or updated.

Current Text Matches

  • Lucene Query
  • phrase
  • contains
  • leading and trailing wild card
  • exact match
  • substring
  • spelling error tolerant substring match
  • stemmed lucene query
  • literal contains
  • starts with
  • non leading wild card literal substring
  • literal
  • Weighted double metaphone lucene query
  • literal substring
  • Double metaphone lucene query
  • Regular expression

...