NIH | National Cancer Institute | NCI Wiki  

Error rendering macro 'rw-search'

null

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 5 Next »

Contents of this Page

 

Contains Algorithm Implementation Details

Equivalent to ' term* ' - in other words - a trailing wildcard on a term (but no leading wild card) and the term can appear at any position.   Searches on property value only.

Algorithm:

The contains search has the following characteristics:

  • It only searches on the property value. 
  • A trailing wild card is added to all tokens in the search text.
  • Lowercase and special characters removed during query parser parse
  • Parsing is done with Lucene's StandardAnalyzer.

Description of Algorithm:

 

Example of use:

Example based on the Automobiles coding scheme.

Search string: automob

Lucene query: +propertyValue:automob* literal_propertyValue:automob^50.0

Result: 1 result

  • entity code: A0001
  • entity description: Automobile

Implementation Details:

 

Associated JUnits:

Junits for contains tests can be found here: https://github.com/lexevs/lexevs/blob/master/lbTest/src/test/java/org/LexGrid/LexBIG/Impl/function/query/lucene/searchAlgorithms/TestContains.java

 

 

  • No labels