NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • This search is case in-sensitive. 
  • It only searches on the untokenizedLCPropertyValue and the property value.
  • The literal property part of the query is boosted by 50.  This gives a literal match priority.
  • A trailing wild card is on the term (but no leading wild card) and the term can appear at any position.
  • Lowercase and special characters removed during query parser parse.
  • Parsing is done with the following analyzers: 
    • untokenizedLCPropertyValue - Analyzers are not applied to property value.  However, the expression is lower cased (this is an explicit step done outside of Lucene by LexEVS code). 
    • literal_propertyValue - Uses our custom literal analyzer.  This literal analyzer uses Lucene's WhitspaceTokenizer WhitespaceTokenizer with Lucene's LowerCaseFilter.

...