NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin
Scrollbar
iconsfalse

...

Panel
titleContents of this Page
Table of Contents
minLevel2

...

Equivalent to '* term* *' - in other words - a trailing wildcard on a term (but no leading wild card) and the term can appear at any position.

Algorithm:

The contains Starts With search has the following characteristics:

  • This search is case in-sensitive. 
  • It only searches on the untokenizedLCPropertyValue and the property value.
  • The literal property part of the query is boosted by 50.  This gives a literal match priority.
  • A trailing wild card is on the term (but no leading wild card) and the term can appear at any position.
  • Lowercase and special characters removed during query parser parse.
  • Parsing is done with the following analyzers: 
    • untokenizedLCPropertyValue - Analyzers are not applied to property value.  However, the expression is lower cased (this is an explicit step done outside of Lucene
    's StandardAnalyzer
    • by LexEVS code). 
    • literal_propertyValue - Uses our custom literal analyzer.  This literal analyzer uses Lucene's WhitespaceTokenizer with Lucene's LowerCaseFilter.

Example of use:

The following examples are based on the Automobiles coding scheme.

...

Lucene query: +untokenizedLCPropertyValue:automob* literal_propertyValue:automob^50.0Complete query:

  • +*:* +(entityType:concept)
  • +*:* +isAnonymous:F
  • +*:* +(+untokenizedLCPropertyValue:automob* literal_propertyValue:automob^50.0) +(propertyType:presentation)

Result: 1 result

  • entity code: A0001
  • entity description: Automobile

...

Example 2:

Search string: Car (with special) charaters!

Lucene query: +untokenizedLCPropertyValue:car (with special) charaters!* ((+literal_propertyValue:car +literal_propertyValue:(with +literal_propertyValue:special) +literal_propertyValue:charaters!)^50.0)Complete query:

...

:

...

+

...

untokenizedLCPropertyValue:car (with special) charaters!* ((+literal_propertyValue:car +literal_propertyValue:(with +literal_propertyValue:special) +literal_propertyValue:charaters!)^50.0

...

)

Result: 1 result

  • entity code: C0001
  • entity description: Car

...