NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin
Scrollbar
iconsfalse

...

Panel
titleContents of this Page
Table of Contents
minLevel2

...

  • This search is case in-sensitive. 
  • It searches on the double metaphone property value and literal property value.  
  •  The The  literal property part (without the wild cards) of the query is boosted by .5.  This gives a literal match priority.  
  • Parsing is done with the following analyzers:

    • dm_propertyValue - Uses our custom double metaphone analyzer.  This has the following filters:

      • LowerCaseFilter - for setting to lowercase
      • StopFilter - to remove stop words (the, a, etc.) from the search
      • DoubleMetaphoneFilter - for testing double metaphone sounds
    • literal_propertyValue - Uses our custom literal analyzer.  This literal analyzer uses Lucene's WhitespaceTokenizer with Lucene's

    StandardAnalyzer
    • LowerCaseFilter.

     

Example of use:

The following examples are based on the Automobiles coding scheme.

...

Lucene query: dm_propertyValue:"KR" literal_propertyValue:"car"^0.5

Complete query:

  • +*:* +(entityType:concept)
  • +*:* +isAnonymous:F
  • +*:* +(dm_propertyValue:"KR" literal_propertyValue:"car"^0.5) +isPreferred:T +(propertyType:presentation)

Result: 2 results

  • Result 1
    • entity code: C0001
    • entity description: Car
  • Result 2
    • entity code: C0002
    • entity description: Kar

...

Lucene query: dm_propertyValue:"JNRL KNRL MTRS" literal_propertyValue:"general motors"^0.5

Complete query:

...

Result: 1 result

  • entity code: GM
  • entity description: General Motors

...