NIH | National Cancer Institute | NCI Wiki  

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Code Block
languagejava
titleCurrent Analyzer Classes
//Wraps the other Analyzers. This is a Lucene class and it's implementation has been updated to
//add analyzers during initialization (In the constructor).  This will need to be adjusted in our 
//use of it.
org.apache.lucene.analysis.PerFieldAnalyzerWrapper

//This analyzer is an implementation of a similar class in Lucene.  Presumably we are using 
//custom lists of characters to escape or otherwise treat as white space. A number of other 
//custom analyzers rely on this class for a token stream.  It's central to the implementaion.
edu.mayo.informatics.indexer.lucene.analyzers.WhiteSpaceLowerCaseAnalyzer

//This would be a good candidate for replacement by a standard Lucene analyzer, but it is fed
//a token stream by the WhiteSpaceLowerCaseAnalyzer before it does it's processing so we'll make
//the necessary adaptations
edu.mayo.informatics.indexer.lucene.analyzers.SnowballAnalyzer

//Appears to be a fully custom analyzer that acts as a utility for other analyzers.  
//Not sure if we ever encode with it, but it is initialized
//at query time.  It may be a candidate for removal.
edu.mayo.informatics.indexer.lucene.analyzers.FieldSkippingAnalyzer

//AppearsNot to standdependent on it's own without using WhiteSpaceLowerCaseAnalyzer. Not sure that
//It can be replaced, but SimpleAnalyzer in the Lucene libraries might be a candidateWraps a Lucene KeywordAnalyzer class.
edu.mayo.informatics.indexer.lucene.analyzers.StringAnalyzer

//Encodes for DoubleMetaphone.  Would replace with Lucene implementation of a Double Metaphone Analyzer
//if possible but this is also fed a WhiteSpaceLowerCaseAnalyzer token stream for processing
edu.mayo.informatics.indexer.lucene.analyzers.EncoderAnalyzer

//This normalizing analyzer appears to be disabled in the LuceneLoaderCode class.
//We should remove it from the code path if it is not being used.
edu.mayo.informatics.indexer.lucene.analyzers.NormAnalyzer


Â