![]() Starting with Solr, the primary mechanism for delivering spelling corrections is through a Search Component called the SpellCheckComponent. There are several working parts to the spell checker, some in Solr and some in Lucene. ![]() To understand spell checking in Solr, it is helpful to know a bit more about what is going on underneath the hood. For instance, it may be the case that a word is so often misspelled by writers in your corpus that the best suggestion just might be an incorrectly spelled word, even if the user’s original query was properly spelled! For some background on building the foundation of a spell checker, see Peter Norvig’s excellent post. It’s not that we don’t want to correct misspelled words, it’s more that we want to give suggestions for words that will lead to better results based on the way things are spelled in the index as well as other factors like past user behavior, the “correct” spelling of the word and any other apriori information, such as business goals, we might have. Recently, I did some minor work on improving the usability of the Lucene spell checker (see LUCENE-2479, LUCENE-2608 and the associated Solr work) and it got me thinking that a post on spell checking in Solr would be useful.įor those who aren’t familiar, the notion of spell checking in search (often called Did You Mean?) is slightly different from the notion of simply correcting spelling errors.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |