Saturday, March 10, 2018

FW: Solr search engine configuration

-----Original Message-----
From: PeterKerk [mailto:petervdkerk@hotmail.com]
Sent: 11 March 2018 03:37
To: solr-user@lucene.apache.org
Subject: Solr search engine configuration

Since Google onsite search will be end of life April 1 2018, I'm trying to setup my own onsite search engine that indexes my site's content and makes it searchable.

My data config successfully loads data from my database (products, companies, blogs) into the fields.

I then try to search in both the title and the description fields with weights. Now for example when users search on "dieren" (this means "animals"
in Dutch):

&q=(title_search_global:(dieren) OR
description_search_global:(dieren))&qf=title_search_global+title_exactm‌​atch^1000+description_search_global+description_exactm‌​atch^100

I get results with "dieren", "huisdieren", but I also get undesired results with "manieren" and "versieren".

What I want is to find text using the following logic (all case
insensitive):


Exact match "dieren" boost result with 1000 Partial match "huisdieren" boost result with 500 Stem match "dier" boost result with 100 Stem partial match "huisdier" boost result with 70 Other partial matches "die" boost result with 10

My current schema.xml is here: http://www.telefonievergelijken.nl/schema.xml
I tried the solr admin tool for tokenization, but I can't figure out how to get to the above logic.
I also Googled for an example Solr schema.xml configuration for building your own search engines and I'm really surprised there's nothing out there.



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

No comments:

Post a Comment