Loading...
 

Tikiwiki-devel (mailman list mirror)


Tracker -> field content with spaces not searchable with UnifiedSearchIndex+MySQL-FulltextSearch?

posts: 1545

Hello everybody,

I have a project, where we collect specific types court verdicts in a
tracker.

One field is the case number and this must be searchable.

All over the place (documents, websites etc.) the case numbers have a
specific syntax which contains spaces.

It is important that we use the same syntax.

Example:

case number (=tracker field content - field type: text field):
120 ABC 13/01

When I use this, the tiki-searchindex.php search find 0 results.

Would I rename the case number to this:
120_ABC_13/01

the search finds 1 result (the correct tracker item).

BUT then it is not possible to use externally researched case numbers
copy/paste to our tracker and additionally nobody could find related
documents in the internet by copy/pasting the case number from our site
to a search engine like google.
It is really essential, that we stick to the correct syntax.

Is there any setting I am missing or how can I make the correct syntax
"120 ABC 13/01" searchable?
I guess that "120 ABC 13/01" is not recognised as one single string.

Hints appreciated.

Regards,
Torsten

------------------------------------------------------------------------------
Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
TikiWiki-devel mailing list
TikiWiki-devel at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/tikiwiki-devel

posts: 126261

Hi Torsten,

I think you're hitting the mysql fulltext search minimum word length ft_min_word_len, which defaults to 4 (you're using the mysql unified engine i presume?) and it looks like you need to be 2.

You can change that in your my.cnf file if you have access to that, instructions are here:

https://dev.mysql.com/doc/refman/5.6/en/fulltext-fine-tuning.html

If you can't change the mysql setup you'll need to switch to using elastic search (preferably, but that needs installing or renting as a service) or use the PHP Lucene as a last resort.

HTH

jonny



> On 24 Jun 2016, at 17:53, Torsten <torsten@tiki.org> wrote:
>
> Hello everybody,
>
> I have a project, where we collect specific types court verdicts in a
> tracker.
>
> One field is the case number and this must be searchable.
>
> All over the place (documents, websites etc.) the case numbers have a
> specific syntax which contains spaces.
>
> It is important that we use the same syntax.
>
> Example:
>
> case number (=tracker field content - field type: text field):
> 120 ABC 13/01
>
> When I use this, the tiki-searchindex.php search find 0 results.
>
> Would I rename the case number to this:
> 120_ABC_13/01
>
> the search finds 1 result (the correct tracker item).
>
> BUT then it is not possible to use externally researched case numbers
> copy/paste to our tracker and additionally nobody could find related
> documents in the internet by copy/pasting the case number from our site
> to a search engine like google.
> It is really essential, that we stick to the correct syntax.
>
> Is there any setting I am missing or how can I make the correct syntax
> "120 ABC 13/01" searchable?
> I guess that "120 ABC 13/01" is not recognised as one single string.
>
> Hints appreciated.
>
> Regards,
> Torsten
>
> ------------------------------------------------------------------------------
> Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
> Francisco, CA to explore cutting-edge tech and listen to tech luminaries
> present their vision of the future. This family event has something for
> everyone, including kids. Get more information and register today.
> http://sdm.link/attshape
> ___
> TikiWiki-devel mailing list
> TikiWiki-devel at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/tikiwiki-devel
>


------------------------------------------------------------------------------
Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
TikiWiki-devel mailing list
TikiWiki-devel at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/tikiwiki-devel

posts: 1545

Thank you Jonny for your indeed very helpful and thoughtful answer.

Best regards,
Torsten

On 25.06.2016 12:25, Jonny Bradley wrote:
> Hi Torsten,
>
> I think you're hitting the mysql fulltext search minimum word length ft_min_word_len, which defaults to 4 (you're using the mysql unified engine i presume?) and it looks like you need to be 2.
>
> You can change that in your my.cnf file if you have access to that, instructions are here:
>
> https://dev.mysql.com/doc/refman/5.6/en/fulltext-fine-tuning.html
>
> If you can't change the mysql setup you'll need to switch to using elastic search (preferably, but that needs installing or renting as a service) or use the PHP Lucene as a last resort.
>
> HTH
>
> jonny
>
>
>
>> On 24 Jun 2016, at 17:53, Torsten <torsten@tiki.org> wrote:
>>
>> Hello everybody,
>>
>> I have a project, where we collect specific types court verdicts in a
>> tracker.
>>
>> One field is the case number and this must be searchable.
>>
>> All over the place (documents, websites etc.) the case numbers have a
>> specific syntax which contains spaces.
>>
>> It is important that we use the same syntax.
>>
>> Example:
>>
>> case number (=tracker field content - field type: text field):
>> 120 ABC 13/01
>>
>> When I use this, the tiki-searchindex.php search find 0 results.
>>
>> Would I rename the case number to this:
>> 120_ABC_13/01
>>
>> the search finds 1 result (the correct tracker item).
>>
>> BUT then it is not possible to use externally researched case numbers
>> copy/paste to our tracker and additionally nobody could find related
>> documents in the internet by copy/pasting the case number from our site
>> to a search engine like google.
>> It is really essential, that we stick to the correct syntax.
>>
>> Is there any setting I am missing or how can I make the correct syntax
>> "120 ABC 13/01" searchable?
>> I guess that "120 ABC 13/01" is not recognised as one single string.
>>
>> Hints appreciated.
>>
>> Regards,
>> Torsten
>>
>> ------------------------------------------------------------------------------
>> Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
>> Francisco, CA to explore cutting-edge tech and listen to tech luminaries
>> present their vision of the future. This family event has something for
>> everyone, including kids. Get more information and register today.
>> http://sdm.link/attshape
>> ___
>> TikiWiki-devel mailing list
>> TikiWiki-devel at lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/tikiwiki-devel
>>
>
> ------------------------------------------------------------------------------
> Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
> Francisco, CA to explore cutting-edge tech and listen to tech luminaries
> present their vision of the future. This family event has something for
> everyone, including kids. Get more information and register today.
> http://sdm.link/attshape
> ___
> TikiWiki-devel mailing list
> TikiWiki-devel at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/tikiwiki-devel


------------------------------------------------------------------------------
Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
TikiWiki-devel mailing list
TikiWiki-devel at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/tikiwiki-devel


Why Register?

Register at tiki.org and you'll be able to use the account at any *.tiki.org site, thanks to the InterTiki feature. A valid email address is required to receive site notifications and occasional newsletters. You can opt out of these items at any time.