For Recoll versions 1.18 and later, and when working with a raw index (not the default), searches can be made sensitive to character case and diacritics. How this happens is controlled by configuration variables and what search data is entered.
The general default is that searches are insensitive to case
and diacritics. An entry of resume
will match any
of Resume
, RESUME
,
résumé
, Résumé
etc.
Two configuration variables can automate switching on sensitivity:
- autodiacsens
If this is set, search sensitivity to diacritics will be turned on as soon as an accented character exists in a search term. When the variable is set to true,
resume
will start a diacritics-unsensitive search, butrésumé
will be matched exactly. The default value is false.- autocasesens
If this is set, search sensitivity to character case will be turned on as soon as an upper-case character exists in a search term except for the first one. When the variable is set to true,
us
orUs
will start a diacritics-unsensitive search, butUS
will be matched exactly. The default value is true (contrary toautodiacsens
).
As in the past, capitalizing the first letter of a word will turn off its stem expansion and have no effect on case-sensitivity.
You can also explicitely activate case and diacritics
sensitivity by using modifiers with the query
language. C
will make the term case-sensitive, and
D
will make it
diacritics-sensitive. Examples:
"us"C
will search for the term us
exactly
(Us
will not be a match).
"resume"D
will search for the term resume
exactly
(résumé
will not be a match).
When either case or diacritics sensitivity is activated, stem expansion is turned off. Having both does not make much sense.