|
a/src/doc/man/recoll.conf.5 |
|
b/src/doc/man/recoll.conf.5 |
|
... |
|
... |
26 |
[~/somedirectory-with-utf8-txt-files]
|
26 |
[~/somedirectory-with-utf8-txt-files]
|
27 |
defaultcharset = utf-8
|
27 |
defaultcharset = utf-8
|
28 |
|
28 |
|
29 |
.fi
|
29 |
.fi
|
30 |
.LP
|
30 |
.LP
|
31 |
There are three kinds of lines:
|
31 |
There are three kinds of lines:
|
32 |
.RS
|
32 |
.RS
|
33 |
.IP \(bu
|
33 |
.IP \(bu
|
34 |
Comment or empty
|
34 |
Comment or empty
|
35 |
.IP \(bu
|
35 |
.IP \(bu
|
36 |
Parameter affectation
|
36 |
Parameter affectation
|
|
... |
|
... |
56 |
.TP
|
56 |
.TP
|
57 |
.BI "topdirs = "string
|
57 |
.BI "topdirs = "string
|
58 |
Space-separated list of files or
|
58 |
Space-separated list of files or
|
59 |
directories to recursively index. Default to ~ (indexes
|
59 |
directories to recursively index. Default to ~ (indexes
|
60 |
$HOME). You can use symbolic links in the list, they will be followed,
|
60 |
$HOME). You can use symbolic links in the list, they will be followed,
|
61 |
independantly of the value of the followLinks variable.
|
61 |
independently of the value of the followLinks variable.
|
62 |
.TP
|
62 |
.TP
|
63 |
.BI "skippedNames = "string
|
63 |
.BI "skippedNames = "string
|
64 |
Files and directories which should be ignored.
|
64 |
Files and directories which should be ignored.
|
65 |
White space separated list of wildcard patterns (simple ones, not paths,
|
65 |
White space separated list of wildcard patterns (simple ones, not paths,
|
66 |
must contain no / ), which will be tested against file and directory
|
66 |
must contain no / ), which will be tested against file and directory
|
67 |
names. The list in the default configuration does not exclude hidden
|
67 |
names. The list in the default configuration does not exclude hidden
|
68 |
directories (names beginning with a dot), which means that it may index
|
68 |
directories (names beginning with a dot), which means that it may index
|
69 |
quite a few things that you do not want. On the other hand, email user
|
69 |
quite a few things that you do not want. On the other hand, email user
|
70 |
agents like Thunderbird usually store messages in hidden directories, and
|
70 |
agents like Thunderbird usually store messages in hidden directories, and
|
71 |
you probably want this indexed. One possible solution is to have '.*' in
|
71 |
you probably want this indexed. One possible solution is to have '.*'
|
72 |
'skippedNames', and add things like '~/.thunderbird' '~/.evolution' to
|
72 |
in 'skippedNames', and add things like '~/.thunderbird' '~/.evolution'
|
73 |
'topdirs'. Not even the file names are indexed for patterns in this
|
73 |
to 'topdirs'. Not even the file names are indexed for patterns in this
|
74 |
list, see the 'noContentSuffixes' variable for an alternative approach
|
74 |
list, see the 'noContentSuffixes' variable for an alternative approach
|
75 |
which indexes the file names. Can be redefined for any
|
75 |
which indexes the file names. Can be redefined for any
|
76 |
subtree.
|
76 |
subtree.
|
77 |
.TP
|
77 |
.TP
|
78 |
.BI "noContentSuffixes = "string
|
78 |
.BI "noContentSuffixes = "string
|
|
... |
|
... |
90 |
Paths we should not go into. Space-separated list of
|
90 |
Paths we should not go into. Space-separated list of
|
91 |
wildcard expressions for filesystem paths. Can contain files and
|
91 |
wildcard expressions for filesystem paths. Can contain files and
|
92 |
directories. The database and configuration directories will
|
92 |
directories. The database and configuration directories will
|
93 |
automatically be added. The expressions are matched using 'fnmatch(3)'
|
93 |
automatically be added. The expressions are matched using 'fnmatch(3)'
|
94 |
with the FNM_PATHNAME flag set by default. This means that '/' characters
|
94 |
with the FNM_PATHNAME flag set by default. This means that '/' characters
|
95 |
must be matched explicitely. You can set 'skippedPathsFnmPathname' to 0
|
95 |
must be matched explicitly. You can set 'skippedPathsFnmPathname' to 0
|
96 |
to disable the use of FNM_PATHNAME (meaning that '/*/dir3' will match
|
96 |
to disable the use of FNM_PATHNAME (meaning that '/*/dir3' will
|
97 |
'/dir1/dir2/dir3'). The default value contains the usual mount point for
|
97 |
match '/dir1/dir2/dir3'). The default value contains the usual mount point
|
98 |
removable media to remind you that it is a bad idea to have Recoll work
|
98 |
for removable media to remind you that it is a bad idea to have Recoll work
|
99 |
on these (esp. with the monitor: media gets indexed on mount, all data
|
99 |
on these (esp. with the monitor: media gets indexed on mount, all data
|
100 |
gets erased on unmount). Explicitely adding '/media/xxx' to the topdirs
|
100 |
gets erased on unmount). Explicitly adding '/media/xxx' to the topdirs
|
101 |
will override this.
|
101 |
will override this.
|
102 |
.TP
|
102 |
.TP
|
103 |
.BI "skippedPathsFnmPathname = "bool
|
103 |
.BI "skippedPathsFnmPathname = "bool
|
104 |
Set to 0 to
|
104 |
Set to 0 to
|
105 |
override use of FNM_PATHNAME for matching skipped
|
105 |
override use of FNM_PATHNAME for matching skipped
|
106 |
paths.
|
106 |
paths.
|
107 |
.TP
|
107 |
.TP
|
108 |
.BI "daemSkippedPaths = "string
|
108 |
.BI "daemSkippedPaths = "string
|
109 |
skippedPaths equivalent specific to
|
109 |
skippedPaths equivalent specific to
|
110 |
real time indexing. This enables having parts of the tree
|
110 |
real time indexing. This enables having parts of the tree
|
111 |
which are initially indexed but not monitored. If daemSkippedPaths is
|
111 |
which are initially indexed but not monitored. If daemSkippedPaths is
|
|
... |
|
... |
113 |
.TP
|
113 |
.TP
|
114 |
.BI "zipSkippedNames = "string
|
114 |
.BI "zipSkippedNames = "string
|
115 |
Space-separated list of wildcard expressions for names that should
|
115 |
Space-separated list of wildcard expressions for names that should
|
116 |
be ignored inside zip archives. This is used directly by
|
116 |
be ignored inside zip archives. This is used directly by
|
117 |
the zip handler, and has a function similar to skippedNames, but works
|
117 |
the zip handler, and has a function similar to skippedNames, but works
|
118 |
independantly. Can be redefined for subdirectories. Supported by recoll
|
118 |
independently. Can be redefined for subdirectories. Supported by recoll
|
119 |
1.20 and newer. See
|
119 |
1.20 and newer. See
|
120 |
https://bitbucket.org/medoc/recoll/wiki/Filtering%20out%20Zip%20archive%20members
|
120 |
https://bitbucket.org/medoc/recoll/wiki/Filtering%20out%20Zip%20archive%20members
|
121 |
|
121 |
|
122 |
.TP
|
122 |
.TP
|
123 |
.BI "followLinks = "bool
|
123 |
.BI "followLinks = "bool
|
|
... |
|
... |
171 |
.BI "systemfilecommand = "string
|
171 |
.BI "systemfilecommand = "string
|
172 |
Command used to guess
|
172 |
Command used to guess
|
173 |
MIME types if the internal methods fails This should be a
|
173 |
MIME types if the internal methods fails This should be a
|
174 |
"file -i" workalike. The file path will be added as a last parameter to
|
174 |
"file -i" workalike. The file path will be added as a last parameter to
|
175 |
the command line. 'xdg-mime' works better than the traditional 'file'
|
175 |
the command line. 'xdg-mime' works better than the traditional 'file'
|
176 |
command, and is now the configured default (with a hard-coded fallback to
|
176 |
command, and is now the configured default (with a hard-coded fallback
|
177 |
'file')
|
177 |
to 'file')
|
178 |
.TP
|
178 |
.TP
|
179 |
.BI "processwebqueue = "bool
|
179 |
.BI "processwebqueue = "bool
|
180 |
Decide if we process the
|
180 |
Decide if we process the
|
181 |
Web queue. The queue is a directory where the Recoll Web
|
181 |
Web queue. The queue is a directory where the Recoll Web
|
182 |
browser plugins create the copies of visited pages.
|
182 |
browser plugins create the copies of visited pages.
|
|
... |
|
... |
213 |
documents with huge amounts of numbers in them, where setting nonumbers
|
213 |
documents with huge amounts of numbers in them, where setting nonumbers
|
214 |
will reduce the index size. This can only be set for a whole index, not
|
214 |
will reduce the index size. This can only be set for a whole index, not
|
215 |
for a subtree.
|
215 |
for a subtree.
|
216 |
.TP
|
216 |
.TP
|
217 |
.BI "dehyphenate = "bool
|
217 |
.BI "dehyphenate = "bool
|
218 |
Determines if we index
|
218 |
Determines if we index 'coworker' also when the input is 'co-worker'.
|
219 |
'coworker' also when the input is 'co-worker'. This is new
|
|
|
220 |
in version 1.22, and on by default. Setting the variable to off allows
|
219 |
This is new in version 1.22, and on by default. Setting the variable to off
|
221 |
restoring the previous behaviour.
|
220 |
allows restoring the previous behaviour.
|
222 |
.TP
|
221 |
.TP
|
223 |
.BI "nocjk = "bool
|
222 |
.BI "nocjk = "bool
|
224 |
Decides if specific East Asian
|
223 |
Decides if specific East Asian
|
225 |
(Chinese Korean Japanese) characters/word splitting is turned
|
224 |
(Chinese Korean Japanese) characters/word splitting is turned
|
226 |
off. This will save a small amount of CPU if you have no CJK
|
225 |
off. This will save a small amount of CPU if you have no CJK
|
|
... |
|
... |
311 |
conversion to metadata fields. This probably needs to be
|
310 |
conversion to metadata fields. This probably needs to be
|
312 |
set if testmodifusemtime is set.
|
311 |
set if testmodifusemtime is set.
|
313 |
.TP
|
312 |
.TP
|
314 |
.BI "metadatacmds = "string
|
313 |
.BI "metadatacmds = "string
|
315 |
Define commands to
|
314 |
Define commands to
|
316 |
gather external metadata, e.g. tmsu tags.
|
315 |
gather external metadata, e.g. tmsu tags.
|
317 |
There can be several entries, separated by semi-colons, each defining
|
316 |
There can be several entries, separated by semi-colons, each defining
|
318 |
which field name the data goes into and the command to use. Don't forget the
|
317 |
which field name the data goes into and the command to use. Don't forget the
|
319 |
initial semi-colon. All the field names must be different. You can use
|
318 |
initial semi-colon. All the field names must be different. You can use
|
320 |
aliases in the "field" file if necessary.
|
319 |
aliases in the "field" file if necessary.
|
321 |
As a not too pretty hack conceded to convenience, any field name
|
320 |
As a not too pretty hack conceded to convenience, any field name
|
|
... |
|
... |
453 |
only errors and warnings. 3 will print information like document updates,
|
452 |
only errors and warnings. 3 will print information like document updates,
|
454 |
4 is quite verbose and 6 very verbose.
|
453 |
4 is quite verbose and 6 very verbose.
|
455 |
.TP
|
454 |
.TP
|
456 |
.BI "logfilename = "fn
|
455 |
.BI "logfilename = "fn
|
457 |
Log file destination. Use 'stderr' (default) to write to the
|
456 |
Log file destination. Use 'stderr' (default) to write to the
|
458 |
console.
|
457 |
console.
|
459 |
.TP
|
458 |
.TP
|
460 |
.BI "idxloglevel = "int
|
459 |
.BI "idxloglevel = "int
|
461 |
Override loglevel for the indexer.
|
460 |
Override loglevel for the indexer.
|
462 |
.TP
|
461 |
.TP
|
463 |
.BI "idxlogfilename = "fn
|
462 |
.BI "idxlogfilename = "fn
|
464 |
Override logfilename for the indexer.
|
463 |
Override logfilename for the indexer.
|
465 |
.TP
|
464 |
.TP
|
466 |
.BI "daemloglevel = "int
|
465 |
.BI "daemloglevel = "int
|
467 |
Override loglevel for the indexer in real time
|
466 |
Override loglevel for the indexer in real time
|
468 |
mode. The default is to use the idx... values if set, else
|
467 |
mode. The default is to use the idx... values if set, else
|
469 |
the log... values.
|
468 |
the log... values.
|
|
... |
|
... |
613 |
.BI "mhmboxquirks = "string
|
612 |
.BI "mhmboxquirks = "string
|
614 |
Enable thunderbird/mozilla-seamonkey mbox format quirks Set this for the directory where the email mbox files are
|
613 |
Enable thunderbird/mozilla-seamonkey mbox format quirks Set this for the directory where the email mbox files are
|
615 |
stored.
|
614 |
stored.
|
616 |
|
615 |
|
617 |
.SH SEE ALSO
|
616 |
.SH SEE ALSO
|
618 |
.PP
|
617 |
.PP
|
619 |
recollindex(1) recoll(1)
|
618 |
recollindex(1) recoll(1)
|