|
a/src/INSTALL |
|
b/src/INSTALL |
|
... |
|
... |
182 |
|
182 |
|
183 |
--------------------------------------------------------------------------
|
183 |
--------------------------------------------------------------------------
|
184 |
|
184 |
|
185 |
Prev Home Next
|
185 |
Prev Home Next
|
186 |
Packages needed for external file types Up Configuration overview
|
186 |
Packages needed for external file types Up Configuration overview
|
|
|
187 |
Link: HOME
|
|
|
188 |
Link: UP
|
|
|
189 |
Link: PREVIOUS
|
|
|
190 |
|
|
|
191 |
Recoll user manual
|
|
|
192 |
Prev Chapter 4. Installation
|
|
|
193 |
|
|
|
194 |
--------------------------------------------------------------------------
|
|
|
195 |
|
|
|
196 |
4.4. Configuration overview
|
|
|
197 |
|
|
|
198 |
There are two sets of configuration files. The system-wide files are kept
|
|
|
199 |
in a directory named like /usr/[local/]share/recoll/examples, they define
|
|
|
200 |
default values for the system. A parallel set of files exists by default
|
|
|
201 |
in the .recoll directory in your home. This directory can be changed with
|
|
|
202 |
the RECOLL_CONFDIR environment variable or the -c option parameter to
|
|
|
203 |
recoll and recollindex.
|
|
|
204 |
|
|
|
205 |
If the .recoll directory does not exist when recoll or recollindex are
|
|
|
206 |
started, it will be created with a set of empty configuration files.
|
|
|
207 |
recoll will give you a chance to edit the configuration file before
|
|
|
208 |
starting indexing. recollindex will proceed immediately.
|
|
|
209 |
|
|
|
210 |
Most of the parameters specific to the recoll GUI are set through the
|
|
|
211 |
Preferences menu and stored in the standard QT place ($HOME/.qt/recollrc).
|
|
|
212 |
You probably do not want to edit this by hand.
|
|
|
213 |
|
|
|
214 |
For other options, Recoll uses text configuration files. You will have to
|
|
|
215 |
edit them by hand for now (there is still some hope for a GUI
|
|
|
216 |
configuration tool in the future). The most accurate documentation for the
|
|
|
217 |
configuration parameters is given by comments inside the default files,
|
|
|
218 |
and we will just give a general overview here.
|
|
|
219 |
|
|
|
220 |
All configuration files share the same format. For exemple, a short
|
|
|
221 |
extract of the main configuration file might look as follows:
|
|
|
222 |
|
|
|
223 |
# Space-separated list of directories to index.
|
|
|
224 |
topdirs = ~/docs /usr/share/doc
|
|
|
225 |
|
|
|
226 |
[~/somedirectory-with-utf8-txt-files]
|
|
|
227 |
defaultcharset = utf-8
|
|
|
228 |
|
|
|
229 |
|
|
|
230 |
There are three kinds of lines:
|
|
|
231 |
|
|
|
232 |
* Comment (starts with #) or empty.
|
|
|
233 |
|
|
|
234 |
* Parameter affectation (name = value).
|
|
|
235 |
|
|
|
236 |
* Section definition ([somedirname]).
|
|
|
237 |
|
|
|
238 |
Section lines allow redefining some parameters for a directory subtree.
|
|
|
239 |
Some of the parameters used for indexing are looked up hierarchically from
|
|
|
240 |
the more to the less specific. Not all parameters can be meaningfully
|
|
|
241 |
redefined, this is specified for each in the next section.
|
|
|
242 |
|
|
|
243 |
The tilde character (~) is expanded in file names to the name of the
|
|
|
244 |
user's home directory.
|
|
|
245 |
|
|
|
246 |
White space is used for separation inside lists. Elements with embedded
|
|
|
247 |
spaces can be quoted using double-quotes.
|
|
|
248 |
|
|
|
249 |
4.4.1. Main configuration file
|
|
|
250 |
|
|
|
251 |
recoll.conf is the main configuration file. It defines things like what to
|
|
|
252 |
index (top directories and things to ignore), and the default character
|
|
|
253 |
set to use for document types which do not specify it internally.
|
|
|
254 |
|
|
|
255 |
The default configuration will index your home directory. If this is not
|
|
|
256 |
appropriate, start recoll to create a blank configuration, click Cancel,
|
|
|
257 |
and edit the configuration file before restarting the command. This will
|
|
|
258 |
start the initial indexing, which may take some time.
|
|
|
259 |
|
|
|
260 |
Paramers:
|
|
|
261 |
|
|
|
262 |
topdirs
|
|
|
263 |
|
|
|
264 |
Specifies the list of directories or files to index (recursively
|
|
|
265 |
for directories). The indexer will not follow symbolic links
|
|
|
266 |
inside the indexed trees. If an entry in the topdirs list is a
|
|
|
267 |
symbolic link, indexing will not start and will generate an error.
|
|
|
268 |
|
|
|
269 |
dbdir
|
|
|
270 |
|
|
|
271 |
The name of the Xapian data directory. It will be created if
|
|
|
272 |
needed when the index is initialized. If this is not an absolute
|
|
|
273 |
path, it will be interpreted relative to the configuration
|
|
|
274 |
directory.
|
|
|
275 |
|
|
|
276 |
skippedNames
|
|
|
277 |
|
|
|
278 |
A space-separated list of patterns for names of files or
|
|
|
279 |
directories that should be completely ignored. The list defined in
|
|
|
280 |
the default file is:
|
|
|
281 |
|
|
|
282 |
*~ #* bin CVS Cache caughtspam tmp
|
|
|
283 |
|
|
|
284 |
The list can be redefined for subdirectories, but is only actually
|
|
|
285 |
changed for the top level ones in topdirs.
|
|
|
286 |
|
|
|
287 |
The top-level directories are not affected by this list (that is,
|
|
|
288 |
a directory in topdirs might match and would still be indexed).
|
|
|
289 |
|
|
|
290 |
The list in the default configuration does not exclude hidden
|
|
|
291 |
directories (names beginning with a dot), which means that it may
|
|
|
292 |
index quite a few things that you do not want. On the other hand,
|
|
|
293 |
mail user agents like thunderbird usually store messages in hidden
|
|
|
294 |
directories, and you probably want this indexed. One possible
|
|
|
295 |
solution is to have .* in skippedNames, and add things like
|
|
|
296 |
~/.thunderbird or ~/.evolution in topdirs.
|
|
|
297 |
|
|
|
298 |
loglevel
|
|
|
299 |
|
|
|
300 |
Verbosity level for recoll and recollindex. A value of 4 lists
|
|
|
301 |
quite a lot of debug/information messages. 2 only lists errors.
|
|
|
302 |
|
|
|
303 |
logfilename
|
|
|
304 |
|
|
|
305 |
Where the messages should go. 'stderr' can be used as a special
|
|
|
306 |
value, and is the default.
|
|
|
307 |
|
|
|
308 |
filtersdir
|
|
|
309 |
|
|
|
310 |
A directory to search for the external filter scripts used to
|
|
|
311 |
index some types of files. The value should not be changed, except
|
|
|
312 |
if you want to modify one of the default scripts. The value can be
|
|
|
313 |
redefined for any subdirectory.
|
|
|
314 |
|
|
|
315 |
indexstemminglanguages
|
|
|
316 |
|
|
|
317 |
A list of languages for which the stem expansion databases will be
|
|
|
318 |
built. See recollindex(1) for possible values. You can add a stem
|
|
|
319 |
expansion database for a different language by using recollindex
|
|
|
320 |
-s, but it will be deleted during the next indexing. Only
|
|
|
321 |
languages listed in the configuration file are permanent.
|
|
|
322 |
|
|
|
323 |
defaultcharset
|
|
|
324 |
|
|
|
325 |
The name of the character set used for files that do not contain a
|
|
|
326 |
character set definition (ie: plain text files). This can be
|
|
|
327 |
redefined for any subdirectory. If it is not set at all, the
|
|
|
328 |
character set used is the one defined by the nls environment
|
|
|
329 |
(LC_ALL, LC_CTYPE, LANG), or iso8859-1 if nothing is set.
|
|
|
330 |
|
|
|
331 |
guesscharset
|
|
|
332 |
|
|
|
333 |
Decide if we try to guess the character set of files if no
|
|
|
334 |
internal value is available (ie: for plain text files). This does
|
|
|
335 |
not work well in general, and should probably not be used.
|
|
|
336 |
|
|
|
337 |
usesystemfilecommand
|
|
|
338 |
|
|
|
339 |
Decide if we use the file -i system command as a final step for
|
|
|
340 |
determining the mime type for a file (the main procedure uses
|
|
|
341 |
suffix associations as defined in the mimemap file). This can be
|
|
|
342 |
useful for files with suffixless names, but it will also cause the
|
|
|
343 |
indexing of many bogus "text" files.
|
|
|
344 |
|
|
|
345 |
indexallfilenames
|
|
|
346 |
|
|
|
347 |
Recoll indexes file names in a special section of the database to
|
|
|
348 |
allow specific file names searches using wild cards. This
|
|
|
349 |
parameter decides if file name indexing is performed only for
|
|
|
350 |
files with mime types that would qualify them for full text
|
|
|
351 |
indexing, or for all files inside the selected subtrees,
|
|
|
352 |
independant of mime type.
|
|
|
353 |
|
|
|
354 |
idxabsmlen
|
|
|
355 |
|
|
|
356 |
Recoll stores an abstract for each indexed file inside the
|
|
|
357 |
database. This is so that they can be displayed inside the result
|
|
|
358 |
lists without decoding the original file. This parameter defines
|
|
|
359 |
the size of the stored abstract (which can come from an actual
|
|
|
360 |
section or just be the beginning of the text). The default value
|
|
|
361 |
is 250.
|
|
|
362 |
|
|
|
363 |
iconsdir
|
|
|
364 |
|
|
|
365 |
The name of the directory where recoll result list icons are
|
|
|
366 |
stored. You can change this if you want different images.
|
|
|
367 |
|
|
|
368 |
4.4.2. The mimemap file
|
|
|
369 |
|
|
|
370 |
mimemap specifies the file name extension to mime type mappings.
|
|
|
371 |
|
|
|
372 |
For file names without an extension, or with an unknown one, the system's
|
|
|
373 |
file -i command will be executed to determine the mime type (this can be
|
|
|
374 |
switched off inside the main configuration file).
|
|
|
375 |
|
|
|
376 |
The mappings can be specified on a per-subtree basis, which may be useful
|
|
|
377 |
in some cases. Example: gaim logs have a .txt extension but should be
|
|
|
378 |
handled specially, which is possible because they are usually all located
|
|
|
379 |
in one place.
|
|
|
380 |
|
|
|
381 |
mimemap also has a recoll_noindex variable which is a list of suffixes.
|
|
|
382 |
Matching files will be skipped (avoids unnecessary decompressions or file
|
|
|
383 |
executions). This is partially redundant with skippedNames in the main
|
|
|
384 |
configuration file, with two differences: it will not affect directories,
|
|
|
385 |
and it can be changed for any subdirectory.
|
|
|
386 |
|
|
|
387 |
4.4.3. The mimeconf file
|
|
|
388 |
|
|
|
389 |
mimeconf specifies how the different mime types are handled for indexing,
|
|
|
390 |
and for display.
|
|
|
391 |
|
|
|
392 |
Changing the indexing parameters is probably not a good idea except if you
|
|
|
393 |
are a Recoll developper.
|
|
|
394 |
|
|
|
395 |
You may want to adjust the external viewers defined in (ie: html is either
|
|
|
396 |
previewed internally or displayed using firefox, but you may prefer
|
|
|
397 |
mozilla, your openoffice.org program might be named oofice instead of
|
|
|
398 |
openoffice ...). Look for the [view] section.
|
|
|
399 |
|
|
|
400 |
You can also change the icons which are displayed by recoll in the result
|
|
|
401 |
lists (the values are the basenames of the png images inside the iconsdir
|
|
|
402 |
directory (specified in recoll.conf).
|
|
|
403 |
|
|
|
404 |
--------------------------------------------------------------------------
|
|
|
405 |
|
|
|
406 |
Prev Home
|
|
|
407 |
Building from source Up
|