a/src/INSTALL b/src/INSTALL
...
...
182
182
183
   --------------------------------------------------------------------------
183
   --------------------------------------------------------------------------
184
184
185
   Prev                                      Home                        Next 
185
   Prev                                      Home                        Next 
186
   Packages needed for external file types    Up       Configuration overview 
186
   Packages needed for external file types    Up       Configuration overview 
187
   Link: HOME
188
   Link: UP
189
   Link: PREVIOUS
190
191
                               Recoll user manual
192
   Prev                     Chapter 4. Installation                           
193
194
   --------------------------------------------------------------------------
195
196
                          4.4. Configuration overview
197
198
   There are two sets of configuration files. The system-wide files are kept
199
   in a directory named like /usr/[local/]share/recoll/examples, they define
200
   default values for the system. A parallel set of files exists by default
201
   in the .recoll directory in your home. This directory can be changed with
202
   the RECOLL_CONFDIR environment variable or the -c option parameter to
203
   recoll and recollindex.
204
205
   If the .recoll directory does not exist when recoll or recollindex are
206
   started, it will be created with a set of empty configuration files.
207
   recoll will give you a chance to edit the configuration file before
208
   starting indexing. recollindex will proceed immediately.
209
210
   Most of the parameters specific to the recoll GUI are set through the
211
   Preferences menu and stored in the standard QT place ($HOME/.qt/recollrc).
212
   You probably do not want to edit this by hand.
213
214
   For other options, Recoll uses text configuration files. You will have to
215
   edit them by hand for now (there is still some hope for a GUI
216
   configuration tool in the future). The most accurate documentation for the
217
   configuration parameters is given by comments inside the default files,
218
   and we will just give a general overview here.
219
220
   All configuration files share the same format. For exemple, a short
221
   extract of the main configuration file might look as follows:
222
223
         # Space-separated list of directories to index.
224
         topdirs =  ~/docs /usr/share/doc
225
226
         [~/somedirectory-with-utf8-txt-files]
227
         defaultcharset = utf-8
228
       
229
230
   There are three kinds of lines:
231
232
     * Comment (starts with #) or empty.
233
234
     * Parameter affectation (name = value).
235
236
     * Section definition ([somedirname]).
237
238
   Section lines allow redefining some parameters for a directory subtree.
239
   Some of the parameters used for indexing are looked up hierarchically from
240
   the more to the less specific. Not all parameters can be meaningfully
241
   redefined, this is specified for each in the next section.
242
243
   The tilde character (~) is expanded in file names to the name of the
244
   user's home directory.
245
246
   White space is used for separation inside lists. Elements with embedded
247
   spaces can be quoted using double-quotes.
248
249
4.4.1. Main configuration file
250
251
   recoll.conf is the main configuration file. It defines things like what to
252
   index (top directories and things to ignore), and the default character
253
   set to use for document types which do not specify it internally.
254
255
   The default configuration will index your home directory. If this is not
256
   appropriate, start recoll to create a blank configuration, click Cancel,
257
   and edit the configuration file before restarting the command. This will
258
   start the initial indexing, which may take some time.
259
260
   Paramers:
261
262
   topdirs
263
264
           Specifies the list of directories or files to index (recursively
265
           for directories). The indexer will not follow symbolic links
266
           inside the indexed trees. If an entry in the topdirs list is a
267
           symbolic link, indexing will not start and will generate an error.
268
269
   dbdir
270
271
           The name of the Xapian data directory. It will be created if
272
           needed when the index is initialized. If this is not an absolute
273
           path, it will be interpreted relative to the configuration
274
           directory.
275
276
   skippedNames
277
278
           A space-separated list of patterns for names of files or
279
           directories that should be completely ignored. The list defined in
280
           the default file is:
281
282
 *~ #* bin CVS  Cache caughtspam  tmp
283
284
           The list can be redefined for subdirectories, but is only actually
285
           changed for the top level ones in topdirs.
286
287
           The top-level directories are not affected by this list (that is,
288
           a directory in topdirs might match and would still be indexed).
289
290
           The list in the default configuration does not exclude hidden
291
           directories (names beginning with a dot), which means that it may
292
           index quite a few things that you do not want. On the other hand,
293
           mail user agents like thunderbird usually store messages in hidden
294
           directories, and you probably want this indexed. One possible
295
           solution is to have .* in skippedNames, and add things like
296
           ~/.thunderbird or ~/.evolution in topdirs.
297
298
   loglevel
299
300
           Verbosity level for recoll and recollindex. A value of 4 lists
301
           quite a lot of debug/information messages. 2 only lists errors.
302
303
   logfilename
304
305
           Where the messages should go. 'stderr' can be used as a special
306
           value, and is the default.
307
308
   filtersdir
309
310
           A directory to search for the external filter scripts used to
311
           index some types of files. The value should not be changed, except
312
           if you want to modify one of the default scripts. The value can be
313
           redefined for any subdirectory.
314
315
   indexstemminglanguages
316
317
           A list of languages for which the stem expansion databases will be
318
           built. See recollindex(1) for possible values. You can add a stem
319
           expansion database for a different language by using recollindex
320
           -s, but it will be deleted during the next indexing. Only
321
           languages listed in the configuration file are permanent.
322
323
   defaultcharset
324
325
           The name of the character set used for files that do not contain a
326
           character set definition (ie: plain text files). This can be
327
           redefined for any subdirectory. If it is not set at all, the
328
           character set used is the one defined by the nls environment
329
           (LC_ALL, LC_CTYPE, LANG), or iso8859-1 if nothing is set.
330
331
   guesscharset
332
333
           Decide if we try to guess the character set of files if no
334
           internal value is available (ie: for plain text files). This does
335
           not work well in general, and should probably not be used.
336
337
   usesystemfilecommand
338
339
           Decide if we use the file -i system command as a final step for
340
           determining the mime type for a file (the main procedure uses
341
           suffix associations as defined in the mimemap file). This can be
342
           useful for files with suffixless names, but it will also cause the
343
           indexing of many bogus "text" files.
344
345
   indexallfilenames
346
347
           Recoll indexes file names in a special section of the database to
348
           allow specific file names searches using wild cards. This
349
           parameter decides if file name indexing is performed only for
350
           files with mime types that would qualify them for full text
351
           indexing, or for all files inside the selected subtrees,
352
           independant of mime type.
353
354
   idxabsmlen
355
356
           Recoll stores an abstract for each indexed file inside the
357
           database. This is so that they can be displayed inside the result
358
           lists without decoding the original file. This parameter defines
359
           the size of the stored abstract (which can come from an actual
360
           section or just be the beginning of the text). The default value
361
           is 250.
362
363
   iconsdir
364
365
           The name of the directory where recoll result list icons are
366
           stored. You can change this if you want different images.
367
368
4.4.2. The mimemap file
369
370
   mimemap specifies the file name extension to mime type mappings.
371
372
   For file names without an extension, or with an unknown one, the system's
373
   file -i command will be executed to determine the mime type (this can be
374
   switched off inside the main configuration file).
375
376
   The mappings can be specified on a per-subtree basis, which may be useful
377
   in some cases. Example: gaim logs have a .txt extension but should be
378
   handled specially, which is possible because they are usually all located
379
   in one place.
380
381
   mimemap also has a recoll_noindex variable which is a list of suffixes.
382
   Matching files will be skipped (avoids unnecessary decompressions or file
383
   executions). This is partially redundant with skippedNames in the main
384
   configuration file, with two differences: it will not affect directories,
385
   and it can be changed for any subdirectory.
386
387
4.4.3. The mimeconf file
388
389
   mimeconf specifies how the different mime types are handled for indexing,
390
   and for display.
391
392
   Changing the indexing parameters is probably not a good idea except if you
393
   are a Recoll developper.
394
395
   You may want to adjust the external viewers defined in (ie: html is either
396
   previewed internally or displayed using firefox, but you may prefer
397
   mozilla, your openoffice.org program might be named oofice instead of
398
   openoffice ...). Look for the [view] section.
399
400
   You can also change the icons which are displayed by recoll in the result
401
   lists (the values are the basenames of the png images inside the iconsdir
402
   directory (specified in recoll.conf).
403
404
   --------------------------------------------------------------------------
405
406
   Prev                               Home                                    
407
   Building from source                Up