|
a/src/INSTALL |
|
b/src/INSTALL |
|
... |
|
... |
331 |
|
331 |
|
332 |
Recoll indexing options are set inside text configuration files located in
|
332 |
Recoll indexing options are set inside text configuration files located in
|
333 |
a configuration directory. There can be several such directories, each of
|
333 |
a configuration directory. There can be several such directories, each of
|
334 |
which define the parameters for one index.
|
334 |
which define the parameters for one index.
|
335 |
|
335 |
|
336 |
The configuration files can be edited by hand or through the Indexing
|
336 |
The configuration files can be edited by hand or through the Index
|
337 |
configuration dialog (Preferences menu). The GUI tool will try to respect
|
337 |
configuration dialog (Preferences menu). The GUI tool will try to respect
|
338 |
your formatting and comments as much as possible, so it is quite possible
|
338 |
your formatting and comments as much as possible, so it is quite possible
|
339 |
to use both ways.
|
339 |
to use both ways.
|
340 |
|
340 |
|
341 |
The most accurate documentation for the configuration parameters is given
|
341 |
The most accurate documentation for the configuration parameters is given
|
|
... |
|
... |
524 |
want to index very big text files as it will both reduce memory
|
524 |
want to index very big text files as it will both reduce memory
|
525 |
usage at index time and help with loading data to the preview
|
525 |
usage at index time and help with loading data to the preview
|
526 |
window. A size of a few megabytes would seem reasonable (default:
|
526 |
window. A size of a few megabytes would seem reasonable (default:
|
527 |
1MB).
|
527 |
1MB).
|
528 |
|
528 |
|
|
|
529 |
membermaxkbs
|
|
|
530 |
|
|
|
531 |
This defines the maximum size in kilobytes for an archive member
|
|
|
532 |
(zip, tar or rar at the moment). Bigger entries will be skipped.
|
|
|
533 |
|
529 |
indexallfilenames
|
534 |
indexallfilenames
|
530 |
|
535 |
|
531 |
Recoll indexes file names in a special section of the database to
|
536 |
Recoll indexes file names in a special section of the database to
|
532 |
allow specific file names searches using wild cards. This
|
537 |
allow specific file names searches using wild cards. This
|
533 |
parameter decides if file name indexing is performed only for
|
538 |
parameter decides if file name indexing is performed only for
|
|
... |
|
... |
559 |
|
564 |
|
560 |
Changing some of these parameters will imply a full reindex. Also, when
|
565 |
Changing some of these parameters will imply a full reindex. Also, when
|
561 |
using multiple indexes, it may not make sense to search indexes that don't
|
566 |
using multiple indexes, it may not make sense to search indexes that don't
|
562 |
share the values for these parameters, because they usually affect both
|
567 |
share the values for these parameters, because they usually affect both
|
563 |
search and index operations.
|
568 |
search and index operations.
|
|
|
569 |
|
|
|
570 |
indexStripChars
|
|
|
571 |
|
|
|
572 |
Decide if we strip characters of diacritics and convert them to
|
|
|
573 |
lower-case before terms are indexed. If we don't, searches
|
|
|
574 |
sensitive to case and diacritics can be performed, but the index
|
|
|
575 |
will be bigger, and some marginal weirdness may sometimes occur.
|
|
|
576 |
The default is a stripped index (indexStripChars = 1) for now.
|
|
|
577 |
When using multiple indexes for a search, this parameter must be
|
|
|
578 |
defined identically for all. Changing the value implies an index
|
|
|
579 |
reset.
|
|
|
580 |
|
|
|
581 |
maxTermExpand
|
|
|
582 |
|
|
|
583 |
Maximum expansion count for a single term (e.g.: when using
|
|
|
584 |
wildcards). The default of 10000 is reasonable and will avoid
|
|
|
585 |
queries that appear frozen while the engine is walking the term
|
|
|
586 |
list.
|
|
|
587 |
|
|
|
588 |
maxXapianClauses
|
|
|
589 |
|
|
|
590 |
Maximum number of elementary clauses we can add to a single Xapian
|
|
|
591 |
query. In some cases, the result of term expansion can be
|
|
|
592 |
multiplicative, and we want to avoid using excessive memory. The
|
|
|
593 |
default of 100 000 should be both high enough in most cases and
|
|
|
594 |
compatible with current typical hardware configurations.
|
564 |
|
595 |
|
565 |
nonumbers
|
596 |
nonumbers
|
566 |
|
597 |
|
567 |
If this set to true, no terms will be generated for numbers. For
|
598 |
If this set to true, no terms will be generated for numbers. For
|
568 |
example "123", "1.5e6", 192.168.1.4, would not be indexed
|
599 |
example "123", "1.5e6", 192.168.1.4, would not be indexed
|
|
... |
|
... |
697 |
documents, which gives little memory usage control, as memory
|
728 |
documents, which gives little memory usage control, as memory
|
698 |
usage depends on average document size. The default value is 10.
|
729 |
usage depends on average document size. The default value is 10.
|
699 |
|
730 |
|
700 |
5.4.1.4. Miscellaneous parameters:
|
731 |
5.4.1.4. Miscellaneous parameters:
|
701 |
|
732 |
|
|
|
733 |
autodiacsens
|
|
|
734 |
|
|
|
735 |
IF the index is not stripped, decide if we automatically trigger
|
|
|
736 |
diacritics sensitivity if the search term has accented characters
|
|
|
737 |
(not in unac_except_trans). Else you need to use the query
|
|
|
738 |
language and the D modifier to specify diacritics sensitivity.
|
|
|
739 |
Default is no.
|
|
|
740 |
|
|
|
741 |
autocasesens
|
|
|
742 |
|
|
|
743 |
IF the index is not stripped, decide if we automatically trigger
|
|
|
744 |
character case sensitivity if the search term has upper-case
|
|
|
745 |
characters in any but the first position. Else you need to use the
|
|
|
746 |
query language and the C modifier to specify character-case
|
|
|
747 |
sensitivity. Default is yes.
|
|
|
748 |
|
702 |
loglevel,daemloglevel
|
749 |
loglevel,daemloglevel
|
703 |
|
750 |
|
704 |
Verbosity level for recoll and recollindex. A value of 4 lists
|
751 |
Verbosity level for recoll and recollindex. A value of 4 lists
|
705 |
quite a lot of debug/information messages. 2 only lists errors.
|
752 |
quite a lot of debug/information messages. 2 only lists errors.
|
706 |
The daemversion is specific to the indexing monitor daemon.
|
753 |
The daemversion is specific to the indexing monitor daemon.
|
|
... |
|
... |
734 |
monauxinterval
|
781 |
monauxinterval
|
735 |
|
782 |
|
736 |
Period (in seconds) at which the real time monitor will regenerate
|
783 |
Period (in seconds) at which the real time monitor will regenerate
|
737 |
the auxiliary databases (spelling, stemming) if needed. The
|
784 |
the auxiliary databases (spelling, stemming) if needed. The
|
738 |
default is one hour.
|
785 |
default is one hour.
|
|
|
786 |
|
|
|
787 |
monioniceclass, monioniceclassdata
|
|
|
788 |
|
|
|
789 |
These allow defining the ionice class and data used by the indexer
|
|
|
790 |
(default class 3, no data).
|
739 |
|
791 |
|
740 |
filtermaxseconds
|
792 |
filtermaxseconds
|
741 |
|
793 |
|
742 |
Maximum filter execution time, after which it is aborted. Some
|
794 |
Maximum filter execution time, after which it is aborted. Some
|
743 |
postscript programs just loop...
|
795 |
postscript programs just loop...
|
|
... |
|
... |
779 |
|
831 |
|
780 |
If this is set, the aspell dictionary generation is turned off.
|
832 |
If this is set, the aspell dictionary generation is turned off.
|
781 |
Useful for cases where you don't need the functionality or when it
|
833 |
Useful for cases where you don't need the functionality or when it
|
782 |
is unusable because aspell crashes during dictionary generation.
|
834 |
is unusable because aspell crashes during dictionary generation.
|
783 |
|
835 |
|
|
|
836 |
mhmboxquirks
|
|
|
837 |
|
|
|
838 |
This allows definining location-related quirks for the mailbox
|
|
|
839 |
handler. Currently only the tbird flag is defined, and it should
|
|
|
840 |
be set for directories which hold Thunderbird data, as their
|
|
|
841 |
folder format is weird.
|
|
|
842 |
|
784 |
5.4.2. The fields file
|
843 |
5.4.2. The fields file
|
785 |
|
844 |
|
786 |
This file contains information about dynamic fields handling in Recoll.
|
845 |
This file contains information about dynamic fields handling in Recoll.
|
787 |
Some very basic fields have hard-wired behaviour, and, mostly, you should
|
846 |
Some very basic fields have hard-wired behaviour, and, mostly, you should
|
788 |
not change the original data inside the fields file. But you can create
|
847 |
not change the original data inside the fields file. But you can create
|
|
... |
|
... |
883 |
link in a result list. Ie: HTML is normally displayed using firefox, but
|
942 |
link in a result list. Ie: HTML is normally displayed using firefox, but
|
884 |
you may prefer Konqueror, your openoffice.org program might be named
|
943 |
you may prefer Konqueror, your openoffice.org program might be named
|
885 |
oofice instead of openoffice etc.
|
944 |
oofice instead of openoffice etc.
|
886 |
|
945 |
|
887 |
Changes to this file can be done by direct editing, or through the recoll
|
946 |
Changes to this file can be done by direct editing, or through the recoll
|
888 |
user preferences dialog.
|
947 |
GUI preferences dialog.
|
889 |
|
948 |
|
890 |
If Use desktop preferences to choose document editor is checked in the
|
949 |
If Use desktop preferences to choose document editor is checked in the
|
891 |
Recoll GUI user preferences, all mimeview entries will be ignored except
|
950 |
Recoll GUI preferences, all mimeview entries will be ignored except the
|
892 |
the one labelled application/x-all (which is set to use xdg-open by
|
951 |
one labelled application/x-all (which is set to use xdg-open by default).
|
893 |
default).
|
952 |
|
|
|
953 |
In this case, the xallexcepts top level variable defines a list of mime
|
|
|
954 |
type exceptions which will be processed according to the local entries
|
|
|
955 |
instead of being passed to the desktop. This is so that specific Recoll
|
|
|
956 |
options such as a page number or a search string can be passed to
|
|
|
957 |
applications that support them, such as the evince viewer.
|
894 |
|
958 |
|
895 |
As for the other configuration files, the normal usage is to have a
|
959 |
As for the other configuration files, the normal usage is to have a
|
896 |
mimeview inside your own configuration directory, with just the
|
960 |
mimeview inside your own configuration directory, with just the
|
897 |
non-default entries, which will override those from the central
|
961 |
non-default entries, which will override those from the central
|
898 |
configuration file.
|
962 |
configuration file.
|
899 |
|
963 |
|
900 |
Please note that these entries must be placed under a [view] section.
|
964 |
All viewer definition entries must be placed under a [view] section.
|
901 |
|
965 |
|
902 |
The keys in the file are normally mime types. You can add an application
|
966 |
The keys in the file are normally mime types. You can add an application
|
903 |
tag to specialize the choice for an area of the filesystem (using a
|
967 |
tag to specialize the choice for an area of the filesystem (using a
|
904 |
localfields specification in mimeconf). The syntax for the key is
|
968 |
localfields specification in mimeconf). The syntax for the key is
|
905 |
mimetype|tag
|
969 |
mimetype|tag
|
|
... |
|
... |
924 |
on the container type. If this appears in the command line, Recoll
|
988 |
on the container type. If this appears in the command line, Recoll
|
925 |
will not create a temporary file to extract the subdocument, expecting
|
989 |
will not create a temporary file to extract the subdocument, expecting
|
926 |
the called application (possibly a script) to be able to handle it.
|
990 |
the called application (possibly a script) to be able to handle it.
|
927 |
|
991 |
|
928 |
* %M. Mime type
|
992 |
* %M. Mime type
|
|
|
993 |
|
|
|
994 |
* %p. Page index. Only significant for a subset of document types,
|
|
|
995 |
currently only PDF, Postscript and DVI files. Can be used to start the
|
|
|
996 |
editor at the right page for a match or snippet.
|
|
|
997 |
|
|
|
998 |
* %s. Search term. The value will only be set for documents with indexed
|
|
|
999 |
page numbers (ie: PDF). The value will be one of the matched search
|
|
|
1000 |
terms. It would allow pre-setting the value in the "Find" entry inside
|
|
|
1001 |
Evince for example, for easy highlighting of the term.
|
929 |
|
1002 |
|
930 |
* %U, %u. Url.
|
1003 |
* %U, %u. Url.
|
931 |
|
1004 |
|
932 |
In addition to the predefined values above, all strings like %(fieldname)
|
1005 |
In addition to the predefined values above, all strings like %(fieldname)
|
933 |
will be replaced by the value of the field named fieldname for the
|
1006 |
will be replaced by the value of the field named fieldname for the
|