None
closed
nobody
None
2018-09-03
2018-07-30
Anonymous
No

Hi,
recollindex is using huge amounts of ram, up to 12.5GB, when indexing. ~500Gb of files.
This makes the computer unstable and unusable. This is Recoll 1.23.3 in Linux (Gentoo) running on a i7 quad core (8 threads) on a 250GB SSD with 16GB ram.
Please advise on how to proceed!

Discussion

  • Anonymous
    Anonymous
    2018-07-30

    Are you sure that it's the recollindex process itself which is using a lot of memory and not one of the input handler processes (e.g the ppt format handler, or some other)? Usually, when this happens, it is caused by a specific document, so we need to look closer.

    To make the diagnostics easier, you can turn off multithreading in the indexer by adding the following to recoll.conf (typically ~/.recoll/recoll.conf):

              thrQSizes = -1 -1 -1
    
     
    Last edit: medoc 2018-07-30
  • medoc
    medoc
    2018-09-03

    Closing this for absence of feedback. To anybody reading this: excessive memory usage by the recoll indexer is usually due to an issue with one or several specific files, and often with the file type input handler, not the indexer itself. So correcting the problem needs determining which files are causing the issue, which is not difficult and general. At this point, and it has been true for a number of years, there are no known issues of memory leaks or excessive memory usage in the indexer core.

     
  • medoc
    medoc
    2018-09-03

    • status: open --> closed
    • milestone: -->
     

Cancel   Add attachment