None
closed
nobody
2017-09-07
2017-08-29
Anonymous
No

from: https://www.linuxquestions.org/questions/linux-software-2/recoll-works-wonderfully-in-linux-mint-fails-in-opensuse-tumbleweed-and-antergos-4175612825/

I have used and depended on Recoll in Linux Mint for several years. Version 1.21.5 in Mint reliably indexes a large directory on an external drive, without error messages. The configuration indicates the top directory to be indexed and the database directory, and nothing else. I have no settings in Local parameters, or in a recoll.conf file.

I recently configured boot partitions with openSUSE Tumbleweed and Antergos. I installed Recoll 1.23 on both, and couldn't get them to work. Each run reported thousands of errors, and tests afterward proved that Recoll failed to index many files.

I checked for missing helpers, experimented with local parameters and recoll.conf settings, but made no headway.

I haven't been able to find a log file of skipped/missed files, and don't see any mention in the documentation of where I can find such a log.

Any suggestions from Recoll users will be welcome. If others have had problems with Recoll in other-than-Debian-based distributions, I'd like to know that, too.

Discussion

  • Anonymous
    Anonymous
    2017-08-30

    Thanks very much for the reply. I ran the tests you suggest in Tumbleweed, and see that the Recoll version is 1.23.1 with Xapian 1.4.4, installed from the repository at: download.opensuse.org/repositories/KDE:/Extra/openSUSE_Tumbleweed/

    I added loglevel = 6, logfilename = stderr and thrQSizes = -1 -1 -1 to my recoll.conf file.

    Further investigation suggests that this installation of Recoll may have more trouble with documents created some years ago in Windows. It failed to index all 2010 .doc files in a testing directory, although it correctly indexed two vintage .pdf files, as well as two more recent .odt files. The .doc files may have been renamed from documents with .rtf extensions; I've forgotten.

    The Recoll version in Linux Mint 18 has no trouble with these documents.

    I edited and ran recollindex -i myunfindablefile.xxx > /tmp/myindexlog 2>&1, and am cut and pasting the results below:

    :4:common/rclconfig.cpp:563::RclConfig::initThrConf: chosen config (ql,nt): (-1, 0) (-1, 0) (-1, 0)
    :5:common/rclinit.cpp:346::rclinit: will use vfork() for starting commands
    :3:index/recollindex.cpp:518::recollindex: changing current directory to [/tmp]
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/share/recoll/filters/rclcheckneedretry.sh
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x256
    :3:index/recollindex.cpp:548::recollindex: starting up
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/bin/ionice {-c} {3} {-p} {17626}
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x0
    :4:rcldb/rcldb.cpp:758::Db::open: m_isopen 0 m_iswritable 0 mode 1
    :5:rcldb/stoplist.cpp:35::StopList::StopList: file_to_string(/home/username/.recoll/stoplist.txt) failed: open/stat: errno: 2 :
    :4:rcldb/rcldb.cpp:230::RclDb:: threads: haveWriteQ 0, wqlen -1 wqts 0
    :4:rcldb/rcldb.cpp:796::Db::open: lastdocid: 20
    :4:index/fsindexer.cpp:135::FsIndexer: threads: haveIQ 0 iql -1 iqts 0 haveSQ 0 sql -1 sqts 0
    :4:index/fsindexer.cpp:322::FsIndexer::indexFiles
    :4:index/fsindexer.cpp:288::FsIndexer::indexFiles: skipping [/mnt/username/doc/testing/2010earthquakeinsurancepremiums.doc] (ntd)
    :4:index/fsindexer.cpp:386::Indexfiles: purging orphans
    :4:index/fsindexer.cpp:398::FsIndexer::indexFiles: done
    :4:rcldb/rcldb.cpp:855::Db::i_close(0): m_isopen 1 m_iswritable 1
    :4:rcldb/rcldb.cpp:869::Rcl::Db:close: xapian will close. May take some time
    :4:rcldb/rcldb.cpp:873::Rcl::Db:close() xapian close done.
    :4:internfile/mimehandler.cpp:129::clearMimeHandlerCache()
    :4:rcldb/rcldb.cpp:737::Db::~Db: isopen 0 m_iswritable 0
    :4:rcldb/rcldb.cpp:855::Db::i_close(1): m_isopen 0 m_iswritable 0

     
  • medoc
    medoc
    2017-08-30

    The (ntd) in the following line:

    :4:index/fsindexer.cpp:288::FsIndexer::indexFiles: skipping [/mnt/username/doc/testing/2010earthquakeinsurancepremiums.doc] (ntd)

    means that you moved the file out of the indexed area (one of the topdirs), and it was skipped.

    From what you write, the first thing I would check would be the 'missing' file inside the config directory (or the missing apps entry in the GUI help menu). One possibility would be that antiword is not installed.

     
  • Anonymous
    Anonymous
    2017-08-30

    Thank you again for your efforts to help.

    openSUSE Tumbleweed YAST reports that antiword version 0.37-1.33 is installed. I also added a .doc file created in Linux to the test directory, and Recoll indexed it successfully.

    This time, I configured Recoll's top directory as: /mnt/username/doc/clip/test , created the directory test_subdir beneath it, and put all the files in test_subdir. The new log file is below. Thank you once more!


    :4:common/rclconfig.cpp:563::RclConfig::initThrConf: chosen config (ql,nt): (-1, 0) (-1, 0) (-1, 0)
    :5:common/rclinit.cpp:346::rclinit: will use vfork() for starting commands
    :3:index/recollindex.cpp:518::recollindex: changing current directory to [/tmp]
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/share/recoll/filters/rclcheckneedretry.sh
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x256
    :3:index/recollindex.cpp:548::recollindex: starting up
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/bin/ionice {-c} {3} {-p} {4330}
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x0
    :4:rcldb/rcldb.cpp:758::Db::open: m_isopen 0 m_iswritable 0 mode 1
    :5:rcldb/stoplist.cpp:35::StopList::StopList: file_to_string(/home/username/.recoll/stoplist.txt) failed: open/stat: errno: 2 :
    :4:rcldb/rcldb.cpp:230::RclDb:: threads: haveWriteQ 0, wqlen -1 wqts 0
    :4:rcldb/rcldb.cpp:796::Db::open: lastdocid: 24
    :4:index/fsindexer.cpp:135::FsIndexer: threads: haveIQ 0 iql -1 iqts 0 haveSQ 0 sql -1 sqts 0
    :4:index/fsindexer.cpp:322::FsIndexer::indexFiles
    :4:rcldb/rcldb.cpp:1827::Db::needUpdate:yes: olsig [45551504084396+] new [45551504084396] [Q/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc|]
    :4:index/fsindexer.cpp:639::processone: needupdate 1 noretry 0 existing 8 oldsig []
    :5:index/fsindexer.cpp:672::processone: processing: [5 KB ] /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc
    :5:internfile/internfile.cpp:122::FileInterner::FileInterner(fn=/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc)
    :4:internfile/internfile.cpp:167::FileInterner::init fn [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] mime [(null)] preview 0
    :4:internfile/mimehandler.cpp:268::getMimeHandler: mtype [application/msword] filtertypes 1
    :4:internfile/mimehandler.cpp:64::getMimeHandlerFromCache: ff44fdc414957c55aa0c8bcbcb767db1 cache size 0
    :4:internfile/mimehandler.cpp:80::getMimeHandlerFromCache: ff44fdc414957c55aa0c8bcbcb767db1 not found
    :4:internfile/internfile.cpp:250::FileInterner:: init ok application/msword [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc]
    :4:internfile/internfile.cpp:743::FileInterner::internfile. ipath []
    :4:internfile/mh_execm.cpp:157::MimeHandlerExecMultiple::next_document(): [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc]
    :4:internfile/mh_execm.cpp:39::MimeHandlerExecMultiple::startCmd
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (1|1) /usr/share/recoll/filters/rcldoc.py
    /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc is not a Word Document. It is probably a Rich Text Format file
    /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc is not a Word Document.
    RCLMFILT: rcldoc.py: extractone: [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] returncode 1
    RCLMFILT: rcldoc.py: rcldoc.py: actual MIME type text/rtf
    RCLMFILT: rcldoc.py: rcldoc.py: returning cmd ['/usr/bin/python', '/usr/share/recoll/filters/rclrtf.py', '-s']
    No config directories. Searched: /etc/unrtf//usr/share/unrtf//
    RCLMFILT: rclrtf.py: extractone: [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] returncode 1
    Got error, eof 2
    RCLMFILT: rcldoc.py: extractone: [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] returncode 1
    :4:internfile/mh_execm.cpp:246::MHExecMultiple: got EOFNOW
    :4:internfile/mh_execm.cpp:252::MHExecMultiple: got SUBDOCERROR
    :4:internfile/mh_execm.cpp:97::MHExecMultiple: Got empty line
    :4:internfile/internfile.cpp:601::collectIpath..: fbytes->4555
    :2:internfile/internfile.cpp:738::FileInterner::internfile: next_document error [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] application/msword
    :4:internfile/mimehandler.cpp:99::returnMimeHandler: returning filter for application/msword cache size 0
    :4:internfile/internfile.cpp:857::FileInterner::internfile: conversion ended with no doc
    :4:rcldb/rcldb.cpp:1241::Db::add: udi [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc|] parent []
    :5:rcldb/rcldb.cpp:1344::Db::add: field [containerfilename] pfx [XCFN] inc 1: [2010earthquakeinsurancepremiums.doc]
    :5:rcldb/rcldb.cpp:1344::Db::add: field [filename] pfx [XSFN] inc 1: [2010earthquakeinsurancepremiums.doc]
    :5:rcldb/rcldb.cpp:1572::Rcl::Db::add: new doc record:
    url=file:///mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc
    mtype=application/msword
    fmtime=01299989802
    origcharset=
    fbytes=4555
    pcbytes=4555
    dbytes=0
    sig=45551504084396+
    filename=2010earthquakeinsurancepremiums.doc

    :3:rcldb/rcldb.cpp:609::Db::add: docid 8 updated [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc|]
    :4:index/fsindexer.cpp:386::Indexfiles: purging orphans
    :4:index/fsindexer.cpp:398::FsIndexer::indexFiles: done
    :4:rcldb/rcldb.cpp:855::Db::i_close(0): m_isopen 1 m_iswritable 1
    :4:rcldb/rcldb.cpp:869::Rcl::Db:close: xapian will close. May take some time
    :4:rcldb/rcldb.cpp:873::Rcl::Db:close() xapian close done.
    :4:internfile/mimehandler.cpp:129::clearMimeHandlerCache()
    :4:utils/execmd.cpp:274::ExecCmd: killpg(4339, SIGTERM)
    :4:rcldb/rcldb.cpp:737::Db::~Db: isopen 0 m_iswritable 0
    :4:rcldb/rcldb.cpp:855::Db::i_close(1): m_isopen 0 m_iswritable 0

     
  • Anonymous
    Anonymous
    2017-08-30

    I apologize. I believe that I ran the last log check with Recoll loaded. I just ran it again, and have:

    :4:common/rclconfig.cpp:563::RclConfig::initThrConf: chosen config (ql,nt): (-1, 0) (-1, 0) (-1, 0)
    :5:common/rclinit.cpp:346::rclinit: will use vfork() for starting commands
    :3:index/recollindex.cpp:518::recollindex: changing current directory to [/tmp]
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/share/recoll/filters/rclcheckneedretry.sh
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x256
    :3:index/recollindex.cpp:548::recollindex: starting up
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/bin/ionice {-c} {3} {-p} {4730}
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x0
    :4:rcldb/rcldb.cpp:758::Db::open: m_isopen 0 m_iswritable 0 mode 1
    :5:rcldb/stoplist.cpp:35::StopList::StopList: file_to_string(/home/username/.recoll/stoplist.txt) failed: open/stat: errno: 2 :
    :4:rcldb/rcldb.cpp:230::RclDb:: threads: haveWriteQ 0, wqlen -1 wqts 0
    :4:rcldb/rcldb.cpp:796::Db::open: lastdocid: 24
    :4:index/fsindexer.cpp:135::FsIndexer: threads: haveIQ 0 iql -1 iqts 0 haveSQ 0 sql -1 sqts 0
    :4:index/fsindexer.cpp:322::FsIndexer::indexFiles
    :2:index/fsindexer.cpp:361::FsIndexer::indexFiles: (l)stat /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremium.doc: No such file or directory
    :4:index/fsindexer.cpp:386::Indexfiles: purging orphans
    :4:index/fsindexer.cpp:398::FsIndexer::indexFiles: done
    :4:rcldb/rcldb.cpp:855::Db::i_close(0): m_isopen 1 m_iswritable 1
    :4:rcldb/rcldb.cpp:869::Rcl::Db:close: xapian will close. May take some time
    :4:rcldb/rcldb.cpp:873::Rcl::Db:close() xapian close done.
    :4:internfile/mimehandler.cpp:129::clearMimeHandlerCache()
    :4:rcldb/rcldb.cpp:737::Db::~Db: isopen 0 m_iswritable 0
    :4:rcldb/rcldb.cpp:855::Db::i_close(1): m_isopen 0 m_iswritable 0

     
  • Anonymous
    Anonymous
    2017-08-30

    (With embarrassment) I ran the log file above for 2010earthquakeinsurancepremium.doc, and not 2010earthquakeinsurancepremiums.doc. The missing 's' at the end mattered!

    I'll try once again. The command issued from the prompt, with Recoll closed:

    recollindex -i /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc > /tmp/myindexlog 2>&1

    The log file:


    :4:common/rclconfig.cpp:563::RclConfig::initThrConf: chosen config (ql,nt): (-1, 0) (-1, 0) (-1, 0)
    :5:common/rclinit.cpp:346::rclinit: will use vfork() for starting commands
    :3:index/recollindex.cpp:518::recollindex: changing current directory to [/tmp]
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/share/recoll/filters/rclcheckneedretry.sh
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x256
    :3:index/recollindex.cpp:548::recollindex: starting up
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/bin/ionice {-c} {3} {-p} {4785}
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x0
    :4:rcldb/rcldb.cpp:758::Db::open: m_isopen 0 m_iswritable 0 mode 1
    :5:rcldb/stoplist.cpp:35::StopList::StopList: file_to_string(/home/username/.recoll/stoplist.txt) failed: open/stat: errno: 2 :
    :4:rcldb/rcldb.cpp:230::RclDb:: threads: haveWriteQ 0, wqlen -1 wqts 0
    :4:rcldb/rcldb.cpp:796::Db::open: lastdocid: 24
    :4:index/fsindexer.cpp:135::FsIndexer: threads: haveIQ 0 iql -1 iqts 0 haveSQ 0 sql -1 sqts 0
    :4:index/fsindexer.cpp:322::FsIndexer::indexFiles
    :4:rcldb/rcldb.cpp:1827::Db::needUpdate:yes: olsig [45551504084396+] new [45551504084396] [Q/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc|]
    :4:index/fsindexer.cpp:639::processone: needupdate 1 noretry 0 existing 8 oldsig []
    :5:index/fsindexer.cpp:672::processone: processing: [5 KB ] /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc
    :5:internfile/internfile.cpp:122::FileInterner::FileInterner(fn=/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc)
    :4:internfile/internfile.cpp:167::FileInterner::init fn [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] mime [(null)] preview 0
    :4:internfile/mimehandler.cpp:268::getMimeHandler: mtype [application/msword] filtertypes 1
    :4:internfile/mimehandler.cpp:64::getMimeHandlerFromCache: ff44fdc414957c55aa0c8bcbcb767db1 cache size 0
    :4:internfile/mimehandler.cpp:80::getMimeHandlerFromCache: ff44fdc414957c55aa0c8bcbcb767db1 not found
    :4:internfile/internfile.cpp:250::FileInterner:: init ok application/msword [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc]
    :4:internfile/internfile.cpp:743::FileInterner::internfile. ipath []
    :4:internfile/mh_execm.cpp:157::MimeHandlerExecMultiple::next_document(): [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc]
    :4:internfile/mh_execm.cpp:39::MimeHandlerExecMultiple::startCmd
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (1|1) /usr/share/recoll/filters/rcldoc.py
    /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc is not a Word Document. It is probably a Rich Text Format file
    /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc is not a Word Document.
    RCLMFILT: rcldoc.py: extractone: [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] returncode 1
    RCLMFILT: rcldoc.py: rcldoc.py: actual MIME type text/rtf
    RCLMFILT: rcldoc.py: rcldoc.py: returning cmd ['/usr/bin/python', '/usr/share/recoll/filters/rclrtf.py', '-s']
    No config directories. Searched: /etc/unrtf//usr/share/unrtf//
    RCLMFILT: rclrtf.py: extractone: [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] returncode 1
    Got error, eof 2
    RCLMFILT: rcldoc.py: extractone: [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] returncode 1
    :4:internfile/mh_execm.cpp:246::MHExecMultiple: got EOFNOW
    :4:internfile/mh_execm.cpp:252::MHExecMultiple: got SUBDOCERROR
    :4:internfile/mh_execm.cpp:97::MHExecMultiple: Got empty line
    :4:internfile/internfile.cpp:601::collectIpath..: fbytes->4555
    :2:internfile/internfile.cpp:738::FileInterner::internfile: next_document error [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc] application/msword
    :4:internfile/mimehandler.cpp:99::returnMimeHandler: returning filter for application/msword cache size 0
    :4:internfile/internfile.cpp:857::FileInterner::internfile: conversion ended with no doc
    :4:rcldb/rcldb.cpp:1241::Db::add: udi [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc|] parent []
    :5:rcldb/rcldb.cpp:1344::Db::add: field [containerfilename] pfx [XCFN] inc 1: [2010earthquakeinsurancepremiums.doc]
    :5:rcldb/rcldb.cpp:1344::Db::add: field [filename] pfx [XSFN] inc 1: [2010earthquakeinsurancepremiums.doc]
    :5:rcldb/rcldb.cpp:1572::Rcl::Db::add: new doc record:
    url=file:///mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc
    mtype=application/msword
    fmtime=01299989802
    origcharset=
    fbytes=4555
    pcbytes=4555
    dbytes=0
    sig=45551504084396+
    filename=2010earthquakeinsurancepremiums.doc

    :3:rcldb/rcldb.cpp:609::Db::add: docid 8 updated [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.doc|]
    :4:index/fsindexer.cpp:386::Indexfiles: purging orphans
    :4:index/fsindexer.cpp:398::FsIndexer::indexFiles: done
    :4:rcldb/rcldb.cpp:855::Db::i_close(0): m_isopen 1 m_iswritable 1
    :4:rcldb/rcldb.cpp:869::Rcl::Db:close: xapian will close. May take some time
    :4:rcldb/rcldb.cpp:873::Rcl::Db:close() xapian close done.
    :4:internfile/mimehandler.cpp:129::clearMimeHandlerCache()
    :4:utils/execmd.cpp:274::ExecCmd: killpg(4793, SIGTERM)
    :4:rcldb/rcldb.cpp:737::Db::~Db: isopen 0 m_iswritable 0
    :4:rcldb/rcldb.cpp:855::Db::i_close(1): m_isopen 0 m_iswritable 0

     
  • Anonymous
    Anonymous
    2017-08-30

    After looking at the log file above, I verified that unrtf is installed, renamed the file extension to rtf, and updated the index. Recoll still did not see the file. I re-ran the command to create my indexlog, and now have:


    :4:common/rclconfig.cpp:563::RclConfig::initThrConf: chosen config (ql,nt): (-1, 0) (-1, 0) (-1, 0)
    :5:common/rclinit.cpp:346::rclinit: will use vfork() for starting commands
    :3:index/recollindex.cpp:518::recollindex: changing current directory to [/tmp]
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/share/recoll/filters/rclcheckneedretry.sh
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x256
    :3:index/recollindex.cpp:548::recollindex: starting up
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|0) /usr/bin/ionice {-c} {3} {-p} {5266}
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x0
    :4:rcldb/rcldb.cpp:758::Db::open: m_isopen 0 m_iswritable 0 mode 1
    :5:rcldb/stoplist.cpp:35::StopList::StopList: file_to_string(/home/username/.recoll/stoplist.txt) failed: open/stat: errno: 2 :
    :4:rcldb/rcldb.cpp:230::RclDb:: threads: haveWriteQ 0, wqlen -1 wqts 0
    :4:rcldb/rcldb.cpp:796::Db::open: lastdocid: 24
    :4:index/fsindexer.cpp:135::FsIndexer: threads: haveIQ 0 iql -1 iqts 0 haveSQ 0 sql -1 sqts 0
    :4:index/fsindexer.cpp:322::FsIndexer::indexFiles
    :4:rcldb/rcldb.cpp:1827::Db::needUpdate:yes: olsig [45551504086217+] new [45551504086217] [Q/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf|]
    :4:index/fsindexer.cpp:639::processone: needupdate 1 noretry 0 existing 8 oldsig []
    :5:index/fsindexer.cpp:672::processone: processing: [5 KB ] /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf
    :5:internfile/internfile.cpp:122::FileInterner::FileInterner(fn=/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf)
    :4:internfile/internfile.cpp:167::FileInterner::init fn [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf] mime [(null)] preview 0
    :4:internfile/mimehandler.cpp:268::getMimeHandler: mtype [text/rtf] filtertypes 1
    :4:internfile/mimehandler.cpp:64::getMimeHandlerFromCache: efa6b42f5b41b994b6a0b99321a1119c cache size 0
    :4:internfile/mimehandler.cpp:80::getMimeHandlerFromCache: efa6b42f5b41b994b6a0b99321a1119c not found
    :4:internfile/internfile.cpp:250::FileInterner:: init ok text/rtf [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf]
    :4:internfile/internfile.cpp:743::FileInterner::internfile. ipath []
    :4:utils/execmd.cpp:457::ExecCmd::startExec: (0|1) /usr/bin/unrtf {--nopict} {--html} {/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf}
    No config directories. Searched: /etc/unrtf//usr/share/unrtf//
    :5:utils/netcon.cpp:277::Netcon::selectloop: fd 11 has 0x0 mask, erasing
    :5:utils/execmd.cpp:795::ExecCmd::doexec: selectloop returned 0
    :4:utils/execmd.cpp:961::ExecCmd::wait: got status 0x256
    :2:internfile/mh_exec.cpp:178::MimeHandlerExec: command status 0x256 for /usr/bin/unrtf
    :4:internfile/internfile.cpp:601::collectIpath..: fbytes->4555
    :2:internfile/internfile.cpp:738::FileInterner::internfile: next_document error [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf] text/rtf
    :4:internfile/mimehandler.cpp:99::returnMimeHandler: returning filter for text/rtf cache size 0
    :4:internfile/internfile.cpp:857::FileInterner::internfile: conversion ended with no doc
    :4:rcldb/rcldb.cpp:1241::Db::add: udi [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf|] parent []
    :5:rcldb/rcldb.cpp:1344::Db::add: field [containerfilename] pfx [XCFN] inc 1: [2010earthquakeinsurancepremiums.rtf]
    :5:rcldb/rcldb.cpp:1344::Db::add: field [filename] pfx [XSFN] inc 1: [2010earthquakeinsurancepremiums.rtf]
    :5:rcldb/rcldb.cpp:1572::Rcl::Db::add: new doc record:
    url=file:///mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf
    mtype=text/rtf
    fmtime=01299989802
    origcharset=
    fbytes=4555
    pcbytes=4555
    dbytes=0
    sig=45551504086217+
    filename=2010earthquakeinsurancepremiums.rtf

    :3:rcldb/rcldb.cpp:609::Db::add: docid 8 updated [/mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf|]
    :4:index/fsindexer.cpp:386::Indexfiles: purging orphans
    :4:index/fsindexer.cpp:398::FsIndexer::indexFiles: done
    :4:rcldb/rcldb.cpp:855::Db::i_close(0): m_isopen 1 m_iswritable 1
    :4:rcldb/rcldb.cpp:869::Rcl::Db:close: xapian will close. May take some time
    :4:rcldb/rcldb.cpp:873::Rcl::Db:close() xapian close done.
    :4:internfile/mimehandler.cpp:129::clearMimeHandlerCache()
    :4:rcldb/rcldb.cpp:737::Db::~Db: isopen 0 m_iswritable 0
    :4:rcldb/rcldb.cpp:855::Db::i_close(1): m_isopen 0 m_iswritable 0

     
  • medoc
    medoc
    2017-08-30

    It looks like unrtf is not working for some reason (maybe because of the environment recoll uses to executing it). The 'No config directories' message is a fatal unrtf error.

    Could you try the following (one line, the site is breaking it) on the command line ?

    unrtf --noppict --html /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf

    This should output html data on stdout.

     
  • Anonymous
    Anonymous
    2017-08-30

    I cut and pasted this command into the terminal. The results:

    No config directories. Searched: /etc/unrtf//usr/share/unrtf//

    In /etc/unrtf I find the following files: html.conf, latex.conf,rtf.conf,text.conf,troff_mm.conf,vt.conf

    In /usr/share/unrtf, I find: SYMBOL.charmap

     
    Last edit: Anonymous 2017-08-30
  • medoc
    medoc
    2017-08-30

    This looks like an unrtf configuration issue. The search path is supposed to be colon-separated. Please retry the command as:

    UNRTF_SEARCH_PATH=/etc/unrtf:/usr/share/unrtf unrtf --noppict --html /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf

    (Still one line)

     
  • Anonymous
    Anonymous
    2017-08-30

    Certainly!

    UNRTF_SEARCH_PATH=/etc/unrtf:/usr/share/unrtf unrtf --noppict --html /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf

    failed to find noppict.conf in search path dirs

     
  • medoc
    medoc
    2017-08-30

    Oops sorry, --nopict

     
  • Anonymous
    Anonymous
    2017-08-31

    Very different results this time!

    Command:

    UNRTF_SEARCH_PATH=/etc/unrtf:/usr/share/unrtf unrtf --nopict --html /mnt/username/doc/clip/test/test_subdir/2010earthquakeinsurancepremiums.rtf

    results:

    <!DOCTYPE html PUBLIC -//W3C//DTD HTML 4.01 Transitional//EN>



    -->
    -->

    Firms find cost of earthquake insurance too big a jolt

    SMALL BUSINESS

    Many forgo coverage because of the high premiums and deductibles.

    April 19, 2010|By Cyndia Zwahlen and Nathan Olivarez-Giles



    Like small-business owners across Southern California, Hilario Sanchez knows all too well the shaking, the damage and, of course, the inevitable cleanup after an earthquake.


    ... followed by the text of the clipped article, and the HTML formatting codes "span," "font," "body" and "html" (stripped of HTML formatting here, to insure that the codes appear in this message.)

     
    • medoc
      medoc
      2017-08-31

      Check if you have a bad UNRTF_SEARCH_PATH in your environment:

      echo $UNRTF_SEARCH_PATH

      If it is set, unset it (in your shell init file), or change it to the correct value.

      If this is not set, it would appear that the default value compiled in the unrtf command you have is incorrect, I'm not too sure of what happened, you will have to see with the packager.

       
      Last edit: medoc 2017-08-31
  • Anonymous
    Anonymous
    2017-08-31

    First, medoc, thanks again for your help in troubleshooting this problem.

    The command echo $UNRTF_SEARCH_PATH produces no results.

    As for the rest: I haven't edited a shell init file before, and am unsure of what to do next. I'll be happy to create a ~/.bashrc file or a file in /etc/profile.d (or elsewhere) if it will fix this problem, but don't know what to put in the file. I'm in new territory.

    FWIW: Although I no longer have Arch-based Antergos on disk and can't easily test, my problems running Recoll in that distro were about the same. I wonder if unrtf is being incorrectly configured by more than one packager.

     
  • medoc
    medoc
    2017-08-31

    if you are using bash, you probably already have a .bashrc in hour home. Actually it is sourced by ~/.bash_profile.

    just add:

    export UNRTF_SEARCH_PATH=/etc/unrtf:/usr/share/unrtf

    at the end.

    I don't understand why unrtf is misconfigured on both systems,this is weird. I am currently installing OpenSUSE tumbleweed to have a look

     
  • Anonymous
    Anonymous
    2017-08-31

    (with a smile) Success! Adding export UNRTF_SEARCH_PATH=/etc/unrtf:/usr/share/unrtf to the existing .bashrc file didn't produce results, but I re-read your message, created a one line .bash_profile file, and tried again. Recoll indexed everything in the test directory.

    For me, the timing is perfect; I'm about to leave town for a week, and was about to apologize for having to bow out of this troubleshooting thread. I'll be back on September 6, and can try other steps then, if you wish. In the meantime -- thanks to your patient help -- it looks like I have a functional Recoll in openSUSE. I'm grateful, medoc!

     
  • medoc
    medoc
    2017-09-01

    This definitely looks like an unrtf (probably packaging) issue.

    As far as I can see there is no official package for unrtf on opensuse. Where did your unrtf come from ?

     
  • Anonymous
    Anonymous
    2017-09-06

    I just checked YAST --> Software Management. It says:

    021.51-1.33-x86_64 from openSUSE_Tumbleweed ... and vendor obs://build.opensuse.org/KDE:Extra

    That's the same repository that Recoll came from. I don't know of another one for openSUSE Tumbleweed. If my memory isn't playing tricks on me, I didn't install unrtf separately. I think it was installed automatically with Recoll.

    Here's the file list, from Software Management:

    /etc/unrtf
    /etc/unrtf/html.conf
    /etc/unrtf/latex.conf
    /etc/unrtf/rtf.conf
    /etc/unrtf/text.conf
    /etc/unrtf/troff_mm.conf
    /etc/unrtf/vt.conf
    /usr/bin/unrtf
    /usr/share/doc/packages/unrtf
    /usr/share/doc/packages/unrtf/AUTHORS
    /usr/share/doc/packages/unrtf/COPYING
    /usr/share/doc/packages/unrtf/ChangeLog
    /usr/share/doc/packages/unrtf/NEWS
    /usr/share/doc/packages/unrtf/README
    /usr/share/man/man1/unrtf.1.gz
    /usr/share/unrtf
    /usr/share/unrtf/SYMBOL.charmap

     
    Last edit: Anonymous 2017-09-06
  • medoc
    medoc
    2017-09-07

    • status: open --> closed
    • milestone: -->
     

Cancel   Add attachment