None
wont-fix
nobody
None
2017-09-01
2017-08-17
Anonymous
No

Xubuntu 17.04
Recoll 1.23.2 from PPA recoll-backports/recoll-1.15-on

Report:
:2:internfile/internfile.cpp:738::FileInterner::internfile: next_document error [/home/(path)/(filename).xls] application/vnd.ms-excel

Discussion

  • medoc
    medoc
    2017-08-17

    Hi, and thanks for reporting this. Actually there are quite a few XLS documents which the text extractor is unable to process. Could you please report the output of:

    /usr/share/recoll/filters/rclxls.py /path/to/problem.xls

     
  • Anonymous
    Anonymous
    2017-08-18

    Report:
    "Traceback (most recent call last):
    File "/usr/share/recoll/filters/rclxls.py", line 80, in
    rclexecm.main(proto, extract)
    File "/usr/share/recoll/filters/rclexecm.py", line 427, in main
    ok, data, ipath, eof = extract.getnext(params)
    File "/usr/share/recoll/filters/rclexec1.py", line 125, in getnext
    ret= self.extractone(params)
    File "/usr/share/recoll/filters/rclexec1.py", line 103, in extractone
    ok, data = self.runCmd(cmd, fn, postproc, opt)
    File "/usr/share/recoll/filters/rclexec1.py", line 85, in runCmd
    return True, postproc.wrapData()
    File "/usr/share/recoll/filters/rclxls.py", line 39, in wrapData
    data = xml.sax.parseString(self.xmldata, handler)
    File "/usr/lib/python2.7/xml/sax/init.py", line 49, in parseString
    parser.parse(inpsrc)
    File "/usr/lib/python2.7/xml/sax/expatreader.py", line 110, in parse
    xmlreader.IncrementalParser.parse(self, source)
    File "/usr/lib/python2.7/xml/sax/xmlreader.py", line 125, in parse
    self.close()
    File "/usr/lib/python2.7/xml/sax/expatreader.py", line 225, in close
    self.feed("", isFinal = 1)
    File "/usr/lib/python2.7/xml/sax/expatreader.py", line 217, in feed
    self._err_handler.fatalError(exc)
    File "/usr/lib/python2.7/xml/sax/handler.py", line 38, in fatalError
    raise exception
    xml.sax._exceptions.SAXParseException: :1:0: no element found"

     
  • medoc
    medoc
    2017-08-18

    I think that this means that the xls text extractor can't grok the file, or does not find any data in it. There is not much I can do about it.

     
  • medoc
    medoc
    2017-09-01

    Not much I can do...

     
  • medoc
    medoc
    2017-09-01

    • status: open --> wont-fix
    • milestone: -->
     

Cancel   Add attachment