Xubuntu 17.04
Recoll 1.23.2 from PPA recoll-backports/recoll-1.15-on
Report:
:2:internfile/internfile.cpp:738::FileInterner::internfile: next_document error [/home/(path)/(filename).xls] application/vnd.ms-excel
Discussion
-
medoc
2017-08-17Hi, and thanks for reporting this. Actually there are quite a few XLS documents which the text extractor is unable to process. Could you please report the output of:
/usr/share/recoll/filters/rclxls.py /path/to/problem.xls
-
Anonymous
2017-08-18Report:
"Traceback (most recent call last):
File "/usr/share/recoll/filters/rclxls.py", line 80, in
rclexecm.main(proto, extract)
File "/usr/share/recoll/filters/rclexecm.py", line 427, in main
ok, data, ipath, eof = extract.getnext(params)
File "/usr/share/recoll/filters/rclexec1.py", line 125, in getnext
ret= self.extractone(params)
File "/usr/share/recoll/filters/rclexec1.py", line 103, in extractone
ok, data = self.runCmd(cmd, fn, postproc, opt)
File "/usr/share/recoll/filters/rclexec1.py", line 85, in runCmd
return True, postproc.wrapData()
File "/usr/share/recoll/filters/rclxls.py", line 39, in wrapData
data = xml.sax.parseString(self.xmldata, handler)
File "/usr/lib/python2.7/xml/sax/init.py", line 49, in parseString
parser.parse(inpsrc)
File "/usr/lib/python2.7/xml/sax/expatreader.py", line 110, in parse
xmlreader.IncrementalParser.parse(self, source)
File "/usr/lib/python2.7/xml/sax/xmlreader.py", line 125, in parse
self.close()
File "/usr/lib/python2.7/xml/sax/expatreader.py", line 225, in close
self.feed("", isFinal = 1)
File "/usr/lib/python2.7/xml/sax/expatreader.py", line 217, in feed
self._err_handler.fatalError(exc)
File "/usr/lib/python2.7/xml/sax/handler.py", line 38, in fatalError
raise exception
xml.sax._exceptions.SAXParseException: :1:0: no element found"
-
medoc
2017-08-18I think that this means that the xls text extractor can't grok the file, or does not find any data in it. There is not much I can do about it.
-
medoc
2017-09-01Not much I can do...
-
medoc
2017-09-01- status: open --> wont-fix
- milestone: -->