Switch to side-by-side view

--- a
+++ b/website/BUGS.html
@@ -0,0 +1,312 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
+<html>
+  <head>
+    <title>Recoll known bugs</title>
+
+    <meta name="generator" content="HTML Tidy, see www.w3.org">
+    <meta name="Author" content="Jean-Francois Dockes">
+    <meta name="Description" content=
+    "recoll is a simple full-text search system for unix and linux
+    based on the powerful and mature xapian engine">
+    <meta name="Keywords" content=
+    "full text search, desktop search, unix, linux">
+    <meta http-equiv="Content-language" content="en">
+    <meta http-equiv="content-type" content="text/html; charset=iso-8859-1">
+    <meta name="robots" content="All,Index,Follow">
+
+    <link type="text/css" rel="stylesheet" href="styles/style.css">
+  </head>
+
+  <body>
+    
+    <div class="rightlinks">
+      <ul>
+	<li><a href="index.html">Home</a></li>
+	<li><a href="download.html">Downloads</a></li>
+	<li><a href="doc.html">Documentation</a></li>
+      </ul>
+    </div>
+    
+    <div class="content">
+
+      <h1>Known bugs in current and older versions</h1>
+
+      <p><i>Bugs that are listed in an older version section are
+	  supposedly fixed in later versions. Bugs listed in the
+	  topmost section may also exist in older versions.</i></p>
+
+      <h2>Latest (recoll 1.11.0 + xapian 1.0.x)</h2>
+      <ul>
+
+	<li> When Recoll is built with qt 4.4.0, the icons in the
+	  result list are all displayed at the top of the page and
+	  garbled. This appears to be a qt bug, fixed in 4.4.1. Use
+	  either qt 4.3.x or 4.4.1
+
+	<li> If the user-chosen result list entry format results in
+	  several paragraphs (in the qt textedit sense), right clicks
+	  will only work inside the first one for each entry.
+
+	<li> When a mime type has an external viewer defined, but the
+	  actual file is compressed (ie: xxx.txt.gz), recoll will try
+	  to start the external viewer on the compressed file, which
+	  will not work in most cases.
+
+	<li> NEAR crashes: 1.6 has added NEAR searches. Unlike what
+	  recoll did with PHRASES, stemming expansion is performed on
+	  terms inside NEAR clauses (except if prevented by a
+	  capitalized entry of course). There is a bug in Xapian (all
+	  versions as far as I know), where NEAR does not support
+	  multiple OR subclauses, as would result from a multiple
+	  expansion. This manifests itself by a 'not implemented'
+	  Xapian exception. Workarounds:
+	  <ul>
+	    <li>Prevent expansion of NEAR terms (possibly except one) by
+              capitalizing them.
+
+	    <li>Or apply the following patch to xapian, inside the
+              "api/" directory: 
+              http://www.recoll.org/xapian/xapNearDistrib-1.0.patch
+              or fetch the already patched source:
+	      http://www.recoll.org/xapian/xapian-core-1.0.7-recollNEARpatch.tar.gz
+              then recompile, and install.
+	    </li>
+	  </ul>
+
+	  I hope that an equivalent fix will make it into xapian at
+	  some point (the current fix is not completely correct but
+	  still handles most useful cases).</li>
+
+	<li> If you are seeing a delay of a few seconds before the
+	  result list displays for the first query of a recoll
+	  instance, try changing the result list font in the query
+	  preferences. This is not a recoll problem, I don't know the
+	  exact cause (I've seen it happen with "Sans Serif" and go
+	  away with Helvetica or Arial).
+
+	<li> Under some versions of KDE (ie: Fedora FC5 KDE
+	  3.5.4-0.5.fc5), there is a problem with the window stacking
+	  order. Opening the "browse" file selection dialog from the
+	  advanced search dialog will stack the latter under the main
+	  window, possibly making it invisible. This is quite probably
+	  a Kwin bug, possibly related to
+	  http://bugs.kde.org/show_bug.cgi?id=79183 or a correction
+	  thereof.
+
+	<li> Under Solaris, it is necessary to perform initial indexing with the
+	  recollindex program (the recoll index thread doesn't work for creating
+	  the database). Don't know the reason. Only idea I have is problem with
+	  exception handling (recoll catches an exception while trying the
+	  yet inexistant db).</li>
+      </ul>
+
+      <h2>1.10.6</h2>
+      <ul>
+	<li> If the locale is not utf-8, non-ascii command line
+	  arguments to recoll and recollq are not converted to utf-8,
+	  which may prevent, for example, the kde applet from
+	  working. The workaround is to apply the following one-line
+	  fix to qtgui/main.cpp, recompile and install recoll:
+	  <pre>
+	    386c386
+	    &lt;        sSearch->setSearchString(QString::fromUtf8(qstring.c_str()));
+	    ---
+	    &gt;        sSearch->setSearchString(QString::fromLocal8Bit(qstring.c_str()));
+	  </pre>
+	</li>
+      </ul>
+
+      <h2>1.10.1</h2>
+
+      <ul>
+	<li> A relatively simple error case can cause the indexer to
+	  stop processing an mbox file (forgetting all subsequent
+	  messages). More specifically, this happens when encountering
+	  more than than a few dozen errors while handling
+	  attachments. This is relatively common: for exemple if an
+	  external helper application is missing and multiple
+	  attachments of the affected type are found (ie: multiple
+	  images and no exiftool). Workaround: install the helper
+	  application.
+	<li> The decoding of base-64 data in emails fails in a relatively uncommon 
+	  but sometimes encountered case.
+	<li> In a preview window, when walking the search term hits with the
+	  Previous/Next buttons, 'Previous' actually acts as 'Next' (it does work
+	  normally for the local search).
+	<li> Problems in detecting message separators inside Thunderbird mailboxes
+	  (quite probably mainly for messages imported from outlook?). Can lead to
+	  unindexed messages, and even apparently indexer crashes in some cases.
+	<li> File names indexed as terms can sometimes overflow the maximum term
+	  size, halting the indexing.
+	<li> For Phrase/Near searches, only the first term group is highlighted in
+	  preview. 
+      </ul>
+
+      <h2>1.10.0</h2>
+      <ul>
+
+	<li> If a filter fails while trying to extract the data from a file, the file
+	  will not be indexed at all (not even the file name). The file
+	  name should be indexed in this case. This happens in particular in the
+	  very common case where the helper application is not installed (ie:
+	  missing Exiftool -> no *.jpg names in the index).
+
+	<li> If several query language "ext:" qualifiers are specified, they will be
+	  joined by an AND instead of OR, resulting in no results. Using an
+	  explicit OR doesn't work (actually OR + field names is generally
+	  broken). In some cases, you can use a "type:" qualifier as a workaround.
+
+
+      </ul>
+      <h2>1.9.x</h2>
+      <ul>
+	<li> Problems have been reported indexing big mailstores (several hundreds of
+	  thousands of messages): resulting in a very big database and even
+	  crashes.
+
+      </ul>
+      <h2>1.8.2</h2>
+      <ul>
+	<li> Under ubuntu (at least, maybe debian too), the default awk interpreter
+	  (mawk) is ancient, and the recoll pdf input filter does not
+	  work (removes all space characters). This can be solved by installing the
+	  gawk package. 
+  	  $ apt-get install gawk
+	  $ update-alternatives --set awk /usr/bin/gawk
+
+	<li> There are sometimes problems with document deletions: the index can
+	  get in a state where deleted or moved documents are not purged from the
+	  index (the log file says that the doc are deleted, but they aren't
+	  actually). When this happens, the only solution currently is to reindex
+	  from scratch (recollindex -z). This is due to a xapian bug, which is
+	  fixed in xapian 1.0.2, or you can apply the following patch to xapian
+	  1.0.1 to fix it:
+	  http://www.lesbonscomptes.com/recoll/xapian/xapian-delete-document.patch 
+
+	<li> The dates shown for email attachments in a result list are the email
+	  folder modification date. This should be inherited from the parent
+	  message instead.
+
+	<li> There are a few problems in the qt4 version of recoll: 
+	<li> Some accelerators (esc-spc, ctl-arrow) do not work, neither do
+	  copy/paste between the result list and preview windows and x11
+	  applications. 
+	<li> The qt4 q3textedit::find() method is extremely slow, so that
+	  positionning to first search term in Recoll preview has been disabled,
+	  and the application will sometimes appear to be looping when using the
+	  find feature in the preview window (it's not looping, it's searching...)
+
+      </ul>
+      <h2>1.8.1</h2>
+      <ul>
+	<li> This is not really a bug but .beagle really should be included in
+	  "skippedNames", or you end up indexing the beagle text cache, which is
+	  not really desirable.
+	<li> Doc bug: the manual states that the query language supports a "mime:"
+	  switch to filter mime types. There is currently no such thing.
+
+
+      </ul>
+      <h2>1.7.5</h2>
+      <ul>
+	<li> Debian and Ubuntu: the rclsoff Openoffice filter doesn't work,
+	  because of an incorrect shell syntax (understood by bash but not sh). To
+	  fix, you edit /usr[/local]/share/recoll/filters/rclsoff and can change
+	  the line:
+	  trap cleanup EXIT SIGHUP SIGQUIT SIGINT SIGTERM
+	  into:
+	  trap cleanup EXIT HUP QUIT INT TERM
+	  or download the updated filter from the filters page: 
+	  http://www.recoll.org/filters/filters.html
+
+      </ul>
+      <h2>1.7.3</h2>
+      <ul>
+	<li> Processing will stop on first error while indexing an mbox file. This
+	  could happen just because an attachment could not be decoded, and can
+	  cause non-indexing of many messages. The most probable cause of error is
+	  a missing filter (ie for ms-word files), so the temporary workaround
+	  would be to install the missing filters. This bug is specific to 1.7 and
+	  1.6 users need not worry. A correction will be issued very soon.
+	<li> Messages of type multipart/signed are not indexed. 
+
+      </ul>
+      <h2>1.6.2</h2>
+      <ul>
+	<li> Relatively unfrequent issue with message boundary detection in mbox
+	  files, could cause miscellaneous problems.
+	<li> Executing an external viewer for a file with single-quotes in the name
+	  would not work.
+
+      </ul>
+      <h2>1.5.10</h2>
+      <ul>
+	<li> If a defaultcharset was set in the configuration file for a subdirectory,
+	  it would stay in effect for all subsequent files/directories (except if
+	  explicitely overridden), potentially causing many transcoding errors.
+
+      </ul>
+      <h2>1.5.[1-7]</h2>
+      <ul>
+	<li> Dates in result list come from the file's ctimes, which may be confusing
+	<li> Some rare MIME messages with null boundaries can crash the indexer.
+
+      </ul>
+      <h2>1.5.0</h2>
+      <ul>
+	<li> Under some conditions, recoll startup and exit could be very slow: the
+	  simple search history list had serious problems with non-ascii strings,
+	  whose size sometimes doubled at each program startup/stop.
+
+      </ul>
+      <h2>1.3.3</h2>
+      <ul>
+
+	<li> Several of the external filters did not handle path names with embedded
+	  spaces (rcluncomp rclsoff rclps rclmedia rcldjvu). This is fixed in 1.4.
+
+	<li> If your QT installation is built with the QT_NO_STL flag, Recoll will not
+	  compile. I have a patch for this (will be fixed in the next release),
+	  contact me if you get the problem. Typical error message:
+	  main.cpp:160: error: no match for 'operator+=' in 'msg += reason'
+
+	<li> The 'None of these words' field in the complex search does not work if
+	  there are no other filled fields (it transforms into an ordinary
+	  search). Workaround: enter very common term(s) in the 'any of these
+	  words' field.
+
+	<li> Indexing cannot currently be conveniently and cleanly
+	  stopped when it's started. You can kill the process, and
+	  keyboard interrupt might work, but this may leave the
+	  database in a bad state. This is fixed in the upcoming
+	  release, there is no current workaround.
+      </ul>
+
+      <h2>1.2.2</h2>
+      <ul>
+	<li> The preview window is supposed to scroll after loading the document so
+	  that the first search term is visible. This does not work in many cases.
+	<li> The result list title is not shown for sorted lists
+
+	  Notes on older versions:
+	<li> Trouble compiling on some linux systems (Gentoo and Slackware?). There
+	  existed a quite common issue where the Recoll link will fail trying to
+	  use a libstdc++.la file. This was due to a problem with the xapian-config
+	  program. A workaround has been included in the configure script for
+	  recoll 1.2.2, and the problem should not occur any more.
+
+	<li> Case-insensitive search should now work in most cases
+	(used to not work except for accented ascii).
+
+	<li> All directories and files with names beginning with a dot were ignored
+	  by the skippedNames directive in the default recoll.conf file from
+	  older versions (no indexation of mozilla or thunderbird email !). An
+	  upgrade will not fix this (it will not modify an existing
+	  configuration). You need to edit recoll.conf by hand and remove the .*
+	  from skippedNames.</li>
+
+      </ul>
+
+    </div>
+  </body>
+</html>