Recoll journal of user-visible changes
1.11.0
Recoll release 1.11 has relatively extensive changes that have necessitated a modification of the index format. Hence installing this release implies a full re-indexing, which is enforced by the software.
- Filtering on category (message/text/media etc.) as a function of the main window for quick access.
- Use html for preview when available (ex: html files or "colorized" python) instead of converting to text. This can be turned of in the preferences.
- New Python query and index interfaces. The Python query interface will be used for building a Xesam adapter for Recoll when the specification is stabilized, and could be useful for other things, such as indexing contents from an RDBMS (see the manual for details). Restructured and cleaned up internal Recoll interfaces.
- Improved filter framework. Can now process either html or text output from the filters, and more easily execute "raw" commands instead of Recoll scripts. Avoided wasteful repeated execution of filters for which the helper application is missing.
- Query language now closer to Xesam specification, (but still far from a complete implementation). See the Recoll manual and http://www.xesam.org/main/XesamUserSearchLanguage
- Much improved configuration for fields. Fields like "author" can now be specified as storable (displayable in results) and/or indexed (searchable). Added alias facility for translating from user-level names to internal.
- Added "recipient" as an indexed/searchable field for emails.
- rcltext filter for processing text such as C code for which no specific processing is needed when indexing but a specific viewer is desired.
1.10.6
- Fix a simple and mildly nasty bug that would cause the indexer to stop indexing an mbox on encountering a specific but not exceptional error condition (like a few dozen errors while indexing attachments for which no filter was installed).
1.10.5
- Ensure that file names indexed as terms don't overflow the maximum term size.
- Handle non-standard date format in mbox separator lines sometimes generated by thunderbird.
- Use attachment file names to help identify a better mime type for parts only described as application/octet-stream
- For Phrase/Near searches, highlight all term groups in preview, not just the first
- Added Open XML filters
1.10.2
- Fixed openSuse 11 compile issues.
- Fixed bug in interpreting email mime structure, which resulted in base-64 decoding errors.
- Fixed "Prev" button in preview window. Would actually go forward when walking the search terms.
- Allow setting the highlight color for search terms in result list and preview (yes: feature change, should have waited for major release...)
- Added svg filter
1.10.1
- Ensure that in case the data of a file can't be indexed because of some error, at least the file name is indexed.
- Improve query language to support OR queries of terms with field specifications (ie: title:someterm OR author:someauthor).
- Fix filename search to split patterns on white space, so that a "*.jpg *.jpeg" search does what's expected. Means you now need to use double-quotes if there is actual embedded white space.
- Jump directly to the external editor choice dialog instead of opening preferences when an external viewer is not found.
- Allow stopping indexing through menu action (only works with qt4 for now).
- Create an "indexedmimetypes" configuration variable to allow explicitely restricting the file types which do get indexed.
1.10.0
- Added a GUI dialog to configure the indexing parameters.
- Added better support for indexing CJK text (Chinese, Japanese, Korean). Please note that: - You will need a full reindex to take good advantage of this. (You *don't* need to reindex if you don't need to search CJK, even if there is some in your index). - When entering CJK search terms, words (single or multiple characters) should be separated with white space. - The specific CJK processing can be turned off by setting the nocjk variable to true in the configuration file (this may make sense if you have a mixed cjk/other document base and don't want to index the cjk part, as it will save some disk space and a minuscule amount of cpu).
- Changed the way Recoll handles searches including composite words (like an email address). The new approach looks saner, but could have side-effects, please report any problems in this area.
- The query language got a new "dir:" specifier to filter results on location.
- New rclimg perl filter for better indexing of picture tags, thanks to Cedric Scott. This depends on Exiftool.
- New rcltex filter.
- Changed and improved how the preview window local search finds the query terms, this does not involve weird characters any more. The display is cleaner and cut and paste works better.
- Fixed the fact that a newline-separated word list in simple search would wrongly trigger a phrase search.
- Fixed the way we input text to the preview textedit (the old way would sometimes confuse the window into displaying tags instead of acting on them).
- Fixed transcoding to utf-8 for text/plain email attachments
- Improved mbox From_ line detection
- Added indexedmimetypes variables to allow restricting the list of indexed mime types.
- KDE kicker applet: start a recoll search from the panel and get a Recoll window. This is a clone from the find_applet, originally meant to start a Tracker search. Not so useful presently because it will start a new Recoll instance for every search. Not part of the main source (the configure script is a whopping 1MB...), linked from the download page.
- Added recoll command line options to define a query and execute it immediately when the program starts. This is used in practice from the applet and could be used from other programs. There is a also a new option to not start the GUI and print the results to stdout.
1.9.0
- Incompatible change: the icon image reference is now part of the result list paragraph format string: - If you had a standard config, you need do nothing. - If you had a custom format string, you need to add at its beginning to get the same result as before. - If you had unchecked the "show icons" option, you need to remove the above string from the paragraph format to make the icons go away. Changes to the format string are performed in the "Preferences->Query Configuration->User Interface" dialog tab.
- New filters: wordperfect, abiword and kword, rcljpeg, rclflac, rclogg (contributed filters). The jpeg and audio filters should be extended to make use of the new field indexing/search capability (hint :) )
- When searching for an empty string inside the preview window, position the window to the next occurrence of a primary search term.
- Added ext: and mime: selectors to the query language.
- Added an adjustable flush threshold during indexing: should help control memory usage. See the idxflushmb configuration variable.
- Added a check for file system free space. Indexing will stop if the threshold is reached. See the maxfsoccuppc configuration parameter.
- Added 'followLinks' configuration option to have the indexer follow symbolic links while walking the tree (the default is false).
- Allow symbolic links as 'topdirs' members. These are always followed.
- Add preference option to remember sort tool state between program invocations (it is reset to inactive by default)
- Added File menu entry to erase document history.
- Bound the space and backspace keys to PgUp/PgDown in preview.
- (Hopefully) Improved abstract (keyword in context) generation
- Added support for arbitrary fields. Filters can now produce any number of fields which will be selectively searchable through the query language. This could be useful, for exemple, for the mp3 and jpeg filters (but it is not currently used).
- Improved qt4 build: no more need for --enable-qt4. Note: the qt4 build still needs the qt3 support library.
- Changed the icon to an ugly one. The previous one was nicer but looked too much like Xapian's.
- Added some kind of support for a stopword list.
- Have email attachments inherit date and author from their parent message (instead of mail folder).
- Fix bus error on rclmon exit
- Better handling of aspell errors inside rclmon
- Fixed a number of qt4 glitches: selection and keyboard shortcuts.
- New query configuration parameter to set the maximum text size beyond which text won't be hilighted before preview (takes too much time). This was a fixed value in 1.8.
1.8.2 2007-05-19
- Fixed method name for compatibility with xapian 1.0.0
- Add .beagle to default list of skipped names (avoids indexing beagle document cache...)
- Fix configure.ac to use $libdir instead of /usr/lib
- Fix recollinstall to properly copy translations and pictures for qt4
1.8.1 2007-02-20
- Add a small query language with some field-based searches (author, title, etc.)
- Add wildcard handling everywhere. *, ?, [] can be used in any search. Warning: using a wild card at the left of a term can make for a very slow search.
- Allow skipping specific paths during indexing (in addition to file name patterns)
- Improved external index choice dialog, accessible from the top-level menu.
- Many small bugs fixed: stemming language choice ignored in term explorer, qt4 preview window reentrancy crashes, issues with saving the default advanced search file, type filter, display more clearly missing helper errors, etc.
- Option to use the desktop defaults (with xdg-open) to choose the native viewer for files (instead of recoll's mimeview).
1.7.6 2007-01-30
- Fixes an issue with the openoffice filter on debian systems.
- Adds Scribus and Lyx filters.
1.7.5 2007-01-15
- Fixes two email indexing bugs in 1.7.3, which would bail out from an mbox folder on the first attachment filtering error, and would decline to handle multipart/signed bodies. You may need to run a full indexing pass (recollindex -z), to force reindexing of old folders.
1.7.3 2007-01-09
- Email attachments are now indexed.
- Right-click menu option to access the parent document of an embedded result (ie from mail attachment to parent message), or the parent folder of a given file (which is opened with the application configured for directories)
- The sort tool has been improved: no need to restart the query after sort criteria change.
- Support for real-time indexing with inotify is now enabled by default when appropriate.
- Recoll now warns when the configured native viewer can not be found and starts an interface for chosing another one.
- Categories (text, presentation, spreadsheets, etc.) can be used instead of raw mime types when filtering on file types in advanced search.
- The port to qt4 is functional and can be enabled with configure --enable-qt4
- 'autophrase' option improved and may now actually be useful.
- Improved highlighting (again...)
- Display term frequencies in term explorer.
- Recollindex -e to remove data from index for listed files.
- Directory names now indexed. Directories can be 'edited' with the configured application (rox by default)
1.6.3
- Fixed problem with bad detection of mbox message boundaries. Upgrading can change the message numbering in some cases, and you should perform a full index update (recollindex -z) after installing the new version.
- Fixed problem with execution of external viewer for files with single-quotes in the name.
1.6.2
- Minor solaris compilation glitches only.
1.6.1
- Term explorer: a multimode wildcard-regexp-spell/phonetic tool to search the index for terms. This uses aspell for the orthographic/phonetic part.
- A more dynamic advanced search window. You now have a choice of the top level conjunction (OR/AND) and of any number of clauses, including NEAR and PHRASE clauses with an adjustable proximity parameter.
- User-settable format for the result-list entries, which use an HTML string with %xx printf-like replacements (accessible from the user preferences).
- Real time monitoring/indexing support. This is not configured by default, and must be specified at build time (configure --help).
- Improved phrase/group highlighting in abstracts and preview
- Better sample selection for synthetic abstracts.
- Improved performance of the text splitter, good for indexing and previewing.
- Shift+click link to open new preview window instead of tab in existing window.
- The key sequence for term completion in the simple search entry was changed from CTRL+TAB to "Escape Space" to avoid interaction with window managers.
- Improved recall for phrases with composite words like email addresses. Updating from 1.2 to 1.3 or 1.4 or 1.5:
- -------------------------------------- From version 1.3 up, there is a new feature to search specifically for file names (with wildcard processing). If you want to take full advantage of this, you should perform a full reindex after installing the new version (ie: use recollindex -z, or delete ~/.recoll/xapiandb). Also, we now use the central copies of configuration files for default values, and the user ones only for overrides. Your old configuration files will still work, but, you may want to remove them if they are unmodified, or keep only the modified parameters.
1.5.9
- Fix bad timezone conversion in email dates. Display timezone in result list dates.
1.5.8
- Fix stored and displayed dates which used to come from the file's ctime, now use mtime (which was already used for deciding re-indexing).
- Fix problem with some weird MIME messages (with null boundaries) which crashed the indexer.
1.5.6
- Small fixes dealing with the build process or compiler issues. 1.5.6 has updated ukrainian and russian messages. Otherwise no functional changes, and no need to upgrade from 1.5.1
1.5.1
- Fix serious bug with non ascii strings in simple search history
- Improve synthetic abstracts: remove size limitations, handle overlapping extracts, avoid printing several terms from the same position.
1.5.0 2006-09-20
- Added support for powerpoint and excel files, with the catdoc package.
- Allow viewing consecutive documents from the result list inside a single preview window using the shift-arrow-up and shift-arrow-down keys.
- Colorize search terms in abstracts in the result list.
- A number of elements are now remembered between program invocations: sort criteria, list of ignored file types (always starts inactive), subtree restriction, better handling of the recent searches listbox, the buildAbstract and replaceAbstract settings are not forgotten any more.
- New option to automatically add a phrase to simple searches.
- Possibility to adjust the length and context width for synthetic abstracts.
- Handle weird html better.
- When indexing mail messages, walk the full mime tree instead of staying at the top level, index all text parts and attachement file names.
- Add -c
option to recoll and recollindex to specify the configuration directory on the command line - Better synchronization between the active preview and the highlighted paragraph inside the list
- Improved recall for some special cases of stemming.
- Much better handling of email dates, allowing better email sorting by date (previously the message date was quite often the date when the file was indexed).
- Store the external database lists in the configuration directory, not the qt preferences.
- Ensure dialogs are sized according to font size
1.4.3 2006-05-07
- Multiple search databases.
- Optionally auto-search when a word is entered in the simple search field.
- Show possible term completions in simple search by typing CTRL+TAB
- Add 'more like this' option to result list right-click menu, to look for documents related to the current result.
- Double-click in preview or result list adds the selected word to the simple search text field.
- The simple search text entry field is now a combobox and remembers previous searches.
- Additional OR field in complex search.
- Improved indexing cancellability (interrupting recollindex or closing recoll with an indexing thread active), and status reporting.
- Fixed filters to handle file paths with embedded spaces.
- Misc small bug and memory leaks fixes.
- More compact result list.
- Set mode 0700 on .recoll directory by default
1.3.3 2006-04-04
- Implement specific search on file names with wildcard support. Indexation can optionally process all file names or only those with mime types supported for normal indexation. UPDATING: you need a full re-indexation to take advantage of this.
- Use links and a right-click popup menu to replace confusing use of mouse clicks and double-clicks inside the result list.
- The 'example' configuration files are now used as default, and are not copied any more to the user directory during installation. Overrides can be set in the personal files for any value that the user wishes to modify, with unchanged formats and file names (so that the files from previous versions remain valid, but you may wish to trim them of values that duplicate the central ones).
- Use NLS information (LC_CTYPE, LANG) do determine default charset when possible.
- Mp3 file indexing, either filenames only or also id3 tags if id3info is available. c/c++ ext edit. Use gnuclient instead of xemacs for text files.
- Russian and Ukrainian translations and many improvement ideas thanks to Michael Shigorin.
1.2.3 2006-03-03
- Added support for dvi (with dvips), and dvu (with DjVuLibre).
- Ensure that configure and make use the same qt version.
- Fix sorted sequence title display.
- Discriminate fatal errors and missing docs while loading a doc list.
- Improved and cleaned up way to position a preview on the first search term.
1.2.2 2006-02-02
- Fix minor compilation glitches (FreeBSD 4, QT 3.1, xapian-config problem)
1.2.0 2006-02-01
- Improved preview loading: don't highlight very big documents (over 1Mb), allow cancellation while loading.
- Abstracts generated in the result list by looking at search term contexts. This can slow down result list display for big documents, and can be turned off in the preferences menu.
- Wrap query detail line displayed when clicking on result list header.
- Text splitting cleanup with less spurious terms should result in slightly smaller databases.
- Sligthly improved presentation in preview, esp. line breaks.
- Color icons...
- Let the user select the html browser used for help display.
- autoconf/Makefile change: allow building UI from inside the qtgui directory.
- autoconf/Makefile: improved search and diagnostics for qt/qmake.
- Internal code cleanup for maintainability: text splitting, user interface.
- Added prototype kio_slave to show result inside Konqueror, doesn't seem particularly useful.
1.1.0 2006-01-12
- A much better user manual, which can be browsed from the help menu.
- man pages for recoll, recollindex, recoll.conf
- User/query interface configuration dialog.
- Click on result list header will display the exact boolean search which was used.
- recollindex can be used to create stem expansion databases independantly of a full indexing pass.
- Misc user interface improvements, like an 'all terms' checkbox for simple search.
- Fixed case-insensitivity issues. Probably needs more testing.
1.0.16 2006-01-05
- Minor installation tweaks for rpm compatibility
1.0.15
- Fix problems with prefix != /usr/local
- Remove '.*' from the default list of ignored file/dir names: this prevented mozilla/thunderbird mail indexing.
- Fix some 64 bits issues
1.0.14
- Small changes for FreeBSD 4 compilation.
1.0.13
- Install of recollinstall program not done or needed any more.
1.0.12
- Fixed nasty html parsing bug introduced in 1.0.9 Html parsing failed whenever the document charset name differed from the default only in character case or punctuation.
1.0.11
- Create personal configuration on first start.
- Use qt toolbars.
- Also index terms in file paths.
- Tool for sorting on dates or mime types.
- Fixed pdf filter which was broken by more recent xpdf
- Filters now installed/executed from /usr/local
1.0.10
- Added tool to manage the history of consulted documents.
- Try harder to convert email messages with wrongly declared charsets.
- Add option to reset the database before indexing (easier than rm -rf).
- Small gui improvements.
- Install partial french translation as a tease for future translaters...
1.0.9
- Fixed 2 really ennoying bugs in 1.0.8: wouldn't preview 2nd document from same file + spurious db close when filter could not be executed.
1.0.8
- Add support for rtf and gaim logs
- Optionally show icons to indicate mime types in result list
- Better (but imperfect) feedback during the preview loading for big files
- Remember main window geometry when closing
- Fix stem expansion in advanced search
- Some autoconf
- Option to use the system's 'file' command as a final step of identification for suffix-less or unknown files.
- Typo had removed support for .Z compression
- Use more appropriate conjonction operators when computing the advanced search query (OP_AND_MAYBE, OP_FILTER instead of OP_AND)