|
a/website/features.html |
|
b/website/features.html |
|
... |
|
... |
85 |
<p>Qt versions from 3.1 to 4.7</p>
|
85 |
<p>Qt versions from 3.1 to 4.7</p>
|
86 |
|
86 |
|
87 |
<h2><a name="doctypes">Document types</a></h2>
|
87 |
<h2><a name="doctypes">Document types</a></h2>
|
88 |
|
88 |
|
89 |
<p>Recoll can index many document types (along with their
|
89 |
<p>Recoll can index many document types (along with their
|
90 |
compressed versions). Some types are handled internally (no
|
90 |
compressed versions). Some types are handled internally (no
|
91 |
external application needed). Other types need a separate
|
91 |
external application needed). Other types need a separate
|
92 |
application to be installed to extract the text. Types that
|
92 |
application to be installed to extract the text. Types that
|
93 |
only need very common utilities (awk/sed/groff etc.) are
|
93 |
only need very common utilities (awk/sed/groff/Python etc.)
|
94 |
listed in the native section.</p>
|
94 |
are listed in the native section.</p>
|
95 |
|
95 |
|
96 |
<h4>File types indexed natively</h4>
|
96 |
<h4>File types indexed natively</h4>
|
97 |
|
97 |
|
98 |
<ul>
|
98 |
<ul>
|
99 |
<li><span class="literal">text</span>.</li>
|
99 |
<li><span class="literal">text</span>.</li>
|
100 |
|
100 |
|
101 |
<li><span class="literal">html</span>.</li>
|
101 |
<li><span class="literal">html</span>.</li>
|
102 |
|
102 |
|
103 |
<li><span class="literal">maildir</span> and <span class=
|
103 |
<li><span class="literal">maildir</span> and
|
104 |
"literal">mailbox</span> (<span class=
|
104 |
<span class="literal">mailbox</span> (
|
105 |
"literal">Mozilla</span>, <span class=
|
105 |
<span class="literal">Mozilla</span>,
|
106 |
"literal">Thunderbird</span> and <span class=
|
106 |
<span class="literal">Thunderbird</span> and
|
107 |
"literal">Evolution</span> mail ok).</li>
|
107 |
<span class="literal">Evolution</span>mail ok).
|
|
|
108 |
</li>
|
108 |
|
109 |
|
109 |
<li><span class="literal">gaim</span> and <span class=
|
110 |
<li><span class="literal">gaim</span> and
|
110 |
"literal">purple</span> log files.</li>
|
111 |
<span class="literal">purple</span> log files.</li>
|
111 |
|
112 |
|
112 |
<li><span class="literal">Lyx</span> files (needs <span
|
113 |
<li><span class="literal">Lyx</span> files (needs
|
113 |
class="literal">Lyx</span> to be installed).</li>
|
114 |
<span class="literal">Lyx</span> to be installed).</li>
|
114 |
|
115 |
|
115 |
<li><span class="literal">Scribus</span> files.</li>
|
116 |
<li><span class="literal">Scribus</span> files.</li>
|
116 |
|
117 |
|
117 |
<li><span class="literal">Man pages</span> (need <span
|
118 |
<li><span class="literal">Man pages</span> (needs
|
118 |
class="command">groff</span>).</li>
|
119 |
<span class="command">groff</span>).</li>
|
|
|
120 |
|
|
|
121 |
<li><span class="literal">Dia</span> diagrams.</li>
|
119 |
</ul>
|
122 |
</ul>
|
120 |
|
123 |
|
121 |
<h4>File types indexed with external helpers</h4>
|
124 |
<h4>File types indexed with external helpers</h4>
|
122 |
|
125 |
|
123 |
<p>Many document types need the <span class="command">iconv</span>
|
126 |
<p>Many document types need the <span class="command">iconv</span>
|
124 |
command in addition to the applications specifically listed.</p>
|
127 |
command in addition to the applications specifically listed.</p>
|
125 |
|
128 |
|
126 |
<h5>The XML ones</h5>
|
129 |
<h5>The XML ones</h5>
|
|
|
130 |
|
127 |
<p>The following types need <span class=
|
131 |
<p>The following types need <span class="command">
|
128 |
"command">xsltproc</span> from the <b>libxslt</b> package.
|
132 |
xsltproc</span> from the <b>libxslt</b> package.
|
129 |
Quite a few also need <span class="command">unzip</span>:</p>
|
133 |
Quite a few also need <span class="command">unzip</span>:</p>
|
130 |
|
134 |
|
131 |
<ul>
|
135 |
<ul>
|
132 |
<li><span class="literal">Abiword</span> files.</li>
|
136 |
<li><span class="literal">Abiword</span> files.</li>
|
133 |
|
137 |
|
134 |
<li><span class="literal">Fb2</span> ebooks.</li>
|
138 |
<li><span class="literal">Fb2</span> ebooks.</li>
|
|
... |
|
... |
145 |
<li><span class="literal">Okular</span> annotations files.</li>
|
149 |
<li><span class="literal">Okular</span> annotations files.</li>
|
146 |
|
150 |
|
147 |
</ul>
|
151 |
</ul>
|
148 |
|
152 |
|
149 |
<h5>Other formats</h5>
|
153 |
<h5>Other formats</h5>
|
|
|
154 |
|
|
|
155 |
<p>The following need miscellaneous helper programs to decode
|
|
|
156 |
the internal formats.</p>
|
150 |
|
157 |
|
151 |
<ul>
|
158 |
<ul>
|
152 |
<li><span class="literal">pdf</span> with the <span class=
|
159 |
<li><span class="literal">pdf</span> with the <span class=
|
153 |
"command">pdftotext</span> command, which can be installed
|
160 |
"command">pdftotext</span> command, which can be installed
|
154 |
as part of <a href="http://www.foolabs.com/xpdf/">xpdf</a>
|
161 |
as part of <a href="http://www.foolabs.com/xpdf/">xpdf</a>
|
|
... |
|
... |
164 |
<li><span class="literal">Powerpoint</span> and <span
|
171 |
<li><span class="literal">Powerpoint</span> and <span
|
165 |
class="literal">Excel</span> with the <a href=
|
172 |
class="literal">Excel</span> with the <a href=
|
166 |
"http://vitus.wagner.pp.ru/software/catdoc/">catdoc</a> utilities.</li>
|
173 |
"http://vitus.wagner.pp.ru/software/catdoc/">catdoc</a> utilities.</li>
|
167 |
|
174 |
|
168 |
<li><span class="literal">CHM (Microsoft help)</span> files
|
175 |
<li><span class="literal">CHM (Microsoft help)</span> files
|
|
|
176 |
with <span class="command">Python,
|
169 |
with <span class="command">Python, <a href="http://gnochm.sourceforge.net/pychm.html">pychm</a>
|
177 |
<a href="http://gnochm.sourceforge.net/pychm.html">pychm</a>
|
170 |
and <a href="http://www.jedrea.com/chmlib/">chmlib</a></span>.</li>
|
178 |
and <a href="http://www.jedrea.com/chmlib/">chmlib</a></span>.</li>
|
171 |
|
179 |
|
172 |
<li><span class="literal">GNU info</span> files
|
180 |
<li><span class="literal">GNU info</span> files
|
173 |
with <span class="command">Python</span> and the
|
181 |
with <span class="command">Python</span> and the
|
174 |
<span class="command">info</span> command.</li>
|
182 |
<span class="command">info</span> command.</li>
|