2001-05-14 16:03:20 +02:00
|
|
|
External converter script for ht://Dig (version 3.1.4 and later), that
|
|
|
|
converts Microsoft Word, Excel and Powerpoint files, and PDF,
|
|
|
|
PostScript, RTF, and WordPerfect files to text (in HTML form) so they
|
|
|
|
can be indexed. Uses a variety of conversion programs:
|
|
|
|
|
|
|
|
wp2html - to convert Wordperfect and Word7 & 97 documents to HTML
|
|
|
|
catdoc - to extract text from Word documents
|
2003-05-06 19:40:18 +02:00
|
|
|
rtf2html - to convert RTF documents to HTML
|
|
|
|
pdftotext - to extract text from Adobe PDFs
|
2001-05-14 16:03:20 +02:00
|
|
|
ps2ascii - to extract text from PostScript
|
|
|
|
pptHtml - to convert Powerpoint files to HTML
|
|
|
|
xlHtml - to convert Excel spreadsheets to HTML
|
|
|
|
or
|
2003-05-06 19:40:18 +02:00
|
|
|
xls2csv - to obtain data from Excel spreadsheets.
|
2001-05-14 16:03:20 +02:00
|
|
|
|
2003-05-06 19:40:18 +02:00
|
|
|
Written by David Adams (University of Southampton), and based on the
|
2001-05-14 16:03:20 +02:00
|
|
|
conv_doc.pl script by Gilles Detillieux.
|