Commit graph

11 commits

Author SHA1 Message Date
minskim
5ad246fc2a Update to 1.3.2.
Changes:
  - Added command line option '-ascii' for straight ascii output
  (instead of ISO-8859-1, which is the default). (html2text.C, sgml.C)
  - Implemented rendering of most SGML entities introduced in
  HTML-4. As a limitation, those entities not present in
  HTML-3.2/ISO-8859-1 will be recognized only if represented as "named
  entities" and not with thier numeric values, e.g. "™" will be
  rendered, "™" won't. (html2text.C, sgml.C)
  - Element closing as done in XHTML (e.g. "<br />") is now tolerated.
  (HTMLControl.C)
  - The program now ignores the content of <SCRIPT> or <STYLE> elemts
  within TABLEs, even if it is not commented out. (HTMLParser.y)
  - Fixed urlistream.h (fd_ might be uninitialized). (urlistream.h)
2004-02-13 14:43:18 +00:00
jschauma
3a2379e7ac Make this compile with gcc-3.3. Fixes problem seen in agc's latest bulk build. 2004-01-11 20:31:44 +00:00
jschauma
ff0a12552e Update to version 1.3.1, closing PR pkg/21342. This version is now
licensed under the GPL, has a new home and the following changes:

Multiple-line DOCTYPE declarations are now accepted (HTMLControl.C:352).
------------------------------
Bad initialisation of "fd_" in "urlistream" fixed (urlistream.h), so that an
error message is printed if a remote document could not be retrieved.
------------------------------
A missing node now is interpreted as node '/' (urlistream.C).
------------------------------
Closing DIV, FORM and BLOCKQUOTE tags as well as the closing UL, OL and PRE
tags are now optional (HTMLParser.y), so that the program won't die any longer
if one of them is omited in the document.
------------------------------
Some dificulties with non-ASCII chars fixed (HTMLControl.C).
------------------------------
Colons in elements and attributes now are tolerated and elements are not any
longer implecitly closed at a newline (HTMLControl.C), in order to handly
MS-Word's HTML better.
------------------------------
We do not use '/dev/stdin' as file descriptor any longer (html2text.C and
urlistream.C), in order to make reading from STDIN finaly work.
------------------------------
Block elements are not enclosed in -implecit- Paragraphs any longer
(HTMLParser.y), in order to avoid superfluous newlines in the output.
------------------------------
Fixed segmentation fault on tables with "border" attribute: we now assume that
any TABLE has at least one row and one column (table.C).
------------------------------
Fixed format.C to avoid excessive runtime increment on parsing much nested
block elements.
------------------------------
Some changes in configure, Makefile.in and the documentation.
------------------------------

------------------------------
Ported to g++ version 3.0.
This uses the 'istream.h' header file from the g++3's 'backward' directory.
------------------------------
Bugfix: '-' did not work as synonym for STDIN.
------------------------------
Added support for the EURO-sign (well, almost).
------------------------------
Finaly the GNU GPL as new copyright terms for all parts of the program, after
GMRS agreed to change the program's license terms to it.
------------------------------

------------------------------
Fixed coredump when parsing empty <SCRIPT> or <STYLE> elements (HTMLControl.C).
------------------------------
New image handling: <IMG alt=""> does no longer return the value of the SRC
attribute nor "[]". Added new ~/.html2textrc options: IMG.replace.{all noalt}
and IMG.alt.{prefix suffix} with new defaults in pretty-style mode. Added
method for checking whether an attribut was set, even to a zero-value.
------------------------------
(New) Copyright terms for all changes we made since version 1.2.2.
------------------------------
Some minor changes in configure and html2text.C.
------------------------------
Updated the documentation.
------------------------------


------------------------------
Fixed segmention fault when parsing <H5> tags (typo in format.C).
------------------------------
2003-07-31 02:49:51 +00:00
grant
ca3be631f2 s/netbsd.org/NetBSD.org/ 2003-07-17 22:50:55 +00:00
zuntum
d038a73ebd Move pkg/ files into package's toplevel directory 2001-10-31 22:52:58 +00:00
skrll
88af130dfa Move to sha1 digests, and add distfile sizes. 2001-04-20 15:04:54 +00:00
agc
bbc67fac91 + move the distfile digest/checksum value from files/md5 to distinfo
+ move the patch digest/checksum values from files/patch-sum to distinfo
2001-04-17 11:53:33 +00:00
wiz
ad02d69ab2 Update to new COMMENT style: COMMENT var in Makefile instead of pkg/COMMENT. 2001-02-17 17:31:58 +00:00
rh
0630b58a79 Update html2text to 1.2.2. Changes are bugfixes only. 2000-04-19 05:44:56 +00:00
abs
df05aef71f Strip trailing '.', and/or leading '(a|an) ' 2000-01-05 15:37:50 +00:00
rh
393e51a3f7 Initial import of html2text-1.2.1, an advanced HTML to text converter. 1999-12-28 10:42:20 +00:00