Commit graph

26 commits

Author SHA1 Message Date
heinz
872a263420 Update to 3.31.
Better compatibility with Mozilla/MSIE behaviour.


==== Changes since 3.27 ====

2003-08-19   Gisle Aas <gisle@ActiveState.com>

     Release 3.31

     The -DDEBUGGING fix in 3.30 was not really there :-(

2003-08-17   Gisle Aas <gisle@ActiveState.com>

     Release 3.30

     The previous release failed to compile on a -DDEBUGGING perl
     like the one provided by Redhat 9.

     Got rid of references to perl-5.7.

     Further fixes to avoid warnings from Visual C.
     Patch by Steve Hay <steve.hay@uk.radan.com>.

2003-08-14   Gisle Aas <gisle@ActiveState.com>

     Release 3.29

     Setting xml_mode now implies strict_names also for end tags.

     Avoid warning from Visual C.  Patch by <gsar@activestate.com>.

     64-bit fix from Doug Larrick <doug@ties.org>
     http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=195500

     Try to parse similar to Mozilla/MSIE in certain edge cases.
     All these are outside of the official definition of HTML but
     HTML spam often tries to take advantage of these.

       - New configuration attribute 'strict_end'.  Unless enabled
         we will allow end tags to contain extra words or stuff
         that look like attributes before the '>'.  This means that
         tags like these:

            </foo foo="<ignored>">
            </foo ignored>
            </foo ">" ignored>

         are now all parsed as a 'foo' end tag instead of text.
         Even if the extra stuff looks like attributes they will not
         be reported if requested via the 'attr' or 'tokens' argspecs
         for the 'end' handler.

       - Parse '</:comment>' and '</ comment>' as comments unless
         strict_comment is enabled.  Previous versions of the parser
         would report these as text.  If these comments contain
         quoted words prefixed by space or '=' these words can
         contain '>' without terminating the comment.

       - Parse '<! "<>" foo>' as comment containing ' "<>" foo'.
         Previous versions of the parser would terminate the comment
         at the first '>' and report the rest as text.

       - Legacy comment mode:  Parse with comments terminated with a
         lone '>' if no '-->' is found before eof.

       - Incomplete tag at eof is reported as a 'comment' instead
         of 'text' unless strict_comment is enabled.

2003-04-16   Gisle Aas <gisle@ActiveState.com>

     Release 3.28

     When 'strict_comment' is off (which it is by default)
     treat anything that matches <!...> a comment.

     Should now be more efficient on threaded perls.
2003-08-25 00:00:08 +00:00
martti
33e2a02324 COMMENT should start with a capital letter. 2003-07-22 04:14:17 +00:00
grant
ca3be631f2 s/netbsd.org/NetBSD.org/ 2003-07-17 22:50:55 +00:00
martti
5bbada05c5 Updated p5-HTML-Parser to 3.27
* Typo fixes to the documentation
* Added HTML::Entities::escape_entities_numeric
2003-04-12 15:40:38 +00:00
seb
0f6263bb26 Use perl5/module.mk. 2002-10-20 15:05:09 +00:00
jlam
41ed4d01d8 buildlink1 -> buildlink2 2002-09-21 07:44:28 +00:00
wiz
750e7cddee Update to 3.26:
Avoid core dump in some cases where the callback croaks.
The perl_call_method and perl_call_sv needs G_EVAL flag
to be safe.

New parser attributes; 'attr_encoded' and 'case_sensitive'.
Contributed by Guy Albertelli II <guy@albertelli.com>.

HTML::Entities
    - don't encode \r by default as suggested by Sean M. Burke.

HTML::HeadParser
    - ignore empty http-equiv
    - allow multiple <link> elements.  Patch by
      Timur I. Bakeyev <timur@gnu.org>

Avoid warnings from bleadperl on the uentities test.
2002-07-01 12:12:01 +00:00
jlam
cc4128d97e Buildlinkify, in the sense that only the perl headers are found in
${PREFIX} -- everything else is pickup up from ${BUILDLINK_DIR}.
2001-11-26 06:49:36 +00:00
zuntum
431e7a7dda Move pkg/ files into package's toplevel directory 2001-11-01 02:15:23 +00:00
veego
63ad910dee SVR4 packages have a limit of 9 chars for a package name.
The automatic truncation in gensolpkg doesn't work for packages which
have the same package name for the first 5-6 chars.
e.g. amanda-server and amanda-client would be named amanda and amanda.
Now, we add a SVR4_PKGNAME and use amacl for amanda-client and amase for
amanda-server.
All svr4 packages also have a vendor tag, so we have to reserve some chars
for this tag, which is normaly 3 or 4 chars. Thats why we can only use 6
or 5 chars for SVR4_PKGNAME. I used 5 for all the packages, to give the
vendor tag enough room.
All p5-* packages and a few other packages have now a SVR4_PKGNAME.
2001-10-18 15:20:01 +00:00
nra
16055cce35 Update to HTML-Parser-3.25.
2001-05-11   Gisle Aas <gisle@ActiveState.com>

     Release 3.25

     Minor tweaks for build failures on perl5.004_04, perl-5.6.0,
     and for macro clash under Windows.

     Improved parsing of <plaintext>...  :-)
2001-06-21 17:30:39 +00:00
zuntum
64aa918370 Update p5-HTML-Parser to 3.24
Changelog at http://search.cpan.org/doc/GAAS/HTML-Parser-3.24/Changes

Fixes pkg/12896 by Nathan Ahlstrom <nrahlstr@winternet.com>
2001-05-14 13:40:39 +00:00
agc
8f972b049a + move the distfile digest/checksum value from files/md5 to distinfo
+ move the patch digest/checksum values from files/patch-sum to distinfo
2001-04-17 12:12:16 +00:00
drochner
17ac015854 update to 3.19 (3.13 isn't available on CPAN archives anymore)
Changes:
     If a handler triggered by flushing text at eof called the
     eof method then infinite recursion occurred.  Fixed.
     Bug discovered by Jonathan Stowe <gellyfish@gellyfish.com>.

     Allow <!doctype ...> to be parsed as declaration.

     HTML::TokeParser's get_tag() method now takes multiple
     tags to match.  Hopefully the documentation is also a bit clearer.

     #define PERL_NO_GET_CONTEXT: Should speed up things for thread
     enabled versions of perl.

     Quote some more entities that also happens to be perl keywords.
     This avoids warnings on perl-5.004.

     Unicode entities only triggered for perl-5.7.0 or higher.

     The unbroken_text option now works across ignored tags.

     Fix casting of pointers on some 64 bit platforms.

     Fix decoding of Unicode entities.  Only optionally available for
     perl-5.7.0 or better.

     Expose internal decode_entities() function at the Perl level.

     Reindented some code.

     The 3.16 release broke MULTIPLICITY builds.  Fixed.

     There was a C++ style comment left in util.c.  Strict C
     compilers do not like that kind of stuff.

     Avoid the entity2char global.  That should make the module
     more thread safe.   Patch by Gurusamy Sarathy <gsar@ActiveState.com>.
2001-03-12 17:32:53 +00:00
wiz
a0745845b3 Update to new COMMENT style: COMMENT var in Makefile instead of pkg/COMMENT. 2001-02-17 17:21:28 +00:00
wiz
baa07b823e regen 2000-10-19 11:29:06 +00:00
jlam
a9ac1ca26d Update p5-HTML-Parser to 3.13. Make me the maintainer. Changes from
version 3.11:

     Experimental support for decoding of Unicode entities.

     HTML::Entities documentation spelling.
2000-10-15 02:19:38 +00:00
jlam
62d5616f2d Add dependency on p5-HTML-Tagset. 2000-08-27 22:10:27 +00:00
jlam
80353f6221 Update p5-HTML-Parser to 3.11. Changes from version 3.10:
HTML::LinkExtor and eg/hrefsub now obtain %linkElements from
     the HTML::Tagset module.
2000-08-27 21:53:02 +00:00
jlam
18cba9d7fa Convert packages to use PERL5_PACKLIST (part 1). These were the easy
ones to do, and each compiled and installed/de-installed apparently
correctly.

As a side effect of the dynamic PLIST, we no longer need to have separate
-static and -shared PLISTs.  It's now easier than ever to make a perl5
package for NetBSD :)
2000-08-27 07:10:59 +00:00
mjl
695fbcd3cf Update to 3.10. Too many changes to include here, please
refer to the Changes file.
2000-07-03 09:17:55 +00:00
wiz
e133ff3249 changed PKGNAME to p5-${DISTNAME} 1999-12-28 12:49:56 +00:00
rh
c555298c4b Update p5-HTML-Parser to 2.25.
Changes include:
 * Allow ":" in attribute names as a workaround for Microsoft Excel
   2000 which generates such files.

 * Make depreciate warning if netscape_buggy_comment() method is
   used.  The method to use in strict_comment().

 * Avoid duplication of parse_file() method in HTML::HeadParser.

 * $p->parse_file() will not close a handle passed to it any more.
   If passed a filename that can't be opened it will return undef
   instead of raising an exception, and strings like "*STDIN" are not
   treated as globs any more.

 * HTML::LinkExtor knowns about background attribute of <tables>.
   Patch by Clinton Wong <clintdw@netcom.com>

 * HTML::TokeParser will parse large inline strings much faster now.
   The string holding the document must not be changed during parsing.

 * Documentation updates.
1999-12-01 16:59:01 +00:00
tron
23e0063766 Completely replace "MASTER_SITE_SUBDIR" and "PATCH_SITE_SUBDIR" with
variable substituition of "MASTER_SITES" and "PATCH_SITES".
1999-04-15 20:39:38 +00:00
agc
79e49eaba2 s/netbsd/${LOWER_OPSYS}/g 1999-04-08 18:51:48 +00:00
kim
da1348ca63 HTML::Parse for parsing HTML (SGML). Used to be part of libwww-perl. 1999-01-11 22:22:50 +00:00