Commit graph

8 commits

Author SHA1 Message Date
wiz
b133fd21b2 Update to 2.10:
2.10  Sat Jul 15 20:50:41 EDT 2006
        - minor bug fixed in HTML repair routines (thanks to Dave Gray)

2.09  Thu Jun  8 15:46:17 EDT 2006
        - Tweaked rasterizer to handle some situations where the HTML is
          broken but tables can still be inferred.
        - Fixed TREE() definition for situations where import() is
          not invoked. (thanks to DDICK on cpan.org)
2007-02-24 23:05:42 +00:00
gson
270f578c77 Update to 2.08:
2.08  Wed May  3 17:17:33 EDT 2006
        - Implemented new rasterizer for grid mapping. Thanks to Roland
          Schar for a tortuous example of span issues.
        - Regular extraction and TREE mode are using the same
          rasterizer now.
        - Fixed HTML stripping for a header matching bug on single word
          text in keep_html mode (thanks to Michael S. Muegel for
          pointing the bug out)

2.07  Sun Feb 19 13:40:44 EST 2006
        - Fixed subtable slicing bug
        - Fixed hrow() attachment bug
        - Added tests
2006-05-08 16:38:11 +00:00
wiz
2b8550401e Update to 2.06:
2.06  Tue Oct 18 13:13:52 EDT 2005
        - Tightened up element interactions in TREE() mode when examing
          rows, columns, cells, etc. Was running into trouble with
          dereferencing scalars vs objects.
        - Documented space() H::TE::T method, added tests
        - Added POD tests
        - Documentation updates and fixes

2.05  Tue Oct  4 16:00:02 EDT 2005
        - Fixed a TREE() definition bug and class method assignments
        - Fixed a 'row above header' bug, added tests

2.04  Wed Aug  3 14:42:23 EDT 2005
        - Fixed some conditional optional dependency tests in order to
          avoid falure assertions on some test boxes.

2.03  Wed Jul 20 12:45:56 EDT 2005
        - Fixed greedy attribute bug (non qualifying tables were being
          selected under certain circumstances)
        - Moved more completely to File::Spec operations in testload.pm
          in order to make windows boxes happy.

2.02  Thu Jun 23 12:42:44 EDT 2005
        - squelched TREE() creation warnings for subclasses
        - fixed a rows() bug involving keep_headers

2.01  Tue Jun 21 22:05:53 EDT 2005
        - fixed some test changes

2.00  Fri Jun 17 17:28:10 EDT 2005
        - Can now return parsed tables as HTML::TableElement objects
          within an HTML::Element tree structure (via HTML::TreeBuilder)
          for such purposes as in-line editing of table content within
          documents. Invoked via 'use HTML::TableExtract qw(tree);'.
        - Added columns(), row(), column(), and cell() methods.
        - Added some handy reporting methods: tables_report() and
          tables_dump(). These are almost always handy while first
          analyzing a new HTML document for table content.
        - Debugging and error output can now be assigned to arbitrary
          file handles.
        ! Old 'table_state' methods are now merely 'table' methods,
          though the old table_state style is still supported.
        ! Chains have been dropped. Though interesting (think xpath),
          they needlessly complicated matters as they were nearly
          universally unused.
2005-10-25 13:40:24 +00:00
chris
8b88d7c065 Update HTML-TableExtract to v1.10. v1.09 appears to have disappeared from
cpan mirrors.

Changes:
1.10  Mon Mar  7 10:45:54 EST 2005
        - Minor fix to resets during calls to parse(). Resets must be
          requested manually.
        - Docs update to reflect impending arrival of 2.x series.
2005-03-24 18:01:52 +00:00
wiz
e28945f61b Update to 1.09:
1.09  Fri Feb 25 17:49:00 EST 2005
        - Tables can now be selected by table tag attributes
        - lineage() method now returns row and column information, as
          well as depth and count, for each ancestor (potential
          backwards incompatability, entries are now 4 element arrays
          now rather than 2)
        - header matching and column retention enhancements
        - header retention
        - old-style procedures deprecated in prepration for them to
          become methods
        - various bug fixes
2005-02-27 13:24:13 +00:00
wiz
809ad6f2f7 Add RMD160 checksums. 2005-02-24 14:08:26 +00:00
jmmv
ef84b5e257 Update to 1.08:
1.08  Thu Apr  4 11:26:27 CST 2002
	- Added some more crufty HTML tolerance -- not
	  PC (puristicly correct) but HTML correctness
	  is probably of no interest to those merely
	  trying to extract information *out* of HTML.
	- Fixed a mapback problem with the legacy methods

1.07  Wed Aug 22 06:14:24 CDT 2001
	- Added keep_html option for HTML retention
	- bug fix for depth/count targets
2003-07-25 12:30:53 +00:00
damon
805ad950ee Initial Version 2001-10-30 13:34:47 +00:00