www/p5-WWW-Mechanize from 1.71 to 1.72.
pkgsrc changes:
- adjust dependencies
- update some required modules first (and hopefully don't introduced
circular dependencies)
Upstream changes:
1.72 Thu Feb 2 18:37:28 EST 2012
========================================
[DEPENDENCIES]
Bumped the HTML::Form dependency to fix failures on CentOS 5
1.71 Tue Nov 14 13:50:41 EDT 2011
========================================
[ENHANCEMENTS]
Recognise application/xhtml+xml as HTML.
[DOCUMENTATION]
Improved docs about support of JavaScript
Typo fixes.
[TESTS]
Updated tests as oops-music.com is in utf-8 now
Changes from previous:
1.70 Fri Aug 26 13:46:30 EDT 2011
========================================
[ENHANCEMENTS]
Mech now defaults to _not_ running live tests by default.
You can still enable them by running "perl Makefile.PL --live"
Thanks to RJBS for the suggestion
1.69_01
========================================
[INTERNALS]
The test suite for the local tests was updated
1.68 Fri Apr 22 01:10:40 EST 2011
========================================
No changes from 1.67_01
1.67_01
========================================
[ANNOUNCE]
As of this release, Jesse Vincent has taken over maintenance of
WWW-Mechanize. The project's repository can be found at:
https://github.com/bestpractical/www-mechanize
[FIXED]
Added prereq for HTML::TreeBuilder.
1.66 Fri Sep 10 16:25:44 CDT 2010
========================================
[FIXED]
Fixed prerequisites on HTTP::Server::Simple on Windows.
DNS checks in t/autocheck.t and t/local/failure.t improved. Thanks,
Schwern.
[ENHANCEMENTS]
New $mech->text method returns the text from your HTML page. The
exact rendering of this text is simply removing all the HTML
tags, but this will change. It's pretty ugly. If anyone wants to
work on a better-looking text dump, I'd love to see it.
Added mech-dump --text.
[DOCUMENTATION]
Improvements to the docs explaining explicitly about the subclassed
methods we inherit from LWP::UserAgent. Thanks, Lyle Hopkins!
to trigger/signal a rebuild for the transition 5.10.1 -> 5.12.1.
The list of packages is computed by finding all packages which end
up having either of PERL5_USE_PACKLIST, BUILDLINK_API_DEPENDS.perl,
or PERL5_PACKLIST defined in their make setup (tested via
"make show-vars VARNAMES=..."), minus the packages updated after
the perl package update.
sno@ was right after all, obache@ kindly asked and he@ led the
way. Thanks!
Upstream changes:
1.64 Thu Jul 1 10:41:00 CDT 2010
========================================
[THINGS THAT MAY BREAK YOUR CODE]
If you've been accessing $mech->{forms} or $mech->{form} values
directly, instead of going through the $mech->forms or $mech->current_form
accessors, respectively, then this version of Mech will break your
code.
[ENHANCEMENTS]
Parsing of forms has been delayed until they're actually needed.
If don't use forms on a page, you'll no longer waste time and memory
parsing them.
$mech->title now caches the title of the page after parsing the
page to find it.
mech-dump now takes a --cookie-file parameter for keeping cookies
between calls.
[DOCUMENTATION]
Typo fixes.
Upstream changes:
NEXT Sat Apr 10 23:10:07 CDT 2010
========================================
[FIXED]
Fixed a declaration in the Movable Type example in
WWW::Mechanize::Examples.
Quiet warnings if %ENV has undef values.
$mech->follow_link() no longer dies with an inappropriate error if
the link is not found.
$mech->click_button() now checks to see if a form is selected.
[INCOMPATIBILITIES]
$mech->form_name() and $mech->form_number() no longer throw warnings
if they can't find the form specified. They still return undef,
though.
[DOCUMENTATION]
More additions to the FAQ.
Upstream changes:
1.60 Mon Aug 17 00:41:39 CDT 2009
========================================
No new features. Exists only to skip tests that always fail on
Windows.
Fixed up some minor documentation problems.
- Updating package for p5 module WWW::Mechanize from 1.54 to 1.58
- Adjusting license and dependencies according to META.yml and Makefile.PL
Upstream changes:
1.58 Mon Jul 13 22:32:23 CDT 2009
========================================
No new features. If you have 1.56 installed OK, you do NOT need
to install 1.58.
[FIXES]
Removed prereq of HTTP::Response::Encoding, even though it was never
used. Thanks for the catch, Gisle.
1.56 Thu Jul 9 00:36:54 CDT 2009
========================================
[THINGS THAT MAY BREAK YOUR CODE]
For a while, Mech used HTTP::Response::Encoding to try to suss out
the proper encoding of the page it receives. Now, it lets
LWP::UserAgent do the work, and no longer requires
HTTP::Response::Encoding.
[ENHANCEMENTS]
Added a new dump_headers() method to dump the HTTP response headers.
Added --headers flag to mech-dump to dump the HTTP response headers.
[FIXES]
Now requires LWP version 5.829 because HTTP::Response has memory
cycle bugs.
[DOCUMENTATION]
Added a few notes to the FAQ, and fixed some incorrect docs.
1.55_01 Mon Jul 6 12:17:10 CDT 2009
========================================
This is mostly a bug fix release. There will be a number of other
bug fix releases in the next few days.
[FIXED]
New test server now randomizes the port it runs on.
t/cookies.t should not hang on Windows any more.
META.yml has been updated so the search.cpan.org links should be
correct.
Passing no_proxy would make LWP::UserAgent barf. Thanks to Mike
Schilli for the fix.
Cookies test would fail under Windows. Fixed, thanks to many people
reporting it.
[ENHANCEMENTS]
$mech->submit_form() now can specify the form by ID using the form_id
parameter.
[DOCUMENTATION]
The docs used to say that ->stack_depth(0) was an infinite stack
size. This is wrong. Zero will tell Mech not to keep any history.
- Updating package for p5 module of WWW::Mechanize from 1.52 to 1.54
- Adding some test dependencies
- Setting license to gnu-gpl-v2
Upstream changes:
1.54 Mon Jan 12 00:36:08 CST 2009
========================================
[FIXED]
Removed the computers4sure test that was failing.
Pkgsrc changes:
o Changed MAINTAINER to pkgsrc-users@, hope that's ok
o Adjusted dependencies according to new requirements
Upstream changes:
1.52 Tue Nov 25 09:52:30 CST 2008
========================================
[FIXED]
Improved some error messages in $mech->submit_form(). Thanks to
Norbert Buchmuller.
1.51_03 Thu Nov 20 11:05:49 CST 2008
========================================
[FIXED]
The $mech->clone() method was not passing the cookie jar to its
clone properly. Thanks to David Sainty.
The $mech->back() can fail if there's nothing on the stack to go
back to. Thanks to Dave Page.
$mech->follow_link() did not complain if a link could not be found,
even with autocheck on. Now it does. Thanks, Flavio Poletti.
[ENHANCEMENTS]
Added a $mech->form_id() method so you can look up forms by ID.
Added $mech->content_type(), because $mech->ct() is too cryptic.
1.51_02 Tue Nov 18 01:30:54 CST 2008
========================================
[STILL BROKEN]
t/local/click_button.t is still failing its tests for calling ->click
on an HTML::Form object. I suspect this is an LWP change, but I
haven't dug into it enough yet.
[FIXES]
Fixed the bad credentials API that stomped on LWP::UserAgent's
credentials() method. Thanks to Max Maschien and Matt Lawrence.
The $mech->links method now finds <link href="..."> links. Thanks
to H.Merijn Brand.
Makefile.PL explicitly requires Perl 5.8.0.
URI.pm has to be version 1.36 or else URIs don't get encoded
correctly.
LWP has to be 5.819 or we have encoding problems.
1.51_01 Thu Nov 6 15:13:03 CST 2008
========================================
[FIXES]
Page history is now working much better. The $mech->back() method
should behave more like a browser now. Most notably, it no longer
restores the cookie state, just like your browser doesn't restore
cookie state when you page back. It also should use much less
memory.
1.50 Sun Sun Oct 26 22:42:46 CDT 2008
========================================
[THINGS THAT MAY BREAK YOUR CODE]
WWW::Mechanize now requires version 5.815 of LWP. This in itself
may cause problems for you because of changes in how LWP does
authentication.
1.49_01 Sat Sep 27 23:50:04 CDT 2008
========================================
[THINGS THAT MAY BREAK YOUR CODE]
The autocheck argument to the constructor is now ON by default,
unless WWW::Mechanize is being subclassed. There are so many new
programmers whose ->get() calls fail unchecked that I'm now putting
on the seat belts for them.
[FIXES]
I do believe we are on the way to having all the encoding problems
ironed out. This version incorporates a patch from here:
http://code.google.com/p/www-mechanize/issues/detail?id=61
and tests from Miyagawa's WWW::Mechanize::DecodedContent
http://search.cpan.org/dist/WWW-Mechanize-DecodedContent/
to finally fix this.
[ENHANCEMENTS]
You can now specify not to set up the proxy, if there is one. The
proxy causes problems for Crypt::SSLeay. For details see:
http://code.google.com/p/www-mechanize/issues/detail?id=39
[DOCUMENTATION]
Fixed internal links.
[INTERNALS]
Lots of refactoring based on Schwern's "Skimmable Code" talk.
http://use.perl.org/~schwern/journal/36704http://schwern.org/~schwern/talks/Skimmable%20Code%20-%20YAPC-NA-2008.pdf
to trigger/signal a rebuild for the transition 5.8.8 -> 5.10.0.
The list of packages is computed by finding all packages which end
up having either of PERL5_USE_PACKLIST, BUILDLINK_API_DEPENDS.perl,
or PERL5_PACKLIST defined in their make setup (tested via
"make show-vars VARNAMES=...").
1.34 Mon Dec 10 00:30:39 CST 2007
========================================
[FIXES]
Many fixes to make the test suite more portable.
1.32 Tue Oct 30 12:02:17 CDT 2007
========================================
[ENHANCEMENTS]
Added dump methods to mirror mech-dump:
* $mech->dump_images()
* $mech->dump_links()
* $mech->dump_forms()
* $mech->dump_all()
Sanity checks in the WWW::Mechanize::Image constructor. Every Image
must have a "url" and "tag" field passed in to it.
1.31_02 Thu Oct 25 11:48:29 CDT 2007
========================================
[ENHANCEMENTS]
Added class, class_regex, id and id_regex limiters to find_link()
and find_all_links(). Thanks to Adriano Ferreira.
1.31_01 Mon Sep 17 23:38:03 CDT 2007
========================================
[FIXES]
Mech tests now pass even if your DNS server gives A records for
anything (like OpenDNS). Thanks, Miyagawa!
Searching for the <base href> is now case-inensitive. A better
solution would be to actually parse the HTML.
[ENHANCEMENTS]
mech-dump now handles --user and --password arguments for sites
that require authentication.
Changes since version 1.24
==========================
1.30 Thu May 24 21:31:10 CDT 2007
========================================
[DOCUMENTATION]
* Minor doc fixes. Thanks David Steinbrunner.
1.29_01 Tue May 22 14:02:55 CDT 2007
========================================
Kevin Falcone and I ask for your assistance in figuring out how to
handle the warnings thrown by the tests, other than hiding them.
[FIXES]
* Overhauled how tainting was done. Stole code directly from
Test::Taint.
* Have LWP only handle decoding of Content-Encoding, not charset.
[DOCUMENTATION]
* Fixed the docs for $mech->submit_form()'s with_fields arg.
Thanks, Peteris Krumins.
1.26 Wed May 16 14:21:29 CDT 2007
========================================
[FIXES]
* Re-reversed the content decoding. This is critical for reading from
sites with gzip on the fly, like Wikipedia.
* Content is now properly tainted.
[ENHANCEMENTS]
* mech-dump can now pass --agent and --agent-alias flags so you can
fetch from sites like Wikipedia that block LWP user agents.
[INSTALLATION]
* The mech-dump program is now always installed. It no longer is
presented as an option.
1.24 Fri May 11 15:57:56 CDT 2007
========================================
[FIXES]
* Fixed failures in "make test" with some versions of HTTP::Server::Simple
* RT #26593: Improved handling of charsets. Thanks Kevin Falcone.
* RT #24354: find_link now handles http-equivs with quoted URLs.
* Reverses the change in 1.21_01 where it decodes the content.
[ENHANCEMENTS]
* Added find_all_inputs() and find_all_submits() methods. Thanks,
Mike O'Regan.
* Test::LongString is no longer needed, so has been removed as a
requirement.
[TESTS]
* Added a test for save_content()
1.22 Fri Mar 2 00:05:57 CST 2007
[INTERNALS]
Added new tests.
Added Perl::Critic changes and a perlcriticrc file.
1.21_04 Sat Oct 7 21:35:42 CDT 2006
[FIXES]
* $mech->content( type => 'text' ) was not freeing memory. Thanks to
Cat Okita for finding it.
[INTERNALS]
* Made the order of parms to $mech->content() not relevant.
1.21_03 Sat Oct 7 01:21:46 CDT 2006
[THINGS THAT MAY BREAK YOUR CODE]
* The methods $mech->form() and $mech->follow() have been removed.
They've been deprecated since 1.10, which was released in Feb 2005.
[ENHANCEMENTS]
* I'm trying to nail down what seems to be a memory leak on long-running
Mech programs. I'm stringifying URI::URL objects wherever I can.
[INTERNALS]
* No longer uses UNIVERSAL.
1.21_02 Wed Oct 4 13:14:30 CDT 2006
[ENHANCEMENTS THAT MAY BREAK YOUR CODE]
* The $mech->stack_depth() setting had no way to say "don't cache any
pages at all". How silly!
Now, if you set $mech->stack_depth(0), no history of pages will be kept.
In the past, it would mean "Keep all pages." This means that if you want
to set it to keep all pages, set it to some ridiculously large number.
[DOCUMENTATION]
* The docs previously refered to Compress::Gzip instead of Compress::Zlib.
1.21_01 Mon Sep 18 17:18:43 CDT 2006
[ENHANCEMENTS]
* If Compress::Zlib is installed, gzipped content is now
accepted and transparently decoded. No additional syntax needed!
This should save time and bandwidth in a number of cases.
(Mark Stosberg)
* Added a put() method. It also calls a subfunction called
_SUPER_put that will be removed once LWP::UserAgent supports put().
1.20
[ENHANCEMENTS]
* Added new two-argument form of credentials() method.
$mech->credentials($username, $password);
That provides simpler visiting of password-protected
resources in the vast majority of cases and still
allows the other cases to be supported. (Peter Scott)
[BUG FIXES]
* autocheck no longer is triggered when informational
responses are returned. (Mark Stosberg)
[INTERNALS]
* test suite no longer fails when Test::Warn is missing.
(CPAN testers, Mark Stosberg)
* Removed all the testing against live sites. The networking
code is not actually in Mech anway, and they were prone to
breaking, as the live sites changed. (Mark Stosberg)
1.19_02 Mon Aug 7 23:57:56 CDT 2006
[ENHANCEMENTS]
* Add new Do-What-I-Mean submit_form() option.
$mech->submit_form(
with_fields => \%data
);
That expresses that you want to select the first form contains all
fields in \%data, and then submit the data to that form. See the docs
for form_with_fields() and submit_form() for details.
(Mark Stosberg, inspired by RT#6100)
[BUG FIXES]
* The behavior of clone() now copies over the cookie jar, which
is probably what you expected it did in the first place.
This fixes bug RT#13541 filed against Test::WWW::Mechanize,
which was using clone() internally. (Mark Stosberg)
* The correct URL is returned after redirecting. This a regression
from 1.04 and was reported as RT#9059, RT#12882, and RT#12786.
The documentation about this has also been clarified that we
return a URI object, but that it stringifies to the URI itself.
[DOCUMENTATION]
* Fixed a misleading parm in the constructor.
* Document the return value of set_visible (RT#6071, MJD,
Mark Stosberg)
* Document that form_name and form_number return an HTML::Form
object (Mark Stosberg)
[INTERNALS]
* Made lots of little cleanups based on Perl::Critic
* Fix Taint-mode warnings with Perl 5.6.1 (RT#16945)
1.18 Thu Feb 2 00:11:26 CST 2006
[TESTS]
* Makefile.PL now takes four new parms:
* --live/nolive turns on/off the live tests
* --local/nolocal turns on/off the local tests
* --mech-dump/nomech-dump installs/doesn't the mech-dump program
* --all turns on all tests and installs mech-dump
* Fixed some failures in tests. Non-existent URLs now have a
"." postpended to them, so if someone's got a search domain
with a wildcard (i.e. ignore.us) it'll ignore that. Also,
Google's second link is now a https:// link, which some Mechs
can't handle. Added a 'url_regex' which now makes it look at
the second non-https link. Thanks to Pete Krawczyk.
1.16 Fri Oct 28 17:34:20 CDT 2005
[ENHANCEMENTS]
* Sped up Mech significantly (~20% in some cases). Images and
links are extracted from the HTML, and objects are created,
only when they're actually needed. This will be a speedup for
pages where you're only following links, or vice versa.
[THINGS THAT MAY BREAK YOUR CODE]
* If you've been relying on the $mech->{images} and $mech->{links}
fields being populated so that you can bypass the $mech->images()
and $mech->links() accessors, your code will break. That's OK,
because you should have been using the accessors all along.
1.14 Tue Aug 30 17:17:40 CDT 2005
[DOCUMENTATION]
* Added lots of new FAQs. Thanks to Peter Stevens.
[INTERNALS]
* Now requires Test::LongString. That's not too odious.
[FIXES]
* Tests now pass with the shuffling around that Google did.
1.13_01 Tue Apr 12 14:11:18 CDT 2005
[ENHANCEMENTS]
* Now dies if you call submit_form() with a non-existsing
form_number or form_name. Before, it would just warn.
[DOCUMENTATION]
* Added an example of using credentials() in the cookbook.
1.12 Thu Feb 24 23:38:44 CST 2005
[FIXES]
* Fixed RT #9026: hang in t/local/back.t under Windows XP.
Thanks Andrew Savige. It also should no longer complain
about being unable to clean up a temp file.
1.11_01 Mon Feb 14 00:12:48 CST 2005
[THINGS THAT MAY BREAK YOUR CODE]
* Removed deprecated _parse_html() method.
[FIXES]
* Was incorrectly looking for INPUT tags TYPE="SUBMIT" as images.
Thanks to Abe Timmerman.
[ENHANCEMENTS]
* Calling $mech->set_fields() with no current form now dies.
Thanks to Julien Beasley.
1.10
[FIXES]
* Fixed bug where images inside of links would not be found.
* Fixed test failures because of Google changes. Thanks to
Offer Kaye and others who sent in patches.
[DOCUMENTATION]
* More samples in the FAQ. Thanks to Joshua Gatcomb.
[INTERNALS]
* Added explanation of running live tests against Google in Makefile.PL.
1.08 Fri Dec 24 01:01:06 CST 2004
[ENHANCEMENTS]
* Added find_image() and find_all_images().
1.06 Wed Dec 8 14:58:39 CST 2004
[INTERNALS]
* Now uses the base pragma instead of setting @ISA.
1.05_04 Fri Nov 5 23:35:38 CST 2004
[ENHANCEMENTS]
* Added WWW::Mechanize::Image object for representing images.
* Improved the regex on the URL for META tags.
* Added --images flag to mech-dump.
[FIXES]
* When parsing urls out of meta refresh tags, "url" may now be
uppercase (RT#8230)
* Behavior of back() fixed in a number of cases (RT#8109 reported by
Josh Purinton, patched by Dominique Quatravaux)
[INTERNALS]
* Mark figured out to how to prevent his text editor from putting tabs
into the code. Andy's blood pressure dropped slightly.
1.05_03 Sun Oct 31 20:54:33 CST 2004
[ENHANCEMENTS]
* click_button() has a new input option for HTML::Form::SubmitInput
objects (DOMQ)
* content() has new options to return the page formatted
as text, with a <base href> added. (RT#8087, patch by
Dominique Quatravaux)
* update_html() method has been added, which can be used
to modify the HTML that Mech parses. It should be sub-classed
instead of _parse_html(), which has been deprecated.
(RT#8087, patch by Dominique Quatravaux)
* select() has new option to select an option by number
(RT#5789, Scott Lanning)
* WWW::Mechanize::Link now has support providing all the
attributes of the link through a new attrs() method, which
returns them as a hashref. This is a replacement for the
alt() method, added in 1.05_01. It's not backwards
compatible with that, but, hey, that's what developer
releases are for. (RT#8092, Rob Casey and Mark Stosberg)
[FIXES]
* Upload <input type="file" ... > does not use the default
value to prevent attacks, patch by Jan Pazdziora (RT #7843).
[INTERNALS]
* Improved tests and documentation for select() (RT#5789,
Scott Lanning)
* Improve taint-safeness on Perl 5.6.1 (RT#8042, patch by
Dominique Quatravaux)
* Added tests for click_button() (RT#8061, by Dominique
Quatravaux)
* Require URI 1.25, fixing bug which exposed itself in
WWW::Mechanize (RT#3048)
* Move select() to better location in docs. Document and
test the return values. The return value is now "1" on
success instead of the undocumented behavior of returning
a form value. (RT#6138, spotted by MJD, patched by Mark
Stosberg)
* Possible matching tags for the find_link() 'tag_regex'
attribute are now documented. (RT#2989, by Mark Stosberg)
* refactored find_link() to avoid use of eval(). This should
improve performance a bit and avoid potential security
issues. (Mark Stosberg)
1.05_02 Sat Oct 2 16:55:59 CDT 2004
[ENHANCEMENTS]
* Added the $mech->save_content( $filename ) function, so you
can dump stuff to files easily.
1.05_01 Thu Sep 30 21:04:44 CDT 2004
[FIXES]
* set_visible() doesn't stop setting values when it finds a zero.
[ENHANCEMENTS]
* WWW::Mechanize::Link has a new, easier to remember constructor
interface. The old one is still supported. Support for including
an 'alt' attribute was added, which is useful for <area> links.
(RT #3317). Thanks to Mark Stosberg.
* When links are extracted from <area> tags, the ALT attribute will
be captured and become part of the WWW::Mechanize::Link object.
(RT #3317). Patch by Mark Stosberg.
[INTERNALS]
* t/mech-dump.t is now more portable (RT #7690)
* t/local/follow.t has new tests to confirm that 'follow*' functions
work with characters like o-umlaut, even when the o-umlaut is
encoded in the HTML, but not in the call to follow(). (RT #2416)
By Mark Stosberg.
module directory has changed (eg. "darwin-2level" vs.
"darwin-thread-multi-2level").
binary packages of perl modules need to be distinguishable between
being built against threaded perl and unthreaded perl, so bump the
PKGREVISION of all perl module packages and introduce
BUILDLINK_RECOMMENDED for perl as perl>=5.8.5nb5 so the correct
dependencies are registered and the binary packages are distinct.
addresses PR pkg/28619 from H. Todd Fujinaka.
Changes since 1.02:
[ENHANCEMENTS]
* $mech->get() now accepts a WWW::Mechanize::Link object.
* $mech->stack_depth(n) lets you set the depth of the mech
object's page stack. This way, if you have a Mech that does
lots of stuff and never/rarely goes back(), you won't be eating
up memory. Thanks to BooK and Chi-Fung. (RT #5362)
[FIXES]
* Fixed tests that fail under LWP >= 5.800.
* Added a workaround for LWP::UserAgent->clone() when ->{proxy}
is undef. (RT #6443)
* The Referer was getting passed as a URI object sometimes,
and that caused sadness. Eugene Haimov supplied a workaround.
(RT #6372)
[DOCUMENTATION]
* Added Ian Langworth's listmod and John Beppu's photobucket
uploader programs to WWW::Mechanize::Examples.
* Minor doc tweak for find_link()
* Finally added a value() func. Thanks to Spoon,
who even now, months after his passing, is still contributing
to Mechanize.
"WWW::Mechanize", or Mech for short, helps you automate interaction
with a website. It supports performing a sequence of page fetches
including following links and submitting forms. Each fetched page
is parsed and its links and forms are extracted. A link or a form
can be selected, form fields can be filled and the next page can
be fetched. Mech also stores a history of the URLs you've visited,
which can be queried and revisited.