3ad4440fb1
- update py-beautyfulsoup to current stable branch (4.1.1) - pass maintainership of this new port to submitter - set USE_PYTHON to 2.7, because it doesn't packages with python3 (port modification needed) - chase dependency update to deskutils/calibre - add UPDATING entry PR: 168372 (based on) Submitted by: William Grzybowski <william88 at gmail dot com> Approved by: Mike Meyer <mwm at mired dot org> (maintainer)
12 lines
630 B
Text
12 lines
630 B
Text
Beautiful Soup parses arbitrarily invalid XML- or HTML-like substance
|
|
into a tree representation. It provides methods and Pythonic idioms
|
|
that make it easy to search and modify the tree.
|
|
|
|
A well-formed XML/HTML document will yield a well-formed data
|
|
structure. An ill-formed XML/HTML document will yield a
|
|
correspondingly ill-formed data structure. If your document is only
|
|
locally well-formed, you can use this library to find and process the
|
|
well-formed part of it. The BeautifulSoup class has heuristics for
|
|
obtaining a sensible parse tree in the face of common HTML errors.
|
|
|
|
WWW: http://www.crummy.com/software/BeautifulSoup/
|