2008-09-05 17:46:51 +02:00
|
|
|
Beautiful Soup parses arbitrarily invalid XML- or HTML-like substance
|
|
|
|
into a tree representation. It provides methods and Pythonic idioms
|
|
|
|
that make it easy to search and modify the tree.
|
|
|
|
|
|
|
|
A well-formed XML/HTML document will yield a well-formed data
|
2008-12-10 13:32:27 +01:00
|
|
|
structure. An ill-formed XML/HTML document will yield a correspondingly
|
|
|
|
ill-formed data structure. If your document is only locally
|
|
|
|
well-formed, you can use this library to find and process the
|
2008-09-05 17:46:51 +02:00
|
|
|
well-formed part of it. The BeautifulSoup class has heuristics for
|
|
|
|
obtaining a sensible parse tree in the face of common HTML errors.
|
2014-01-25 10:25:32 +01:00
|
|
|
|
|
|
|
This package contains v3 of the module.
|