| @@ -1,12 +1,10 @@ | | | @@ -1,12 +1,10 @@ |
1 | Beautiful Soup parses arbitrarily invalid XML- or HTML-like substance | | 1 | Beautiful Soup parses arbitrarily invalid XML- or HTML-like substance |
2 | into a tree representation. It provides methods and Pythonic idioms | | 2 | into a tree representation. It provides methods and Pythonic idioms |
3 | that make it easy to search and modify the tree. | | 3 | that make it easy to search and modify the tree. |
4 | | | 4 | |
5 | A well-formed XML/HTML document will yield a well-formed data | | 5 | A well-formed XML/HTML document will yield a well-formed data |
6 | structure. An ill-formed XML/HTML document will yield a | | 6 | structure. An ill-formed XML/HTML document will yield a correspondingly |
7 | correspondingly ill-formed data structure. If your document is only | | 7 | ill-formed data structure. If your document is only locally |
8 | locally well-formed, you can use this library to find and process the | | 8 | well-formed, you can use this library to find and process the |
9 | well-formed part of it. The BeautifulSoup class has heuristics for | | 9 | well-formed part of it. The BeautifulSoup class has heuristics for |
10 | obtaining a sensible parse tree in the face of common HTML errors. | | 10 | obtaining a sensible parse tree in the face of common HTML errors. |
11 | | | | |
12 | WWW: http://www.crummy.com/software/BeautifulSoup/ | | | |