@@ -31,19 +31,21 @@ Usage
3131 >> doc = Document(response.text)
3232 >> doc.title()
3333 >> 'Example Domain'
34+ >> doc.summary()
35+ >> u'<html><body><div><body id="readabilityBody">\n<div>\n <h1>Example Domain</h1>\n
36+ <p>This domain is established to be used for illustrative examples in documents. You may
37+ use this\n domain in examples without prior coordination or asking for permission.</p>
38+ \n <p><a href="http://www.iana.org/domains/example">More information...</a></p>\n</div>
39+ \n</body>\n</div></body></html>'
3440
3541Change Log
3642----------
3743
38- - 0.7 Improved HTML5 tags handling. Heuristics were changed for a lot of sites: Fixed an important
39- bug with stripping unwanted HTML nodes (only first matching node was removed before).
40- - 0.6 Finally a release which supports Python versions 2.6, 2.7, 3.3
41- and 3.4
42- - 0.5 Preparing a release to support Python versions 2.6, 2.7, 3.3 and
43- 3.4
44+ - 0.7 Improved HTML5 tags handling. Fixed stripping unwanted HTML nodes (only first matching node was removed before).
45+ - 0.6 Finally a release which supports Python versions 2.6, 2.7, 3.3 and 3.4
46+ - 0.5 Preparing a release to support Python versions 2.6, 2.7, 3.3 and 3.4
4447- 0.4 Added Videos loading and allowed more images per paragraph
45- - 0.3 Added Document.encoding, positive\_ keywords and
46- negative\_ keywords
48+ - 0.3 Added Document.encoding, positive\_ keywords and negative\_ keywords
4749
4850Licensing
4951=========
@@ -54,14 +56,9 @@ This code is under `the Apache License
5456Thanks to
5557---------
5658
57- - Latest
58- `readability.js <https://github.com/MHordecki/readability-redux/blob/master/readability/readability.js >`__
59+ - Latest `readability.js <https://github.com/MHordecki/readability-redux/blob/master/readability/readability.js >`__
5960- Ruby port by starrhorne and iterationlabs
60- - `Python port <https://github.com/gfxmonk/python-readability >`__ by
61- gfxmonk
62- - `Decruft
63- effort <http://www.minvolai.com/blog/decruft-arc90s-readability-in-python/> `__
64- to move to lxml
65- - "BR to P" fix from readability.js which improves quality for smaller
66- texts
61+ - `Python port <https://github.com/gfxmonk/python-readability >`__ by gfxmonk
62+ - `Decruft effort <http://www.minvolai.com/blog/decruft-arc90s-readability-in-python/> ` to move to lxml
63+ - "BR to P" fix from readability.js which improves quality for smaller texts
6764- Github users contributions.
0 commit comments