BeautifulSoup: Difference between revisions
(New page: Beautiful Soup is a Python library for manipulating HTML pages. !! Examples A function to replace the contents of a tag: <source lang="python"> import BeautifulSoup soup = BeautifulSoup....) |
No edit summary |
||
Line 1: | Line 1: | ||
Beautiful Soup is a Python library for manipulating HTML pages. | Beautiful Soup is a Python library for manipulating HTML pages. | ||
* [[http://www.crummy.com/software/BeautifulSoup/]] | |||
* [[http://www.crummy.com/software/BeautifulSoup/documentation.html documentation]] | |||
== Code Examples == | |||
A function to replace the contents of a tag: | A function to replace the contents of a tag: |
Revision as of 11:42, 12 June 2008
Beautiful Soup is a Python library for manipulating HTML pages.
- [[1]]
- [documentation]
Code Examples
A function to replace the contents of a tag:
import BeautifulSoup
soup = BeautifulSoup.BeautifulSoup("<ul><li>one</li><li>two</li></ul>")
def setcontents (tag, val):
# remove previous contents
for c in tag.contents:
c.extract()
# insert the new
tag.insert(0, val)
items = soup.findAll("li")
for item in items:
setcontents(item, "foo")
print soup.prettify()