Jump to navigation Jump to search
Scraping (also Screen Scraping) is the process of extracting data out of something.
Other interesting libraries to consider:
- Mechanize in essence simulates a browser in Python, that can "remember" things (like cookies / sessions) between pages
- lxml which can apparently deal with "mal-formed" HTML and quickly convert them to xml trees
- http://scrapy.org/ Python framework for custom scrapers