Sniff, Scrape, Crawl (Prototyping)
Revision as of 14:30, 19 May 2014 by Michael Murtaugh (talk | contribs)
In 2011, Sniff, Scrape, Crawl was a thematic project led by Aymeric Mansoux, Renee Turner, and Michael Murtaugh.
This prototyping module covers some of the core themes and tools around the practice of "scraping", with the goal to better familiarize yourself with the possibilities of this technique and to develop strategic uses of the tools for your specific research.
Meeting 1
Scraping Tools
- S: Simple Web Spider in Python
- M: Scrapy
- L: Heritrix
Afternoon: Meeting to discuss / develop / brainstorm project ideas
Some Examples
- focused_crawls
- Lasse's Tumblr Jumper
- Birgit Bachler's Bonus Card Friends