User:Lucia Dossin/Protyping/Modules/Scraping: Difference between revisions

From XPUB & Lens-Based wiki
No edit summary
No edit summary
Line 1: Line 1:
<h1>Let me Think</h1>
<h1>Let me Think</h1>


For this module, I tried out 'manual' x 'automated' scraping in my self-directed research - [[User:Lucia Dossin/Self-directed Research/Exercise1|exercise 1]].
For this module, I tried out 'manual' x 'automated' scraping in my self-directed research - exercise #1.
 
The exercise consists of a script that runs in Greasemonkey and it's meant to be used also by someone with little knowledge of code. More info can be found [[User:Lucia Dossin/Self-directed Research/Exercise1|here]].
 
I manually inspected the HTML/CSS and built the plugin code accordingly. But the HTML/CSS may change in the future. In this case, being able to use Firebug is an important skill. That's what will allow the user to change the code if necessary.
 
For the SDR exercise I put up a tutorial website. But a tutorial on Firebug on top of that seemed just too much information. So, I built a simple form that will scrape a URL, search for a few elements and return their CSS identifiers as result.

Revision as of 17:01, 25 August 2014

Let me Think

For this module, I tried out 'manual' x 'automated' scraping in my self-directed research - exercise #1.

The exercise consists of a script that runs in Greasemonkey and it's meant to be used also by someone with little knowledge of code. More info can be found here.

I manually inspected the HTML/CSS and built the plugin code accordingly. But the HTML/CSS may change in the future. In this case, being able to use Firebug is an important skill. That's what will allow the user to change the code if necessary.

For the SDR exercise I put up a tutorial website. But a tutorial on Firebug on top of that seemed just too much information. So, I built a simple form that will scrape a URL, search for a few elements and return their CSS identifiers as result.