Mia/scraping images

From XPUB & Lens-Based wiki

SCRAPING IMAGES WITH WEB BROWSER EXTENSION

Download Firefox: https://www.mozilla.org/nl/firefox/new/

Download "Download all Images": https://addons.mozilla.org/nl/firefox/addon/save-all-images-webextension/?src=search


SCRAPING IMAGES WITH A PYTHON SCRIPT

Download "Google images download": https://github.com/hardikvasa/google-images-download

Unzip "Google images download" by Double clicking on it:


Open the Terminal:

Spotlight search->Type terminal->Press key "Enter"

Type: cd (and the key "Space")

Drag the folder "google_images_download" that will be in Download/google-images-download-master

Press the key "Enter"

Type: python google_images_download.py --keywords "Polar bears, balloons, Beaches" --limit 20


Check all the options in the page of the script: https://github.com/hardikvasa/google-images-download


SCRAPING IMAGES WITH YOUTUBE-DL

FOR PC:

open command prompt

type cd SPACE, then drag folder where is youtube-dl.exe (Documents) ENTER

then drag the youtube-dl.exe file

press SPACE and add url

ENTER, the videos are in the same folder


for mac:

Spotlight search->Type terminal->Press key "Enter"

  • Paste:

/usr/bin/ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)" (And press the key "Enter")

  • Then, in the Terminal type and Press the key 'Enter':

brew install youtube-dl


To download a video:

youtube-dl VIDEO_URL


For example: youtube-dl https://www.youtube.com/watch?v=zcGOoDThC1E


For forcing to download the higher resolution version of it you can add "-f best -citw":

youtube-dl -f best -citw https://www.youtube.com/watch?v=zcGOoDThC1E


To download an entire Youtube channel:

youtube-dl -f best -citw ytuser:<USER>


For example: youtube-dl -f best -citw ytuser:tate


To download all videos on a playlist:

youtube-dl -f best -citw <PLAYLIST URL>

For example: youtube-dl -f best -citw https://www.youtube.com/watch?v=wPI1uRr4GJo&list=PL7E618BEB1EEEC4C3


To stop the script click on the Terminal and Press the Key "CTRL" and without releasing it, press the key "c"