User:Francg/expub/thesis/prototype: Difference between revisions
No edit summary |
No edit summary |
||
Line 12: | Line 12: | ||
</center> | </center> | ||
Run Python (I did it from virtual environment) | Run Python (I did it from virtual environment in my laptop) | ||
<br>then following these commands: | |||
<br>from bs4 import BeautifulSoup | <br>from bs4 import BeautifulSoup | ||
<br>import requests | <br>import requests |
Revision as of 12:57, 5 October 2017
Prototype
Extracting data (scrapping URL's / web links from content only)
from: https://www.reddit.com/
Run Python (I did it from virtual environment in my laptop)
then following these commands:
from bs4 import BeautifulSoup
import requests
url = raw_input("https://www.reddit.com/: ")
r = requests.get("https://www.reddit.com/" +url)
data = r.text
soup = BeautifulSoup(data)
for link in soup.find_all('a'):
print(link.get('href'))