User:Bohye Woo/Prototyping: Difference between revisions

From XPUB & Lens-Based wiki
No edit summary
No edit summary
 
(37 intermediate revisions by the same user not shown)
Line 38: Line 38:


[[Motivational messages]]
[[Motivational messages]]
[[Category:Prototyping]]
= Py.rate.chinic workshop #1=
==Workshop #1==
Using Selenium to scrap the Youtube comments, and using text processor to rank the most frequent words.
<source lang=python>
import re
import string
frequency = {}
document_text = open('4.txt', 'r')
text_string = document_text.read().lower()
match_pattern = re.findall(r'\b[a-z]{4,15}\b', text_string)
for word in match_pattern:
    count = frequency.get(word,0)
    frequency[word] = count + 1
   
frequency_list = frequency.keys()
print (frequency)
for word in sorted(frequency, key=frequency.get):
print (word, frequency[word])
</source>
Link: [[.py.rate.chnic sessions]]
=Pandoc converter=
A universal document converter - converts from one markup language onto another
https://pandoc.org/MANUAL.html
Use: convert downloaded wiki pages onto HTML files
extensive documentation in [https://pandoc.org/MANUAL.html Pandoc’s Manual] or <code>man pandoc</code>
==calibre ebook program==
https://calibre-ebook.com/
==Pandoc (Convert Docx file to Media Wiki)==
===Pandoc common arguments===
'''-f''' - option standing for “from”, is followed by the input format;
'''-t''' - option standing for “to”, is followed by the output format;
'''-s''' - option standing for “standalone”, produces output with an appropriate header and footer;
'''-o''' - option for file output;
'''mediawiki''' - mediawiki input filename - you need to replace it by its actual name
<source lang=python>
echo texts.docx
echo '<h1>Hello</h1>'
</source>
<source lang=python>
# (Convert Docx file to Markdown)
echo '<h1>Hello</h1>' | pandoc -f (from) html -t(to) markdown
# (Convert Docx file to Media Wiki)
echo '<h1>Hello</h1>' | pandoc -f (from) html -t(to) mediawiki
# (Convert Docx file to Latex)
echo '<h1>Hello</h1>' | pandoc -f (from) html -t(to) latex
</source>
<source lang=python>
#Content is stored in a file now
#start Pandoc program
pandoc texts.docx -f docx -t mediawiki
#convert to wiki file to get output of texts.wiki
pandoc texts.docx -f docx -t mediawiki -o texts.wiki
#check the text
less text.wiki
</source>
==From Wiki to Html to PDF==
example: https://gitlab.com/Mondotheque/RadiatedBook/blob/master/resources/style.pdf.css
Weasyprint filename.html  -s style.css filename.pdf
<source lang=python>
#download the page
#STEP 1
#take content from wiki
./wiki-download.py --pages 'Entreprecariat reader synopses and abstracts'
#STEP 2
#converting mediawiki file to html file:
pandoc 'Entreprecariat reader synopses and abstracts' -f mediawiki -t html -s -o 'Entreprecariat_reader.html'
#STEP 3
#convert html to pdf
weasyprint Entreprecariat_reader.html -s style.css output.pdf
</source>
[[Program Languages]]

Latest revision as of 12:38, 23 October 2018

Motivational messages - work groups

Bo, Bi, Pedro, Rita group (BBPR)

Pad:https://pad.xpub.nl/p/LINKEDIN

outcome

http://145.24.139.232/~pedrosaclout/linkedinproject/

In my role as Head of recruiting for technology product development in India, I have had the exciting opportunity to was director of Open State Foundation, a non-profit organization. I am Isla Garcia and I . I take responsibility and pride myself in being strategic yet adaptable. I have an entrepreneurial spirit in that I enjoy taking on new challenges, creating new opportunities and designing new programs. My passions lie in reinforcement learning. When Iâm not focused on my professional endeavors, you can find me go 14,000 feet above sea level hiking a mountain. My goal is to be a good social responsibility person in society.

In my role as Co-Founder, I have had the exciting opportunity to was director of Open State Foundation, a non-profit organization. I am Isla Garcia and I . I take responsibility and pride myself in being strategic yet adaptable. I have an entrepreneurial spirit in that I enjoy taking on new challenges, creating new opportunities and designing new programs. My passions lie in GANs. When Iâm not focused on my professional endeavors, you can find me go 14,000 feet above sea level hiking a mountain. My goal is to become a good software engineer in software field.

script

/home/pedrosaclout/public_html/linkedinproject/generator.sh

#!/bin/sh
dir=/home/pedrosaclout/public_html/linkedinproject
profession=`cat $dir/professions.txt | sort -R | head -n 1`
subject=`cat $dir/subject.txt | sort -R | head -n 1`
goal=`cat $dir/goal.txt | sort -R | head -n 1`
education=`cat $dir/education.txt | sort -R | head -n 1`
quotes=`cat $dir/quotes.txt | sort -R | head -n 1`
adjectives=`cat $dir/adjectives.txt | sort -R | head -n 1`
name=`cat $dir/names.txt | sort -R | head -n 1`
hobby=`cat $dir/hobby.txt | sort -R | head -n 1`
experience=`cat $dir/experience.txt | sort -R | head -n 1`

template=`cat $dir/template.txt| sort -R | head -n 1`


echo $template | sed "s/PROFESSION/$profession/g"  | sed "s/EXPERIENCE/$experience/g" | sed "s/SUBJECT/$subject/g" | sed "s/GOAL/$goal/g" | sed "s/EDUCATION/$education/g" | sed "s/QUOTE/$quotes/g" | sed "s/ADJECTIVES/$adjectives/g" | sed "s/NAME/$name/g" | sed "s/HOBBY/$hobby/g" | sed "s/EXPERIENCE/$experience/g" > $dir/index.html


 from nltk.corpus import wordnet synonyms = [] for syn in wordnet.synsets('Computer'): for lemma in syn.lemmas(): synonyms.append(lemma.name()) print(synonyms)

Motivational messages


Py.rate.chinic workshop #1

Workshop #1

Using Selenium to scrap the Youtube comments, and using text processor to rank the most frequent words.

import re
import string
frequency = {}
document_text = open('4.txt', 'r')
text_string = document_text.read().lower()
match_pattern = re.findall(r'\b[a-z]{4,15}\b', text_string)
 
for word in match_pattern:
    count = frequency.get(word,0)
    frequency[word] = count + 1
     
frequency_list = frequency.keys()
print (frequency)

for word in sorted(frequency, key=frequency.get):
	print (word, frequency[word])

Link: .py.rate.chnic sessions

Pandoc converter

A universal document converter - converts from one markup language onto another https://pandoc.org/MANUAL.html

Use: convert downloaded wiki pages onto HTML files

extensive documentation in Pandoc’s Manual or man pandoc

calibre ebook program

https://calibre-ebook.com/

Pandoc (Convert Docx file to Media Wiki)

Pandoc common arguments

-f - option standing for “from”, is followed by the input format;

-t - option standing for “to”, is followed by the output format;

-s - option standing for “standalone”, produces output with an appropriate header and footer;

-o - option for file output;

mediawiki - mediawiki input filename - you need to replace it by its actual name


echo texts.docx 
echo '<h1>Hello</h1>'
# (Convert Docx file to Markdown)
echo '<h1>Hello</h1>' | pandoc -f (from) html -t(to) markdown

# (Convert Docx file to Media Wiki)
echo '<h1>Hello</h1>' | pandoc -f (from) html -t(to) mediawiki

# (Convert Docx file to Latex)
echo '<h1>Hello</h1>' | pandoc -f (from) html -t(to) latex
#Content is stored in a file now

#start Pandoc program
pandoc texts.docx -f docx -t mediawiki

#convert to wiki file to get output of texts.wiki
pandoc texts.docx -f docx -t mediawiki -o texts.wiki

#check the text
less text.wiki

From Wiki to Html to PDF

example: https://gitlab.com/Mondotheque/RadiatedBook/blob/master/resources/style.pdf.css

Weasyprint filename.html -s style.css filename.pdf

#download the page 

#STEP 1
#take content from wiki
./wiki-download.py --pages 'Entreprecariat reader synopses and abstracts'

#STEP 2
#converting mediawiki file to html file:
pandoc 'Entreprecariat reader synopses and abstracts' -f mediawiki -t html -s -o 'Entreprecariat_reader.html' 

#STEP 3
#convert html to pdf
weasyprint Entreprecariat_reader.html -s style.css output.pdf

Program Languages