Calendars:Networked Media Calendar/Networked Media Calendar/16-03-2011 -Event 1: Difference between revisions
No edit summary |
|||
(9 intermediate revisions by 5 users not shown) | |||
Line 1: | Line 1: | ||
11-18 | Nicolas Maleve - Thematic Project | 11-18 | Nicolas Maleve - Thematic Project | ||
= Cookbook Recipes for Goodiff Workshop = | |||
=== Cookbook Recipes for Goodiff Workshop === | |||
* [[Simplifying_HTML_by_removing_"invisible"_parts]] | * [[Simplifying_HTML_by_removing_"invisible"_parts]] | ||
* [[Stripping all the tags from HTML to get pure text]] | * [[Stripping all the tags from HTML to get pure text]] | ||
* [[Looking up synonym-sets for a word]] | * [[Looking up synonym-sets for a word]] | ||
* [[Splitting text into sentences]] | * [[Splitting text into sentences]] | ||
* [[Removing common words / stopwords]] | * [[Removing common words / stopwords]] | ||
* [[Finding capitalized words]] | * [[Finding capitalized words]] | ||
* [[Extracting parts of an HTML document]] | * [[Extracting parts of an HTML document]] | ||
* [[Extracting the text contents of a node]] | * [[Extracting the text contents of a node]] | ||
* [[Turning part of a page back into code (aka serialization)]] | * [[Turning part of a page back into code (aka serialization)]] | ||
; TOS selected words frequency in time (by Dusan and Natasa) | |||
[[Goodiff_TOS_word_frequency | source code]] | |||
* [https://spreadsheets.google.com/pub?key=0AgT6KLPteXsOdF84Y0F3RWpxQnQ2ODFOLVA3RG9XWFE&output=html Facebook TOS] | |||
* [https://spreadsheets.google.com/pub?key=0AgT6KLPteXsOdHRuczQxUEU4dWxjWmNjaUtKb2JfM1E&single=true&gid=0&output=html Skype TOS] | |||
; Simple statistics TOS | |||
* [[16-03-2011 Laura Amy Laurier | process]] | |||
; TOS Game | |||
* [[16-03-2011_Danny_Fabien_Mirjam | Lost in TOS]] |
Latest revision as of 22:59, 28 March 2011
11-18 | Nicolas Maleve - Thematic Project
Cookbook Recipes for Goodiff Workshop
- Simplifying_HTML_by_removing_"invisible"_parts
- Stripping all the tags from HTML to get pure text
- Looking up synonym-sets for a word
- Splitting text into sentences
- Removing common words / stopwords
- Finding capitalized words
- Extracting parts of an HTML document
- Extracting the text contents of a node
- Turning part of a page back into code (aka serialization)
- TOS selected words frequency in time (by Dusan and Natasa)
- Simple statistics TOS
- TOS Game