User:Manetta/wordnet/: Difference between revisions
(Created page with "=WordNet= 900px * WordNet words (about) <br> User:Manetta/prototyping/conversational-in...") |
(→so far) |
||
(4 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
<div style="width:100%;max-width:800px;"> | |||
__NOTOC__ | |||
=WordNet= | =WordNet= | ||
[[File:Mb-wordnet-tour-interface-root.png| | [[File:Mb-wordnet-tour-interface-root.png|800px]] | ||
<small> prototype for a WordNet tour. [http://213.167.241.137/cgi-bin/wordnet-tour-hypernyms.cgi take a tour here]</small> | |||
==*== | |||
WordNet is a lexical dataset and a primary resource in the field of Knowlegde Discovery in Data processes (also known as the field of data-mining and big-data). WordNet is built with word-'synsets' (where a word could have multiple entries according to multiple meanings), which are related to eachother by various relations, like: word-type, categorie or synonyms. This dataset has been developed since 1985, and is basically a norm in the field, used during training processes of data-mining algorithms. Although the focus on word-synsets is an attempt to create a nuanced model of a human language, the dataset is still a model, and will always be 'imperfect'. | |||
As written language is regarded as 'data' today, data-mining techniques 'read' written text to return 'information'. It's a constructive truth instead, and datasets as WordNet are functioning as the 'norm' of such truths. | |||
Looking closer into WordNet is an attempt to reflect on methods in which meaningless data is transformed into semantic data (in this case according to WordNet's norms). How can this normalizing resource be a design tool? And where would we like to apply it onto? And would it be possible to reflect on the question why there is still the aim to built such universal systems? | |||
==so far== | |||
[[User:Manetta/wordnet/wordnetwords | * WordNet words (about) ]]<br> | [[User:Manetta/wordnet/wordnetwords | * WordNet words (about) ]]<br> | ||
[[User:Manetta/prototyping/conversational-interfaces-WordNet-tour | * WordNet | [[User:Manetta/scripts/videogrep-wordnet-2001 | #prototype: WordNet & Videogrep, editing on hypernym (=wordgroup)]]<br> | ||
[[User:Manetta/prototyping/conversational-interfaces-WordNet-tour | #prototype: WordNet tour (part of conversational interfaces)]]<br> | |||
[[User:Manetta/i-could-have-written-that/wordnet-case-studies | * WordNet in the wild (case-studies)]]<br> | |||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/classification_what-happened_roget---wordnet/classification_what-happened_roget---wordnet.html * on the position of 'information', in Roget's thesaurus (1911) & WordNet 3.0 (2006)]<br> | |||
==notes== | |||
no | |||
ne | |||
[[User:Manetta/ | |||
-------------------------------------------- | |||
<gallery mode="nolines" height="50px"> | |||
File:Mb-WordNet-teologicals-proto.png|[[User:Manetta/wordnet/wordnetwords#teleological_links|WordNet dive]] | |||
File:Mb-WordNet-alive-tweets-01.png|[[User:Manetta/wordnet/wordnetwords|WordNet alive]] | |||
</gallery> | |||
</div> |
Latest revision as of 22:05, 25 November 2015
WordNet
prototype for a WordNet tour. take a tour here
*
WordNet is a lexical dataset and a primary resource in the field of Knowlegde Discovery in Data processes (also known as the field of data-mining and big-data). WordNet is built with word-'synsets' (where a word could have multiple entries according to multiple meanings), which are related to eachother by various relations, like: word-type, categorie or synonyms. This dataset has been developed since 1985, and is basically a norm in the field, used during training processes of data-mining algorithms. Although the focus on word-synsets is an attempt to create a nuanced model of a human language, the dataset is still a model, and will always be 'imperfect'.
As written language is regarded as 'data' today, data-mining techniques 'read' written text to return 'information'. It's a constructive truth instead, and datasets as WordNet are functioning as the 'norm' of such truths.
Looking closer into WordNet is an attempt to reflect on methods in which meaningless data is transformed into semantic data (in this case according to WordNet's norms). How can this normalizing resource be a design tool? And where would we like to apply it onto? And would it be possible to reflect on the question why there is still the aim to built such universal systems?
so far
* WordNet words (about)
#prototype: WordNet & Videogrep, editing on hypernym (=wordgroup)
#prototype: WordNet tour (part of conversational interfaces)
* WordNet in the wild (case-studies)
* on the position of 'information', in Roget's thesaurus (1911) & WordNet 3.0 (2006)
notes
no ne