User:Manetta/thesis/thesis-outline: Difference between revisions
No edit summary |
No edit summary |
||
Line 2: | Line 2: | ||
=== intro=== | === intro=== | ||
* | * NLP, natural language processing | ||
* | * current focus: data-mining field (a data-fashion) | ||
* | * mining as ideology | ||
* | ** from: mining natural resources, to: data mining ''[what are the differences?]'' | ||
** 'data mining'? | |||
* text processing | |||
** from: able to check results with senses (OCR), to: intuition (data-mining) ''[what are the differences?]'' | |||
** parsing, how text is treated: as n-grams, chunks, bag-of-words, characters | |||
* data as autonomous entity | |||
** from: information, to: data science ''[what are the differences?]'' | |||
* concern | |||
===hypothesis=== | ===hypothesis=== | ||
The results of data-mining software are not mined, results are created. | |||
What elements do allow for algorithmic agreeability? | |||
=== algorithmic agreeability - case study objects=== | |||
=== algorithmic agreeability - | * anthropomorphism: | ||
** data ''mining'' | |||
** machine ''learning'' | |||
* wordclouds | * wordclouds | ||
* | * workflow mining-software (eg. Pattern, Wecka) | ||
* data as autonomous entity | * data as autonomous entity | ||
Line 27: | Line 32: | ||
[[User:Manetta/i-could-have-written-that/little-glossary | → little glossary]]<br> | [[User:Manetta/i-could-have-written-that/little-glossary | → little glossary]]<br> | ||
=== | ===mining as ideology=== | ||
[[User:Manetta/i-could-have-written-that/ | [[User:Manetta/i-could-have-written-that/from-mining-minerals-to-mining-data | * from mining minerals to mining data]]<br> | ||
'''anthropomorphism''' | |||
[[User:Manetta/i-could-have-written-that/anthropomorphic-qualities | * anthropomorphic qualities of a computer (?)]]<br> | [[User:Manetta/i-could-have-written-that/anthropomorphic-qualities | * anthropomorphic qualities of a computer (?)]]<br> | ||
[[User:Manetta/i-could-have-written-that/the-data-apparatus | * the photographic apparatus → the data apparatus ( | [[User:Manetta/i-could-have-written-that/the-data-apparatus | * the photographic apparatus → the data apparatus (annotations)]] <br> | ||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/joseph-s_questions/joseph-s_questions.html * Joseph's (Weizenbaum) questions on Computer Power and Human Reason]<br> | [http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/joseph-s_questions/joseph-s_questions.html * Joseph's (Weizenbaum) questions on Computer Power and Human Reason]<br> | ||
===text processing=== | |||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/semantic-math-averaging/semantic-math-averaging.html * semantic math: averaging polarity rates in Pattern (text mining software package)]<br> | |||
[[User:Manetta/i-could-have-written-that/wordclouds | * notes on wordclouds]]<br> | |||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/automatic-reading-machines/automatic-reading-machines.html * automatic reading machines; from encoding-decoding to constructed-truths]<br> | |||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/wordnet-skeleton/wordnet-skeleton.html * index of WordNet 3.0 (2006)]<br> | |||
===data as autonomous entity=== | |||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/knowlegde-driven-by-the-data/knowlegde-driven-by-the-data.html * knowledge driven by data - ''whenever i fire a linguist, the results improve'']<br> | |||
===other=== | ===other=== | ||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/ | [http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/i-am-sorry-but-these-are-the-words-laughter/i-am-sorry-but-these-are-the-words-laughter.html * (laughter) - ''it's embarrassing but these are the words'']<br> | ||
[[User:Manetta/i-could-have-written-that/syntactic-view | * call for a syntactic view; Florian Cramer & Benjamin Bratton (text)]] <br> | |||
[[User:Manetta/i-could-have-written-that/sentiment-analysis-phd-presentation | * EUR PhD presentation 'Sentiment Analysis of Text Guided by Semantics and Structure' (13-11-2015) ]]<br> | [[User:Manetta/i-could-have-written-that/sentiment-analysis-phd-presentation | * EUR PhD presentation 'Sentiment Analysis of Text Guided by Semantics and Structure' (13-11-2015) ]]<br> | ||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/roget-s_thesaurus-of-english-words-and-phrases/roget-s_thesaurus-of-english-words-and-phrases.html * index of Roget's thesaurus (1805)]<br> | |||
[http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/classification_what-happened_roget---wordnet/classification_what-happened_roget---wordnet.html * comparing the classification of the word 'information' Thesaurus (1911) vs. WordNet 3.0 (2006)]<br> | |||
Line 52: | Line 61: | ||
* Alan Turing - Computing Machinery and Intelligence (1936) | * Alan Turing - Computing Machinery and Intelligence (1936) | ||
* The Journal of Typographic Research - OCR-B: A Standardized Character for Optical Recognition this article (V1N2) (1967); [http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/automatic-reading-machines/automatic-reading-machines.html → abstract] | * The Journal of Typographic Research - OCR-B: A Standardized Character for Optical Recognition this article (V1N2) (1967); [http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/automatic-reading-machines/automatic-reading-machines.html → abstract] | ||
* Ted Nelson - Computer Lib & Dream Machines (1974); | * Ted Nelson - Computer Lib & Dream Machines (1974); | ||
* Joseph Weizenbaum - Computer Power and Human Reason (1976); [http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/joseph-weizenbaum_computer-power-and-human-reason/joseph-weizenbaum_computer-power-and-human-reason.html → annotations] | * Joseph Weizenbaum - Computer Power and Human Reason (1976); [http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/joseph-weizenbaum_computer-power-and-human-reason/joseph-weizenbaum_computer-power-and-human-reason.html → annotations] | ||
Line 73: | Line 81: | ||
=bibliography (five key texts)= | =bibliography (five key texts)= | ||
* Language, Florian Cramer (2008); [http://pzwart1.wdka.hro.nl/~manetta/annotations/html/txt/florian-cramer_language.html → annotations] | * Language, Florian Cramer (2008); [http://pzwart1.wdka.hro.nl/~manetta/annotations/html/txt/florian-cramer_language.html → annotations] | ||
* Antoinette Rouvroy - All Watched Over By Algorithms - Transmediale (Jan. 2015); [http://pzwart1.wdka.hro.nl/~manetta/annotations/html/events%2btalks/transmediale_all-watched-over-by-algorithms_2015.html → annotations] | * Antoinette Rouvroy - All Watched Over By Algorithms - Transmediale (Jan. 2015); [http://pzwart1.wdka.hro.nl/~manetta/annotations/html/events%2btalks/transmediale_all-watched-over-by-algorithms_2015.html → annotations] | ||
* | * The Journal of Typographic Research - OCR-B: A Standardized Character for Optical Recognition this article (V1N2) (1967); [http://pzwart1.wdka.hro.nl/~manetta/i-could-have-written-that/elements/automatic-reading-machines/automatic-reading-machines.html → abstract] | ||
* | * |
Revision as of 11:37, 5 January 2016
thesis outline
intro
- NLP, natural language processing
- current focus: data-mining field (a data-fashion)
- mining as ideology
- from: mining natural resources, to: data mining [what are the differences?]
- 'data mining'?
- text processing
- from: able to check results with senses (OCR), to: intuition (data-mining) [what are the differences?]
- parsing, how text is treated: as n-grams, chunks, bag-of-words, characters
- data as autonomous entity
- from: information, to: data science [what are the differences?]
- concern
hypothesis
The results of data-mining software are not mined, results are created. What elements do allow for algorithmic agreeability?
algorithmic agreeability - case study objects
- anthropomorphism:
- data mining
- machine learning
- wordclouds
- workflow mining-software (eg. Pattern, Wecka)
- data as autonomous entity
research material
→ filesystem interface, collecting research related material (+ about the workflow)
→ wikipage for 'i-could-have-written-that' (list of prototypes & inquiries)
→ little glossary
mining as ideology
* from mining minerals to mining data
anthropomorphism
* anthropomorphic qualities of a computer (?)
* the photographic apparatus → the data apparatus (annotations)
* Joseph's (Weizenbaum) questions on Computer Power and Human Reason
text processing
* semantic math: averaging polarity rates in Pattern (text mining software package)
* notes on wordclouds
* automatic reading machines; from encoding-decoding to constructed-truths
* index of WordNet 3.0 (2006)
data as autonomous entity
* knowledge driven by data - whenever i fire a linguist, the results improve
other
* (laughter) - it's embarrassing but these are the words
* call for a syntactic view; Florian Cramer & Benjamin Bratton (text)
* EUR PhD presentation 'Sentiment Analysis of Text Guided by Semantics and Structure' (13-11-2015)
* index of Roget's thesaurus (1805)
* comparing the classification of the word 'information' Thesaurus (1911) vs. WordNet 3.0 (2006)
annotations
- Alan Turing - Computing Machinery and Intelligence (1936)
- The Journal of Typographic Research - OCR-B: A Standardized Character for Optical Recognition this article (V1N2) (1967); → abstract
- Ted Nelson - Computer Lib & Dream Machines (1974);
- Joseph Weizenbaum - Computer Power and Human Reason (1976); → annotations
- Water J. Ong - Orality and Literacy (1982);
- Vilem Flusser - Towards a Philosophy of Photography (1983); → annotations
- Christiane Fellbaum - WordNet, an Electronic Lexical Database (1998);
- Charles Petzold - Code, the hidden languages and inner structures of computer hardware and software (2000); → annotations
- John Hopcroft, Rajeev Motwani, Jeffrey Ullman - Introduction to Automata Theory, Languages, and Computation (2001);
- James Gleick - The Information, a History, a Theory, a Flood (2008); → annotations
- Matthew Fuller - Software Studies. A lexicon (2008);
- Language, Florian Cramer; → annotations
- Algorithm, Andrew Goffey;
- Marissa Meyer - the physics of data, lecture (2009); → annotations
- Matthew Fuller & Andrew Goffey - Evil Media (2012); → annotations
- Antoinette Rouvroy - All Watched Over By Algorithms - Transmediale (Jan. 2015); → annotations
- Benjamin Bratton - Outing A.I., Beyond the Turing test (Feb. 2015) → annotations
- Ramon Amaro - Colossal Data and Black Futures, lecture (Okt. 2015); → annotations
- Benjamin Bratton - On A.I. and Cities : Platform Design, Algorithmic Perception, and Urban Geopolitics (Nov. 2015);
bibliography (five key texts)
- Language, Florian Cramer (2008); → annotations
- Antoinette Rouvroy - All Watched Over By Algorithms - Transmediale (Jan. 2015); → annotations
- The Journal of Typographic Research - OCR-B: A Standardized Character for Optical Recognition this article (V1N2) (1967); → abstract