User:Manetta/thesis/thesis-outline

From XPUB & Lens-Based wiki
< User:Manetta
Revision as of 11:37, 5 January 2016 by Manetta (talk | contribs)

thesis outline

intro

  • NLP, natural language processing
  • current focus: data-mining field (a data-fashion)
  • mining as ideology
    • from: mining natural resources, to: data mining [what are the differences?]
    • 'data mining'?
  • text processing
    • from: able to check results with senses (OCR), to: intuition (data-mining) [what are the differences?]
    • parsing, how text is treated: as n-grams, chunks, bag-of-words, characters
  • data as autonomous entity
    • from: information, to: data science [what are the differences?]
  • concern

hypothesis

The results of data-mining software are not mined, results are created. What elements do allow for algorithmic agreeability?

algorithmic agreeability - case study objects

  • anthropomorphism:
    • data mining
    • machine learning
  • wordclouds
  • workflow mining-software (eg. Pattern, Wecka)
  • data as autonomous entity


research material

→ filesystem interface, collecting research related material (+ about the workflow)
→ wikipage for 'i-could-have-written-that' (list of prototypes & inquiries)
→ little glossary

mining as ideology

* from mining minerals to mining data

anthropomorphism

* anthropomorphic qualities of a computer (?)
* the photographic apparatus → the data apparatus (annotations)
* Joseph's (Weizenbaum) questions on Computer Power and Human Reason

text processing

* semantic math: averaging polarity rates in Pattern (text mining software package)
* notes on wordclouds
* automatic reading machines; from encoding-decoding to constructed-truths
* index of WordNet 3.0 (2006)

data as autonomous entity

* knowledge driven by data - whenever i fire a linguist, the results improve

other

* (laughter) - it's embarrassing but these are the words
* call for a syntactic view; Florian Cramer & Benjamin Bratton (text)
* EUR PhD presentation 'Sentiment Analysis of Text Guided by Semantics and Structure' (13-11-2015)
* index of Roget's thesaurus (1805)
* comparing the classification of the word 'information' Thesaurus (1911) vs. WordNet 3.0 (2006)


annotations

  • Alan Turing - Computing Machinery and Intelligence (1936)
  • The Journal of Typographic Research - OCR-B: A Standardized Character for Optical Recognition this article (V1N2) (1967); → abstract
  • Ted Nelson - Computer Lib & Dream Machines (1974);
  • Joseph Weizenbaum - Computer Power and Human Reason (1976); → annotations
  • Water J. Ong - Orality and Literacy (1982);
  • Vilem Flusser - Towards a Philosophy of Photography (1983); → annotations
  • Christiane Fellbaum - WordNet, an Electronic Lexical Database (1998);
  • Charles Petzold - Code, the hidden languages and inner structures of computer hardware and software (2000); → annotations
  • John Hopcroft, Rajeev Motwani, Jeffrey Ullman - Introduction to Automata Theory, Languages, and Computation (2001);
  • James Gleick - The Information, a History, a Theory, a Flood (2008); → annotations
  • Matthew Fuller - Software Studies. A lexicon (2008);
  • Marissa Meyer - the physics of data, lecture (2009); → annotations
  • Matthew Fuller & Andrew Goffey - Evil Media (2012); → annotations
  • Antoinette Rouvroy - All Watched Over By Algorithms - Transmediale (Jan. 2015); → annotations
  • Benjamin Bratton - Outing A.I., Beyond the Turing test (Feb. 2015) → annotations
  • Ramon Amaro - Colossal Data and Black Futures, lecture (Okt. 2015); → annotations
  • Benjamin Bratton - On A.I. and Cities : Platform Design, Algorithmic Perception, and Urban Geopolitics (Nov. 2015);


bibliography (five key texts)

  • Language, Florian Cramer (2008); → annotations
  • Antoinette Rouvroy - All Watched Over By Algorithms - Transmediale (Jan. 2015); → annotations
  • The Journal of Typographic Research - OCR-B: A Standardized Character for Optical Recognition this article (V1N2) (1967); → abstract