User:Bohye Woo/Degrees of colonially in Terms of Service

From XPUB & Lens-Based wiki

DoCiToS — general description

DoCiToS (Degrees of Coloniality in Terms of Service) is a modern-colonial publishing archive, a series of numerical analysis that reveal the colonial contexts in 'Terms of Service (ToS)'. ToS is a modern working contract in digital colonial society, where big companies are dominating the power structures via ToS to deceive users. The project develops the colonization of metrics by measuring how colonized particular terms are being used in ToS.

The role of DoCiToS

DoCiToS...

  • introduces how modern digital colonialism is showing in digital society by mapping out the whole colonial architecture of Terms of Service.
  • archives news articles, videos, books, tweets ... to proffer knowledges related with the project.
  • provides a basic analysis on each web platforms by using 'MediaWiki' to store information.
  • collects Terms of Services from major websites to investigate the degrees of coloniality: every platform has different tones of coloniality, different words being used. To what extent does the platforms have coloniality in Terms of Service?
  • measures the degrees of coloniality: I specifically focus on measuring words within the realm of Terms of Service. I narrow down the context to that of ToS to prevent generalizing the measurement.
  • develops colonial language into colonial glossary.
  • annotates numerical datas on analyzing the colonial context of Terms of Service.

Importance of language

Language is important because:

  • Language as a colonial tool: Treaties in colonial times and Terms of Service in modern time both are to control the power structure. it shows a way to indoctrinate/manipulate the colonize subject via ToS.
  • Polarity of calming language: How specific words can play as colonial words? Using their comfort language become a colonial language. Comforting language for the company that are using highly polarized terms sometimes obfuscates or conceal users labour to deceive them.
  • Language as a colonial waterway: Shiproute in colonial times that used to sail to exchange/extract/export the goods. Language itself as a medium by which to interface with the colonizer.

How to publish/archive?

The archive will contain different analyzations on the colonial words and context.

  • In order to archive, I've been gathering some resources related to this project from websites to books.
  • In order to provide a basic analysis on each web platforms, I use 'MediaWiki' to store in a database.
  • In order to collect Terms of Services from major websites to investigate the degrees of coloniality, I keep reading, annotating and analyzing the context of ToS.
  • In order to measure the degrees of coloniality and annotates numerical datas, I use NLTK's POS tag and modify a Python script called 'Modality.py'. I will also use this technic to rank the most frequent colonial words in each ToS and show the relation between words that are in the same degree of coloniality.
  • In order to develop colonial language into colonial glossary, I build a colonial-glossary JSON file that contains all the analysed and categorised words

DoCiToS — Process

Tools

1. The Semantic MediaWiki extension allows for managing structured data in my wiki and for querying that data to create dynamic representations into timeline based website. (https://www.semantic-mediawiki.org/wiki/Semantic_MediaWiki)

2. NLTK will help to analyse and categorise the words from ToS.

3. Pattern is a web mining module for the Python programming language. A Python script 'Modality.py' is written in Pattern, it it a rule-based text mining program.

A list of Terms of Services

These companies are officially not a colony, but in a pratical term: a colony. There is a relationship that is unfair regardless what their intention is.

PokemonGo: https://www.pokemon.com/us/privacy-notice/
TikTok: https://www.tiktok.com/legal/privacy-policy?lang=cn#privacy-row
Snapchat: https://www.snap.com/en-GB/privacy/privacy-policy/ / https://www.snap.com/en-GB/terms/
FaceApp: https://www.faceapp.com/privacy-en.html / https://www.faceapp.com/terms-en.html
Netflix: https://help.netflix.com/legal/termsofuse / https://help.netflix.com/legal/privacy
Amazon: https://www.amazon.com/gp/help/customer/display.html?ie=UTF8&nodeId=468496&ref_=footer_privacy / https://www.amazon.com/gp/help/customer/display.html?ie=UTF8&nodeId=508088&ref_=footer_cou
Sony - Aibo: https://us.aibo.com/terms/aibo-privacy.html
FaceBook
Instagram

How to categorise colonial words?

STEP 1
Using a Standford Log-linear Part-Of-Speech Tagger in NLTK. A Part-Of-Speech Tagger (POS Tagger) is a piece of software that reads text in some language and assigns parts of speech to each word (and other token), such as noun, verb, adjective, etc. (https://nlp.stanford.edu/software/tagger.shtml)

The list of part of speech

  1. MD = would, could...
  2. RB = adverb 'very', 'slightly'...
  3. VB = verb
  4. JJ = adjective 'big'...
  5. NN = noun
  6. CC = coordinating conjunction 'and', 'or'...
  7. PRP = personal pronoun 'I', 'he', 'she'...

... more and more

STEP 2
When it's first categorized, I will establish sub-categories into the degree of coloniality.

degree of coloniality (gradation of intensity words)
100.00 = absolute level of coloniality
90.00 = extreme level of coloniality
80.00 = heavy level of coloniality
70.00 = high level of coloniality
60.00 = significant level of coloniality
50.00 =
40.00 = relative level of coloniality
30.00 = moderate level of coloniality
20.00 = reasonable level of coloniality
10.00 = fair level of coloniality
0.00 = neutral level of coloniality

Example

epistemic_VB = { #verbs from FaceApp ToS
    100.00: d("must", "agree","use"),
    90.00: d("use", "bound", "access", "allow", "acknowlegde", "reproduce"),
    80.00: d("choose","claim", "permit", "collect" ),
    70.00: d("change", ),
    60.00: d("create"),
    50.00: d(),
    40.00: d("maintain"),
    30.00: d("support"),
    20.00: d("identify"),
    10.00: d("may"),
    0.00: d(),
}

Project overview

Reference

https://www.cnet.com/how-to/amazon-and-google-are-listening-to-your-voice-recordings-heres-what-we-know/
https://www.cnet.com/news/faceapp-says-its-not-uploading-all-your-photos/
https://twitter.com/rycrist/status/1151479283661115392
FaceBook https://www.wired.com/story/faceapp-privacy-backlash-facebook/
Aibo https://www.cnet.com/news/yes-the-robot-dog-ate-your-privacy/