User:Pleun/grad/altrightlexicon-practice

From XPUB & Lens-Based wiki
< User:Pleun
Revision as of 15:58, 13 November 2017 by Pleun (talk | contribs) (Created page with "The implementation and part-creation of a tool that will gather jargon on (a) main right-wing forum(s), through Pattern (a Natural Language Processing library in Python), beca...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The implementation and part-creation of a tool that will gather jargon on (a) main right-wing forum(s), through Pattern (a Natural Language Processing library in Python), because of my believe this jargon captures the core of their ideology. I will focus on Reddits subforum The Red Pill first, and then on 4chans subforum /pol/ second. I believe those are two of the most highly influential subforums in the Manosphere and the Alt-Right media-bubble.

I plan to use text scraping to scrape the forum and then start to filter out content so I would be left with non-dictionary words. I could use a text minus text method, where you, for instance, extract all the words used in a New York Times article, from the words used in a forum thread. Then I also want to look at which words, nouns/adjectives are used most, maybe if they are used in a positive or negative way. Next, I can also look at sentiment. Is a text negative or positive and to which standards? The difficulty of a text could be tested with the Flesch/Kincaid Readability Test, where word-length, syllables and sentence length are taken into account.

Urban Dictionary could provide meaning to the jargon that's found.


  • Natural Language Processing

https://pypi.python.org/pypi/redditnlp/0.1.3
– Python Pattern Library: Sentiment

  • Bag-of-words model
  • Urban Dictionary

http://api.urbandictionary.com/v0/define?term=word
https://market.mashape.com/community/urban-dictionary

  • Flesch/Kincaid Readability Test

https://en.wikipedia.org/wiki/Flesch%E2%80%93Kincaid_readability_tests