User:Andre Castro/2/spam: Difference between revisions

From XPUB & Lens-Based wiki
No edit summary
Line 36: Line 36:


A character describes an individual that:
A character describes an individual that:
* is present on one or more emails
* can present on one or more emails
* is related to other character (present or not within the emails)
* can related to other character (present or not within the emails)
* has a role in the emails
* has a role in the emails
* has a history to herself
* has a history to herself
Line 44: Line 44:


'''It would be interesting to translate the relationships of characters to a graph. GRAPHYS??'''
'''It would be interesting to translate the relationships of characters to a graph. GRAPHYS??'''
==db==
In the db I used a '''Character table'''.  The elements of this table are related to the emails, where they are referred. They are also related to other characters.
<pre>
        EMAIL A                  EMAIL B
        /    \                        |
        /      \                      |
CHARACTER A    CHARACTER B      CHARACTER C    CHARACTER D
    |  |        |                    | |            |
    |  -----------                    | |            |
    -----------------------------------  -------------
</pre>
* Who is each of the characters?
* How is he related to other characters?
===in django===
Currently in django I have the following table
<source lang="sql">
    "name" varchar(200) NOT NULL UNIQUE,
    "birth" date,
    "death" date,
    "nationality" varchar(200),
    "occupation" varchar(200),
    "bio" text,
    "in_media" text,
    "other" text,
    "photo" varchar(100),
    "rate" integer NOT NULL
</source>

Revision as of 11:51, 21 January 2013

Characters

I began identifying and describing the characters I find in the spam messages from the database.

several reasons sparked my interest for the characters found in spam:

  • one character reappears across emails
  • often, if portrayed as a rich character happens to be a real person
  • the rich (real) character (often):
    • is known publicly
    • is rich
    • got killed
    • is referred to in mass-media, specially due to its controversial death
    • left money behind, who the narrator-sender is in possession but needs the email recipient's help to get hold of it
    • is related to other characters (either mentioned or not mentioned on the emails)

What characters do we find?

in spanish prison/fraud emails

  • sender
    • A)simple kind a guy. Working on a bank or insurance company who proposes a business with you, for the profit of both. It is somehow a corrupt character, but not utterly corrupt. Only wants a bit more than he has, a better living. He is not greedy, or at least is greedy as any of us is. Often he is Africa.
    • B) A sick and religious widow, who want to donate her deceased husband fortune. She has no family left, therefore asks for the recipient's help to help her transfer the money to charity. She trust in the good-willingness and loyalty of the recipient.
    • C) An America soldier in Iraq or Afghanistan who comes across a large some of money and needs the recipient' s help to take it out the country he is in. (Goes often by the name of David Joseph - clear Christian tone)


  • the rich man - the source of the money
    • A) often a rich man, with businesses going on, often in an African country, but he is a foreigner. Often he appears to be related to other characters that the email doesn't mention.
    • B) A wealthy man who died in either Iraq or Afghanistan. His fortune was left to his wife.


in dating/come and marry emails

method

I have added to the database of spam a new table: Character

A character describes an individual that:

  • can present on one or more emails
  • can related to other character (present or not within the emails)
  • has a role in the emails
  • has a history to herself
  • has a future


It would be interesting to translate the relationships of characters to a graph. GRAPHYS??

db

In the db I used a Character table. The elements of this table are related to the emails, where they are referred. They are also related to other characters.

        EMAIL A                  EMAIL B
         /    \                        |
        /      \                       |
CHARACTER A    CHARACTER B       CHARACTER C    CHARACTER D
    |  |         |                    | |             |
    |  -----------                    | |             |
    -----------------------------------  -------------


  • Who is each of the characters?
  • How is he related to other characters?

in django

Currently in django I have the following table

    "name" varchar(200) NOT NULL UNIQUE,
    "birth" date,
    "death" date,
    "nationality" varchar(200),
    "occupation" varchar(200),
    "bio" text,
    "in_media" text,
    "other" text,
    "photo" varchar(100),
    "rate" integer NOT NULL