User:Tancre/Special Issue 8/The new annotating reader: Difference between revisions
Line 16: | Line 16: | ||
_Text Format | _Text Format | ||
* format texts | * format texts | ||
* add to recognize h1 and h2 when translate in html | * add to recognize h1 and h2 when translate in html | ||
* translate with number line structure | |||
* html template | * html template | ||
* create all the pages + working links | * create all the pages + working links | ||
* (tools and scripts to format) not needed now | |||
_Bot <br> | _Bot <br> | ||
__chat | __chat |
Revision as of 01:40, 21 February 2019
The new annotated reader
I've added:
- a reader sample
- an home page (only gramsci's link is working ).
- a folder with the texts in txt ready to be translated in html
- an html template to use for each text's page
+++++
- a sketch of the #Text format
- something to add to the #Bot
+++++
- you can read some stuff I'm writing and researching related to the annotated reader
Things to do:
_Text Format
- format texts
- add to recognize h1 and h2 when translate in html
- translate with number line structure
- html template
- create all the pages + working links
- (tools and scripts to format) not needed now
_Bot
__chat
- add commands (list of texts, search, comment)
- add classes (text, img, video) to the tag
- manage to put pdfs, imgs from link, video from link(needs a particular preset)
__html
- style pages
- logo/title
- infos
Text format
The text's formatting process is the translation of a text (pdf,html,epub,txt) into an html page with the text in a line numbered strucutre.
1. copy the text in a txt file
2. actual formattation of the text needed to convert it in a structured html
- first line as tile/author + empty line
- subtitles as emptyline/line/emptyline
- body of the text in lines with 60-70 charachters maximum.
3. the txt formatted is translated in html with a line numbered structure
4. the html with the line numbered structure is inserted in the body of an html template
To resume:
- original text > txt (manual process) //maybe this process can be automatized by giving a link
- txt > txt formatted (python script) //apparently this is the difficultest part, we need a set of tools to help this process
- txt formatted > line number structure (python script) //we need to add: -first line become h1 - if emptyline/line/emptyline become h2
- line number structure > html (python script + html template) //probably the html template can be inserted in the python script
If the 2. works, everything can be inserted in one script. Otherwise we need to keep separate 2. but we can merge 3. and 4.
Bot
The bot communicate with the xmpp chatroom. All kind of comments are admitted (img,text,pdf,audio) if it cannot be visualized, the link as a text is provided.
From chat to html
the line numbered structure works fine but it is important to:
- insert the comment div after the closing tag sentence (it can be created a tag comment) >> see the html code of this
- we need to add a class related to the content (text, img, video) >> see the html and css code of this
- what happen if there are too many comments in a line?
Usage
We need to decide how can be chosed the text to comment and how to know the line (for example if I'm reading the text from a pdf or an actual book). I was thinking at those three different usages and commands to address them:
- I want to knwo which texts are presents
command(titles) > number(X) + author + title + date
titles
> 1 Gramsci - On Ideology - 1930 / 2 Marx, Engels - The Ruling Ideas - 1883 / ...
- I want to know the line which contain this text 'text to search'
X(number of the text you want to search in) + text to search > line number(Y) or numbers
1 groups of States
> 11
- I want to comment the text x at line number y
X + Y + comment > add your comment to the chosen line
2 27 ciaociaociao
> the comment is added