PythonLabZalan: Difference between revisions

From XPUB & Lens-Based wiki
No edit summary
Line 1: Line 1:
== Optical character recognition + Tesseract ==
== Optical character recognition + Tesseract ==
Firstly I experimented  in Terminal how to translate PDF or JPG to .txt files with tesseract and imagemagick (convert).


[[Optical character recognition]]
[[Optical character recognition]]
Line 8: Line 10:
imagemagick  
imagemagick  
* Mac <code>brew install imagemagick</code>
* Mac <code>brew install imagemagick</code>


== Python3 ==
== Python3 ==

Revision as of 15:41, 24 March 2018

Optical character recognition + Tesseract

Firstly I experimented in Terminal how to translate PDF or JPG to .txt files with tesseract and imagemagick (convert).

Optical character recognition

Tesseract (with languages you will be using)

  • Mac brew install tesseract --all-languages

imagemagick

  • Mac brew install imagemagick

Python3

Natural Language Tool Kit

DrawBot

ACCP (Analogue Circular Communication Protocol