User:Claxhanson/proto2: Difference between revisions

From XPUB & Lens-Based wiki
(Created page with " ==Tesseract== <br><br> OCR : <br> "Optical character recognition is the conversion of images of text into machine-encoded text. It’s a common method of digitizing printed t...")
 
No edit summary
Line 1: Line 1:


==Tesseract==
=Tesseract=
<br><br>
<br>
OCR : <br>
OCR : <br>
"Optical character recognition is the conversion of images of text into machine-encoded text. It’s a common method of digitizing printed texts so that they can be electronically edited, searched, or stored more compactly, displayed on-line, and used in machine processes.  
"Optical character recognition is the conversion of images of text into machine-encoded text. It’s a common method of digitizing printed texts so that they can be electronically edited, searched, or stored more compactly, displayed on-line, and used in machine processes.  
Line 14: Line 14:


[[File:Tesseractt.png]]
[[File:Tesseractt.png]]
<br>
==ETHER2HTML==
WORKSHOP WITH OSP (Open Source Publishing)
http://osp.kitchen/tools/ether2html/

Revision as of 13:33, 19 April 2020

Tesseract


OCR :
"Optical character recognition is the conversion of images of text into machine-encoded text. It’s a common method of digitizing printed texts so that they can be electronically edited, searched, or stored more compactly, displayed on-line, and used in machine processes.

Tesseract is a way of OCR’ing your scans or pictures though Terminal

How-to :

Install tesseract with brew 🍺 (brew install tesseract) Change directory to the file’s emplacement tesseract filename.jpg filename txt

Tesseractt.png

ETHER2HTML

WORKSHOP WITH OSP (Open Source Publishing)

http://osp.kitchen/tools/ether2html/