orc

OCR Qt C++ server using tesseract. It can process PDF and some kind of images that ImageMagick can read. Connect to the server using socket (Port 5321) and base 64 encoding/decoding. The list of commands are:

Installation

Just install Tesseract and ImageMagick (convert) command line tools. Then compile the OCR using Qt Creator. To test the system, use the OCR Client (https://github.com/paulocesar/ocr-client).

Commands

Add document

Upload and add one document to the stack:

add|http://www.tcpdf.org/examples/example_001.pdf

it will return a code like 5ac8395a9bed59cba8b6abc9. Use this code to access the document's information.

Remove document

Remove the document and clean the source downloaded execute:

remove|5ac8395a9bed59cba8b6abc9

Get processed page

Get the text from a processed pages:

page|5ac8395a9bed59cba8b6abc9|<num_page>

Get information about the document

Get information about the document like number of pages and already processed pages:

info|5ac8395a9bed59cba8b6abc9

Close connection

quit

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
OCR		OCR
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

orc

Installation

Commands

Add document

Remove document

Get processed page

Get information about the document

Close connection

About

Uh oh!

Releases

Packages

Languages

License

paulocesar/ocr

Folders and files

Latest commit

History

Repository files navigation

orc

Installation

Commands

Add document

Remove document

Get processed page

Get information about the document

Close connection

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages