Skip to content

Extract text from images using tesseract-ocr #14

@ubmarco

Description

@ubmarco

To detect headlines (see Issue #13) also the font size and style (bold/italic) should be extracted.
See https://stackoverflow.com/questions/39324626/get-font-size-in-python-with-tesseract-and-pyocr and tesseract-ocr/tesseract#1074 and in the issue especially the comment tesseract-ocr/tesseract#1074 (comment).

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions