OCR problem

Asked by saifin on 2018-11-20

so i m trying to capture a text from a image using .text() method
the problem is it captures the text by skipping spaces
i mean if the text is "hello world" in the image it captures as helloworld and ignores the spaces and sometimes it also gives false special characters

Question information

Language:
English Edit question
Status:
Answered
For:
Sikuli Edit question
Assignee:
No assignee Edit question
Last query:
2018-11-21
Last reply:
2018-11-21
RaiMan (raimund-hocke) said : #1

you might try with version 1.1.4

If you stick with 1.1.3 or earlier, you have to live with the poor OCR

saifin (saifin) said : #2

i checked 1.1.4 and it still shows some characters incorrect

below is the text in the image

"Problem List has not been verified"

and below is what i get

"Problem List has not been veri?ed"

RaiMan (raimund-hocke) said : #3

You have to live with that, since I did not find a solution for that until now:
In some fonts an fi is read as the unicode character fi (u'\ufb01') (not sure what that is)

Can you help with this problem?

Provide an answer of your own, or ask saifin for more information if necessary.

To post a message you must log in.