Couldn't able to extract exact text from image

Asked by siddu

Hi ,

I'm using sikuli for one of the App testing and it is working fine. I have seen the below issue while extracting text from the images.

Issue1:

Image has text "Quantity value is invalid", when im grepping with below code it is returning the "Quantityvalue is invalid"(Failed to return the exact text, space is not seen between Quantity and value)

textSearch = find("sikuli.png").highlight(1)
    textFound = textSearch.text()
    print "Grepped text is",textFound

O/P---->
Actual O/p: Grepped text is Quantityvalue is invalid.
Expected O/p: Grepped text is Quantity value is invalid.

Im using sikuli 1.1.0(with options 1 and 3) and windows machine.

Thanks in Advance

regards
Siddu

Question information

Language:
English Edit question
Status:
Solved
For:
SikuliX Edit question
Assignee:
No assignee Edit question
Solved by:
RaiMan
Solved:
Last query:
Last reply:
Revision history for this message
Launchpad Janitor (janitor) said :
#1

This question was expired because it remained in the 'Open' state without activity for the last 15 days.

Revision history for this message
RaiMan (raimund-hocke) said :
#2

Be aware of the current problematic implementation of the Text/OCR feature and hence many oddities and bugs.

Many Q&A's on that.

Revision history for this message
siddu (siddum-meti) said :
#3

Will these issues(Text/OCR) fixed in next release of sikuli and may i know when will be the next release of sikuli?

thanks and regards
Siddu

Revision history for this message
Best RaiMan (raimund-hocke) said :
#4

I will surely optimize the text/OCR features in version 2, but I cannot tell you now, what will really be fixed then and what new features will be available.

I guess that pre-release versions with revised text/OCR features will be available beginning April 2016.

But you always have the choice, to use the Tesseract package standalone at least for your OCR purposes.

Revision history for this message
siddu (siddum-meti) said :
#5

Thanks RaiMan