Can Sikuli identify text...

Asked by Vaishali

Can sikuli identify same text given in different fonts... is there any way to identify it...

Question information

Language:
English Edit question
Status:
Answered
For:
SikuliX Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
RaiMan (raimund-hocke) said :
#1

Wether Sikuli can read text from a screen or image totally depends on the features of the used package Tesseract.

Sikuli tries to hand over the image containing the text to Tesseract optimised in a way (resized, grayscale), that it is likely, that Tesseract returns the correct text. Depending on many aspects of the font and the used font features, the result might be not useable to 100% correct.

Standard fonts used in the web in normal font sizes (small and extra large are critical) on high contrast, even backgrounds normally return the best results.

You do not get the information of the font, that was used.

Later this year, I will rewrite the text features based on the Tess4J package, that implements a Java API to the Tesseract features, so the C++ layer is obsolete.

Revision history for this message
Eugene S (shragovich) said :
#2

Hi,

It's not clear what you're asking:
   1. Is it possible to recognize SAME text with different fonts
   2. Identify that the same text was represented in a DIFFERENT font

If the question is 1, then the answer is yes, given the text recognition known limitations.
If the question is 2, then some "hacking" will be needed. For example if you have the text appearing on your screen BEFORE its font was changed, you can create a pattern of that text and search for it later. If the pattern does not exist anymore, you can assume that its appearance (e.g. the font) has changed.

If you have a specific scenario and the above is not working for you, please share it.

Cheers,
Eugene

Can you help with this problem?

Provide an answer of your own, or ask Vaishali for more information if necessary.

To post a message you must log in.