how sikuli uses tesseract

Asked by Alfredo Adragna

Our application uses a font not recognonized by sikuli and tesseract.
We trained tesseract to recognize the font, producing a new eng2.traineddata file.
The file has been verified by means of the command:

tesseract image.png test -l eng2

How can we use this new file?
Does Sikuli use the traineddata in standard tesseract directory?

Thanks in advance
Alfredo & Andrea

Question information

Language:
English Edit question
Status:
Answered
For:
SikuliX Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
RaiMan (raimund-hocke) said :
#1

with version 1.1.0+ the tessdata folder after an approriate setup and a run, that uses the text feature, can be found at <app-data-folder>/SikulixTesseract/tessdata (see docs)

You might simply then replace the bundled standard files with your custom files keeping the standard naming.
As long this folder exists, it will not be replaced by the bundled stuff.
If it does not exists, it will be created as needed with the bundled standard stuff.

Revision history for this message
Alfredo Adragna (aadragna) said :
#2

Thanks for the quick answer.
We try what you suggest.
In particular we removed all the file in <app-data-folder>/SikulixTesseract/tessdata and copied just our eng.traineddata.
The result is very different with the tessaract command specified in the question.
Any other suggestion?
Thanks in advance again.
Alfredo & Andrea

Revision history for this message
RaiMan (raimund-hocke) said :
#3

ok, then this might be caused by the very basic implementation and the fact, that it is not really adapted to the 3.0+ features (no changes the last 5 years).

I am sorry, but you have to take the feature set as is. Only version 2 later this year will have some enhancements and more options according to the current state of version 3

You might try with the Tess4J package (I will use it in version 2) or if timing is not a problem use the Tesseract command in a subprocess with files.

Can you help with this problem?

Provide an answer of your own, or ask Alfredo Adragna for more information if necessary.

To post a message you must log in.