Text recognition able or not

Asked by emmanuelbu

Hi,

I have problem in opening pdf format that contain tibetan writings cause they are completely modified.
I just try to make layout modification of this document.

Is it possible to turn off text recognition or something else ?

Thanks for awnser !

Question information

Language:
English Edit question
Status:
Solved
For:
Inkscape Edit question
Assignee:
No assignee Edit question
Solved by:
su_v
Solved:
Last query:
Last reply:
Revision history for this message
mahfiaz (mahfiaz) said :
#1

The problem is that this PDF uses a font with false letters (such as symbol font on windows) and you don't have that font installed. E.g you have character a but in font it looks like a car (or tibetian q or something). If it was unicode font, you wouldn't have that problem. To fix it you could get that font, convert pdf text to outlines (reprinting with acrobat pro might help, not sure) or replace all wrong characters with right unicode ones and use a proper font (tedious, messes up your layout and probably is easier to recreate from scratch).

Revision history for this message
Best su_v (suv-lp) said :
#2

Inkscape has no option to disable "text recognition" when
importing/loading PDF files (yet). It's a known feature request though.

Depending on which OS/platform you use or have access to, there are
workarounds to create an SVG file with outlined glyphs (the text is no
longer a text but converted to paths or clones of paths for glyphs used
multiple times within the text in the document.

If you work on linux, an easy method is to open the PDF file in Evince,
and print to SVG file (in the print dialog, choose 'Print to File', and
select SVG as output format). The resulting SVG file can then be opened
in Inkscape and edited with the usual tools (select tool, node tool, etc.).

Alternatively, there is a command line tool available with more recent
versions of poppler (check the packages available fror your distro),
which can do the same conversion:

$ pdftocairo -svg file.pdf file.svg

The same results might be achieved with the tool pdf2svg from
<http://www.cityinthesky.co.uk/opensource/pdf2svg>
though I haven't tested the rsults of this tool lately myself.

As last option, you could get hold of a current development snapshot
build of Inkscape (via PPA for linux/Ubuntu, or development snapshot
builds for Windows (see download page on inkscape.org)) - trunk builds
have an additional (experimental) PDF import format ("Adobe PDF via
poppler-cairo (*.pdf)") which also converts all texts to outlined paths
(open the PDF file from within Inkscape and choose the specific file
format in the file chooser dialog).

Revision history for this message
emmanuelbu (emmanuel-burnel) said :
#3

Thanks a lot for your answer, the support and the work done to give a such great tool to everyone.
Greetings for all the community

"Inkscape has no option to disable "text recognition" when
importing/loading PDF files (yet). It's a known feature request though."

It could be a really nice feature, in my case. Thank you again.

Revision history for this message
emmanuelbu (emmanuel-burnel) said :
#4

Thanks ~suv, that solved my question.