Stucked to the failed analysis of an answer pdf

Asked by Mitvailer

Hello,

I am testing queXF to choose the best ORM regarding a problematic we have to answer to in my company.

I am working on Windows, with an install of the last version (I downloaded the self-extracting package available on sourceForge : queXSuite-queXML-1.4.0-queXS-1.10.3-queXF-1.15.0-queXC-0.9.4.exe). Therefore, I run all the services using xampp.

You can find all the files (original XML, questionnaire PDF, banding XML, scanned pdf answer and print screen of the results) in my dropbox : https://www.dropbox.com/sh/h9zy0as5qjsnccm/AAAWouTe-ESNKa1GG29ozGyVa?dl=0

I took the XML example provided in the package, modified it a bit (changed a few labels and incremented the id of the questionnaire to prevent conflicts).

I generated the pdf and the banding XML, loaded the pdf with queXF and banded it with the banding XML.

Then I printed two questionnaires, filled them, and loaded them using the "Import a directory of PDF files". I also checked with the "Test form compatibility with queXF" tool that the corners and the barcode are in the right area.

Nevertheless, when I verify the result as an opearator (the questionnaire corresponding to the answer is well detected), the result is a real mess. I think only one box was correctly detected, but that I removed it by clicking on it (section 2, sub question 2, the blue box was sligthly to the upper right of the real box).

My questions are the following :
   - Why is the tool not recognizing my answers?
   - Why is the displayed pdf of far lower quality than the one I provided ? Is it of any importance for the analysis, or is it just the display to the operator ?
   - Is it possible for an operator to manually define the answers of a questionnaire, by clicking on the areas of the boxes for example, in the case where the optical recognition did not work well?
   - Is it possible for an operator to Reject a pdf which optical recognition failed?
I only saw the verifier can accept it or add a note on it, but not reject it or skip it. Additionally, I could not accept it, probably since fields where missing to the analyzer. I didn't even find a way to delete it as an admin, unless there is a duplicate form or something like that, so I found myself locked as a verifier, and went to the db to remove stuff.
   - What is a duplicate form / What is the PFID ? How are they defined ?
I suppose it is what allows to associate, when we have multiple pages, a page 2 of an aswer to the page 1 of the same answer, and not another page 1 of another answer of the same questionnaire. If not, how is this done ?
   - Can an admin delete an answer to a questionnaire, instead of just deleting duplicate forms or allowing it to be imported again?

This is a lot of questions, so thank you for your time, and I hope it will help other new users later on, as there is no use manual at the moment ^^

Thank you!

Question information

Language:
English Edit question
Status:
Answered
For:
queXF Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
Adam Zammit (adamzammit) said :
#1

Dear Mitvailer,

Thank you for providing the files which have assisted my responses below:

- Why is the tool not recognizing my answers?

I believe you have not completed the "page setup" step. Please go to "Page setup" choose your form, click on the page number, then click on "Finished page setup" This writes the correct edge values to the database. You will then need to import your forms again (can use the "allow import again" field under "Successfully imported files")

- Why is the displayed pdf of far lower quality than the one I provided ? Is it of any importance for the analysis, or is it just the display to the operator ?

The display to the verifier is currently resized to the config.inc.php directive DISPLAY_PAGE_WIDTH pixels (default 800) therefore the quality is reduced but it is a smaller file and fits on most screens. It is just for display purposes.

- Is it possible for an operator to manually define the answers of a questionnaire, by clicking on the areas of the boxes for example, in the case where the optical recognition did not work well?

Yes this is the purpose of verification. The operator can either click on the boxes or press the number keys on the keyboard then the "Enter" key to proceed to the next question to verify.

  - Is it possible for an operator to Reject a pdf which optical recognition failed? I only saw the verifier can accept it or add a note on it, but not reject it or skip it. Additionally, I could not accept it, probably since fields where missing to the analyzer. I didn't even find a way to delete it as an admin, unless there is a duplicate form or something like that, so I found myself locked as a verifier, and went to the db to remove stuff.

Currently an operator cannot reject a PDF. This would be a good feature to add. Currently for our operations we ask our verification staff to add a page note, then click on the "Accept page" link to proceed to the end of the form. We then review the page notes for any forms to re-import.

   - What is a duplicate form / What is the PFID ? How are they defined ? I suppose it is what allows to associate, when we have multiple pages, a page 2 of an aswer to the page 1 of the same answer, and not another page 1 of another answer of the same questionnaire. If not, how is this done ?

A duplicate form is a form that has been allowed to be imported again more than once (using the function "Successfully imported files"). Usually when queXF scans a directory for files to import it will not import if there is a matching filename in the database, or if there is a matching SHA1 sum of the file in the database. The PFID field is supposed to be unique for a file. Once a form is allowed to be imported again, it becomes a "duplicate form" and should share a PFID with the previously imported form(s).

 - Can an admin delete an answer to a questionnaire, instead of just deleting duplicate forms or allowing it to be imported again?

Currently this would need to be done in the database manually or by excluding records when exporting data.

- This is a lot of questions, so thank you for your time, and I hope it will help other new users later on, as there is no use manual at the moment ^^

You're welcome. Please let me know if I have not answered all your queries clearly.

Regards,
Adam Zammit

Revision history for this message
Adam Zammit (adamzammit) said :
#2

Also I noticed with your original form - an additional reason for the decrease in quality is that it doesn't appear to be scanned in monochrome (1 bit) mode at 300DPI. This is required to get the best quality result.

Can you help with this problem?

Provide an answer of your own, or ask Mitvailer for more information if necessary.

To post a message you must log in.