I have a lot of docs to be scannedand put in PDF format.
The aim is either a PDF file(which is nothing more than a package of several JPG) or a PDF imagesearch (an invisible text is hidden and can use a search engine) orfull text PDF file.
Strangely, I found nowhere OCR softwaretesting. Other topics of HFR are a little older. I mostly scandocuments for archiving and transmission of legal documents via ftp.
I tried to do OCR with Adobe Acrobat 7 and my scanner (an oldAgfa 1212 and a recent Epson 2480) and I'm not really satisfied withthe results. I'm looking for the best compromise "respect of theoriginal document / file size."
I experimented withdifferent resolutions and it is always disgusting (even by increasingthe resolution a lot, I sometimes have strange effects), that is tosay that I often find myself with a moitiée sentence remains inbitmap, the text aute moitiée but with different fonts, etc.