David's technobabble Rotating Header Image

Posts Tagged ‘OCR’

OCR’ing all of the PDF files in a SharePoint Document Library using PowerShell and Solid PDF Tools

A recent review of the PDF Documents in our Document Control Library, revealed that most were “image only” PDF’s. We’ve run our document control system on different versions of SharePoint technologies since SharePoint Portal Server 2001. We are currently running SharePoint 2007. I’m surprised that someone did not previously notice that most of our PDF [...]

A walkthrough the code to extend the HP Digital Sending Software

Recently, I wrote about extending the HP Digital Sending Software to “close the gap” and email the OCR’ed result. I received a request to release the code, so I’ve decided to do so with a bit a of documentation.

Making OCR’ed PDF’s using the HP Digital Sending Software

We recently became aware that our fancy HP workgroup printers which can copy a document and email the result as pdf to a set of email addresses only creates image pdf’s. None of the text in the pdf is searchable. After some investigation, we discovered that we needed to install the HP Digital Sending Software [...]

Bad Behavior has blocked 596 access attempts in the last 7 days.