Question

Pega OCR

Hello

I am trying to utilize the built in OCR functionality Pega has recently released.

The below article outlines a new component Pega has built "DocumentOCR" component.

https://community.pega.com/knowledgebase/articles/extract-content-image-based-files

I have made sure to install the latest version of the The Studio environment as well as have downloaded and run the exe within the "Robotic Automation OCR Essentials - for Robotics SP1 2003 and higher" downloadable. I have an existing web automation. I know i have installed the OCR essentials correctly as i am able to use methods like "ClickByOcr" on certain windows like when i interogated a "Save as" window.

My goal is to use the new functionality within the Document OCR component to take an existing PDF which contains a scanned image and use the ProcessToPDF method outlined in the release link to convert the scanned image into text. For some reason i am unable to find ProcessToPDF method in the toolbox nor am i able to find the Document OCR component. Any assistance setting up the Document OCR component or using the methods enclosed would be much welcomed.

Thanks

Comments

Keep up to date on this post and subscribe to comments

November 2, 2018 - 6:13am

Hello Team,

Please guide us in finding the document OCR component.

November 6, 2018 - 10:22am
Response to RajasekharReddy

Hi,

Please make right click somewhere inside Toolbox and click "Choose Items". Then, find DocumentOcr under ".Net Framework Components" tab, set checkbox there and hit OK.

After that it is available on Toolbox.

Grad and drop it onto Global Container and you will be able to use "ProcessTo..." methods.

December 27, 2018 - 2:39am

Hi ,

 

i still could not get the Document OCR listed in .Net Framework.

is there anything that am missing ?

Please it is urgent!

 

Thanks

 Kathyayini

December 27, 2018 - 9:59am
Response to KathyayiniG

Hi,

Please check your Pega Robotics version -  it should be 8.0.2006 or higher. Make sure you have installed OCR essentials component and try to reinstall it, see if there are any errors during installation. ".NET framework components" tab takes time to load all available components, give it a little time before searching. If nothing helps then you may want to open an SR to get hands on assistance.

Pega
December 27, 2018 - 11:34am
Response to KathyayiniG
  1. First double-check that you are on a version that has DocumentOcr (8.0.2006+).
  2. Confirm that you have the DLL available by checking your installation directory - you should see a DLL called - OpenSpan.DocumentOcr.dll.
  3. Go back to Studio, right-click an area in the Toolbox and select "Choose Items..."
    1. Filter for "Ocr"
    2. If you do not see the DocumentOcr component, then manually add the namespace - Select "Browse" and select the DLL - OpenSpan.Document.dll in the installation dir.
    3. Check for DocumentOcr again. 

 

December 27, 2018 - 11:42pm

Hi 

Thanks for the response.

I have the 1089 version which I downloaded from the academy course.

where can I get the latest version from ?

Thanks

Kathyayini

Pega
December 28, 2018 - 8:42am
Response to KathyayiniG

If your company has purchased licenses, you should be able to get the latest version from Digital Delivery - https://community1.pega.com/community/pega-support/question/pega-digital-delivery-pilot

May 2, 2019 - 9:50am

Hi 

I am not able to find the latest pega robotics installable can anyone share the download link for latest version so that i can download and utilize OCR component.

Thanks !!!!

May 2, 2019 - 3:10pm
Response to SagarM13

The latest software available to you will be here on the Digital Software Delivery page.

Marissa | Community Moderator | Pegasystems Inc.

October 16, 2019 - 12:43pm