DropOCR – version 1.2.5 available

Innovations DropOCR version 1.2.5:

  • Direct selection of the AutoOCR processing profile through the context menu of the icon tray application
  • function “Cancel all jobs” – with that currently running transfers and processes can be canceled immediatly
  • The “AutoStart” Option is now activated by default
  • The max. page amount is now preset to 1000 by default
  • The connection data of the AutoOCR testserver are already preassigned with the installation

DropOCR - Context Menu - Icon Tray Anwendung  DropOCR - Konfigurationseinstellungen 1.2.5

Download – DropOCR >>>

FileConverterPro (FCpro) – PDF(/A) conversion service with SOAP / REST – web-service

The FileConverterPro is installed as Windows service and offers functions for the conversion of the most important document formats to PDF, PDF/A incl OCR through a web-service interface (SOAP or REST).

For FCpro the same base components as for the FileConverterPDFMerge and AutoOCR are used. Adjustments and extensions are therefor available the same for all these applications.

The web-service interface of FCpro is compatible to the web-service interface of AutoOCR by which all applications are runnable without adjustment with both services. With that e.g. our Alfreso / ifresco Transformer integration can be operated with AutoOCR – for pure OCR processing – as well as with FCpro – to process all document formats incl. OCR.

As well as for AutoOCR also for the FCpro service is a ready-to-use .NET / C# sample application with EXE and source code available. With it the FCpro functions can be tested immediatly or the code can be used as a base for the integration in own applications.

PDF or PDF/A conversion – all important file formats – MS-Office, image, e-mail, HTML and so on get converted to PDF or PDF/A automatically. Normally no other components or MS-Office are required. The conversion takes place directly without additional applications or printer drivers. Optionally also MS-Office 2010/2013 can be used as converter component if available or a “high fidelity” conversion for office formats is required. Image and PDF documents can be made searchable via the integrated iOCR. Optional through a additional license also the Abbyy OCR engine can be used.

Supported source file formats:

  • DOC, DOCX, RTF, TXT,
  • XLS, XLSX,
  • PPT, PPTX, PPS, PPSX,
  • FDF, XFDF (Adobe forms),
  • XML
  • PNG, BMP, TIF, TIFF, JPG, JPEG, GIF
  • ZIP, RAR, 7Z,
  • MSG, EML,
  • PDF,
  • HTM, HTML, MHTML,
  • PMTX (PDFMerge)

Functions – general:

  • MS-Windows service application with SOAP / REST web-service interface for document conversion from Office, PDF, image, HTML, ZIP, MSG and e-mail to PDF or PDF/A. The communication takes place encrypted via HTTPS.
  • Processing profiles – all settings can be preconfigured and retrieved and used through profiles.
  • Direct conversion without usage of additional required original applications.
  • For the “high fidelity” conversion of MS-Office documents also MS-office 2010 / 2013 can be installed and used.
  • Dissolving and conversion of container files – ZIP, 7ZIP, RAR, MSG, EML, PMTX – to build overall files. Structures are displayed as bookmarks, for not convertible documents placeholder sites are inserted
  • images and scans (TIF, JPEG, PNG, BMP, GIF, PDF) can be converted to searchable PDF’s with the integrated iOCR – Abbyy OCR engine as option.
  • Parallel processing with configurable amount of processes and priorities – allows the optimal usage of the hardware and guarantees a quick work off.

Special functions:

  • With ZIP/RAR/7Z conatainers all contained and supported documents get extracted automatically, converted and merged to a single PDF overall document. The folder structure contained in the container gets displayed in the PDF output document as the bookmark structure.
  • MSG / EML – e-mails can contain any also interlaced attachements. This documents get extracted also, for not convertible formats placeholder pages get inserted and the structure gets displayed via PDF bookmarks.
  • PDF/A conversion – the FCpro is also a PDF to PDF/A converter. The converted documents can be produced as PDF/A-1b or also with embedded source documents as ISO standardized PDF/A-3b format. Therefor the FCpro service is suited ideally for long-term archiving of documents and e-mails.
  • PMTX – is a XML data format from PDFMerge which contains structure and processing information as well as the documents themselves. From it PDFMerge creates a single overall PDF file which consists of the converted and merged single files. The PDFMerge structure gets displayed via the PDF bookmarks.
  • FDF, XFDF – PDF form data – can be fused with PDF forms and converted to a “normal” PDF.
  • Stamps, watermarks, stationery – can be configured and applied
  • Intelligent OCR of PDF – PDF documents get analyzed page per page if OCR is required or not – pages which already contain text don’t get OCR processed again, bookmarks and links stay preserved. This saves time and resources and increases the quality.

Functions – PDF-export settings – part of the processing profiles

  • Infilling of PDF profile fields with fixed values or variables (origin values, profile name, date, time, PC name, user, file name, application, pages, PDF-level, user variables)
  • Web-optimization (yes / no)
  • Preserve existing bookmarks (yes / no)
  • Settings for opening the PDF
  • Security settings – password-opening, system, restrictions
  • Pagination – position, start, offset, text (current page, pages), font, color, masking underlying area
  • Stationery / PDF watermarks – underlay / overlay, file selection, opacity (%), position
  • Text stamp – one or more stamps, text or variables (like profile fields incl. bookmarks), start, offset, font, style (incl. outline), size, color, opacity (%), angle

FCpro user interface:

UI1_FCpro - Config of web-service endpoints UI2_FCpro - Conversion profiles UI3_FCpro - Advanced settings UI4_FCpro - Advanced settings - web-service user config and rights UI5_FCpro - Advanced settings - service account config UI6_FCpro - Advanced settings - MIME types config UI7_FCpro - Icon tray functions

FCpro – conversion profile:

CO1_FCpro - Conversion profile config - office documents CO2_FCpro - Conversion profile config - image documentsCO3_FCpro - Conversion profile config - HTML documents CO4_FCpro - Conversion profile config - XML CO5_FCpro - Conversion profile config - PDFA and PDFExport settings CO6_FCpro - Conversion profile config - FDF XFDF forms CO7_FCpro - Conversion profile config - OCR settings

FCpro – conversion profile – OCR:

OC1_FCpro - Conversion profile config - iOCR settings #1 OC2_FCpro - Conversion profile config - iOCR settings #2 - image processing OC3_FCpro - Conversion profile config - iOCR settings #3 - language selection OC4_FCpro - Conversion profile config - iOCR settings #4 - language selection OC5_FCpro - Conversion profile config - Abbyy OCR settings - predefined profiles OC6_FCpro - Conversion profile config - Abbyy OCR settings - general settings OC7_FCpro - Conversion profile config - Abbyy OCR settings - recognition - image processing OC8_FCpro - Conversion profile config - Abbyy OCR settings - recognition - page analysis OC9_FCpro - Conversion profile config - Abbyy OCR settings - recognition - page synthesis OC10_FCpro - Conversion profile config - Abbyy OCR settings - PDF export parameter

Available FCpro applications / clients:

The FCpro server provides its functionality through a SOAP /REST – web-service interface to other applications. The following applications and integrations are available for the FCpro or use its functions:

1.)    FileConverterPro – WCF service sample – this client application is additionally installed with the FCpro setup. With it all function availabel via the web-service can be tried and tested. Beside the EXE is this application also available as C# source code to be able to use FCpro functions from own applications quick and easily.

2.)    DropConvert – convert documents to PDF(/A) via drag & drop or folder monitoring. DropConvert is a Windows client application which communicates with the FCpro service to convert documents which are dragged into the always “on top” displayed “DropZone” or into a monitored folder to PDF or PDF/A. The result documents are transfered back to the client and deposited in a configurable output folder. The FCpro server is called https encrypted through the local net or also external through the internet.

3.)    EMail Archiver – The EMail Archiver is a MS Outlook 2010 / 2013 plug-in with which single or multiple marked e-mails or even whole e-mail folders and subfolders with all contained e-mail masseges can be converted to PDF or PDF/A directly out of MS Outlook. The processing and conversion of the e-mail runs via the FCpro server which is called encrypted via https on the local network or external through the internet. The resulting PDF(/A)’s get deposited in a configured start-folder and path into the file system with the variable information extracted from the e-mail.

PDF/A and especially PDF/A-3 are particularly good for the archiving or for ISO standardized long-term archiving of e-mails. With PDF/A-3 the original MSG / EML messages get embedded also in the PDF container.

4.)    Alfresco / ifresco – Transformer – the installation of the “ifresco Transformer” AMP’s for Alfresco allows the PDF(/A) conversion and the OCR processing through the FileConverterPro server. If only OCR is required the AutoOCR server can be used instead. The processing of the supported document formats to PDF, PDF/A and/or with OCR is then available via Java, JavaScript, REST, the “transform” action through folders and in Alfresco Share as “transform” document action.

FCpro – versions, licensing, scope

The FileConverterPro is available in a basic version as well as in an extended version incl. PDF/A and OCR. With the extended version optionally the Abby OCR engine can be licensed additionally to the iOCR. Abbyy licenses are available page (monthly or overall volume) or processor dependent. The FCpro standard license is per server but there are also “Enterprise” for any amount of servers per company and “OEM” licenses for the integration through developers in their own applications.

Also containing in the FCpro server is the – WCF service sample application incl. source code and the MS-Windows application “DropConvert” – this can be installed and used on any workplace without restrictions.

Download – FileConverterPro (FCpro) ~250MB >>>

DropOCR – version 1.2.1 available

Innovations DropOCR version 1.2.1 :

  • Userinterface switchable between german and english
  • HTTP and HTTPS support
  • Logging of the conversion processes, deleting of the log file
  • AutoStart function to start the application when the PC is started
  • Doubleclick on the Drop Zone opens the destination folder
  • AutoOCR testserver preconfigured

Our AutoOCR testserver is reachable vie the following URL and may be used for testing purposes:

  • https://autoocr.may.co.at:8001/AutoOCRService2/
  • User: admin
  • Password: autoocr

DropOCR Konfiguration DropOCR - Context Menü - DropZone

Download – DropOCR >>>

ifresco AutoOCR Transformer – OCR processing integrated with Alfresco Share

The AutoOCR Server is integrated via REST as a dynamic configurable Alfresco document transformer. AutoOCR creates searchable PDF´s or other document formats like TXT, DOC(X), XLS(X), PPT(X), XML, RTF and HTML from image of PDF files. The OCR functions can be used via Java, JavaScript or as a document transformer. Config is done from the Share UI which also has a new document action “Transform” and gives access to all Alfresco transformers.

AutoOCR is an OCR server / service which is based on the obviously best OCR engine from Abbyy. The AutoOCR server has a REST web-serverice interface which was used to integrate it with Alfresco. AutoOCR is able to convert image- or PDF- files to searchable PDF´s. In addition to PDF other document formats like TXT, DOC(X), XLS(X), PPT(X), XML, RTF and HTML can also be created.

The configuration is simple and uses OCR profiles to summarize all possible settings. With an AMP install module  the direct integration of AutoOCR to Alfresco is realized. OCR functions are available in   Alfresco as a dynamically configurable transformer. Appropriate bindings allow the use of the OCR out services also from JavaScript and Java. From Alfresco 4.0, the configuration and monitoring will be done directly on the UI of the Share Administrator console.

In addition, we have extended the  Alfresco share document actions with the Alfresco Transformer integration. Transformer functions are available on  any document via the share interface and allow the conversion of documents into different formats.

AutoOCR as Alfresco Transformer:

The OCR function can be bound to a folder as an action. So if e.g. a scanned document will be placed in this folder, the processing starts automatically started and the document will be passed to the AutoOCR server. The result is a searchable PDF or other document format that can be immediately sought and found on the Alfresco full-text index.

AutoOCR JavaScript binding for Alfresco:

The JavaScript API allows direct access to the AutoOCR service from Alfresco scripts. From Repository JavaScripts (Webscript controller script, scripted actions) all the features of AutoOCR API can be adressed. This API is completely independent from the integration of AutoOCR services as Alfresco Transformer.

Alfresco Share – “Transform” document action

By implementing the additional “transform” document action to the Share UI you can use all your Alfresco transformes and not only the AutoOCR transformers. The “transform” action is implemented general and not only OCR specific.

Highlights / features:

  • Direct AutoOCR integration as Alfresco transformer with REST web service interface.
  • Separate AutoOCR service / server which does not strain the Alfresco server
  • Based on ABBYY – the leading OCR engine
  • Easy configuration by selecting OCR profiles – all available ABBYY OCR engine settings are combined.
  • In addition to PDF other output formats can be generated (TXT, RTF, DOC, etc.)
  • Dynamic transformer configuration at runtime using the Alfresco Share Admin interface.
  • JavaScript client for the AutoOCR service, available in Alfresco repository scripts (WebScripts, actions, etc.)
  • Java client for the AutoOCR service, for use in Java code.
  • The Java client itself has no dependencies for Alfresco.
  • New Share document action “Transform” enhances Share not only with OCR but with all supported transformers.

Requirements:

  • Alfresco 4.x – dynamic configuration via Share Userinterface
  • Alfresco 3.x – manual configuration w/o Share UI
  • AutoOCR from Version 1.9.8 on Microsoft Windows as a service
  • ABBYY FineReader Engine 10 (starting with 10.000 pages per month)

20-autoocr-admin-status 22-autoocr-admin-transformerconfig2 23-autoocr-admin-jobs 01-autoocr-action-menu 02-autoocr-shareaction-dialog 03-autoocr-shareaction-transform-waiting 04-autoocr-shareaction-results 05-autoocr-shareaction-transformed-docs

Step by Step – Setup & Installation documentation for ifresco AutoOCR Transformer >>>

Test and Demo version is available – please contact us for details >>>

Price information you can find here >>>