iOCR / vsOCR Setup divided into standard and additional languages

The iOCR / vsOCR setup containing the language and dictionary files of our standard OCR engine is more than 270MB in size. In order to make the downloads and the setups smaller, we decided to split the iOCR / vsOCR into a “base” and an “additional setup”. The basic setup, which is available through our applications, eg. AutoOCR, FileConverterPro, or PDFmdx now only contains a selection of major European languages and has been reduced to 127MB.

If all available languages are to be installed, this is possible at any time. The additionally available “exotic languages” can be installed via a separate setup.

iOCR Basic-languages:

Danish, German, English, Finnish, French, Italian, Catalan, New Greek, Dutch, Norwegian, Polish, Portuguese, Romanian, Russian, Swedish, Slovakian, Slovenian, Spanish, Czech, Turkish, Ukrainian, Hungarian

iOCR extende languages:

Afrikaanis, Albanian, Arabic, Azerbaijani, Bahasa Indonesian, Bengali, Bulgarian, Cherokee, Chinese – Traditional, Chinese – Simplified, Estonian, Franconian, Gallic, Hebrew, Hindi, Icelandic, Japanese, Korean, Croatian, Latvian, Lithuanian, Macedonian, Malay , Serbian, Swahili, Tagalog, Tamil, Telugu, Thai, Vietnamese, Belarusian

Download – iOCR (vsOCR) Setup – Basis Sprachen (ca. 127MB) >>>

Download – iOCR (vsOCR) Setup – zusätzliche Sprachen (ca. 200MB) >>>

 

FileConverter – processing of folders and subfolders – configuration and features

The FileConverter (FC) has several options to control the processing via folders and subfolders.

The first thing important to know is that the processing takes place “transaction oriented”. That means the FC needs to know when the processing of the files from the in-folder and the subfolders it possibly contains can be started – because there could be new files added any time.

FileConverter Verarbeitungsoptionen für Unterordner

Trigger to start the processing – there are 2 possibilities for this:

  • The time of the last writing process of the files is used plus a setable delay. If there are no new files added in this period of time all files of the in-folder get recognized as a transaction and the processing starts.

Caution: If an entire folder or folder structure is copied into the in-folder the “old” creation date of the single documents will be preserved. It only gets set newly if the files and not the entire folder gets copied. In this case “ready” files have to be used or the processing can also be initiated with a “stop” and new “start” of the FC-services.

  • A “ready” file (ready.rd) is used. As soon as this file appears the available content of the in-folder at this moment is recognized as transaction and processed. The contents of the ready file don’t matter – it also can be empty. The name is configurable. If a ready file is used it has to be available in every folder which should be processed – therefor also in the subfolders of the subfolder processing was activated.

Process subfolders – yes / no:

  • If this option is not active only files from the root-in-folder get processed. Possibly underlying subfolders get ignored. Inside the out-folder an unique with date and time as name gets created for every transaction. All files created from the transaction get put into this folder.
  • If this option is active and the option subfolder processing from level” is inactive – all subfolders inside the in-folder get processed also. For each folder / subfolder from the in-folder, independent from the level in which it is located a folder with the same name gets created in the root level of the out-folder. In this case a possibly present folder structure from the in-folder isn’t created in the out-folder. This only happens if the option “subfolder processing from level” was activated.

Process all files – yes / no:

  • With this option it can be controlled what is going to happen of a file from a transaction couldn’t be converted or creates an error. If this option is active all files get processed – if an error occurs the concerning file gets marked (renamed with .err or moved to an error folder). All other files from the transaction get processed though.
  • If this option is not active the whole transaction gets aborted and “faulty” with the occuring of the first error. No other files get processed.

Subfolder processing from level:

With this option it can be controlled from which level the subfolder-processing should start. If e.g. 1 is configured all folders and underlying levels inside the in-folder get processed. The files which are located in the input-folder directly don’t get processed however.

  • If this option is active the same folder structure as in the in-folder (beginning at the defined level) gets build in the out-folder.
  • If this option is not active the folder structure doesn’t get taken over from the in-folder to the out-folder. For every folder (level-independent) a new folder with the same name gets created in the root of the out-folder. Therefor all in-folders get created in one level in the out-folder.

Download – FileConverter – documentc & e-mails to PDF, PDF/A and TIFF >>>

FileConverter – Version 1.0.40 available – new additional HTML converter

Innovations & improvements – FileConverter 1.0.40:

  • With the processing of e-mails (EML, MSG) now also e-mails which contain MSG or EML attachments themselves can be converted
  • An error at the processing of MS-Exchange e-mail boxes via the webservice interface was fixed – the error blocked the service at times by which the service had to be restarted.
  • The underlying converter component was actualized and adjusted to the current state of the FileConverterPro.
  • Especially for the HTML conversion a new converter engine (ASP-direct) was implemented. There are now 3 for choice
  • HiQ-direct = previous „direct conversion“
  • ASP-direct = new HTML converter
  • MS-Office – same as so far

We recommend to use ASP-direct as standard because this converter displays the fonts bigger and therefor the created PDF’s are more readable.

Download – FileConverter – documents & e-mails to PDF, PDF/A and TIFF >>>

FileConverter / FileConverter Pro – install silent – without user input

“msiexec” /qn /uninstall {<productcode>}

Caution: A “silent – uninstall” is only possible as an “administrator” therefor the command has to be given from “CMD”, started as “administrator”, or with a .BAT – file, executed as “administrator”.

Productcode FileConverter:

  • English installation: {E30FC554-4891-4EF0-A30C-F19CA574A39E}
  • German installation: {44D4015B-0570-42F7-92EF-8E4A0DD89ED9}

Productcode FileConverter Pro:

  • English installation: {096365FF-8127-4678-B9E3-7ACB50BAA597}
  • German installation: {11B8A908-9014-4700-81AF-0205D8848D6A}

FileConverter – version 1.0.27 – supports MS-Office 2010 as converter

With the new version 1.0.27 of the FileConverter service now also MS-Office 2010 is supported for the conversion of MS-Word, MS-Excel and MS-PowerPoint files to PDF, PDF/A and TIFF. MS-Office can be configured and used parallel to the consisting, direct conversion or mixed, after folders or e-mail boxes. The conversion is, like with the direct conversion, done silent in the background via the FileConverter Windows service. Required is the installation of the 32bit version of MS-Office on the computer and that the user of the service has opened the MS-Office applications at least once.

With the usage of MS-Office as converter engine 100% quality and the support of all MS-Office features can be guaranteed, which can only can only be reached with the usage of the original application.

To take advantage of the available computer resources or to guarantee an optimal throughput the conversions get, depending on the configuration, processed parallel also. The set standard is 5 parallel processes.

FileConverter - MS-Office as converter

Download – FileConverter – documents & e-mails to PDF, PDF/A and TIFF >>>

FileConverter – automatically convert documents and e-mails from folders or e-mail boxes to PDF, PDF/A and TIFF

The FileConverter is an application, installable as service in MS-Windows (32 and 64bit), to monitor folders and e-mail boxes and automatically convert the contained documents to the PDF, PDF/A or TIFF file format. With that, multiple folders or also MS-Exchange and POP3 mailboxes can be configured and monitored.

The following input-documentformats are supported:

  • DOC, DOCX, RTF, TXT,
  • XLS, XLSX,
  • PPT, PPTX,
  • XFDF, FDF,
  • PNG, BMP, TIF, TIFF, JPG, JPEG
  • ZIP, RAR, 7Z,
  • MSG, EML,
  • PDF,
  • HTM, HTML, MHTML,
  • PMT, PMTX

file format – features:

  • With ZIP/RAR/7Z containers, all containing and supported documents get automatically extracted and converted. The containing folder structure of the container gets build in de output directory.
  • PMT and PMTX – are PDFMerge XML dataformats – which contain hierarchic structure information as well as links to the documents or the documents themself. The FileConverter produces from this files, like the PDFMerge server, a single total PDF file, which is merged from the to PDF converted single documents. The structure defined in the XML gets displayed as PDF-bookmarks.

Conversion:

  • The PDF/TIFF conversion takes place directly without the usage of the source application. So for the processing, no installation of MS-Office or Adobe Acrobat is necessary. Optional, the PDF’s also can be exported in the ISO standardized PDF/A-1b format.
  • In the standard scope also the iOCR engine, for creation of searchable PDF(/A)’s out of PDF or image documents, is implemented. Optional – also Abbyy, the most efficient OCR engine at the moment, can be installed. With the OCR processing, PDF documents get analyzed page by page and only documents which don’t include text information yet get processed (intelligent OCR processing) – this saves resources and increases the quality and the processing speed.

Functions – general:

  • MS-Windows service application for document conversion of MS-Office, PDF, image, HTML, ZIP, MSG and e-mail to PDF, PDF/A or TIFF
  • Multiple folders as well as MS-Exchange and POP3 e-mail boxes can be monitored and processed parallel.
  • Direct conversion without usage of additional necessary source applications (MS-Office, Adobe Acrobat)  or printer drivers.
  • Flattened of filled PDF forms: PDF forms (XFDF,FDF) can be converted into normal PDF documents. The forms either can be deposited fixed or newly loaded every time.
  • Parallel processing with configurable amount of processes – allows the optimal exploitation of the hardware und garants the fast processing.
  • Logging of all conversion instances, forwarding of failed e-mail conversions or sending of error – e-mails via SMTP

In / out folder processing:

  • Processing of files and folders out of configured in / out – folders via time lapse or “ready” file, incl. subfolder processing (one level)
  • Erstellen einer Index-Text-Datei über alle bei einem Verarbeitungsvorgang erzeugten Dateien.
  • After the processing: deleting, moving into archive folder, renaming – of the files or folders (.con / .err)
  • Configuration of the filename extension which shouldn’t be converted – these get ignored and not processed. E-mails with attachments and not identifyable extensions get handled as errors and forwarded to an e-mail address.
  • Single page output with configurable amount of locations for the site index
  • Configuration of the TIFF conversion – compression / color depth / resolution / JPEG-quality
  • extensive parameters for the OCR processing – iOCR or Abbyy – the FileConverter has the same OCR functions as AutoOCR
  • Parameters for the HTML conversion – page size and margins – HTML document and e-mails get scaled automatically.

Processing of e-mail boxes:

  • Processing of POP3 / MS-Exchange e-mail boxes – forwarding  or deleting at successful or incorrect processing, or moving into an archive / error folder under MS-Exchange. Direct access to MS-Exchange 2007/2010/2013 through the SOAP web-service-interface.
  • EML and MSG – body and attachments get converted – generation of the e-mail header information in the body document – from, date, to, subject
  • Output of a XML-file with the processed e-mails with the metadata and file-links – configurable: from, to , cc, bcc, received, subject, body, attachments
  • Output per e-mail in separated subfolders or “flat” in the destination folder.

 

1_FileConverter - general settings - email & folder processing 2_FileConverter - processing options  3_FileConverter - service configuration  4_Fileconverter - SMTP server configuration  5_FileConverter - configuration folder processing  6_FileConverter - configuration e-mail box processing  7_FileConverter - MS-Exchange configuration  8_FileConverter - POP3 configuration  9_FileConverter - TIFF conversion settings  10_FileConverter - OCR settings  11_FileConverter - HTML conversion settings  12_FileConverter - Log

  Download – FileConverter – documents & e-mails to PDF, PDF/A and TIFF >>>

Windows Service – access to network resources – what to consider?

Our document conversion tools – FileConverter, AutoOCR and FileConverterPro – are used to monitor one or more input folders and to automatically start processing for new documents. This can be done either via local drives or via network resources.

Particularly when installing the applications as a service and when using network resource, some things have to be considered with regard to the correct configuration:

  • The service that has access to network resources must run under a user account and not as a system account.
  • The user at which the service runs must have the appropriate rights (read / write / delete) on the network resource.
  • You must not use a mapped drive to access the network resources (in / out / error / archive / log folder), but the direct network Share (UNC path) must be used.
  • The processing option for folder monitoring must be changed from “File System Events” to “Read File Blocks“.

The network connection to a drive letter is managed via the “Network Connection Service”. Mapping a drive so there are some things you should know

  • Option – Reconnect at the logon – Used to automatically restore the drive mapping at the next login.
  • The mapping of the drives is implemented via users – if the user is not logged in, the mapped drives are not available.
  • Mapped drives are not available via a service – regardless of whether the user is running the same account as the user currently logged in – because a service runs only under the “user credentials”, but is not logged in.

In general – even if the applications are not installed and operated as a service – it is recommended to use network shares (UNC paths) instead of mapped drives. A network share – the direct access to the network resource – is always available for the service (under a user account) as well as for normal applications and is defined by the remote server. However, this does not apply to the local system account – it does not have access to network resources and therefore can not be used for a service that must have access to network resources.