With the PDF2PDFA-CL Version 1.1.7 there is an extended command line parameter for the ZUGFeRD processing /zfxml=pdfname. This searches for an XML with the same name in the folder of the PDF file and uses it to create the ZUGFeRD PDF.
Month: November 2019
PDF2DOCX (PDF to DOCX) could only convert all pages of a PDF file so far. Now there is also the possibility to convert only single pages, as well as page ranges “from/to”.
Pages are separated by “,” (comma), page ranges by “-” (hyphen) from/to, and the last page can be specified by the “$” (dollar) character. Example: “1,5-8,10,13-$”
HTML2PDF-CL (HTML to PDF) is a command line application to convert HTML to PDF. But also several HTML files can be converted at once and merged, sorted by name, into a total PDF file. Existing PDF files can also be used as input files, alone or mixed with HTML files. In order to be able to jump to the documents in the total PDF file, a bookmark is added to the total PDF file for each individual file.
- Command line application to convert HTML to PDF.
- Several HTML as well as PDF can be combined to a total PDF file.
- Processes ZIP, CHTML, CHTM container files or via file and folder specification.
- CHTML, CHTM – are ZIP files which summarize all required files (HTML, external images, PDF) in a single container file.
- The sorting takes place via the name of the input files (HTML, PDF) or via a TXT file.
- Control of the conversion parameters (page size, margins …) via profiles.
- Profile management and configuration takes place via a user interface.
- Merge / Single file processing – Yes / No.
- Subfolder Processing – Yes / No.
- Delete the output file – Yes / No.
New features of PDFmdx version 3.8.1:
- Pitney Bowes DataMatrix barcode:
With the Pitney Bowes Relay inserting system it is possible to automatically insert letters or invoices. The inserter system has a camera to recognize a DataMatrix 2D barcode applied on the page, read it out and use the barcode to control the inserter. The 2D barcode must have a certain structure. It contains a 14-digit identifier of the document, eg. the invoice number, the page number in the document, the number of pages of the document and at the end a counter that must be continuous throughout the document. With this code, the inserter can recognize when a new letter begins and also determine whether a sheet is missing or not in the correct order in the stack.
The stamp variable definition now has its own “Pitney Bowes” checkbox to create such a predefined structure and apply it to the individual pages as a DataMatrix 2D barcode.
The input as well as the output files, sorted by file name, are processed and output as sorted, eg. by the invoice number read from the invoice file. The Merge2Print command line application can then be used to create a sorted PDF total file for the printout. However, due to the requirement that the entire process must be sorted, only the executable EXE processor of PDFmdx, but not the PDFmdx service, can be used. In addition, “block processing” must be activated.
- Start processing via *.rd file:
Previously, PDFmdx processing (executable EXE application or Windows service) could be started either timed (Timer, Date, Daily, Weekly) or by inserting PDF files into a monitored folder. However, there are applications in which it is important that all files are present in the input folder first and only the to start sorted processing. There is now the *.rd option. If this option is activated, the processing starts only if a *.rd file eg. “Ready.rd” is copied to the monitored folder. This allows the processing to be started in a controlled manner at the desired time.
Info: As of PDFmdx version 3.8.0 .NET Runtime version 4.5 is required.
Merge2Print is an interactive as well as a command line application to merge several single PDF files from one input folder to one or more total PDF files in an output folder. The individual files are merged in alphabetical order by name. The name of the ouput file can be configured via variables – counter, date, time. There is an additional parameter to set the maximum number of pages an output file should contain. This makes it possible to create PDF “print files” with a given maximum page number. A total PDF file can be used to ensure that the documents/pages are output to a printer in the order they are intended.
- Merge single PDF files to total PDF file(s)
- Interactive / Command line application
- Processing via input/output folder
- Delete source files – Yes/No
- Max. pages for total PDF file – 0 = no max. pages
- Counter, date, time as variables for the output file
pdfFM – Folder Merge – can be run interactively or as a command line application. During processing, the 3 configured inbox folders are searched for files with the same name. Documents with the same name are merged in the order of the specified folder into a new complete PDF and stored in the destination folder.
An additional parameter can also be used as the page threshold for a second output folder. All documents with fewer pages than the set threshold will be placed in the first and all other files with more pages in the second output folder. A “Pair check – yes/no” option determines whether files with the same name are required in all input folders or not. If the “Pair check” option is active and a file in one of the folders is missing, the files will not be processed and moved to the error folder. A log file logs the processing. In addition to merging into a single PDF, the output file can also be converted to an ISO PDF/A-1b, 2b or 3b file.
New features PDFmdx Editor version 3.7.4:
- Automatic backup of templates at startup: Activate the function, path for the backups, backups are marked with date and time and replaced by rotation.
- Search function for conditions: Forward/backward search, full-text search in the conditions. Using the context menu, the layout associated with the condition can be called up and opened directly.
- Comment / separator lines in the condition editor can be deleted or moved up/down.
- Warning for empty condition nodes: Empty condition nodes can lead to unpredictable results during processing. These are now recognized in the condition editor. A warning is displayed to perform a cleanup.
- NOT for conditions: To be able to reverse the logic of a condition.
- Extensible fields: For fields of a moving group, not every record may have the same number of rows, and therefore a field fixed in its vertical size may either capture too many or not all rows. With this option, the field can be defined vertically smaller and all subsequent lines to the next record in a field are recorded. The character inserted at the end of each merge line is configurable (space, semicolon, comma).
- Align the field position and adjust the optimal size: For capturing records of a moving group / subgroup, it is important that the fields are all at a roughly similar vertical position and that the fields are vertically the correct size. The size is optimal if the field vertically just barely captures the text area to read the text, but should not be larger or smaller. It can sometimes not be easy to set the size manually with narrow lines. There is now an automatic function. This function automatically aligns the fields vertically and sets them to the optimal size.
- Invert area before OCR detection: OCR only works with dark text on a light background. For light writing on a dark background, the area must be inverted before the OCR recognition. There is now a special image processing function that can be activated for a field and executed before the integrated OCR recognition.
- Always run OCR: Not always does a PDF have the correct text in the text layer. For example, if inverted areas with white text on a black background are present in the document. If “SmartOCR” processing is enabled, an area OCR will only be executed if there is no text in the area. It can now be determined for individual areas that despite existing text, the OCR is always executed, e.g. to perform an inversion of the area beforehand to get a usable result.
- Compound fields: You can now also create fields that are composed of other fields and texts. These fields can be used for the output.
- Default values for fields can be assigned based on the layout and not just globally.
- Numeric fields can also accept negative values.
- Create a template without the layouts contained in the template as a new template.
- Transfer settings of a template to other templates: Selection of the settings tabs of the source template as well as selection of the target templates.
- PDFSign component has been integrated to protect and sign the generated PDF.
- PDFCompressor component has been integrated to create optimized and compact PDF files.
- Export record filter: Conditions can be used to filter the data record export. Records that meet one of the defined conditions are filtered and not output. Filtered records are displayed in the test function marked “red”. Conditions can be constructed on the basis of text strings, substrings, regular expression or “empty” over fields, layouts and selection level (document, group, subgroup) as well as AND/OR or NOT relationships.
New features PDFmdx Processor version 3.7.4:
- Call a command line application: After processing all documents from the input area of a job, a command line application can be called. For example, pdfFM to merge files from multiple folders with the same name into a single PDF. If processing takes place via the PDFmdx Windows service, the command line application must not display a dialog and must be executed “silent”.
- Locked files are detected and not processed: If a file to be processed is locked, it can not be processed or moved to an error folder. Such files are marked with a *.lock file and are not further processed. To process such a file later, only the *.lock file has to be deleted.
- Output – repetition: If a device is not immediately available at the output (share / network drive) or responds too slowly, then the waiting time and the number of repetitions can now be set before the processing recognizes this fact as an error and interrupts the processing.