iText pdf2Data comprises the pdf2Data SDK (available for either Java or .NET), pdf2Data CLI, and the browser-based pdf2Data Template Editor.

As of iText pdf2Data 3.1.0, the Template Editor is available also as a Docker container.

pdf2Data SDK

Java SDK 

Requirements

  • Java 8
  • Recommended minimal hardware configuration:
    • 2 core CPU
    • Memory: 2 GB
    • Temp storage: 2 GB free disk space

The pdf2Data Java SDK requires Java 8 to be installed on your system. iText guarantees software compatibility with both the Oracle JRE and Open JRE. We recommend you use Java 8 with the latest updates as pdf2Data might not work properly on older versions.

While the Java SDK will work fine on a single core, we recommend using multiple cores in cases where you handle documents in parallel using separate threads (one document per thread). We recommend using at least 1.5GB of Java heap space. 

.NET SDK 

Requirements

  • Net Framework 4.6.1+
  • Recommended minimal hardware configuration:
    • 2 core CPU
    • Memory: 2 GB
    • Temp storage: 2 GB free disk space

pdf2Data CLI

  • Java 8
  • Recommended minimal hardware configuration:
    • 2 core CPU
    • Memory: 2 GB
    • Temp storage: 2 GB free disk space

It is possible to use pdf2Data from the command line as long as you have Java 8 installed.

Template Editor

The pdf2Data Template editor is a browser-based application, and is traditionally shipped as Java web application archive (.war) and requires Apache Tomcat for deployment. However since the release of iText pdf2Data 3.1.0 it is also available as a Docker container.

WAR

Requirements

  • Apache Tomcat 7 (≥ 7.0.77) or 8
  • Java 8
  • Recommended minimal hardware configuration:
    • 2 core CPU
    • Memory:  4 GB
    • Temp storage:  2 GB free disk space

The minimal supported version of Apache Tomcat is 7.0.77. We recommend using Tomcat 8+.
The volume of required free disk space depends on the number of extraction templates you are going to support; we recommend having at least 2 GB free space on the disk where Tomcat`s temp storage is located.

Docker

Since pdf2Data 3.1.0 Template Editor is also being shipped as a Docker container that already includes all needed software.
Deployment of a docker container requires Docker with Linux containers to be installed on your system. The required Docker engine version is 19.03.0+

Requirements

  • Docker 19.03.0+
  • Recommended minimal hardware configuration:
    • 2 core CPU
    • Memory: 4 GB
    • Temp storage:  2 GB free disk space


The volume of required free disk space depends on the number of extraction templates you are going to support; we don't perform auto-cleanup.