Download the PHP package joest8/pdfinterpreter without Composer

On this page you can find all versions of the php package joest8/pdfinterpreter. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.

FAQ

After the download, you have to make one include require_once('vendor/autoload.php');. After that you have to import the classes with use statements.

Example:
If you use only one package a project is not needed. But if you use more then one package, without a project it is not possible to import the classes with use statements.

In general, it is recommended to use always a project to download your libraries. In an application normally there is more than one library needed.
Some PHP packages are not free to download and because of that hosted in private repositories. In this case some credentials are needed to access such packages. Please use the auth.json textarea to insert credentials, if a package is coming from a private repository. You can look here for more information.

  • Some hosting areas are not accessible by a terminal or SSH. Then it is not possible to use Composer.
  • To use Composer is sometimes complicated. Especially for beginners.
  • Composer needs much resources. Sometimes they are not available on a simple webspace.
  • If you are using private repositories you don't need to share your credentials. You can set up everything on our site and then you provide a simple download link to your team member.
  • Simplify your Composer build process. Use our own command line tool to download the vendor folder as binary. This makes your build process faster and you don't need to expose your credentials for private repositories.
Please rate this library. Is it a good library?

Informations about the package pdfinterpreter

Pdf Interpreter

Introduction

This class is designed to convert multiple PDF files, whether image-based or text-based, into an array of data. The class uses user-defined templates containing regular expressions to control the data extraction process, allowing for customized and flexible output.

Table of Contents

This README is divided into several sections:

Installation

Console Applications

To use this class, you'll need to install the following applications:

  1. Poppler (necessary to convert pdf to text and get information about number of pages in file)
  2. Tesseract (necessary to read and interpret png file)
  3. ImageMagick (necessary to convert pdf->png)

Make sure you have a package-manager installed on your system.

Automated installation

Run the following code from the source folder to autoinstall all dependencies and tesseract language files:

Manual installation with homebrew

If homebrew is installed run the following commands to install the Homebrew packages:

Manual installation of Tesseract Language Files

You also need to install the required Tesseract language files. You can check the available languages at: https://github.com/tesseract-ocr/tessdata_best/

Download the necessary language files and place them in the appropriate directory. To find the directory use:

Usage

Create Object

Get Sample Output

Using the get_sample_output-Method will allow you to get a sample of a text output without any interpretation of patterns.

Set new template

Using the add_new_template-Method will help you to create a new template. For more informations about the demanded parameters read the DocBloc of the method.

Add pattern to template

Using the add_pattern_to_template-Method will help you to add a new pattern to an existing template. For more informations about the demanded parameters read the DocBloc of the method.

Get Template

Using the get_template-Method will return the entire template. For more informations about the demanded parameters read the DocBloc of the method.

Delete Template

Using the delete_template-Method will delete the entire template. For more informations about the demanded parameters read the DocBloc of the method.

Convert Files from Folder

Using the convert_folder-Method will convert all files from a folder into an array of data. For more informations about the demanded parameters read the DocBloc of the method.

Convert File

Using the convert_file-Method will convert a single file into an array of data. For more informations about the demanded parameters read the DocBloc of the method.


All versions of pdfinterpreter with dependencies

PHP Build Version
Package Version
Requires php Version ^8.2
symfony/process Version ^6.3
Composer command for our command line client (download client) This client runs in each environment. You don't need a specific PHP version etc. The first 20 API calls are free. Standard composer command

The package joest8/pdfinterpreter contains the following files

Loading the files please wait ....