Download the PHP package content-extract/content-processor without Composer

On this page you can find all versions of the php package content-extract/content-processor. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.

FAQ

After the download, you have to make one include require_once('vendor/autoload.php');. After that you have to import the classes with use statements.

Example:
If you use only one package a project is not needed. But if you use more then one package, without a project it is not possible to import the classes with use statements.

In general, it is recommended to use always a project to download your libraries. In an application normally there is more than one library needed.
Some PHP packages are not free to download and because of that hosted in private repositories. In this case some credentials are needed to access such packages. Please use the auth.json textarea to insert credentials, if a package is coming from a private repository. You can look here for more information.

  • Some hosting areas are not accessible by a terminal or SSH. Then it is not possible to use Composer.
  • To use Composer is sometimes complicated. Especially for beginners.
  • Composer needs much resources. Sometimes they are not available on a simple webspace.
  • If you are using private repositories you don't need to share your credentials. You can set up everything on our site and then you provide a simple download link to your team member.
  • Simplify your Composer build process. Use our own command line tool to download the vendor folder as binary. This makes your build process faster and you don't need to expose your credentials for private repositories.
Please rate this library. Is it a good library?

Informations about the package content-processor

Content Processor

Production-ready PHP library for batch document processing with intelligent content extraction and structuring.

Framework-agnostic, scalable, and optimized for real-world document pipelines from day one.

๐ŸŽฏ Purpose

Process multiple documents (PDFs, text files, images, etc.), extract their content, and convert it into configurable JSON structures ready for bulk loading into databases or services.

Quick Example

๐Ÿ“ฆ Installation

Or add to your composer.json:

๐Ÿ—๏ธ Project Structure

โšก Quick Start

1. Define Your Schema

2. Configure the Processor

3. Consume Results

๐Ÿงช Testing

Run Examples

Full Test Suite

Code Quality

๐Ÿ”Œ Available Interfaces

ExtractorInterface

StructurerInterface

SchemaInterface

๐Ÿ“‹ Processor Options

โœ… Implemented Features (Blocks 1-5)

Block 1: Core โœ…

Block 2: PDF Support โœ…

Block 3: Semantic Structuring โœ…

Block 4: Final Result API โœ…

Block 5: Security & Hardening โœ…

Block 6: OCR Support (v1.5.0+) ๐Ÿš€

๐Ÿ” OCR Support (Optional)

This library supports OCR for scanned PDFs using Tesseract OCR.

Requirements

Automatic Fallback

OCR is automatically used when:

Example with OCR

Important Notes

๐Ÿ“š Documentation

๐Ÿ”Œ API Reference

FinalResult

๐Ÿš€ Production Ready

The library is tested and ready for production deployment. See SECURITY.md for deployment recommendations.

๐Ÿ“‹ Requirements

๐Ÿ“„ License

MIT


All versions of content-processor with dependencies

PHP Build Version
Package Version
Requires php Version >=8.1
smalot/pdfparser Version ^2.0
Composer command for our command line client (download client) This client runs in each environment. You don't need a specific PHP version etc. The first 20 API calls are free. Standard composer command

The package content-extract/content-processor contains the following files

Loading the files please wait ...