Download the PHP package falkemedia/pdf-extractor without Composer
On this page you can find all versions of the php package falkemedia/pdf-extractor. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.
Download falkemedia/pdf-extractor
More information about falkemedia/pdf-extractor
Files in falkemedia/pdf-extractor
Package pdf-extractor
Short Description This package automates the generation of an SQLite database that you can use to do a full-text search on a PDF.
License MIT
Homepage https://github.com/falkemedia/pdf-extractor
Informations about the package pdf-extractor
PDF Extractor
This package automates the generation of an SQLite database that you can use to do a full-text search on a PDF. Meaning you take your PDF, use this tool to generate a database and then query the database and not the PDF for any text search.
This tool also generates thumbnails that you can use to display your search results however you like.
This is heavily inspired spatie/pdf-to-image
and has a dependency of spatie/pdf-to-text
Installation
You can install the package via composer:
This package requires the installation of ImageMagic and the imagick php extension.
Instructions for macOS Catalina + PHP 7.3:
If there are any errors with imagemagic I suggest reading through this guide
Also, behind the scenes this package leverages pdftotext. On a mac you can install the binary using brew
Usage
examples/extract_pdf_data.php
If you have a saved sqlite database you can do full-text queries like for example:
Testing
Changelog
Please see CHANGELOG for more information what has changed recently.
Contributing
Please see CONTRIBUTING for details.
Security
If you discover any security related issues, please email [email protected] instead of using the issue tracker.
Credits
- falkemedia
- Robin Reiter
- All Contributors
License
The MIT License (MIT). Please see License File for more information.
PHP Package Boilerplate
This package was generated using the PHP Package Boilerplate.
All versions of pdf-extractor with dependencies
ext-imagick Version *
ext-sqlite3 Version *
intervention/image Version ^2.5
spatie/pdf-to-text Version ^1.3.0