Download the PHP package mostlyserious/craft-text-extractor without Composer
On this page you can find all versions of the php package mostlyserious/craft-text-extractor. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.
Download mostlyserious/craft-text-extractor
More information about mostlyserious/craft-text-extractor
Files in mostlyserious/craft-text-extractor
Package craft-text-extractor
Short Description A tool to extract text from documents.
License
Informations about the package craft-text-extractor
Text Extractor
A tool to extract text from documents and insert it into Craft CMS Asset Elements.
Requirements
This plugin requires Craft CMS 5.0.0 or later, and PHP 8.2 or later.
Features
- Supports PDF (.pdf) and MS Word (.docx) files
- Password-protected PDF files are not supported.
- Extracts text on Asset creation and when Asset files are replaced
- Includes an Action to extract text from the Assets index view.
Configuration
Extracted document text is inserted into the custom field handle defined by the plugin. The default field handle is body.
You can customize the handle by adding a plugin config file.
This must be a Text field or CKEditor field.
Usage
- Upload supported file extensions and enjoy!
Thank you to the following packages:
Future Plans and Other Document Parsers
The PHPWord library (docs) and PHPOffice tools like promising, but were more complex than needed for this project at this time.
All versions of craft-text-extractor with dependencies
craftcms/cms Version ^5.0.0
label305/docx-extractor Version ^0.2.3
smalot/pdfparser Version ^2.12.0