Libraries tagged by extract text
smalot/pdfparser
38991108 Downloads
Pdf parser library. Can read and extract information from pdf file.
spatie/pdf-to-text
7368976 Downloads
Extract text from a pdf
nojimage/twitter-text-php
1949416 Downloads
A library of PHP classes that provide auto-linking and extraction of usernames, lists, hashtags and URLs from tweets.
vaites/php-apache-tika
1505786 Downloads
Apache Tika bindings for PHP: extracts text from documents and images (with OCR), metadata and more...
prinsfrank/pdfparser
28553 Downloads
PHP library to Read and extract text & images from PDFs - Fast & Low memory - Built from scratch
crodas/text-rank
60643 Downloads
Extract relevant keywords from a given text
helgesverre/receipt-scanner
8944 Downloads
Use OpenAI to extract structured receipt and invoice data from Text, Html, Images and PDFs.
nilgems/laravel-textract
5695 Downloads
A Laravel package to extract text from files like DOC, XL, Image, Pdf and more. I've developed this package by inspiring "npm textract".
aymanrb/php-unstructured-text-parser
23489 Downloads
A PHP library to help extract text out of text documents
bureaupartners/extract-address-from-text
7255 Downloads
With this package you can extract the address from a unformatted text string
flow-php/etl-adapter-text
56870 Downloads
PHP ETL - Adapter - Text
aspose/pdf-sdk-php
28440 Downloads
Aspose.PDF Cloud is a REST API for creating and editing PDF files. It can also be used to convert PDF files to different formats like DOC, HTML, XPS, TIFF and many more. Aspose.PDF Cloud gives you control: create PDFs from scratch or from HTML, XML, template, database, XPS or an image. Render PDFs to image formats such as JPEG, PNG, GIF, BMP, TIFF and many others. Aspose.PDF Cloud helps you manipulate elements of a PDF file like text, annotations, watermarks, signatures, bookmarks, stamps and so on. Its REST API also allows you to manage PDF pages by using features like merging, splitting, and inserting. Add images to a PDF file or convert PDF pages to images.
ottosmops/pdftotext
147100 Downloads
Extract text from PDF
sgh/pdfbox
100219 Downloads
PHP5 wrapper for the Apache PdfBox ExtractText utility.
hexydec/htmldoc
10849 Downloads
A token based HTML document parser and minifier. Minify HTML documents including inline CSS, Javascript, and SVG's on the fly. Extract document text, attributes, and fragments. Full test suite.