Libraries tagged by extract text
aspose/pdf-sdk-php
24344 Downloads
Aspose.PDF Cloud is a REST API for creating and editing PDF files. It can also be used to convert PDF files to different formats like DOC, HTML, XPS, TIFF and many more. Aspose.PDF Cloud gives you control: create PDFs from scratch or from HTML, XML, template, database, XPS or an image. Render PDFs to image formats such as JPEG, PNG, GIF, BMP, TIFF and many others. Aspose.PDF Cloud helps you manipulate elements of a PDF file like text, annotations, watermarks, signatures, bookmarks, stamps and so on. Its REST API also allows you to manage PDF pages by using features like merging, splitting, and inserting. Add images to a PDF file or convert PDF pages to images.
ipwsystems/rtftools
34462 Downloads
Library used to extract raw text from an RTF file
keyword-extractor/keyword-extractor
8278 Downloads
A package to extract keywords from text
linguistic/ngramextractor
1954 Downloads
Extracts ngrams from a given text and does linguistic pre-processing like stopword removal
bureaupartners/extract-address-from-text
2880 Downloads
With this package you can extract the address from a unformatted text string
aspose/pdf
562 Downloads
A powerful library for manipulating and converting PDF files.
oneofftech/parse-client
352 Downloads
Parse PDF document keeping the structure.
mdoteu/pdfparser
124 Downloads
Fork of Smalot's Pdf parser library with modifications.
lmasforne/pdfparser
20935 Downloads
Pdf parser library. Can read and extract information from pdf file.
antonizer/pdfparser
437 Downloads
Pdf parser library. Can read and extract information from pdf file. Fork from https://github.com/smalot/pdfparser
hocvt/php-apache-tika
1543 Downloads
Apache Tika bindings for PHP: extracts text from documents and images (with OCR), metadata and more...
venveo/craft-documentsearch
5305 Downloads
Extract the contents of text documents and add to Craft's search index
humanmade/entity-base
657 Downloads
Entity Base is a content analysis plugin that uses the TextRazor Natural Language Processor to extract entities from post content.
hejunjie/address-parser
621 Downloads
收货地址智能解析工具,支持从非结构化文本中提取姓名、手机号、身份证号、省市区、详细地址等字段,适用于电商、物流、CRM 等系统 | An intelligent address parser that extracts name, phone number, ID number, region, and detailed address from unstructured text—perfect for e-commerce, logistics, and CRM systems.
jbpapp/pdf-to-text
872 Downloads
Extract text from a pdf file using pdf-to-text binary.