Libraries tagged by Text extractor
cpierce/pdf2text
15962 Downloads
Client library for extracting text form PDF
hexydec/htmldoc
8125 Downloads
A token based HTML document parser and minifier. Minify HTML documents including inline CSS, Javascript, and SVG's on the fly. Extract document text, attributes, and fragments. Full test suite.
nlpcloud/nlpcloud-client
17029 Downloads
NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, grammar and spelling correction, keywords and keyphrases extraction, chatbot, product description and ad generation, intent classification, text generation, image generation, code generation, question answering, automatic speech recognition, machine translation, language detection, semantic search, semantic similarity, tokenization, POS tagging, speech synthesis, embeddings, and dependency parsing. It is ready for production, served through a REST API. This is the PHP client for the API. More details here: https://nlpcloud.com. Documentation: https://docs.nlpcloud.com. Github: https://github.com/nlpcloud/nlpcloud-php
hello-solucoes/pdf-to-text
21678 Downloads
Extract text from a pdf
nilgems/laravel-textract
3462 Downloads
A Laravel package to extract text from files like DOC, XL, Image, Pdf and more. I've developed this package by inspiring "npm textract".
rosette/api
21123 Downloads
PHP Interface for Babel Street Text Analytics
flow-php/etl-adapter-excel
608 Downloads
PHP ETL - Adapter - Excel
aspose/pdf-sdk-php
24333 Downloads
Aspose.PDF Cloud is a REST API for creating and editing PDF files. It can also be used to convert PDF files to different formats like DOC, HTML, XPS, TIFF and many more. Aspose.PDF Cloud gives you control: create PDFs from scratch or from HTML, XML, template, database, XPS or an image. Render PDFs to image formats such as JPEG, PNG, GIF, BMP, TIFF and many others. Aspose.PDF Cloud helps you manipulate elements of a PDF file like text, annotations, watermarks, signatures, bookmarks, stamps and so on. Its REST API also allows you to manage PDF pages by using features like merging, splitting, and inserting. Add images to a PDF file or convert PDF pages to images.
ipwsystems/rtftools
34430 Downloads
Library used to extract raw text from an RTF file
linguistic/ngramextractor
1953 Downloads
Extracts ngrams from a given text and does linguistic pre-processing like stopword removal
bureaupartners/extract-address-from-text
2880 Downloads
With this package you can extract the address from a unformatted text string
oneofftech/parse-client
352 Downloads
Parse PDF document keeping the structure.
aspose/pdf
562 Downloads
A powerful library for manipulating and converting PDF files.
mdoteu/pdfparser
124 Downloads
Fork of Smalot's Pdf parser library with modifications.
lmasforne/pdfparser
20935 Downloads
Pdf parser library. Can read and extract information from pdf file.