Libraries tagged by text-extraction

kreuzberg/kreuzberg

8444 Favers
164 Downloads

High-performance document intelligence library

Go to Download


silverstripe/textextraction

9 Favers
186816 Downloads

Text Extraction API for SilverStripe CMS (mostly used with 'fulltextsearch' module)

Go to Download


iamgerwin/php-pdf-to-markdown-parser

5 Favers
4443 Downloads

A lightweight PHP library to convert PDF documents into clean, structured Markdown. Supports text extraction, headings, lists, tables, diagrams and code blocks for easier content reuse and publishing.

Go to Download


oxide/pdf-oxide

836 Favers
3 Downloads

PDF processing toolkit (Rust-backed, FFI-bound) for PHP

Go to Download


keyvan/german-ocr

109 Favers
0 Downloads

High-performance German document OCR - Local & Cloud API

Go to Download


jcfrane/pdf-text-extractor

2 Favers
197 Downloads

A Laravel PDF text extraction package with multiple strategies (PdfParser, XObject, AWS Textract, Tesseract OCR). Handles Canva-generated PDFs, scanned documents, and other edge cases with automatic fallback.

Go to Download


daniel-jorg-schuppelius/php-pdf-toolkit

0 Favers
304 Downloads

PHP 8.2+ library for PDF text extraction with automatic reader selection. Supports embedded text and scanned documents via OCR.

Go to Download


moinul/laravel-pdf-to-html

0 Favers
100 Downloads

A Laravel package to convert PDF files to HTML using poppler-utils

Go to Download


manofstrong/sitescrapper

6 Favers
71 Downloads

A Package to Scrape Websites from their Sitemaps and Extract Relevant Content from the Webpage and Upload to a Database

Go to Download


aspose/pdf

2 Favers
850 Downloads

A powerful library for manipulating and converting PDF files.

Go to Download


xatham/text-extraction

1 Favers
17 Downloads

Easy text extraction for many different file types

Go to Download


teon/text-extraction

1 Favers
607 Downloads

Text Extraction Library

Go to Download


centertap/tika-all-the-files

0 Favers
107 Downloads

Mediawiki extension that provides extraction of searchable text and metadata from uploaded files, via Apache Tika

Go to Download


mayaram/laravel-ocr

64 Favers
1330 Downloads

Laravel OCR & Document Data Extractor - A powerful OCR and document parsing engine for Laravel

Go to Download


cryde/json-text-extractor

12 Favers
8578 Downloads

Helper that will extract JSON from plain text

Go to Download


Next >>