Libraries tagged by document parser
sokil/php-vast
164144 Downloads
Generator and parser for VAST documents
yusufkandemir/microdata-parser
57842 Downloads
Parse microdata from HTML documents with ease. PHP Implementation of W3C Microdata to JSON Specification.
iamgerwin/php-pdf-to-markdown-parser
5553 Downloads
A lightweight PHP library to convert PDF documents into clean, structured Markdown. Supports text extraction, headings, lists, tables, diagrams and code blocks for easier content reuse and publishing.
deft/mrz-parser
107328 Downloads
Library to parse machine readable zones (MRZ) of passports and ID cards
aymanrb/php-unstructured-text-parser
23940 Downloads
A PHP library to help extract text out of text documents
hexydec/htmldoc
11091 Downloads
A token based HTML document parser and minifier. Minify HTML documents including inline CSS, Javascript, and SVG's on the fly. Extract document text, attributes, and fragments. Full test suite.
jkphl/rdfa-lite-microdata
174712 Downloads
RDFa Lite 1.1 and HTML Microdata parser for web documents (HTML, SVG, XML)
adianti/html-document
3896 Downloads
HTML Document parser
webignition/yaml-document-set-parser
76786 Downloads
Separate a collection of yaml documents into an array of yaml documents
webignition/yaml-document-generator
41348 Downloads
Generate a yaml document from an array of data
sourcepot/php-ole-msg-parser
364 Downloads
Minimal PHP library for parsing Outlook .msg files incl. attachments stored in OLE compound documents.
soothsilver/dtd-parser
6639 Downloads
Simple fully compliant DTD parser that allows you to extract information from Document Type Definition files.
hexydec/cssdoc
11691 Downloads
A token based CSS Document parser and minifier written in PHP
ordinary9843/ghostscript
24119 Downloads
Use Ghostscript to merge / split all PDF files or guess and convert PDF file version, and transform PDF into images. Fix FPDI error by Ghostscript: This document PDF probably uses a compression technique which is not supported by the free parser shipped with FPDI.
andrewandante/silverstripe-document-parser
1473 Downloads
Adds DocumentParser package to extract contents of .doc, .docx, .rtf and .txt files for search etc.