Libraries tagged by HTML Extractor
jkphl/micrometa
158287 Downloads
A meta parser for extracting micro information out of web documents, currently supporting Microformats 1+2, HTML Microdata, RDFa Lite 1.1 and JSON-LD
helgesverre/receipt-scanner
5372 Downloads
Use OpenAI to extract structured receipt and invoice data from Text, Html, Images and PDFs.
lysice/php-simple-html-dom-parser
28787 Downloads
Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.
hexydec/htmldoc
8133 Downloads
A token based HTML document parser and minifier. Minify HTML documents including inline CSS, Javascript, and SVG's on the fly. Extract document text, attributes, and fragments. Full test suite.
linclark/microdata-php
42167 Downloads
Extracts microdata from HTML using PHP.
crwlr/schema-org
13111 Downloads
Extract schema.org structured data from HTML documents.
grom/tube-link
14061 Downloads
Extract video/music information from any URL and render HTML
nilgems/laravel-textract
3470 Downloads
A Laravel package to extract text from files like DOC, XL, Image, Pdf and more. I've developed this package by inspiring "npm textract".
aspose/pdf-sdk-php
24354 Downloads
Aspose.PDF Cloud is a REST API for creating and editing PDF files. It can also be used to convert PDF files to different formats like DOC, HTML, XPS, TIFF and many more. Aspose.PDF Cloud gives you control: create PDFs from scratch or from HTML, XML, template, database, XPS or an image. Render PDFs to image formats such as JPEG, PNG, GIF, BMP, TIFF and many others. Aspose.PDF Cloud helps you manipulate elements of a PDF file like text, annotations, watermarks, signatures, bookmarks, stamps and so on. Its REST API also allows you to manage PDF pages by using features like merging, splitting, and inserting. Add images to a PDF file or convert PDF pages to images.
lavatech/php-simple-html-dom-parser
14685 Downloads
Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.
itul/php-simple-html-dom-parser
1939 Downloads
This is a modified version to work with PHP 7.4+. Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.
dotpack/php-boiler-pipe
4763 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages
imelgrat/opml-parser
437 Downloads
OPML Parser Class: Extract the properties of content from OPML files.
aspose/pdf
562 Downloads
A powerful library for manipulating and converting PDF files.
nfservice/doc-cfe
389 Downloads
Cria extrato da CF-e para exportar em PDF e HTML