Libraries tagged by HTML Extractor

atrox/matcher

94 Favers
93841 Downloads

Powerful XML and HTML matching and data extraction library

Go to Download


jkphl/micrometa

119 Favers
159428 Downloads

A meta parser for extracting micro information out of web documents, currently supporting Microformats 1+2, HTML Microdata, RDFa Lite 1.1 and JSON-LD

Go to Download


helgesverre/receipt-scanner

137 Favers
6748 Downloads

Use OpenAI to extract structured receipt and invoice data from Text, Html, Images and PDFs.

Go to Download


lysice/php-simple-html-dom-parser

6 Favers
30465 Downloads

Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.

Go to Download


nilgems/laravel-textract

19 Favers
3747 Downloads

A Laravel package to extract text from files like DOC, XL, Image, Pdf and more. I've developed this package by inspiring "npm textract".

Go to Download


hexydec/htmldoc

24 Favers
8487 Downloads

A token based HTML document parser and minifier. Minify HTML documents including inline CSS, Javascript, and SVG's on the fly. Extract document text, attributes, and fragments. Full test suite.

Go to Download


linclark/microdata-php

118 Favers
42365 Downloads

Extracts microdata from HTML using PHP.

Go to Download


crwlr/schema-org

14 Favers
13825 Downloads

Extract schema.org structured data from HTML documents.

Go to Download


grom/tube-link

30 Favers
14351 Downloads

Extract video/music information from any URL and render HTML

Go to Download


dotpack/php-boiler-pipe

17 Favers
4843 Downloads

PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages

Go to Download


aspose/pdf-sdk-php

8 Favers
24863 Downloads

Aspose.PDF Cloud is a REST API for creating and editing PDF files. It can also be used to convert PDF files to different formats like DOC, HTML, XPS, TIFF and many more. Aspose.PDF Cloud gives you control: create PDFs from scratch or from HTML, XML, template, database, XPS or an image. Render PDFs to image formats such as JPEG, PNG, GIF, BMP, TIFF and many others. Aspose.PDF Cloud helps you manipulate elements of a PDF file like text, annotations, watermarks, signatures, bookmarks, stamps and so on. Its REST API also allows you to manage PDF pages by using features like merging, splitting, and inserting. Add images to a PDF file or convert PDF pages to images.

Go to Download


lavatech/php-simple-html-dom-parser

0 Favers
15016 Downloads

Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.

Go to Download


itul/php-simple-html-dom-parser

0 Favers
2193 Downloads

This is a modified version to work with PHP 7.4+. Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.

Go to Download


p1ho/accessibility-checker

3 Favers
540 Downloads

Accessibility Testing Suite on raw HTML extracted from Content Management Systems

Go to Download


nfservice/doc-cfe

0 Favers
470 Downloads

Cria extrato da CF-e para exportar em PDF e HTML

Go to Download


<< Previous Next >>