Libraries tagged by HTML Extractor
bitandblack/document-crawler
390 Downloads
Extract different parts of an HTML or XML document.
ahmaadkhader/pdf-to-html
87 Downloads
Standalone PHP library for extracting semantic HTML from PDF files. Detects headings, lists, tables, links, and inline styles from PDF content.
lavatech/php-simple-html-dom-parser
15675 Downloads
Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.
webignition/html-document-type-parser
1019 Downloads
Parse a public html document type (so many caveats), extract FPI and URI
dotpack/php-boiler-pipe
4913 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages
nicolas-joubert/grabit-bundle
195 Downloads
Grab it is R&D crawler to grab HTML pages
zoon/microdata-php
6451 Downloads
Extracts microdata from HTML using PHP.
nfservice/doc-cfe
843 Downloads
Cria extrato da CF-e para exportar em PDF e HTML
dealnews/metadata
860 Downloads
Extracts meta data (using oembed, opengraph, twitter-cards, scrapping the html, etc) from web pages
yubarajshrestha/html-dom-parser
24 Downloads
This package a simple wrapper around Simple Html Dom Parser library. It provides a simple way to parse html and extract data from it.
itul/php-simple-html-dom-parser
2807 Downloads
This is a modified version to work with PHP 7.4+. Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.
moinul/laravel-pdf-to-html
104 Downloads
A Laravel package to convert PDF files to HTML using poppler-utils
mage2kishan/module-html-sitemap
34 Downloads
Theme-agnostic HTML sitemap page for Magento 2 (Hyva + Luma). Renders categories (tree), products (grid), CMS pages, store switcher, and custom links at /sitemap with a built-in client-side search. Extracted from Panth_AdvancedSEO for independent installation.
arleyoliveira/extrato-cfe
76 Downloads
Cria extrato da CF-e para exportar em PDF e HTML
p1ho/accessibility-checker
612 Downloads
Accessibility Testing Suite on raw HTML extracted from Content Management Systems