Libraries tagged by HTML extraction
ngfw/webparser
6 Downloads
WebParser is a PHP library that allows developers to parse and query webpages using an ORM-like syntax. It facilitates the extraction of HTML elements by chaining operations such as filtering by ID or class, ordering results, and limiting output. WebParser offers a flexible interface for exploring and extracting data from the web, making it ideal for web scraping and data analysis tasks.
clientbg/php-boiler-pipe
39 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages. Based on dotpack's PHP implementation.
ahadabasi/php-readability
1 Downloads
Automatic article extraction from HTML
hstanleycrow/easyphparticleextractor
17 Downloads
Free PHP library to extract the main content from an article post or news post, including images and HTML
einfacharchiv/microdata
298 Downloads
Extract billing data from HTML (supporting Microdata and JSON-LD)
ouxsoft/livingmarkup
3830 Downloads
A Processor for Markup written in PHP. Allows extraction of Markup into a data structure, orchestrated nested manipulation of said structure, and output as (optimized) Markup.
jkphl/micrometa
156002 Downloads
A meta parser for extracting micro information out of web documents, currently supporting Microformats 1+2, HTML Microdata, RDFa Lite 1.1 and JSON-LD
teners/laravel-link-preview
1936 Downloads
A Laravel package for extracting link previews with customizable parsers, and caching support
mylukin/textractor
130 Downloads
An efficient class library for extracting text from HTML.
imelgrat/feed-finder
34 Downloads
A PHP class for extracting the URLs of RSS (1.0 and 2.0) and ATOM feeds associated to a page, as well as OPML outline documents.
stamina/phpquery-tools
33 Downloads
phpQuery tools for extracting links and sanitizing HTML documents using phpQuery
michaelstivala/micrometa
8 Downloads
A meta parser for extracting micro information out of web documents, currently supporting Microformats 1+2, HTML Microdata, RDFa Lite 1.1 and JSON-LD
eborges78/micrometa
4 Downloads
A meta parser for extracting micro information out of web documents, currently supporting Microformats 1+2, HTML Microdata, RDFa Lite 1.1 and JSON-LD
ankurgoels/micrometa
195 Downloads
A meta parser for extracting micro information out of web documents, currently supporting Microformats 1+2, HTML Microdata, RDFa Lite 1.1 and JSON-LD
shiba/textractor
11 Downloads
An efficient class library for extracting text from HTML.