Libraries tagged by HTML extraction
j0k3r/php-readability
639498 Downloads
Automatic article extraction from HTML
atrox/matcher
91011 Downloads
Powerful XML and HTML matching and data extraction library
pforret/pf-article-extractor
181 Downloads
PhpArticleExtractor. Boilerplate Removal and Fulltext Extraction from HTML pages
dotpack/php-boiler-pipe
4734 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages
aspose/pdf
395 Downloads
A powerful library for manipulating and converting PDF files.
anshu-krishna/html-scraper
19 Downloads
A set of PHP classes to simplify data extraction from HTML.
moinul/laravel-pdf-to-html
15 Downloads
A Laravel package to convert PDF files to HTML using poppler-utils
vanry/readability
42 Downloads
Automatic article content extraction from html and html parser.
sleimanx2/grawler
296 Downloads
A guided html crawler with media meta extraction
ncjoes/pdf-suite
247 Downloads
A high level wrapper over Poppler-Php for PDF content extraction and conversion using Poppler utils
hxtree/livingmarkup
1357 Downloads
A Processor for Markup written in PHP. Allows extraction of Markup into a data structure, orchestrated nested manipulation of said structure, and output as (optimized) Markup.
matejch/html_helpers
6 Downloads
Helper class for removing elements and content, and extracting file paths
martinille/meta-tag-extraction
11 Downloads
PHP library for fetching and parsing meta tags from web pages using a given URL or HTML source.
gregpriday/laravel-zyte-api
30 Downloads
A Laravel package for seamless integration with Zyte's web scraping API, offering functionalities for extracting raw HTML, browser-rendered HTML, and structured article content.
tacman/php-readability
46 Downloads
Automatic article extraction from HTML, fork of j0k3r/php-readability