Libraries tagged by HTML extraction
j0k3r/php-readability
667705 Downloads
Automatic article extraction from HTML
atrox/matcher
92695 Downloads
Powerful XML and HTML matching and data extraction library
pforret/pf-article-extractor
275 Downloads
PhpArticleExtractor. Boilerplate Removal and Fulltext Extraction from HTML pages
dotpack/php-boiler-pipe
4761 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages
aspose/pdf
539 Downloads
A powerful library for manipulating and converting PDF files.
vanry/readability
43 Downloads
Automatic article content extraction from html and html parser.
sleimanx2/grawler
297 Downloads
A guided html crawler with media meta extraction
ncjoes/pdf-suite
250 Downloads
A high level wrapper over Poppler-Php for PDF content extraction and conversion using Poppler utils
einfacharchiv/microdata
303 Downloads
Extract billing data from HTML (supporting Microdata and JSON-LD)
anshu-krishna/html-scraper
19 Downloads
A set of PHP classes to simplify data extraction from HTML.
moinul/laravel-pdf-to-html
24 Downloads
A Laravel package to convert PDF files to HTML using poppler-utils
matejch/html_helpers
7 Downloads
Helper class for removing elements and content, and extracting file paths
martinille/meta-tag-extraction
12 Downloads
PHP library for fetching and parsing meta tags from web pages using a given URL or HTML source.
gregpriday/laravel-zyte-api
30 Downloads
A Laravel package for seamless integration with Zyte's web scraping API, offering functionalities for extracting raw HTML, browser-rendered HTML, and structured article content.
tacman/php-readability
48 Downloads
Automatic article extraction from HTML, fork of j0k3r/php-readability