Libraries tagged by HTML extraction
j0k3r/php-readability
692894 Downloads
Automatic article extraction from HTML
atrox/matcher
93416 Downloads
Powerful XML and HTML matching and data extraction library
pforret/pf-article-extractor
399 Downloads
PfArticleExtractor. Boilerplate Removal and Fulltext Extraction from HTML pages
dotpack/php-boiler-pipe
4795 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages
moinul/laravel-pdf-to-html
33 Downloads
A Laravel package to convert PDF files to HTML using poppler-utils
vanry/readability
43 Downloads
Automatic article content extraction from html and html parser.
sleimanx2/grawler
297 Downloads
A guided html crawler with media meta extraction
ncjoes/pdf-suite
251 Downloads
A high level wrapper over Poppler-Php for PDF content extraction and conversion using Poppler utils
aspose/pdf
575 Downloads
A powerful library for manipulating and converting PDF files.
anshu-krishna/html-scraper
19 Downloads
A set of PHP classes to simplify data extraction from HTML.
matejch/html_helpers
7 Downloads
Helper class for removing elements and content, and extracting file paths
martinille/meta-tag-extraction
12 Downloads
PHP library for fetching and parsing meta tags from web pages using a given URL or HTML source.
gregpriday/laravel-zyte-api
30 Downloads
A Laravel package for seamless integration with Zyte's web scraping API, offering functionalities for extracting raw HTML, browser-rendered HTML, and structured article content.
tacman/php-readability
48 Downloads
Automatic article extraction from HTML, fork of j0k3r/php-readability
ngfw/webparser
7 Downloads
WebParser is a PHP library that allows developers to parse and query webpages using an ORM-like syntax. It facilitates the extraction of HTML elements by chaining operations such as filtering by ID or class, ordering results, and limiting output. WebParser offers a flexible interface for exploring and extracting data from the web, making it ideal for web scraping and data analysis tasks.