Libraries tagged by HTML extraction
tacman/php-readability
50 Downloads
Automatic article extraction from HTML, fork of j0k3r/php-readability
ngfw/webparser
8 Downloads
WebParser is a PHP library that allows developers to parse and query webpages using an ORM-like syntax. It facilitates the extraction of HTML elements by chaining operations such as filtering by ID or class, ordering results, and limiting output. WebParser offers a flexible interface for exploring and extracting data from the web, making it ideal for web scraping and data analysis tasks.
clientbg/php-boiler-pipe
40 Downloads
PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages. Based on dotpack's PHP implementation.
ahadabasi/php-readability
1 Downloads
Automatic article extraction from HTML
shibashish/pdf-reader
0 Downloads
A comprehensive Laravel package for extracting text, HTML, images, and metadata from PDF files using Poppler utilities.
llm-html-extractor/symfony-bundle
2 Downloads
Symfony bundle for extracting structured data from HTML using LLM providers
hstanleycrow/easyphparticleextractor
20 Downloads
Free PHP library to extract the main content from an article post or news post, including images and HTML
einfacharchiv/microdata
332 Downloads
Extract billing data from HTML (supporting Microdata and JSON-LD)
balintpethe/laravel-universal-scraper
2 Downloads
Universal web scraping toolkit for Laravel applications.
ouxsoft/livingmarkup
3831 Downloads
A Processor for Markup written in PHP. Allows extraction of Markup into a data structure, orchestrated nested manipulation of said structure, and output as (optimized) Markup.
hxtree/livingmarkup
1357 Downloads
A Processor for Markup written in PHP. Allows extraction of Markup into a data structure, orchestrated nested manipulation of said structure, and output as (optimized) Markup.
jkphl/micrometa
172482 Downloads
A meta parser for extracting micro information out of web documents, currently supporting Microformats 1+2, HTML Microdata, RDFa Lite 1.1 and JSON-LD
teners/laravel-link-preview
3856 Downloads
A Laravel package for extracting link previews with customizable parsers, and caching support
ahmaadkhader/pdf-to-html
164 Downloads
Standalone PHP library for extracting semantic HTML from PDF files. Detects headings, lists, tables, links, and inline styles from PDF content.
mylukin/textractor
131 Downloads
An efficient class library for extracting text from HTML.