Libraries tagged by Content extractor
vanry/readability
35 Downloads
Automatic article content extraction from html and html parser.
esplora/decompresso
5 Downloads
PHP library for extracting contents from various archive formats with ease.
manofstrong/sitescrapper
70 Downloads
A Package to Scrape Websites from their Sitemaps and Extract Relevant Content from the Webpage and Upload to a Database
ncjoes/pdf-suite
236 Downloads
A high level wrapper over Poppler-Php for PDF content extraction and conversion using Poppler utils
torann/dom-parser
444 Downloads
A HTML DOM parser written in PHP7 let you manipulate HTML in a very easy way! Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.
farzinft/php-simple-html-dom-parser
694 Downloads
Composer adaptation of: A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way! Require PHP 5+. Supports invalid HTML. Find tags on an HTML page with selectors just like jQuery. Extract contents from HTML in a single line.
p1ho/accessibility-checker
419 Downloads
Accessibility Testing Suite on raw HTML extracted from Content Management Systems
se7enxweb/xrowextract
71 Downloads
This extenstion delivers tools for extracting/exporting content object data to csv
imelgrat/opml-parser
426 Downloads
OPML Parser Class: Extract the properties of content from OPML files.
aramonc/docblock-parser
22 Downloads
Parses strings for docBlock like portions and then extracts the annotations, descriptions, and optional document content. This should not be used as an annotation parser for PHP code, at least not on it's own. If you're looking to do something with the docBlocks you might want to use something like https://github.com/schmittjoh/metadata better. This is more for if you're trying to get metadata from a plain text file. Look through the tests for examples.
sters/extract-content
885 Downloads
Extract web articles
xtroo/php-client
12 Downloads
Xtroo PHP Client Library
gregpriday/laravel-zyte-api
27 Downloads
A Laravel package for seamless integration with Zyte's web scraping API, offering functionalities for extracting raw HTML, browser-rendered HTML, and structured article content.
ahadabasi/php-readability
1 Downloads
Automatic article extraction from HTML
matejch/html_helpers
5 Downloads
Helper class for removing elements and content, and extracting file paths