Libraries tagged by crawling
duzun/hquery
98080 Downloads
An extremely fast web scraper that parses megabytes of HTML in a blink of an eye. No dependencies. PHP5+
crwlr/crawler
6333 Downloads
Web crawling and scraping library.
crawlbase/crawlbase
13089 Downloads
A lightweight, dependency free PHP class that acts as wrapper for Crawlbase API
proxycrawl/proxycrawl
72315 Downloads
A lightweight, dependency free PHP class that acts as wrapper for ProxyCrawl API
stil/curl-easy
194229 Downloads
cURL wrapper for PHP. Supports parallel and non-blocking requests. For high speed crawling, see stil/curl-robot.
cyber-duck/silverstripe-seo
48603 Downloads
A SilverStripe module to optimise the Meta, crawling, indexing, and sharing of your website content
crwlr/robots-txt
10440 Downloads
Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping
cleantalk/anti-ddos-lite
1274 Downloads
A small PHP app to protect your site against DDoS attack or crawling web site by bots.
renoki-co/clusteer
1188 Downloads
Clusteer is a Puppeteer wrapper written for PHP, with the super-power of parallelizing pages across multiple browser instances.
crawlzone/crawlzone
5084 Downloads
Crawlzone is a fast asynchronous internet crawling framework aiming to provide open source web search and testing solution. It can be used for a wide range of purposes, from extracting and indexing structured data to monitoring and automated testing.
plasticstudio/silverstripe-seo
4970 Downloads
A SilverStripe module to optimise the Meta, crawling, indexing, and sharing of your website content (forked from Cyber-Duck/Silverstripe-SEO)
fourlabs/robots-bundle
21494 Downloads
Symfony2 bundle to control X-Robots-Tag HTTP header via annotations
dexiio/dexi-api-client
9980 Downloads
Dexi API Client for PHP 5.3+
crwlr/crawler-ext-browser
544 Downloads
Extension for the crwlr/crawler package containing steps utilizing a headless browser.
stil/curl-robot
22784 Downloads
Parallel URL crawling extension to curl-easy.