Libraries tagged by craw
crwlr/crawler
10271 Downloads
Web crawling and scraping library.
crawlbase/crawlbase
33214 Downloads
A lightweight, dependency free PHP class that acts as wrapper for Crawlbase API
vipnytt/robotstxtparser
699040 Downloads
Robots.txt parsing library, with full support for every directive and specification.
spatie/http-status-check
47843 Downloads
CLI tool to crawl a website and check HTTP status code
proxycrawl/proxycrawl
86703 Downloads
A lightweight, dependency free PHP class that acts as wrapper for ProxyCrawl API
stil/curl-easy
201825 Downloads
cURL wrapper for PHP. Supports parallel and non-blocking requests. For high speed crawling, see stil/curl-robot.
baba/sitemap-crawler
Downloads
smochin/instagram-php-crawler
8465 Downloads
A simple PHP Crawler for Instagram
nmure/crawler-detect-bundle
271956 Downloads
A Symfony bundle for the Crawler-Detect library (detects bots/crawlers/spiders via the user agent)
nadar/crawler
22228 Downloads
A highly extendible, dependency free Crawler for HTML, PDFS or any other type of Documents.
dachcom-digital/dynamic-search-data-provider-crawler
26011 Downloads
aoepeople/crawler
287892 Downloads
Crawler extension for TYPO3
vipnytt/useragentparser
876382 Downloads
User-Agent parser for robot rule sets
tomverran/robots-txt-checker
54036 Downloads
Given a robots.txt file, user agent and URL path will tell you whether you're allowed to access a page
sleeping-owl/apist
5033 Downloads
Package to provide api-like access to foreign sites based on html parsing