Libraries tagged by crawl
aoepeople/crawler
280071 Downloads
Crawler extension for TYPO3
vipnytt/robotstxtparser
561234 Downloads
Robots.txt parsing library, with full support for every directive and specification.
spatie/http-status-check
47262 Downloads
CLI tool to crawl a website and check HTTP status code
proxycrawl/proxycrawl
62610 Downloads
A lightweight, dependency free PHP class that acts as wrapper for ProxyCrawl API
stil/curl-easy
189582 Downloads
cURL wrapper for PHP. Supports parallel and non-blocking requests. For high speed crawling, see stil/curl-robot.
baba/sitemap-crawler
Downloads
smochin/instagram-php-crawler
5890 Downloads
A simple PHP Crawler for Instagram
nmure/crawler-detect-bundle
253547 Downloads
A Symfony bundle for the Crawler-Detect library (detects bots/crawlers/spiders via the user agent)
nadar/crawler
18955 Downloads
A highly extendible, dependency free Crawler for HTML, PDFS or any other type of Documents.
dachcom-digital/dynamic-search-data-provider-crawler
13033 Downloads
zrashwani/arachnid
20189 Downloads
A crawler to find all unique internal pages on a given website
gsouf/chromium
378 Downloads
Instrument headless chrome/chromium instances from PHP
vipnytt/useragentparser
716039 Downloads
User-Agent parser for robot rule sets
tomverran/robots-txt-checker
31339 Downloads
Given a robots.txt file, user agent and URL path will tell you whether you're allowed to access a page
sleeping-owl/apist
4424 Downloads
Package to provide api-like access to foreign sites based on html parsing