Libraries tagged by crawl
aoepeople/crawler
279654 Downloads
Crawler extension for TYPO3
gsouf/chromium
376 Downloads
Instrument headless chrome/chromium instances from PHP
vipnytt/robotstxtparser
556928 Downloads
Robots.txt parsing library, with full support for every directive and specification.
spatie/http-status-check
47243 Downloads
CLI tool to crawl a website and check HTTP status code
jyggen/curl
141934 Downloads
A simple and lightweight cURL library with support for asynchronous requests.
stil/curl-easy
189179 Downloads
cURL wrapper for PHP. Supports parallel and non-blocking requests. For high speed crawling, see stil/curl-robot.
baba/sitemap-crawler
Downloads
nmure/crawler-detect-bundle
252946 Downloads
A Symfony bundle for the Crawler-Detect library (detects bots/crawlers/spiders via the user agent)
nadar/crawler
18781 Downloads
A highly extendible, dependency free Crawler for HTML, PDFS or any other type of Documents.
dachcom-digital/dynamic-search-data-provider-crawler
12662 Downloads
zrashwani/arachnid
20184 Downloads
A crawler to find all unique internal pages on a given website
vipnytt/useragentparser
711051 Downloads
User-Agent parser for robot rule sets
tomverran/robots-txt-checker
31020 Downloads
Given a robots.txt file, user agent and URL path will tell you whether you're allowed to access a page
sleeping-owl/apist
4414 Downloads
Package to provide api-like access to foreign sites based on html parsing
opensearchserver/opensearchserver
61591 Downloads
PHP library for OpenSearchServer: professionnal search engine, crawlers (web, file, database), REST APIs, .... This library uses OpenSearchServer's V2 API.