Libraries tagged by craw

vipnytt/robotstxtparser

27 Favers
593519 Downloads

Robots.txt parsing library, with full support for every directive and specification.

Go to Download


duzun/hquery

357 Favers
96036 Downloads

An extremely fast web scraper that parses megabytes of HTML in a blink of an eye. No dependencies. PHP5+

Go to Download


gsouf/chromium

2192 Favers
503 Downloads

Instrument headless chrome/chromium instances from PHP

Go to Download


stil/curl-easy

329 Favers
192575 Downloads

cURL wrapper for PHP. Supports parallel and non-blocking requests. For high speed crawling, see stil/curl-robot.

Go to Download


baba/sitemap-crawler

Favers
Downloads

Go to Download


smochin/instagram-php-crawler

47 Favers
6182 Downloads

A simple PHP Crawler for Instagram

Go to Download


nmure/crawler-detect-bundle

26 Favers
257553 Downloads

A Symfony bundle for the Crawler-Detect library (detects bots/crawlers/spiders via the user agent)

Go to Download


dachcom-digital/dynamic-search-data-provider-crawler

8 Favers
16308 Downloads

Go to Download


crawlbase/crawlbase

12 Favers
8073 Downloads

A lightweight, dependency free PHP class that acts as wrapper for Crawlbase API

Go to Download


vipnytt/useragentparser

2 Favers
753426 Downloads

User-Agent parser for robot rule sets

Go to Download


tomverran/robots-txt-checker

13 Favers
35613 Downloads

Given a robots.txt file, user agent and URL path will tell you whether you're allowed to access a page

Go to Download


spatie/http-status-check

598 Favers
47524 Downloads

CLI tool to crawl a website and check HTTP status code

Go to Download


sleeping-owl/apist

313 Favers
4584 Downloads

Package to provide api-like access to foreign sites based on html parsing

Go to Download


opensearchserver/opensearchserver

52 Favers
62754 Downloads

PHP library for OpenSearchServer: professionnal search engine, crawlers (web, file, database), REST APIs, .... This library uses OpenSearchServer's V2 API.

Go to Download


kiddyu/beanbun

1250 Favers
4056 Downloads

Beanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性

Go to Download


<< Previous Next >>