Libraries tagged by crawl

monperrus/crawler-user-agents

1281 Favers
2033 Downloads

This repository contains a list of of HTTP user-agents used by robots, crawlers, and spiders as in single JSON file.

Go to Download


crawlbase/crawlbase

16 Favers
29016 Downloads

A lightweight, dependency free PHP class that acts as wrapper for Crawlbase API

Go to Download


vipnytt/robotstxtparser

27 Favers
683860 Downloads

Robots.txt parsing library, with full support for every directive and specification.

Go to Download


stil/curl-easy

326 Favers
200193 Downloads

cURL wrapper for PHP. Supports parallel and non-blocking requests. For high speed crawling, see stil/curl-robot.

Go to Download


baba/sitemap-crawler

Favers
Downloads

Go to Download


smochin/instagram-php-crawler

47 Favers
8121 Downloads

A simple PHP Crawler for Instagram

Go to Download


nmure/crawler-detect-bundle

26 Favers
269557 Downloads

A Symfony bundle for the Crawler-Detect library (detects bots/crawlers/spiders via the user agent)

Go to Download


friends-of-hyva/magento2-crawler-session

13 Favers
3320 Downloads

Prevent crawlers from creating a session

Go to Download


dachcom-digital/dynamic-search-data-provider-crawler

8 Favers
24421 Downloads

Go to Download


aoepeople/crawler

57 Favers
287391 Downloads

Crawler extension for TYPO3

Go to Download


luka-dev/headless-task-server-php

10 Favers
8158 Downloads

Helper for sending requests to luka-dev/headless-task-server

Go to Download


vipnytt/useragentparser

2 Favers
858194 Downloads

User-Agent parser for robot rule sets

Go to Download


tomverran/robots-txt-checker

13 Favers
51635 Downloads

Given a robots.txt file, user agent and URL path will tell you whether you're allowed to access a page

Go to Download


spatie/http-status-check

601 Favers
47740 Downloads

CLI tool to crawl a website and check HTTP status code

Go to Download


sleeping-owl/apist

312 Favers
4951 Downloads

Package to provide api-like access to foreign sites based on html parsing

Go to Download


<< Previous Next >>