Libraries tagged by crawler
zenstruck/dom
3306 Downloads
DOM crawler with advanced selector API and assertions.
xcrawler/xcrawler
880 Downloads
A fast, simple and powerful PHP web crawler (scraper/spider) 快速、简洁且强大的爬虫/采集框架
silverstripe-labs/googleanalytics
10125 Downloads
The Google Analytics module consists of 2 components that can be employed independently: The Google Logger injects the google analytics javascript snippet into your source code and logs relevant events (as of now only crawler visits) The Analyzer adds the Google Analytics UI to your CMS.
octopoda/octopus
4423 Downloads
PHP Sitemap crawler
neclimdul/coveo-push-api
1256 Downloads
The Push API allows you to *push* items and security identities, as opposed to letting standard Coveo Cloud V2 crawlers *pull* this data from a content repository. This is especially useful when you need to index content from a cloud or on-premises system for which no dedicated source type exists in the Coveo Cloud V2 platform.
flancer32/mage2_ext_bot_sess
10943 Downloads
Magento2: prevent session creation for bots & crawlers.
crwlr/utils
12086 Downloads
Utilities that are needed in multiple crawler packages.
bugbuster/contao-botdetection-bundle
39119 Downloads
Contao bundle helper class to detect search engines, bots, spiders, crawlers ...
codeguy/arachnid
9738 Downloads
A crawler to find all unique internal pages on a given website
yurunsoft/crawler
82 Downloads
宇润爬虫框架(Yurun Crawler) 是一个低代码、高性能、分布式爬虫采集框架,这可能是最一把梭的爬虫框架。
yuan1994/z-crawler
53 Downloads
【正方教务】爬虫,支持成绩查询、考试查询、课表查询、四六级成绩查询、四六级报名、选课查询、修改密码、获取用户菜单等功能,并且解析数据成易读格式,符合 psr 规范,拿来即用
webprofil/crawler
520 Downloads
Crawls Website and can generate backstop json
survos/crawler-bundle
3842 Downloads
Provides a way to create tests that crawl a site
shel/crawler
1876 Downloads
Allows crawling of sitemaps and node-trees
migliori/sitemap-crawler
2149 Downloads
Sitemap crawler/generator. For the given URL it will return sitemap XML file with URLs and images.