Download the PHP package daa/web-scraping-sdk without Composer

On this page you can find all versions of the php package daa/web-scraping-sdk. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.

FAQ

After the download, you have to make one include require_once('vendor/autoload.php');. After that you have to import the classes with use statements.

Example:
If you use only one package a project is not needed. But if you use more then one package, without a project it is not possible to import the classes with use statements.

In general, it is recommended to use always a project to download your libraries. In an application normally there is more than one library needed.
Some PHP packages are not free to download and because of that hosted in private repositories. In this case some credentials are needed to access such packages. Please use the auth.json textarea to insert credentials, if a package is coming from a private repository. You can look here for more information.

  • Some hosting areas are not accessible by a terminal or SSH. Then it is not possible to use Composer.
  • To use Composer is sometimes complicated. Especially for beginners.
  • Composer needs much resources. Sometimes they are not available on a simple webspace.
  • If you are using private repositories you don't need to share your credentials. You can set up everything on our site and then you provide a simple download link to your team member.
  • Simplify your Composer build process. Use our own command line tool to download the vendor folder as binary. This makes your build process faster and you don't need to expose your credentials for private repositories.
Please rate this library. Is it a good library?

Informations about the package web-scraping-sdk

Web Scraping PHP SDK

This is a composer package that simplifies web content scraping providing a lightweight and easy to use code base.

Simply extend the Scraper class provided and implement the gather() method to extract the desired content using xpaths. You can then output this content to a file, store in a database, return a json string, etc.

Highlights:

Packagist link: https://packagist.org/packages/daa/web-scraping-sdk

Usage

Add the following requirement to your composer file and do a composer install/update:

Write your own scraper class which extends Scraper\Sdk\WebScraper and implements the gather method:

Now call your class, for example from a script that is executed by a cron job:

With troublesome sources you can specify the retry configuration (default is 3 retries with a 3 second pause in between)

You can use the same instance to scrape several urls with the same structure:

Check out the examples folder for more details and fully working examples.


All versions of web-scraping-sdk with dependencies

PHP Build Version
Package Version
Requires php Version >=5.3.3
ext-curl Version *
daa/restful-curl-php-wrapper Version 1.*
Composer command for our command line client (download client) This client runs in each environment. You don't need a specific PHP version etc. The first 20 API calls are free. Standard composer command

The package daa/web-scraping-sdk contains the following files

Loading the files please wait ....