PHP download

Download the PHP package rajanrx/php-scrape without Composer

On this page you can find all versions of the php package rajanrx/php-scrape. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.

Table of contents
Download rajanrx/php-scrape
More information about rajanrx/php-scrape
Files in rajanrx/php-scrape

Vendor rajanrx
Package php-scrape
Short Description A scraping framework written in PHP
License MIT
Homepage https://github.com/rajanrx/php-scrape

Keywords framework php scrape

FAQ

After the download, you have to make one include require_once('vendor/autoload.php');. After that you have to import the classes with use statements.

Example:

If you use only one package a project is not needed. But if you use more then one package, without a project it is not possible to import the classes with use statements.

In general, it is recommended to use always a project to download your libraries. In an application normally there is more than one library needed.

Some PHP packages are not free to download and because of that hosted in private repositories. In this case some credentials are needed to access such packages. Please use the auth.json textarea to insert credentials, if a package is coming from a private repository. You can look here for more information.

Some hosting areas are not accessible by a terminal or SSH. Then it is not possible to use Composer.
To use Composer is sometimes complicated. Especially for beginners.
Composer needs much resources. Sometimes they are not available on a simple webspace.
If you are using private repositories you don't need to share your credentials. You can set up everything on our site and then you provide a simple download link to your team member.
Simplify your Composer build process. Use our own command line tool to download the vendor folder as binary. This makes your build process faster and you don't need to expose your credentials for private repositories.

Please rate this library. Is it a good library?

Example code of rajanrx/php-scrape

Informations about the package php-scrape

PHP Scrape

A simple, easy to use, scalable scraping framework written in PHP

About PHP Scrape

Php Scrape is a basic scraping framework for PHP based on configuration first concept. i.e once implemented changes should be made on configuration file as far as possible avoiding need for code update/addition. Also, you can extend/Customize this framework to any level or use components (Extractor, Crawler) separately if you just need to use them.

Following are the key points which you can use/expect in future:

[x] Scrape in console or browser
[x] Use hash to escape duplicate scraping (or halt further scraping)
[x] Generate editable configuration file using PHP code
[x] Ability to extend own scraping components
[ ] Add complete wiki for general and advance usage instructions
[x] Add test coverage for command line scraping (> 80%)
[ ] Add test coverage for Javascript scraping
[ ] Allow use of proxy to scrape anonymously
[ ] Generate automated integration test for scraping to ensure data integrity

Why Need For yet another git repo ?

One of the biggest problem in scraping data is the source gets changed and we have to update our codebase to get it working. As the codebase increases it is harder to maintain and even annoying looking for the place to update if someone new to codebase has to maintain it. Also different projects has their own unique requirements (made even harder by varieties/complexity of data sources) which might not be addressed by lots of libraries for not being generic enough. So in order to help facilitate developers tackle these problems, I have tried to come up with a generic, flexible solution that might help them to write easily configurable, maintainable and (extend/customize)able scraping projects.

Getting Started

The easiest way to use PHP Scrape is via Composer.

You need to create configuration file to start scraping. You can do it either by creating a config JSON file or via using php (Highly recommended as its easier to maintain and scale ) to generate one.

Once you have a configuration file you can start scraping by writing few lines of code

will return result like

As simple as that. Docs in detail will be updated soon.Meanwhile until the doc is not available please see Multi Row Extractor Test to figure out how you can scrape paginated records.

Please let me know if you have any suggestions to make this codebase better. I am happy to assist if you get stuck on your scraping project :). Feel free to ping me. Interested contributors are welcome.

Partners

BrowserStack is supporting PHP Scrape, allowing us to use their service and infrastructure to test the code in this repository. Thank you for supporting the open source community!

License

This framework is open-sourced software licensed under the MIT license.

If you are happy and want to buy me a coffee then why not :).

All versions of php-scrape with dependencies

PHP Build Version

Package Version

Version 1.1.1 Release 22. Jun 2017
create-project require 1 people chose require and
0 people chose create-project.

Download

Download latest version of php-scrape from vendor rajanrx

Requires php Version >=5.6.0
behat/mink Version ~1.6
behat/mink-browserkit-driver Version ~1.2
behat/mink-goutte-driver Version ~1.1
behat/mink-selenium2-driver Version ~1.2
behat/mink-zombie-driver Version ~1.2
php-curl-class/php-curl-class Version 3.5.5
guzzlehttp/cache-subscriber Version 0.1.0

Composer command for our command line client (download client) This client runs in each environment. You don't need a specific PHP version etc. The first 20 API calls are free. Standard composer command

The package rajanrx/php-scrape contains the following files

Loading the files please wait ....