Download the PHP package mfsousa/goutte without Composer
On this page you can find all versions of the php package mfsousa/goutte. It is possible to download/install these versions without Composer. Possible dependencies are resolved automatically.
Download mfsousa/goutte
More information about mfsousa/goutte
Files in mfsousa/goutte
Package goutte
Short Description A simple PHP Web Scraper
License MIT
Homepage https://github.com/fabpot/Goutte
Informations about the package goutte
Goutte, a simple PHP Web Scraper
Goutte is a screen scraping and web crawling library for PHP.
Goutte provides a nice API to crawl websites and extract data from the HTML/XML responses.
Requirements
Goutte works with PHP 5.3.3 or later.
Installation
Installing Goutte is as easy as it can get. Download the Goutte.phar file and you're done!
Usage
Require the Goutte phar file to use Goutte in a script:
require_once '/path/to/goutte.phar';
Create a Goutte Client instance (which extends
Symfony\Component\BrowserKit\Client
):
use Goutte\Client;
$client = new Client();
Make requests with the request()
method:
// Go to the symfony.com website
$crawler = $client->request('GET', 'http://www.symfony.com/blog/');
The method returns a Crawler
object
(Symfony\Component\DomCrawler\Crawler
).
Click on links:
// Click on the "Security Advisories" link
$link = $crawler->selectLink('Security Advisories')->link();
$crawler = $client->click($link);
Extract data:
// Get the latest post in this category and display the titles
$crawler->filter('h2.post > a')->each(function ($node) {
print $node->text()."\n";
});
Submit forms:
$crawler = $client->request('GET', 'http://github.com/');
$crawler = $client->click($crawler->selectLink('Sign in')->link());
$form = $crawler->selectButton('Sign in')->form();
$crawler = $client->submit($form, array('login' => 'fabpot', 'password' => 'xxxxxx'));
$crawler->filter('.flash-error')->each(function ($node) {
print $node->text()."\n";
});
More Information
Read the documentation of the BrowserKit and DomCrawler Symfony Components for more information about what you can do with Goutte.
Technical Information
Goutte is a thin wrapper around the following fine PHP libraries:
- Symfony Components: BrowserKit, ClassLoader, CssSelector, DomCrawler, Finder, and Process;
- Guzzle HTTP Component.
License
Goutte is licensed under the MIT license.
All versions of goutte with dependencies
ext-curl Version *
symfony/browser-kit Version ~2.1
symfony/css-selector Version ~2.1
symfony/dom-crawler Version ~2.1
symfony/finder Version ~2.1
symfony/process Version ~2.1
guzzle/http Version ~3.1