phpsymfonyweb-scrapinggoutte

Scrape Goutte and waiting for full loading


I try to scrape via Goutte list from popular auction web service, but pieces of they code are rendered by javascript and problem is that Goutte return me only DOM without JS job. So if symphony are headless browser is it posiibile to wait with scraping until full content is rendered? I checked phantomjs where is normal, but I've never found that ability in Goute..


Solution

  • https://github.com/FriendsOfPHP/Goutte

    Goutte does not parse Javascript, it is not a headless browser it just downloads the content and parse it without running javascript.

    Technical Information

    Goutte is a thin wrapper around the following fine PHP libraries:

    Symfony Components: BrowserKit, CssSelector and DomCrawler; Guzzle HTTP Component.