javascriptphpweb-scrapingcurlcookies

How to scrape page requiring cookies and javascript in PHP


Is there an easy way to emulate cookies and javascript with a php script scraping a web page requiring it?

The current response shows:

<body><noscript>This site requires JavaScript and Cookies to be enabled. Please change your browser settings or upgrade your browser.</noscript></body>

I put this in the code and it made no difference:

$strCookie = 'PHPSESSID=' . $_COOKIE['PHPSESSID'] . '; path=/';
curl_setopt( $ch, CURLOPT_COOKIE, $strCookie );

Solution

  • HTML inside the <noscript> </noscript> will only get displayed if Javascript is disabled. So this is the test for whether Javascript is enabled.

    There is jQuery-cookie-plugin for reading, writing and deleting cookies. But it will not work if Javascript is disabled.

    For checking if cookie is enabled with PHP code, you need to set the cookie and read the cookie with another request. This post have some guides on it.