Blocking php curl from scraping website content

**knkk** · Jun 9 '10, 10:53 AM

I just realized that the specific URL I am not able to access with that function is http://london.vivastreet.co.uk/cars+london. Since that link wasn't accessible, I assumed the entire site wasn't accessible, and so posted the home page URL, which appears to be accessible through this function. Any idea why this is happening? Is the "+" in that URL doing something, or is there a way to block URLs from cURL?

**knkk** · Jun 10 '10, 06:15 AM

I found the issue. I was sending the url to the disguise_curl() function after doing a url_decode first, and so the "cars+londo n" in the url was becoming "cars london", resulting in the error page I was seeing. So yes, cURL works for this page, too.

Blocking php curl from scraping website content

Blocking php curl from scraping website content

Comment

Comment