screen scraping

**DJ Majestik** · Jul 17 '05, 01:48 PM

Re: screen scraping

How about reg ex'ing for particular error codes? You can look for
specific ones like 404, 500, should be the normal ones you would be
getting. If you see that in your return code, you know you have an
error.

HTH

JJ

**Clintster** · Jul 17 '05, 01:48 PM

Re: screen scraping

I discovered if I set the URL check in a variable and check the
variable, the error will not be output.

i.e.

function urlcheck($url, $sitelink) {
$urlup = @file($url);
// grab code from web site
if ($urlup){
$html = file_get_conten ts($url);
//REGEX to pull the link code out of the array
$relink = "/<a.+?href=[\"\'](.*?)[\"\'].+?\>/i";

// Put the matching link code into an array called links
preg_match_all( $relink, $html, $links);

// loop through links on the page and look for a match
for ($i=0; $i< count($links[0]); $i++) {
if ( strpos($links[1][$i], $sitelink) != false ||
strpos($links[1][$i], $sitelink) === 0 ) {
return $links[0][$i];
break;
}
}
}
else {
print "Doesn't exist";
}
}

screen scraping

screen scraping

Comment

Comment