 |
|

09-13-2007, 06:56 PM
|
 |
Boss Cart consultant
1115 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 8
|
|
|
Join Date: Feb 2007
Location: Veszprém, Hungary
Posts: 1,600
Thanks: 4
Thanked 50 Times in 13 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Quote:
Originally Posted by temi
I'm I correct?
Thanks
[/color] [/color]
|
Absolutly correct. I was the one who additionally inserted the Yahoo slurp validation.
Please test your site after installing.
|

09-13-2007, 07:00 PM
|
 |
Super Moderator
1904 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 73
|
|
|
Join Date: Mar 2007
Location: zeshaan.info
Posts: 3,713
Thanks: 1
Thanked 10 Times in 8 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Thanks Bagi
I will copy the code and name the file reversedns.php and upload
and add <?php include("reversedns.php"); ?>
Thanks 
|

09-13-2007, 07:04 PM
|
 |
Super Moderator
1904 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 73
|
|
|
Join Date: Mar 2007
Location: zeshaan.info
Posts: 3,713
Thanks: 1
Thanked 10 Times in 8 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Quote:
Originally Posted by Bagi Zoltán
Thanks Imran. At my seo dir the yahoo slurp validation isn't in the code since the script has some error with that. I used this reversedns.php file
PHP Code:
<?php
// Get the user agent.
$ua = $_SERVER['HTTP_USER_AGENT'];
// Check the user agent to see if it's identifying itself as a search engine bot.
if(stristr($ua, 'msnbot') || stristr($ua, 'googlebot')){
// The user agent is purporting to be MSN's bot or Google's bot.
// If the user agent string is spoofed, we won't find googlebot.com in the host name.
// Get the IP address requesting the page.
$ip = $_SERVER['REMOTE_ADDR'];
// Reverse DNS lookup the IP address to get a hostname.
$hostname = gethostbyaddr($ip);
// Check for '.googlebot.com' and '/search.live.com' in hostname.
if(!preg_match("/\.googlebot\.com$/", $hostname) &&!preg_match("/search\.live\.com$/", $hostname)) {
// The host name does not belong to either live.com or googlebot.com.
// Remember the UA already said it is either MSNBot or Googlebot.
$block = TRUE;
header("HTTP/1.0 403 Forbidden");
exit;
} else {
// Now we have a hit that half-passes the check. One last go:
// Forward DNS lookup the hostname to get an IP address.
$real_ip = gethostbyname($hostname);
if($ip!= $real_ip){
$block = TRUE;
header("HTTP/1.0 403 Forbidden");
exit;
} else {
// Real bot.
$block = FALSE;
}
}
}
?>
which validates only the googlebot and the msnbot.
Imran, don't forget that not i'm the person who developed this solution, i only share this 
|
If the Yahoo Slurp did not work for you, does this mean it is not guaranteed to work for others?
|

09-13-2007, 07:16 PM
|
 |
Boss Cart consultant
1115 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 8
|
|
|
Join Date: Feb 2007
Location: Veszprém, Hungary
Posts: 1,600
Thanks: 4
Thanked 50 Times in 13 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
No, Imran that means that it won't validate Yahoo!'s spider, but validates the msnbot and the googlebot.
|

09-13-2007, 07:20 PM
|
 |
Facilitator
5166 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 283
|
|
|
Join Date: Jun 2003
Location: London, England.
Posts: 11,929
Thanks: 3
Thanked 29 Times in 20 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
If it does not validate Yahoo spider, Yahoo can still craw the site, it just will not be able to cache it?
|

09-13-2007, 07:21 PM
|
 |
Super Moderator
1904 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 73
|
|
|
Join Date: Mar 2007
Location: zeshaan.info
Posts: 3,713
Thanks: 1
Thanked 10 Times in 8 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Thanks Bagi 
|

09-13-2007, 07:36 PM
|
 |
Boss Cart consultant
1115 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 8
|
|
|
Join Date: Feb 2007
Location: Veszprém, Hungary
Posts: 1,600
Thanks: 4
Thanked 50 Times in 13 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Quote:
Originally Posted by temi
If it does not validate Yahoo spider, Yahoo can still craw the site, it just will not be able to cache it?
|
It means that when the Yahoo! Slurp comes through a proxy link it will see it as content duplication. If you use the code specified at the first post that will validate Yahoo spider as well, but my older indexscript site doesn't like it for some reason, so i use there the "non-yahoo-validating" version. I tested today the first code, which validate the three major spider at my other site, and seemed to work well.
|

09-13-2007, 08:32 PM
|
 |
Facilitator
5166 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 283
|
|
|
Join Date: Jun 2003
Location: London, England.
Posts: 11,929
Thanks: 3
Thanked 29 Times in 20 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
thanks for that clarification Bagi
|

09-14-2007, 08:09 AM
|
 |
Super Moderator
1904 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 73
|
|
|
Join Date: Mar 2007
Location: zeshaan.info
Posts: 3,713
Thanks: 1
Thanked 10 Times in 8 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
I set this up yesterday with help from Bagi and it seems to work fine.
I have not yet setup Yahoo Slurp but will be doing this tonight 
|

09-14-2007, 10:59 AM
|
 |
Boss Cart consultant
1115 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 8
|
|
|
Join Date: Feb 2007
Location: Veszprém, Hungary
Posts: 1,600
Thanks: 4
Thanked 50 Times in 13 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
The WordPress plugin is on the way. If everything will be fine tomorrow it will be released 
|
|
Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
|
|
|
|
|