 |
|

03-30-2006, 06:55 PM
|
 |
Facilitator
5304 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 270
|
|
|
Join Date: Jun 2003
Location: London, England.
Posts: 11,764
Thanks: 3
Thanked 22 Times in 15 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
sound more like the definition of jealousy 
|

03-31-2006, 02:46 AM
|
|
|
Just out of curiosity, how exactly is this going to work anyway? If I take dmoz as an example, they have plenty of links to dead sites as do many directories I've run across. With the ever changing dynamic of the www., how can a human edited Search Engine hope to stay current with every site in its listing?
|

03-31-2006, 07:53 AM
|
 |
Super Moderator
992 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 8
|
|
|
Join Date: Oct 2004
Posts: 2,272
Thanks: 2
Thanked 11 Times in 11 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Duke, it isn't human edited, it is human seeded, there is a difference. The initial seeding will be done manually. This is wjhat Zoltan Gyongyi said in the original study named
Combating Web Spam with TrustRank. Technical Report, Stanford University, 2004
Web spam pages use various techniques to achieve higher-than-deserved rankings in a search engine's results. While human experts can identify spam, it is too expensive to manually evaluate a large number of pages. Instead, we propose techniques to semi-automatically separate reputable, good pages from spam. We first select a small set of seed pages to be evaluated by an expert. Once we manually identify the reputable seed pages, we use the link structure of the web to discover other pages that are likely to be good. In this paper we discuss possible ways to implement the seed selection and the discovery of good pages. We present results of experiments run on the World Wide Web indexed by AltaVista and evaluate the performance of our techniques. Our results show that we can effectively filter out spam from a significant fraction of the web, based on a good seed set of less than 200 sites.
Original doc here http://dbpubs.stanford.edu:8090/pub/2004-17
|

03-31-2006, 12:24 PM
|
|
|
I'm missing the point somehow OWG so I sincerely apologize for my ignorance. I think I'll take some time this weekend and give the link a good long read and maybe post back here with further questions.
I guess what doesn't add up for me is by the sounds of things, at some point a human will have to be involved in quantifying either a page, search result or some other info., and with the billions of sites online and more added every minute of each day, I just don't see how this can work.
Anyway, I was never a math genious so if I'm rambling just tell me to stfu.
|

03-31-2006, 06:04 PM
|
 |
Super Moderator
992 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 8
|
|
|
Join Date: Oct 2004
Posts: 2,272
Thanks: 2
Thanked 11 Times in 11 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Ok Duke,
Just for you I have knocked this flow chart up to better explain it.
Please not that this image is copyrighted 
|

03-31-2006, 06:59 PM
|
 |
Facilitator
5304 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 270
|
|
|
Join Date: Jun 2003
Location: London, England.
Posts: 11,764
Thanks: 3
Thanked 22 Times in 15 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
This illustration helps explain trust ranking better, questions, what sites apart from the one below will fall within the category of trusted site? Are trusted sites the same site link vendors call authority site?
I can think of the following sites as Trusted site, which other sites fits the bill or which of the one I listed should be removed?
dmoz
yahoo directory
institution like BBC
Government website
Major newspaper sites
Banks
what other sites?
|

03-31-2006, 08:06 PM
|
 |
Super Moderator
992 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 8
|
|
|
Join Date: Oct 2004
Posts: 2,272
Thanks: 2
Thanked 11 Times in 11 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Temi, My understanding is that they will not just say 'major newspapers' after all in the UK the main newspapers are the sun and the news of the world, both of which will link to any old crap. Certainly they might look at the newspapers, and decide that the Timnes, FT, Guardian, Observer are ok, but the daily sport, daily mirror sun etc are junk.
I would also say that google might split Dmoz into areas that are trusted, and areas that are not. It would not surprise me to see DMOZ devalued to almost nothing. It would also not surprise me to see google make an offer to Netscape for DMOZ to use as their main seed.
|

03-31-2006, 08:24 PM
|
 |
Facilitator
5304 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 270
|
|
|
Join Date: Jun 2003
Location: London, England.
Posts: 11,764
Thanks: 3
Thanked 22 Times in 15 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Thanks very much for that OWG,
Yes, the though of Sun being regarded as a trusted authority site is frightening  When I think of newspapers, Sun, Mirrors and other tabloid don't cross my mind at all.
I some how cannot see Google making and offer for DMOZ , they can already get the contents free of charge, starting their own directory is a much feasible/better proposition that buying DMOZ
|

03-31-2006, 09:47 PM
|
 |
Super Moderator
992 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 8
|
|
|
Join Date: Oct 2004
Posts: 2,272
Thanks: 2
Thanked 11 Times in 11 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
If they buy DMOZ, then remove all the editors, employ their own, they can control the content of it.
|

04-01-2006, 06:58 AM
|
 |
Facilitator
5304 posts this year. Platinum VIP!Trusted Member - This user is a Master!
Last months UKWW Tokens: 270
|
|
|
Join Date: Jun 2003
Location: London, England.
Posts: 11,764
Thanks: 3
Thanked 22 Times in 15 Posts
Nominated 0 Times in 0 Posts
TOTW/F/M Award(s): 0
|
|
Yes, but why bother doing that? it will be unpopular, give them some bad name saying they are throwing their weight about etc etc when is VERY easy to start their own directory from scratch an use a better system from the ground up. Like they created Google talk instead of buying the over priced Skype
|
|
Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
|
|
|
|
|