http://www.google.com/search?q=http

Justin Mason jm-fork at jmason.org
Mon Oct 20 12:45:49 PDT 2003


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


Lucas Gonze writes:
>On Monday, Oct 20, 2003, at 15:26 America/New_York, Justin Mason wrote:
>> old bits I'm afraid!
>> Try also cgi-bin (NetSol), https (grc.com!), www (Yahoo).
>
>So what do you think its train of thought is?  The string "http" has no 
>semantic association with Yahoo et al that I can see.  That string 
>should point at the W3C, IETF, misc protocol hackers.  Getting back the 
>web A list seems like an error.

It's an artifact of google's tokenizer -- it's splitting 

	http://www.yahoo.com/
	
into

	http www yahoo com

or similar -- and is now indexing link URLs, as well as link text and
linked-page text.   So Yahoo! must be the most-linked page on the web...
or possibly most-linked with a bit of good PageRank thrown in as well ;)

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.2 (GNU/Linux)
Comment: Exmh CVS

iD8DBQE/lDttQTcbUG5Y7woRAl/JAJ4oS7r66RUgfhkcYo1X6T1ePLTm6ACglBu+
VzCw3KH2N4GUpJ0CqRypzok=
=GHVm
-----END PGP SIGNATURE-----



More information about the FoRK mailing list