Comments on: On Bots
http://www.metafilter.com/52060/On-Bots/
Comments on MetaFilter post On BotsSat, 03 Jun 2006 00:06:45 -0800Sat, 03 Jun 2006 00:06:45 -0800en-ushttp://blogs.law.harvard.edu/tech/rss60On Bots
http://www.metafilter.com/52060/On-Bots
<a href="http://drunkmenworkhere.org/219">On Bots</a> - results of a year long experiment on search engine bot behaviourpost:www.metafilter.com,2006:site.52060Fri, 02 Jun 2006 23:48:52 -0800MetaMonkeysearchbotbinarysearchtreevisualisationBy: IronLizard
http://www.metafilter.com/52060/On-Bots#1328617
"PLEASE GOD, WHAT DOES IT ALL MEAN?!?!?!"comment:www.metafilter.com,2006:site.52060-1328617Sat, 03 Jun 2006 00:06:45 -0800IronLizardBy: Tuwa
http://www.metafilter.com/52060/On-Bots#1328620
boo to cliffhangers.comment:www.metafilter.com,2006:site.52060-1328620Sat, 03 Jun 2006 00:14:11 -0800TuwaBy: hincandenza
http://www.metafilter.com/52060/On-Bots#1328632
That page almost seems to suggest the efficacy of the bots would be ranked MSN, Google, and Yahoo- the Yahoo bot seems <em>especially</em> stupid to not realize the pages it is crawling are valueless, whereas the MSN and Google bots figure this out much sooner, and basically stop crawling. There is a lot of work done on bot technology precisely to avoid getting trapped in autolink farms and spam pages, and to recognize content for being valuable. It's not perfect, but those trees seem to suggest MSN and Google are much further on in "smart" bots than Yahoo.comment:www.metafilter.com,2006:site.52060-1328632Sat, 03 Jun 2006 00:54:28 -0800hincandenzaBy: elpapacito
http://www.metafilter.com/52060/On-Bots#1328648
<i>"PLEASE GOD, WHAT DOES IT ALL MEAN?!?!?!"</i>
It means you are stupid son, but don't worry ! Even teh stupid can become prez !comment:www.metafilter.com,2006:site.52060-1328648Sat, 03 Jun 2006 03:10:05 -0800elpapacitoBy: IronLizard
http://www.metafilter.com/52060/On-Bots#1328650
<em>note: Help maintain a healthy, respectful discussion by focusing comments on the
issues, topics, and facts at hand -- not at other members of the site.</em>
Well now, at least I can read. Jackass.comment:www.metafilter.com,2006:site.52060-1328650Sat, 03 Jun 2006 03:29:27 -0800IronLizardBy: Chuckles
http://www.metafilter.com/52060/On-Bots#1328658
<em>Yahoo bot seems especially stupid to not realize the pages it is crawling are valueless, whereas the MSN and Google bots figure this out much sooner, and basically stop crawling.</em>
The cumulative number of pageviews by month keeps going up for all of them, so there isn't any 'sooner' about it. MSN and google increase their attention in fits and starts, where yahoo's attention increases consistently, and you could argue that MSN eventually capped its attention level.
MSN and google both avoid the deep nodes, but can we call that spam avoiding behaviour (or link farm avoiding, whatever..)?comment:www.metafilter.com,2006:site.52060-1328658Sat, 03 Jun 2006 04:32:15 -0800ChucklesBy: tellurian
http://www.metafilter.com/52060/On-Bots#1328660
I can't assimilate all the information but it sure draws pretty pictures. I like the Yahoo Slurp Tree most.comment:www.metafilter.com,2006:site.52060-1328660Sat, 03 Jun 2006 04:40:12 -0800tellurianBy: econous
http://www.metafilter.com/52060/On-Bots#1328662
IronLizard, I have the feeling that elpapacito's tongue was in his cheek. In any case at least the diagrams of trees are sort of purty, can't you just enjoy those?comment:www.metafilter.com,2006:site.52060-1328662Sat, 03 Jun 2006 04:42:33 -0800econousBy: Chuckles
http://www.metafilter.com/52060/On-Bots#1328663
I guess I am misreading that (on second review, in more ways than one. ARGH!). For one, the graphs aren't attention per month, they are cumulative.. And I am mixing up which ones have the 'fits and starts' behaviour..
So, MSN really does stop paying attention to the site, but you can't say the same about google. google only goes so deep, but it keeps coming back to view pages.comment:www.metafilter.com,2006:site.52060-1328663Sat, 03 Jun 2006 04:43:30 -0800ChucklesBy: loquacious
http://www.metafilter.com/52060/On-Bots#1328664
<em>"PLEASE GOD, WHAT DOES IT ALL MEAN?!?!?!"</em>
"I SEE THEM EVERYWHERE, EVERYWHERE. YELLOW, BLACK AND, ERR, RECTANGULAR. EVERYWHERE! EVERYWHERE! DO YOU HEAR ME?"
"THERE, THERE..."
"EVERYWHERE, EVERYWHERE. YELLOW, BLACK, AND, ERR, RECTANGULAR! WITH... WEDGE SHAPES INSIDE."
"THERE THERE, JUST LIE BACK ON THE COUCH, MRS. ERR... RECTANGULAR."comment:www.metafilter.com,2006:site.52060-1328664Sat, 03 Jun 2006 05:03:05 -0800loquaciousBy: GuyZero
http://www.metafilter.com/52060/On-Bots#1328666
It's a pretty clever experimental design.comment:www.metafilter.com,2006:site.52060-1328666Sat, 03 Jun 2006 05:13:19 -0800GuyZeroBy: sswiller
http://www.metafilter.com/52060/On-Bots#1328673
<blockquote><i>On 2005-06-30 Googlebot visited node 1, the leftmost node. It did not crawl the path from the root to this node, so how did it find the page? Did it guess the URL or did it follow some external link?</i></blockquote>
I think the Googlebot is haunted.comment:www.metafilter.com,2006:site.52060-1328673Sat, 03 Jun 2006 05:57:46 -0800sswillerBy: 3.2.3
http://www.metafilter.com/52060/On-Bots#1328681
it would be illuminating to see the experiment with more, shall we say, <em>interesting</em> valueless content. maybe grab some unique gutenberg-ish text, markov it up a bit, and place something more distinguishing on each page to see if search engine penetration isn't more thorough than it appears, or at least more so than spam bots. if this is an accurate picture of search engine penetation, it sure explains why search results for things i know are around turn up more build logs than anything else. google is reported to be very shallow and this experiment seems to confirm it, but with questionable seeding.
better seeding would probably not be in keeping with the spirit of the zero content symposium, though.comment:www.metafilter.com,2006:site.52060-1328681Sat, 03 Jun 2006 06:41:16 -08003.2.3By: Tryptophan-5ht
http://www.metafilter.com/52060/On-Bots#1328686
<strong>"google is reported to be very shallow"</strong>
well really, how often is a page only accessible from a page with is only accessible from a page which is only ... past 12 levels? that just seems like bad organization design.comment:www.metafilter.com,2006:site.52060-1328686Sat, 03 Jun 2006 07:03:08 -0800Tryptophan-5htBy: 3.2.3
http://www.metafilter.com/52060/On-Bots#1329348
a) apparently, it's a lot more shallow that 12 levels. from the looks of this experiment, it may even be arbitrary.
b) people performing searches are hardly in a position to rectify the organizational design of the results they are trying to find, especially prior to finding the results.comment:www.metafilter.com,2006:site.52060-1329348Sat, 03 Jun 2006 21:21:41 -08003.2.3
"Yes. Something that interested us yesterday when we saw it." "Where is she?" His lodgings were situated at the lower end of the town. The accommodation consisted[Pg 64] of a small bedroom, which he shared with a fellow clerk, and a place at table with the other inmates of the house. The street was very dirty, and Mrs. Flack's house alone presented some sign of decency and respectability. It was a two-storied red brick cottage. There was no front garden, and you entered directly into a living room through a door, upon which a brass plate was fixed that bore the following announcement:¡ª The woman by her side was slowly recovering herself. A minute later and she was her cold calm self again. As a rule, ornament should never be carried further than graceful proportions; the arrangement of framing should follow as nearly as possible the lines of strain. Extraneous decoration, such as detached filagree work of iron, or painting in colours, is [159] so repulsive to the taste of the true engineer and mechanic that it is unnecessary to speak against it. Dear Daddy, Schopenhauer for tomorrow. The professor doesn't seem to realize Down the middle of the Ganges a white bundle is being borne, and on it a crow pecking the body of a child wrapped in its winding-sheet. 53 The attention of the public was now again drawn to those unnatural feuds which disturbed the Royal Family. The exhibition of domestic discord and hatred in the House of Hanover had, from its first ascension of the throne, been most odious and revolting. The quarrels of the king and his son, like those of the first two Georges, had begun in Hanover, and had been imported along with them only to assume greater malignancy in foreign and richer soil. The Prince of Wales, whilst still in Germany, had formed a strong attachment to the Princess Royal of Prussia. George forbade the connection. The prince was instantly summoned to England, where he duly arrived in 1728. "But they've been arrested without due process of law. They've been arrested in violation of the Constitution and laws of the State of Indiana, which provide¡ª" "I know of Marvor and will take you to him. It is not far to where he stays." Reuben did not go to the Fair that autumn¡ªthere being no reason why he should and several why he shouldn't. He went instead to see Richard, who was down for a week's rest after a tiring case. Reuben thought a dignified aloofness the best attitude to maintain towards his son¡ªthere was no need for them to be on bad terms, but he did not want anyone to imagine that he approved of Richard or thought his success worth while. Richard, for his part, felt kindly disposed towards his father, and a little sorry for him in his isolation. He invited him to dinner once or twice, and, realising his picturesqueness, was not ashamed to show him to his friends. Stephen Holgrave ascended the marble steps, and proceeded on till he stood at the baron's feet. He then unclasped the belt of his waist, and having his head uncovered, knelt down, and holding up both his hands. De Boteler took them within his own, and the yeoman said in a loud, distinct voice¡ª HoME²¨¶àÒ°´²Ï·ÊÓÆµ ѸÀ×ÏÂÔØ ѸÀ×ÏÂÔØ
ENTER NUMBET 0016www.fkchain.com.cn holdzhu.net.cn www.hyrlx.com.cn jlrcik.com.cn www.seqqjo.com.cn www.pqecch.com.cn www.mmshop.net.cn www.qcchain.com.cn www.oyzlpx.com.cn www.mirion.com.cn