{"id":66,"date":"2006-10-13T01:51:59","date_gmt":"2006-10-13T00:51:59","guid":{"rendered":"http:\/\/www.kiberpipa.org\/~gandalf\/blog\/?p=66"},"modified":"2006-10-13T01:53:07","modified_gmt":"2006-10-13T00:53:07","slug":"seeing-lots-of-wikipedia-in-your-google-searches","status":"publish","type":"post","link":"https:\/\/www.jurecuhalev.com\/blog\/seeing-lots-of-wikipedia-in-your-google-searches\/","title":{"rendered":"Seeing lots of Wikipedia in your Google searches?"},"content":{"rendered":"<p>In August and September 2006 various bloggers (<a href=\"http:\/\/www.roughtype.com\/archives\/2006\/08\/the_oracle_of_w.php\">Nicholas G. Carr<\/a>, <a href=\"http:\/\/www.micropersuasion.com\/2006\/09\/study_wikipedia.html\">Steve Rubel<\/a>, <a href=\"http:\/\/www.tbray.org\/ongoing\/When\/200x\/2006\/09\/15\/Wikipedia\">Tim Bray<\/a>, and others) started to notice that Wikipedia often shows up on Google for their searches.<\/p>\n<p>To research this recent phenomena more throughly I decided to try to do a simple random sampling on whole Wikipedia (together with redirects makes it to ~2.7 million titles) and then try to Google, Yahoo and MSN those articles.<\/p>\n<p>So, how likely is it? It turns out that it is <b>very<\/b> likely actually. You have about <i>81 %<\/i> chance to get Wikipedia link in top 10 results.<\/p>\n<p>(pictures follow, so if you don&#8217;t see them in your RSS feeds go to my blog page)<\/p>\n<p>Here is a nice pie for Google Wikipedia results count for top 10 results:<\/p>\n<p><a href=\"http:\/\/www.kiberpipa.org\/~gandalf\/blog-files\/wikistatus\/google-count.png\"><img decoding=\"async\" src=\"http:\/\/www.kiberpipa.org\/~gandalf\/blog-files\/wikistatus\/small_google-count.png\"\/><\/a><\/p>\n<p>and we can do this of course also for other search engines like Yahoo! or MSN. This gives us  nice combined trend lines:<\/p>\n<p><a href=\"http:\/\/www.kiberpipa.org\/~gandalf\/blog-files\/wikistatus\/combined-resultcount.png\"><img decoding=\"async\" src=\"http:\/\/www.kiberpipa.org\/~gandalf\/blog-files\/wikistatus\/small_combined-resultcount.png\"\/><\/a><\/p>\n<p>But then comes the question, how high do those Wikipedia articles rank? Well it turns out that if you are Yahoo it&#8217;s probably #1 result in ~47% of cases and in top 3 in ~76% cases. It&#8217;s in top 3 for Google only in ~66% of cases.<\/p>\n<p><a href=\"http:\/\/www.kiberpipa.org\/~gandalf\/blog-files\/wikistatus\/highest-rank-combined.png\"><img decoding=\"async\" src=\"http:\/\/www.kiberpipa.org\/~gandalf\/blog-files\/wikistatus\/small_highest-rank-combined.png\"\/><\/a><\/p>\n<p>If you want to read more about it you can download <a href=\"http:\/\/www.kiberpipa.org\/~gandalf\/blog-files\/wikistatus\/wikistatus.pdf\">Full report<\/a> (PDF, 7 pages) and also check out <a href=\"http:\/\/www.kiberpipa.org\/~gandalf\/blog-files\/wikistatus\/appendix.pdf\">Appendix<\/a> that has more pictures and also statistical outputs if you feel like doubting my interpretations or would just like to see more detailed numbers behind it.<\/p>\n<p>Special thanks go to Matej for giving me helpful hints about sampling methodology and professor <a href=\"http:\/\/www.dsv.su.se\/~hercules\/\">Hercules Dalianis<\/a> who approved this subject as my assignment and thus forced me to actually finish it.<\/p>\n<p>Please post any suggestions or comments about this research in comments. If there is enough interest for further analysis I will be extend it with time.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In August and September 2006 various bloggers (Nicholas G. Carr, Steve Rubel, Tim Bray, and others) started to notice that Wikipedia often shows up on Google for their searches. To research this recent phenomena more throughly I decided to try to do a simple random sampling on whole Wikipedia (together with redirects makes it to [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[14],"tags":[],"class_list":["post-66","post","type-post","status-publish","format-standard","hentry","category-tech"],"acf":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/posts\/66","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/comments?post=66"}],"version-history":[{"count":0,"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/posts\/66\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/media?parent=66"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/categories?post=66"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.jurecuhalev.com\/blog\/wp-json\/wp\/v2\/tags?post=66"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}