I am right now sitting in the city of Koblenz and I was typing in a search for weather (Wetter in German) Google returns me the weatherforcast for Koblenz. Afterwards I used a proxy tunnel to my webserver of www.rene-pickhardt.de which is located in the city of Stuttgart. After doing the exact same search on the same computer with the same browser Google returned the weather forcast of Stuttgart.
google search for weather. My location was Koblenz
Google search for weather when using a proxy tunnel to my Location Stuttgart
Since my blog is run on a shared hosting I logged in to my third webserver of www.metalcon.de which is a root server. Google coulden’t personalzie the search in this case because Google doesn’t know where the server is located. But Google asks me to enter the location where I am.
Googleing the weather is kind of boaring so I did some other queries for bars and nightclubs (also in German = “bars und kneipen”) Again I got different results for Koblenz and Stuttgart. The interesting thing is that for both locations the results have been very general inlcuding also city guides from Berlin / Hamburg / Köln Munich and so on. Just the snippets from google maps have been tailored to my location.
Uff I have to say after 57 minutes of brainstorming I am running out of ideas for the moment. But this might be because it is already one hour after midnight!
If you have some other ideas for signals or think some of my guesses are totally unreasonable, why don’t you tell me in the comments?
Disclaimer: this list of signals is a pure guess based on my knowledge and education on data mining. Not one signal I name might correspond to the 57 signals google is using. In future I might discuss why each of these signals could be interesting. But remember: as long as you have a high diversity in the distribution you are fine with any list of signals.
Eli is pointing out a thing some people might have already noticed. If two different people search for the same thing on Google it is very probable that the search results will be very different. Google is doing this without telling the user that it is acutally filtering the results based on what the algorithm thinks the user might like. According to Eli Pariser Google is using 57 signals to determine the interest of us. Among those we find:
Of course this kind of personalization has its good sides. When I am about to buy a new notebook computer y I definitely want to see different Websites if I live in Germany or in the US. This could be due to tax and shipping fees. Which means that I am most probably interested in local stores and not in oversea shops. Still this personalization and filtering is a huge potential for serious problems. Let me ask a few questions:
We might think we get all the information we need. But in reality we are becoming blinded by the filters Google is using. We have no chance to determine what other information is filtert and potentially available for a certain topic. On the other hand due to the amount of information we need filters and computers to help us. But the systems should be more transparent!
I have always been thinking Facebook’s huge success is strongly correlated to the fact that there is hardly Spam on Facebook and the information economy is very smart and user friendly. The attention of users to status updates is very high making facebook a great place for every company to do online and viral marketing. This of course contributes to Facebook’s reach. In fact the information architecture on Facebook is even so smart that your 20’000 followers on Facebook might not receive your status updates since Facebook’s EdgeRank algorithm decides it is not relevant to your fans or friends. Edgerank might not have 57 signals but it still takes into consideration:
Great news isn’t it? Just compare this with my statement in a recent blog post about creating newsletters as a musician in order to communicate with your fans and not solely rely on other services like Facebook or MySpace.
You don’t believe the Facebook thing? There is a video about the EdgeRank algorithm used by Facebook to determine which status updates should reach us and which shouldn’t. Feel free to have a look and thanks to the guys from Klurig Analytics for producing such a great video resource:
Again thanks a lot to Eli Pariser to start this discussion!
]]>