Text Mining

Rene on February 16th, 2011

Archived; click post to view. Excerpt: Wikipedia is an amazing data set to do all different kinds of research which will go far beyond text mining. The best thing about Wikipedia is that it is licensed under creative common license. So you are allowed to download Wikipedia and use it in any way you want. [...]

Continue reading about How to download Wikipedia

Rene on February 15th, 2011

Recently there was a lot of news on the Web about IBM’s natural language processing system Watson. As you might have heard right now Watson is challenging two of the best Jeopardy players in the US. A lot of news magazines compare Watson with Google which is the reason for this article. Even though the algorithms behind Watson and Google are not open source still a lot of estimates and guesses can be made about the algorithms both computer systems use in order to give intelligent answers to the questions people ask them. Based on this guesses I will explain the differences between Google and Watson.

Continue reading about IBM’s Watson & Google – What is the the difference?

Close

Subscribe to my newsletter

You don't like mail?