Text Mining

Archived; click post to view. Excerpt: Due to the interesting results we found by creating Typology I am currently reading the related work about query prediction and auto completion of scentences. There is quite some interesting academic work available in this area of information retrieval. While reading these papers I realized that I am not [...]

Continue reading about Foundations of statistical natural language processing Review of chapter 1

René Pickhardt on February 16th, 2011

Archived; click post to view. Excerpt: Wikipedia is an amazing data set to do all different kinds of research which will go far beyond text mining. The best thing about Wikipedia is that it is licensed under creative common license. So you are allowed to download Wikipedia and use it in any way you want. [...]

Continue reading about How to download Wikipedia

René Pickhardt on February 15th, 2011

Recently there was a lot of news on the Web about IBM’s natural language processing system Watson. As you might have heard right now Watson is challenging two of the best Jeopardy players in the US. A lot of news magazines compare Watson with Google which is the reason for this article. Even though the algorithms behind Watson and Google are not open source still a lot of estimates and guesses can be made about the algorithms both computer systems use in order to give intelligent answers to the questions people ask them. Based on this guesses I will explain the differences between Google and Watson.

Continue reading about IBM’s Watson & Google – What is the the difference?

Close

Subscribe to my newsletter

You don't like mail?