Amazing! Today I had a discussion with a coworker about transparency and the way companies should be more open about what they are doing! And what happens on the same day? One of my favourite webcompanies has decided to publish a short video taken from the weekly search quality meeting!

The proposed change by Lars Hellsten is that instead of only checking the first 10 words for possible spelling corrections one could predict which two words are most likely spelled wrong and add an additional window of +-5 words around them. They discuss how this change has much better scores than the old one.

The entire video is interesting because they say that semantic context is usually given by using 3 grams. My students used up to 5 grams in order to make their scentence prediction and the machine learning already told them that 4grams would be sufficient to make syntactically and semantically correct predictions.

Anyway enjoy this great video by Google and thanks to Google for sharing this:

If you like this post, you might like these related posts:

  1. Look for love video: Did DJ Sammy steel the video story from jubilees Love Language video? Watch both clips! UPDATE: (Feb. 8th. 2012) the originial love language video is...
  2. Download Google n gram data set and neo4j source code for storing it In the end of September I discovered an amazing data...
  3. What are the 57 signals google uses to filter search results? Since my blog post on Eli Pariser’s Ted talk about...
  4. IBM’s Watson & Google – What is the the difference? Recently there was a lot of news on the Web...
  5. Related-work.net – Product Requirement Document released! Recently I visited my friend Heinrich Hartmann in Oxford. We...

Sharing:

Tags: , , , , , ,

Leave a Reply

*

Close

Subscribe to my newsletter

You don't like mail?