A glimpse at search engines of the future?

Text mining is a computer technique to extract useful information from unstructured text. And it’s a difficult task. But now, using a relatively new method named topic modeling, computer scientists from University of California, Irvine (UCI), have analyzed 330,000 stories published by the New York Times between 2000 and 2002 in just a few hours. They were able to automatically isolate topics such as the Tour de France, prices of apartments in Brooklyn or dinosaur bones. This technique could soon be used not only by homeland security experts or librarians, but also by physicians, lawyers, real estate people, and even by yourself.

Discovering topics in the NYT archives

Source: Text mining the New York Times

Do it now! Click here to subscribe to JimStroud 2.0.

Nothing says "Thanks for posting this Jim!" like Starbucks Coffee. Click here to buy me a cup (or two).

Send post as PDF to PDF Creator | PDF Converter | PDF Software | Create PDF

If you enjoyed this post, please consider to leave a comment or subscribe to the feed and get future articles delivered to your feed reader.

Comments

No comments yet.

Leave a comment

(required)

(required)