14. March 2010 22:56
A few years ago I worked on a project called Atrax which among other things included an implementation of the work of Yutaka Matsuo of the National Institute of Advanced Industrial Science and Technology in Tokyo and by Mitsuru Ishizuka of the University of Tokyo.
I decided to revisit the keyword extraction algorithm and update it a bit and isolate it from the overall Atrax code to make it easier for anyone to use. You can download the code Keyword.zip (17.87 KB).
Here are the top ten keywords the code returns from the Gettysburg Address and from Scot Gu’s most recent blog post:
VS 2010 and Visual Web Developer
Web Developer 2008 Express
Improved Visual Studio
Let me know if you end up using the implementation of the algorithm and if you happen to make improvements to it.