The tm-extractors library has a new release! v1.0. You can download it here:

http://text-mining.googlecode.com/files/tm-extractors-1.0.jar

The tm-extractors library is a pure java library for extracting text from Word documents. Notable improvements in this release:

The source is hosted by google project hosting. You can find info on how to access the svn repository at the url: http://code.google.com/p/text-mining/source/checkout. Watch this page for documentation and more helpful info in the coming weeks. I just wanted to get this out asap.

This latest release was brought to you by Benryan Software Inc.

Please note that the license has changed to LGPL beginning with this release and moving forward.