Open source toolkit for preprocessing and extracting information from unstructured data like PDFs, HTML, Word docs, etc.
OtherFree
Open source toolkit for preprocessing and extracting information from unstructured data like PDFs, HTML, Word docs, etc.