Support for MS WORD PDF and Excel documents

Coordinator
Jul 1, 2009 at 7:34 PM

After a significant amount of work, I have finally managed to put the indexing infrastructure up and running. Now that the core infrastructure components of the project -  the indeing infrastructure and the earch infrastructure are ready and seem to be working fine, I now plan to go ahead and add the following new features

1.) Support for reading MS Word, PDF and Excel documents - Whie this may seem simple at first I still need to research the dependencies on the COM components that should be used to perform these tasks.

2.) Optimize search and index creation echanism by filerting out noise words.

3.) Logging classes that can be used by developers of the client to understand what is going on whenever an API in this DLL is called.

 

 

That's all for now!!

 

Cheers

Prahalad