Search Engine Service

Posted by Admin on 01-Apr-2008 08:21

Search engines are powerful tools that have various applications beyond internet browsing. Open source engines such as Lucene make it easier to integrate searching capabilities into any system. This submission contains two Custom Java Services (an Index Builder service and a Query service). They work together: one processes files and incrementally builds/updates an index, the other queries that index and returns a list of search results as an XML document.

The index builder service indexes any file that appears on its entry endpoint. In this demo (to keep thing simple), a FilePickup is used to pickup files form a /tmp/docs directory and send them to the index builder. In a complete application, there can be many modules (implemented as both custom services or standalone clients) that discover files to be indexed and send them to the index builder. Typically, such components are called spiders and they follow the links on the World Wide Web locating and indexing documents. However, several other types of spiders can be built:

*File system spider that recursively traverses a directory and sends all files it encounters to the index builder

*E-mail spider that reads attachments from every incoming message and sends the to the index builder

*A database record spider that queries the database for records containing long text fields

*Many more

Basically, any program that can locate documents and can post messages to a JMS destination can become a spider and contribute files to an index. Placing these spiders close to key public sources of information in the company or department creates a single point of access to all documents. Almost every organization has a wealth of useful information that few people know about and can locate. This service makes that information searchable putting it right at the employee?s fingertips.

Search Engine Service.doc

SearchEngine_03_24_2008.zip

All Replies

Posted by adaltas on 02-Apr-2008 09:45

Thank you! Your entry has been received and recorded and your rules copy moved.

This thread is closed