How to index SharePoint sites with the dtSearch Engine

Last Reviewed: July 8, 2017

Article: DTS0240

 

Applies to: dtSearch Engine versions 7.87 and later

The dtSearch Engine can index SharePoint sites in two ways: using the dtSearch Spider, and using the DataSource API.

The dtSearch Spider has a .NET API that you can use to implement web site crawling in your application.  For API documentation, please see:  dtSearch Spider API.  Using this API, you can have your application crawl SharePoint sites using HTTP.

For a more direct connection with one of the SharePoint APIs, you can use the dtSearch Engine's DataSource API.  This API lets you pass binary documents (Word, PDF, etc.) directly to the dtSearch Engine along with a set of field-value pairs that will be indexed with the document as metadata.  For information on the DataSource API, please see:

How to index databases with the dtSearch Engine

API Overview -- Indexing Databases

.NET DataSource API documentation

For sample C# code demonstrating how to use the DataSource API to connect with the SharePoint client API, see the C:\Program Files\dtSearch Developer\examples\cs4\SharePointDemo folder.  Sample code demonstrating how to index using the SharePoint server API is also available, in the codeproject.com article "dtSearch's DataSource API for indexing SharePoint Site Collections".