dtSearch Text Retrieval Engine .NET interface

HttpDataSource Members

HttpDataSource overview

Public Static Methods

SetDebugLogging Create a debug log recording the spider's actions

Public Instance Constructors

HttpDataSource Constructor Initializes a new instance of the HttpDataSource class.

Public Instance Fields

CacheFolder Location for temporary files saved by the spider.

Public Instance Properties

DocBytes (inherited from DataSource) Use DocBytes to provide an array of bytes for dtSearch to use as the binary contents of this document.
DocCreatedDate (inherited from DataSource) The date that the document was originally created.
DocDisplayName (inherited from DataSource) The DocDisplayName is a user-friendly version of the filename, which the dtSearch end-user product displays in search results.
DocError (inherited from DataSource) If WasDocError is true, DocError will contain a string providing details on the nature of the error.
DocFields (inherited from DataSource) In DocFields, supply any fielded data you want the dtSearch Engine to index.
DocId (inherited from DataSource) Each time GetNextDoc() is called, DocId will contain the doc id of the previous document.
DocIsFile (inherited from DataSource) If True, DocName will be interpreted as the name of a file to be indexed, and dtSearch will index the contents of the file along with any data provided in DocText and DocFields.
DocModifiedDate (inherited from DataSource) The date that the document was last modified.
DocName (inherited from DataSource) The DocName is the name of the document, as you want it to appear in search results.
DocText (inherited from DataSource) In DocText, supply the text you want the dtSearch Engine to index.
DocTypeId (inherited from DataSource) Each time GetNextDoc() is called, DocTypeId will return an integer identifying the file type of the previous document.
DocWordCount (inherited from DataSource) Each time GetNextDoc() is called, DocWordCount will contain the number of words in the previous document.
HaveDocBytes (inherited from DataSource) Set HaveDocBytes to true if there is binary document data to process in DocBytes
WasDocError (inherited from DataSource) Each time GetNextDoc() is called, WasDocError will be true if there was an error processing the

Public Instance Methods

Add Requests that the spider stop crawling web sites
Dispose 
GetNextDoc Get the next document from the data source.
GetStatus Get the status of the spider
IsCrawling Returns true if the spider is still downloading data
Rewind Initializes the spider so the next GetNextDoc call will return the first document.
StartCrawl Starts crawling web sites on a different thread.
StopCrawl 

See Also

HttpDataSource Class | dtSearch.Spider Namespace