SearchReportJob Class

Generates a report showing each hit in one or more documents, with a specified amount of context

dtSearch.Engine.JobBase | dtSearch.Engine.OutputBase | dtSearch.Engine.SearchReportJob

public class SearchReportJob : OutputBase;

Remarks

To generate a search report,

(1) Start with a SearchResults object representing the results of a search.

(2) Call SearchJob.NewSearchReportJob to make a SearchReportJob

(3) Select the items to include in the search report using the Select*() methods in SearchReportJob

(4) Specify the amount of context to include using WordsOfContextExact, ParagraphsOfContext, or WordsOfContext

(5) Set the output format for the report using ContextFooter, ContextHeader, etc.

(6) Call Execute() to generate the report

Format

A search report lists the hits found in one or more documents, with each hit surrounded by a specified amount of context. Each block of context starts with a ContextHeader and ends with the ContextFooter. Contiguous or overlapping blocks of context will be combined. The amount of context included in the report can be specified by words or by paragraphs.

Each block of context is constructed as follows:

[ContextHeader] ...text... [BeforeHit] hit [AfterHit] ...text... [ContextFooter]

The report as a whole is constructed as follows:

[Header] [FileHeader] [ContextHeader] ...text... [BeforeHit] hit [AfterHit] ...text... [ContextFooter] [ContextSeparator] [ContextHeader] ...text... [BeforeHit] hit [AfterHit] ...text... [ContextFooter] ... more blocks of context, if present [FileFooter] ... more files ... [Footer]

Use the following symbols to insert file information into the FileHeader and FileFooter:

Symbol	Meaning
Filename	The name of the file (without path information). For PDF and HTML files, this will be the Title.
Location	The location of the file
Fullname	The path and filename of the file.
Size	File size in bytes
SizeK	File size in kilobytes
Date	Modification date of the file when indexed
Hits	Number of hits in the file
Title	The first 80 characters of the file
DocId	The docId of the file
Type	The file type (Microsoft Word, PDF, HTML, etc.)
Ordinal	The 1-based ordinal of this item in the SearchResults from which it was generated
IndexRetrievedFrom	The index where the file was found

Use %% around each symbol, like this: %%FullName%%

Use the following symbols to insert context information in the ContextHeader, which appears in front of each block of context:

Symbol	Meaning
Page	Page number where the hit occurs
Paragraph	Paragraph number where the hit occurs (relative to the start of the page)
Word	Word offset of the block of context from the beginning of the file.
FirstHit	Word offset of the first hit in the block of context.

Adding Hits in Context to Search Results

You can use SearchReportJob to add a brief snippet of text to each SearchResults item showing a few hits with a limited amount of context around each hit. This "synopsis" can then be included in the displayed search results to make it easier for end-users to see why each document was found.

To add a synopsis to SearchResults,

1. Use MaxContextBlocks to limit the number of blocks of context included in the report. For example, if MaxContextBlocks = 1, then only the first hit will be included.

2. Use WordsOfContextExact to specify the number of words of context to included.

3. Set the OutputFormat to itUnformattedHTML, so output characters will be correctly HTML-encoded and formatting from the original document will not appear in the search results list. (If you use itHTML as the output format, the output could contain paragraph breaks, color changes, etc., that would not look right in a search results table.)

4. Set the dtsReportStoreInResults flag in SearchReportJob, which causes the synopsis to be stored in each search results item, making it easier to access the individual synopsis items.

5. Set the BeforeHit and AfterHit marks to HTML tags like and to mark the hits.

6. Select a range of items to include in the search report that corresponds to the range items to be displayed. For example, if you are displaying the first ten items, select items 0 through 9. Generating a synopsis can be time-consuming, so it is important to generate it only when needed for display.

Caching Text to Optimize SearchReportJob

Generation of a synopsis is much faster if you index the documents with caching of text enabled, because the context can be extracted from the index without the need to access the original files, and because the cached text includes tables designed to make context extraction more efficient.

Example

reportJob.SelectRange(startAt, endAt); reportJob.WordsOfContextExact = 10; reportJob.BeforeHit = ""; reportJob.AfterHit = ""; reportJob.MaxContextBlocks = 3; reportJob.ContextSeparator = " "; reportJob.ContextHeader = "..."; reportJob.ContextFooter = "..."; reportJob.SetOutputToString(512000); reportJob.Flags = ReportFlags.dtsReportStoreInResults | ReportFlags.dtsReportLimitContiguousContext | ReportFlags.dtsReportGetFromCache; reportJob.OutputFormat = it_UnformattedHTML); reportJob.Execute();

IDisposable

SearchReportJob requires the IDisposable Pattern.

Topics

Topic	Description
JobBase Members	The following tables list the members exposed by JobBase.
JobBase Methods	The methods of the JobBase class are listed here.
JobBase Properties	The properties of the JobBase class are listed here.

OutputBase Class

Topic	Description
OutputBase Members	The following tables list the members exposed by OutputBase.
OutputBase Properties	The properties of the OutputBase class are listed here.

SearchReportJob Class

Topic	Description
SearchReportJob Members	The following tables list the members exposed by SearchReportJob.
SearchReportJob Methods	The methods of the SearchReportJob class are listed here.
SearchReportJob Properties	The properties of the SearchReportJob class are listed here.

JobBase Methods

Show:Inherited

No members matching the current filter

JobBase Methods	Description
Failed	True if any errors occurred during execution of the job. Check the JobErrorInfo Errors object for details. (Inherited from JobBase)

SearchReportJob Class

SearchReportJob Class	Description
ClearSelections	Select no items in the SearchResults.
Execute	Generate the report.
SelectAll	Select all items in the SearchResults.
SelectItems	Select a range of items in the SearchResults.
SetResults	The search results list that this SearchReportJob will use.

JobBase Properties

Show:Inherited

No members matching the current filter

JobBase Properties	Description
Errors	Contains any errors that occurred during execution of the job. (Inherited from JobBase)
TimeoutSeconds	Set to a non-zero value to make the job terminate automatically after a specified number of seconds. (Inherited from JobBase)

OutputBase Class

Show:Inherited

No members matching the current filter

OutputBase Class	Description
AfterHit	If an array of hit offsets has been provided in Hits, then the BeforeHit and AfterHit strings will be used to mark each hit in the document in the converted output (Inherited from OutputBase)
BaseHRef	For HTML output, an HREF for a BASE tag to be inserted in the header. (Inherited from OutputBase)
BeforeHit	If an array of hit offsets has been provided in Hits, then the BeforeHit and AfterHit strings will be used to mark each hit in the document in the converted output (Inherited from OutputBase)
DocTypeTag	For HTML output, a DocType tag such as <!DOCTYPE html>to go before the first tag in the output. (Inherited from OutputBase)
Footer	The Footer will be appended to the conversion output and can use tags in the output format, such as HTML tags in a document converted to HTML. (Inherited from OutputBase)
Header	The Header will appear at the top of the conversion output and can use tags in the output format, such as HTML tags in a document converted to HTML. (Inherited from OutputBase)
HtmlHead	Use HtmlHead to supply HTML data to appear inside the HEAD section of the output. (Inherited from OutputBase)
OutputFile	Name of the converted file to create. (Inherited from OutputBase)
OutputFormat	By default, a FileConverter converts the input file to HTML. Other supported options are: itRTF, itUTF8 (Unicode text), itAnsi, and itXML (for XML input data only). (Inherited from OutputBase)
OutputString	If OutputToString is true, output will be stored in OutputString rather than in a disk file. (Inherited from OutputBase)
OutputStringMaxSize	When output is directed to an in-memory string, you may wish to limit the maximum amount of memory used. To do this, set OutputStringMaxSize to the maximum size you want to allow. (Inherited from OutputBase)
OutputToString	If true, output will be stored in an in-memory string variable rather than a disk file. (OutputFile will be ignored.) After the Execute method is done, the output will be in the OutputString property. (Inherited from OutputBase)
WasTruncated	The output was truncated because of the OutputStringMaxSize setting. (Inherited from OutputBase)

SearchReportJob Class

SearchReportJob Class	Description
ContextFooter	Text to appear after each block of context in the report.
ContextHeader	Text to appear at the start of each block of context in the report.
ContextSeparator	Text to appear between blocks of context in the report (after one ContextFooter, before the next ContextHeader)
FileFooter	Text to appear after each document in the report.
FileHeader	Text to appear at the start of each document in the report.
Flags	Flags controlling generation of the report.
MaxContextBlocks	Number of blocks of context to include in the report for each document.
MaxWordsToRead	Number of words to scan in each document looking for blocks of context to include in the report.
ParagraphsOfContext	Number of paragraphs of context to include around each hit.
WordsOfContext	Approximate number of words of context to include around each hit.
WordsOfContextExact	Number of words of context to include around each hit.