Links
dtSearch Text Retrieval Engine Programmer's Reference
Removing documents from an index
Building and Maintaining Indexes | Send Feedback

How to remove documents from an index

Remarks

There are two ways to remove documents from an index: 

(1) You can pass a list of filenames to be removed from the index in an index job. The list is passed as a plain text file containing a list of filenames or DocIds, one per line.  Filenames must exactly match the path and name associated with a document in the index.  DocIds must be preceded by > to indicate that they are not filenames.  Each item must appear on a separate line.  Examples: 

c:\docs\filename.txt 

>45 

This list would specify that the document "c:\docs\filename.txt", and the document with the DocId 45, should be removed from the index. 

(2) You can set the "Remove Deleted" flag in an index job, which indicates that each file in the index should be checked and, if the corresponding disk file does not exist, the file is to be removed from the index.

Group
Links
API
Language 
API 
C/C++ 
DIndexJob or dtsIndexJob, set action.removeListed or action.removeDeleted = true 
.NET (C#, VB.NET) 
dtSearch.Engine.IndexJob, set ActionRemoveListed or ActionRemoveDeleted = true 
Java 
com.dtsearch.engine.IndexJob, setActionRemoveListed(true) or setActionRemoveDeleted(true) 
COM (Visual Basic, ASP) 
IIndexJob (IndexJob) object, set ActionRemoveListed or ActionRemoveDeleted = true 
You are here: Overviews > Building and Maintaining Indexes > Removing documents from an index
Copyright (c) 1995-2008 dtSearch Corp. All rights reserved.