3.1.1.3 Portal Content Project

The crawler application contains exactly one instance of the portal content project.  This object’s properties control the index server behavior and state during crawls for all types of content source except for the user profile content source.

extension: A list of file name extensions recognized by the index server. The index server parses all links discovered during the crawl and extracts the file extension from them. The file extension is checked against the extension list. Depending on the setting of isExtensionIncludeList property the link is discarded or added to the crawl queue.

isExtensionIncludeList: A Boolean value that specifies whether the list of file extensions in extension is an inclusion list or an exclusion list.  If true, extension is an inclusion list; otherwise, extension is an exclusion list.

contentSourcesMetadata: A custom metadata string associated with the portal content project. The value of this property is ignored by the protocol server, but can be interpreted by the protocol client to associate arbitrary metadata with the collection of content sources.

clientCertificateNames: The list of names of X.509 client certificates used by crawl rules with authentication type of 3.

activeDocuments: The list of URLs of items currently being crawled by the index server.

pauseReason: A 32-bit number that contains the set of reasons why the search catalog is paused. Each bit indicates a particular reason. If there is more than one reason why the search catalog was paused, multiple bits corresponding to the reasons are turned on. The search catalog is resumed when all bits are turned off. MUST only turn on the following bits:

Value

Meaning

0x00000001

Paused because of certain unspecified internal event not initiated by the protocol client.

0x00000002

Paused because of certain unspecified internal event not initiated by the protocol client.

0x00000004

Paused for backup.

0x00000008

Paused for query server initialization.

0x00000010

Paused because of an unresponsive query server.