Builder

class Builder

Properties

Link copied to clipboard

Configuration information required to connect to websites using authentication.

Link copied to clipboard

The 'depth' or number of levels from the seed level to crawl. For example, the seed URL page is depth 1 and any hyperlinks on this page that are also crawled are depth 2.

Link copied to clipboard

The maximum size (in MB) of a web page or attachment to crawl.

Link copied to clipboard

The maximum number of URLs on a web page to include when crawling a website. This number is per web page.

Link copied to clipboard

The maximum number of URLs crawled per website host per minute.

Link copied to clipboard

Configuration information required to connect to your internal websites via a web proxy.

Link copied to clipboard

A list of regular expression patterns to exclude certain URLs to crawl. URLs that match the patterns are excluded from the index. URLs that don't match the patterns are included in the index. If a URL matches both an inclusion and exclusion pattern, the exclusion pattern takes precedence and the URL file isn't included in the index.

Link copied to clipboard

A list of regular expression patterns to include certain URLs to crawl. URLs that match the patterns are included in the index. URLs that don't match the patterns are excluded from the index. If a URL matches both an inclusion and exclusion pattern, the exclusion pattern takes precedence and the URL file isn't included in the index.

Link copied to clipboard
var urls: Urls?

Specifies the seed or starting point URLs of the websites or the sitemap URLs of the websites you want to crawl.

Functions