Interface SeedUrlConfiguration.Builder
- All Superinterfaces:
Buildable,CopyableBuilder<SeedUrlConfiguration.Builder,,SeedUrlConfiguration> SdkBuilder<SeedUrlConfiguration.Builder,,SeedUrlConfiguration> SdkPojo
- Enclosing class:
SeedUrlConfiguration
-
Method Summary
Modifier and TypeMethodDescriptionThe list of seed or starting point URLs of the websites you want to crawl.seedUrls(Collection<String> seedUrls) The list of seed or starting point URLs of the websites you want to crawl.webCrawlerMode(String webCrawlerMode) You can choose one of the following modes:webCrawlerMode(WebCrawlerMode webCrawlerMode) You can choose one of the following modes:Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copyMethods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, buildMethods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFieldNameToField, sdkFields
-
Method Details
-
seedUrls
The list of seed or starting point URLs of the websites you want to crawl.
The list can include a maximum of 100 seed URLs.
- Parameters:
seedUrls- The list of seed or starting point URLs of the websites you want to crawl.The list can include a maximum of 100 seed URLs.
- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
seedUrls
The list of seed or starting point URLs of the websites you want to crawl.
The list can include a maximum of 100 seed URLs.
- Parameters:
seedUrls- The list of seed or starting point URLs of the websites you want to crawl.The list can include a maximum of 100 seed URLs.
- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
webCrawlerMode
You can choose one of the following modes:
-
HOST_ONLY—crawl only the website host names. For example, if the seed URL is "abc.example.com", then only URLs with host name "abc.example.com" are crawled. -
SUBDOMAINS—crawl the website host names with subdomains. For example, if the seed URL is "abc.example.com", then "a.abc.example.com" and "b.abc.example.com" are also crawled. -
EVERYTHING—crawl the website host names with subdomains and other domains that the web pages link to.
The default mode is set to
HOST_ONLY.- Parameters:
webCrawlerMode- You can choose one of the following modes:-
HOST_ONLY—crawl only the website host names. For example, if the seed URL is "abc.example.com", then only URLs with host name "abc.example.com" are crawled. -
SUBDOMAINS—crawl the website host names with subdomains. For example, if the seed URL is "abc.example.com", then "a.abc.example.com" and "b.abc.example.com" are also crawled. -
EVERYTHING—crawl the website host names with subdomains and other domains that the web pages link to.
The default mode is set to
HOST_ONLY.-
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
-
webCrawlerMode
You can choose one of the following modes:
-
HOST_ONLY—crawl only the website host names. For example, if the seed URL is "abc.example.com", then only URLs with host name "abc.example.com" are crawled. -
SUBDOMAINS—crawl the website host names with subdomains. For example, if the seed URL is "abc.example.com", then "a.abc.example.com" and "b.abc.example.com" are also crawled. -
EVERYTHING—crawl the website host names with subdomains and other domains that the web pages link to.
The default mode is set to
HOST_ONLY.- Parameters:
webCrawlerMode- You can choose one of the following modes:-
HOST_ONLY—crawl only the website host names. For example, if the seed URL is "abc.example.com", then only URLs with host name "abc.example.com" are crawled. -
SUBDOMAINS—crawl the website host names with subdomains. For example, if the seed URL is "abc.example.com", then "a.abc.example.com" and "b.abc.example.com" are also crawled. -
EVERYTHING—crawl the website host names with subdomains and other domains that the web pages link to.
The default mode is set to
HOST_ONLY.-
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
-