Interface S3ExcelSource.Builder
- All Superinterfaces:
Buildable
,CopyableBuilder<S3ExcelSource.Builder,
,S3ExcelSource> SdkBuilder<S3ExcelSource.Builder,
,S3ExcelSource> SdkPojo
- Enclosing class:
S3ExcelSource
-
Method Summary
Modifier and TypeMethodDescriptiondefault S3ExcelSource.Builder
additionalOptions
(Consumer<S3DirectSourceAdditionalOptions.Builder> additionalOptions) Additional configuration options for S3 direct source processing.additionalOptions
(S3DirectSourceAdditionalOptions additionalOptions) Additional configuration options for S3 direct source processing.compressionType
(String compressionType) The compression format used for the Excel files.compressionType
(ParquetCompressionType compressionType) The compression format used for the Excel files.exclusions
(String... exclusions) Patterns to exclude specific files or paths from processing.exclusions
(Collection<String> exclusions) Patterns to exclude specific files or paths from processing.groupFiles
(String groupFiles) Specifies how files should be grouped for processing.Defines the size of file groups for batch processing.The maximum number of processing bands to use.maxFilesInBand
(Integer maxFilesInBand) The maximum number of files to process in each band.The name of the S3 Excel data source.numberRows
(Long numberRows) The number of rows to process from each Excel file.outputSchemas
(Collection<GlueSchema> outputSchemas) The AWS Glue schemas to apply to the processed data.outputSchemas
(Consumer<GlueSchema.Builder>... outputSchemas) The AWS Glue schemas to apply to the processed data.outputSchemas
(GlueSchema... outputSchemas) The AWS Glue schemas to apply to the processed data.The S3 paths where the Excel files are located.paths
(Collection<String> paths) The S3 paths where the Excel files are located.Indicates whether to recursively process subdirectories.skipFooter
(Integer skipFooter) The number of rows to skip at the end of each Excel file.Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copy
Methods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, build
Methods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFieldNameToField, sdkFields
-
Method Details
-
name
The name of the S3 Excel data source.
- Parameters:
name
- The name of the S3 Excel data source.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
paths
The S3 paths where the Excel files are located.
- Parameters:
paths
- The S3 paths where the Excel files are located.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
paths
The S3 paths where the Excel files are located.
- Parameters:
paths
- The S3 paths where the Excel files are located.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
compressionType
The compression format used for the Excel files.
- Parameters:
compressionType
- The compression format used for the Excel files.- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
compressionType
The compression format used for the Excel files.
- Parameters:
compressionType
- The compression format used for the Excel files.- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
exclusions
Patterns to exclude specific files or paths from processing.
- Parameters:
exclusions
- Patterns to exclude specific files or paths from processing.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
exclusions
Patterns to exclude specific files or paths from processing.
- Parameters:
exclusions
- Patterns to exclude specific files or paths from processing.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
groupSize
Defines the size of file groups for batch processing.
- Parameters:
groupSize
- Defines the size of file groups for batch processing.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
groupFiles
Specifies how files should be grouped for processing.
- Parameters:
groupFiles
- Specifies how files should be grouped for processing.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
recurse
Indicates whether to recursively process subdirectories.
- Parameters:
recurse
- Indicates whether to recursively process subdirectories.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
maxBand
The maximum number of processing bands to use.
- Parameters:
maxBand
- The maximum number of processing bands to use.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
maxFilesInBand
The maximum number of files to process in each band.
- Parameters:
maxFilesInBand
- The maximum number of files to process in each band.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
additionalOptions
Additional configuration options for S3 direct source processing.
- Parameters:
additionalOptions
- Additional configuration options for S3 direct source processing.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
additionalOptions
default S3ExcelSource.Builder additionalOptions(Consumer<S3DirectSourceAdditionalOptions.Builder> additionalOptions) Additional configuration options for S3 direct source processing.
This is a convenience method that creates an instance of theS3DirectSourceAdditionalOptions.Builder
avoiding the need to create one manually viaS3DirectSourceAdditionalOptions.builder()
.When the
Consumer
completes,SdkBuilder.build()
is called immediately and its result is passed toadditionalOptions(S3DirectSourceAdditionalOptions)
.- Parameters:
additionalOptions
- a consumer that will call methods onS3DirectSourceAdditionalOptions.Builder
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-
numberRows
The number of rows to process from each Excel file.
- Parameters:
numberRows
- The number of rows to process from each Excel file.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
outputSchemas
The AWS Glue schemas to apply to the processed data.
- Parameters:
outputSchemas
- The AWS Glue schemas to apply to the processed data.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
outputSchemas
The AWS Glue schemas to apply to the processed data.
- Parameters:
outputSchemas
- The AWS Glue schemas to apply to the processed data.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
outputSchemas
The AWS Glue schemas to apply to the processed data.
This is a convenience method that creates an instance of theGlueSchema.Builder
avoiding the need to create one manually viaGlueSchema.builder()
.When the
Consumer
completes,SdkBuilder.build()
is called immediately and its result is passed tooutputSchemas(List<GlueSchema>)
.- Parameters:
outputSchemas
- a consumer that will call methods onGlueSchema.Builder
- Returns:
- Returns a reference to this object so that method calls can be chained together.
- See Also:
-