Interface S3ExcelSource.Builder

  • Method Details

    • name

      The name of the S3 Excel data source.

      Parameters:
      name - The name of the S3 Excel data source.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • paths

      The S3 paths where the Excel files are located.

      Parameters:
      paths - The S3 paths where the Excel files are located.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • paths

      S3ExcelSource.Builder paths(String... paths)

      The S3 paths where the Excel files are located.

      Parameters:
      paths - The S3 paths where the Excel files are located.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • compressionType

      S3ExcelSource.Builder compressionType(String compressionType)

      The compression format used for the Excel files.

      Parameters:
      compressionType - The compression format used for the Excel files.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • compressionType

      S3ExcelSource.Builder compressionType(ParquetCompressionType compressionType)

      The compression format used for the Excel files.

      Parameters:
      compressionType - The compression format used for the Excel files.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • exclusions

      S3ExcelSource.Builder exclusions(Collection<String> exclusions)

      Patterns to exclude specific files or paths from processing.

      Parameters:
      exclusions - Patterns to exclude specific files or paths from processing.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • exclusions

      S3ExcelSource.Builder exclusions(String... exclusions)

      Patterns to exclude specific files or paths from processing.

      Parameters:
      exclusions - Patterns to exclude specific files or paths from processing.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • groupSize

      S3ExcelSource.Builder groupSize(String groupSize)

      Defines the size of file groups for batch processing.

      Parameters:
      groupSize - Defines the size of file groups for batch processing.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • groupFiles

      S3ExcelSource.Builder groupFiles(String groupFiles)

      Specifies how files should be grouped for processing.

      Parameters:
      groupFiles - Specifies how files should be grouped for processing.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • recurse

      S3ExcelSource.Builder recurse(Boolean recurse)

      Indicates whether to recursively process subdirectories.

      Parameters:
      recurse - Indicates whether to recursively process subdirectories.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • maxBand

      S3ExcelSource.Builder maxBand(Integer maxBand)

      The maximum number of processing bands to use.

      Parameters:
      maxBand - The maximum number of processing bands to use.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • maxFilesInBand

      S3ExcelSource.Builder maxFilesInBand(Integer maxFilesInBand)

      The maximum number of files to process in each band.

      Parameters:
      maxFilesInBand - The maximum number of files to process in each band.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • additionalOptions

      S3ExcelSource.Builder additionalOptions(S3DirectSourceAdditionalOptions additionalOptions)

      Additional configuration options for S3 direct source processing.

      Parameters:
      additionalOptions - Additional configuration options for S3 direct source processing.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • additionalOptions

      default S3ExcelSource.Builder additionalOptions(Consumer<S3DirectSourceAdditionalOptions.Builder> additionalOptions)

      Additional configuration options for S3 direct source processing.

      This is a convenience method that creates an instance of the S3DirectSourceAdditionalOptions.Builder avoiding the need to create one manually via S3DirectSourceAdditionalOptions.builder().

      When the Consumer completes, SdkBuilder.build() is called immediately and its result is passed to additionalOptions(S3DirectSourceAdditionalOptions).

      Parameters:
      additionalOptions - a consumer that will call methods on S3DirectSourceAdditionalOptions.Builder
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • numberRows

      S3ExcelSource.Builder numberRows(Long numberRows)

      The number of rows to process from each Excel file.

      Parameters:
      numberRows - The number of rows to process from each Excel file.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • skipFooter

      S3ExcelSource.Builder skipFooter(Integer skipFooter)

      The number of rows to skip at the end of each Excel file.

      Parameters:
      skipFooter - The number of rows to skip at the end of each Excel file.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • outputSchemas

      S3ExcelSource.Builder outputSchemas(Collection<GlueSchema> outputSchemas)

      The AWS Glue schemas to apply to the processed data.

      Parameters:
      outputSchemas - The AWS Glue schemas to apply to the processed data.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • outputSchemas

      S3ExcelSource.Builder outputSchemas(GlueSchema... outputSchemas)

      The AWS Glue schemas to apply to the processed data.

      Parameters:
      outputSchemas - The AWS Glue schemas to apply to the processed data.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • outputSchemas

      S3ExcelSource.Builder outputSchemas(Consumer<GlueSchema.Builder>... outputSchemas)

      The AWS Glue schemas to apply to the processed data.

      This is a convenience method that creates an instance of the GlueSchema.Builder avoiding the need to create one manually via GlueSchema.builder().

      When the Consumer completes, SdkBuilder.build() is called immediately and its result is passed to outputSchemas(List<GlueSchema>).

      Parameters:
      outputSchemas - a consumer that will call methods on GlueSchema.Builder
      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also: