Class ParquetSerDe
- All Implemented Interfaces:
Serializable
,SdkPojo
,ToCopyableBuilder<ParquetSerDe.Builder,
ParquetSerDe>
A serializer to use for converting data to the Parquet format before storing it in Amazon S3. For more information, see Apache Parquet.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionfinal Integer
The Hadoop Distributed File System (HDFS) block size.static ParquetSerDe.Builder
builder()
final ParquetCompression
The compression code to use over data blocks.final String
The compression code to use over data blocks.final Boolean
Indicates whether to enable dictionary compression.final boolean
final boolean
equalsBySdkFields
(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional
<T> getValueForField
(String fieldName, Class<T> clazz) final int
hashCode()
final Integer
The maximum amount of padding to apply.final Integer
The Parquet page size.static Class
<? extends ParquetSerDe.Builder> Take this object and create a builder that contains all of the current property values of this object.final String
toString()
Returns a string representation of this object.final ParquetWriterVersion
Indicates the version of row format to output.final String
Indicates the version of row format to output.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
blockSizeBytes
The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.
- Returns:
- The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.
-
pageSizeBytes
The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.
- Returns:
- The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.
-
compression
The compression code to use over data blocks. The possible values are
UNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed.If the service returns an enum value that is not available in the current SDK version,
compression
will returnParquetCompression.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available fromcompressionAsString()
.- Returns:
- The compression code to use over data blocks. The possible values are
UNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed. - See Also:
-
compressionAsString
The compression code to use over data blocks. The possible values are
UNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed.If the service returns an enum value that is not available in the current SDK version,
compression
will returnParquetCompression.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available fromcompressionAsString()
.- Returns:
- The compression code to use over data blocks. The possible values are
UNCOMPRESSED
,SNAPPY
, andGZIP
, with the default beingSNAPPY
. UseSNAPPY
for higher decompression speed. UseGZIP
if the compression ratio is more important than speed. - See Also:
-
enableDictionaryCompression
Indicates whether to enable dictionary compression.
- Returns:
- Indicates whether to enable dictionary compression.
-
maxPaddingBytes
The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.
- Returns:
- The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.
-
writerVersion
Indicates the version of row format to output. The possible values are
V1
andV2
. The default isV1
.If the service returns an enum value that is not available in the current SDK version,
writerVersion
will returnParquetWriterVersion.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available fromwriterVersionAsString()
.- Returns:
- Indicates the version of row format to output. The possible values are
V1
andV2
. The default isV1
. - See Also:
-
writerVersionAsString
Indicates the version of row format to output. The possible values are
V1
andV2
. The default isV1
.If the service returns an enum value that is not available in the current SDK version,
writerVersion
will returnParquetWriterVersion.UNKNOWN_TO_SDK_VERSION
. The raw value returned by the service is available fromwriterVersionAsString()
.- Returns:
- Indicates the version of row format to output. The possible values are
V1
andV2
. The default isV1
. - See Also:
-
toBuilder
Description copied from interface:ToCopyableBuilder
Take this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilder
in interfaceToCopyableBuilder<ParquetSerDe.Builder,
ParquetSerDe> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojo
Indicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojo
class, and is generated based on a service model.If an
SdkPojo
class does not have any inherited fields,equalsBySdkFields
andequals
are essentially the same.- Specified by:
equalsBySdkFields
in interfaceSdkPojo
- Parameters:
obj
- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-
sdkFieldNameToField
- Specified by:
sdkFieldNameToField
in interfaceSdkPojo
- Returns:
- The mapping between the field name and its corresponding field.
-