Class JobUpdate
- All Implemented Interfaces:
Serializable,SdkPojo,ToCopyableBuilder<JobUpdate.Builder,JobUpdate>
Specifies information used to update an existing job definition. The previous job definition is completely overwritten by this information.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionfinal IntegerDeprecated.This property is deprecated, use MaxCapacity instead.static JobUpdate.Builderbuilder()final Map<String, CodeGenConfigurationNode> The representation of a directed acyclic graph on which both the Glue Studio visual component and Glue Studio code generation is based.final JobCommandcommand()TheJobCommandthat runs this job (required).final ConnectionsListThe connections used for this job.The default arguments for every run of this job, specified as name-value pairs.final StringDescription of the job being defined.final booleanfinal booleanequalsBySdkFields(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final ExecutionClassIndicates whether the job is run with a standard or flexible execution class.final StringIndicates whether the job is run with a standard or flexible execution class.final ExecutionPropertyAnExecutionPropertyspecifying the maximum number of concurrent runs allowed for this job.final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz) final StringIn Spark jobs,GlueVersiondetermines the versions of Apache Spark and Python that Glue available in a job.final booleanFor responses, this returns true if the service returned a value for the CodeGenConfigurationNodes property.final booleanFor responses, this returns true if the service returned a value for the DefaultArguments property.final inthashCode()final booleanFor responses, this returns true if the service returned a value for the NonOverridableArguments property.final JobModejobMode()A mode that describes how a job was created.final StringA mode that describes how a job was created.final BooleanSpecifies whether job run queuing is enabled for the job runs for this job.final StringlogUri()This field is reserved for future use.final StringThis field specifies a day of the week and hour for a maintenance window for streaming jobs.final DoubleFor Glue version 1.0 or earlier jobs, using the standard worker type, the number of Glue data processing units (DPUs) that can be allocated when this job runs.final IntegerThe maximum number of times to retry this job if it fails.Arguments for this job that are not overridden when providing job arguments in a job run, specified as name-value pairs.final NotificationPropertySpecifies the configuration properties of a job notification.final IntegerThe number of workers of a definedworkerTypethat are allocated when a job runs.final Stringrole()The name or Amazon Resource Name (ARN) of the IAM role associated with this job (required).final StringThe name of theSecurityConfigurationstructure to be used with this job.static Class<? extends JobUpdate.Builder> final SourceControlDetailsThe details for a source control configuration for a job, allowing synchronization of job artifacts to or from a remote repository.final Integertimeout()The job timeout in minutes.Take this object and create a builder that contains all of the current property values of this object.final StringtoString()Returns a string representation of this object.final WorkerTypeThe type of predefined worker that is allocated when a job runs.final StringThe type of predefined worker that is allocated when a job runs.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
jobMode
A mode that describes how a job was created. Valid values are:
-
SCRIPT- The job was created using the Glue Studio script editor. -
VISUAL- The job was created using the Glue Studio visual editor. -
NOTEBOOK- The job was created using an interactive sessions notebook.
When the
JobModefield is missing or null,SCRIPTis assigned as the default value.If the service returns an enum value that is not available in the current SDK version,
jobModewill returnJobMode.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromjobModeAsString().- Returns:
- A mode that describes how a job was created. Valid values are:
-
SCRIPT- The job was created using the Glue Studio script editor. -
VISUAL- The job was created using the Glue Studio visual editor. -
NOTEBOOK- The job was created using an interactive sessions notebook.
When the
JobModefield is missing or null,SCRIPTis assigned as the default value. -
- See Also:
-
-
jobModeAsString
A mode that describes how a job was created. Valid values are:
-
SCRIPT- The job was created using the Glue Studio script editor. -
VISUAL- The job was created using the Glue Studio visual editor. -
NOTEBOOK- The job was created using an interactive sessions notebook.
When the
JobModefield is missing or null,SCRIPTis assigned as the default value.If the service returns an enum value that is not available in the current SDK version,
jobModewill returnJobMode.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromjobModeAsString().- Returns:
- A mode that describes how a job was created. Valid values are:
-
SCRIPT- The job was created using the Glue Studio script editor. -
VISUAL- The job was created using the Glue Studio visual editor. -
NOTEBOOK- The job was created using an interactive sessions notebook.
When the
JobModefield is missing or null,SCRIPTis assigned as the default value. -
- See Also:
-
-
jobRunQueuingEnabled
Specifies whether job run queuing is enabled for the job runs for this job.
A value of true means job run queuing is enabled for the job runs. If false or not populated, the job runs will not be considered for queueing.
If this field does not match the value set in the job run, then the value from the job run field will be used.
- Returns:
- Specifies whether job run queuing is enabled for the job runs for this job.
A value of true means job run queuing is enabled for the job runs. If false or not populated, the job runs will not be considered for queueing.
If this field does not match the value set in the job run, then the value from the job run field will be used.
-
description
Description of the job being defined.
- Returns:
- Description of the job being defined.
-
logUri
This field is reserved for future use.
- Returns:
- This field is reserved for future use.
-
role
The name or Amazon Resource Name (ARN) of the IAM role associated with this job (required).
- Returns:
- The name or Amazon Resource Name (ARN) of the IAM role associated with this job (required).
-
executionProperty
An
ExecutionPropertyspecifying the maximum number of concurrent runs allowed for this job.- Returns:
- An
ExecutionPropertyspecifying the maximum number of concurrent runs allowed for this job.
-
command
The
JobCommandthat runs this job (required).- Returns:
- The
JobCommandthat runs this job (required).
-
hasDefaultArguments
public final boolean hasDefaultArguments()For responses, this returns true if the service returned a value for the DefaultArguments property. This DOES NOT check that the value is non-empty (for which, you should check theisEmpty()method on the property). This is useful because the SDK will never return a null collection or map, but you may need to differentiate between the service returning nothing (or null) and the service returning an empty collection or map. For requests, this returns true if a value for the property was specified in the request builder, and false if a value was not specified. -
defaultArguments
The default arguments for every run of this job, specified as name-value pairs.
You can specify arguments here that your own job-execution script consumes, as well as arguments that Glue itself consumes.
Job arguments may be logged. Do not pass plaintext secrets as arguments. Retrieve secrets from a Glue Connection, Secrets Manager or other secret management mechanism if you intend to keep them within the Job.
For information about how to specify and consume your own Job arguments, see the Calling Glue APIs in Python topic in the developer guide.
For information about the arguments you can provide to this field when configuring Spark jobs, see the Special Parameters Used by Glue topic in the developer guide.
For information about the arguments you can provide to this field when configuring Ray jobs, see Using job parameters in Ray jobs in the developer guide.
Attempts to modify the collection returned by this method will result in an UnsupportedOperationException.
This method will never return null. If you would like to know whether the service returned this field (so that you can differentiate between null and empty), you can use the
hasDefaultArguments()method.- Returns:
- The default arguments for every run of this job, specified as name-value pairs.
You can specify arguments here that your own job-execution script consumes, as well as arguments that Glue itself consumes.
Job arguments may be logged. Do not pass plaintext secrets as arguments. Retrieve secrets from a Glue Connection, Secrets Manager or other secret management mechanism if you intend to keep them within the Job.
For information about how to specify and consume your own Job arguments, see the Calling Glue APIs in Python topic in the developer guide.
For information about the arguments you can provide to this field when configuring Spark jobs, see the Special Parameters Used by Glue topic in the developer guide.
For information about the arguments you can provide to this field when configuring Ray jobs, see Using job parameters in Ray jobs in the developer guide.
-
hasNonOverridableArguments
public final boolean hasNonOverridableArguments()For responses, this returns true if the service returned a value for the NonOverridableArguments property. This DOES NOT check that the value is non-empty (for which, you should check theisEmpty()method on the property). This is useful because the SDK will never return a null collection or map, but you may need to differentiate between the service returning nothing (or null) and the service returning an empty collection or map. For requests, this returns true if a value for the property was specified in the request builder, and false if a value was not specified. -
nonOverridableArguments
Arguments for this job that are not overridden when providing job arguments in a job run, specified as name-value pairs.
Attempts to modify the collection returned by this method will result in an UnsupportedOperationException.
This method will never return null. If you would like to know whether the service returned this field (so that you can differentiate between null and empty), you can use the
hasNonOverridableArguments()method.- Returns:
- Arguments for this job that are not overridden when providing job arguments in a job run, specified as name-value pairs.
-
connections
The connections used for this job.
- Returns:
- The connections used for this job.
-
maxRetries
The maximum number of times to retry this job if it fails.
- Returns:
- The maximum number of times to retry this job if it fails.
-
allocatedCapacity
Deprecated.This property is deprecated, use MaxCapacity instead.This field is deprecated. Use
MaxCapacityinstead.The number of Glue data processing units (DPUs) to allocate to this job. You can allocate a minimum of 2 DPUs; the default is 10. A DPU is a relative measure of processing power that consists of 4 vCPUs of compute capacity and 16 GB of memory. For more information, see the Glue pricing page.
- Returns:
- This field is deprecated. Use
MaxCapacityinstead.The number of Glue data processing units (DPUs) to allocate to this job. You can allocate a minimum of 2 DPUs; the default is 10. A DPU is a relative measure of processing power that consists of 4 vCPUs of compute capacity and 16 GB of memory. For more information, see the Glue pricing page.
-
timeout
The job timeout in minutes. This is the maximum time that a job run can consume resources before it is terminated and enters
TIMEOUTstatus.Jobs must have timeout values less than 7 days or 10080 minutes. Otherwise, the jobs will throw an exception.
When the value is left blank, the timeout is defaulted to 2880 minutes.
Any existing Glue jobs that had a timeout value greater than 7 days will be defaulted to 7 days. For instance if you have specified a timeout of 20 days for a batch job, it will be stopped on the 7th day.
For streaming jobs, if you have set up a maintenance window, it will be restarted during the maintenance window after 7 days.
- Returns:
- The job timeout in minutes. This is the maximum time that a job run can consume resources before it is
terminated and enters
TIMEOUTstatus.Jobs must have timeout values less than 7 days or 10080 minutes. Otherwise, the jobs will throw an exception.
When the value is left blank, the timeout is defaulted to 2880 minutes.
Any existing Glue jobs that had a timeout value greater than 7 days will be defaulted to 7 days. For instance if you have specified a timeout of 20 days for a batch job, it will be stopped on the 7th day.
For streaming jobs, if you have set up a maintenance window, it will be restarted during the maintenance window after 7 days.
-
maxCapacity
For Glue version 1.0 or earlier jobs, using the standard worker type, the number of Glue data processing units (DPUs) that can be allocated when this job runs. A DPU is a relative measure of processing power that consists of 4 vCPUs of compute capacity and 16 GB of memory. For more information, see the Glue pricing page.
For Glue version 2.0+ jobs, you cannot specify a
Maximum capacity. Instead, you should specify aWorker typeand theNumber of workers.Do not set
MaxCapacityif usingWorkerTypeandNumberOfWorkers.The value that can be allocated for
MaxCapacitydepends on whether you are running a Python shell job, an Apache Spark ETL job, or an Apache Spark streaming ETL job:-
When you specify a Python shell job (
JobCommand.Name="pythonshell"), you can allocate either 0.0625 or 1 DPU. The default is 0.0625 DPU. -
When you specify an Apache Spark ETL job (
JobCommand.Name="glueetl") or Apache Spark streaming ETL job (JobCommand.Name="gluestreaming"), you can allocate from 2 to 100 DPUs. The default is 10 DPUs. This job type cannot have a fractional DPU allocation.
- Returns:
- For Glue version 1.0 or earlier jobs, using the standard worker type, the number of Glue data processing
units (DPUs) that can be allocated when this job runs. A DPU is a relative measure of processing power
that consists of 4 vCPUs of compute capacity and 16 GB of memory. For more information, see the Glue pricing page.
For Glue version 2.0+ jobs, you cannot specify a
Maximum capacity. Instead, you should specify aWorker typeand theNumber of workers.Do not set
MaxCapacityif usingWorkerTypeandNumberOfWorkers.The value that can be allocated for
MaxCapacitydepends on whether you are running a Python shell job, an Apache Spark ETL job, or an Apache Spark streaming ETL job:-
When you specify a Python shell job (
JobCommand.Name="pythonshell"), you can allocate either 0.0625 or 1 DPU. The default is 0.0625 DPU. -
When you specify an Apache Spark ETL job (
JobCommand.Name="glueetl") or Apache Spark streaming ETL job (JobCommand.Name="gluestreaming"), you can allocate from 2 to 100 DPUs. The default is 10 DPUs. This job type cannot have a fractional DPU allocation.
-
-
-
workerType
The type of predefined worker that is allocated when a job runs. Accepts a value of G.1X, G.2X, G.4X, G.8X or G.025X for Spark jobs. Accepts the value Z.2X for Ray jobs. For more information, see Defining job properties for Spark jobs
If the service returns an enum value that is not available in the current SDK version,
workerTypewill returnWorkerType.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromworkerTypeAsString().- Returns:
- The type of predefined worker that is allocated when a job runs. Accepts a value of G.1X, G.2X, G.4X, G.8X or G.025X for Spark jobs. Accepts the value Z.2X for Ray jobs. For more information, see Defining job properties for Spark jobs
- See Also:
-
workerTypeAsString
The type of predefined worker that is allocated when a job runs. Accepts a value of G.1X, G.2X, G.4X, G.8X or G.025X for Spark jobs. Accepts the value Z.2X for Ray jobs. For more information, see Defining job properties for Spark jobs
If the service returns an enum value that is not available in the current SDK version,
workerTypewill returnWorkerType.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromworkerTypeAsString().- Returns:
- The type of predefined worker that is allocated when a job runs. Accepts a value of G.1X, G.2X, G.4X, G.8X or G.025X for Spark jobs. Accepts the value Z.2X for Ray jobs. For more information, see Defining job properties for Spark jobs
- See Also:
-
numberOfWorkers
The number of workers of a defined
workerTypethat are allocated when a job runs.- Returns:
- The number of workers of a defined
workerTypethat are allocated when a job runs.
-
securityConfiguration
The name of the
SecurityConfigurationstructure to be used with this job.- Returns:
- The name of the
SecurityConfigurationstructure to be used with this job.
-
notificationProperty
Specifies the configuration properties of a job notification.
- Returns:
- Specifies the configuration properties of a job notification.
-
glueVersion
In Spark jobs,
GlueVersiondetermines the versions of Apache Spark and Python that Glue available in a job. The Python version indicates the version supported for jobs of type Spark.Ray jobs should set
GlueVersionto4.0or greater. However, the versions of Ray, Python and additional libraries available in your Ray job are determined by theRuntimeparameter of the Job command.For more information about the available Glue versions and corresponding Spark and Python versions, see Glue version in the developer guide.
Jobs that are created without specifying a Glue version default to Glue 0.9.
- Returns:
- In Spark jobs,
GlueVersiondetermines the versions of Apache Spark and Python that Glue available in a job. The Python version indicates the version supported for jobs of type Spark.Ray jobs should set
GlueVersionto4.0or greater. However, the versions of Ray, Python and additional libraries available in your Ray job are determined by theRuntimeparameter of the Job command.For more information about the available Glue versions and corresponding Spark and Python versions, see Glue version in the developer guide.
Jobs that are created without specifying a Glue version default to Glue 0.9.
-
hasCodeGenConfigurationNodes
public final boolean hasCodeGenConfigurationNodes()For responses, this returns true if the service returned a value for the CodeGenConfigurationNodes property. This DOES NOT check that the value is non-empty (for which, you should check theisEmpty()method on the property). This is useful because the SDK will never return a null collection or map, but you may need to differentiate between the service returning nothing (or null) and the service returning an empty collection or map. For requests, this returns true if a value for the property was specified in the request builder, and false if a value was not specified. -
codeGenConfigurationNodes
The representation of a directed acyclic graph on which both the Glue Studio visual component and Glue Studio code generation is based.
Attempts to modify the collection returned by this method will result in an UnsupportedOperationException.
This method will never return null. If you would like to know whether the service returned this field (so that you can differentiate between null and empty), you can use the
hasCodeGenConfigurationNodes()method.- Returns:
- The representation of a directed acyclic graph on which both the Glue Studio visual component and Glue Studio code generation is based.
-
executionClass
Indicates whether the job is run with a standard or flexible execution class. The standard execution-class is ideal for time-sensitive workloads that require fast job startup and dedicated resources.
The flexible execution class is appropriate for time-insensitive jobs whose start and completion times may vary.
Only jobs with Glue version 3.0 and above and command type
glueetlwill be allowed to setExecutionClasstoFLEX. The flexible execution class is available for Spark jobs.If the service returns an enum value that is not available in the current SDK version,
executionClasswill returnExecutionClass.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromexecutionClassAsString().- Returns:
- Indicates whether the job is run with a standard or flexible execution class. The standard
execution-class is ideal for time-sensitive workloads that require fast job startup and dedicated
resources.
The flexible execution class is appropriate for time-insensitive jobs whose start and completion times may vary.
Only jobs with Glue version 3.0 and above and command type
glueetlwill be allowed to setExecutionClasstoFLEX. The flexible execution class is available for Spark jobs. - See Also:
-
executionClassAsString
Indicates whether the job is run with a standard or flexible execution class. The standard execution-class is ideal for time-sensitive workloads that require fast job startup and dedicated resources.
The flexible execution class is appropriate for time-insensitive jobs whose start and completion times may vary.
Only jobs with Glue version 3.0 and above and command type
glueetlwill be allowed to setExecutionClasstoFLEX. The flexible execution class is available for Spark jobs.If the service returns an enum value that is not available in the current SDK version,
executionClasswill returnExecutionClass.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromexecutionClassAsString().- Returns:
- Indicates whether the job is run with a standard or flexible execution class. The standard
execution-class is ideal for time-sensitive workloads that require fast job startup and dedicated
resources.
The flexible execution class is appropriate for time-insensitive jobs whose start and completion times may vary.
Only jobs with Glue version 3.0 and above and command type
glueetlwill be allowed to setExecutionClasstoFLEX. The flexible execution class is available for Spark jobs. - See Also:
-
sourceControlDetails
The details for a source control configuration for a job, allowing synchronization of job artifacts to or from a remote repository.
- Returns:
- The details for a source control configuration for a job, allowing synchronization of job artifacts to or from a remote repository.
-
maintenanceWindow
This field specifies a day of the week and hour for a maintenance window for streaming jobs. Glue periodically performs maintenance activities. During these maintenance windows, Glue will need to restart your streaming jobs.
Glue will restart the job within 3 hours of the specified maintenance window. For instance, if you set up the maintenance window for Monday at 10:00AM GMT, your jobs will be restarted between 10:00AM GMT to 1:00PM GMT.
- Returns:
- This field specifies a day of the week and hour for a maintenance window for streaming jobs. Glue
periodically performs maintenance activities. During these maintenance windows, Glue will need to restart
your streaming jobs.
Glue will restart the job within 3 hours of the specified maintenance window. For instance, if you set up the maintenance window for Monday at 10:00AM GMT, your jobs will be restarted between 10:00AM GMT to 1:00PM GMT.
-
toBuilder
Description copied from interface:ToCopyableBuilderTake this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilderin interfaceToCopyableBuilder<JobUpdate.Builder,JobUpdate> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojoIndicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojoclass, and is generated based on a service model.If an
SdkPojoclass does not have any inherited fields,equalsBySdkFieldsandequalsare essentially the same.- Specified by:
equalsBySdkFieldsin interfaceSdkPojo- Parameters:
obj- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-
sdkFieldNameToField
- Specified by:
sdkFieldNameToFieldin interfaceSdkPojo- Returns:
- The mapping between the field name and its corresponding field.
-