Class InferenceComponentComputeResourceRequirements
- All Implemented Interfaces:
Serializable
,SdkPojo
,ToCopyableBuilder<InferenceComponentComputeResourceRequirements.Builder,
InferenceComponentComputeResourceRequirements>
Defines the compute resources to allocate to run a model, plus any adapter models, that you assign to an inference component. These resources include CPU cores, accelerators, and memory.
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic interface
-
Method Summary
Modifier and TypeMethodDescriptionbuilder()
final boolean
final boolean
equalsBySdkFields
(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional
<T> getValueForField
(String fieldName, Class<T> clazz) final int
hashCode()
final Integer
The maximum MB of memory to allocate to run a model that you assign to an inference component.final Integer
The minimum MB of memory to allocate to run a model that you assign to an inference component.final Float
The number of accelerators to allocate to run a model that you assign to an inference component.final Float
The number of CPU cores to allocate to run a model that you assign to an inference component.static Class
<? extends InferenceComponentComputeResourceRequirements.Builder> Take this object and create a builder that contains all of the current property values of this object.final String
toString()
Returns a string representation of this object.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
numberOfCpuCoresRequired
The number of CPU cores to allocate to run a model that you assign to an inference component.
- Returns:
- The number of CPU cores to allocate to run a model that you assign to an inference component.
-
numberOfAcceleratorDevicesRequired
The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and Amazon Web Services Inferentia.
- Returns:
- The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and Amazon Web Services Inferentia.
-
minMemoryRequiredInMb
The minimum MB of memory to allocate to run a model that you assign to an inference component.
- Returns:
- The minimum MB of memory to allocate to run a model that you assign to an inference component.
-
maxMemoryRequiredInMb
The maximum MB of memory to allocate to run a model that you assign to an inference component.
- Returns:
- The maximum MB of memory to allocate to run a model that you assign to an inference component.
-
toBuilder
Description copied from interface:ToCopyableBuilder
Take this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilder
in interfaceToCopyableBuilder<InferenceComponentComputeResourceRequirements.Builder,
InferenceComponentComputeResourceRequirements> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
public static Class<? extends InferenceComponentComputeResourceRequirements.Builder> serializableBuilderClass() -
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojo
Indicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojo
class, and is generated based on a service model.If an
SdkPojo
class does not have any inherited fields,equalsBySdkFields
andequals
are essentially the same.- Specified by:
equalsBySdkFields
in interfaceSdkPojo
- Parameters:
obj
- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-
sdkFieldNameToField
- Specified by:
sdkFieldNameToField
in interfaceSdkPojo
- Returns:
- The mapping between the field name and its corresponding field.
-