Class RFTHyperParameters
- All Implemented Interfaces:
Serializable,SdkPojo,ToCopyableBuilder<RFTHyperParameters.Builder,RFTHyperParameters>
Hyperparameters for controlling the reinforcement fine-tuning training process, including learning settings and evaluation intervals.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionfinal IntegerNumber of training samples processed in each batch during reinforcement fine-tuning (RFT) training.static RFTHyperParameters.Builderbuilder()final IntegerNumber of training epochs to run during reinforcement fine-tuning.final booleanfinal booleanequalsBySdkFields(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final IntegerInterval between evaluation runs during RFT training, measured in training steps.final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz) final inthashCode()final IntegerMaximum number of tokens the model can generate in response to each prompt during RFT training.final FloatLearning rate for the reinforcement fine-tuning.final IntegerMaximum length of input prompts during RFT training, measured in tokens.final ReasoningEffortLevel of reasoning effort applied during RFT training.final StringLevel of reasoning effort applied during RFT training.static Class<? extends RFTHyperParameters.Builder> Take this object and create a builder that contains all of the current property values of this object.final StringtoString()Returns a string representation of this object.final IntegerNumber of response samples generated per prompt during RFT training.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
epochCount
Number of training epochs to run during reinforcement fine-tuning. Higher values may improve performance but increase training time.
- Returns:
- Number of training epochs to run during reinforcement fine-tuning. Higher values may improve performance but increase training time.
-
batchSize
Number of training samples processed in each batch during reinforcement fine-tuning (RFT) training. Larger batches may improve training stability.
- Returns:
- Number of training samples processed in each batch during reinforcement fine-tuning (RFT) training. Larger batches may improve training stability.
-
learningRate
Learning rate for the reinforcement fine-tuning. Controls how quickly the model adapts to reward signals.
- Returns:
- Learning rate for the reinforcement fine-tuning. Controls how quickly the model adapts to reward signals.
-
maxPromptLength
Maximum length of input prompts during RFT training, measured in tokens. Longer prompts allow more context but increase memory usage and training-time.
- Returns:
- Maximum length of input prompts during RFT training, measured in tokens. Longer prompts allow more context but increase memory usage and training-time.
-
trainingSamplePerPrompt
Number of response samples generated per prompt during RFT training. More samples provide better reward signal estimation.
- Returns:
- Number of response samples generated per prompt during RFT training. More samples provide better reward signal estimation.
-
inferenceMaxTokens
Maximum number of tokens the model can generate in response to each prompt during RFT training.
- Returns:
- Maximum number of tokens the model can generate in response to each prompt during RFT training.
-
reasoningEffort
Level of reasoning effort applied during RFT training. Higher values may improve response quality but increase training time.
If the service returns an enum value that is not available in the current SDK version,
reasoningEffortwill returnReasoningEffort.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromreasoningEffortAsString().- Returns:
- Level of reasoning effort applied during RFT training. Higher values may improve response quality but increase training time.
- See Also:
-
reasoningEffortAsString
Level of reasoning effort applied during RFT training. Higher values may improve response quality but increase training time.
If the service returns an enum value that is not available in the current SDK version,
reasoningEffortwill returnReasoningEffort.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromreasoningEffortAsString().- Returns:
- Level of reasoning effort applied during RFT training. Higher values may improve response quality but increase training time.
- See Also:
-
evalInterval
Interval between evaluation runs during RFT training, measured in training steps. More frequent evaluation provides better monitoring.
- Returns:
- Interval between evaluation runs during RFT training, measured in training steps. More frequent evaluation provides better monitoring.
-
toBuilder
Description copied from interface:ToCopyableBuilderTake this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilderin interfaceToCopyableBuilder<RFTHyperParameters.Builder,RFTHyperParameters> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojoIndicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojoclass, and is generated based on a service model.If an
SdkPojoclass does not have any inherited fields,equalsBySdkFieldsandequalsare essentially the same.- Specified by:
equalsBySdkFieldsin interfaceSdkPojo- Parameters:
obj- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-
sdkFieldNameToField
- Specified by:
sdkFieldNameToFieldin interfaceSdkPojo- Returns:
- The mapping between the field name and its corresponding field.
-