Specifies the endpoint capacity type.
Defines the capacity size, either as a number of inference component copies or a capacity percentage.