Settings for distilling a foundation model into a smaller and more efficient model.
The teacher model configuration.