My Fine-Tuned Model

This model was fine-tuned using LoRA + DPO on the dockerNLcommands and Human-Like-DPO-Dataset datasets.

Generation Settings

64 512
0 1.5
0.1 1
Examples