`b1d46a6`

Replace preference section with instruction for SFT-only training

Authored by mfwolffe <wolffemf@dukes.jmu.edu> 2 weeks ago

Status	File	+	-
M	`test/fixtures/sample.dlm`	5	8

test/fixtures/sample.dlmmodified

  ### A
  Run `dlm train your-file.dlm` and the adapter trains on the document content.
 -::preference::
 -### Prompt
 -Explain LoRA in one sentence.
+-
 -### Chosen
 -LoRA adds small trainable matrices to frozen model layers, enabling efficient fine-tuning.
 +::instruction::
 +### Q
 +What is LoRA?
 -### Rejected
 -LoRA is a method for training language models that involves modifying the architecture of the model by introducing additional parameters in the form of low-rank decomposition matrices that are applied to the attention weight matrices, which allows for parameter-efficient fine-tuning while keeping the original pre-trained weights frozen.
 +### A
 +LoRA adds small trainable matrices to frozen model layers, enabling efficient fine-tuning without modifying the full model weights.