thejaminator's picture
verl GRPO trained model at step 100
1f8e02a verified
|
raw
history blame
135 Bytes
metadata
base_model: thejaminator/qwen-hook-layer-9-posneg-merged
library_name: peft
tags:
  - lora
  - peft
pipeline_tag: text-generation