Model save

Files changed (7) hide show

README.md CHANGED Viewed

@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: -299806972635812416.0000
-- Ndcg: 0.9561
-- Ndcg@25: 0.5733
-- Precision@25: 0.4457
 ## Model description
@@ -51,15 +51,15 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss            | Epoch  | Step | Validation Loss          | Ndcg   | Ndcg@25 | Precision@25 |
-|:------------------------:|:------:|:----:|:------------------------:|:------:|:-------:|:------------:|
-| -937807332288285952.0000 | 1.0    | 44   | -74029561088243888.0000  | 0.9555 | 0.1932  | 0.0400       |
-| -92957537119449904.0000  | 1.9711 | 86   | -299806972635812416.0000 | 0.9561 | 0.5733  | 0.4457       |
 ### Framework versions
 - Transformers 4.49.0
-- Pytorch 2.6.0
 - Datasets 3.6.0
 - Tokenizers 0.21.1

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: -240602318374661568.0000
+- Ndcg: 0.9563
+- Ndcg@25: 0.4422
+- Precision@25: 0.2214
 ## Model description
 ### Training results
+| Training Loss             | Epoch  | Step | Validation Loss          | Ndcg   | Ndcg@25 | Precision@25 |
+|:-------------------------:|:------:|:----:|:------------------------:|:------:|:-------:|:------------:|
+| -2330796205703744512.0000 | 1.0    | 44   | -92728505831320080.0000  | 0.9556 | 0.2014  | 0.0400       |
+| -98429772131152688.0000   | 1.9711 | 86   | -240602318374661568.0000 | 0.9563 | 0.4422  | 0.2214       |
 ### Framework versions
 - Transformers 4.49.0
+- Pytorch 2.7.1+cu126
 - Datasets 3.6.0
 - Tokenizers 0.21.1

config.json CHANGED Viewed

@@ -15905,6 +15905,7 @@
   "num_attention_heads": 32,
   "num_hidden_layers": 32,
   "num_key_value_heads": 8,
   "quantization_config": {
     "_load_in_4bit": true,
     "_load_in_8bit": false,

   "num_attention_heads": 32,
   "num_hidden_layers": 32,
   "num_key_value_heads": 8,
+  "num_labels": 7942,
   "quantization_config": {
     "_load_in_4bit": true,
     "_load_in_8bit": false,

eval_loss_plot.png CHANGED Viewed

eval_ndcg@25_plot.png CHANGED Viewed

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:71963d816ce7434c7169aa7fc2606cb22f0fda04a9c8f7a983637f02eca2adc9
 size 4323010659

 version https://git-lfs.github.com/spec/v1
+oid sha256:dd5f989c75988e88730d7c9804f1edf50eeb897b1ac5ae913d3e539f87b5beae
 size 4323010659

train_loss_plot.png CHANGED Viewed

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:89e3579564b24f58b16d5bc418fc04f168686996e1bff9fd0a1447ccece4634a
-size 5432

 version https://git-lfs.github.com/spec/v1
+oid sha256:4b213befbacaa13807e9aba070c28ef3a2e97ec3351f16dc389ae217c7e65988
+size 5905