Model save

Files changed (6) hide show

README.md CHANGED Viewed

@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: -289695770057323328.0000
-- Ndcg: 0.9560
-- Ndcg@25: 0.6207
-- Precision@25: 0.2464
 ## Model description
@@ -51,10 +51,10 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss            | Epoch  | Step | Validation Loss          | Ndcg   | Ndcg@25 | Precision@25 |
-|:------------------------:|:------:|:----:|:------------------------:|:------:|:-------:|:------------:|
-| -824219259923909888.0000 | 1.0    | 44   | -120189777058411312.0000 | 0.9555 | 0.2126  | 0.0          |
-| -91157196780129488.0000  | 1.9711 | 86   | -289695770057323328.0000 | 0.9560 | 0.6207  | 0.2464       |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.3](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: -377379303853489216.0000
+- Ndcg: 0.9566
+- Ndcg@25: 0.5389
+- Precision@25: 0.2423
 ## Model description
 ### Training results
+| Training Loss             | Epoch  | Step | Validation Loss          | Ndcg   | Ndcg@25 | Precision@25 |
+|:-------------------------:|:------:|:----:|:------------------------:|:------:|:-------:|:------------:|
+| -6819856770917832704.0000 | 1.0    | 44   | -289695770057323328.0000 | 0.9560 | 0.6207  | 0.2464       |
+| -160735776926492256.0000  | 1.9711 | 86   | -377379303853489216.0000 | 0.9566 | 0.5389  | 0.2423       |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,7 +1,8 @@
 {
   "_name_or_path": "mistralai/Mistral-7B-Instruct-v0.3",
   "architectures": [
-    "LTRModel"
   ],
   "attention_dropout": 0.0,
   "bos_token_id": 1,
@@ -15925,7 +15926,7 @@
   "rope_theta": 1000000.0,
   "sliding_window": null,
   "tie_word_embeddings": false,
-  "torch_dtype": "float32",
   "transformers_version": "4.49.0",
   "use_cache": true,
   "vocab_size": 32768

 {
+  "_attn_implementation_autoset": true,
   "_name_or_path": "mistralai/Mistral-7B-Instruct-v0.3",
   "architectures": [
+    "MistralForCausalLM"
   ],
   "attention_dropout": 0.0,
   "bos_token_id": 1,
   "rope_theta": 1000000.0,
   "sliding_window": null,
   "tie_word_embeddings": false,
+  "torch_dtype": "bfloat16",
   "transformers_version": "4.49.0",
   "use_cache": true,
   "vocab_size": 32768

eval_loss_plot.png CHANGED Viewed

eval_ndcg@25_plot.png CHANGED Viewed

train_loss_plot.png CHANGED Viewed

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f28aac9705c5796bc9414dbec1ffbbcacc5bed2ae18f36b3f5117eafd1bd7aa5
 size 5432

 version https://git-lfs.github.com/spec/v1
+oid sha256:9d8ecf06721ad87ab8121f2db0c7f9e06711bbd37d7a94b1f2e0165aeb84b036
 size 5432