AiLab-IMCS-UL
/

Llama3.1-8B-Instruct-LVportals-15K

Text Generation

Model card Files Files and versions

normundsg commited on May 19

Commit

5e0a69d

·

verified ·

1 Parent(s): 39c1903

Update README.md

Files changed (1) hide show

README.md +21 -3

README.md CHANGED Viewed

@@ -1,3 +1,21 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+language:
+- lv
+base_model:
+- meta-llama/Llama-3.1-8B-Instruct
+tags:
+- legal
+---
+This is a fine-tuned version of the **Llama3.1-8B-Instruct** model, adapted for answering questions about legislation in Latvia. The model was fine-tuned on a [dataset](http://hdl.handle.net/20.500.12574/130) of ~15 thousand question–answer pairs sourced from the [LVportals.lv](https://lvportals.lv/e-konsultacijas) archive.
+A quantized version of the model is available for use with Ollama or other local LLM runtime environments that support the GGUF format.
+The data preparation, fine-tuning process, and comprehensive evaluation are described in more detail in:
+> Artis Pauniņš. Evaluation and Adaptation of Large Language Models for Question-Answering on Legislation. Master’s Thesis. University of Latvia, 2025.
+**Note**:
+The model may occasionally generate overly long responses. To prevent this, it is recommended to set the `num_predict` parameter to limit the number of tokens generated - either in your Python code or in the `Modelfile`, depending on how the model is run.