barbaroo
/

gptsw3_translate_6.7B

Text Generation

Model card Files Files and versions

barbaroo commited on 21 days ago

Commit

e3b5328

·

verified ·

1 Parent(s): 17ddc46

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ pipeline_tag: text-generation
 ### Model Sources
-- **Paper:** [COMING SOON]
 ---
 ## Uses
@@ -110,7 +110,7 @@ for sentence in sentences:
     # Generate the output
     outputs = model.generate(**inputs,
-                             max_new_tokens=2000,
                              eos_token_id=tokenizer.eos_token_id,  # Ensure EOS token is used
                              pad_token_id=tokenizer.pad_token_id,  # Ensure padding token is used
                              use_cache=True,
@@ -144,8 +144,7 @@ for sentence in sentences:
 ### Training Data
-We used the Sprotin parallel corpus for **English–Faroese** translation: [barbaroo/Sprotin_parallel](https://huggingface.co/datasets/barbaroo/Sprotin_parallel).
 ### Training Procedure
@@ -182,8 +181,8 @@ Human evaluation was also performed (see paper)
 ## Citation []
-[COMING SOON]
 ---
 ## Framework versions

 ### Model Sources
+- **Paper:** Rethinking Low-Resource MT: The Surprising Effectiveness of Fine-Tuned Multilingual Models in the LLM Age (Scalvini et al., NoDaLiDa 2025)
 ---
 ## Uses
     # Generate the output
     outputs = model.generate(**inputs,
+                             max_new_tokens=500,
                              eos_token_id=tokenizer.eos_token_id,  # Ensure EOS token is used
                              pad_token_id=tokenizer.pad_token_id,  # Ensure padding token is used
                              use_cache=True,
 ### Training Data
+We used the Sprotin parallel corpus for **English–Faroese** translation: [barbaroo/Sprotin_parallel](https://huggingface.co/datasets/barbaroo/Sprotin_parallel).
 ### Training Procedure
 ## Citation []
+Barbara Scalvini, Iben Nyholm Debess, Annika Simonsen, and Hafsteinn Einarsson. 2025. Rethinking Low-Resource MT: The Surprising Effectiveness of Fine-Tuned Multilingual Models in the LLM Age. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), pages 609–621, Tallinn, Estonia. University of Tartu Library.
 ---
 ## Framework versions