Julia287 commited on
Commit
9ec5904
·
verified ·
1 Parent(s): 645712e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - togethercomputer/RedPajama-Data-V2
4
+ language:
5
+ - de
6
+ library_name: transformers
7
+ license: other
8
+ pipeline_tag: feature-extraction
9
+ tags:
10
+ - masked-lm
11
+ - long-context
12
+ base_model:
13
+ - LSX-UniWue/LLaMmlein_1B
14
+ ---
15
+
16
+ # LLäMmlein2Vec 1B
17
+
18
+ LLäMmlein2Vec 1B is a German encoder language model derived from our German decoder-only model [LLäMmlein 1B](https://huggingface.co/LSX-UniWue/LLaMmlein_1B) via [LLM2Vec](https://github.com/McGill-NLP/llm2vec).
19
+ Find more details in our [preprint](https://arxiv.org/abs/2505.13136)!
20
+
21
+
22
+ We provide three transformed models:
23
+
24
+
25
+ * [LLäMmlein 7B](https://huggingface.co/LSX-UniWue/LLaMmlein2Vec_7B)
26
+
27
+ * [LLäMmlein 1B](https://huggingface.co/LSX-UniWue/LLaMmlein2Vec_1B) ← You are here
28
+
29
+ * [LLäMmlein 120M](https://huggingface.co/LSX-UniWue/LLaMmlein2Vec_120M)
30
+
31
+
32
+ ### Usage
33
+ You can use LLäMmlein2Vec with the `llm2vec` library.
34
+
35
+ ```python
36
+ import torch
37
+ from llm2vec import LLM2Vec
38
+
39
+ model_id = "LSX-UniWue/LLaMmlein2Vec_1B"
40
+ l2v = LLM2Vec.from_pretrained(
41
+ model_id,
42
+ device_map="cuda" if torch.cuda.is_available() else "cpu",
43
+ torch_dtype=torch.bfloat16,
44
+ )
45
+ ```
46
+
47
+ ### License
48
+
49
+ We release the ModernGBERT models under a research-only RAIL-M license. See [license.md](./license.md) for details.