Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,24 @@
 # CodeModernBERT-Crow-v1-Pre
 ## Model Description
@@ -48,7 +69,7 @@ A custom **BPE tokenizer** was trained for code and docstrings.
 * **Number of layers**: 12
 * **Attention heads**: 12
 * **Intermediate size**: 3072
-* **Max sequence length**: 2048 tokens
 * **RoPE positional encoding**: supported
 ---

+---
+license: apache-2.0
+datasets:
+- bigcode/starcoderdata
+- bigcode/starcoder2data-extras
+language:
+- en
+tags:
+- code
+- python
+- java
+- javascript
+- typescript
+- go
+- rust
+- php
+- ruby
+- cpp
+- c
+- sql
+---
 # CodeModernBERT-Crow-v1-Pre
 ## Model Description
 * **Number of layers**: 12
 * **Attention heads**: 12
 * **Intermediate size**: 3072
+* **Max sequence length: 8192** (during training, inputs were limited to 1024 tokens)
 * **RoPE positional encoding**: supported
 ---