LargeWorldModel
/

LWM-Text-Chat-256K-Jax

Model card Files Files and versions

wilson1yan commited on Feb 12, 2024

Commit

59937e9

·

verified ·

1 Parent(s): 00f5240

Create README.md

Files changed (1) hide show

README.md +31 -0

README.md ADDED Viewed

	@@ -0,0 +1,31 @@

+---
+inference: false
+---
+<br>
+<br>
+# LWM-Text-Chat-256K-Jax Model Card
+## Model details
+**Model type:**
+LWM-Text-Chat-256K-Jax is an open-source model trained from LLaMA-2 on a subset of Books3 filtered data. It is an auto-regressive language model, based on the transformer architecture.
+The model is a Jax checkpoint. Inference code and instructions can be found at: https://github.com/LargeWorldModel/lwm
+**Model date:**
+LWM-Text-Chat-256K-Jax was trained in December 2023.
+**Paper or resources for more information:**
+https://largeworldmodel.github.io/
+## License
+Llama 2 is licensed under the LLAMA 2 Community License,
+Copyright (c) Meta Platforms, Inc. All Rights Reserved.
+**Where to send questions or comments about the model:**
+https://github.com/LargeWorldModel/lwm/issues
+## Training dataset
+- 92K subset of Books3 documents with 100K to 200K tokens