opendiffusionai
/

t5-v1_1-xl-encoder-only

Model card Files Files and versions

ppbrown commited on Jun 16

Commit

fe4f26d

·

verified ·

1 Parent(s): 4042f7a

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -10,4 +10,13 @@ This is just the encoder weights from "google/t5-v1_1-xl"
 It takes 11GB down to 4GB.
 The script to do the extraction is included here as
-[transform.py](transform.py)

 It takes 11GB down to 4GB.
 The script to do the extraction is included here as
+[transform.py](transform.py)
+Edit:   Now that I have this in a convenient form...
+I got a chance to test t5-xxl projected down to 2048, vs this t5-xl
+Surprisingly, even with an untrained projection layer, trivial embedding diversity scores rate
+the projected xxl version higher than native xl at 2048.
+So, while this model will continue to exist as a convenient way to compare.. and possibly as something
+to use if you are really, really REALLY tight on memory... you are probably best off
+using t5-xxl whenever you can.