llm-jp/optimal-sparsity-math-d2048-E128-k4-52.2B-A2.3B Text Generation • 52B • Updated 12 days ago • 17
llm-jp/optimal-sparsity-math-d2048-E64-k4-26.4B-A2.3B Text Generation • 26B • Updated 12 days ago • 21
llm-jp/optimal-sparsity-math-d2048-E32-k4-13.6B-A2.3B Text Generation • 14B • Updated 12 days ago • 22
llm-jp/optimal-sparsity-math-d1024-E256-k4-26.0B-A670M Text Generation • 26B • Updated 12 days ago • 21
llm-jp/optimal-sparsity-math-d1024-E128-k4-13.2B-A670M Text Generation • 13B • Updated 12 days ago • 24
llm-jp/optimal-sparsity-math-d512-E32-k4-920M-A220M Text Generation • 0.9B • Updated 12 days ago • 21
llm-jp/optimal-sparsity-math-d512-E16-k4-520M-A220M Text Generation • 0.5B • Updated 12 days ago • 21
llm-jp/optimal-sparsity-math-d2048-E128-k2-52.2B-A1.5B Text Generation • 52B • Updated 12 days ago • 22
llm-jp/optimal-sparsity-math-d2048-E64-k2-26.4B-A1.5B Text Generation • 26B • Updated 12 days ago • 19
llm-jp/optimal-sparsity-math-d2048-E32-k2-13.6B-A1.5B Text Generation • 14B • Updated 12 days ago • 24
llm-jp/optimal-sparsity-math-d1024-E256-k2-26.0B-A470M Text Generation • 26B • Updated 12 days ago • 24
llm-jp/optimal-sparsity-math-d1024-E128-k2-13.2B-A470M Text Generation • 13B • Updated 12 days ago • 24