no gpu utilization
#10 opened almost 2 years ago
by
Phanindra49
Convert weights.npz to safetensors
#9 opened almost 2 years ago
by
projectprogramamark
Long latency and low gpu utilization
#8 opened almost 2 years ago
by
Scott0612
Quantised model for mlx-community/Mistral-7B-Instruct-v0.2
1
#6 opened almost 2 years ago
by
akashicmarga
Script that converted this model?
👍
1
#5 opened almost 2 years ago
by
nheagy