webnn
/

Phi-4-mini-instruct-onnx-transformers_js

Text Generation

Model card Files Files and versions

ibelem commited on Jul 18

Commit

be34995

·

verified ·

1 Parent(s): 5e9ac47

Update README.md

Files changed (1) hide show

README.md +14 -3

README.md CHANGED Viewed

@@ -1,3 +1,14 @@
----
-license: mit
----

+---
+license: mit
+pipeline_tag: text-generation
+tags: [ONNX, ONNXRuntime, phi3.5, nlp, conversational, custom_code]
+inference: false
+---
+Based on https://huggingface.co/microsoft/Phi-4-mini-instruct
+Convert ONNX model by using https://github.com/microsoft/onnxruntime-genai
+Using command: python -m onnxruntime_genai.models.builder -m microsoft/Phi-4-mini-instruct -o Phi-4-mini-instruct-onnx -e webgpu -c cache-dir -p int4 --extra_options int4_block_size=32 int4_accuracy_level=4
+The generated external data (model.onnx.data) is larger than 2GB, which is not suitable for ORT-Web. I use an additional Python script to move some data into model.onnx.