Update README.md
Browse files
README.md
CHANGED
@@ -27,12 +27,12 @@ The ONNX models are tested on:
|
|
27 |
ONNX enables you to run your models on-device across CPU, GPU, NPU. With ONNX you can run your models on any machine across all silica Qualcomm, AMD, Intel, Nvidia. See table below for some key benchmarks for Windows GPU and CPU devices.
|
28 |
| **Model** | **Precisionl** | **Device Type** | **Execution Provider** | **Device** | **Token Generation Throughput** | **Speed up vs base model**|
|
29 |
| :------------: | :------------: | :------------: | :------------: | :------------: | :------------: | :------------:|
|
30 |
-
|deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B|
|
31 |
-
|deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B|
|
32 |
-
|deepseek-ai_DeepSeek-R1-Distill-Qwen-
|
33 |
-
|deepseek-ai_DeepSeek-R1-Distill-Qwen-7B|
|
34 |
-
|deepseek-ai_DeepSeek-R1-Distill-Qwen-7B|
|
35 |
-
|deepseek-ai_DeepSeek-R1-Distill-Qwen-
|
36 |
|
37 |
CPU build specs:
|
38 |
|
|
|
27 |
ONNX enables you to run your models on-device across CPU, GPU, NPU. With ONNX you can run your models on any machine across all silica Qualcomm, AMD, Intel, Nvidia. See table below for some key benchmarks for Windows GPU and CPU devices.
|
28 |
| **Model** | **Precisionl** | **Device Type** | **Execution Provider** | **Device** | **Token Generation Throughput** | **Speed up vs base model**|
|
29 |
| :------------: | :------------: | :------------: | :------------: | :------------: | :------------: | :------------:|
|
30 |
+
| deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B | ONNX | fp16 | CUDA | RTX 4090 | 197.195 | 4X |
|
31 |
+
| deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B | ONNX | int4 | CUDA | RTX 4090 | 313.32 | 6.3X |
|
32 |
+
| deepseek-ai_DeepSeek-R1-Distill-Qwen-1.5B | ONNX | int4 | CPU | Intel i9 | 11.749 | 1.4x |
|
33 |
+
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | ONNX | fp16 | CUDA | RTX 4090 | 57.316 | 1.3X |
|
34 |
+
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | ONNX | int4 | CUDA | RTX 4090 | 161.00 | 3.7X |
|
35 |
+
| deepseek-ai_DeepSeek-R1-Distill-Qwen-7B | ONNX | int4 | CPU | Intel i9 | 3.184 | 20X |
|
36 |
|
37 |
CPU build specs:
|
38 |
|