helenai
/

Qwen2.5-VL-7B-Instruct-ov-int4

Model card Files Files and versions Community

helenai commited on 8 days ago

Commit

fd1d860

·

verified ·

1 Parent(s): d5cbbf4

Update README.md

Files changed (1) hide show

README.md +40 -0

README.md CHANGED Viewed

@@ -5,11 +5,15 @@ base_model:
 This is the [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) model, converted to OpenVINO, with int4 weights for the language model, int8 weights for the other models.
 To download the model, run `pip install huggingface-hub[cli]` and then:
 ```
 huggingface-cli download helenai/Qwen2.5-VL-7B-Instruct-ov-int4 --local-dir Qwen2.5-VL-7B-Instruct-ov-int4
 ```
 Use OpenVINO GenAI to run inference on this model. This model works with OpenVINO GenAI 2025.2 and later.
 - Install OpenVINO GenAI and pillow:
@@ -43,3 +47,39 @@ print(result.texts[0])
 ```
 See [OpenVINO GenAI repository](https://github.com/openvinotoolkit/openvino.genai?tab=readme-ov-file#performing-visual-language-text-generation)

 This is the [Qwen/Qwen2.5-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct) model, converted to OpenVINO, with int4 weights for the language model, int8 weights for the other models.
+## Download Model
 To download the model, run `pip install huggingface-hub[cli]` and then:
 ```
 huggingface-cli download helenai/Qwen2.5-VL-7B-Instruct-ov-int4 --local-dir Qwen2.5-VL-7B-Instruct-ov-int4
 ```
+## Run inference with OpenVINO GenAI
 Use OpenVINO GenAI to run inference on this model. This model works with OpenVINO GenAI 2025.2 and later.
 - Install OpenVINO GenAI and pillow:
 ```
 See [OpenVINO GenAI repository](https://github.com/openvinotoolkit/openvino.genai?tab=readme-ov-file#performing-visual-language-text-generation)
+## Model export properties
+Model export command:
+```
+optimum-cli export openvino -m Qwen/Qwen2.5-VL-7B-Instruct --weight-format int4 Qwen2.5-VL-7B-Instruct-ov-int4
+```
+### Framework versions
+```
+openvino         : 2025.2.0-19140-c01cd93e24d-releases/2025/2
+nncf             : 2.17.0.dev0+c6296072
+optimum_intel    : 1.26.0.dev0+0e2ccef
+optimum          : 1.27.0
+pytorch          : 2.7.0+cpu
+transformers     : 4.51.3
+```
+### LLM export properties
+```
+all_layers               : False
+awq                      : False
+backup_mode              : int8_asym
+compression_format       : dequantize
+gptq                     : False
+group_size               : 128
+ignored_scope            : []
+lora_correction          : False
+mode                     : int4_asym
+ratio                    : 1.0
+scale_estimation         : False
+sensitivity_metric       : weight_quantization_error
+```