TsienDragon commited on
Commit
7525139
·
verified ·
0 Parent(s):

initial commit

Browse files
Files changed (2) hide show
  1. .gitattributes +55 -0
  2. README.md +96 -0
.gitattributes ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.lz4 filter=lfs diff=lfs merge=lfs -text
12
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
13
+ *.model filter=lfs diff=lfs merge=lfs -text
14
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
15
+ *.npy filter=lfs diff=lfs merge=lfs -text
16
+ *.npz filter=lfs diff=lfs merge=lfs -text
17
+ *.onnx filter=lfs diff=lfs merge=lfs -text
18
+ *.ot filter=lfs diff=lfs merge=lfs -text
19
+ *.parquet filter=lfs diff=lfs merge=lfs -text
20
+ *.pb filter=lfs diff=lfs merge=lfs -text
21
+ *.pickle filter=lfs diff=lfs merge=lfs -text
22
+ *.pkl filter=lfs diff=lfs merge=lfs -text
23
+ *.pt filter=lfs diff=lfs merge=lfs -text
24
+ *.pth filter=lfs diff=lfs merge=lfs -text
25
+ *.rar filter=lfs diff=lfs merge=lfs -text
26
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
27
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
29
+ *.tar filter=lfs diff=lfs merge=lfs -text
30
+ *.tflite filter=lfs diff=lfs merge=lfs -text
31
+ *.tgz filter=lfs diff=lfs merge=lfs -text
32
+ *.wasm filter=lfs diff=lfs merge=lfs -text
33
+ *.xz filter=lfs diff=lfs merge=lfs -text
34
+ *.zip filter=lfs diff=lfs merge=lfs -text
35
+ *.zst filter=lfs diff=lfs merge=lfs -text
36
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
37
+ # Audio files - uncompressed
38
+ *.pcm filter=lfs diff=lfs merge=lfs -text
39
+ *.sam filter=lfs diff=lfs merge=lfs -text
40
+ *.raw filter=lfs diff=lfs merge=lfs -text
41
+ # Audio files - compressed
42
+ *.aac filter=lfs diff=lfs merge=lfs -text
43
+ *.flac filter=lfs diff=lfs merge=lfs -text
44
+ *.mp3 filter=lfs diff=lfs merge=lfs -text
45
+ *.ogg filter=lfs diff=lfs merge=lfs -text
46
+ *.wav filter=lfs diff=lfs merge=lfs -text
47
+ # Image files - uncompressed
48
+ *.bmp filter=lfs diff=lfs merge=lfs -text
49
+ *.gif filter=lfs diff=lfs merge=lfs -text
50
+ *.png filter=lfs diff=lfs merge=lfs -text
51
+ *.tiff filter=lfs diff=lfs merge=lfs -text
52
+ # Image files - compressed
53
+ *.jpg filter=lfs diff=lfs merge=lfs -text
54
+ *.jpeg filter=lfs diff=lfs merge=lfs -text
55
+ *.webp filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,96 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - text-to-image
4
+ - lora
5
+ - diffusers
6
+ - template:diffusion-lora
7
+ widget:
8
+ - output:
9
+ url: images/input_image.jpg
10
+ text: Original Image
11
+ - output:
12
+ url: images/result_base_model.jpg
13
+ text: change the face to face segmentation mask
14
+ - output:
15
+ url: images/result_lora_model.jpg
16
+ text: change the face to face segmentation mask
17
+ base_model: Qwen/Qwen-Image
18
+ instance_prompt: null
19
+ license: mit
20
+ ---
21
+ # Qwen-Image-Lora-Faceseg
22
+
23
+ <Gallery />
24
+
25
+ ## Model description
26
+
27
+ # Face Segmentation Model Description
28
+ ## Overview
29
+ This is a LoRA fine-tuned face segmentation model based on Qwen-VL (Qwen Vision-Language) architecture, specifically designed to transform facial images into precise segmentation masks. The model leverages the powerful multimodal capabilities of Qwen-VL and enhances it through Parameter-Efficient Fine-Tuning (PEFT) using LoRA (Low-Rank Adaptation) technique.
30
+ ## Model Architecture
31
+ - Base Model: Qwen-Image-Edit (built on Qwen-VL foundation)
32
+ - Fine-tuning Method: LoRA (Low-Rank Adaptation)
33
+ - Task: Image-to-Image translation (Face → Segmentation Mask)
34
+ - Input: RGB facial images
35
+ - Output: Binary&#x2F;grayscale segmentation masks highlighting facial regions
36
+ ## Training Configuration
37
+ - Dataset: 20 carefully curated face segmentation samples
38
+ - Training Steps: 900-1000 steps
39
+ - Prompt: &quot;change the image from the face to the face segmentation mask&quot;
40
+ - Precision Options:
41
+ - BF16 precision for high-quality results
42
+ - FP4 quantization for memory-efficient deployment
43
+ ## Key Features
44
+ 1. High Precision Segmentation: Accurately identifies and segments facial boundaries with fine detail preservation
45
+ 2. Memory Efficient: FP4 quantized version maintains competitive quality while significantly reducing memory footprint
46
+ 3. Fast Inference: Optimized for real-time applications with 20 inference steps
47
+ 4. Robust Performance: Handles various lighting conditions and facial orientations
48
+ 5. Parameter Efficient: Only trains LoRA adapters (~1M parameters) while keeping base model frozen
49
+ ## Technical Specifications
50
+ - Inference Steps: 20
51
+ - CFG Scale: 2.5
52
+ - Input Resolution: Configurable (typically 512x512)
53
+ - Model Size: Base model + ~1M LoRA parameters
54
+ - Memory Usage:
55
+ - BF16 version: Higher memory, best quality
56
+ - FP4 version: 75% memory reduction, competitive quality
57
+ ## Use Cases
58
+ - Identity Verification: KYC (Know Your Customer) applications
59
+ - Privacy Protection: Face anonymization while preserving facial structure
60
+ - Medical Applications: Facial analysis and dermatological assessments
61
+ - AR&#x2F;VR Applications: Real-time face tracking and segmentation
62
+ - Content Creation: Automated face masking for video editing
63
+ ## Performance Highlights
64
+ - Accuracy: Significantly improved boundary detection compared to base model
65
+ - Detail Preservation: Maintains fine facial features in segmentation masks
66
+ - Consistency: Stable segmentation quality across different input conditions
67
+ - Efficiency: FP4 quantization achieves 4x memory savings with minimal quality loss
68
+ ## Deployment Options
69
+ - High-Quality Mode: BF16 precision for maximum accuracy
70
+ - Efficient Mode: FP4 quantization for resource-constrained environments
71
+ - Real-time Applications: Optimized inference pipeline for low-latency requirements
72
+ This model represents a practical solution for face segmentation tasks, offering an excellent balance between accuracy, efficiency, and deployability across various hardware configurations
73
+
74
+ ## Example:
75
+ Control Images
76
+ ![input_image.jpg](https:&#x2F;&#x2F;cdn-uploads.huggingface.co&#x2F;production&#x2F;uploads&#x2F;641af68ea5f876fe30c38508&#x2F;sPFRuwzgdMjUNWkL84jLl.jpeg)
77
+
78
+ Edited Image with Qwen-Image-Edit by promot
79
+ &#x60;change the face to face segmentation mask&#x60;
80
+
81
+ ![result_base_model.jpg](https:&#x2F;&#x2F;cdn-uploads.huggingface.co&#x2F;production&#x2F;uploads&#x2F;641af68ea5f876fe30c38508&#x2F;v20z6hctGEY_DdP5WtFFv.jpeg)
82
+
83
+ After Lora Finetune with same prompt
84
+
85
+ ![result_lora_model.jpg](https:&#x2F;&#x2F;cdn-uploads.huggingface.co&#x2F;production&#x2F;uploads&#x2F;641af68ea5f876fe30c38508&#x2F;pE6F_FSSfdxphfrfiZjeu.jpeg)
86
+
87
+ ## Code
88
+ Lora Finetune of Qwen-Image-Edit Code here:
89
+ https:&#x2F;&#x2F;github.com&#x2F;tsiendragon&#x2F;qwen-image-finetune
90
+
91
+
92
+
93
+ ## Download model
94
+
95
+
96
+ [Download](/TsienDragon/qwen-image-edit-lora-face-segmentation/tree/main) them in the Files & versions tab.