|  | --- | 
					
						
						|  | base_model: | 
					
						
						|  | - deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | 
					
						
						|  | - YOYO-AI/Qwen2.5-Coder-14B-YOYO-1010 | 
					
						
						|  | - qihoo360/Light-R1-14B-DS | 
					
						
						|  | - Qwen/Qwen2.5-14B-Instruct | 
					
						
						|  | - Qwen/Qwen2.5-14B-Instruct-1M | 
					
						
						|  | - arcee-ai/Virtuoso-Small-v2 | 
					
						
						|  | - tanliboy/lambda-qwen2.5-14b-dpo-test | 
					
						
						|  | library_name: transformers | 
					
						
						|  | tags: | 
					
						
						|  | - mergekit | 
					
						
						|  | - merge | 
					
						
						|  |  | 
					
						
						|  | --- | 
					
						
						|  | # merge | 
					
						
						|  |  | 
					
						
						|  | This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). | 
					
						
						|  |  | 
					
						
						|  | ## Merge Details | 
					
						
						|  | ### Merge Method | 
					
						
						|  |  | 
					
						
						|  | This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct) as a base. | 
					
						
						|  |  | 
					
						
						|  | ### Models Merged | 
					
						
						|  |  | 
					
						
						|  | The following models were included in the merge: | 
					
						
						|  | * [deepseek-ai/DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B) | 
					
						
						|  | * [YOYO-AI/Qwen2.5-Coder-14B-YOYO-1010](https://huggingface.co/YOYO-AI/Qwen2.5-Coder-14B-YOYO-1010) | 
					
						
						|  | * [qihoo360/Light-R1-14B-DS](https://huggingface.co/qihoo360/Light-R1-14B-DS) | 
					
						
						|  | * [Qwen/Qwen2.5-14B-Instruct-1M](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct-1M) | 
					
						
						|  | * [arcee-ai/Virtuoso-Small-v2](https://huggingface.co/arcee-ai/Virtuoso-Small-v2) | 
					
						
						|  | * [tanliboy/lambda-qwen2.5-14b-dpo-test](https://huggingface.co/tanliboy/lambda-qwen2.5-14b-dpo-test) | 
					
						
						|  |  | 
					
						
						|  | ### Configuration | 
					
						
						|  |  | 
					
						
						|  | The following YAML configuration was used to produce this model: | 
					
						
						|  |  | 
					
						
						|  | ```yaml | 
					
						
						|  | models: | 
					
						
						|  | - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | 
					
						
						|  | - model: qihoo360/Light-R1-14B-DS | 
					
						
						|  | - model: arcee-ai/Virtuoso-Small-v2 | 
					
						
						|  | - model: Qwen/Qwen2.5-14B-Instruct | 
					
						
						|  | - model: YOYO-AI/Qwen2.5-Coder-14B-YOYO-1010 | 
					
						
						|  | - model: Qwen/Qwen2.5-14B-Instruct-1M | 
					
						
						|  | - model: tanliboy/lambda-qwen2.5-14b-dpo-test | 
					
						
						|  | merge_method: model_stock | 
					
						
						|  | base_model: Qwen/Qwen2.5-14B-Instruct | 
					
						
						|  | tokenizer_source: base | 
					
						
						|  | normalize: true | 
					
						
						|  | int8_mask: true | 
					
						
						|  | dtype: bfloat16 | 
					
						
						|  | ``` | 
					
						
						|  |  |