Update README.md
Browse files
README.md
CHANGED
@@ -5,12 +5,12 @@ language:
|
|
5 |
- en
|
6 |
---
|
7 |
|
8 |
-
# Model Card for
|
9 |
|
10 |

|
11 |
|
12 |
## Model Details
|
13 |
-
**Model Name**:
|
14 |
**Base Model**: Qwen/Qwen1.5-1.8B
|
15 |
**Publisher**: M4-ai
|
16 |
**Model Type**: Question answering, conversational AI, code generation, medical text comprehension, mathematical reasoning, logical reasoning.
|
@@ -18,7 +18,7 @@ language:
|
|
18 |
**License**: Apache-2.0
|
19 |
|
20 |
## Model Description
|
21 |
-
`
|
22 |
|
23 |
## Intended Use
|
24 |
This model is intended for researchers and practitioners looking for a powerful tool to tackle challenging problems in scientific domains. It can be used in the following scenarios:
|
@@ -28,7 +28,7 @@ This model is intended for researchers and practitioners looking for a powerful
|
|
28 |
- Automation in code generation and understanding complex programming context.
|
29 |
|
30 |
## Training Data
|
31 |
-
The `
|
32 |
|
33 |
## Evaluation Results
|
34 |
Coming soon...
|
@@ -37,7 +37,7 @@ Coming soon...
|
|
37 |
```python
|
38 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
39 |
|
40 |
-
model_name = "
|
41 |
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
42 |
model = AutoModelForCausalLM.from_pretrained(model_name)
|
43 |
|
@@ -57,7 +57,7 @@ The diversity of the dataset could lead to inconsistencies in the model's respon
|
|
57 |
This model is released under the Apache-2.0 license.
|
58 |
## Citation Information
|
59 |
|
60 |
-
If you use
|
61 |
|
62 |
```
|
63 |
@misc{sebastian_gabarain_2024,
|
|
|
5 |
- en
|
6 |
---
|
7 |
|
8 |
+
# Model Card for Locutusque/hyperion-small-preview
|
9 |
|
10 |

|
11 |
|
12 |
## Model Details
|
13 |
+
**Model Name**: Locutusque/hyperion-small-preview
|
14 |
**Base Model**: Qwen/Qwen1.5-1.8B
|
15 |
**Publisher**: M4-ai
|
16 |
**Model Type**: Question answering, conversational AI, code generation, medical text comprehension, mathematical reasoning, logical reasoning.
|
|
|
18 |
**License**: Apache-2.0
|
19 |
|
20 |
## Model Description
|
21 |
+
`Locutusque/hyperion-small-preview` is a state-of-the-art language model fine-tuned on the Hyperion dataset for advanced reasoning across scientific domains. This model is designed to handle complex inquiries and instructions, leveraging the diverse and rich information contained in the Hyperion dataset. Its primary use cases include but are not limited to complex question answering, conversational understanding, code generation, medical text comprehension, mathematical reasoning, and logical reasoning.
|
22 |
|
23 |
## Intended Use
|
24 |
This model is intended for researchers and practitioners looking for a powerful tool to tackle challenging problems in scientific domains. It can be used in the following scenarios:
|
|
|
28 |
- Automation in code generation and understanding complex programming context.
|
29 |
|
30 |
## Training Data
|
31 |
+
The `Locutusque/hyperion-small-preview` model was fine-tuned on the Hyperion dataset, which amalgamates various datasets rich in diversity and complexity, including programming, medical texts, mathematical problems, and reasoning tasks.
|
32 |
|
33 |
## Evaluation Results
|
34 |
Coming soon...
|
|
|
37 |
```python
|
38 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
39 |
|
40 |
+
model_name = "Locutusque/hyperion-small-preview"
|
41 |
tokenizer = AutoTokenizer.from_pretrained(model_name)
|
42 |
model = AutoModelForCausalLM.from_pretrained(model_name)
|
43 |
|
|
|
57 |
This model is released under the Apache-2.0 license.
|
58 |
## Citation Information
|
59 |
|
60 |
+
If you use Locutusque/hyperion-small-preview in your research, please cite the Hyperion dataset as follows:
|
61 |
|
62 |
```
|
63 |
@misc{sebastian_gabarain_2024,
|