license: apache-2.0 | |
language: | |
- de | |
- fr | |
- en | |
- ro | |
base_model: | |
- google/flan-t5-xxl | |
Original Model: https://huggingface.co/google/flan-t5-xxl/ | |
Original Readme: https://huggingface.co/google/flan-t5-xxl/blob/main/README.md | |
Disclaimer: I don't claim any rights on this modell. All rights go to google. | |
How to use: | |
./llama-cli -m /path/to/file.gguf --prompt "your prompt" --n-gpu-layers nn | |
nn --> numbers of layers to offload to gpu | |