| # Natural Don't Know Response Model | |
| Fine-tunned on [Google's T5](https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html) using [Quora Question Pairs(QQP)](https://huggingface.co/nlp/viewer/?dataset=quora) dataset for **Don't Know Response Generation** task. | |
| Additional information about this model: | |
| - Paper : https://arxiv.org/pdf/2012.01873.pdf | |
| - Github Repo: https://github.com/kaustubhdhole/natural-dont-know | |
| #### How to use | |
| ```python | |
| from transformers import T5ForConditionalGeneration, T5Tokenizer | |
| model_name = "ashish-shrivastava/dont-know-response" | |
| model = T5ForConditionalGeneration.from_pretrained(model_name) | |
| tokenizer = T5Tokenizer.from_pretrained(model_name) | |
| input = "Where can I find good Italian food ?" | |
| input_ids = tokenizer.encode(input, return_tensors="pt") | |
| outputs = model.generate(input_ids) | |
| decoded_output = tokenizer.decode(outputs[0], skip_special_tokens=True) | |
| print(decoded_output) # I'm not sure where you can get good quality Italian food. | |
| ``` | |
| #### Hyperparameters | |
| ``` | |
| n_epochs = 2 | |
| base_LM_model = "T5-base" | |
| max_seq_len = 256 | |
| learning_rate = 3e-4 | |
| adam_epsilon = 1e-8 | |
| train_batch_size = 6 | |
| ``` | |
| #### BibTeX entry and citation info | |
| ```bibtex | |
| @misc{shrivastava2020saying, | |
| title={Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries}, | |
| author={Ashish Shrivastava and Kaustubh Dhole and Abhinav Bhatt and Sharvani Raghunath}, | |
| year={2020}, | |
| eprint={2012.01873}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.CL} | |
| } | |
| ``` | |