license: apache-2.0
language:
  - code
  - en
datasets:
  - saridormi/commit-chronicle
tags:
  - code
  - commit_message_generation
pipeline_tag: text2text-generation
CMG/CMC: CodeT5 (without history)
This is the checkpoint for CodeT5 model, fine-tuned for the commit message generation (and/or completion) task as part of the paper "From Commit Message Generation to History-Aware Commit Message Completion", ASE 2023.
Details
🔍 For further details, please refer to:
- Paper: TODO
- Repository: https://github.com/JetBrains-Research/commit_message_generation
- This model is based on - Salesforce/codet5-basecheckpoint from 📜 CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation.
- This model was trained with commit diffs, WITHOUT commit message history. 
- This model was trained on the CommitChronicle dataset introduced in our study. 
- Our hyperparameter setting is mostly based on 📜 RACE: Retrieval-augmented Commit Message Generation. The exact values are provided below: 
Available checkpoints
We also released checkpoints for other models fine-tuned as part of our study.
- Models trained with commit message history:- CodeT5: 🤗 JetBrains-Research/cmg-codet5-with-history
- CodeReviewer: 🤗 JetBrains-Research/cmg-codereviewer-with-history
- RACE: 🤗 JetBrains-Research/cmg-race-with-history
 
- CodeT5: 🤗 
- Models trained without commit message history:- CodeT5: 🤗 JetBrains-Research/cmg-codet5-without-history(this model)
- CodeReviewer: 🤗 JetBrains-Research/cmg-codereviewer-without-history
- RACE: 🤗 JetBrains-Research/cmg-race-without-history
 
- CodeT5: 🤗 
Citation
TODO

