Mardiyyah commited on
Commit
0d38692
·
verified ·
1 Parent(s): d04e397

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -5
README.md CHANGED
@@ -16,6 +16,7 @@ model-index:
16
  datasets:
17
  - asr-nigerian-pidgin/nigerian-pidgin-1.0
18
  pipeline_tag: automatic-speech-recognition
 
19
  ---
20
 
21
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -23,18 +24,25 @@ should probably proofread and complete it, then remove this comment. -->
23
 
24
  # pidgin-wav2vec2-xlsr53
25
 
26
- This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the [Nigerian Pidgin](https://huggingface.co/datasets/asr-nigerian-pidgin/nigerian-pidgin-1.0) dataset.
 
 
27
  It achieves the following results on the evaluation set:
28
  - Loss: 0.6907
29
  - Wer: 0.3161 (val)
30
 
31
- ## Model description
32
-
33
- *to be updated*
34
 
35
  ## Intended uses & limitations
36
 
37
- *to be updated*
 
 
 
 
 
 
 
 
38
 
39
  ## Training and evaluation data
40
 
 
16
  datasets:
17
  - asr-nigerian-pidgin/nigerian-pidgin-1.0
18
  pipeline_tag: automatic-speech-recognition
19
+ library_name: transformers
20
  ---
21
 
22
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
24
 
25
  # pidgin-wav2vec2-xlsr53
26
 
27
+ This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53), adapted for transcribing Nigerian Pidgin English. Building on the self-supervised, cross-lingual representations of XLSR-53, it has been trained using the [Nigerian Pidgin dataset](https://huggingface.co/datasets/asr-nigerian-pidgin/nigerian-pidgin-1.0) to handle the phonetic and lexical nuances unique to Nigerian Pidgin, offering significant improvements over zero-shot ASR baselines
28
+
29
+
30
  It achieves the following results on the evaluation set:
31
  - Loss: 0.6907
32
  - Wer: 0.3161 (val)
33
 
 
 
 
34
 
35
  ## Intended uses & limitations
36
 
37
+ **Intended Use**: Best suited for automatic speech recognition (ASR) tasks on Nigerian Pidgin audio, such as speech-to-text conversion and related downstream tasks. Best performance is achieved in a clean recording environments with limited background noise.
38
+
39
+ **Limitations/Caveats**:
40
+
41
+ - Trained exclusively on speech from limited demographic groups; may underperform on dialects or accents outside the training set.
42
+
43
+ - Struggles with numeric phrases and unusual phonetic variants, as noted in qualitative evaluations [see here]
44
+ - Struggles with noisy environment and fast-paced speech
45
+ - Not suited for critically high-accuracy domains (e.g., legal, medical domain) without further tuning.
46
 
47
  ## Training and evaluation data
48