machine-learningnlpspeech-recognitionspeech-to-textwindows-error-reporting

WER for wav2vec2-base model remains as 1 throughout the whole training process


I am trying to run the wav2vec2 speech recognition model as shared in https://huggingface.co/docs/transformers/tasks/asr

This is the loss and WER during the training process, whereby the validation loss is reducing significantly, whereas the WER remains as 1.

I tried to print out the predicted and label values and this is what I got for the last 3 outputs, which results in the WER = 1.

This is the set of parameters of the model. model param.

What may actually go wrong here? Please help.. Thanks!

I have tried tuning the hyperparameters and hoping to reduce the WER.


Solution

  • Thank you for providing some useful information for troubleshooting.