saattrupdan/wav2vec2-xls-r-300m-ftspeech

automatic speech recognitiontransformersdatransformerspytorchsafetensorswav2vec2automatic-speech-recognitiondaother
192.3K

XLS-R-300m-FTSpeech

Model description

This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the FTSpeech dataset, being a dataset of 1,800 hours of transcribed speeches from the Danish parliament.

## Performance

The model achieves the following WER scores (lower is better):

DatasetWER without LMWER with 5-gram LM
Danish part of Common Voice 8.020.4817.91
Alvenir test set15.4613.84

License

The use of this model needs to adhere to this license from the Danish Parliament.

DEPLOY IN 60 SECONDS

Run wav2vec2-xls-r-300m-ftspeech on Runcrate

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.