saattrupdan/wav2vec2-xls-r-300m-ftspeech

Name: saattrupdan/wav2vec2-xls-r-300m-ftspeech
Rating: 5 (1 reviews)
Author: saattrupdan

automatic speech recognitiontransformersdatransformerspytorchsafetensorswav2vec2automatic-speech-recognitiondaother

0

882.1K

XLS-R-300m-FTSpeech

This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the FTSpeech dataset, being a dataset of 1,800 hours of transcribed speeches from the Danish parliament.

## Performance

The model achieves the following WER scores (lower is better):

Dataset	WER without LM	WER with 5-gram LM
Danish part of Common Voice 8.0	20.48	17.91
Alvenir test set	15.46	13.84

The use of this model needs to adhere to this license from the Danish Parliament.

Run this model on powerful GPU infrastructure. Deploy in 60 seconds.

Pay per second

H100, A100, RTX GPUs

Instant deployment

DEPLOY IN 60 SECONDS

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.