Samle generated audio
The Orpheus Urdu TTS model is a fine-tuned version of the Orpheus 3B text-to-speech model specifically adapted for Urdu language. This experimental model not recommended for production use this was trained on the mahwizzzz/UAT dataset, which contains 20.4k audio samples split from train. This fine-tuning was performed for 10 epochs on a single RTX 4090.
This model can be used for generating Urdu speech from text. It is ideal for experimenting with TTS systems for Urdu, particularly for audiobooks, conversational AI, or speech synthesis tasks.
mahwizzzz/UAT (20.4k audiobook audio samples)