Open Speech and Language Resources (OpenSLR) | User-friendly Deep Learning: Datasets

Speech datasets obtained from the OpenSRL project:

Javanese
- Source
- Original: jv_id_female.zip (48kHz, 967MB), jv_id_male.zip (48kHz, 923MB), convert.sh
- Festvox (48kHz, 1.9GB)
- Coqui STT (16kHz, 655MB)

Notes on the archives:

Festvox
- annotations.txt - the annotations
Coqui STT
- samples.csv - the annotations

Conversion into other formats can be achieved with the wai.annotations library.

License