Speech datasets obtained from the OpenSRL project:
-
Javanese
Original: jv_id_female.zip (48kHz, 967MB), jv_id_male.zip (48kHz, 923MB), convert.sh
Festvox (48kHz, 1.9GB)
Coqui STT (16kHz, 655MB)
Notes on the archives:
-
Festvox
annotations.txt
- the annotations
-
Coqui STT
samples.csv
- the annotations
Conversion into other formats can be achieved with the wai.annotations library.
License