Skip to main content
User-friendly Deep Learning: Datasets
  • News
  • Domains
    Image classification Image segmentation Instance segmentation Object detection Speech
  • UFDL Project
  • RSS
    OpenSLR logo

    Speech datasets obtained from the OpenSRL project:

    • Javanese

      • Source

      • Original: jv_id_female.zip (48kHz, 967MB), jv_id_male.zip (48kHz, 923MB), convert.sh

      • Festvox (48kHz, 1.9GB)

      • Coqui STT (16kHz, 655MB)

    Notes on the archives:

    • Festvox

      • annotations.txt - the annotations

    • Coqui STT

      • samples.csv - the annotations

    Conversion into other formats can be achieved with the wai.annotations library.

    License

    CC BY-SA 4.0

    Contents © 2023 University of Waikato - Powered by Nikola