Datasets
Common Voice Spontaneous Speech 2.0 - Catalan
License: CC0-1.0
Locale: ca
Task: ASR
Format: MP3
Size: 11.78 MB
Common Voice Spontaneous Speech 2.0 - Bukusu
License: CC0-1.0
Locale: bxk
Task: ASR
Format: MP3
Size: 258.53 MB
Common Voice Spontaneous Speech 2.0 - Sabah Bisaya
License: CC0-1.0
Locale: bsy
Task: ASR
Format: MP3
Size: 219.99 MB
Common Voice Spontaneous Speech 2.0 - Bodo
License: CC0-1.0
Locale: brx
Task: ASR
Format: MP3
Size: 1.29 MB
Common Voice Spontaneous Speech 2.0 - Breton
License: CC0-1.0
Locale: br
Task: ASR
Format: MP3
Size: 13.57 MB
Common Voice Spontaneous Speech 2.0 - Betawi
License: CC0-1.0
Locale: bew
Task: ASR
Format: MP3
Size: 213.73 MB
Common Voice Spontaneous Speech 2.0 - Basaa
License: CC0-1.0
Locale: bas
Task: ASR
Format: MP3
Size: 109.37 MB
Common Voice Spontaneous Speech 2.0 - Bashkir
License: CC0-1.0
Locale: ba
Task: ASR
Format: MP3
Size: 5.08 MB
Common Voice Spontaneous Speech 2.0 - Aragonese
License: CC0-1.0
Locale: an
Task: ASR
Format: MP3
Size: 2.24 MB
Common Voice Spontaneous Speech 2.0 - Gheg Albanian
License: CC0-1.0
Locale: aln
Task: ASR
Format: MP3
Size: 200.85 MB
Common Voice Spontaneous Speech 2.0 - Adyghe
License: CC0-1.0
Locale: ady
Task: ASR
Format: MP3
Size: 107.44 MB
Common Voice Spontaneous Speech 2.0 - Arvanitika
License: CC0-1.0
Locale: aat
Task: ASR
Format: MP3
Size: 46.68 MB
Common Voice Scripted Speech 24.0 - Teutila Cuicatec
License: CC0-1.0
Locale: cut
Task: ASR
Format: MP3
Size: 209.52 MB
Common Voice Scripted Speech 24.0 - Norwegian Nynorsk
License: CC0-1.0
Locale: nn-NO
Task: ASR
Format: MP3
Size: 33.55 MB
rm-vallader test
License: BSD-3-Clause
Locale: rm-vallader
Task: NLP
Format: MP3
Size: 2.63 MB
checksum dataset
License: Apache-2.0
Locale: en-US
Task: N/A
Format: Not specified
Size: 914.69 KB
dawdad
License: Apache-2.0
Locale: awdad
Task: NLP
Format: awdawd
Size: 34.00 MB
Common Voice AZ DF
License: CC0-1.0
Locale: az
Task: ASR
Format: mp3
Size: 3.41 MB
test
License: Apache-2.0
Locale: en-US
Task: N/A
Format: WAV
Size: 2.63 MB
Dataset for API & Python SDK Tests [Do not remove] - Mock Spontaneous Speech English
License: CC-BY-4.0
Locale: en-US
Task: NLP
Format: CSV
Size: 119.84 KB
test with better name
License: Apache-2.0
Locale: en-US
Task: NLP
Format: Not specified
Size: 7.37 MB
Community Dataset
License: CC-BY-SA-4.0
Locale: en-US
Task: RAG
Format: MP3
Size: 2.76 MB
Community Dataset
License: BSD-3-Clause
Locale: en-US
Task: MT
Format: MP3
Size: 2.76 MB
Otro dataset bonito
License: CC-BY-ND-4.0
Locale: es_MX
Task: NLP
Format: wav
Size: 180.78 MB