Datasets

Filters:
Mozilla Foundation

test 3.0

testing stuff
License Icon

License: CC-BY-NC-SA-4.0

Locale Icon

Locale: en

Task Icon

Task: MT

Format Icon

Format: Not specified

Size Icon

Size: 914.69 KB

Common Voice

file upload edit2

test
License Icon

License: Apache-2.0

Locale Icon

Locale: en-CA

Task Icon

Task: TTS

Format Icon

Format: MP3

Size Icon

Size: 72.21 MB

Mozilla Foundation

Test 2.0

Test 2
License Icon

License: CC-BY-4.0

Locale Icon

Locale: nhi

Task Icon

Task: NLP

Format Icon

Format: TXT

Size Icon

Size: 914.69 KB

MoFo-BetaBugBash

JohannBetaBugBashDataset

My Beta Bug Bash Dataset
License Icon

License: CC-0

Locale Icon

Locale: en-US

Task Icon

Task: CALL

Format Icon

Format: MP3

Size Icon

Size: 7.57 MB

MDC

Antarctic Penguin Observation

A comprehensive collection of field observations of three Antarctic penguin species (Emperor, Adelie, Gentoo) gathered between 2015-2023.
License Icon

License: BSD Zero Clause License

Locale Icon

Locale: en-US

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 34.00 MB

Elotl

Otro bonito dataset

Este dataset es para probar que puedo subirlos a MDC
License Icon

License: CC-BY-4.0

Locale Icon

Locale: es-MX

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 3.15 MB

Elotl

My bonito dataset

Esta es una descripción muy adecuada para mi dataset. TQM Elotl
License Icon

License: CC-BY-4.0

Locale Icon

Locale: en-US

Task Icon

Task: NLP

Format Icon

Format: wav

Size Icon

Size: 3.15 MB

Common Voice

kostis-test-28oct

License Icon

License: cc

Locale Icon

Locale: en-US

Task Icon

Task: N/A

Format Icon

Format: Not specified

Size Icon

Size: 12.06 MB

Mozilla Foundation

ReRooted 1.0

A speech corpus of Syrian Armenian refugee testimonials
License Icon

License: GPL-3.0

Locale Icon

Locale: en-US

Task Icon

Task: OTH

Format Icon

Format: WAV, TSV

Size Icon

Size: 914.69 KB

Common Voice

md test

testing markdown
License Icon

License: cc-0

Locale Icon

Locale: en-US

Task Icon

Task: NLU

Format Icon

Format: mp3

Size Icon

Size: 2.76 MB

Common Voice

Example Dataset Upload - 2025 10 23

Example Dataset Upload - 2025 10 23
License Icon

License: cc-0

Locale Icon

Locale: en-US

Task Icon

Task: NLP

Format Icon

Format: mp3

Size Icon

Size: 72.21 MB

Community

Test dataset - random

This is a test dataset that I will search for on my computer.
License Icon

License: CC-BY-4.0

Locale Icon

Locale: nhi

Task Icon

Task: NLP

Format Icon

Format: wav,conllu

Size Icon

Size: 330.70 KB

Common Voice

newest test

test
License Icon

License: CC0-1.0

Locale Icon

Locale: en-US

Task Icon

Task: CV

Format Icon

Format: tar.gz

Size Icon

Size: 72.21 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Kuku

A collection of spontaneous spoken phrases in Kuku.
License Icon

License: CC0-1.0

Locale Icon

Locale: ukv

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 237.60 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Amba (Uganda)

A collection of spontaneous spoken phrases in Amba (Uganda).
License Icon

License: CC0-1.0

Locale Icon

Locale: rwm

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 265.80 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Sena

A collection of spontaneous spoken phrases in Sena.
License Icon

License: CC0-1.0

Locale Icon

Locale: seh

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 4.40 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Sabah Malay

A collection of spontaneous spoken phrases in Sabah Malay.
License Icon

License: CC0-1.0

Locale Icon

Locale: msi

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 277.20 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Western Penan

A collection of spontaneous spoken phrases in Western Penan.
License Icon

License: CC0-1.0

Locale Icon

Locale: pne

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 247.40 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Toba

A collection of spontaneous spoken phrases in Toba.
License Icon

License: CC0-1.0

Locale Icon

Locale: tob

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 172.50 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Scots

A collection of spontaneous spoken phrases in Scots.
License Icon

License: CC0-1.0

Locale Icon

Locale: sco

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 228 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Ruuli

A collection of spontaneous spoken phrases in Ruuli.
License Icon

License: CC0-1.0

Locale Icon

Locale: ruc

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 365.20 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Central Melanau

A collection of spontaneous spoken phrases in Central Melanau.
License Icon

License: CC0-1.0

Locale Icon

Locale: mel

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 208.60 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Tooro

A collection of spontaneous spoken phrases in Tooro.
License Icon

License: CC0-1.0

Locale Icon

Locale: ttj

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 272.80 MB

Common Voice

Common Voice Spontaneous Speech 1.0 - Bukar-Sadung Bidayuh

A collection of spontaneous spoken phrases in Bukar-Sadung Bidayuh.
License Icon

License: CC0-1.0

Locale Icon

Locale: sdo

Task Icon

Task: ASR

Format Icon

Format: MP3

Size Icon

Size: 200.80 MB