Polish speech corpus

The corpus consists of about 55 hours of annotated recordings, plus some recordings without annotations of time. The corpora includes speakers, whose recording lasts over an hour. The rest of the speakers recordings lasts for 3-20 minutes. The corpus includes majority of male voices, but there are also female ones. In total about 600 speakers. The recordings were done in various conditions and on various hardware, but all of them in 16-bit and 16 [kHz] standard.

Authors invite people interested in audio processing technologies to contact spin-off company techmo.pl

Copyright © Zespół Przetwarzania Sygnałów AGH 2011-2014