This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
The paper introduces the ORTOFON corpus of spontaneous spoken Czech and the DIALEKT corpus of Czech dialects, their design principles and practical solutions adopted during data collection.