Identification of Spontaneous Spoken Texts in Slovak

We propose a text classification method for the purpose of creating a language model for automatic recognition of spontaneous spoken speech. Transcripts from our departmental speech database served as spontaneous spoken texts. Using supervised machine learning methods, we have created multiple classification models (including neural networks), that were able to distinguish them from written texts with high accuracy. We subsequently verified the accuracy of our trained models on a database of texts containing direct speech extracted from newspaper articles.

Langue:: Anglais

Périodicité:: 2 fois par an
Sujets de la revue:: Linguistique et sémiotique, Cadres théoriques et disciplines, Linguistique, autres

RSS Feed de la revue

Identification of Spontaneous Spoken Texts in Slovak

Róbert Sabo

Peter Krammer

Ján Mojžiš

Marcel Kvassay

Publié en ligne: 21 déc. 2019

Pages: 481 - 490

DOI: https://doi.org/10.2478/jazcas-2019-0076

Mots clésspontaneous speech, text classification, supervised machine learning, neural networks, Slovak language

© 2019 Róbert Sabo et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Mots clés
spontaneous speech, text classification, supervised machine learning, neural networks, Slovak language