Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods

Andrey Tarasiev; Margarita Filippova; Konstantin Aksyonov; Olga Aksyonova; Anna Antonova

doi:10.1007/978-3-030-47240-5_10

Conference Papers Year : 2020

Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods

(1) , (1) , (1) , (1) , (1)

Andrey Tarasiev

Function : Author

Ural Federal University [Ekaterinburg]

Margarita Filippova

Function : Author

Ural Federal University [Ekaterinburg]

Konstantin Aksyonov

Function : Author

Ural Federal University [Ekaterinburg]

Olga Aksyonova

Function : Author

Ural Federal University [Ekaterinburg]

Anna Antonova

Function : Author
PersonId : 1132718

Ural Federal University [Ekaterinburg]

Abstract

This article discusses the idea of developing an intelligent and customizable automated system for real-time text and voice dialogs with the user. This system can be used for almost any subject area, for example, to create an automated robot - a call center operator or smart chat bots, assistants, and so on. This article presents the developed flexible architecture of the proposed system. The system has many independent submodules. These modules work as interacting microservices and use several speech recognition schemes, including a decision support submodule, third-party speech recognition systems and a post-processing subsystem. In this paper, the post-processing module of the recognized text is presented in detail on the example of Russian and English dictionary models. The proposed submodule also uses several processing steps, including the use of various stemming methods, the use of word stop-lists or other lexical structures, the use of stochastic keyword ranking using a weight table, etc.

Keywords

Multi-agent Design Development System Decision-making Real-time Twin Stemming Postprocessing Open-source

Domains

Computer Science [cs]

Fichier principal

496591_1_En_10_Chapter.pdf (513.67 Ko)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03647270

Submitted on : Wednesday, April 20, 2022-1:41:51 PM

Last modification on : Wednesday, April 20, 2022-3:40:44 PM

Long-term archiving on : Thursday, July 21, 2022-7:24:26 PM

Dates and versions

hal-03647270 , version 1 (20-04-2022)

Licence

Attribution

Identifiers

HAL Id : hal-03647270 , version 1
DOI : 10.1007/978-3-030-47240-5_10

Cite

Andrey Tarasiev, Margarita Filippova, Konstantin Aksyonov, Olga Aksyonova, Anna Antonova. Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods. 16th IFIP International Conference on Open Source Systems (OSS), May 2020, Innopolis, Russia. pp.98-105, ⟨10.1007/978-3-030-47240-5_10⟩. ⟨hal-03647270⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-WG IFIP-OSS IFIP-TC2 IFIP-WG2-13 IFIP-AICT-582

29 View

103 Download

Using of Open-Source Technologies for the Design and Development of a Speech Processing System Based on Stemming Methods

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share