THE AUTOMATED VOICING SYSTEM WITH ELEMENTS OF ARTIFICIAL INTELLIGENCE

Authors

DOI:

https://doi.org/10.31891/2307-5732-2023-321-3-115-119

Keywords:

Automatic Speech Recognition, emotional recognition, voice, Text-to-Speech, Speech-to-Text, voice analysis

Abstract

Automatic voicing of texts has not been a novelty among users for a long time. However, the emotional component is lost during the automated dubbing of artistic texts or audio from other languages. The emotional transformation of the voice, considering the gender of the speaker, features of speech, etc., aims to preserve the linguistic meaning and identity of the speaker. This work provides an overview of the latest research in the field of automated voicing, obtaining metadata from audio, and proposes a general architecture for an automated voicing system with elements of artificial intelligence, such as a classifier for determining the emotional coloring of speech, models for determining gender and speech features. The obtained work results will form the basis of further research in developing a group of classifiers for determining the emotional coloring of speech, gender, age, and features of human speech. Based on the proposed architecture, the corresponding system's design and development are planned.  The proposed system will significantly simplify the consumption of foreign language content for users from different countries, regardless of the level of proficiency in one or another language. For this reason, automated translation and voiceover systems are widespread. However, the speaker's emotional component and other features need to be recovered during automated dubbing of texts or audio from other languages. That is why the automated voicing of texts or dubbing of audio or video will be relevant, taking into account the gender of the speaker, his age, emotional coloring and other features of speech. Such a system will simplify the process of adapting audio and video content to the users of one or another country. It will help make a large part of exciting content available to users. In education, this system can be used as an auxiliary tool when viewing lectures or parts of lectures from foreign lecturers, significantly expanding students' access to educational materials.

Published

2023-06-29

How to Cite

DUMYN, A. (2023). THE AUTOMATED VOICING SYSTEM WITH ELEMENTS OF ARTIFICIAL INTELLIGENCE. Herald of Khmelnytskyi National University. Technical Sciences, 321(3), 115-119. https://doi.org/10.31891/2307-5732-2023-321-3-115-119