Preview

Speech Synthesis in the Kazakh Language and its Phonetic Features

https://doi.org/10.55491/2411-6076-2025-2-94-101

Abstract

The article examines the synthesis of speech in the Kazakh language from the phonetic, phonological and prosodic points of view. Word synthesis is the process of converting a written text into an oral form. It is aimed not only at voicing the text, but also at providing a natural and understandable sound of speech. To create a realistic and legible synthesizer, the text is systematized in accordance with orthoepical norms, and phonemic changes, vowel reduction, and sound phenomena are described based on linguistic data. Phonetic and phonological patterns play an important role in improving speech synthesis technology. The article also examines the relevance and methods of speech synthesis, identifies their features and differences. In addition, an analysis of modern speech synthesis programs has been carried out, their advantages and disadvantages have been identified. The article analyzes articulatory, formative, parametric and neural models used for speech synthesis. To improve the quality of speech synthesis in the Kazakh language, it is shown that it is necessary to decipher abbreviated words and numbers, observe orthoepical norms, and correctly model intonation and rhythmic groups. The results of the study can serve as a scientific basis for creating a qualitative synthesis of words based on the phonetic and phonological patterns of the Kazakh language. The creation of high-quality word synthesis will make it possible to develop speech interfaces in the Kazakh language.

About the Authors

Zh. Zhumabayeva
A. Baitursynuly Institute of Linguistics

Zhanar Zhumabayeva, Candidate of Philological Sciences

Almaty



G. Beissenbekova
Kazakh National Women's Teacher Training University
Kazakhstan

Gulnaz Beissenbekova, Gandidate of Pedagogical Sciences, Associate Professor

Almaty



References

1. Amanbaeva, A.Zh., Zhumabaeva, Zh.T., Bazarbaeva, Z.M., Fazylzhanova, A.M., Ospangazieva, N.B. (2025) Qazaq tilining ulttyq korpusy: pojetikalyq diskurstyng prosodikalyq erekshelikteri. Forum for Linguistic Studies | 2025, 7(1), В. 505519. https://doi.org/10.30564/fls.v7i1.7428 [Amanbayeva, A.Zh., Zhumabayeva, Zh.T., Bazarbayeva, Z.M., Fazylzhanova, A.M., Ospangaziyeva, N.B. (2025) National Corpus of the Kazakh Language: Prosodic Features of the Poetic Discourse. Forum for Linguistic Studies | 2025, 7(1), Р. 505-519. https://doi.org/10.30564/fls.v7i1.7428] (in English)

2. Bazarbaeva, Z.M. (2022) Intonologiya. Almaty: Jeverest, 440 b. [Bazarbayeva, Z.M. (2022) Intonology. Almaty: Everest, 440 p.] (in Kazakh)

3. Bazarbaeva, Z.M. (2022) Qazaq fonologijasynyng negіzderi. Almaty: Jeverest, 460 b. [Bazarbayeva, Z.M. (2022) Fundamentals of Kazakh Phonology. Almaty: Everest, 460 p.] (in Kazakh)

4. Bogdanova, N.V. (2001) Zhivye foneticheskie processy russkoj rechi. Sankt-Peterburg: Izd. filol. fakul. Sankt. Pet. univ., 186 b. [Bogdanova, N.V. (2001) Active Phonetic Processes in Russian Speech. Saint Petersburg: Ed. philol. faculty. St. Pet. univ., 186 p.] (in Russian)

5. Derkach, M.F., Gumeckij, R.Ya., Gura, B.M., Chaban, M.Ye. (1993) Dinamichnye spektry rechevyh signalоv. L`vov: Izd. pri L`vov univ., 168 s. [Derkach M.F., Gumetsky R.Ya., Gura B.M., Chaban M.E. (1993) Dynamic Spectra of Speech Signals. Lviv: Ed. at Lviv. university, 168 p.] (in Russian)

6. Fant, G. (1970) Analiz i sintez rechi. Novosibirsk: Nauka, 166 s. [Fant G. (1970) Analysis and Synthesis of Speech. Novosibirsk: Nauka, 166 p.] (in Russian)

7. Fazylzhan, A. (2022) Soz sazy zhane intonacija (jeksperimentti-fonetikalyq zertteu). Almaty: “Almaty Bolashaq” AQ baspasy, 208 b. [Fazylzhan, A. (2022) Melody of speech and intonation (an experimental phonetic study). Almaty: Publishing house of JSC “Almaty-Bolashaq”, 208 p.] (in Kazakh)

8. Kodzassov, S.V., Krivnova, O.F. (2001) Obshhchaja fonetika. Moskva: Ros. gosud. guman. univ., 591 s. [Kodzassov, S.V., Krivnova, O.F. (2001) General Phonetics. Moscow: Russian State Humanit. univ., 591 p.] (in Russian)

9. Kuderinova, Q. (2013) Qazaq jazuynyng tarihy men teorijasy. Almaty: Eltanym, 242 b. [Kuderinova, Q. (2013) The history and theory of kazakh writing. Almaty: Yeltanym, 242 p.] (in Kazakh)

10. Lobanov, B.M., Cirul`nik, L.I. (2008) Komp`juternyj sintez i klonirovanie rechi. Minsk: Belorusskaja nauka, 316 s. [Lobanov, B.M., Tsirulnik, L.I. (2008) Computer Speech Synthesis and Cloning. Minsk: Belorusskaya nauka, 316 p.] (in Russian) Orfojepijalyq sozdіk (2007) Red. alqasy: O. Aitbai, S. Abdrahmanov, A. Kekilbai t.b. Almaty: Arys, 800 b. [Orthoepic dictionary. Ed. board: O. Aitbai, S. Abdrakhmanov, A. Kekilbay, etc. Almaty: Arys, 2007. 800 p. ISBN 9965174733.] (in Kazakh)

11. Qaliev, G. (2005) Tіl bіlіmі terminderіnіng tusіndіrme sozdіgі. Almaty: Sozdik-Slovar`, 440 b. [Kaliyev, G. (2005) Explanatory Dictionary of Linguistics terms. Almaty: Sozdik-Slovar, 440 p.] (in Kazakh)

12. Uali, N. (2018) Grafika. Orfografija. Orfojepija. Almaty, 250 b. [Uali, N. (2018) Graphics. Spelling. Orthoepy. Almaty, 250 p.] (in Kazakh)

13. Zagoruiko, N.G. (2013) Kognitivnyj analiz dannyh. Novosibirsk: Akadem. izd. Geo, 186 s. [Zagoruyko, N.G. (2013) Cognitive Data Analysis. Novosibirsk: Academic Publ. House Geo, 186 p.] (in Russian)

14. Zhunisbek, A. (2009) Qazaq fonetikasy. Almaty: Arys, 312 b. [Zhunisbek, A. (2009) Kazakh phonetics. Almaty: Arys, 312 p.] (in Kazakh)

15. Zhunіsbek, A. (2018) Qazaq til bіlіmіnіng maselelerі. Almaty: Abzal-ai, 368 b. [Zhunisbek, A. (2018) Issues of Kazakh Linguistics. Almaty: Abzal-ai, 368 p.] (in Kazakh)

16.


Review

For citations:


Zhumabayeva Zh., Beissenbekova G. Speech Synthesis in the Kazakh Language and its Phonetic Features. Tiltanym. 2025;(2):94-101. (In Kazakh) https://doi.org/10.55491/2411-6076-2025-2-94-101

Views: 9


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2411-6076 (Print)
ISSN 2709-135X (Online)