Preview

Tiltanym

Advanced search

TECHNOLOGY AND PROBLEMS OF AUTOMATIC LINGUOANALYSIS OF THE MORPHOLOGICAL COMPOSITION OF WORDS

https://doi.org/10.55491/2411-6076-2022-4-15-25

Abstract

The article discusses the technology of the morphological analyzer, which implements the program of morphological markup in the National corpus of the Kazakh language. The introduction of morphological markup is the most basic and important linguistic analysis of the National Corpus. It is also considered how language units in the intermediate layer are differentiated in the dictionary of basic words and the dictionary of grammatical forms (word forms) that we are developing. It is difficult for a computer to automatically divide words into roots and affixes and describe the relation of words to parts of speech and grammatical characteristics of affixes. However, in agglutinative languages, such as Kazakh, it is easier to automatically separate words and automatically analyze the composition of words than in inflectional languages. This is due to the fact that in agglutinative languages affixes are added in a certain system. The formal model of word forms is more understandable. The article discusses some difficulties of automatic parsing and analysis of words in the Kazakh language. The article also discusses problematic issues of functional affixes in the modeling of the morphological system of the Kazakh language, as well as issues related to categories and conditionally accepted codes included in the grammatical dictionary.

About the Authors

A. Zhanabekova
A. Baitursynuly Institute of Linguistics
Kazakhstan


Zh. Alpysbay
15-school-gymnasium
Kazakhstan


References

1. Qazaq adebi tіlіnіn sozdіgі. (2001) – Almaty, 2011. [Dictionary of the Kazakh literary language. (2001) - Almaty, 2011.] (in Kazakh)

2. Qazaq teksіnіn statistikasy. (1973) Zhinaq. – Almaty: Gylym, 1973. – 731 b. [Statistics of the Kazakh text. (1973) Сollection. – Almaty: Science, 1973. – 731p.] (in Kazakh)

3. Sirazitdinov Z.A. (2006) Modelirovanie grammatiki bashkirskogo yazyka. – Ufa, 2006. [Sirazitdinov Z. A. (2006) Modeling the Grammar of the Bashkir Language. - Ufa, 2006.] (in Russian)

4. Zhanabekova A. (2001) Soz formalaryn zhasaudagy Kosymshalardyn funkciyalyk erekshelіkterі: filol.gyl.kand. diss. – Almaty, 2001. [Zhanabekova A. (2001) Functional Features of Applications in the Development of Word Forms: philol.science.cand. diss. – Almaty, 2001.] (in Kazakh)

5. Isaev S. (1998) Qazіrgі qazaq tіlіndegі sozderdіn grammatikalyk sipaty. Almaty: Rauan, 1998. – 303 b. [Isaev S. (1998) The Grammatical Nature of Words in the Modern Kazakh Language. Almaty: Rauan, 1998. – 303 p.] (in Kazakh)

6. Qazaq grammatikasy. (2002) – Astana: Astana poligrafiya, 2002. – 784 b. [Kazakh grammar. (2002) – Astana: Astana polygraphy, 2002. - 784 p.] (in Kazakh)

7. Shayakhmetov K. (1973) Ekі funkciyalyk affikster: filol. gyl. kand. diss. – Almaty, 1973. [Shayakhmetov K. (1973) Two-function Affixes: philol. science. cand. diss. - Almaty, 1973.] (in Kazakh)

8. Nasilov V.M. (1958) Affiksy vklyucheniya // Sb. Voprosy yazyka i literatury stran vostoka. – M. 1958. [Nasilov V.M. (1958) Affixes of inclusion // Collection of Questions of the language and literature of the countries of the East. – M. 1958] (in Russian)

9. Ganiev F.A. (1970) O sinteticheskih i analiticheskix padezhah v tatarskom yazyke. // Sb.Voprosy tyurkologii. – Kazan', 1970. [Ganiev F.A. (1970) On synthetic and analytical cases in the Tatar language. // Sat. Questions of Turkology. – Kazan, 1970.] (in Russian)

10. Baskakov N.A. (1979) Istoriko-tipologicheskaya morfologiya tyurkskih yazykov. – M.:Nauka, 1979. [Baskakov N.A. (1979) Historical and typological morphology of the Turkic languages. – M.:Nauka, 1979.] (in Russian)

11. Xabichev M.A. (1989) Imennoe slovoobrazovanie i formoobrazovanie kumanskih yazykax. – M.:Nauka, 1989. – 217 s. [Khabichev M.A. (1989) Nominal word formation and formation of Cuman languages. – M.:Nauka, 1989. – 217 p.] (in Russian)


Supplementary files

1. Неозаглавлен
Subject
Type Исследовательские инструменты
Download (40KB)    
Indexing metadata

Review

For citations:


Zhanabekova A., Alpysbay Zh. TECHNOLOGY AND PROBLEMS OF AUTOMATIC LINGUOANALYSIS OF THE MORPHOLOGICAL COMPOSITION OF WORDS. Tiltanym. 2022;(4):15-25. (In Kazakh) https://doi.org/10.55491/2411-6076-2022-4-15-25

Views: 332


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2411-6076 (Print)
ISSN 2709-135X (Online)