Frequency of grammatical forms in corpus texts (based on https://qazcorpora.kz)
DOI:
https://doi.org/10.31489/2026phi1(121)/48-58Keywords:
national corpus, text, verb, grammatical form, frequency indexAbstract
This article examines the frequency of grammatical forms of verbs within the framework of the question of how effectively we can utilize corpus materials to study the grammatical aspects of the Kazakh language. Using texts from the corpus base for linguistic research is highly important. For this reason, the corpus base must be regularly updated and improved. Without such studies, the direction of language development becomes unclear. In international practice, corpus databases have long been employed in linguistic research, using various mathematical methods to analyze the frequency of words and forms. The subcorpora of the National Corpus of the Kazakh Language (qazcorpora.kz), developed based on international experience, offer convenient search mechanisms that are especially useful for studying the frequency of word forms. On the main page of the qazcorpora.kz website, under the “Statistics” section, a link is provided that allows users to automatically obtain the necessary statistical information. The article examines the frequency of grammatical forms of Kazakh verbs in journalistic and spoken texts based on frequency statistics of the Subcorpus of the National Corpus of the Kazakh language (qazcorpora.kz). Special attention is paid to the comparative
analysis of the most frequently used grammatical forms of adverbs, participles, and verbs of state. In particular, the frequency of verbs such as: dep, degen, emes, edi, bolyp, eken, boldy, bolgan, alyp, bolady, kelgen, bolsa, zhatkan, otyr, deidi, keledi, dedi, bolatyn, zhatyr, algan, kelip, jurgen, otyryp, tur, otyrgan, turgan, zhurip, aldy, turatyn, zhur, alatyn, turyp. A semantic and contextual analysis of the reasons for the high frequency of these verb forms in the corpus texts is carried out. The article analyzes the ambiguity and versatility of auxiliary verbs de, e, bol, al, kel, zhur, otyr, tur, zhat, as well as their role in the formation of stable phrases and analytical forms in journalistic and colloquial style. The features of the compatibility of the verbs dep, emes, bolyp, alyp, kelgen, zhatkan, otyr, tur, jurgen in stable phrases are revealed and their functions in creating various syntactic relations (attributive, circumstantial, predicative) are analyzed. Their main functions in the formation of free and complex phrases, as well as analytical verb forms, are systematized. Conclusions are drawn about the frequency and functional significance of these verb forms in modern Kazakh texts.







