KAZLLM NATIONAL LANGUAGE MODEL DEVELOPED
15.02.2025 00:09:36 1227
KAZLLM LARGE LANGUAGE MODEL DEVELOPED
As part of the task of the Head of State, a large-scale language model KazLLM has been developed, aimed at developing artificial intelligence in the Kazakh language.
Within the framework of the implementation of this task, the Ministry of Science and Higher Education of the Republic of Kazakhstan, involving the Institute of Intelligent Systems and Artificial Intelligence (ISSAI NU) at Nazarbayev University, scientific institutes and universities, carried out work to provide the Kazakh language corpus necessary for the KazLLM national language model.
This event is aimed at creating effective solutions for processing, translating, analyzing text information in the Kazakh language and integrating the Kazakh language into modern technologies. In the context of globalization and preserving the cultural identity of the country, the importance of the project is even greater.
More than 140 scientists and employees of 26 leading institutes and universities of the country, who participated in the development of the Kazakh-language corpus required for KazLLM, were engaged in the preparation of large volumes of data in 115 fields of economics, finance, mathematics, history, biology, chemistry, medicine, technology and others. For example, the Kazakh National University named after Al-Farabi was engaged in the preparation of data in the fields of philosophy, ethics, PR, astronomy, astrophysics, information technology, the Institute of Mathematics and Mathematical Modeling in the field of mathematics, the Sh. Ualikhanov Institute of History and Ethnology in the field of history, and medical universities were engaged in the preparation of data in the field of medicine. This cooperation with scientific and higher education institutions contributed to the creation of unique content in the Kazakh language, which ensured the high-quality and effective development of the model.
Today, the open-source version of KazLLM is available on the https://huggingface.co/issai platform.
This model, which is an important part of the digital infrastructure, will be used for non-commercial scientific and academic purposes, as well as in the development of chatbots, virtual assistants, and automatic translators similar to Google Translate.
Source : https://www.gov.kz/memleket/entities/sci/press/news/details/937206