Show simple item record

dc.contributor.authorMINSAFINA, Alına
dc.contributor.authorSuleymanov, Dzhavdet
dc.contributor.authorGilmullin, Rinat
dc.contributor.authorKubedinova, Lenara
dc.contributor.authorAbdurakhmonova, Nilufar
dc.contributor.authorKhusainov, Aidar
dc.date.accessioned2021-12-10T12:14:18Z
dc.date.available2021-12-10T12:14:18Z
dc.identifier.citationKhusainov A., Suleymanov D., Gilmullin R., MINSAFINA A., Kubedinova L., Abdurakhmonova N., "First results of the “TurkLang-7” project: Creating Russian-turkic parallel corpora and MT systems", 2020 Computational Models in Language and Speech Workshop, CMLS 2020, Kazan, Rusya, 12 - 13 Kasım 2020, cilt.2780, ss.90-101
dc.identifier.otherav_b02cd09d-a1cc-4d93-8ba4-6423db38a70d
dc.identifier.othervv_1032021
dc.identifier.urihttp://hdl.handle.net/20.500.12627/173497
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85098169528&origin=inward
dc.description.abstractCopyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).The idea of the “TurkLang-7” project is to create datasets and neural machine translation systems for a set of Russian-Turkic low-resource language pairs. It is planned to achieve this goal through a hybrid approach to the creation of a multilingual parallel corpus between Russian and Turkic languages, studying the applicability and effectiveness of neural network learning methods (transfer learning, multi-task learning, back-translation, dual learning) in the context of the selected language pairs, as well as the development of specialized methods for the unification of parallel data in different languages, based on the agglutinative nature of the selected Turkic languages (structural and functional model of the Turkic morpheme). In this paper, we describe the main stages of work on this project and the results of the first year: we developed a semiautomatic process for creating parallel corpora, collected data from several sources on 7 Turkic languages, and conducted the first experiments to create machine translation systems.
dc.language.isoeng
dc.subjectGeneral Computer Science
dc.subjectPhysical Sciences
dc.subjectMühendislik ve Teknoloji
dc.subjectBilgisayar Bilimleri
dc.subjectBilgisayar Bilimi
dc.subjectMühendislik, Bilişim ve Teknoloji (ENG)
dc.titleFirst results of the “TurkLang-7” project: Creating Russian-turkic parallel corpora and MT systems
dc.typeBildiri
dc.contributor.departmentTatarstan Academy of Sciences , ,
dc.identifier.volume2780
dc.contributor.firstauthorID2622437


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record