O‘zbek tili matnlarini qoidaga asoslangan nuqta va vergullarini tahlil qilish algoritmlari
O‘zbek tili matnlarini qoidaga asoslangan nuqta va vergullarini tahlil qilish algoritmlari
Abstract
Tinish belgilarini tahlil qilish bilan dunyoning ko‗pchilik olimlari shug‗ullangan, jumladan, yaqin
vaqtgacha tinish belgilari nazariy va hisoblash tilshunosligining aksariyat tadqiqotchilari tomonidan
e‘tibordan chetda edi. Bu mavhum muammo uchun ixcham, rasmiy asosning yo‗qligi bilan bog‗liq.
Biroq, tinish belgilari yozma tilning orfografik tarkibiy qismi ekanligini esga olsak, tinish belgilariga oid
tadqiqotlar oqilona ma‘noga ega ekanligini ko‗ramiz. Shunga ko‗ra, so‗nggi o‗n yillikda mavzuga
qiziqish ortdi, chunki tinish belgilarini hisobga olmasdan yozma tilni to‗liqroq tushunish v
References
M. Bayraktar, B. Say, and V. Akman, ―An analysis of English punctuation: the special case of comma,‖ International Journal of Corpus Linguistics, vol. 3, Jul. 1998, doi: 10.1075/ijcl.3.1.03bay.
D. Hardt, ―Comma checking in Danish,‖ 2001.
U. Salaev, E. Kuriyozov, and C. Gómez-Rodríguez, ―SimRelUz: Similarity and Relatedness scores as a Semantic Evaluation Dataset for Uzbek Language,‖ in 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages, SIGUL 2022 - held in conjunction with the International Conference on Language Resources and Evaluation, LREC 2022 - Proceedings, 2022, pp. 199 – 206.
[Online]. Available: https://www.scopus.com/inward/record.uri? eid=2-.38700420&partnerID=40&md5=bf476cd74317f06577dd0548c5c600d6
A. M. Abdurashetona and I. O. Ismailovich, ―Methods of Tagging Part of Speech of Uzbek Language,‖ in Proceedings - 6th International Conference on Computer Science and Engineering, UBMK 2021,2021, pp. 82 – 85. doi: 10.1109/UBMK52708.2021.9558900.
A. M. Abdurashetona and U. Mokhiyakon, ―Software Features and Linguistic Features of Uzbek Synonymizer,‖ in Proceedings - 7th International Conference on Computer Science and Engineering, UBMK 2022, 2022, pp. 171 – 175. doi: 10.1109/UBMK55850.2022.9919447.
B. Mengliyev, S. Shahabitdinova, S. Khamroeva, S. Gulyamova, and A. Botirova, ―The morphological analysis and synthesis of word forms in the linguistic analyzer,‖ Journal of Language and Linguistic Studies, vol. 17, no. 1, pp. 558 – 564, 2021, [Online]. Available:
K. Madatov, S. Bekchanov, and J. Vičič, ―Dataset of Karakalpak language stop words,‖ Data Brief, vol.48, 2023, doi: 10.1016/j.dib.2023.109111.
M. Sharipov and O. Sobirov, ―Development of a Rule-Based Lemmatization Algorithm Through Finite State Machine for Uzbek Language,‖ in CEUR Workshop Proceedings, 2022, pp. 154 – 159. [Online]. Available: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85146112590&partnerID=40&md5=e1080c39d101c0e351cfed1a8228d391
M. Sharipov and O. Yuldashov, ―UzbekStemmer: Development of a Rule-Based Stemming Algorithm for Uzbek Language,‖ in CEUR Workshop Proceedings, 2022, pp. 137 – 144. [Online]