ADA Library Digital Repository

Development of Large-scale National Azerbaijan Corpus with UI and Functionality

Show simple item record

dc.contributor.author Alizada, Emin
dc.date.accessioned 2024-12-19T23:57:23Z
dc.date.available 2024-12-19T23:57:23Z
dc.date.issued 2023-04
dc.identifier.uri http://hdl.handle.net/20.500.12181/932
dc.description.abstract This paper presents the creation of a large-scale Azerbaijani language corpus with more than 50 million tokens, and the development of several functionalities for language analysis and corpus linguistics, including Word Frequency, Ngrams, Concordance, Thesaurus, and Word Sketch. The corpus was collected from various sources, including Azerbaijani books, articles, and websites, and was stored in a relational database. The paper provides a detailed description of the corpus creation process and the database schema used to store the corpus, as well as dives into the creation of each of the functionality of the corpus, and what kind of insights it is possible to get from the given functionality set. Afterwards, the paper analyzes different corpus applications and analyzes their interfaces and user experience provided by the application, before introducing the online application for the Azerbaijani language corpus to make the corpus and its functionalities available to the linguists, researchers and language learners. The functionalities were implemented using Python, and the user interface was created using Next.js. The final product is a web application that allows users to access all the functionalities of the corpus easily. en_US
dc.language.iso en en_US
dc.publisher ADA University en_US
dc.rights Attribution-NonCommercial-NoDerivs 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/us/ *
dc.subject Azerbaijani language -- Corpora en_US
dc.subject Computational linguistics -- Tools and techniques en_US
dc.subject Language and languages -- Computer-assisted analysis en_US
dc.title Development of Large-scale National Azerbaijan Corpus with UI and Functionality en_US
dc.type Thesis en_US


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States

Search ADA LDR


Advanced Search

Browse

My Account