dc.contributor.author | Aliyeva, Farida | |
dc.date.accessioned | 2023-10-15T15:59:05Z | |
dc.date.available | 2023-10-15T15:59:05Z | |
dc.date.issued | 2022-04 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12181/717 | |
dc.description.abstract | Our Speech comprises of paralinguistic features such as: identity, age, gender, accent etc. The objective of this paper is to identify, the age-group and gender of the speaker from Azerbaijani speech. The paper will be focused on both the adult and children speech identification. The identification of the age and gender of children speech is more complex than the adult’s speech as the voice of boys and girls before puberty coincide and additionally the puberty complicates the distinction between an adult and a teenager which results in possible errors with age-group identification. Moreover, the existence of numerous accents of Azerbaijani data an additional milestone. To identify age and gender from the speech the data should be pre-processed and then the features extracted. Next step is to classify according to results obtained. Various approaches are going to be tested such as x-vectors and i-vectors that are based on Deep Neural Network architecture. Then there is MFCC - a feature extraction technique a part of Automatic voice processing for unique feature extraction. On top of that the GMM-SVM model which is a Gaussian mixture model is run. KNN and MLP are another prominent approach to be used as a classifier for age and gender identification problem. Another feature extraction technique called SDC – shifted delta cepstral coefficient will be tested and compared with the MFCC results. The music and audio analysis package called Librosa and a PyAudio library are used to enable the record and play of an audio for demonstration purposes in the future. There outcome of the model is going to be classified into 4 age groups which are: Children (7-14), Young aged (15-24), Middle aged (25-54) and Seniors (55-80) and 2 genders: Male and Female. A proper identification of age and gender is sometimes a hard task for a human being as well which complicated the identification process for the machines. | en_US |
dc.language.iso | en | en_US |
dc.publisher | ADA University | en_US |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | * |
dc.subject | GMM. | en_US |
dc.subject | MFCC. | en_US |
dc.subject | SDC. | en_US |
dc.subject | PyAudio. | en_US |
dc.subject | Librosa. | en_US |
dc.subject | DNN. | en_US |
dc.subject | KNN. | en_US |
dc.subject | MLP. | en_US |
dc.subject | Paralinguistics. | en_US |
dc.subject | X-vectors. | en_US |
dc.subject | I-vectors. | en_US |
dc.title | Age-group and gender identification from speech in Azerbaijani language | en_US |
dc.type | Thesis | en_US |
The following license files are associated with this item: