ADA Library Digital Repository

Age-group and gender identification from speech in Azerbaijani language

Show simple item record

dc.contributor.author Aliyeva, Farida
dc.date.accessioned 2023-10-15T15:59:05Z
dc.date.available 2023-10-15T15:59:05Z
dc.date.issued 2022-04
dc.identifier.uri http://hdl.handle.net/20.500.12181/717
dc.description.abstract Our Speech comprises of paralinguistic features such as: identity, age, gender, accent etc. The objective of this paper is to identify, the age-group and gender of the speaker from Azerbaijani speech. The paper will be focused on both the adult and children speech identification. The identification of the age and gender of children speech is more complex than the adult’s speech as the voice of boys and girls before puberty coincide and additionally the puberty complicates the distinction between an adult and a teenager which results in possible errors with age-group identification. Moreover, the existence of numerous accents of Azerbaijani data an additional milestone. To identify age and gender from the speech the data should be pre-processed and then the features extracted. Next step is to classify according to results obtained. Various approaches are going to be tested such as x-vectors and i-vectors that are based on Deep Neural Network architecture. Then there is MFCC - a feature extraction technique a part of Automatic voice processing for unique feature extraction. On top of that the GMM-SVM model which is a Gaussian mixture model is run. KNN and MLP are another prominent approach to be used as a classifier for age and gender identification problem. Another feature extraction technique called SDC – shifted delta cepstral coefficient will be tested and compared with the MFCC results. The music and audio analysis package called Librosa and a PyAudio library are used to enable the record and play of an audio for demonstration purposes in the future. There outcome of the model is going to be classified into 4 age groups which are: Children (7-14), Young aged (15-24), Middle aged (25-54) and Seniors (55-80) and 2 genders: Male and Female. A proper identification of age and gender is sometimes a hard task for a human being as well which complicated the identification process for the machines. en_US
dc.language.iso en en_US
dc.publisher ADA University en_US
dc.rights Attribution-NonCommercial-NoDerivs 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/us/ *
dc.subject GMM. en_US
dc.subject MFCC. en_US
dc.subject SDC. en_US
dc.subject PyAudio. en_US
dc.subject Librosa. en_US
dc.subject DNN. en_US
dc.subject KNN. en_US
dc.subject MLP. en_US
dc.subject Paralinguistics. en_US
dc.subject X-vectors. en_US
dc.subject I-vectors. en_US
dc.title Age-group and gender identification from speech in Azerbaijani language en_US
dc.type Thesis en_US


Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States

Search ADA LDR


Advanced Search

Browse

My Account