ADA Library Digital Repository

Monaural Multi-Speaker Speech Separation in Azerbaijani Language

Show simple item record

dc.contributor.author Rzayev, Kamran
dc.date.accessioned 2025-11-06T11:19:05Z
dc.date.available 2025-11-06T11:19:05Z
dc.date.issued 2022
dc.identifier.uri http://hdl.handle.net/20.500.12181/1534
dc.description.abstract The objective of speech separation is to distinguish and separate different speakers’ utterances from each other and often also from background noise. Speech separation is one of the fundamental problems of signal processing domain that has a big diversity of applications, including hearing prosthesis, mobile telecommunication, and robust automatic speech and speaker recognition. The capability of the human hearing to isolate one sound source from a combination of several sounds from multiple sources is exceptional. Humans appear to be capable of following the utterances of one speaker in the presence of other speakers and background noises with extremely low to no amount of efforts even in noisy environments such as parties. The problem of separation of multiple speakers from noise into separate utterances is referred as “cocktail party problem”. This problem has been investigated worldwide but there is no evidence of using Azerbaijani dataset to solve this kind of problem. This paper aims to investigate how different language affects the solution if this problem and proposes several solutions as well. The first approach will use included Support Vector Machine, Multi-Layer perceptron, Decision Tree Classifier, Random Forest Classifier and K-Nearest Neighbors. Additionally, pretrained models will be used to experiment on custom dataset as an alternative to proposed solution. All the models are going to be evaluated on several evaluation metrics such as accuracy, SNR, SDR, precision, recall. en_US
dc.language.iso en en_US
dc.publisher ADA University en_US
dc.rights Attribution-NonCommercial-NoDerivs 3.0 United States *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/us/ *
dc.subject Speech separation. en_US
dc.subject Signal processing. en_US
dc.subject Machine learning. en_US
dc.subject Noise reduction (Sound) en_US
dc.subject Azerbaijani language -- Data processing. en_US
dc.subject Computational linguistics. en_US
dc.title Monaural Multi-Speaker Speech Separation in Azerbaijani Language en_US
dc.type Thesis en_US
dcterms.accessRights Absolute embargo: Only bibliographic record and abstract


Files in this item

Files Size Format View

There are no files associated with this item.

The following license files are associated with this item:

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 United States Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States

Search ADA LDR


Advanced Search

Browse

My Account