What is it about?
his paper presents a new method to evaluate the quality of speech signals through images generated from a psychoacoustic model to estimate PESQ (ITU-T P862) values using a first-order Fuzzy Sugeno approach implemented in the Adaptive Neuro-Fuzzy Inference System - ANFIS. The factors feeding the network were obtained using an image-processing technique from the perceptual model coefficients. All simulations were performed using a database containing clean and corrupted signals by eight types of noises found in everyday situations. The proposal uses the PESQ values of the signals to train the network. The analyses proved that the predictive performance will depend on the choice of a psychoacoustic model, the factor extraction technique, the combination of these factors, the fuzzification algorithm, and the type of membership function in the ANFIS input space.
Featured Image
Photo by Soundtrap on Unsplash
Read the Original
This page is a summary of: Model predictive PESQ-ANFIS/FUZZY C-MEANS for image-based speech signal evaluation, Speech Communication, October 2023, Elsevier,
DOI: 10.1016/j.specom.2023.102972.
You can read the full text:
Contributors
The following have contributed to this page