↓ Skip to main content

A preliminary study on improving the recognition of esophageal speech using a hybrid system based on statistical voice conversion

Overview of attention for article published in SpringerPlus, October 2015
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
5 Dimensions

Readers on

mendeley
17 Mendeley
Title
A preliminary study on improving the recognition of esophageal speech using a hybrid system based on statistical voice conversion
Published in
SpringerPlus, October 2015
DOI 10.1186/s40064-015-1428-2
Pubmed ID
Authors

Othman Lachhab, Joseph Di Martino, Elhassane Ibn Elhaj, Ahmed Hammouch

Abstract

In this paper, we propose a hybrid system based on a modified statistical GMM voice conversion algorithm for improving the recognition of esophageal speech. This hybrid system aims to compensate for the distorted information present in the esophageal acoustic features by using a voice conversion method. The esophageal speech is converted into a "target" laryngeal speech using an iterative statistical estimation of a transformation function. We did not apply a speech synthesizer for reconstructing the converted speech signal, given that the converted Mel cepstral vectors are used directly as input of our speech recognition system. Furthermore the feature vectors are linearly transformed by the HLDA (heteroscedastic linear discriminant analysis) method to reduce their size in a smaller space having good discriminative properties. The experimental results demonstrate that our proposed system provides an improvement of the phone recognition accuracy with an absolute increase of 3.40 % when compared with the phone recognition accuracy obtained with neither HLDA nor voice conversion.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 17 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
India 1 6%
Unknown 16 94%

Demographic breakdown

Readers by professional status Count As %
Researcher 6 35%
Student > Doctoral Student 2 12%
Student > Bachelor 2 12%
Student > Ph. D. Student 2 12%
Professor > Associate Professor 2 12%
Other 3 18%
Readers by discipline Count As %
Computer Science 6 35%
Engineering 3 18%
Nursing and Health Professions 1 6%
Agricultural and Biological Sciences 1 6%
Psychology 1 6%
Other 3 18%
Unknown 2 12%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 09 November 2015.
All research outputs
#18,836,331
of 23,344,526 outputs
Outputs from SpringerPlus
#1,274
of 1,856 outputs
Outputs of similar age
#206,423
of 285,682 outputs
Outputs of similar age from SpringerPlus
#82
of 129 outputs
Altmetric has tracked 23,344,526 research outputs across all sources so far. This one is in the 11th percentile – i.e., 11% of other outputs scored the same or lower than it.
So far Altmetric has tracked 1,856 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.8. This one is in the 20th percentile – i.e., 20% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 285,682 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 15th percentile – i.e., 15% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 129 others from the same source and published within six weeks on either side of this one. This one is in the 24th percentile – i.e., 24% of its contemporaries scored the same or lower than it.