SlideShare uma empresa Scribd logo
1 de 7
Baixar para ler offline
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 361
AN EFFECTIVE EVALUATION STUDY OF OBJECTIVE MEASURES
USING SPECTRAL SUBTRACTIVE ENHANCED SIGNAL
Sukhvinder Kaur1
, Anil Garg2
1
M.Tech Student MMEC, Mullana, M.M. University, Mullana-Ambala-Haryana, India
2
Associate Prof. in ECE department MMEC, Mullana, M.M. University, Mullana-Ambala-Haryana, India
Abstract
Unwanted noises have a negative influence over communication because it disturbs the conversation and make the
communication impossible. Speech enhancement algorithms are used for improving the quality and intelligibility or to reduce
listener fatigues. Assessment of speech quality can be done by using either subjective listening test or objective quality measure.
Evaluation of several objective measures with the speech processed by enhancement algorithms has been performed but these
having limitations to assess original speech signal. This paper represents the study of speech quality measures and compute the
values used for regression analyses of the objective measures evaluation study using spectral subtraction algorithm based
enhanced speech signal.
Keywords: MOS, ITU-T (P.835), SNRseg, log- likelihood ratio and itakura-saito.
---------------------------------------------------------------------***--------------------------------------------------------------------
1. INTRODUCTION
Speech Communication is an integral part of daily life.
Communication systems include various applications such
as, mobile phones, speech coding and compression, medical
devices such as hearing aids etc. All these are associated
with different types of noise or sounds. Unwanted sounds
are commonly referred as disturbances. It can be both noise
or interferences and which are not acceptable in the
communication process. Therefore, unwanted sounds are
having a negative influence over the communication
because they may disturb the conversation and make it
impossible to communicate at all. Signal Processing
methods are the effective way for aiding the speech
communication by reducing the disturbance levels with
respect to the level of the speech. Such speech processing
methods are known as speech enhancement methods.
Speech enhancement algorithms improve the quality and
intelligibility of speech or reduce listener fatigues.
Speech enhancement has several real world applications
which include: Telecommunications, Electronic hearing aids
and Automatic speaker recognition software. These all are
of important quality and intelligibility of speech which
vastly improves the user’s listening experience. Two criteria
are which generally used for measuring the performance of
speech: quality and intelligibility. It is quite hard to satisfy
both of them at the same time. Quality is the subjective
measure which indicates the amount of effort needed to
understand the speech material. Whereas Intelligibility, is an
objective measure which signifies the amount of speech
material correctly understood.
This paper is represented in such a manner that section 2
provides brief review of the different speech enhancement
techniques and spectral subtraction algorithm. The speech
quality measures are discussed in section 3. In the Section 4
evaluated results are shown and the last section 5 represents
the conclusion part. References are given in the last section
of the paper.
2. SPEECH ENHANCEMENT TECHNIQUES
Speech enhancement methods can be classified in two ways:
(1) Single channel enhancement technique (2) Multi channel
enhancement technique. Single channel enhancement
techniques are applied to situations in which only one
acquisition channel is available. Multi channel
enhancement techniques are employed in microphone arrays
and take advantage of availability of multiple signal
inputs to system, by making possible use of phase
alignment to reject the undesired noise components. Single
channel enhancement techniques are further divided again
into different categories: Short Time Spectral Amplitude
Estimation (i.e. Spectral subtraction and Non Causal Weiner
Filtering), Periodicity of Voiced Speech based, Adaptive
Comb Filtering based, Harmonic Selection based, adaptive
noise canceling techniques based, Speech model based (i.e.
Kalman Filtering Hidden Markov Modelling(HMM)).
Spectral-subtractive algorithms: Is the simplest speech
enhancement algorithms based on the principle of estimating
the clean signal spectrum by subtracting an estimated noise
spectrum from the noisy speech spectrum. Statistical model
based algorithm: are based on the estimation of linear and
non linear transform coefficients of clean signal by
measuring the Fourier transform coefficients of the noisy
signal. E.g., Wiener algorithm and minimum mean square
error (MMSE) algorithms. Subspace Algorithms: are based
on the principle that the clean signals estimation is done by
simply nulling the component of the noisy vector residing in
the noise subspace.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 362
2.1 Spectral-Subtractive Algorithms
The spectral subtraction algorithm is historically one of the
first algorithms proposed for noise reduction based on the
simple principle of assuming additive noise. Estimated clean
signal spectrum can be obtained by subtracting an average
noise spectrum from the noisy speech spectrum.
The noise spectrum can be estimated, and updated, during
the periods when the signal is absent or when only noise is
present. Subtraction process needs to be done carefully to
avoid speech distortions. If too much subtraction is done,
then some speech information might be removed while if the
subtraction is too little then much of the interfering noise
remains.
3. SPEECH QUALITY MEASURES
Speech processing algorithms usage is rapidly increasing,
which raises the need of speech quality evaluation. The
main evaluation method of speech enhancement system
affects the intelligibility of the speech signal and improves
the overall quality of the signal. The assessment of speech
quality can be done by either using subjective listening tests
or objective quality measures i.e. Subjective measures and
Objective measures.
3.1 Subjective Measures
Subjective evaluation involves comparisons of original and
processed speech signals by a group of listeners who are
asked to rate the quality of speech along a pre-determined
scale. This evaluation method requires the judgment of
human listeners. These can be further divided into two
categories: (1) Relative Preference Methods - multiple
signals are compared. (2)Absolute Category Rating Methods
- a single stimulus is tested.
Table 1.1 MOS rating scale
Rating Speech
quality
Level of distortion
5 Excellent Imperceptible
4 Good Just perceptible, but not annoying
3 Fair Perceptible and slightly annoying
2 Poor Annoying , but not objectionable
1 Bad Very annoying and objectionable
MOS (Mean Opinion score) is the most widely used
subjective evaluation method [8]. The MOS test is based on
five category rating scale of the speech quality given in
Table 1.1. ITU-T standard (P.835) is designed for
integrating the effect of signal and background distortions.
Subjective evaluation is the reliable method for assessment
of speech quality but these are time consuming and
expensive. This evaluation method is stringent in nature as it
needed a set of trained listeners for determining the quality
of speech. Therefore need of objective evaluation arises.
3.2 Objective Measures
Objective evaluation involves a mathematical comparison of
original and processed speech. In this evaluation method the
judgment is predicted with some analysis of the system. In
this method quality is quantified by measuring the numerical
distance between the original and processed signals. There
are different types of objective quality measures which are
further categorized as: (1) Time and frequency signal to
noise ratio measures, (2) Spectral distance measure based on
LPC, (3) Perceptually Motivated measures and (4)
Composite measures.
The objective measures accesses the quality of the processed
speech without accessing the original speech signal.
Objective measures of speech quality are implemented by
first segmenting the speech signal into 10-30 ms frames, and
then computing a distortion measure between the original
and processed signals. Distortion measure computation can
either be done in the time domain (e.g. signal to noise ratio
measures) or in the frequency domain (LPC spectral
distance measures). In the frequency domain measure
distortions and differences detected in the magnitude spectra
are correlated with speech quality.
In Time and Frequency Signal to Noise Ratio Measures, the
segmental signal to noise ratio can be evaluated either in
terms of time or frequency domain. The original and
processed signals both are aligned either in time domain or
in frequency domain for the calculation. The main
advantage of using frequency based segmental SNR over the
time domain (SNRseg) is the flexibility to place different
weights for different frequency bands of the spectrum.
In the Spectral Distance Measure Based on LPC several
LPC based objective measures are proposed on the basis of
all pole models of the clean and enhanced speech signals.
Three different LPC based objective measures include: The
LLR (log- likelihood ratio) measure, the IS ( itakura-saito)
measure and the Cepstrum distance measure. Cepstral
distance measure is derived from the LPC coefficients.
Perceptually Motivated Measures are used for the modeling
of auditory processes like normal listening frequency
selectivity and the perceived loudness. Weighted spectral
slope (WSS), Bark distortion measures (BSD) and
Perceptual evaluation of speech quality (PESQ) measures
are the perceptually motivated measures.
Composite Measures are formed by combining multiple
objective measures. Regression analysis is used to compute
the optimum combination of objective measure for
maximum correlation. The selection of objective measures
for composite measures depends basically on the
experimental evidence and intuition. MARS technique can
be used for composite measure evaluation.
Figure of Merits are obtained by statistical analysis. This
analysis is necessary for the validity of evaluation method of
objective measures. Statistical analysis is used to assess the
correlation between subjective score and the values of the
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 363
objective measures. Two types of figure of merit are
calculated in statistical analysis: (1) Pearson’s correlation
coefficient. (2) Standard error of the estimate.
The first figure of merit Pearson’s correlation coefficient is
used to obtain the correlation between subjective listening
scores and objective measures. Second figure of merit is an
estimate of the standard deviation of the error obtained by
objective measures to predict subjective listening scores.
3.2.1 Time and Frequency Signal to Noise Ratio
Measures
The segmental signal to noise ratio is evaluated either in
terms of time or frequency domain. The time domain
measure is the simplest objective measure which is used to
evaluate speech enhancement and speech coding algorithms.
In this measure original and processed signal are aligned in
time and phase errors are present. The segmental signal to
noise ratio is given by eqn. (3.1)
(3.1)
Where is the original signal (clean), is the
enhanced signal, is the frame length and is the number
of frames in the signal. The segmental SNR in terms of
frequency domain produce the frequency weighted
segmental SNR is given in eqn. (3.2)
(3.2)
Where is the weight placed on the jth frequency band,
is the number of bands, is the total number of frames in
the signal. is the filter bank amplitude (excitation
spectrum) of the clean signal and is the filter bank
amplitude of enhanced signal in the same bank.
3.2.2 LPC based Objective Measures
LPC based objective measures including the log likelihood
ratio (LLR), Itakura Saito distance measure (IS), cepstrum
distance measure (CEP).The two most commonly used
measures are LLR and IS. Cepstral distance measure is
derived from the LPC coefficients.The log likelihood ratio
(LLR) measure is defined as in eqn.
(3.3)
Where the coefficients of the clean signal are, are
the coefficients of enhanced signal and is the
autocorrelation matrix of clean signal.The Itakura Saito(IS)
measure is defined as in eqn.(3.4)
(3.4)
Where and are the all pole gains of the clean and
enhanced signals.The cepstrum distance measure (CEP) is
defined as in eqn. (3.5)
(3.5)
Where and are the cepstrum coefficients of the
clean and enhanced signals.
4. EVALUATION RESULTS
Results evaluated for the Statistical Analysis of spectral
subtractive enhanced signal is discussed in this section. In
this paper, the clean and processed signal is used for
statistical analysis. Speech signal is first segmented into
frames and then standard deviation and mean values are
computed for the noisy and enhanced signals shown in table
1.2. The original time waveform for clean noise, noisy and
enhanced signals are shown in fig 1. These are used as the
primary tool for the result analysis.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 364
Fig 1: Time waveform representation of clean, noise, noisy and spectrally subtractive enhanced signal
The spectrogram of clean, noise, noisy and enhanced signal
shown in fig 2. Spectrogram is utilized for detecting the
differences in the magnitude spectra of waveform
information. The quality of the processed speech is
determined by without needing access to the original speech
signal.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 365
Fig 2: Spectrogram representation of clean, noise, noisy and spectrally subtractive enhanced signal
Computation of the LPC measure is done by estimating the
correlation coefficient by using the mean value and standard
deviation values. Table 1.2 shows the respective values of
mean and standard deviation for different objective
measures such as SNR, IS, LR, LLR, WSM. These values
are the LPC coefficients that can be used for computing the
objective measures.
Table 1.2 Table for mean and standard deviation values of
objective measure
Objective
measure
Mean value Standard
deviation
SNR 1.388178 1.727957
IS 30.170003 21.972701
LR 0.205720 0.189852
LLR 0.176183 0.139966
WSM 3.686912 2.256240
SNR plot of enhanced signal with respect to the noisy signal
is shown in fig 3. The plot shows the smaller values of the
signal energy during the interval of silence in the speech
signal results the larger negative value of (SNR).
Fig 3: SNR plot with respect to the noisy signal spectrum
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 366
Itakura saito ratio (IS) plot of enhanced signal with respect
to the noisy signal is shown in fig 4. The plot shows the
smaller values of the signal energy during the interval when
the speech in the processed signal is of highest amplitude
i.e., speech is clearly understandable and distractive and
larger values of IS shown for the noise present in processed
signal.
Fig 4: IS (itakura- saito ratio) plot with respect to the noisy signal spectrum
LLR (log likelihood ratio) plot of enhanced signal with
respect to the noisy signal is shown in fig 5. The plot shows
the lower values of the signal energy during the interval of
speech signal and results the larger peak value of (LLR) for
the noise in the processed signal spectrum.
Fig 5: LLR (log likelihood ratio) plot with respect to the noisy signal spectrum.
5. CONCLUSION
Different speech enhancement algorithms have been
proposed to improve the performance of modern
communication devices in noisy environments. The key
conclusion drawn from this paper is that speech quality
assessment is done through subjective and objective
evaluation measures. In this paper study of objective
measure and computed values of the standard deviation and
mean for noisy and enhanced signal are given.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
_______________________________________________________________________________________
Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 367
It is very difficult to find reliable and fair comparison
among different algorithms due to lack of common speech
database used in algorithms evaluation, different types of
noise and differences in the testing methodology. The most
accurate method for evaluating speech quality is subjective
listening tests but these are time consuming and expensive.
This evaluation method is stringent in nature as it needed a
set of trained listeners for determining the quality of speech.
Therefore need of objective evaluation arises. As the frame
length increases SNR value decreases monotonically.
REFERENCES
[1]. Loizou, P., 2007. Speech Enhancement: Theory and
Practice. CRC Press, Boca Raton, FL.
[2]. Y. Hu and P. C. Loizou, “Subjective evaluation and
comparison of speech enhancement algorithms,” submitted
to Speech Communication.
[3]. Yi Hu and Philipos C. Loizou, “Evaluation of Objective
Quality Measures for Speech Enhancement”, IEEE Trans.
on Audio, Speech, and Language Processing, Vol. 16, No. 1,
January 2008.
[4]. P. Chu and D. Messerschmitt, “A frequency weighted
itakura-saito spectral distance measure”, IEEE Trans.
Acoustics, Speech and Signal Processing, vol. 30, pp. 545—
560, Aug. 1982.
[5]. Anuradha R. Fukane, “Enhancement of Noisy Speech
Signals for Hearing Aids”, 2011 International Conference
on Communication Systems and Network Technologies,
978-0-7695-4437-3/11© 2011 IEEE
[6]. Sunil D. Kamath,” A Multi-Band Spectral Subtraction
Method for Enhancing Speech Corrupted By Colored
Noise”, 2002
[7]. Jianfen Ma, Yi Hu and Philipos C. Loizou, “Objective
measures for predicting speech intelligibility in noisy
conditions based on new band-importance functions”, J.
Acoust. Soc. Am., Vol. 125, No. 5, May 2009.
[8]. Kuldip Paliwal, “Single-channel speech enhancement
using spectral subtraction in the short-time modulation
domain”, Elsevier, Speech Communication 52 450–475,
2010.

Mais conteúdo relacionado

Mais procurados

EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMESEFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
kevig
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From Speech
IJERA Editor
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
IJMER
 

Mais procurados (20)

Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...
 
Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...Noise reduction in speech processing using improved active noise control (anc...
Noise reduction in speech processing using improved active noise control (anc...
 
Comparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: ReviewComparative Study of Different Techniques in Speaker Recognition: Review
Comparative Study of Different Techniques in Speaker Recognition: Review
 
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMESEFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
EFFECT OF DYNAMIC TIME WARPING ON ALIGNMENT OF PHRASES AND PHONEMES
 
H42045359
H42045359H42045359
H42045359
 
Human Emotion Recognition From Speech
Human Emotion Recognition From SpeechHuman Emotion Recognition From Speech
Human Emotion Recognition From Speech
 
IRJET- Study of Effect of PCA on Speech Emotion Recognition
IRJET- Study of Effect of PCA on Speech Emotion RecognitionIRJET- Study of Effect of PCA on Speech Emotion Recognition
IRJET- Study of Effect of PCA on Speech Emotion Recognition
 
M sc thesis_presentation_
M sc thesis_presentation_M sc thesis_presentation_
M sc thesis_presentation_
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
 
05 comparative study of voice print based acoustic features mfcc and lpcc
05 comparative study of voice print based acoustic features mfcc and lpcc05 comparative study of voice print based acoustic features mfcc and lpcc
05 comparative study of voice print based acoustic features mfcc and lpcc
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based ModelReal-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
Real-Time Non-Intrusive Speech Quality Estimation: A Signal-Based Model
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing
 
Dynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modellingDynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modelling
 
IRJET- Emotion recognition using Speech Signal: A Review
IRJET-  	  Emotion recognition using Speech Signal: A ReviewIRJET-  	  Emotion recognition using Speech Signal: A Review
IRJET- Emotion recognition using Speech Signal: A Review
 
LPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A ReviewLPC Models and Different Speech Enhancement Techniques- A Review
LPC Models and Different Speech Enhancement Techniques- A Review
 

Semelhante a An effective evaluation study of objective measures using spectral subtractive enhanced signal

IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD Editor
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...
eSAT Journals
 
Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate
Katia Raichlin-Levi
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
International Journal of Science and Research (IJSR)
 
Mfcc based enlargement of the training set for emotion recognition in speech
Mfcc based enlargement of the training set for emotion recognition in speechMfcc based enlargement of the training set for emotion recognition in speech
Mfcc based enlargement of the training set for emotion recognition in speech
sipij
 

Semelhante a An effective evaluation study of objective measures using spectral subtractive enhanced signal (20)

IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
 
Review Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different TechniquesReview Paper on Noise Reduction Using Different Techniques
Review Paper on Noise Reduction Using Different Techniques
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
A novel speech enhancement technique
A novel speech enhancement techniqueA novel speech enhancement technique
A novel speech enhancement technique
 
Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...Speech enhancement using spectral subtraction technique with minimized cross ...
Speech enhancement using spectral subtraction technique with minimized cross ...
 
Kc3517481754
Kc3517481754Kc3517481754
Kc3517481754
 
Effect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal AlignmentsEffect of MFCC Based Features for Speech Signal Alignments
Effect of MFCC Based Features for Speech Signal Alignments
 
01 8445 speech enhancement
01 8445 speech enhancement01 8445 speech enhancement
01 8445 speech enhancement
 
Nd2421622165
Nd2421622165Nd2421622165
Nd2421622165
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniques
 
Optimized audio classification and segmentation algorithm by using ensemble m...
Optimized audio classification and segmentation algorithm by using ensemble m...Optimized audio classification and segmentation algorithm by using ensemble m...
Optimized audio classification and segmentation algorithm by using ensemble m...
 
Ijetcas14 426
Ijetcas14 426Ijetcas14 426
Ijetcas14 426
 
Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
Mfcc based enlargement of the training set for emotion recognition in speech
Mfcc based enlargement of the training set for emotion recognition in speechMfcc based enlargement of the training set for emotion recognition in speech
Mfcc based enlargement of the training set for emotion recognition in speech
 
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...
Analysis of Suitable Extraction Methods and Classifiers For Speaker Identific...
 
Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...Performance enhancement of dct based speaker recognition using wavelet de noi...
Performance enhancement of dct based speaker recognition using wavelet de noi...
 
Ijetcas14 390
Ijetcas14 390Ijetcas14 390
Ijetcas14 390
 
Dy36749754
Dy36749754Dy36749754
Dy36749754
 
Sentiment analysis by deep learning approaches
Sentiment analysis by deep learning approachesSentiment analysis by deep learning approaches
Sentiment analysis by deep learning approaches
 

Mais de eSAT Journals

Material management in construction – a case study
Material management in construction – a case studyMaterial management in construction – a case study
Material management in construction – a case study
eSAT Journals
 
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialsLaboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
eSAT Journals
 
Geographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementGeographical information system (gis) for water resources management
Geographical information system (gis) for water resources management
eSAT Journals
 
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodEstimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn method
eSAT Journals
 
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...
eSAT Journals
 

Mais de eSAT Journals (20)

Mechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavementsMechanical properties of hybrid fiber reinforced concrete for pavements
Mechanical properties of hybrid fiber reinforced concrete for pavements
 
Material management in construction – a case study
Material management in construction – a case studyMaterial management in construction – a case study
Material management in construction – a case study
 
Managing drought short term strategies in semi arid regions a case study
Managing drought    short term strategies in semi arid regions  a case studyManaging drought    short term strategies in semi arid regions  a case study
Managing drought short term strategies in semi arid regions a case study
 
Life cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangaloreLife cycle cost analysis of overlay for an urban road in bangalore
Life cycle cost analysis of overlay for an urban road in bangalore
 
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materialsLaboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
Laboratory studies of dense bituminous mixes ii with reclaimed asphalt materials
 
Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...Laboratory investigation of expansive soil stabilized with natural inorganic ...
Laboratory investigation of expansive soil stabilized with natural inorganic ...
 
Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...Influence of reinforcement on the behavior of hollow concrete block masonry p...
Influence of reinforcement on the behavior of hollow concrete block masonry p...
 
Influence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizerInfluence of compaction energy on soil stabilized with chemical stabilizer
Influence of compaction energy on soil stabilized with chemical stabilizer
 
Geographical information system (gis) for water resources management
Geographical information system (gis) for water resources managementGeographical information system (gis) for water resources management
Geographical information system (gis) for water resources management
 
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...Forest type mapping of bidar forest division, karnataka using geoinformatics ...
Forest type mapping of bidar forest division, karnataka using geoinformatics ...
 
Factors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concreteFactors influencing compressive strength of geopolymer concrete
Factors influencing compressive strength of geopolymer concrete
 
Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...Experimental investigation on circular hollow steel columns in filled with li...
Experimental investigation on circular hollow steel columns in filled with li...
 
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...Experimental behavior of circular hsscfrc filled steel tubular columns under ...
Experimental behavior of circular hsscfrc filled steel tubular columns under ...
 
Evaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabsEvaluation of punching shear in flat slabs
Evaluation of punching shear in flat slabs
 
Evaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in indiaEvaluation of performance of intake tower dam for recent earthquake in india
Evaluation of performance of intake tower dam for recent earthquake in india
 
Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...Evaluation of operational efficiency of urban road network using travel time ...
Evaluation of operational efficiency of urban road network using travel time ...
 
Estimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn methodEstimation of surface runoff in nallur amanikere watershed using scs cn method
Estimation of surface runoff in nallur amanikere watershed using scs cn method
 
Estimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniquesEstimation of morphometric parameters and runoff using rs & gis techniques
Estimation of morphometric parameters and runoff using rs & gis techniques
 
Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...Effect of variation of plastic hinge length on the results of non linear anal...
Effect of variation of plastic hinge length on the results of non linear anal...
 
Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...Effect of use of recycled materials on indirect tensile strength of asphalt c...
Effect of use of recycled materials on indirect tensile strength of asphalt c...
 

Último

Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
pritamlangde
 

Último (20)

Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
💚Trustworthy Call Girls Pune Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...
💚Trustworthy Call Girls Pune Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...💚Trustworthy Call Girls Pune Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...
💚Trustworthy Call Girls Pune Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top...
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKARHAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
HAND TOOLS USED AT ELECTRONICS WORK PRESENTED BY KOUSTAV SARKAR
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best ServiceTamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
Tamil Call Girls Bhayandar WhatsApp +91-9930687706, Best Service
 
Digital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptxDigital Communication Essentials: DPCM, DM, and ADM .pptx
Digital Communication Essentials: DPCM, DM, and ADM .pptx
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
PE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and propertiesPE 459 LECTURE 2- natural gas basic concepts and properties
PE 459 LECTURE 2- natural gas basic concepts and properties
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
Jaipur ❤CALL GIRL 0000000000❤CALL GIRLS IN Jaipur ESCORT SERVICE❤CALL GIRL IN...
 

An effective evaluation study of objective measures using spectral subtractive enhanced signal

  • 1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 361 AN EFFECTIVE EVALUATION STUDY OF OBJECTIVE MEASURES USING SPECTRAL SUBTRACTIVE ENHANCED SIGNAL Sukhvinder Kaur1 , Anil Garg2 1 M.Tech Student MMEC, Mullana, M.M. University, Mullana-Ambala-Haryana, India 2 Associate Prof. in ECE department MMEC, Mullana, M.M. University, Mullana-Ambala-Haryana, India Abstract Unwanted noises have a negative influence over communication because it disturbs the conversation and make the communication impossible. Speech enhancement algorithms are used for improving the quality and intelligibility or to reduce listener fatigues. Assessment of speech quality can be done by using either subjective listening test or objective quality measure. Evaluation of several objective measures with the speech processed by enhancement algorithms has been performed but these having limitations to assess original speech signal. This paper represents the study of speech quality measures and compute the values used for regression analyses of the objective measures evaluation study using spectral subtraction algorithm based enhanced speech signal. Keywords: MOS, ITU-T (P.835), SNRseg, log- likelihood ratio and itakura-saito. ---------------------------------------------------------------------***-------------------------------------------------------------------- 1. INTRODUCTION Speech Communication is an integral part of daily life. Communication systems include various applications such as, mobile phones, speech coding and compression, medical devices such as hearing aids etc. All these are associated with different types of noise or sounds. Unwanted sounds are commonly referred as disturbances. It can be both noise or interferences and which are not acceptable in the communication process. Therefore, unwanted sounds are having a negative influence over the communication because they may disturb the conversation and make it impossible to communicate at all. Signal Processing methods are the effective way for aiding the speech communication by reducing the disturbance levels with respect to the level of the speech. Such speech processing methods are known as speech enhancement methods. Speech enhancement algorithms improve the quality and intelligibility of speech or reduce listener fatigues. Speech enhancement has several real world applications which include: Telecommunications, Electronic hearing aids and Automatic speaker recognition software. These all are of important quality and intelligibility of speech which vastly improves the user’s listening experience. Two criteria are which generally used for measuring the performance of speech: quality and intelligibility. It is quite hard to satisfy both of them at the same time. Quality is the subjective measure which indicates the amount of effort needed to understand the speech material. Whereas Intelligibility, is an objective measure which signifies the amount of speech material correctly understood. This paper is represented in such a manner that section 2 provides brief review of the different speech enhancement techniques and spectral subtraction algorithm. The speech quality measures are discussed in section 3. In the Section 4 evaluated results are shown and the last section 5 represents the conclusion part. References are given in the last section of the paper. 2. SPEECH ENHANCEMENT TECHNIQUES Speech enhancement methods can be classified in two ways: (1) Single channel enhancement technique (2) Multi channel enhancement technique. Single channel enhancement techniques are applied to situations in which only one acquisition channel is available. Multi channel enhancement techniques are employed in microphone arrays and take advantage of availability of multiple signal inputs to system, by making possible use of phase alignment to reject the undesired noise components. Single channel enhancement techniques are further divided again into different categories: Short Time Spectral Amplitude Estimation (i.e. Spectral subtraction and Non Causal Weiner Filtering), Periodicity of Voiced Speech based, Adaptive Comb Filtering based, Harmonic Selection based, adaptive noise canceling techniques based, Speech model based (i.e. Kalman Filtering Hidden Markov Modelling(HMM)). Spectral-subtractive algorithms: Is the simplest speech enhancement algorithms based on the principle of estimating the clean signal spectrum by subtracting an estimated noise spectrum from the noisy speech spectrum. Statistical model based algorithm: are based on the estimation of linear and non linear transform coefficients of clean signal by measuring the Fourier transform coefficients of the noisy signal. E.g., Wiener algorithm and minimum mean square error (MMSE) algorithms. Subspace Algorithms: are based on the principle that the clean signals estimation is done by simply nulling the component of the noisy vector residing in the noise subspace.
  • 2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 362 2.1 Spectral-Subtractive Algorithms The spectral subtraction algorithm is historically one of the first algorithms proposed for noise reduction based on the simple principle of assuming additive noise. Estimated clean signal spectrum can be obtained by subtracting an average noise spectrum from the noisy speech spectrum. The noise spectrum can be estimated, and updated, during the periods when the signal is absent or when only noise is present. Subtraction process needs to be done carefully to avoid speech distortions. If too much subtraction is done, then some speech information might be removed while if the subtraction is too little then much of the interfering noise remains. 3. SPEECH QUALITY MEASURES Speech processing algorithms usage is rapidly increasing, which raises the need of speech quality evaluation. The main evaluation method of speech enhancement system affects the intelligibility of the speech signal and improves the overall quality of the signal. The assessment of speech quality can be done by either using subjective listening tests or objective quality measures i.e. Subjective measures and Objective measures. 3.1 Subjective Measures Subjective evaluation involves comparisons of original and processed speech signals by a group of listeners who are asked to rate the quality of speech along a pre-determined scale. This evaluation method requires the judgment of human listeners. These can be further divided into two categories: (1) Relative Preference Methods - multiple signals are compared. (2)Absolute Category Rating Methods - a single stimulus is tested. Table 1.1 MOS rating scale Rating Speech quality Level of distortion 5 Excellent Imperceptible 4 Good Just perceptible, but not annoying 3 Fair Perceptible and slightly annoying 2 Poor Annoying , but not objectionable 1 Bad Very annoying and objectionable MOS (Mean Opinion score) is the most widely used subjective evaluation method [8]. The MOS test is based on five category rating scale of the speech quality given in Table 1.1. ITU-T standard (P.835) is designed for integrating the effect of signal and background distortions. Subjective evaluation is the reliable method for assessment of speech quality but these are time consuming and expensive. This evaluation method is stringent in nature as it needed a set of trained listeners for determining the quality of speech. Therefore need of objective evaluation arises. 3.2 Objective Measures Objective evaluation involves a mathematical comparison of original and processed speech. In this evaluation method the judgment is predicted with some analysis of the system. In this method quality is quantified by measuring the numerical distance between the original and processed signals. There are different types of objective quality measures which are further categorized as: (1) Time and frequency signal to noise ratio measures, (2) Spectral distance measure based on LPC, (3) Perceptually Motivated measures and (4) Composite measures. The objective measures accesses the quality of the processed speech without accessing the original speech signal. Objective measures of speech quality are implemented by first segmenting the speech signal into 10-30 ms frames, and then computing a distortion measure between the original and processed signals. Distortion measure computation can either be done in the time domain (e.g. signal to noise ratio measures) or in the frequency domain (LPC spectral distance measures). In the frequency domain measure distortions and differences detected in the magnitude spectra are correlated with speech quality. In Time and Frequency Signal to Noise Ratio Measures, the segmental signal to noise ratio can be evaluated either in terms of time or frequency domain. The original and processed signals both are aligned either in time domain or in frequency domain for the calculation. The main advantage of using frequency based segmental SNR over the time domain (SNRseg) is the flexibility to place different weights for different frequency bands of the spectrum. In the Spectral Distance Measure Based on LPC several LPC based objective measures are proposed on the basis of all pole models of the clean and enhanced speech signals. Three different LPC based objective measures include: The LLR (log- likelihood ratio) measure, the IS ( itakura-saito) measure and the Cepstrum distance measure. Cepstral distance measure is derived from the LPC coefficients. Perceptually Motivated Measures are used for the modeling of auditory processes like normal listening frequency selectivity and the perceived loudness. Weighted spectral slope (WSS), Bark distortion measures (BSD) and Perceptual evaluation of speech quality (PESQ) measures are the perceptually motivated measures. Composite Measures are formed by combining multiple objective measures. Regression analysis is used to compute the optimum combination of objective measure for maximum correlation. The selection of objective measures for composite measures depends basically on the experimental evidence and intuition. MARS technique can be used for composite measure evaluation. Figure of Merits are obtained by statistical analysis. This analysis is necessary for the validity of evaluation method of objective measures. Statistical analysis is used to assess the correlation between subjective score and the values of the
  • 3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 363 objective measures. Two types of figure of merit are calculated in statistical analysis: (1) Pearson’s correlation coefficient. (2) Standard error of the estimate. The first figure of merit Pearson’s correlation coefficient is used to obtain the correlation between subjective listening scores and objective measures. Second figure of merit is an estimate of the standard deviation of the error obtained by objective measures to predict subjective listening scores. 3.2.1 Time and Frequency Signal to Noise Ratio Measures The segmental signal to noise ratio is evaluated either in terms of time or frequency domain. The time domain measure is the simplest objective measure which is used to evaluate speech enhancement and speech coding algorithms. In this measure original and processed signal are aligned in time and phase errors are present. The segmental signal to noise ratio is given by eqn. (3.1) (3.1) Where is the original signal (clean), is the enhanced signal, is the frame length and is the number of frames in the signal. The segmental SNR in terms of frequency domain produce the frequency weighted segmental SNR is given in eqn. (3.2) (3.2) Where is the weight placed on the jth frequency band, is the number of bands, is the total number of frames in the signal. is the filter bank amplitude (excitation spectrum) of the clean signal and is the filter bank amplitude of enhanced signal in the same bank. 3.2.2 LPC based Objective Measures LPC based objective measures including the log likelihood ratio (LLR), Itakura Saito distance measure (IS), cepstrum distance measure (CEP).The two most commonly used measures are LLR and IS. Cepstral distance measure is derived from the LPC coefficients.The log likelihood ratio (LLR) measure is defined as in eqn. (3.3) Where the coefficients of the clean signal are, are the coefficients of enhanced signal and is the autocorrelation matrix of clean signal.The Itakura Saito(IS) measure is defined as in eqn.(3.4) (3.4) Where and are the all pole gains of the clean and enhanced signals.The cepstrum distance measure (CEP) is defined as in eqn. (3.5) (3.5) Where and are the cepstrum coefficients of the clean and enhanced signals. 4. EVALUATION RESULTS Results evaluated for the Statistical Analysis of spectral subtractive enhanced signal is discussed in this section. In this paper, the clean and processed signal is used for statistical analysis. Speech signal is first segmented into frames and then standard deviation and mean values are computed for the noisy and enhanced signals shown in table 1.2. The original time waveform for clean noise, noisy and enhanced signals are shown in fig 1. These are used as the primary tool for the result analysis.
  • 4. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 364 Fig 1: Time waveform representation of clean, noise, noisy and spectrally subtractive enhanced signal The spectrogram of clean, noise, noisy and enhanced signal shown in fig 2. Spectrogram is utilized for detecting the differences in the magnitude spectra of waveform information. The quality of the processed speech is determined by without needing access to the original speech signal.
  • 5. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 365 Fig 2: Spectrogram representation of clean, noise, noisy and spectrally subtractive enhanced signal Computation of the LPC measure is done by estimating the correlation coefficient by using the mean value and standard deviation values. Table 1.2 shows the respective values of mean and standard deviation for different objective measures such as SNR, IS, LR, LLR, WSM. These values are the LPC coefficients that can be used for computing the objective measures. Table 1.2 Table for mean and standard deviation values of objective measure Objective measure Mean value Standard deviation SNR 1.388178 1.727957 IS 30.170003 21.972701 LR 0.205720 0.189852 LLR 0.176183 0.139966 WSM 3.686912 2.256240 SNR plot of enhanced signal with respect to the noisy signal is shown in fig 3. The plot shows the smaller values of the signal energy during the interval of silence in the speech signal results the larger negative value of (SNR). Fig 3: SNR plot with respect to the noisy signal spectrum
  • 6. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 366 Itakura saito ratio (IS) plot of enhanced signal with respect to the noisy signal is shown in fig 4. The plot shows the smaller values of the signal energy during the interval when the speech in the processed signal is of highest amplitude i.e., speech is clearly understandable and distractive and larger values of IS shown for the noise present in processed signal. Fig 4: IS (itakura- saito ratio) plot with respect to the noisy signal spectrum LLR (log likelihood ratio) plot of enhanced signal with respect to the noisy signal is shown in fig 5. The plot shows the lower values of the signal energy during the interval of speech signal and results the larger peak value of (LLR) for the noise in the processed signal spectrum. Fig 5: LLR (log likelihood ratio) plot with respect to the noisy signal spectrum. 5. CONCLUSION Different speech enhancement algorithms have been proposed to improve the performance of modern communication devices in noisy environments. The key conclusion drawn from this paper is that speech quality assessment is done through subjective and objective evaluation measures. In this paper study of objective measure and computed values of the standard deviation and mean for noisy and enhanced signal are given.
  • 7. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 _______________________________________________________________________________________ Volume: 04 Issue: 03 | Mar-2015, Available @ http://www.ijret.org 367 It is very difficult to find reliable and fair comparison among different algorithms due to lack of common speech database used in algorithms evaluation, different types of noise and differences in the testing methodology. The most accurate method for evaluating speech quality is subjective listening tests but these are time consuming and expensive. This evaluation method is stringent in nature as it needed a set of trained listeners for determining the quality of speech. Therefore need of objective evaluation arises. As the frame length increases SNR value decreases monotonically. REFERENCES [1]. Loizou, P., 2007. Speech Enhancement: Theory and Practice. CRC Press, Boca Raton, FL. [2]. Y. Hu and P. C. Loizou, “Subjective evaluation and comparison of speech enhancement algorithms,” submitted to Speech Communication. [3]. Yi Hu and Philipos C. Loizou, “Evaluation of Objective Quality Measures for Speech Enhancement”, IEEE Trans. on Audio, Speech, and Language Processing, Vol. 16, No. 1, January 2008. [4]. P. Chu and D. Messerschmitt, “A frequency weighted itakura-saito spectral distance measure”, IEEE Trans. Acoustics, Speech and Signal Processing, vol. 30, pp. 545— 560, Aug. 1982. [5]. Anuradha R. Fukane, “Enhancement of Noisy Speech Signals for Hearing Aids”, 2011 International Conference on Communication Systems and Network Technologies, 978-0-7695-4437-3/11© 2011 IEEE [6]. Sunil D. Kamath,” A Multi-Band Spectral Subtraction Method for Enhancing Speech Corrupted By Colored Noise”, 2002 [7]. Jianfen Ma, Yi Hu and Philipos C. Loizou, “Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions”, J. Acoust. Soc. Am., Vol. 125, No. 5, May 2009. [8]. Kuldip Paliwal, “Single-channel speech enhancement using spectral subtraction in the short-time modulation domain”, Elsevier, Speech Communication 52 450–475, 2010.