Open Access Open Access  Restricted Access Subscription Access

Identification of English Dialects and Emotions using Spectral and Prosodic Features of Speech Signal Processing

M. Chinna Rao, A. V. S. N. Murthy, Ch. Satyanarayana

Abstract


Abstract
In this paper, the authors have explored speech features to identify English dialects and emotions. A dialect is any distinguishable variety of a language spoken by a group of people. Emotions provide naturalness to speech. Speech database considered for dialect identification task consists of spontaneous speech spoken by male and female speakers. The emotions considered in this study are anger, disgust, fear, happy, neutral and sad. Prosodic and spectral features extracted from speech are used for discriminating the dialects and emotions. Spectral features are represented by Mel frequency cepstral coefficients (MFCC) and prosodic features are represented by durations of syllables, pitch and energy contours. Auto-associative neural network (AANN) models and support vector machines (SVM) are explored for capturing the dialect-specific and emotionspecific information from the above specified features. AANN models are expected to capture the nonlinear relations specific to dialects or emotions through the distributions of feature vectors. SVMs perform dialect or emotion classification based on discriminative characteristics present among the dialects or emotions. Classification systems are developed separately for dialect classification and emotion classification.

Keywords: emotion recognition, English dialect, prosodic features, spectral features, support vector machines


Full Text:

PDF

Refbacks

  • There are currently no refbacks.


Copyright (c) 2019 Journal of Computer Technology & Applications