Speech Recognition Using Articulatory and Excitation Source Features
tarafından
Rao, K. Sreenivasa. author.
Başlık
:
Speech Recognition Using Articulatory and Excitation Source Features
Yazar
:
Rao, K. Sreenivasa. author.
ISBN
:
9783319492209
Yazar
:
Rao, K. Sreenivasa. author.
Fiziksel Niteleme
:
XI, 92 p. 23 illus., 4 illus. in color. online resource.
Seri
:
SpringerBriefs in Speech Technology, Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning,
İçindekiler
:
Introduction -- Literature Review -- Articulatory Features for Phone Recognition -- Excitation Source Features for Phone Recognition -- Articulatory and Excitation Source Features for Speech Recognition in Read, Extempore and Conversation Modes -- Conclusion -- Appendix A: MFCC Features -- Appendix B: Pattern Recognition Models.
Özet
:
This book discusses the contribution of articulatory and excitation source information in discriminating sound units. The authors focus on excitation source component of speech -- and the dynamics of various articulators during speech production -- for enhancement of speech recognition (SR) performance. Speech recognition is analyzed for read, extempore, and conversation modes of speech. Five groups of articulatory features (AFs) are explored for speech recognition, in addition to conventional spectral features. Each chapter provides the motivation for exploring the specific feature for SR task, discusses the methods to extract those features, and finally suggests appropriate models to capture the sound unit specific knowledge from the proposed features. The authors close by discussing various combinations of spectral, articulatory and source features, and the desired models to enhance the performance of SR systems.
Konu Başlığı
:
Natural language processing (Computer science).
Computational linguistics.
Signal, Image and Speech Processing. http://scigraph.springernature.com/things/product-market-codes/T24051
Natural Language Processing (NLP). http://scigraph.springernature.com/things/product-market-codes/I21040
Computational Linguistics. http://scigraph.springernature.com/things/product-market-codes/N22000
Yazar Ek Girişi
:
K E, Manjunath.
Ek Kurum Yazar
:
SpringerLink (Online service)
Elektronik Erişim
:
Materyal Türü | Barkod | Yer Numarası | Durumu/İade Tarihi |
---|
Electronic Book | 224910-1001 | TK5102.9 | Springer E-Book Collection |