Atjaunināt sīkdatņu piekrišanu

E-grāmata: Language Identification Using Excitation Source Features

  • Formāts: PDF+DRM
  • Sērija : SpringerBriefs in Speech Technology
  • Izdošanas datums: 15-Apr-2015
  • Izdevniecība: Springer International Publishing AG
  • Valoda: eng
  • ISBN-13: 9783319177250
  • Formāts - PDF+DRM
  • Cena: 53,52 €*
  • * ši ir gala cena, t.i., netiek piemērotas nekādas papildus atlaides
  • Ielikt grozā
  • Pievienot vēlmju sarakstam
  • Šī e-grāmata paredzēta tikai personīgai lietošanai. E-grāmatas nav iespējams atgriezt un nauda par iegādātajām e-grāmatām netiek atmaksāta.
  • Formāts: PDF+DRM
  • Sērija : SpringerBriefs in Speech Technology
  • Izdošanas datums: 15-Apr-2015
  • Izdevniecība: Springer International Publishing AG
  • Valoda: eng
  • ISBN-13: 9783319177250

DRM restrictions

  • Kopēšana (kopēt/ievietot):

    nav atļauts

  • Drukāšana:

    nav atļauts

  • Lietošana:

    Digitālo tiesību pārvaldība (Digital Rights Management (DRM))
    Izdevējs ir piegādājis šo grāmatu šifrētā veidā, kas nozīmē, ka jums ir jāinstalē bezmaksas programmatūra, lai to atbloķētu un lasītu. Lai lasītu šo e-grāmatu, jums ir jāizveido Adobe ID. Vairāk informācijas šeit. E-grāmatu var lasīt un lejupielādēt līdz 6 ierīcēm (vienam lietotājam ar vienu un to pašu Adobe ID).

    Nepieciešamā programmatūra
    Lai lasītu šo e-grāmatu mobilajā ierīcē (tālrunī vai planšetdatorā), jums būs jāinstalē šī bezmaksas lietotne: PocketBook Reader (iOS / Android)

    Lai lejupielādētu un lasītu šo e-grāmatu datorā vai Mac datorā, jums ir nepieciešamid Adobe Digital Editions (šī ir bezmaksas lietotne, kas īpaši izstrādāta e-grāmatām. Tā nav tas pats, kas Adobe Reader, kas, iespējams, jau ir jūsu datorā.)

    Jūs nevarat lasīt šo e-grāmatu, izmantojot Amazon Kindle.

This book discusses the contribution of excitation source information in discriminating language. The authors focus on the excitation source component of speech for enhancement of language identification (LID) performance. Language specific features are extracted using two different modes: (i) Implicit processing of linear prediction (LP) residual and (ii) Explicit parameterization of linear prediction residual. The book discusses how in implicit processing approach, excitation source features are derived from LP residual, Hilbert envelope (magnitude) of LP residual and Phase of LP residual; and in explicit parameterization approach, LP residual signal is processed in spectral domain to extract the relevant language specific features. The authors further extract source features from these modes, which are combined for enhancing the performance of LID systems. The proposed excitation source features are also investigated for LID in background noisy environments. Each chapter of this book provides the motivation for exploring the specific feature for LID task, and subsequently discuss the methods to extract those features and finally suggest appropriate models to capture the language specific knowledge from the proposed features. Finally, the book discuss about various combinations of spectral and source features, and the desired models to enhance the performance of LID systems.
1 Introduction
1(10)
1.1 Introduction
1(1)
1.2 Types of Language Identification Systems
2(1)
1.2.1 Explicit Language Identification System
2(1)
1.2.2 Implicit Language Identification System
3(1)
1.3 Features Used for Developing Speech Systems
3(2)
1.4 Issues in Developing Language Identification Systems
5(1)
1.5 Objective and Scope of the Work
6(1)
1.6 Contributions of the Book
6(1)
1.7 Organization of the Book
7(4)
References
8(3)
2 Language Identification---A Brief Review
11(20)
2.1 Prior Works on Explicit Language Identification System
11(6)
2.2 Prior Works on Implicit Language Identification System
17(4)
2.3 Prior Works on Excitation Source Features
21(2)
2.4 Motivation for the Present Work
23(4)
2.5 Summary
27(4)
References
27(4)
3 Implicit Excitation Source Features for Language Identification
31(22)
3.1 Introduction
31(1)
3.2 Speech Corpus
32(2)
3.2.1 Indian Institute of Technology Kharagpur Multi-Lingual Indian Language Speech Corpus (BTKGP-MLILSC)
32(2)
3.2.2 Oregon Graduate Institute Multi-Language Telephone-Based Speech (OGI-MLTS) Database
34(1)
3.3 Extraction of Implicit Excitation Source Information from Linear Prediction Residual
34(5)
3.3.1 Analytic Signal Representation of Linear Prediction Residual
35(1)
3.3.2 Implicit Processing of Linear Prediction Residual Signal
36(3)
3.3.3 Implicit Processing of Magnitude and Phase Components of Linear Prediction Residual
39(1)
3.4 Development of Language Identification Systems Using Implicit Excitation Source Features
39(3)
3.5 Performance Evaluation of LID Systems Developed Using Implicit Excitation Source Features
42(8)
3.6 Evaluation of LID Systems Developed Using Implicit Excitation Source Features on OGI-MLTS Database
50(1)
3.7 Summary
50(3)
References
51(2)
4 Parametric Excitation Source Features for Language Identification
53(24)
4.1 Introduction
53(1)
4.2 Parametric Representation of Excitation Source Information
54(12)
4.2.1 Parametric Representation of Sub-segmental Level Excitation Source Information
54(7)
4.2.2 Parametric Representation of Segmental Level Excitation Source Information
61(3)
4.2.3 Parametric Representation of Supra-Segmental Level Excitation Source Information
64(2)
4.3 Development of LID Systems Using Parametric Features of Excitation Source
66(2)
4.4 Performance Evaluation of LID Systems Developed Using Parametric Features of Excitation Source
68(6)
4.5 Performance Evaluation of LID Systems Developed Using Parametric Features of Excitation Source on OGI-MLTS Database
74(1)
4.6 Summary
74(3)
References
74(3)
5 Complementary and Robust Nature of Excitation Source Features for Language Identification
77(20)
5.1 Introduction
77(1)
5.2 Vocal Tract Features
78(1)
5.3 Development of Language Identification Systems Using Excitation Source and Vocal Tract Features
79(2)
5.4 Performance Evaluation of Source and System Integrated LID Systems
81(5)
5.5 Performance Evaluation of Source and System Integrated LID Systems on OGI-MLTS Database
86(1)
5.6 Robustness of Excitation Source Features
87(9)
5.6.1 Motivation for the Use of Excitation Source Information for Robust Language Identification
87(2)
5.6.2 Processing of Robust Excitation Source Features for Language Identification
89(1)
5.6.3 Evaluation of Robustness of Excitation Source Features for Language Identification
90(6)
5.7 Summary
96(1)
References
96(1)
6 Summary and Conclusion
97(4)
6.1 Summary of the Book
97(2)
6.2 Contributions of the Book
99(1)
6.3 Future Scope of Work
99(2)
References
100(1)
Appendix A Gaussian Mixture Model 101(4)
Appendix B Mel-Frequency Cepstral Coefficient (MFCC) Features 105(4)
Appendix C Evaluation of Excitation Source Features in Different Noisy Conditions 109