|
|
1 | (4) |
|
|
5 | (28) |
|
|
5 | (3) |
|
|
8 | (21) |
|
|
9 | (2) |
|
|
11 | (3) |
|
|
14 | (15) |
|
|
29 | (4) |
|
Speech Analysis Techniques |
|
|
33 | (18) |
|
Sampling the Speech Waveform |
|
|
33 | (3) |
|
|
36 | (2) |
|
|
38 | (2) |
|
|
40 | (2) |
|
Discrete Fourier Transform |
|
|
42 | (3) |
|
|
43 | (2) |
|
Windowing Signal Segments |
|
|
45 | (6) |
|
Linear Prediction Vocal Tract Modeling |
|
|
51 | (14) |
|
Sound Propagation in the Vocal Tract |
|
|
51 | (6) |
|
|
55 | (2) |
|
Estimation of LP Parameters |
|
|
57 | (3) |
|
Autocorrelation Method of Parameter Estimation |
|
|
58 | (1) |
|
|
59 | (1) |
|
Transformations of LP Parameters Quantization |
|
|
60 | (1) |
|
|
60 | (1) |
|
Line Spectral Frequencies |
|
|
60 | (1) |
|
|
61 | (4) |
|
|
65 | (14) |
|
Autocorrelation Pitch Estimation |
|
|
66 | (6) |
|
Autocorrelation of Center-Clipped Speech |
|
|
68 | (1) |
|
|
69 | (3) |
|
Energy Normalized Correlation |
|
|
72 | (1) |
|
Cepstral Pitch Extraction |
|
|
72 | (4) |
|
Frequency-Domain Error Minimization |
|
|
76 | (1) |
|
|
77 | (2) |
|
|
77 | (1) |
|
Dynamic Programming Tracking |
|
|
78 | (1) |
|
Auditory Information Processing |
|
|
79 | (10) |
|
The Basilar Membrane: A Spectrum Analyzer |
|
|
79 | (1) |
|
|
80 | (3) |
|
Thresholds of Audibility and Detectability |
|
|
83 | (2) |
|
|
85 | (4) |
|
Simultaneous Masking in Frequency |
|
|
85 | (2) |
|
|
87 | (2) |
|
Quantization and Waveform Coders |
|
|
89 | (24) |
|
|
90 | (3) |
|
Uniform Pulse Code Modulation (PCM) |
|
|
90 | (3) |
|
|
93 | (1) |
|
Nonuniform Pulse Code Modulation |
|
|
94 | (1) |
|
Differential Waveform Coding |
|
|
94 | (5) |
|
Predictive Differential Coding |
|
|
96 | (1) |
|
|
97 | (2) |
|
|
99 | (4) |
|
Adaptive Delta Modulation |
|
|
99 | (1) |
|
Adaptive Differential Pulse Code Modulation (AD-PCM) |
|
|
99 | (4) |
|
|
103 | (10) |
|
|
105 | (2) |
|
|
107 | (1) |
|
Complexity Reduction Approaches |
|
|
108 | (2) |
|
Predictive Vector Quantization |
|
|
110 | (3) |
|
|
113 | (10) |
|
|
114 | (1) |
|
|
114 | (1) |
|
|
115 | (1) |
|
|
115 | (5) |
|
|
116 | (1) |
|
|
117 | (2) |
|
Background Noise and Channel Conditions |
|
|
119 | (1) |
|
Perceptual Objective Measures |
|
|
120 | (3) |
|
|
123 | (16) |
|
|
125 | (3) |
|
Implementations of the Channel Vocoder |
|
|
126 | (2) |
|
|
128 | (2) |
|
The Sinusoidal Speech Coder |
|
|
130 | (3) |
|
|
130 | (1) |
|
Sinusoidal Parameter Analysis |
|
|
131 | (2) |
|
Linear Prediction Vocoder |
|
|
133 | (6) |
|
Federal Standard 1015, LPC-10e at 2.4 kbit/s |
|
|
137 | (2) |
|
Linear Prediction Analysis by Synthesis |
|
|
139 | (18) |
|
Analysis by Synthesis Estimation of Excitation |
|
|
140 | (1) |
|
Multi-Pulse Linear Prediction Coder |
|
|
141 | (1) |
|
Regular Pulse Excited LP Coder |
|
|
142 | (1) |
|
ETSI GSM Full Rate RPE-LTP |
|
|
143 | (1) |
|
Code Excited Linear Prediction Coder |
|
|
143 | (14) |
|
|
145 | (1) |
|
CELP Computational Efficiency Improvements |
|
|
146 | (2) |
|
|
148 | (1) |
|
Federal Standard 1016, CELP at 4.8 kbits/sec |
|
|
149 | (1) |
|
ITU-T G.728 Low Delay CELP at 16 kbit/s |
|
|
149 | (1) |
|
ITU G.723.1 Algebraic CELP/Multi-Pulse Coder at 5.3/6.3 kbit/s |
|
|
150 | (2) |
|
ETSI GSM Enhanced Full Rate Algebraic CELP at 12.2 kbit/s |
|
|
152 | (1) |
|
IS-641 EFR 7.4 kbit/s Algebraic CELP for IS-136 North American Digital Cellular |
|
|
153 | (1) |
|
ETSI GSM Adaptive Multi-Rate Algebraic CELP from 4.75 to 12.2 kbit/s |
|
|
154 | (3) |
|
|
157 | (36) |
|
Multi-Band Excitation Vocoder |
|
|
157 | (8) |
|
Multi-Band Excitation Analysis |
|
|
158 | (3) |
|
Multi-Band Excitation Synthesis |
|
|
161 | (2) |
|
Implementations of the MBE Vocoder |
|
|
163 | (2) |
|
Mixed Excitation Linear Prediction Coder |
|
|
165 | (11) |
|
Federal Standard MELP Coder at 2.4 kbit/s |
|
|
168 | (6) |
|
Improvements to MELP Coder |
|
|
174 | (2) |
|
|
176 | (3) |
|
Bit Allocations and Quality Results |
|
|
177 | (2) |
|
Harmonic Vector Excitation Coder |
|
|
179 | (6) |
|
|
179 | (3) |
|
|
182 | (3) |
|
|
185 | (1) |
|
Waveform Interpolation Coding |
|
|
185 | (8) |
|
|
186 | (2) |
|
Quantization of SEW and REW |
|
|
188 | (1) |
|
Performance and Enhancements |
|
|
189 | (4) |
|
|
193 | (12) |
|
Auditory Processing of Speech |
|
|
193 | (6) |
|
General Perceptual Speech Coder |
|
|
194 | (1) |
|
Frequency and Temporal Masking |
|
|
195 | (2) |
|
Determining Masking Levels |
|
|
197 | (2) |
|
Perceptual Coding Considerations |
|
|
199 | (3) |
|
Limits on Time/Frequency Resolution |
|
|
200 | (1) |
|
Sound Quality of Signal Components |
|
|
200 | (1) |
|
MBE Model for Perceptual Coding |
|
|
201 | (1) |
|
Research in Perceptual Speech Coding |
|
|
202 | (3) |
A Related Internet Sites |
|
205 | (4) |
|
A.1 Information on Coding Standards |
|
|
205 | (2) |
|
A.2 Technical Conferences |
|
|
207 | (2) |
References |
|
209 | (16) |
Index |
|
225 | |