Preface |
|
ix | |
1 Introduction to Dialogue Systems |
|
1 | (15) |
|
1.1 Human-Computer Interaction and Speech Processing |
|
|
1 | (1) |
|
1.2 Spoken Dialogue Systems |
|
|
2 | (2) |
|
1.2.1 Technological Precedents |
|
|
3 | (1) |
|
1.3 Multimodal Dialogue Systems |
|
|
4 | (3) |
|
1.4 Multilingual Dialogue Systems |
|
|
7 | (1) |
|
1.5 Dialogue Systems Referenced in This Book |
|
|
7 | (4) |
|
1.6 Area Organisation and Research Directions |
|
|
11 | (2) |
|
|
13 | (2) |
|
|
15 | (1) |
2 Technologies Employed to Set Up Dialogue Systems |
|
16 | (38) |
|
|
16 | (18) |
|
2.1.1 Automatic Speech Recognition |
|
|
17 | (5) |
|
2.1.2 Natural Language Processing |
|
|
22 | (2) |
|
2.1.3 Face Localisation and Tracking |
|
|
24 | (2) |
|
|
26 | (2) |
|
2.1.5 Lip-reading Recognition |
|
|
28 | (2) |
|
2.1.6 Gesture Recognition |
|
|
30 | (3) |
|
2.1.7 Handwriting Recognition |
|
|
33 | (1) |
|
2.2 Multimodal Processing |
|
|
34 | (10) |
|
2.2.1 Multimodal Data Fusion |
|
|
34 | (2) |
|
2.2.2 Multimodal Data Storage |
|
|
36 | (5) |
|
2.2.3 Dialogue Management |
|
|
41 | (1) |
|
|
41 | (1) |
|
|
42 | (1) |
|
2.2.6 Response Generation |
|
|
43 | (1) |
|
|
44 | (7) |
|
|
44 | (3) |
|
2.3.2 Natural Language Generation |
|
|
47 | (1) |
|
|
48 | (3) |
|
|
51 | (1) |
|
2.3.5 Tactile/Haptic Generation |
|
|
51 | (1) |
|
|
51 | (2) |
|
|
53 | (1) |
3 Multimodal Dialogue Systems |
|
54 | (32) |
|
3.1 Benefits of Multimodal Interaction |
|
|
54 | (5) |
|
3.1.1 In Terms of System Input |
|
|
54 | (2) |
|
3.1.2 In Terms of System Processing |
|
|
56 | (2) |
|
3.1.3 In Terms of System Output |
|
|
58 | (1) |
|
3.2 Development of Multimodal Dialogue Systems |
|
|
59 | (25) |
|
3.2.1 Development Techniques |
|
|
59 | (4) |
|
|
63 | (4) |
|
3.2.3 Architectures of Multimodal Systems |
|
|
67 | (3) |
|
|
70 | (9) |
|
|
79 | (5) |
|
|
84 | (1) |
|
|
85 | (1) |
4 Multilingual Dialogue Systems |
|
86 | (32) |
|
4.1 Implications of Multilinguality in the Architecture of Dialogue Systems |
|
|
86 | (9) |
|
4.1.1 Consideration of Alternatives in Multilingual Dialogue Systems |
|
|
86 | (5) |
|
4.1.2 Interlingua Approach |
|
|
91 | (1) |
|
4.1.3 Semantic Frame Conversion Approach |
|
|
92 | (2) |
|
4.1.4 Dialogue-Control Centred Approach |
|
|
94 | (1) |
|
4.2 Multilingual Dialogue Systems Based on Interlingua |
|
|
95 | (12) |
|
|
95 | (3) |
|
|
98 | (2) |
|
|
100 | (7) |
|
4.3 Multilingual Dialogue Systems Based on Web Applications |
|
|
107 | (10) |
|
4.3.1 Requirements for Practical Multilingual Dialogue Systems |
|
|
108 | (1) |
|
4.3.2 Dialogue Systems Based on Web Applications |
|
|
108 | (3) |
|
4.3.3 Multilingual Dialogue Systems Based on the MVC Framework |
|
|
111 | (3) |
|
4.3.4 Implementation of Multilingual Voice Portals |
|
|
114 | (3) |
|
|
117 | (1) |
|
|
117 | (1) |
5 Dialogue Annotation, Modelling and Management |
|
118 | (33) |
|
|
118 | (6) |
|
5.1.1 Annotation of Spoken Dialogue Corpora |
|
|
118 | (3) |
|
5.1.2 Annotation of Multimodal Dialogue Corpora |
|
|
121 | (3) |
|
|
124 | (3) |
|
5.2.1 State-Transition Networks |
|
|
125 | (1) |
|
|
126 | (1) |
|
|
127 | (4) |
|
5.3.1 Interaction Strategies |
|
|
127 | (1) |
|
5.3.2 Confirmation Strategies |
|
|
128 | (3) |
|
5.4 Implications of Multimodality in the Dialogue Management |
|
|
131 | (10) |
|
5.4.1 Interaction Complexity |
|
|
131 | (2) |
|
|
133 | (1) |
|
5.4.3 Social and Emotional Dialogue |
|
|
134 | (1) |
|
5.4.4 Contextual Information |
|
|
135 | (2) |
|
|
137 | (3) |
|
5.4.6 Response Generation |
|
|
140 | (1) |
|
5.5 Implications of Multilinguality in the Dialogue Management |
|
|
141 | (3) |
|
5.5.1 Reference Resolution in Multilingual Dialogue Systems |
|
|
141 | (1) |
|
5.5.2 Ambiguity of Speech Acts in Multilingual Dialogue Systems |
|
|
142 | (1) |
|
5.5.3 Differences in the Interactive Behaviour of Multilingual Dialogue Systems |
|
|
143 | (1) |
|
5.6 Implications of Task Independency in the Dialogue Management |
|
|
144 | (5) |
|
5.6.1 Dialogue Task Classification |
|
|
144 | (2) |
|
5.6.2 Task Modification in Each Task Class |
|
|
146 | (3) |
|
|
149 | (1) |
|
|
150 | (1) |
6 Development Tools |
|
151 | (38) |
|
6.1 Tools for Spoken and Multilingual Dialogue Systems |
|
|
151 | (25) |
|
6.1.1 Tools to Develop System Modules |
|
|
151 | (8) |
|
6.1.2 Web-Oriented Standards and Tools for Spoken Dialogue Systems |
|
|
159 | (11) |
|
|
170 | (6) |
|
6.2 Standards and Tools for Multimodal Dialogue Systems |
|
|
176 | (11) |
|
6.2.1 Web-Oriented Multimodal Dialogue |
|
|
176 | (3) |
|
6.2.2 Face and Body Animation |
|
|
179 | (2) |
|
6.2.3 System Development Tools |
|
|
181 | (4) |
|
6.2.4 Multimodal Annotation Tools |
|
|
185 | (2) |
|
|
187 | (1) |
|
|
187 | (2) |
7 Assessment |
|
189 | (30) |
|
7.1 Overview of Evaluation Techniques |
|
|
189 | (3) |
|
7.1.1 Classification of Evaluation Techniques |
|
|
190 | (2) |
|
7.2 Evaluation of Spoken and Multilingual Dialogue Systems |
|
|
192 | (10) |
|
7.2.1 Subsystem-Level Evaluation |
|
|
192 | (4) |
|
7.2.2 End-to-End Evaluation |
|
|
196 | (1) |
|
7.2.3 Dialogue Processing Evaluation |
|
|
197 | (2) |
|
7.2.4 System-to-System Automatic Evaluation |
|
|
199 | (3) |
|
7.3 Evaluation of Multimodal Dialogue Systems |
|
|
202 | (15) |
|
7.3.1 System-Level Evaluation |
|
|
203 | (5) |
|
7.3.2 Subsystem-Level Evaluation |
|
|
208 | (2) |
|
7.3.3 Evaluation of Multimodal Data Fusion |
|
|
210 | (2) |
|
7.3.4 Evaluation of Animated Agents |
|
|
212 | (5) |
|
|
217 | (1) |
|
|
218 | (1) |
Appendix A Basic Tutorial on VoiceXML |
|
219 | (10) |
Appendix B Multimodal Databases |
|
229 | (4) |
Appendix C Coding Schemes for Multimodal Resources |
|
233 | (2) |
Appendix D URLs of Interest |
|
235 | (2) |
Appendix E List of Abbreviations |
|
237 | (2) |
References |
|
239 | (14) |
Index |
|
253 | |