|
|
1 | (46) |
|
|
2 | (1) |
|
1.2 Classical Pattern Recognition Paradigm |
|
|
3 | (6) |
|
1.2.1 Decision Theory and Pattern Recognition |
|
|
7 | (2) |
|
1.3 Interactive Pattern Recognition and Multimodal Interaction |
|
|
9 | (12) |
|
1.3.1 Using the Human Feedback Directly |
|
|
11 | (1) |
|
1.3.2 Explicitly Taking Interaction History into Account |
|
|
12 | (1) |
|
1.3.3 Interaction with Deterministic Feedback |
|
|
12 | (3) |
|
1.3.4 Interactive Pattern Recognition and Decision Theory |
|
|
15 | (1) |
|
1.3.5 Multimodal Interaction |
|
|
16 | (4) |
|
1.3.6 Feedback Decoding and Adaptive Learning |
|
|
20 | (1) |
|
1.4 Interaction Protocols and Assessment |
|
|
21 | (6) |
|
1.4.1 General Types of Interaction Protocols |
|
|
22 | (2) |
|
1.4.2 Left-to-Right Interactive-Predictive Processing |
|
|
24 | (1) |
|
|
24 | (1) |
|
1.4.4 Interaction with Weaker Feedback |
|
|
25 | (1) |
|
1.4.5 Interaction Without Input Data |
|
|
25 | (1) |
|
1.4.6 Assessing IPR Systems |
|
|
26 | (1) |
|
1.4.7 User Effort Estimation |
|
|
26 | (1) |
|
1.5 IPR Search and Confidence Estimation |
|
|
27 | (8) |
|
|
28 | (5) |
|
1.5.2 Confidence Estimation |
|
|
33 | (2) |
|
1.6 Machine Learning Paradigms for IPR |
|
|
35 | (12) |
|
|
36 | (4) |
|
|
40 | (1) |
|
1.6.3 Semi-Supervised Learning |
|
|
41 | (1) |
|
1.6.4 Reinforcement Learning |
|
|
41 | (2) |
|
|
43 | (4) |
|
2 Computer Assisted Transcription: General Framework |
|
|
47 | (14) |
|
|
47 | (1) |
|
2.2 Common Statistical Framework for HTR and ASR |
|
|
48 | (2) |
|
2.3 Common Statistical Framework for CATTI and CATS |
|
|
50 | (2) |
|
2.4 Adapting the Language Model |
|
|
52 | (1) |
|
2.5 Search and Decoding Methods |
|
|
52 | (6) |
|
2.5.1 Viterbi-Based Implementation |
|
|
53 | (1) |
|
2.5.2 Word-Graph Based Implementation |
|
|
54 | (4) |
|
|
58 | (3) |
|
|
58 | (3) |
|
3 Computer Assisted Transcription of Text Images |
|
|
61 | (38) |
|
3.1 Computer Assisted Transcription of Text Images: CATTI |
|
|
62 | (1) |
|
|
63 | (3) |
|
3.2.1 Word-Graph-Based Search Approach |
|
|
64 | (1) |
|
3.2.2 Word Graph Error-Correcting Parsing |
|
|
64 | (2) |
|
3.3 Increasing Interaction Ergonomics in CATTI: PA-CATTI |
|
|
66 | (4) |
|
3.3.1 Language Model and Search |
|
|
68 | (2) |
|
3.4 Multimodal Computer Assisted Transcription of Text Images: MM-CATTI |
|
|
70 | (5) |
|
3.4.1 Language Model and Search for MM-CATTI |
|
|
73 | (2) |
|
3.5 Non-interactive HTR Systems |
|
|
75 | (6) |
|
3.5.1 Main Off-Line HTR System Overview |
|
|
75 | (4) |
|
3.5.2 On-Line HTR Subsystem Overview |
|
|
79 | (2) |
|
3.6 Tasks, Experiments and Results |
|
|
81 | (13) |
|
|
82 | (6) |
|
|
88 | (6) |
|
|
94 | (5) |
|
|
96 | (3) |
|
4 Computer Assisted Transcription of Speech Signals |
|
|
99 | (20) |
|
4.1 Computer Assisted Transcription of Audio Streams |
|
|
100 | (1) |
|
|
100 | (1) |
|
4.3 Introduction to Automatic Speech Recognition |
|
|
101 | (2) |
|
|
101 | (1) |
|
4.3.2 Pre-process and Feature Extraction |
|
|
102 | (1) |
|
4.3.3 Statistical Speech Recognition |
|
|
102 | (1) |
|
|
103 | (1) |
|
4.5 Word-Graph-Based CATS |
|
|
103 | (4) |
|
4.5.1 Error Correcting Prefix Parsing |
|
|
104 | (1) |
|
4.5.2 A General Model for Probabilistic Prefix Parsing |
|
|
105 | (2) |
|
|
107 | (6) |
|
|
108 | (1) |
|
|
109 | (1) |
|
|
109 | (1) |
|
|
110 | (3) |
|
4.7 Multimodality in CATS |
|
|
113 | (2) |
|
|
115 | (1) |
|
|
115 | (1) |
|
|
116 | (1) |
|
|
116 | (3) |
|
|
117 | (2) |
|
5 Active Interaction and Learning in Handwritten Text Transcription |
|
|
119 | (16) |
|
|
119 | (2) |
|
|
121 | (1) |
|
5.3 Adaptation from Partially Supervised Transcriptions |
|
|
122 | (1) |
|
5.4 Active Interaction and Active Learning |
|
|
122 | (2) |
|
5.5 Balancing Error and Supervision Effort |
|
|
124 | (2) |
|
|
126 | (6) |
|
5.6.1 User Interaction Model |
|
|
126 | (1) |
|
5.6.2 Sequential Transcription Tasks |
|
|
127 | (1) |
|
5.6.3 Adaptation from Partially Supervised Transcriptions |
|
|
128 | (1) |
|
5.6.4 Active Interaction and Learning |
|
|
129 | (1) |
|
5.6.5 Balancing User Effort and Recognition Error |
|
|
130 | (2) |
|
|
132 | (3) |
|
|
132 | (3) |
|
6 Interactive Machine Translation |
|
|
135 | (18) |
|
|
136 | (2) |
|
6.1.1 Statistical Machine Translation |
|
|
136 | (2) |
|
6.2 Interactive Machine Translation |
|
|
138 | (3) |
|
6.2.1 Interactive Machine Translation with Confidence Estimation |
|
|
140 | (1) |
|
6.3 Search in Interactive Machine Translation |
|
|
141 | (3) |
|
6.3.1 Word-Graph Generation |
|
|
141 | (1) |
|
6.3.2 Error-Correcting Parsing |
|
|
142 | (1) |
|
6.3.3 Search for n-Best Completions |
|
|
143 | (1) |
|
6.4 Tasks, Experiments and Results |
|
|
144 | (5) |
|
6.4.1 Pre- and Post-processing |
|
|
145 | (1) |
|
|
145 | (1) |
|
6.4.3 Evaluation Measures |
|
|
145 | (1) |
|
|
146 | (2) |
|
6.4.5 Results Using Confidence Information |
|
|
148 | (1) |
|
|
149 | (4) |
|
|
150 | (3) |
|
7 Multi-Modality for Interactive Machine Translation |
|
|
153 | (16) |
|
|
153 | (1) |
|
7.2 Making Use of Weaker Feedback |
|
|
154 | (3) |
|
7.2.1 Non-explicit Positioning Pointer Actions |
|
|
154 | (2) |
|
7.2.2 Interaction-Explicit Pointer Actions |
|
|
156 | (1) |
|
7.3 Correcting Errors with Speech Recognition |
|
|
157 | (3) |
|
7.3.1 Unconstrained Speech Decoding (DEC) |
|
|
158 | (1) |
|
7.3.2 Prefix-Conditioned Speech Decoding (DEC-PREF) |
|
|
159 | (1) |
|
7.3.3 Prefix-Conditioned Speech Decoding (IMT-PREF) |
|
|
159 | (1) |
|
7.3.4 Prefix Selection (IMT-SEL) |
|
|
160 | (1) |
|
7.4 Correcting Errors with Handwritten Text Recognition |
|
|
160 | (2) |
|
7.5 Tasks, Experiments and Results |
|
|
162 | (4) |
|
7.5.1 Results when Incorporating Weaker Feedback |
|
|
162 | (1) |
|
7.5.2 Results for Speech as Input Feedback |
|
|
163 | (2) |
|
7.5.3 Results for Handwritten Text as Input Feedback |
|
|
165 | (1) |
|
|
166 | (3) |
|
|
167 | (2) |
|
8 Incremental and Adaptive Learning for Interactive Machine Translation |
|
|
169 | (10) |
|
|
169 | (1) |
|
|
170 | (4) |
|
8.2.1 Concept of On-Line Learning |
|
|
170 | (1) |
|
|
171 | (1) |
|
|
172 | (2) |
|
|
174 | (1) |
|
8.3.1 Active Learning on IMT via Confidence Measures |
|
|
174 | (1) |
|
8.3.2 Bayesian Adaptation |
|
|
174 | (1) |
|
|
175 | (1) |
|
|
176 | (3) |
|
|
176 | (3) |
|
|
179 | (16) |
|
|
180 | (2) |
|
9.2 Interactive Parsing Framework |
|
|
182 | (2) |
|
9.3 Confidence Measures in IP |
|
|
184 | (2) |
|
9.4 IP in Left-to-Right Depth-First Order |
|
|
186 | (2) |
|
9.4.1 Efficient Calculation of the Next Best Tree |
|
|
187 | (1) |
|
|
188 | (3) |
|
9.5.1 User Simulation Subsystem |
|
|
188 | (1) |
|
|
189 | (1) |
|
9.5.3 Experimental Results |
|
|
190 | (1) |
|
|
191 | (4) |
|
|
192 | (3) |
|
10 Interactive Text Generation |
|
|
195 | (14) |
|
|
195 | (2) |
|
10.1.1 Interactive Text Generation and Interactive Pattern Recognition |
|
|
196 | (1) |
|
10.2 Interactive Text Generation at the Word Level |
|
|
197 | (8) |
|
10.2.1 N-Gram Language Modeling |
|
|
198 | (1) |
|
10.2.2 Searching for a Suffix |
|
|
199 | (1) |
|
10.2.3 Optimal Greedy Prediction of Suffixes |
|
|
199 | (4) |
|
10.2.4 Dealing with Sentence Length |
|
|
203 | (1) |
|
10.2.5 Word-Level Experiments |
|
|
204 | (1) |
|
10.3 Predicting at Character Level |
|
|
205 | (2) |
|
10.3.1 Character-Level Experiments |
|
|
205 | (2) |
|
|
207 | (2) |
|
|
207 | (2) |
|
11 Interactive Image Retrieval |
|
|
209 | (18) |
|
|
209 | (1) |
|
11.2 Relevance Feedback for Image Retrieval |
|
|
210 | (8) |
|
11.2.1 Probabilistic Interaction Model |
|
|
210 | (3) |
|
11.2.2 Greedy Approximation Relevance Feedback Algorithm |
|
|
213 | (1) |
|
11.2.3 A Simplified Version of GARF |
|
|
214 | (1) |
|
|
214 | (1) |
|
11.2.5 Image Feature Extraction |
|
|
215 | (1) |
|
|
216 | (2) |
|
|
218 | (1) |
|
11.3 Multimodal Relevance Feedback |
|
|
218 | (9) |
|
11.3.1 Fusion by Refining |
|
|
219 | (1) |
|
|
219 | (1) |
|
|
220 | (2) |
|
11.3.4 Proposed Approach: Dynamic Linear Fusion |
|
|
222 | (1) |
|
|
223 | (2) |
|
|
225 | (1) |
|
|
225 | (2) |
|
12 Prototypes and Demonstrators |
|
|
227 | (40) |
|
|
228 | (3) |
|
12.1.1 Passive, Left-to-Right Protocol |
|
|
228 | (2) |
|
12.1.2 Passive, Desultory Protocol |
|
|
230 | (1) |
|
|
231 | (1) |
|
12.1.4 Prototype Evaluation |
|
|
231 | (1) |
|
12.2 MM-IHT: Multimodal Interactive Handwritten Transcription |
|
|
231 | (8) |
|
12.2.1 Prototype Description |
|
|
232 | (1) |
|
|
233 | (2) |
|
|
235 | (4) |
|
12.3 IST: Interactive Speech Transcription |
|
|
239 | (3) |
|
12.3.1 Prototype Description |
|
|
240 | (1) |
|
|
241 | (1) |
|
|
242 | (1) |
|
12.4 IMT: Interactive Machine Translation |
|
|
242 | (4) |
|
12.4.1 Prototype Description |
|
|
243 | (1) |
|
|
244 | (2) |
|
|
246 | (1) |
|
12.5 ITG: Interactive Text Generation |
|
|
246 | (5) |
|
12.5.1 Prototype Description |
|
|
247 | (2) |
|
|
249 | (1) |
|
|
250 | (1) |
|
12.6 MM-IP: Multimodal Interactive Parsing |
|
|
251 | (4) |
|
12.6.1 Prototype Description |
|
|
251 | (3) |
|
|
254 | (1) |
|
|
255 | (1) |
|
12.7 GIDOC: GIMP-Based Interactive Document Transcription |
|
|
255 | (6) |
|
12.7.1 Prototype Description |
|
|
255 | (5) |
|
|
260 | (1) |
|
|
260 | (1) |
|
12.8 RISE: Relevant Image Search Engine |
|
|
261 | (3) |
|
12.8.1 Prototype Description |
|
|
261 | (1) |
|
|
262 | (2) |
|
|
264 | (1) |
|
|
264 | (3) |
|
|
265 | (2) |
Glossary |
|
267 | (4) |
Index |
|
271 | |