|
1 Introduction to Video Text Detection |
|
|
1 | (18) |
|
1.1 Introduction to the Research of Video Text Detection |
|
|
1 | (4) |
|
1.2 Characteristics and Difficulties of Video Text Detection |
|
|
5 | (2) |
|
1.3 Relationship Between Video Text Detection and Other Fields |
|
|
7 | (1) |
|
1.4 A Brief History of Video Text Detection |
|
|
8 | (6) |
|
1.5 Potential Applications |
|
|
14 | (5) |
|
|
16 | (3) |
|
|
19 | (30) |
|
2.1 Preprocessing Operators |
|
|
20 | (6) |
|
2.1.1 Image Cropping and Local Operators |
|
|
21 | (1) |
|
2.1.2 Neighborhood Operators |
|
|
22 | (3) |
|
2.1.3 Morphology Operators |
|
|
25 | (1) |
|
2.2 Color-Based Preprocessing |
|
|
26 | (3) |
|
|
29 | (5) |
|
|
34 | (5) |
|
|
39 | (5) |
|
|
44 | (5) |
|
|
46 | (3) |
|
3 Video Caption Detection |
|
|
49 | (32) |
|
3.1 Introduction to Video Caption Detection |
|
|
49 | (2) |
|
3.2 Feature-Based Methods |
|
|
51 | (21) |
|
|
51 | (5) |
|
3.2.2 Texture-Based Methods |
|
|
56 | (7) |
|
3.2.3 Connected Component-Based Methods |
|
|
63 | (4) |
|
3.2.4 Frequency Domain Methods |
|
|
67 | (5) |
|
3.3 Machine Learning-Based Methods |
|
|
72 | (6) |
|
3.3.1 Support Vector Machine-Based Methods |
|
|
72 | (1) |
|
3.3.2 Neural Network Model-Based Methods |
|
|
72 | (1) |
|
3.3.3 Bayes Classification-Based Methods |
|
|
73 | (5) |
|
|
78 | (3) |
|
|
78 | (3) |
|
4 Text Detection from Video Scenes |
|
|
81 | (46) |
|
4.1 Visual Saliency of Scene Texts |
|
|
82 | (8) |
|
4.2 Natural Scene Text Detection Methods |
|
|
90 | (23) |
|
|
91 | (5) |
|
|
96 | (4) |
|
4.2.3 Statistical and Machine Learning Approach |
|
|
100 | (6) |
|
4.2.4 Temporal Analysis Approach |
|
|
106 | (4) |
|
|
110 | (3) |
|
4.3 Scene Character/Text Recognition |
|
|
113 | (3) |
|
|
116 | (6) |
|
|
122 | (5) |
|
|
124 | (3) |
|
5 Post-processing of Video Text Detection |
|
|
127 | (18) |
|
5.1 Text Line Binarization |
|
|
127 | (6) |
|
5.1.1 Wavelet-Gradient-Fusion Method (WGF) |
|
|
128 | (1) |
|
|
129 | (2) |
|
|
131 | (1) |
|
5.1.4 Foreground and Background Separation |
|
|
132 | (1) |
|
|
132 | (1) |
|
5.2 Character Reconstruction |
|
|
133 | (9) |
|
5.2.1 Ring Radius Transform |
|
|
135 | (1) |
|
5.2.2 Horizontal and Vertical Medial Axes |
|
|
136 | (2) |
|
5.2.3 Horizontal and Vertical Gap Filling |
|
|
138 | (1) |
|
|
139 | (1) |
|
|
140 | (1) |
|
|
141 | (1) |
|
|
142 | (1) |
|
|
142 | (3) |
|
|
143 | (2) |
|
6 Character Segmentation and Recognition |
|
|
145 | (24) |
|
6.1 Introduction to OCR and Its Usage in Video Text Recognition |
|
|
145 | (2) |
|
6.2 Word and Character Segmentation |
|
|
147 | (7) |
|
6.2.1 Fourier Transform-Based Method for Word and Character Segmentation |
|
|
149 | (1) |
|
6.2.2 Bresenham's Line Algorithm |
|
|
149 | (1) |
|
6.2.3 Fourier-Moments Features |
|
|
150 | (2) |
|
|
152 | (1) |
|
6.2.5 Character Extraction |
|
|
153 | (1) |
|
|
153 | (1) |
|
6.3 Character Segmentation Without Word Segmentation |
|
|
154 | (5) |
|
6.3.1 GVF for Character Segmentation |
|
|
155 | (1) |
|
6.3.2 Cut Candidate Identification |
|
|
155 | (2) |
|
6.3.3 Minimum-Cost Pathfinding |
|
|
157 | (1) |
|
6.3.4 False-Positive Elimination |
|
|
158 | (1) |
|
|
159 | (1) |
|
6.4 Video Text Recognition |
|
|
159 | (7) |
|
6.4.1 Character Recognition |
|
|
160 | (1) |
|
6.4.2 Hierarchical Classification Based on Voting Method |
|
|
160 | (4) |
|
6.4.3 Structural Features for Recognition |
|
|
164 | (2) |
|
|
166 | (1) |
|
|
166 | (3) |
|
|
167 | (2) |
|
7 Video Text Detection Systems |
|
|
169 | (26) |
|
7.1 License Plate Recognition Systems |
|
|
170 | (11) |
|
7.1.1 Preprocessing of LPR Systems |
|
|
172 | (3) |
|
7.1.2 License Plate Detection |
|
|
175 | (1) |
|
|
176 | (1) |
|
7.1.4 Character Segmentation |
|
|
177 | (1) |
|
7.1.5 Character Recognition |
|
|
178 | (3) |
|
7.2 Navigation Assistant Systems |
|
|
181 | (2) |
|
7.3 Sport Video Analysis Systems |
|
|
183 | (5) |
|
7.4 Video Advertising Systems |
|
|
188 | (3) |
|
|
191 | (4) |
|
|
191 | (4) |
|
|
195 | (26) |
|
8.1 Language-Dependent Text Detection |
|
|
196 | (4) |
|
8.1.1 Method for Bangla and Devanagari (Indian Scripts) Text Detection |
|
|
197 | (1) |
|
8.1.2 Headline-Based Method for Text Detection |
|
|
197 | (2) |
|
8.1.3 Sample Experimental Results |
|
|
199 | (1) |
|
|
199 | (1) |
|
8.2 Methods for Language-Independent Text Detection |
|
|
200 | (7) |
|
8.2.1 Run Lengths for Multi-oriented Text Detection |
|
|
201 | (1) |
|
8.2.2 Selecting Potential Run Lengths |
|
|
201 | (1) |
|
8.2.3 Boundary Growing Method for Traversing |
|
|
202 | (2) |
|
8.2.4 Zero Crossing for Separating Text Lines from Touching |
|
|
204 | (1) |
|
|
205 | (1) |
|
|
206 | (1) |
|
8.3 Script Identification |
|
|
207 | (11) |
|
8.3.1 Spatial-Gradient-Features for Video Script Identification |
|
|
210 | (1) |
|
8.3.2 Text Components Based on Gradient Histogram Method |
|
|
210 | (2) |
|
8.3.3 Candidate Text Components Selection |
|
|
212 | (1) |
|
8.3.4 Features Based on Spatial Information |
|
|
213 | (1) |
|
8.3.5 Template Formation for Script Identification |
|
|
214 | (3) |
|
|
217 | (1) |
|
|
218 | (3) |
|
|
218 | (3) |
|
9 Text Detection in Multimodal Video Analysis |
|
|
221 | (26) |
|
9.1 Relevance of Video Text and Other Modalities in Video Analysis |
|
|
223 | (4) |
|
9.2 Video Text-Related Multimodality Analysis Models |
|
|
227 | (2) |
|
9.3 Text Detection for Multimodal Video Content Analysis |
|
|
229 | (14) |
|
9.3.1 Text Detection and Multimodal Analysis in Broadcast Videos |
|
|
229 | (3) |
|
9.3.2 Lyrics Analysis for Interpreting Karaoke Music Video |
|
|
232 | (4) |
|
9.3.3 Multimodal Video Summarization |
|
|
236 | (4) |
|
9.3.4 Web Video Category/Search Through Text Modality |
|
|
240 | (3) |
|
|
243 | (4) |
|
|
244 | (3) |
|
10 Performance Evaluation |
|
|
247 | (8) |
|
10.1 Performance Evaluation Protocols |
|
|
248 | (1) |
|
10.2 Benchmark Databases for Video Text Detection |
|
|
248 | (1) |
|
10.3 Matching Detected Text Boxes with Ground Truths |
|
|
249 | (1) |
|
10.4 Performance Metrics for Video Text Detection |
|
|
250 | (2) |
|
10.5 Dataset and Evaluation of Video Text Recognition |
|
|
252 | (1) |
|
|
253 | (2) |
|
|
253 | (2) |
Index |
|
255 | |