Contributors |
|
xv | |
Foreword |
|
xvii | |
Preface |
|
xix | |
The Editors |
|
xxix | |
|
PART I Strategies For Success In The Digital-Data Revolution |
|
|
1 | (122) |
|
1 The Digital-Data Challenge |
|
|
5 | (10) |
|
|
|
1.1 The Digital Revolution |
|
|
5 | (1) |
|
1.2 Changing How We Think and Behave |
|
|
6 | (2) |
|
1.3 Moving Adroitly in this Fast-Changing Field |
|
|
8 | (1) |
|
1.4 Digital-Data Challenges Exist Everywhere |
|
|
8 | (1) |
|
1.5 Changing-, How We Work |
|
|
9 | (1) |
|
1.6 Divide and Conquer Offers the Solution |
|
|
10 | (2) |
|
1.7 Engineering Data-to-Knowledge Highways |
|
|
12 | (3) |
|
|
13 | (2) |
|
2 The Digital-Data Revolution |
|
|
15 | (22) |
|
|
2.1 Data, Information, and Knowledge |
|
|
16 | (2) |
|
2.2 Increasing Volumes and Diversity of Data |
|
|
18 | (10) |
|
2.3 Changing the Ways We Work with Data |
|
|
28 | (9) |
|
|
33 | (4) |
|
3 The Data-Intensive Survival Guide |
|
|
37 | (24) |
|
|
3.1 Introduction: Challenges and Strategy |
|
|
38 | (1) |
|
3.2 Three Categories of Expert |
|
|
39 | (2) |
|
3.3 The Data-Intensive Architecture |
|
|
41 | (1) |
|
3.4 An Operational Data-Intensive System |
|
|
42 | (2) |
|
|
44 | (1) |
|
3.6 A Simple DISPEL Example |
|
|
45 | (2) |
|
3.7 Supporting Data-Intensive Experts |
|
|
47 | (1) |
|
3.8 DISPEL in the Context of Contemporary Systems |
|
|
48 | (3) |
|
|
51 | (3) |
|
3.10 Ramps for Incremental Engagement |
|
|
54 | (2) |
|
3.11 Readers' Guide to the Rest of This Book |
|
|
56 | (5) |
|
|
58 | (3) |
|
4 Data-Intensive Thinking with DISPEL |
|
|
61 | (62) |
|
|
|
62 | (2) |
|
|
64 | (1) |
|
4.3 Data Streams and Structure |
|
|
65 | (1) |
|
|
66 | (6) |
|
4.5 The Three-Level Type System |
|
|
72 | (9) |
|
4.6 Registry, Libraries, and Descriptions |
|
|
81 | (5) |
|
4.7 Achieving Data-Intensive Performance |
|
|
86 | (22) |
|
4.8 Reliability and Control |
|
|
108 | (8) |
|
4.9 The Data-to-Knowledge Highway |
|
|
116 | (7) |
|
|
121 | (2) |
|
PART II Data-Intensive Knowledge Discovery |
|
|
123 | (70) |
|
5 Data-Intensive Analysis |
|
|
127 | (20) |
|
|
|
5.1 Knowledge Discovery in Telco Inc. |
|
|
128 | (2) |
|
5.2 Understanding Customers to Prevent Churn |
|
|
130 | (4) |
|
5.3 Preventing Churn Across Multiple Companies |
|
|
134 | (3) |
|
5.4 Understanding Customers by Combining Heterogeneous Public and Private Data |
|
|
137 | (7) |
|
|
144 | (3) |
|
|
145 | (2) |
|
6 Problem Solving in Data-Intensive Knowledge Discovery |
|
|
147 | (18) |
|
|
|
6.1 The Conventional Life Cycle of Knowledge Discovery |
|
|
148 | (7) |
|
6.2 Knowledge Discovery Over Heterogeneous Data Sources |
|
|
155 | (3) |
|
6.3 Knowledge Discovery from Private and Public, Structured and Nonstructured Data |
|
|
158 | (4) |
|
|
162 | (3) |
|
|
162 | (3) |
|
7 Data-Intensive Components and Usage Patterns |
|
|
165 | (16) |
|
|
7.1 Data Source Access and Transformation Components |
|
|
166 | (6) |
|
7.2 Data Integration Components |
|
|
172 | (1) |
|
7.3 Data Preparation and Processing Components |
|
|
173 | (1) |
|
7.4 Data-Mining Components |
|
|
174 | (2) |
|
7.5 Visualization and Knowledge Delivery Components |
|
|
176 | (5) |
|
|
178 | (3) |
|
8 Sharing and Reuse in Knowledge Discovery |
|
|
181 | (12) |
|
|
8.1 Strategies for Sharing and Reuse |
|
|
182 | (3) |
|
8.2 Data Analysis Ontologies for Data Analysis Experts |
|
|
185 | (3) |
|
8.3 Generic Ontologies for Metadata Generation |
|
|
188 | (1) |
|
8.4 Domain Ontologies for Domain Experts |
|
|
189 | (1) |
|
|
190 | (3) |
|
|
191 | (2) |
|
PART III Data-Intensive Engineering |
|
|
193 | (82) |
|
9 Platforms for Data-Intensive Analysis |
|
|
197 | (6) |
|
|
9.1 The Hourglass Reprise |
|
|
198 | (2) |
|
9.2 The Motivation for a Platform |
|
|
200 | (1) |
|
|
201 | (2) |
|
|
201 | (2) |
|
10 Definition of the DISPEL Language |
|
|
203 | (34) |
|
|
|
|
204 | (1) |
|
|
205 | (8) |
|
|
213 | (4) |
|
|
217 | (5) |
|
|
222 | (2) |
|
|
224 | (1) |
|
|
225 | (2) |
|
|
227 | (8) |
|
|
235 | (2) |
|
|
236 | (1) |
|
|
237 | (14) |
|
|
|
11.1 The Development Landscape |
|
|
237 | (2) |
|
11.2 Data-Intensive Workbenches |
|
|
239 | (8) |
|
11.3 Data-Intensive Component Libraries |
|
|
247 | (1) |
|
|
248 | (3) |
|
|
248 | (3) |
|
|
251 | (24) |
|
|
|
|
12.1 Overview of DISPEL Enactment |
|
|
251 | (2) |
|
12.2 DISPEL Language Processing |
|
|
253 | (2) |
|
|
255 | (11) |
|
|
266 | (2) |
|
12.5 DISPEL Execution and Control |
|
|
268 | (7) |
|
|
273 | (2) |
|
PART IV Data-Intensive Application Experience |
|
|
275 | (102) |
|
13 The Application Foundations of DISPEL |
|
|
277 | (10) |
|
|
13.1 Characteristics of Data-Intensive Applications |
|
|
277 | (3) |
|
13.2 Evaluating Application Performance |
|
|
280 | (3) |
|
13.3 Reviewing the Data-Intensive Strategy |
|
|
283 | (4) |
|
14 Analytical Platform for Customer Relationship Management |
|
|
287 | (14) |
|
|
|
14.1 Data Analysis in the Telecoms Business |
|
|
288 | (1) |
|
14.2 Analytical Customer Relationship Management |
|
|
289 | (2) |
|
14.3 Scenario 1: Churn Prediction |
|
|
291 | (2) |
|
14.4 Scenario 2: Cross Selling |
|
|
293 | (3) |
|
14.5 Exploiting the Models and Rules |
|
|
296 | (3) |
|
14.6 Summary: Lessons Learned |
|
|
299 | (2) |
|
|
299 | (2) |
|
15 Environmental Risk Management |
|
|
301 | (26) |
|
|
|
|
|
15.1 Environmental Modeling |
|
|
302 | (1) |
|
15.2 Cascading Simulation Models |
|
|
303 | (2) |
|
15.3 Environmental Data Sources and Their Management |
|
|
305 | (4) |
|
|
309 | (4) |
|
|
313 | (5) |
|
|
318 | (3) |
|
15.7 New Technologies for Environmental Data Mining |
|
|
321 | (2) |
|
15.8 Summary: Lessons Learned |
|
|
323 | (4) |
|
|
325 | (2) |
|
16 Analyzing Gene Expression Imaging Data in Developmental Biology |
|
|
327 | (26) |
|
|
|
|
|
|
16.1 Understanding Biological Function |
|
|
328 | (2) |
|
16.2 Gene Image Annotation |
|
|
330 | (1) |
|
16.3 Automated Annotation of Gene Expression Images |
|
|
331 | (10) |
|
16.4 Exploitation and Future Work |
|
|
341 | (4) |
|
|
345 | (8) |
|
|
346 | (7) |
|
17 Data-Intensive Seismology: Research Horizons |
|
|
353 | (24) |
|
|
|
|
|
|
354 | (2) |
|
17.2 Seismic Ambient Noise Processing |
|
|
356 | (2) |
|
17.3 Solution Implementation |
|
|
358 | (11) |
|
|
369 | (3) |
|
|
372 | (1) |
|
|
373 | (4) |
|
|
375 | (2) |
|
PART V Data-Intensive Beacons Of Success |
|
|
377 | (82) |
|
18 Data-Intensive Methods in Astronomy |
|
|
381 | (14) |
|
|
|
|
|
|
|
|
381 | (1) |
|
18.2 The Virtual Observatory |
|
|
382 | (1) |
|
18.3 Data-Intensive Photometric Classification of Quasars |
|
|
383 | (4) |
|
18.4 Probing the Dark Universe with Weak Gravitational Lensing |
|
|
387 | (5) |
|
18.5 Future Research Issues |
|
|
392 | (1) |
|
|
392 | (3) |
|
|
393 | (2) |
|
19 The World at One's Fingertips: Interactive Interpretation of Environmental Data |
|
|
395 | (22) |
|
|
|
|
|
395 | (2) |
|
19.2 The Current State of the Art |
|
|
397 | (4) |
|
19.3 The Technical Landscape |
|
|
401 | (2) |
|
19.4 Interactive Visualization |
|
|
403 | (3) |
|
19.5 From Visualization to Intercomparison |
|
|
406 | (3) |
|
19.6 Future Development: The Environmental Cloud |
|
|
409 | (2) |
|
|
411 | (6) |
|
|
412 | (5) |
|
20 Data-Driven Research in the Humanities-the DARIAH Research Infrastructure |
|
|
417 | (14) |
|
|
|
|
|
|
417 | (3) |
|
20.2 The Tradition of Digital Humanities |
|
|
420 | (2) |
|
20.3 Humanities Research Data |
|
|
422 | (4) |
|
|
426 | (3) |
|
20.5 Conclusion and Future Development |
|
|
429 | (2) |
|
|
430 | (1) |
|
21 Analysis of Large and Complex Engineering and Transport Data |
|
|
431 | (10) |
|
|
|
431 | (1) |
|
21.2 Applications and Challenges |
|
|
432 | (2) |
|
|
434 | (4) |
|
|
438 | (1) |
|
|
439 | (2) |
|
|
440 | (1) |
|
22 Estimating Species Distributions-Across Space, Through Time, and with Features of the Environment |
|
|
441 | (18) |
|
|
|
|
|
|
|
|
|
|
442 | (1) |
|
22.2 Data Discovery, Access, and Synthesis |
|
|
443 | (5) |
|
|
448 | (1) |
|
22.4 Managing Computational Requirements |
|
|
449 | (1) |
|
22.5 Exploring and Visualizing Model Results |
|
|
450 | (2) |
|
|
452 | (2) |
|
|
454 | (5) |
|
|
456 | (3) |
|
PART VI The Data-Intensive Future |
|
|
459 | (40) |
|
|
461 | (16) |
|
|
|
|
461 | (8) |
|
23.2 Data-Intensive Applications |
|
|
469 | (8) |
|
|
476 | (1) |
|
|
477 | (22) |
|
|
24.1 Future Data Infrastructure |
|
|
478 | (7) |
|
|
485 | (4) |
|
24.3 Future Data Society and Professionalism |
|
|
489 | (10) |
|
|
494 | (5) |
Appendix A: Glossary |
|
499 | (8) |
|
|
Appendix B: DISPEL Reference Manual |
|
507 | (24) |
|
Appendix C: Component Definitions |
|
531 | (6) |
|
|
Index |
|
537 | |