Preface |
|
xvii | |
Contributors |
|
xix | |
PART I Introduction to the Concepts of Bioinformatics and Evolutionary Computation |
|
1 | (38) |
|
An Introduction to Bioinformatics for Computer Scientists |
|
|
3 | (16) |
|
|
|
|
3 | (2) |
|
Biology---The Science of Life |
|
|
5 | (1) |
|
The Central Dogma of Molecular Biology |
|
|
6 | (9) |
|
Anatomy of a DNA Sequence |
|
|
10 | (1) |
|
Transcription and Translation |
|
|
11 | (1) |
|
|
12 | (3) |
|
|
15 | (1) |
|
|
16 | (1) |
|
|
17 | (2) |
|
An Introduction to Evolutionary Computation for Biologists |
|
|
19 | (20) |
|
|
|
|
19 | (1) |
|
Evolutionary Computation: History and Terminology |
|
|
20 | (6) |
|
|
22 | (1) |
|
|
22 | (2) |
|
|
24 | (1) |
|
Additional Parameters and Testing |
|
|
25 | (1) |
|
|
25 | (1) |
|
|
26 | (1) |
|
Evolutionary Computation within the Context of Computer Science |
|
|
26 | (9) |
|
|
29 | (3) |
|
|
32 | (1) |
|
|
33 | (2) |
|
|
35 | (4) |
PART II Sequence and Structure Alignment |
|
39 | (74) |
|
Determining Genome Sequences from Experimental Data Using Evolutionary Computation |
|
|
41 | (18) |
|
|
|
|
41 | (4) |
|
Sequencing by Hybridization |
|
|
41 | (2) |
|
Example: Reconstruction of Sequence from an Ideal Spectrum |
|
|
43 | (1) |
|
Experimental Errors in the Spectrum |
|
|
43 | (2) |
|
Formulation of the Sequence Reconstruction Problem |
|
|
45 | (2) |
|
An Integer Programming Formulation |
|
|
45 | (1) |
|
Example: Reconstruction of Sequence from a Spectrum Containing Errors |
|
|
46 | (1) |
|
A Hybrid Genetic Algorithm for Sequence Reconstruction |
|
|
47 | (2) |
|
Results from Computational Experiments |
|
|
49 | (6) |
|
|
55 | (4) |
|
Protein Structure Alignment Using Evolutionary Computation |
|
|
59 | (28) |
|
|
|
|
59 | (3) |
|
Structure Alignment Algorithms |
|
|
60 | (1) |
|
|
61 | (1) |
|
|
62 | (9) |
|
|
63 | (2) |
|
Stage 2: Detailed Alignments Using a Genetic Algorithm |
|
|
65 | (4) |
|
Stage 3: Three-Dimensional Superposition |
|
|
69 | (1) |
|
Evaluation of Statistical Significance |
|
|
69 | (2) |
|
|
71 | (13) |
|
|
72 | (1) |
|
GA Performance Characteristics |
|
|
73 | (6) |
|
|
79 | (5) |
|
|
84 | (3) |
|
Using Genetic Algorithms for Pairwise and Multiple Sequence Alignments |
|
|
87 | (26) |
|
|
|
87 | (4) |
|
Standard Optimization Algorithms |
|
|
88 | (1) |
|
|
89 | (2) |
|
Evolutionary Algorithms and Simulated Annealing |
|
|
91 | (1) |
|
SAGA: A Genetic Algorithm Dedicated to Sequence Alignment |
|
|
92 | (7) |
|
|
92 | (1) |
|
|
93 | (1) |
|
Reproduction, Variation, and Termination |
|
|
94 | (1) |
|
Designing the Variation Operators |
|
|
94 | (1) |
|
|
95 | (1) |
|
Mutation Operators: A Gap Insertion Operator |
|
|
95 | (1) |
|
Dynamic Scheduling of Operators |
|
|
95 | (3) |
|
|
98 | (1) |
|
Applications: Choice of an Appropriate Objective Function |
|
|
99 | (6) |
|
|
100 | (1) |
|
Consistency-Based Objective Functions: The COFFEE Score |
|
|
101 | (2) |
|
Taking Nonlocal Interactions into Account: RAGA |
|
|
103 | (2) |
|
|
105 | (8) |
PART III Protein Folding |
|
113 | (80) |
|
On the Evolutionary Search for Solutions to the Protein Folding Problem |
|
|
115 | (22) |
|
|
|
|
115 | (1) |
|
|
116 | (2) |
|
|
118 | (10) |
|
|
118 | (4) |
|
|
122 | (4) |
|
|
126 | (2) |
|
|
128 | (3) |
|
Chemistry at HARvard Molecular Mechanics (CHARMm) |
|
|
129 | (1) |
|
Assisted Model Building with Energy Refinement (AMBER) |
|
|
129 | (1) |
|
On Force Fields and Evolutionary Algorithms |
|
|
130 | (1) |
|
|
131 | (6) |
|
Virtual Backbone Modeling |
|
|
131 | (1) |
|
Full Modeling of the Side Chains |
|
|
132 | (1) |
|
Development of an Empirical Energy Function to Measure Fitness |
|
|
132 | (1) |
|
A Realistic Assessment of a Search Strategy |
|
|
133 | (4) |
|
Toward Effective Polypeptide Structure Prediction with Parallel Fast Messy Genetic Algorithms |
|
|
137 | (26) |
|
|
|
|
137 | (2) |
|
Fast Messy Genetic Algorithms |
|
|
139 | (3) |
|
|
139 | (2) |
|
|
141 | (1) |
|
|
142 | (9) |
|
|
142 | (3) |
|
|
145 | (1) |
|
The Computing Environment |
|
|
145 | (1) |
|
|
145 | (1) |
|
|
145 | (1) |
|
Handling the Steric Constraints |
|
|
146 | (4) |
|
|
150 | (1) |
|
Protein Structure Prediction with Secondary Structure Computation |
|
|
151 | (4) |
|
Effects of Seeding the Population |
|
|
155 | (1) |
|
|
156 | (7) |
|
Application of Evolutionary Computation to Protein Folding with Specialized Operators |
|
|
163 | (30) |
|
|
|
163 | (14) |
|
|
164 | (2) |
|
|
166 | (1) |
|
|
167 | (1) |
|
|
168 | (3) |
|
|
171 | (4) |
|
|
175 | (2) |
|
Multiple-Criteria Optimization of Protein Conformations |
|
|
177 | (3) |
|
Specialized Variation Operators |
|
|
180 | (3) |
|
Preferred Backbone Conformations |
|
|
181 | (1) |
|
|
182 | (1) |
|
|
183 | (5) |
|
|
188 | (5) |
PART IV Machine Learning and Inference |
|
193 | (102) |
|
Identification of Coding Regions in DNA Sequences Using Evolved Neural Networks |
|
|
195 | (24) |
|
|
|
|
|
195 | (6) |
|
Artificial Neural Networks for Pattern Recognition |
|
|
197 | (2) |
|
Evolutionary Computation and Neural Networks |
|
|
199 | (2) |
|
Evolved Artificial Neural Networks for Gene Identification |
|
|
201 | (14) |
|
|
202 | (3) |
|
Classification and Postprocessing |
|
|
205 | (2) |
|
Performance on Training Data |
|
|
207 | (1) |
|
|
208 | (7) |
|
|
215 | (4) |
|
Clustering Microarray Data with Evolutionary Algorithms |
|
|
219 | (12) |
|
|
|
|
219 | (1) |
|
|
220 | (4) |
|
|
220 | (1) |
|
|
221 | (3) |
|
|
224 | (5) |
|
The Grouping Genetic Algorithm |
|
|
224 | (2) |
|
The Use of GGA within ArrayMiner |
|
|
226 | (1) |
|
Why Does It Matter in Practice? |
|
|
226 | (1) |
|
ArrayMiner's Performance: Solution Quality and Speed |
|
|
227 | (2) |
|
|
229 | (2) |
|
Evolutionary Computation and Fractal Visualization of Sequence Data |
|
|
231 | (24) |
|
|
|
|
231 | (1) |
|
|
232 | (6) |
|
|
238 | (5) |
|
|
239 | (2) |
|
Designing the Evolutionary Algorithm |
|
|
241 | (2) |
|
|
243 | (1) |
|
Chaos Automata: Adding Memory |
|
|
243 | (7) |
|
The Data Structure and Its Variation Operators |
|
|
245 | (1) |
|
|
246 | (1) |
|
The Evolutionary Algorithm |
|
|
247 | (1) |
|
|
248 | (1) |
|
|
248 | (2) |
|
|
250 | (1) |
|
|
250 | (5) |
|
|
251 | (1) |
|
|
252 | (1) |
|
Other Fractal Chromosomes |
|
|
252 | (1) |
|
More General Contraction Maps |
|
|
253 | (2) |
|
Identifying Metabolic Pathways and Gene Regulation Networks with Evolutionary Algorithms |
|
|
255 | (24) |
|
|
|
|
255 | (3) |
|
The Importance of Inferring Biological Networks |
|
|
256 | (1) |
|
Representing Biological Networks with Petri Nets |
|
|
256 | (2) |
|
Reaction Kinetics, Petri Nets, and Functional Petri Nets |
|
|
258 | (1) |
|
|
258 | (1) |
|
|
259 | (1) |
|
The Inverse Problem: Inferring Pathways from Data |
|
|
259 | (3) |
|
|
260 | (2) |
|
|
262 | (1) |
|
Evolving Pathways: Sample Results |
|
|
262 | (6) |
|
A Simple Metabolic Pathway |
|
|
262 | (1) |
|
|
263 | (1) |
|
Inferring the Phospholipid Pathway |
|
|
264 | (4) |
|
Related Work Using Evolutionary Computation for the Inference of Biological Networks |
|
|
268 | (8) |
|
Biological Network Inference with Genetic Programming |
|
|
268 | (4) |
|
Inference of Gene Regulatory Networks |
|
|
272 | (3) |
|
Interactive Simulation Systems for Biological Network Inference |
|
|
275 | (1) |
|
|
276 | (3) |
|
Evolutionary Computational Support for the Characterization of Biological Systems |
|
|
279 | (16) |
|
|
|
|
279 | (2) |
|
Characterization of Biological Systems with EPR Spectroscopy |
|
|
281 | (5) |
|
EPR Spectrum-Simulation Model |
|
|
282 | (2) |
|
The Role of Spectral Parameters |
|
|
284 | (2) |
|
Optimization of Spectral Parameters |
|
|
286 | (1) |
|
|
287 | (6) |
|
|
293 | (2) |
PART V Feature Selection |
|
295 | (72) |
|
Discovery of Genetic and Environmental Interactions in Disease Data Using Evolutionary Computation |
|
|
297 | (20) |
|
|
|
|
|
297 | (2) |
|
Biological Background and Definitions |
|
|
299 | (2) |
|
Mathematical Background and Definitions |
|
|
301 | (1) |
|
The Feature Selection Phase |
|
|
302 | (6) |
|
|
302 | (2) |
|
Candidate Solution Encoding and a Distance Measure |
|
|
304 | (1) |
|
|
305 | (1) |
|
|
306 | (1) |
|
|
307 | (1) |
|
|
308 | (1) |
|
|
308 | (2) |
|
Objective of the Clustering Phase |
|
|
309 | (1) |
|
|
309 | (1) |
|
|
310 | (5) |
|
Experiments on Artificial Data |
|
|
310 | (2) |
|
|
312 | (3) |
|
|
315 | (2) |
|
Feature Selection Methods Based on Genetic Algorithms for in Silico Drug Design |
|
|
317 | (24) |
|
|
|
|
|
|
|
|
|
317 | (2) |
|
The Feature Selection Problem |
|
|
319 | (2) |
|
|
321 | (2) |
|
|
321 | (1) |
|
|
322 | (1) |
|
Feature Selection Methods |
|
|
323 | (2) |
|
GA-Supervised Scaled Regression Clustering (GARC) |
|
|
324 | (1) |
|
|
324 | (1) |
|
|
325 | (3) |
|
GA-Driven Supervised Clustering |
|
|
326 | (1) |
|
Supervised Regression Clustering with Local Learning |
|
|
326 | (1) |
|
Supervised Scaled Regression Clustering |
|
|
327 | (1) |
|
Parameterization and Implementation of GAFEAT |
|
|
328 | (3) |
|
|
329 | (1) |
|
Correlation Matrix-Based Evaluation Function |
|
|
330 | (1) |
|
Rank-Based Selection Scheme |
|
|
330 | (1) |
|
Comparative Results and Discussion |
|
|
331 | (3) |
|
|
334 | (7) |
|
Interpreting Analytical Spectra with Evolutionary Computation |
|
|
341 | (26) |
|
|
Analytical Spectra in Bioinformatics |
|
|
341 | (1) |
|
Some Instrumentation Issues |
|
|
342 | (4) |
|
Unsupervised and Supervised Learning in Spectral Interpretation |
|
|
346 | (1) |
|
Some General Issues of Model Validation |
|
|
347 | (3) |
|
Selecting Spectral Variables for Modeling |
|
|
350 | (2) |
|
|
352 | (1) |
|
|
353 | (2) |
|
Making Use of Domain Knowledge |
|
|
355 | (3) |
|
Intelligibility of Models |
|
|
358 | (1) |
|
Model Validation with Evolutionary Algorithms |
|
|
359 | (1) |
|
Applications of Evolutionary Computation in Transcriptomics and Proteomics |
|
|
360 | (2) |
|
|
362 | (5) |
Appendix: Internet Resources for Bioinformatics Data and Tools |
|
367 | (6) |
|
|
367 | (1) |
|
|
367 | (1) |
|
|
368 | (1) |
|
A.4 Expressed Sequence Tags (ESTs) |
|
|
369 | (1) |
|
A.5 Single Nucleotide Polymorphisms (SNPs) |
|
|
369 | (1) |
|
|
369 | (1) |
|
|
370 | (1) |
|
|
371 | (1) |
|
A.9 Educational Resources |
|
|
371 | (1) |
|
|
371 | (2) |
Index |
|
373 | |