Acknowledgments |
|
ix | |
Introduction |
|
xi | |
1.1 Prerequisites and notations |
|
xi | |
1.2 The case under study is a genomic monster |
|
xvii | |
|
Chapter 1 Introduction to Correspondence Analysis |
|
|
1 | (24) |
|
|
1 | (1) |
|
|
2 | (10) |
|
1.2.1 A small data set example |
|
|
2 | (5) |
|
|
7 | (1) |
|
1.2.3 Euclidean distance on row profiles |
|
|
8 | (1) |
|
1.2.4 Euclidean distance on double profiles |
|
|
9 | (1) |
|
|
10 | (2) |
|
|
12 | (13) |
|
1.3.1 Scree plot of eigenvalues |
|
|
12 | (3) |
|
1.3.2 Factor orientations are meaningless |
|
|
15 | (2) |
|
|
17 | (1) |
|
1.3.4 Nonlinearity in CA is not a bug but a feature |
|
|
17 | (3) |
|
1.3.5 Distributional equivalence principle |
|
|
20 | (1) |
|
1.3.6 So, why the χ2metric? |
|
|
20 | (5) |
|
Chapter 2 Global Correspondence Analysis |
|
|
25 | (34) |
|
|
25 | (9) |
|
2.1.1 Queries in database |
|
|
25 | (6) |
|
|
31 | (3) |
|
2.2 Running global correspondence analysis |
|
|
34 | (4) |
|
2.3 The missing factor F0 |
|
|
38 | (2) |
|
|
40 | (4) |
|
2.4.1 Coding sequence point of view |
|
|
40 | (2) |
|
2.4.2 Codon point of view |
|
|
42 | (2) |
|
2.4.3 Biological interpretation |
|
|
44 | (1) |
|
2.5 Second and third factors |
|
|
44 | (9) |
|
2.5.1 Coding sequence point of view |
|
|
45 | (3) |
|
2.5.2 Codon point of view |
|
|
48 | (3) |
|
2.5.3 Biological interpretation |
|
|
51 | (2) |
|
2.6 Fourth and fifth factors |
|
|
53 | (6) |
|
2.6.1 Coding sequence point of view |
|
|
55 | (1) |
|
2.6.2 Codon point of view |
|
|
55 | (4) |
|
Chapter 3 Within and Between Correspondence Analysis |
|
|
59 | (12) |
|
|
59 | (4) |
|
3.2 Synonymous codon usage (WCA) |
|
|
63 | (3) |
|
3.2.1 The first and unique factor Fi |
|
|
63 | (1) |
|
3.2.1.1 Coding sequences point of view |
|
|
63 | (1) |
|
3.2.1.2 Codon point of view |
|
|
63 | (2) |
|
3.2.1.3 Biological interpretation |
|
|
65 | (1) |
|
3.3 Amino acid usage (BCA) |
|
|
66 | (5) |
|
3.3.1 The missing factor F0 |
|
|
66 | (1) |
|
3.3.2 The ugly (F1, F2, F3) menage a trois |
|
|
66 | (1) |
|
3.3.2.1 Coding sequences point of view |
|
|
66 | (3) |
|
3.3.2.2 Aminoacid point of view |
|
|
69 | (1) |
|
3.3.2.3 Methodological point of view |
|
|
69 | (2) |
|
Chapter 4 Internal Correspondence Analysis |
|
|
71 | (18) |
|
|
71 | (4) |
|
4.2 Synonymous codon usage |
|
|
75 | (1) |
|
4.3 Non-synonymous codon usage |
|
|
75 | (12) |
|
4.3.1 Between-group analysis |
|
|
75 | (5) |
|
4.3.2 Within-group analysis |
|
|
80 | (1) |
|
|
80 | (2) |
|
|
82 | (5) |
|
|
87 | (2) |
|
|
89 | (44) |
|
|
91 | (40) |
|
|
91 | (5) |
|
|
96 | (9) |
|
|
105 | (11) |
|
|
116 | (5) |
|
|
121 | (10) |
|
|
131 | (2) |
|
|
131 | (2) |
References |
|
133 | (14) |
Index |
|
147 | |