This textbook provides a step-by-step introduction to the tools and principles of high-dimensional statistics. Each chapter is complemented by numerous exercises, many of them with detailed solutions, and computer labs in R that convey valuable practical insights. The book covers the theory and practice of high-dimensional linear regression, graphical models, and inference, ensuring readers have a smooth start in the field. It also offers suggestions for further reading. Given its scope, the textbook is intended for beginning graduate and advanced undergraduate students in statistics, biostatistics, and bioinformatics, though it will be equally useful to a broader audience.
|
|
1 | (36) |
|
1.1 Embracing High-Dimensionality |
|
|
2 | (3) |
|
1.2 Statistical Limitations of Classical Estimators |
|
|
5 | (5) |
|
1.3 Incorporating Prior Information |
|
|
10 | (3) |
|
1.4 Regularization for Increasing the Numerical Stability* |
|
|
13 | (5) |
|
|
18 | (1) |
|
|
19 | (6) |
|
1.7 R Lab: Least-Squares vs. Ridge Estimation |
|
|
25 | (9) |
|
|
34 | (3) |
|
|
37 | (44) |
|
|
38 | (3) |
|
2.2 Sparsity-Inducing Prior Functions |
|
|
41 | (4) |
|
2.3 Post-Processing Methods |
|
|
45 | (3) |
|
|
48 | (5) |
|
2.5 Optimality Conditions* |
|
|
53 | (10) |
|
|
63 | (8) |
|
|
71 | (6) |
|
|
77 | (4) |
|
|
81 | (28) |
|
|
82 | (3) |
|
3.2 Gaussian Graphical Models |
|
|
85 | (2) |
|
3.3 Maximum Regularized Likelihood Estimation |
|
|
87 | (4) |
|
3.4 Neighborhood Selection |
|
|
91 | (4) |
|
|
95 | (3) |
|
3.6 R Lab: Estimating a Gene-Gene Coactivation Network |
|
|
98 | (9) |
|
|
107 | (2) |
|
4 Tuning-Parameter Calibration |
|
|
109 | (30) |
|
|
110 | (3) |
|
4.2 Bounds on the Lasso's Effective Noise |
|
|
113 | (3) |
|
|
116 | (5) |
|
|
121 | (6) |
|
|
127 | (4) |
|
4.6 R Lab: Cross-Validation |
|
|
131 | (4) |
|
|
135 | (4) |
|
|
139 | (30) |
|
|
140 | (6) |
|
|
146 | (7) |
|
|
153 | (4) |
|
5.4 R Lab: Confidence Intervals in Low and High Dimensions |
|
|
157 | (9) |
|
|
166 | (3) |
|
|
169 | (42) |
|
|
170 | (5) |
|
|
175 | (7) |
|
6.3 Prediction Guarantees |
|
|
182 | (10) |
|
6.4 Prediction Guarantees for Sparse and Weakly Correlated Models |
|
|
192 | (12) |
|
|
204 | (4) |
|
|
208 | (3) |
|
7 Theory II: Estimation and Support Recovery |
|
|
211 | (28) |
|
|
212 | (1) |
|
7.2 Estimation Guarantees in -loss |
|
|
213 | (5) |
|
7.3 Estimation Guarantees in -loss |
|
|
218 | (13) |
|
7.4 Support Recovery Guarantees |
|
|
231 | (4) |
|
|
235 | (2) |
|
|
237 | (2) |
|
Supplementary Information |
|
|
239 | (99) |
|
|
240 | (69) |
|
A.1 Solutions for Chap. 1 |
|
|
240 | (17) |
|
A.2 Solutions for Chap. 2 |
|
|
257 | (14) |
|
A.3 Solutions for Chap. 3 |
|
|
271 | (14) |
|
A.4 Solutions for Chap. 4 |
|
|
285 | (3) |
|
A.5 Solutions for Chap. 5 |
|
|
288 | (7) |
|
A.6 Solutions for Chap. 6 |
|
|
295 | (5) |
|
A.7 Solutions for Chap. 7 |
|
|
300 | (9) |
|
B Mathematical Background |
|
|
309 | (29) |
|
|
309 | (3) |
|
|
312 | (26) |
Bibliography |
|
338 | (7) |
Index |
|
345 | |
Johannes Lederer is a Professor of Statistics at the Ruhr-University Bochum, Germany. He received his PhD in mathematics from the ETH Zürich and subsequently held positions at UC Berkeley, Cornell University, and the University of Washington. He has taught high-dimensional statistics to applied and mathematical audiences alike, e.g. as a Visiting Professor at the Institute of Statistics, Biostatistics, and Actuarial Sciences at UC Louvain, and at the University of Hong Kong Business School.