Preface to the second edition, |
|
xv | |
Preface to the first edition, |
|
xvii | |
Abbreviations, |
|
xxi | |
1 Basic ideas in clinical trial design, |
|
1 | (22) |
|
1.1 Historical perspective, |
|
|
1 | (1) |
|
|
2 | (1) |
|
1.3 Placebos and blinding, |
|
|
3 | (1) |
|
|
3 | (6) |
|
1.4.1 Unrestricted randomisation, |
|
|
4 | (1) |
|
1.4.2 Block randomisation, |
|
|
4 | (1) |
|
1.4.3 Unequal randomisation, |
|
|
5 | (1) |
|
1.4.4 Stratified randomisation, |
|
|
6 | (1) |
|
1.4.5 Central randomisation, |
|
|
7 | (1) |
|
1.4.6 Dynamic allocation and minimisation, |
|
|
8 | (1) |
|
1.4.7 Cluster randomisation, |
|
|
9 | (1) |
|
|
9 | (2) |
|
1.6 Between- and within-patient designs, |
|
|
11 | (1) |
|
|
12 | (1) |
|
1.8 Signal, noise and evidence, |
|
|
13 | (2) |
|
|
13 | (1) |
|
|
13 | (1) |
|
1.8.3 Signal-to-noise ratio, |
|
|
14 | (1) |
|
1.9 Confirmatory and exploratory trials, |
|
|
15 | (1) |
|
1.10 Superiority, equivalence and non-inferiority trials, |
|
|
16 | (1) |
|
1.11 Data and endpoint types, |
|
|
17 | (1) |
|
|
18 | (5) |
|
1.12.1 Primary variables, |
|
|
18 | (1) |
|
1.12.2 Secondary variables, |
|
|
19 | (1) |
|
1.12.3 Surrogate variables, |
|
|
20 | (1) |
|
1.12.4 Global assessment variables, |
|
|
21 | (1) |
|
1.12.5 Composite variables, |
|
|
21 | (1) |
|
|
21 | (2) |
2 Sampling and inferential statistics, |
|
23 | (15) |
|
2.1 Sample and population, |
|
|
23 | (1) |
|
2.2 Sample statistics and population parameters, |
|
|
24 | (4) |
|
2.2.1 Sample and population distribution, |
|
|
24 | (1) |
|
|
25 | (1) |
|
2.2.3 Standard deviation, |
|
|
25 | (1) |
|
|
26 | (1) |
|
|
27 | (1) |
|
2.3 The normal distribution, |
|
|
28 | (3) |
|
2.4 Sampling and the standard error of the mean, |
|
|
31 | (3) |
|
2.5 Standard errors more generally, |
|
|
34 | (4) |
|
2.5.1 The standard error for the difference between two means, |
|
|
34 | (3) |
|
2.5.2 Standard errors for proportions, |
|
|
37 | (1) |
|
2.5.3 The general setting, |
|
|
37 | (1) |
3 Confidence intervals and p-values, |
|
38 | (18) |
|
3.1 Confidence intervals for a single mean, |
|
|
38 | (4) |
|
3.1.1 The 95 per cent Confidence interval, |
|
|
38 | (2) |
|
3.1.2 Changing the confidence coefficient, |
|
|
40 | (1) |
|
3.1.3 Changing the multiplying constant, |
|
|
40 | (1) |
|
3.1.4 The role of the standard error, |
|
|
41 | (1) |
|
3.2 Confidence interval for other parameters, |
|
|
42 | (3) |
|
3.2.1 Difference between two means, |
|
|
42 | (1) |
|
3.2.2 Confidence interval for proportions, |
|
|
43 | (1) |
|
|
44 | (1) |
|
3.2.4 Bootstrap Confidence interval, |
|
|
45 | (1) |
|
|
45 | (11) |
|
3.3.1 Interpreting the p-value, |
|
|
46 | (1) |
|
3.3.2 Calculating the p-value, |
|
|
47 | (3) |
|
|
50 | (3) |
|
3.3.4 The language of statistical significance, |
|
|
53 | (1) |
|
3.3.5 One-sided and two-sided tests, |
|
|
54 | (2) |
4 Tests for simple treatment comparisons, |
|
56 | (22) |
|
|
56 | (1) |
|
|
57 | (3) |
|
4.3 Interpreting the t-tests, |
|
|
60 | (1) |
|
4.4 The chi-square test for binary data, |
|
|
61 | (3) |
|
4.4.1 Pearson chi-square, |
|
|
61 | (3) |
|
4.4.2 The link to a ratio of the signal to the standard error, |
|
|
64 | (1) |
|
4.5 Measures of treatment benefit, |
|
|
64 | (5) |
|
|
65 | (1) |
|
|
65 | (1) |
|
4.5.3 Relative risk reduction, |
|
|
66 | (1) |
|
4.5.4 Number needed to treat, |
|
|
66 | (1) |
|
4.5.5 Confidence intervals, |
|
|
67 | (1) |
|
|
68 | (1) |
|
|
69 | (2) |
|
4.7 Tests for categorical and ordinal data, |
|
|
71 | (4) |
|
|
71 | (2) |
|
4.7.2 Ordered categorical (ordinal) data, |
|
|
73 | (1) |
|
4.7.3 Measures of treatment benefit, |
|
|
74 | (1) |
|
4.8 Extensions for multiple treatment groups, |
|
|
75 | (3) |
|
4.8.1 Between-patient designs and continuous data, |
|
|
75 | (1) |
|
4.8.2 Within-patient designs and continuous data, |
|
|
76 | (1) |
|
4.8.3 Binary, categorical and ordinal data, |
|
|
76 | (1) |
|
4.8.4 Dose-ranging studies, |
|
|
77 | (1) |
|
4.8.5 Further discussion, |
|
|
77 | (1) |
5 Adjusting the analysis, |
|
78 | (11) |
|
5.1 Objectives for adjusted analysis, |
|
|
78 | (1) |
|
5.2 Comparing treatments for continuous data, |
|
|
78 | (4) |
|
|
82 | (1) |
|
5.4 Evaluating the homogeneity of the treatment effect, |
|
|
83 | (3) |
|
5.4.1 Treatment-by-factor interactions, |
|
|
83 | (2) |
|
5.4.2 Quantitative and qualitative interactions, |
|
|
85 | (1) |
|
5.5 Methods for binary, categorical and ordinal data, |
|
|
86 | (1) |
|
|
87 | (2) |
|
5.6.1 Adjusting for centre, |
|
|
87 | (1) |
|
5.6.2 Significant treatment-by-centre interactions, |
|
|
87 | (1) |
|
|
88 | (1) |
6 Regression and analysis of covariance, |
|
89 | (18) |
|
6.1 Adjusting for baseline factors, |
|
|
89 | (1) |
|
6.2 Simple linear regression, |
|
|
89 | (2) |
|
|
91 | (3) |
|
|
94 | (1) |
|
6.5 Analysis of covariance for continuous data, |
|
|
94 | (7) |
|
6.5.1 Main effect of treatment, |
|
|
94 | (2) |
|
6.5.2 Treatment-by-covariate interactions, |
|
|
96 | (2) |
|
|
98 | (1) |
|
6.5.4 Connection with adjusted analyses, |
|
|
98 | (1) |
|
6.5.5 Advantages of ANCOVA, |
|
|
99 | (1) |
|
6.5.6 Least squares means, |
|
|
100 | (1) |
|
6.6 Binary, categorical and ordinal data, |
|
|
101 | (2) |
|
6.7 Regulatory aspects of the use of covariates, |
|
|
103 | (2) |
|
|
105 | (2) |
7 Intention-to-treat and analysis sets, |
|
107 | (16) |
|
7.1 The principle of intention-to-treat, |
|
|
107 | (3) |
|
7.2 The practice of intention-to-treat, |
|
|
110 | (3) |
|
|
110 | (2) |
|
|
112 | (1) |
|
|
112 | (1) |
|
|
113 | (5) |
|
|
113 | (1) |
|
7.3.2 Complete cases analysis, |
|
|
114 | (1) |
|
7.3.3 Last observation carried forward, |
|
|
114 | (1) |
|
7.3.4 Success/failure classification, |
|
|
114 | (1) |
|
7.3.5 Worst-case/best-case classification, |
|
|
115 | (1) |
|
|
115 | (1) |
|
7.3.7 Avoidance of missing data, |
|
|
116 | (1) |
|
7.3.8 Multiple imputation, |
|
|
117 | (1) |
|
7.4 Intention-to-treat and time-to-event data, |
|
|
118 | (2) |
|
7.5 General questions and considerations, |
|
|
120 | (3) |
8 Power and sample size, |
|
123 | (13) |
|
8.1 Type I and type II errors, |
|
|
123 | (1) |
|
|
124 | (3) |
|
8.3 Calculating sample size, |
|
|
127 | (3) |
|
8.4 Impact of changing the parameters, |
|
|
130 | (2) |
|
8.4.1 Standard deviation, |
|
|
130 | (1) |
|
8.4.2 Event rate in the control group, |
|
|
130 | (1) |
|
8.4.3 Clinically relevant difference, |
|
|
131 | (1) |
|
|
132 | (2) |
|
8.5.1 Power > 80 per cent, |
|
|
132 | (1) |
|
8.5.2 Powering on the per-protocol set, |
|
|
132 | (1) |
|
8.5.3 Sample size adjustment, |
|
|
133 | (1) |
|
8.6 Reporting the sample size calculation, |
|
|
134 | (2) |
9 Statistical significance and clinical importance, |
|
136 | (6) |
|
9.1 Link between p-values and Confidence intervals, |
|
|
136 | (1) |
|
9.2 Confidence intervals for clinical importance, |
|
|
137 | (2) |
|
9.3 Misinterpretation of the p-value, |
|
|
139 | (1) |
|
9.3.1 Conclusions of similarity, |
|
|
139 | (1) |
|
9.3.2 The problem with 0.05, |
|
|
140 | (1) |
|
9.4 Single pivotal trial and 0.05, |
|
|
140 | (2) |
10 Multiple testing, |
|
142 | (16) |
|
10.1 Inflation of the type I error, |
|
|
142 | (1) |
|
|
142 | (1) |
|
10.1.2 A simulated trial, |
|
|
142 | (1) |
|
10.2 How does multiplicity arise?, |
|
|
143 | (1) |
|
|
144 | (1) |
|
10.4 Multiple primary endpoints, |
|
|
145 | (4) |
|
10.4.1 Avoiding adjustment, |
|
|
145 | (1) |
|
10.4.2 Significance needed on all endpoints, |
|
|
145 | (1) |
|
10.4.3 Composite endpoints, |
|
|
146 | (1) |
|
10.4.4 Variables ranked according to clinical importance: Hierarchical testing, |
|
|
146 | (3) |
|
10.5 Methods for adjustment, |
|
|
149 | (3) |
|
10.5.1 Bonferroni correction, |
|
|
149 | (1) |
|
10.5.2 Hochberg correction, |
|
|
150 | (1) |
|
|
151 | (1) |
|
10.6 Multiple comparisons, |
|
|
152 | (1) |
|
10.7 Repeated evaluation over time, |
|
|
153 | (1) |
|
|
154 | (2) |
|
10.9 Other areas for multiplicity, |
|
|
156 | (2) |
|
10.9.1 Using different statistical tests, |
|
|
156 | (1) |
|
10.9.2 Different analysis sets, |
|
|
156 | (1) |
|
|
157 | (1) |
11 Non-parametric and related methods, |
|
158 | (12) |
|
11.1 Assumptions underlying the t-tests and their extensions, |
|
|
158 | (1) |
|
11.2 Homogeneity of variance, |
|
|
158 | (1) |
|
11.3 The assumption of normality, |
|
|
159 | (2) |
|
11.4 Non-normality and transformations, |
|
|
161 | (3) |
|
11.5 Non-parametric tests, |
|
|
164 | (4) |
|
11.5.1 The Mann-Whitney U-test, |
|
|
164 | (2) |
|
11.5.2 The Wilcoxon signed rank test, |
|
|
166 | (1) |
|
|
167 | (1) |
|
11.6 Advantages and disadvantages of non-parametric methods, |
|
|
168 | (1) |
|
|
169 | (1) |
12 Equivalence and non-inferiority, |
|
170 | (19) |
|
12.1 Demonstrating similarity, |
|
|
170 | (2) |
|
12.2 Confidence intervals for equivalence, |
|
|
172 | (1) |
|
12.3 Confidence intervals for non-inferiority, |
|
|
173 | (1) |
|
|
174 | (2) |
|
|
176 | (2) |
|
|
178 | (1) |
|
|
179 | (5) |
|
|
179 | (1) |
|
12.7.2 Therapeutic equivalence, |
|
|
180 | (1) |
|
|
180 | (2) |
|
12.7.4 The 10 per cent rule for cure rates, |
|
|
182 | (1) |
|
12.7.5 The synthesis method, |
|
|
183 | (1) |
|
12.8 Biocreep and constancy, |
|
|
184 | (1) |
|
12.9 Sample size calculations, |
|
|
184 | (2) |
|
12.10 Switching between non-inferiority and superiority, |
|
|
186 | (3) |
13 The analysis of survival data, |
|
189 | (16) |
|
13.1 Time-to-event data and censoring, |
|
|
189 | (1) |
|
13.2 Kaplan-Meier curves, |
|
|
190 | (3) |
|
13.2.1 Plotting Kaplan-Meier curves, |
|
|
190 | (2) |
|
13.2.2 Event rates and relative risk, |
|
|
192 | (1) |
|
13.2.3 Median event times, |
|
|
192 | (1) |
|
13.3 Treatment comparisons, |
|
|
193 | (3) |
|
|
196 | (3) |
|
|
196 | (1) |
|
13.4.2 Constant hazard ratio, |
|
|
197 | (1) |
|
13.4.3 Non-constant hazard ratio, |
|
|
197 | (1) |
|
13.4.4 Link to survival curves, |
|
|
198 | (1) |
|
13.4.5 Calculating Kaplan-Meier curves, |
|
|
199 | (1) |
|
|
199 | (3) |
|
13.5.1 Stratified methods, |
|
|
200 | (1) |
|
13.5.2 Proportional hazards regression, |
|
|
200 | (1) |
|
13.5.3 Accelerated failure time model, |
|
|
201 | (1) |
|
13.6 Independent censoring, |
|
|
202 | (1) |
|
13.7 Sample size calculations, |
|
|
203 | (2) |
14 Interim analysis and data monitoring committees, |
|
205 | (10) |
|
14.1 Stopping rules for interim analysis, |
|
|
205 | (1) |
|
14.2 Stopping for efficacy and futility, |
|
|
206 | (4) |
|
|
206 | (1) |
|
14.2.2 Futility and conditional power, |
|
|
207 | (1) |
|
14.2.3 Some practical issues, |
|
|
208 | (1) |
|
14.2.4 Analyses following completion of recruitment, |
|
|
209 | (1) |
|
|
210 | (1) |
|
14.4 Data monitoring committees, |
|
|
211 | (4) |
|
14.4.1 Introduction and responsibilities, |
|
|
211 | (1) |
|
14.4.2 Structure and process, |
|
|
212 | (2) |
|
14.4.3 Meetings and recommendations, |
|
|
214 | (1) |
15 Bayesian statistics, |
|
215 | (10) |
|
|
215 | (1) |
|
15.2 Prior and posterior distributions, |
|
|
215 | (4) |
|
|
215 | (2) |
|
15.2.2 Prior to posterior, |
|
|
217 | (1) |
|
|
217 | (2) |
|
|
219 | (2) |
|
15.3.1 Frequentist methods, |
|
|
219 | (1) |
|
15.3.2 Posterior probabilities, |
|
|
219 | (1) |
|
15.3.3 Credible intervals, |
|
|
220 | (1) |
|
|
221 | (1) |
|
15.5 History and regulatory acceptance, |
|
|
222 | (2) |
|
|
224 | (1) |
16 Adaptive designs, |
|
225 | (16) |
|
16.1 What are adaptive designs?, |
|
|
225 | (3) |
|
16.1.1 Advantages and drawbacks, |
|
|
225 | (1) |
|
16.1.2 Restricted adaptations, |
|
|
226 | (1) |
|
16.1.3 Flexible adaptations, |
|
|
227 | (1) |
|
|
228 | (4) |
|
16.2.1 Control of type I error, |
|
|
228 | (1) |
|
|
229 | (1) |
|
16.2.3 Behavioural issues, |
|
|
230 | (2) |
|
16.2.4 Exploratory trials, |
|
|
232 | (1) |
|
16.3 Unblinded sample size re-estimation, |
|
|
232 | (2) |
|
16.3.1 Product of p-values, |
|
|
232 | (1) |
|
16.3.2 Weighting the two parts of the trial, |
|
|
233 | (1) |
|
|
234 | (1) |
|
16.4 Seamless phase II/III studies, |
|
|
234 | (2) |
|
16.4.1 Standard framework, |
|
|
234 | (1) |
|
16.4.2 Aspects of the p-value calculation, |
|
|
235 | (1) |
|
16.4.3 Logistical challenges, |
|
|
236 | (1) |
|
16.5 Other types of adaptation, |
|
|
236 | (2) |
|
16.5.1 Changing the primary endpoint, |
|
|
236 | (1) |
|
16.5.2 Focusing on a sub-population, |
|
|
237 | (1) |
|
16.5.3 Dropping the placebo arm in a non-inferiority trial, |
|
|
237 | (1) |
|
16.6 Further regulatory considerations, |
|
|
238 | (3) |
|
|
238 | (1) |
|
16.6.2 Non-standard experimental settings, |
|
|
239 | (2) |
17 Observational studies, |
|
241 | (20) |
|
|
241 | (6) |
|
17.1.1 Non-randomised comparisons, |
|
|
241 | (1) |
|
|
241 | (2) |
|
|
243 | (1) |
|
17.1.4 An empirical investigation, |
|
|
244 | (1) |
|
17.1.5 Selection bias in concurrently controlled studies: An empirical evaluation, |
|
|
245 | (1) |
|
17.1.6 Selection bias in historically controlled studies: An empirical evaluation, |
|
|
246 | (1) |
|
|
246 | (1) |
|
17.2 Guidance on design, conduct and analysis, |
|
|
247 | (2) |
|
17.2.1 Regulatory guidance, |
|
|
247 | (1) |
|
17.2.2 Strengthening the Reporting of Observational Studies in Epidemiology, |
|
|
248 | (1) |
|
17.3 Evaluating and adjusting for selection bias, |
|
|
249 | (8) |
|
|
249 | (1) |
|
17.3.2 Adjusting for imbalances using stratification and analysis of covariance, |
|
|
250 | (1) |
|
17.3.3 Propensity scores, |
|
|
250 | (3) |
|
17.3.4 Different methods for adjustment: An empirical evaluation, |
|
|
253 | (3) |
|
|
256 | (1) |
|
17.4 Case-control studies, |
|
|
257 | (4) |
|
|
257 | (2) |
|
17.4.2 Odds ratio and Relative risk, |
|
|
259 | (2) |
18 Meta-analysis, |
|
261 | (16) |
|
|
261 | (2) |
|
|
263 | (1) |
|
18.3 Statistical methodology, |
|
|
264 | (6) |
|
18.3.1 Methods for combination, |
|
|
264 | (1) |
|
18.3.2 Confidence intervals, |
|
|
265 | (1) |
|
18.3.3 Fixed and random effects, |
|
|
265 | (1) |
|
18.3.4 Graphical methods, |
|
|
266 | (1) |
|
18.3.5 Detecting heterogeneity, |
|
|
266 | (3) |
|
|
269 | (1) |
|
|
269 | (1) |
|
18.3.8 Individual patient data, |
|
|
269 | (1) |
|
|
270 | (1) |
|
18.5 Ensuring scientific validity, |
|
|
271 | (4) |
|
|
271 | (2) |
|
18.5.2 Assessing the risk of bias, |
|
|
273 | (1) |
|
18.5.3 Publication bias and funnel plots, |
|
|
273 | (2) |
|
18.5.4 Preferred Reporting Items for Systematic Reviews and Meta-Analyses, |
|
|
275 | (1) |
|
18.6 Further regulatory aspects, |
|
|
275 | (2) |
19 Methods for the safety analysis and safety monitoring, |
|
277 | (27) |
|
|
277 | (2) |
|
19.1.1 Methods for safety data, |
|
|
277 | (1) |
|
19.1.2 The rule of three, |
|
|
278 | (1) |
|
19.2 Routine evaluation in clinical studies, |
|
|
279 | (10) |
|
|
280 | (1) |
|
|
281 | (3) |
|
|
284 | (3) |
|
|
287 | (1) |
|
|
288 | (1) |
|
19.2.6 Safety summary across trials, |
|
|
288 | (1) |
|
19.2.7 Specific safety studies, |
|
|
289 | (1) |
|
19.3 Data monitoring committees, |
|
|
289 | (1) |
|
19.4 Assessing benefit-risk, |
|
|
290 | (9) |
|
19.4.1 Current approaches, |
|
|
290 | (1) |
|
19.4.2 Multi-criteria decision analysis, |
|
|
291 | (6) |
|
19.4.3 Quality-Adjusted Time without Symptoms or Toxicity, |
|
|
297 | (2) |
|
|
299 | (5) |
|
19.5.1 Post-approval safety monitoring, |
|
|
299 | (1) |
|
19.5.2 Proportional reporting ratios, |
|
|
300 | (2) |
|
19.5.3 Bayesian shrinkage, |
|
|
302 | (2) |
20 Diagnosis, |
|
304 | (12) |
|
|
304 | (1) |
|
20.2 Measures of diagnostic performance, |
|
|
304 | (4) |
|
20.2.1 Sensitivity and specificity, |
|
|
304 | (1) |
|
20.2.2 Positive and negative predictive value, |
|
|
305 | (1) |
|
20.2.3 False positive and false negative rates, |
|
|
306 | (1) |
|
|
306 | (1) |
|
|
307 | (1) |
|
20.2.6 Predictive accuracy, |
|
|
307 | (1) |
|
20.2.7 Choosing the correct cut-point, |
|
|
307 | (1) |
|
20.3 Receiver operating characteristic curves, |
|
|
308 | (2) |
|
20.3.1 Receiver operating characteristic, |
|
|
308 | (1) |
|
20.3.2 Comparing ROC curves, |
|
|
309 | (1) |
|
20.4 Diagnostic performance using regression models, |
|
|
310 | (2) |
|
20.5 Aspects of trial design for diagnostic agents, |
|
|
312 | (1) |
|
20.6 Assessing agreement, |
|
|
313 | (3) |
|
20.6.1 The kappa statistic, |
|
|
313 | (1) |
|
20.6.2 Other applications for kappa, |
|
|
314 | (2) |
21 The role of statistics and statisticians, |
|
316 | (15) |
|
21.1 The importance of statistical thinking at the design stage, |
|
|
316 | (1) |
|
21.2 Regulatory guidelines, |
|
|
317 | (4) |
|
21.3 The statistics process, |
|
|
321 | (6) |
|
21.3.1 The statistical methods section of the protocol, |
|
|
321 | (1) |
|
21.3.2 The statistical analysis plan, |
|
|
322 | (1) |
|
21.3.3 The data validation plan, |
|
|
322 | (1) |
|
|
322 | (1) |
|
21.3.5 Statistical analysis, |
|
|
323 | (1) |
|
21.3.6 Reporting the analysis, |
|
|
323 | (1) |
|
|
324 | (2) |
|
21.3.8 Sensitivity and robustness, |
|
|
326 | (1) |
|
21.4 The regulatory submission, |
|
|
327 | (1) |
|
21.5 Publications and presentations, |
|
|
328 | (3) |
References |
|
331 | (8) |
Index |
|
339 | |