-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy path02-scenario-II.Rmd
422 lines (313 loc) · 11.2 KB
/
02-scenario-II.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
# Scenario II: large effect, Gaussian prior {#scenarioII}
## Details
In this scenario, we revisit the case from [Scenario I](#scenarioI), but
are not assuming a point prior any more.
Instead, a Gaussian prior with mean $\vartheta = 0.4$ and
variance $\tau^2 = 0.2^2$ on the effect size is assumed, i.e.
$\delta \sim \mathcal{N} (0.4, 0.2^2)$.
In order to fulfill regulatory considerations, the type one error rate
is still protected under the point prior $\delta = 0$ at the level
of significance $\alpha = 0.025$.
The power constraint, however, needs to be modified.
It is not senseful to compute the power as rejection probability under
the full prior, because effect sizes less than a minimal clinically relevant
effect do not show (sufficient) evidence againt the null hypothesis.
Therefore, we assume a minimal clinically relevant effect size
$\delta = 0.0$ and condition the prior on values $\delta > 0$
to compute expected power.
In the following, the expected power should be at least $0.8$.
```{r}
# data distribution and priors
datadist <- Normal(two_armed = TRUE)
H_0 <- PointMassPrior(.0, 1)
prior <- ContinuousPrior(function(delta) dnorm(delta, mean = .4, sd = .2),
support = c(-5, 5),
tighten_support = TRUE)
# define constraints on type one error rate and expected power
alpha <- 0.025
min_epower <- 0.8
toer_cnstr <- Power(datadist, H_0) <= alpha
epow_cnstr <- Power(datadist, condition(prior, c(0.0, prior@support[2]))) >= min_epower
```
## Variant II-1: Minimizing Expected Sample Size under Point Prior {#variantII_1}
### Objective
Expected sample size under the full prior is minimized, i.e.,
$\boldsymbol{E}\big[n(\mathcal{D})\big]$.
```{r}
ess <- ExpectedSampleSize(datadist, prior)
```
### Constraints
No additional constraints are considered in this variant.
### Initial Design
For this example, the optimal one-stage, group-sequential, and generic
two-stage designs are computed.
While the initial design for the one-stage case is determined heuristically,
both the group sequential and the generic two-stage designs are
optimized starting from the a group-sequential design that is computed by
the `rpact` package to fulfill the type one error rate constraint and
that fulfills the power constraint at an effect size of $\delta = 0.3$.
```{r}
order <- 5L
# data frame of initial designs
tbl_designs <- tibble(
type = c("one-stage", "group-sequential", "two-stage"),
initial = list(
OneStageDesign(250, 2.0),
rpact_design(datadist, 0.3, 0.025, 0.8, TRUE, order),
TwoStageDesign(rpact_design(datadist, 0.3, 0.025, 0.8, TRUE, order))) )
```
The order of integration is set to `r order`.
### Optimization
For all these three initial designs, the resulting optimal designs are
computed.
```{r}
tbl_designs <- tbl_designs %>%
mutate(
optimal = purrr::map(initial, ~minimize(
ess,
subject_to(
toer_cnstr,
epow_cnstr
),
initial_design = .,
opts = opts)) )
```
### Test Cases
Firstly, it is checked that the maximum number of iterations was not reached
in all these cases.
```{r}
tbl_designs %>%
transmute(
type,
iterations = purrr::map_int(tbl_designs$optimal,
~.$nloptr_return$iterations) ) %>%
{print(.); .} %>%
{testthat::expect_true(all(.$iterations < opts$maxeval))}
```
Since type one error rate is defined under the point effect size $\delta=0$,
the type one error rate constraint can be tested for all three optimal designs.
```{r}
tbl_designs %>%
transmute(
type,
toer = purrr::map(tbl_designs$optimal,
~sim_pr_reject(.[[1]], .0, datadist)$prob) ) %>%
unnest(., cols = c(toer)) %>%
{print(.); .} %>% {
testthat::expect_true(all(.$toer <= alpha * (1 + tol))) }
```
Since the optimal two-stage design is more flexible than the optimal
group-sequential design (constant $n_2$ function) and this is
more flexible than the optimal one-stage design (no second stage),
the expected sample sizes under the prior should be ordered in the opposite way.
Additionally, expected sample sizes under the null hypothesis
are computed both via `evaluate()` and simulation-based.
```{r}
essh0 <- ExpectedSampleSize(datadist, H_0)
tbl_designs %>%
mutate(
ess = map_dbl(optimal,
~evaluate(ess, .$design) ),
essh0 = map_dbl(optimal,
~evaluate(essh0, .$design) ),
essh0_sim = map_dbl(optimal,
~sim_n(.$design, .0, datadist)$n ) ) %>%
{print(.); .} %>% {
# sim/evaluate same under null?
testthat::expect_equal(.$essh0, .$essh0_sim,
tolerance = tol_n,
scale = 1)
# monotonicity with respect to degrees of freedom
testthat::expect_true(all(diff(.$ess) < 0)) }
```
## Variant II-2: Minimizing Expected Sample Size under Null Hypothesis {#variantII_2}
### Objective
Expected sample size conditioned on negative effect sizes is minimized, i.e.,
```{r}
ess_0 <- ExpectedSampleSize(datadist, condition(prior, c(bounds(prior)[1], 0)))
```
### Constraints
No additional constraints besides type one error rate and expected power
are considered in this variant.
### Initial Design
As in [Variant I.2](#variantI_2) another initial design is more appropriate
for optimization under the null hypothesis.
In this situation, one may expect a different (increasing) sample size function,
and thus also a different shape of the $c_2$ function.
Therefore, the `rpact` initial design is a suboptimal starting point.
Instead, we start with a constant $c_2$ function by heuristically
setting it to $2$ on the continuation area.
Since optimization under the null hypothesis favours extremely conservative
boundaries for early efficacy stopping we impose as quite liberal upper bound
of $3$ for early efficacy stopping.
```{r}
init_design_h0 <- tbl_designs %>%
filter(type == "two-stage") %>%
pull(initial) %>%
.[[1]]
init_design_h0@c2_pivots <- rep(2, order)
ub_design <- TwoStageDesign(
3 * init_design_h0@n1,
2,
3,
rep(600, order),
rep(3.0, order)
)
```
### Optimization
```{r}
opt_neg <- minimize(
ess_0,
subject_to(
toer_cnstr,
epow_cnstr
),
initial_design = init_design_h0,
upper_boundary_design = ub_design,
opts = opts
)
```
### Test Cases
First of all, check if the optimization algorithm converged.
To avoid improper solutions, it is first verified that the maximum
number of iterations was not exceeded in any of the three cases.
```{r}
testthat::expect_true(opt_neg$nloptr_return$iterations < opts$maxeval)
print(opt_neg$nloptr_return$iterations)
```
Again, the type one error rate under the point null hypothesis $\delta = 0$
can be tested by simulation.
```{r}
tbl_toer <- tibble(
toer = evaluate(Power(datadist, H_0), opt_neg$design),
toer_sim = sim_pr_reject(opt_neg$design, .0, datadist)$prob
)
print(tbl_toer)
testthat::expect_true(tbl_toer$toer <= alpha * (1 + tol))
testthat::expect_true(tbl_toer$toer_sim <= alpha * (1 + tol))
```
Furthermore, the expected sample size under the prior conditioned on negative
effect sizes ($\delta \leq 0$) should be lower for the optimal design derived
in this variant than for the optimal design from [Variant II.1](#variantII_1)
where expected sample size under the full prior was minimized.
```{r}
testthat::expect_lte(
evaluate(ess_0, opt_neg$design),
evaluate(
ess_0,
tbl_designs %>%
filter(type == "two-stage") %>%
pull(optimal) %>%
.[[1]] %>%
.$design )
)
```
## Variant II-3: Conditional Power Constraint {#variantII_3}
### Objective
As in [Variant II-1](#variantII_1), expected sample size under the full prior
is minimized.
### Constraints
In addition to the constraints on type one error rate and expected power,
a constraint on conditional power to be larger than $0.7$ is included.
```{r}
cp <- ConditionalPower(datadist, condition(prior, c(0, prior@support[2])))
cp_cnstr <- cp >= 0.7
```
### Initial Design
The previous initial design can still be applied.
### Optimization
```{r}
opt_cp <- minimize(
ess,
subject_to(
toer_cnstr,
epow_cnstr,
cp_cnstr
),
initial_design = tbl_designs$initial[[3]],
opts = opts
)
```
### Test Cases
We start checking whether the maximum number of iterations was not reached.
```{r}
print(opt_cp$nloptr_return$iterations)
testthat::expect_true(opt_cp$nloptr_return$iterations < opts$maxeval)
```
The type one error rate is tested via simulation and compared
to the value obtained by `evaluate()`.
```{r}
tbl_toer <- tibble(
toer = evaluate(Power(datadist, H_0), opt_cp$design),
toer_sim = sim_pr_reject(opt_cp$design, .0, datadist)$prob
)
print(tbl_toer)
testthat::expect_true(tbl_toer$toer <= alpha * (1 + tol))
testthat::expect_true(tbl_toer$toer_sim <= alpha * (1 + tol))
```
The conditional power is evaluated via numerical integration on several points
inside the continuation region and it is tested whether the constraint is
fulfilled on all these points.
```{r}
tibble(
x1 = seq(opt_cp$design@c1f, opt_cp$design@c1e, length.out = 25),
cp = map_dbl(x1, ~evaluate(cp, opt_cp$design, .)) ) %>%
{print(.); .} %>% {
testthat::expect_true(all(.$cp >= 0.7 * (1 - tol))) }
```
Due to the additional constraint in comparison to [Variant II.1](#variantII_1),
Variant II.3 should show a larger expected sample size under the prior than
Variant II.1
```{r}
testthat::expect_gte(
evaluate(ess, opt_cp$design),
evaluate(
ess,
tbl_designs %>%
filter(type == "two-stage") %>%
pull(optimal) %>%
.[[1]] %>%
.$design )
)
```
## Plot Two-Stage Designs
The optimal two-stage designs stemming from the different variants
are plotted together.
```{r, echo=FALSE}
x1 <- seq(0, 4, by = .01)
tibble(
type = c(
"ESS under prior",
"ESS under null",
"ESS under prior + CP constraint" ),
design = list(
tbl_designs %>%
filter(type == "two-stage") %>%
pull(optimal) %>%
.[[1]] %>%
.$design,
opt_neg$design,
opt_cp$design ) ) %>%
group_by(type) %>%
do(
x1 = x1,
n = adoptr::n(.$design[[1]], x1),
c2 = c2(.$design[[1]], x1),
CP = evaluate(cp, .$design[[1]], x1) ) %>%
unnest(., cols = c(x1, n, c2, CP)) %>%
mutate(
section = ifelse(
is.finite(c2),
"continuation",
ifelse(c2 == -Inf, "efficacy", "futility") ) ) %>%
gather(variable, value, n, c2, CP) %>%
ggplot(aes(x1, value, color = type)) +
geom_line(aes(group = interaction(section, type))) +
facet_wrap(~variable, scales = "free_y") +
labs(y = "", x = expression(x[1])) +
scale_color_discrete("") +
theme_bw() +
theme(
panel.grid = element_blank(),
legend.position = "bottom" )
```