-
Notifications
You must be signed in to change notification settings - Fork 8
/
Copy pathREADME.Rmd
147 lines (109 loc) · 7.52 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
---
output: github_document
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>"
)
```
```{r setup, echo=FALSE, message = FALSE, warning = FALSE}
library(tfrmt)
library(gt)
```
# tfrmt <a href='https://gsk-biostatistics.github.io/tfrmt/'><img src="man/figures/tfrmt.png" align="right" alt = "tfrmt logo" style="height:139px;"/></a>
<!-- badges: start -->
[![R-CMD-check](https://github.com/GSK-Biostatistics/tfrmt/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/GSK-Biostatistics/tfrmt/actions/workflows/R-CMD-check.yaml)
[![Codecov test coverage](https://codecov.io/gh/GSK-Biostatistics/tfrmt/branch/main/graph/badge.svg)](https://app.codecov.io/gh/GSK-Biostatistics/tfrmt?branch=main)
[![status: experimental](https://github.com/GIScience/badges/raw/master/status/experimental.svg)](https://github.com/GIScience/badges#experimental)
[![CRAN status](https://www.r-pkg.org/badges/version/tfrmt)](https://CRAN.R-project.org/package=tfrmt)
<!-- badges: end -->
The tfrmt package provides a language for defining display-related metadata, which can then be used to automate and easily update output formats.
In clinical trials, displays are generally quite standard, but frequent, highly specific formatting tweaks (e.g., rounding, footnotes, headers) are very common. Prior to data analysis, study teams often generate mock displays to represent the desired end product for sponsors to approve or programmers to replicate. This process is typically highly manual and separate from the programming itself. There is also a high importance placed on verifying the accuracy of the results via a QC (Quality Control) process such as double programming. Finally, there is a movement toward an industry standard data structure for Analysis Results Data "ARD", which means analysis results datasets will have consistent structures and column names. Specifically, the ARD is long, with 1 record per computed value. For more information about ARDs click [here](https://pharmasug.org/download/sde/rtp2021/PharmaSUG-NCSDE_2021-08.pdf).
tfrmt supports a vision where:
- Mock displays are integrated with the programming workflow
- Results are QC'ed prior to formatting to reduce rework
- Standard formatting styles can be applied in as little as one line of code
- The ARD structure can be leveraged to accommodate a variety of tables
By reducing the amount of repetitive tasks, study teams can focus on the quality and interpretation of the results themselves.
# Why tfrmt?
While there are many existing table-making packages in the R ecosystem, they typically fall into one of two categories:
- Table packages that perform analyses and format the results
- Table packages that format and output existing data
By design, tfrmt is more of the latter, as it is intended to be used after the results have been computed. What makes tfrmt unique, however, is that it offers an intuitive interface for defining and layering standard or custom formats that are often specific to clinical trials. It also offers the novel ability to easily generate mock displays using metadata that will be used for the actual displays. tfrmt is built on top of the powerful gt package, which is intended to support a variety of output formats in the future.
# Installation
The development version of tfrmt can be installed with:
```{r eval =FALSE}
devtools::install_github("GSK-Biostatistics/tfrmt")
```
# Input data structure
We expect an input dataset that is long, with 1 record per computed value. Required columns include:
- [Optional] 1 or more **group** columns, containing grouping values
- A single **label** column, containing row label values
- 1 or more **column** columns, containing column values
- A single **param** column, which provides a label for distinct types of values
- A single **value** column, containing the computed, raw data values
- [Optional] 1 or more **sorting_cols** columns, containing numeric values to be used in the row ordering
# Functionality
Here is an overview of what is possible with tfrmt:
- Create a "tfrmt" metadata object containing all formatting and labelling for the display
- Create mock displays based on existing sample data or no prior data
- ARD-standard compliant facilitates reuse and automation
Other benefits of tfrmt:
- Provides a tidyverse-friendly, pipeable interface
- Leverages gt as output engine, which allows for further customizations within gt itself
# More Info
For more information about how to build your own tfrmt mocks/tables (like the one below!), please explore the [vignettes](https://gsk-biostatistics.github.io/tfrmt/articles/examples.html).
```{r echo = FALSE, message = FALSE, warning = FALSE}
library(dplyr)
data_demog <- data_demog %>%
select(-value)
tfrmt <- tfrmt(
# specify columns in the data
group = vars(rowlbl1,grp),
label = rowlbl2,
column = column,
param = param,
value = value,
sorting_cols = vars(ord1, ord2),
# Specify body plan
body_plan = body_plan(
frmt_structure(group_val = ".default", label_val = ".default", frmt_combine("{n} {pct}",
n = frmt("xxx"),
pct = frmt_when("==100" ~ "",
"==0" ~ "",
TRUE ~ frmt("(xx.x %)")))),
frmt_structure(group_val = ".default", label_val = "n", frmt("xxx")),
frmt_structure(group_val = ".default", label_val = c("Mean", "Median", "Min","Max"), frmt("xxx.x")),
frmt_structure(group_val = ".default", label_val = "SD", frmt("xxx.xx")),
frmt_structure(group_val = ".default", label_val = ".default", p = frmt("")),
frmt_structure(group_val = ".default", label_val = c("n","<65 yrs","<12 months","<25"), p = frmt_when(">0.99" ~ ">0.99",
"<0.001" ~ "<0.001",
TRUE ~ frmt("x.xxx", missing = "")))
),
# Specify row group plan
row_grp_plan = row_grp_plan(
row_grp_structure(group_val = ".default", element_block(post_space = " ")),
label_loc = element_row_grp_loc(location = "column")
),
# Specify column style plan
col_style = col_style_plan(
col_style_structure(align = c(".",","," "), col = vars(everything()))
),
# remove extra cols
col_plan = col_plan(Placebo,
starts_with("Xanom"),
Total,
-grp,
-starts_with("ord")
)
)
gt_out <- print_mock_gt(tfrmt, data_demog)
invisible(gtsave(gt_out, "man/figures/gt_readme.png", vwidth = 999))
```
![Example GT Demography Table](man/figures/gt_readme.png)
# Recorded Presentations and Shared Materials
[R/Pharma 2022 Day 1: Christina Fillmore - Why do I spend all my life formatting tables?](https://www.youtube.com/watch?v=00lGhuANUJw)
[R/Pharma 2023: Becca Krouse - Everyone's Invited: A Case Study on Bridging the Usability Gap](https://www.youtube.com/watch?v=Zg1LPJSO0kQ)
[R/Pharma 2022 Workshop: Christina Fillmore, Ellis Hughes, Thomas Neitmann - Clinical Reporting in R (Day 2)](https://www.youtube.com/watch?v=rYflZhFDSZQ)
[R/Pharma 2023 Workshop: Thomas Neitmann, Pawel Rucki, Ellis Hughes - Leveraging and contributing to the the pharmaverse for clinical trial reporting in R](https://github.com/posit-conf-2023/r-pharma)