title

subtitle

author

date

output

Analytical Plan for Time-adjusted effect of socioeconomic status in mortality rates after brain injury: cohort study

DOCUMENT: SAP-2023-016-BH-v01

**From:** Felipe Figueiredo **To:** Brennan Hickson

2023-05-26

html_document

pdf_document

word_document

fig_caption	fig_height	fig_width	keep_md	number_sections	toc
true	6	6	true	true	true

number_sections	toc
true	true

fig_caption	fig_height	fig_width	reference_docx	toc
true	6	6	misc/style_SAP_en.docx	true

\newpage

Analytical Plan for Time-adjusted effect of socioeconomic status in mortality rates after brain injury: cohort study

Document version

Version	Alterations
01	Initial version

Abbreviations

FIM: Functional Independence Measure
HR: hazards ratio
SD: standard deviation
SES: socioeconomic status

Context

Objectives

To determine the effect of socioeconomic status of the neighborhood on mortality of patients with brain injury, accounting for time-dependent covariates.

Hypotheses

The neighborhood to which an individual is discharged from acute care affects the mortality rates of individuals that suffered a brain injury.

Data

Raw data

The raw data was received in three distinct databases, one containing clinical and epidemiological data, a second one with follow up on the status of individuals and a third one containing SES information of each Zip code. The clinical and followup tables were merged by study ID, and this was joined with the SES table by the Zip code of the neighborhood of discharge, resulting in the original raw data base.

The original data base had 711 variables collected on 76665 observations from 19303 individuals.

Analytical dataset

The date of death of participants was available, and its presence was used as an indicator of the living status of the participant at each follow up. All variables in the raw dataset had varying missing data codes in the data dictionary made available by the researcher, which were used to attribute missingness status to each datum. Most of the categorical variables were measured with many levels that were condensed into fewer levels for analysis. The quartiles for each of the both FIM scores were calculated for each individual and saved as two separate variables as covariates for the analysis.

After the cleaning process 23 variables were included in the analysis. The total number of observations excluded due to incompleteness and exclusion criteria will be reported in the analysis.

All variables in the analytical set were labeled according to the raw data provided and values were labeled according to the data dictionary for the preparation of production-quality results tables and figures.

Study parameters

Study design

This is a retrospective analysis of a prospective cohort study.

Inclusion and exclusion criteria

Inclusion criteria

Participants with at most 10 years of follow up;
Participants included in the cohort between 2010-01-01 and 2018-12-31.
Participants with non-negative time under observation.

Exclusion criteria

Observations after 2019-12-31 will be excluded in order to mitigate risk of confounding by COVID-19 related deaths. Observations prior to this date will still be considered for participants where such data is available.

Exposures

SES of the neighborhood to which the participant was discharged. The SES measure was stratified into its quintiles, and labelled according to the data dictionary to facilitate interpretation of the results.

Outcomes

Specification of outcome measures (Zarin, 2011):

(Domain) Mortality
(Specific measurement) Death
(Specific metric) Time-to-event
(Method of aggregation) Hazard ratio

Primary outcome

Death after a brain injury.

Covariates

Covariates will be included in the models in sets (section 5.1.3).

Demographic covariates:

Sex
Race
Age at injury
Education
Employment status

Clinical covariates:

Cause of injury
Substance Problem Use
Previous seizure disorder diagnosis
Spinal cord injury
Primary rehabilitation payer
Days From Injury to Rehab Discharge
FIM Motor at Discharge quartiles
FIM Cognitive at Discharge quartiles

Geographical covariates:

Rural area
Residence after rehab discharge

Statistical methods

Statistical analyses

Descriptive analyses

The epidemiological profile of the study participants will be described. Demographic and clinical variables will be described as mean (SD) or as counts and proportions (%), as appropriate. The distributions of participants' characteristics will be summarized in tables and visualized in exploratory plots.

Inferential analyses

All inferential analyses will be performed in the statistical models (described in the next section).

Statistical modeling

The hazard of mortality will be assessed with multivariate Cox regression models. In order to assess if there is an effect of the SES of the neighborhood to which the participant was discharged on mortality will be assessed with four nested models.

A crude estimate of the HR between each SES quintiles and mortality will be calculated as the basis of interpretation of the effect.
Model 2 will include demographic covariates on top of the crude estimate.
Model 3 will add clinical covariates on top of model 2.
Model 4 will add geographical covariates to model 3, i.e., it will be calculated adjusting for all covariates described in section 4.5.

The estimates on the association between SES quintiles and mortality will be compared across models in a sensitivity analysis to gauge the cumulative effect of each set of covariates.

The proportional hazards assumption will be tested by assessing the the distribution of Schoenfeld residuals against time, the Schoenfeld test and the distribution of Martingale residuals of time-dependent covariates against the null model. In order to keep the maximum number of time-dependent covariates in the final model, we plan to perform the residual analysis combining the following approaches: stratification by categorical variables, adding explicit interactions with time, time splitting and non-linear fitting of covariates using polynomials or splines. Time-dependent covariates that fail these approaches will be removed before the final model is evaluated and described.

Missing data

No missing data imputation will be performed. All evaluations will be performed as complete case analyses. Missing data counts and proportions will be reported in tables.

Significance and Confidence Intervals

All analyses will be performed using the significance level of 5%. All significance hypothesis tests and confidence intervals computed will be two-tailed.

Study size and Power

N/A

Statistical packages

This analysis will be performed using statistical software R version 4.3.0.

Observations and limitations

Recommended reporting guideline

The adoption of the EQUATOR network (http://www.equator-network.org/) reporting guidelines have seen increasing adoption by scientific journals. All observational studies are recommended to be reported following the STROBE guideline (von Elm et al, 2014).

References

SAR-2023-016-BH-v01 -- Time-adjusted effect of socioeconomic status in mortality rates after brain injury: cohort study

Zarin DA, et al. The ClinicalTrials.gov results database -- update and key issues. N Engl J Med 2011;364:852-60 (https://doi.org/10.1056/NEJMsa1012065).
Gamble C, et al. Guidelines for the Content of Statistical Analysis Plans in Clinical Trials. JAMA. 2017;318(23):2337–2343 (https://doi.org/10.1001/jama.2017.18556).
von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP; STROBE Initiative. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement: guidelines for reporting observational studies. Int J Surg. 2014 Dec;12(12):1495-9 (https://doi.org/10.1016/j.ijsu.2014.07.013).

Appendix

This document was elaborated following recommendations on the structure for Statistical Analysis Plans (Gamble, 2017) for better transparency and clarity.

Availability

All documents from this consultation were included in the consultant's Portfolio.

The portfolio is available at:

https://philsf-biostat.github.io/SAR-2023-004-BH/

Associated analyses

This analysis is part of a larger project and is supported by other analyses, linked below.

Effect of socioeconomic status in mortality rates after brain injury: cohort study

https://philsf-biostat.github.io/SAR-2023-004-BH/

Sensitivity of mortality rates to the imputation of missing socioeconomic data: cohort study

https://philsf-biostat.github.io/SAR-2023-017-BH/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SAP-2023-016-BH-v01.md

SAP-2023-016-BH-v01.md

Abbreviations

Context

Objectives

Hypotheses

Data

Raw data

Analytical dataset

Study parameters

Study design

Inclusion and exclusion criteria

Exposures

Outcomes

Covariates

Statistical methods

Statistical analyses

Descriptive analyses

Inferential analyses

Statistical modeling

Missing data

Significance and Confidence Intervals

Study size and Power

Statistical packages

Observations and limitations

References

Appendix

Availability

Associated analyses

Files

SAP-2023-016-BH-v01.md

Latest commit

History

SAP-2023-016-BH-v01.md

File metadata and controls

Abbreviations

Context

Objectives

Hypotheses

Data

Raw data

Analytical dataset

Study parameters

Study design

Inclusion and exclusion criteria

Exposures

Outcomes

Covariates

Statistical methods

Statistical analyses

Descriptive analyses

Inferential analyses

Statistical modeling

Missing data

Significance and Confidence Intervals

Study size and Power

Statistical packages

Observations and limitations

References

Appendix

Availability

Associated analyses