Good Statistical Monitoring: A Flexible Open-Source Tool to Detect Risks in Clinical Trials

Wu, George; Childress, Spencer; Wang, Zhongkai; Roumaya, Matt; Stern, Colleen McLaughlin; Dickens, Chelsea; Wildfire, Jeremy

doi:10.1007/s43441-024-00651-4

Good Statistical Monitoring: A Flexible Open-Source Tool to Detect Risks in Clinical Trials

Original Research
Open access
Published: 09 May 2024

Volume 58, pages 838–844, (2024)
Cite this article

Download PDF

You have full access to this open access article

Therapeutic Innovation & Regulatory Science Aims and scope Submit manuscript

Good Statistical Monitoring: A Flexible Open-Source Tool to Detect Risks in Clinical Trials

Download PDF

George Wu ORCID: orcid.org/0009-0003-5984-6457¹,
Spencer Childress¹,
Zhongkai Wang¹,
Matt Roumaya²,
Colleen McLaughlin Stern²,
Chelsea Dickens² &
…
Jeremy Wildfire¹

1534 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Background

Risk-based quality management is a regulatory-recommended approach to manage risk in a clinical trial. A key element of this strategy is to conduct risk-based monitoring to detect potential risks to critical data and processes earlier. However, there are limited publicly available tools to perform the analytics required for this purpose. Good Statistical Monitoring is a new open-source solution developed to help address this need.

Methods

A team of statisticians, data scientists, clinicians, data managers, clinical operations, regulatory, and quality compliance staff collaborated to design Good Statistical Monitoring, an R package, to flexibly and efficiently implement end-to-end analyses of key risks. The package currently supports the mapping of clinical trial data from a variety of formats, evaluation of 12 key risk indicators, interactive visualization of analysis results, and creation of standardized reports.

Results

The Good Statistical Monitoring package is freely available on GitHub and empowers clinical study teams to proactively monitor key risks. It employs a modular workflow to perform risk assessments that can be customized by replacing any workflow component with a study-specific alternative. Results can be exported to other clinical systems or can be viewed as an interactive report to facilitate follow-up risk mitigation. Rigorous testing and qualification are performed as part of each release to ensure package quality.

Conclusions

Good Statistical Monitoring is an open-source solution designed to enable clinical study teams to implement statistical monitoring of critical risks, as part of a comprehensive risk-based quality management strategy.

Harnessing the Power of Quality Assurance Data: Can We Use Statistical Modeling for Quality Risk Assessment of Clinical Trials?

Article Open access 30 March 2020

Data-driven risk identification in phase III clinical trials using central statistical monitoring

Article 02 August 2015

Statistical challenges for central monitoring in clinical trials: a review

Article 23 October 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Clinical trials aim to evaluate the safety and efficacy of promising therapeutic candidates, while protecting patients’ welfare and rights. To reliably achieve this objective, it is essential that both critical data and processes are high quality. Using traditional monitoring approaches like 100% source data verification and frequent site visits have been shown to be less efficient than a risk-focused strategy [1,2,3]. Regulatory authorities recommend risk-based monitoring (RBM) as a superior alternative, given it is a more adaptive and targeted approach to identify, assess, and mitigate study risks [4,5,6].

RBM is defined as five functional components - key risk indictors (KRIs), centralized monitoring, off-site/remote-site monitoring, reduced source data review, and reduced source data verification – and is part of a broader risk-based quality management framework, that also includes initial cross-functional risk assessment, ongoing cross-functional risk assessment, and quality tolerance limits [7]. The components collectively enhance the effectiveness of monitoring with proven benefits to trial quality, efficiency, patient safety, and overall value [8, 9]. Despite the advantages offered by RBM, adoption has been slower for risk detection components (32–35%) compared to risk assessment components (78–80%), even in the face of the increased need for remote risk detection through the recent COVID pandemic [10].

One possible driver for slower adoption is the lack of effective, easy-to-use, and inexpensive tools to properly perform risk detection compared to risk assessment. Recent reviews have found a breadth of tools available to assess potential risks to a trial at the start-up stage, but only limited information on how to develop or implement published methods for detecting study risk as the trial is ongoing [11,12,13]. In contrast, commercial and home-grown CRO solutions tend to be more sophisticated and include technical support, but are substantially more expensive to implement [14]. Given the proprietary nature of these systems, it is often difficult to share analysis findings and details of how the underlying risk detection algorithms work. Unfortunately, this trade-off between quality and cost may leave trial sponsors in a tough spot, especially when there are limited trial resources to support RBM.

To address this gap, we would like to introduce a new open-source R package, Good Statistical Monitoring {gsm}, as a free, flexible, and reliable tool to perform risk detection for RBM. R was chosen as it is freely available and widely used by the clinical trial community. GSM provides a supportive end-to-end framework for risk detection from data ingestion, risk analysis, visualization to reporting. It includes a flexible mapping process that is capable of handling multiple data standards and leverages a modular workflow structure that can easily be adjusted for study-specific customizations. It is also thoroughly tested and qualified prior to each release.

Methods

{gsm} was designed based on a series of extensive discussions with clinicians, statisticians, data scientists, data managers, clinical operations, regulatory, and quality compliance staff, including reviews of existing tools and literature. The goal was to create a scalable and customizable analytics engine that could support an end-to-end workflow for risk detection including data ingestion, analysis, visualization, and reporting. Technical details, vignettes and example reports can be found at: https://gilead-biostats.github.io/gsm/index.html.

Development and testing of the functions in {gsm} relied primarily on two repositories of anonymized clinical trial data: {clindata} and {safetyData}. {safetyData} is an R package that reformats PHUSE’s sample ADaM and SDTM trial datasets [15]. {clindata} is a repository of anonymized and simulated clinical trial datasets from a variety of different sources and data formats [16].

Statistical analysis of KRIs in {gsm} relies on defining a numerator and a denominator for each metric (Table 1). Then depending on whether the metric is a percentage or a rate, the user can select different statistical methods to be applied. The default method is to use a normal approximation for percentages and rates, with an adjustment for overdispersion, to calculate z-scores for flagging at-risk sites [17]. When m sites are in a trial, where m > 2, the adjusted z-score for a site $i$ can be defined as:

Table 1 Key risk indicators

Full size table

$${z{\prime }}_{i}=\frac{{y}_{i}-{\theta }_{0}}{\sqrt{V{\prime }\left(Y|{\theta }_{0}\right)}}$$

where 𝑦_𝑖 is the KRI metric for site i, 𝜃₀ is the overall mean, and 𝑉′(𝑌|𝜃₀) is the over-dispersion adjusted variance. The over-dispersion parameter $\varphi$ is calculated as the average of unadjusted squared z-scores: $\varphi =\frac{1}{m}\sum _{i=1}^{m}{z}_{i}^{2}$. For percentages, the over-dispersion adjusted variance is $V{\prime }\left(Y|{\theta }_{0}=p\right)=\varphi \frac{\widehat{p}\left(1-\widehat{p}\right)}{n_i}$, where $\widehat{p}$ is the observed overall proportion of events and $n_i$ is the total number of study participants at site i. For rates, the over-dispersion adjusted variance is $V{\prime }\left(Y|{\theta }_{0}=\lambda\right)=\varphi \frac{\widehat{\lambda }}{{T}_{i}}$, where $\widehat{\lambda }$ is the observed exposure-adjusted incidence rate, defined as the total number of events divided by the total study exposure time and ${T}_{i}$ is the total exposure time for participants at site $i$. Alternatively, users can choose to perform Fisher’s exact tests for percentages and Poisson regression analyses for rates. More details can be found at https://gilead-biostats.github.io/gsm/articles/KRI%20Method.html.

Visualizations are built with R and JavaScript to create custom plots to depict analysis results. Interactive reports are produced as HTML documents using R Markdown. A detailed qualification report is automatically generated for each release using a set of machine-readable specifications and test cases to evaluate the expected performance of critical functions.

Results

The analysis of each KRI in {gsm} is defined as an assessment following a standard model: data is first inputted at the trial participant level, transformed into a site-level summary, analyzed to generate test statistics and p-values, flagged to identify sites that cross user-specified thresholds, and then summarized (Fig. 1). Optional customizable mapping functions are provided to support conversion of trial data, from a variety of possible data sources and formats - ADaM, SDTM, raw, etc. - to the input data required for each assessment. Workflows expand upon assessments by adding more capabilities – support for country or region level analyses, analyses of data subsets, and automated data checking – and enable users to perform a set of workflows more easily and at scale through only one function (Fig. 2). An example of the benefit of being able to customize using workflows is a user can easily expand on an analysis of AE reporting rates for all enrolled patients, by adding filter functions to repeat the analysis in the same workflow focusing only on the subset of participants who were randomized and treated, or a subset of participants with a specific category of adverse events.

{gsm} supports the creation of multiple interactive visualizations leading to a better understanding of analysis results. For individual assessments, results can be depicted as a scatter plot or bar plot on different scales (Fig. 3). For an overview of results, a site-by-assessment heatmap can be generated to highlight the commonly flagged KRIs across sites or the sites with the most flagged KRIs. For assessments of a given site over time, longitudinal plots can be created to show changes in results over multiple analyses. To easily capture and share the analysis results and visualizations, users can create a standard report with supportive trial information and the ability to search, filter, or examine specific data points of interest in more detail.

The {gsm} R package has undergone extensive testing and qualification. As of v1.8.1, over 1,450 unit tests have been written with a 87.3% code coverage. Along with each release, a qualification report is automatically attached ensuring the package meets expected standards and requirements to detect study risks (Fig. 4). Qualification testing currently covers 24 core functions, evaluating 88 use cases across 171 total tests.

Discussion

An effective RBM approach requires the ability to accurately detect study risks in a timely manner. {gsm} is a free open-source qualified solution developed for that purpose. It covers all the steps from data ingestion to reporting and allows R users to do so in a few lines of code. The modular structure of assessments and workflows facilitate study-specific customizations, and interactive visualizations allow users to better understand analysis results. Early efforts implementing {gsm} at Gilead have proven successful; we were able to detect similar risks as found by other proprietary systems, and more easily perform fit-for-purpose analyses for study-specific nuances across a diverse set of pilot studies.

Compared to alternative tools to detect risks as part of RBM, {gsm} offers a robust and effective solution for free. Among publicly available options, code to implement the proposed statistical methods may not exist, and if available, are usually provided in a piece-wise fashion or limited to a much narrower scope [18,19,20], making it difficult to detect all potential critical risks in a study, and impractical to systematically apply across a portfolio of studies. Commercial options typically offer a software-as-a-service approach [14] with more thorough and customizable analytics, but are substantially more expensive. Thus, {gsm} helps to fill an existing gap in risk detection tools and we hope will support increased adoption of RBM.

Future improvements planned for future releases, in order of prioritization, include expanding the number of KRIs that can be analyzed, supporting qualified QTL analyses, conducting unsupervised statistical monitoring, and incorporating more options for statistical testing. Current KRIs focus on critical areas related to study population, safety, deviations, and data quality, but do not yet cover other important areas such as primary and secondary endpoints, as this may require more complex study specific derivations and analyses. Although users can choose from more than one statistical method, some commonly used models like beta binomial models for binary outcomes [21], and linear mixed-effect models for continuous outcomes [22] have not been implemented. These methods may exhibit better performance in different situations; for example, the default method relying on normal approximation will tend to perform better when there are more sites, while an exact method may perform better when there are only a few sites. Further adding unsupervised approaches will allow users to agnostically survey the entirety of available trial data to find unknown risk signals. The {gsm} workflow can also easily be extended to perform QTL analyses, and experimental QTL functions, which need further refinement and validation, are being developed. Another interesting use case to explore is to use {gsm} to analyze real-world data to detect potential risks across regions, data sources, or other groupings. Adding these features will take time; fortunately {gsm} was purposely designed with a modular framework suited for quickly incorporating new improvements and releasing {gsm} as an open-source package will allow more R developers to contribute to its development.

Two of the primary drivers for releasing {gsm} as an open-source publicly available solution was to encourage collaboration with external partners, and benefit from the diverse experience of the broader R community. We believe this will result in much quicker integration of the latest statistical methods, expansion of the library of KRIs and QTLs that can be analyzed, creation of more visualizations, as well as faster discovery and resolution of bugs and pain points. A new PHUSE project called OpenRBQM was recently announced, which will combine an open-source RBQM Working group focused on information sharing with an RBQM Development Team that will co-develop new RBQM tools including {gsm}. We hope more and continued open collaboration will spur increased knowledge sharing on how to best perform risk based monitoring for the benefit of patients.

Conclusion

{gsm} is an open-source qualified R package to ingest, analyze, identify, visualize, and report critical study risks with robust support for study-specific customizations. It is free for use under the Apache v2.0 license and has been successfully implemented on multiple clinical trials. We hope {gsm} will encourage more open collaboration to build better RBQM tools and achieve better outcomes for our trials and our patients. Full technical specifications, user guides, package details and more examples are available at https://gilead-biostats.github.io/gsm/.

References

Reith C, Landray M, Devereaux PJ, et al. Randomized clinical trials – removing unnecessary obstacles. N Engl J Med. 2013;369:1061–5.
Article CAS PubMed Google Scholar
Duley L, Antman K, Arena J, et al. Specific barriers to the conduct of randomized trials. Clin Trials. 2008;5(1):40–8.
Article PubMed Google Scholar
Bakobaki JM, Rauchenberger M, Joffe N, et al. The potential for central monitoring techniques to replace on-site monitoring: findings from an international multi-centre clinical trial. Clin Trials. 2012;9(2):257–64.
Article PubMed Google Scholar
MRC/DH/MHRA Joint Project. Risk-adapted approaches to the management of clinical trials of investigational medicinal products. 2011.
Food and Drug Administration. Guidance for industry: oversight of clinical investigations – a risk-based approach to monitoring. 2013.
European Medicines Agency. Reflection paper on risk based quality management in clinical trials. 2013.
Barnes B, Stansbury N, Brown D, et al. Risk-based monitoring in clinical trials: past, present, and future. Ther Innov Regul Sci. 2021;55(4):899–906.
Article PubMed PubMed Central Google Scholar
Macefield RC, Beswick AD, Blazeby JM, et al. A systematic review of on-site monitoring methods for healthcare randomised controlled trials. Clin Trials. 2013;10(1):104–24.
Article PubMed Google Scholar
Brosteanu O, Houben P, Ihrig K, et al. Risk analysis and risk adapted on-site monitoring in noncommercial clinical trials. Clin Trials. 2009;6(6):585–96.
Article PubMed Google Scholar
Adams A, Adelfio A, Barnes B, et al. Risk-based monitoring in clinical trials: 2021 update. Ther Innov Regul Sci. 2023;57:529–37.
Article PubMed PubMed Central Google Scholar
Hurley C, Shiely F, Power J, et al. Risk based monitoring (RBM) tools for clinical trials: a systematic review. Contemp Clin Trials. 2016;51:15–27.
Article PubMed Google Scholar
Hurley C, Sinnott C, Clarke M, et al. Perceived barriers and facilitators to risk based monitoring in academic-led clinical trials: a mixed methods study. Trials. 2017;18:423.
Article PubMed PubMed Central Google Scholar
Cragg WJ, Hurley C, Yorke-Edwards V, et al. Dynamic methods for ongoing assessment of site-level risk in risk-based monitoring of clinical trials: a scoping review. Clin Trials. 2021;18(2):245–59.
Article PubMed PubMed Central Google Scholar
Agrafiotis DK, Lobanov VS, Farnum MA, et al. Risk-based monitoring of clinical trials: an integrative approach. Clin Ther. 2018;40(7):1204–12.
Article PubMed Google Scholar
Wildfire J, Escalante Chong R. safetyData: clinical trial data. R package version 1.0.0. 2022.
Wildfire J, Childress S, Zhongkai W et al. (2023). clindata: Synthetic Clinical Data for testing and development. R package version 1.0.2, 2023. https://gilead-biostats.github.io/clindata/.
Spiegelhalter DJ. Funnel plots for comparing institutional performance. Stat Med. 2005;24(8):1185–202.
Article PubMed Google Scholar
Kirkwood AA, Cox T, Hackshaw A. Application of methods for central statistical monitoring in clinical trials. Clin Trials. 2013;10(5):783–806.
Article PubMed Google Scholar
Koneswarakantha B, Barmaz Y, Ménard T, et al. Follow-up on the use of advanced analytics for clinical quality assurance: bootstrap resampling to enhance detection of adverse event under-reporting. Drug Saf. 2021;44(1):121–3.
Article PubMed Google Scholar
Kirkpatrick J. rbqmR: risk-based quality management in R. R package version 0.0.0.9001. 2023.
Desmet L, Venet D, Doffagne E, et al. Use of the beta-binomial model for central statistical monitoring of multicenter clinical trials. Stat Biopharm Res. 2017;9(1):1–11.
Article Google Scholar
Desmet L, Venet D, Doffagne E, et al. Linear mixed-effects models for central statistical monitoring of multicenter clinical trials. Stat Med. 2014;33(30):5265–79.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the following individuals for consultation and support: Joanne Benedict, Matt Southwick, Andy Grannell, Li Ge, Maya Gans, Doug Sanders, Ajay Bansal, Shravan Thangellapelly, Vinh Nguyen, Dexter Aguila, Victor Chen, Shaji Parayil, Senthilkumar Krishnan, Juan Aristy, Kipp Spanbauer, Fiamma Giger, Randall Holzberger, Goshia Szczodrak, David Tesarowski, Kimberly Lockwood, Ron Yu, Catherine Jia, Zhishen Ye, Patrick Loerch.

Funding

Funded by Gilead Sciences, Inc.

Author information

Authors and Affiliations

Gilead Sciences Inc., 333 Lakeside Dr, Foster City, CA, 94404, USA
George Wu, Spencer Childress, Zhongkai Wang & Jeremy Wildfire
Atorus Research, Newtown Square, Harrisburg, PA, USA
Matt Roumaya, Colleen McLaughlin Stern & Chelsea Dickens

Authors

George Wu
View author publications
You can also search for this author in PubMed Google Scholar
Spencer Childress
View author publications
You can also search for this author in PubMed Google Scholar
Zhongkai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Matt Roumaya
View author publications
You can also search for this author in PubMed Google Scholar
Colleen McLaughlin Stern
View author publications
You can also search for this author in PubMed Google Scholar
Chelsea Dickens
View author publications
You can also search for this author in PubMed Google Scholar
Jeremy Wildfire
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the design and development of the work. The first draft of the manuscript was written by GW and SC, ZW, MR, and JW commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to George Wu.

Ethics declarations

Conflict of interest

The authors declare no conflicts of interest. All authors are employees and/or stockholders of the companies with which they are affiliated.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, G., Childress, S., Wang, Z. et al. Good Statistical Monitoring: A Flexible Open-Source Tool to Detect Risks in Clinical Trials. Ther Innov Regul Sci 58, 838–844 (2024). https://doi.org/10.1007/s43441-024-00651-4

Download citation

Received: 09 December 2023
Accepted: 29 March 2024
Published: 09 May 2024
Issue Date: September 2024
DOI: https://doi.org/10.1007/s43441-024-00651-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Good Statistical Monitoring: A Flexible Open-Source Tool to Detect Risks in Clinical Trials