Alexander Pate1*, Bowen Jiang1, Yun-Ting Huang12, Sophie Griffiths3, David Stables4, Brian McMillan5 and Matthew Sperrin1
1Division of Informatics, Imaging, and Data Science, Faculty of Biology Medicine and Health, University of Manchester
2Centre for Pharmacoepidemiology and Drug Safety, Division of Pharmacy and Optometry, University of Manchester
3Centre for Health Psychology, Faculty of Biology Medicine and Health, University of Manchester
4Endeavour Health
5Centre for Primary Care and Health Services Research, Faculty of Biology Medicine and Health, University of Manchester
*Corresponding author; email: alexander.pate{at}manchester.ac.uk
medRxiv preprint DOI: https://doi.org/10.1101/2025.09.18.25336064
Posted: September 19, 2025, Version 1
Copyright: This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at http://creativecommons.org/licenses/by/4.0/
Abstract
Objective Clinical prediction models are used across the world to guide treatment for the primary prevention of cardiovascular disease. Such models are appropriate for estimating an individual’s risk of developing cardiovascular disease; however, they are sometimes used inappropriately to estimate risk under some intervention that results in changes to their risk factors, such as weight loss, or stopping smoking. The objective of this study is to develop a model that correctly predicts 10-year risk of cardiovascular disease under a wide range of interventions.
Design Retrospective cohort study, prediction under intervention using causal inference.
Setting English national primary care linked with secondary care, office for national statistics and index of multiple deprivation.
Participants Adults (aged 18 – 86) free from cardiovascular disease between 2005 – 2020.
Main outcome measure Incidence of cardiovascular disease
Results The resulting model is designed to be used across multiple follow-up visits. We illustrate how a 70 year old woman with a 23.15% 10-year risk of cardiovascular disease could reduce this to 17.71% (statins), 18.82% (antihypertensives leading to 10mmHg reduction in systolic blood pressure), 19.42% (smoking cessation), 21.15% (weight loss of 5kg), 19.27 (weight loss of 10kg), or 14.78% through a combination of therapies. Under internal validation, the model has good calibration in the entire cohort and within subgroups defined by protected characteristics (sex, age and ethnicity) and 10 major English regions. The model had discrimination (c-statistic) of 0.874 (female) and 0.859 (male). Within subgroups defined by ethnic group, discrimination did not drop below 0.858 (Black Caribbean, Female) or 0.850 (White, male). Within subgroups defined by region, discrimination did not drop below 0.859 (West Midlands, Female) or 0.842 (North East, Male). There was a large drop in discrimination within subgroups defined by age given its strong predictive properties.
Conclusions Alongside the novel prediction under intervention aspects, the model preserves good predictive performance with respect to traditional metrics – it is well calibrated and has discrimination meeting or exceeding the discrimination of models currently used to guide clinical care around cardiovascular disease. Further evaluation and piloting is required to support its clinical use.
1. Introduction
Cardiovascular diseases (CVD) is the leading cause of death worldwide, and causes a quarter (>170,000) of all deaths in the UK every year, and costs the NHS and the UK economy an estimated £12 billion and £28 billion per year respectively.1 Models have been developed to predict incident CVD in various locations: QRISK2 (UK); ASSIGN3 (Scotland); SCORE2 and SCORE2-OP4 (Europe), Pooled Cohort Equations5,6 (United States); PREDICT-CVD7 (New Zealand), Globorisk8 (worldwide). These models estimate cardiovascular risk so that high risk individuals can be identified for intervention, often statins, and several are recommended in guidelines for primary prevention of CVD.9–11. In England, CVD risk is evaluated as part of the NHS Health Check, which is offered to individuals between ages 40 and 74 years that meet certain criteria.12 CVD risk is calculated using a prognostic model (QRISK), and those with a 10-year CVD risk of 10% or more are recommended to be offered a cholesterol lowering medication (statin) and given lifestyle advice on reducing their risk.11,13 The goal of this activity is prevention: to reduce the number of individuals developing CVD (and other chronic conditions), and to spur people on to make healthy lifestyle changes.14
To aid discussions around CVD risk reduction, some clinical systems provide visual representations of the potential impact that taking a statin may have on a patient’s 10-year risk score.15 Whilst there are anecdotal reports of clinicians also using these models to demonstrate the potential impact of other lifestyle changes, they are not designed to calculate how much (e.g.,) losing weight, reducing blood pressure, or stopping smoking, would reduce someone’s risk of developing CVD.16 Nevertheless, such demonstrations are intuitively and theoretically useful. Research suggests that people are more likely to change their behaviour when they perceive a personal risk and believe that specific actions will be effective in reducing that risk, a belief known as response efficacy.17 Tools that allow users to manipulate risk factors and observe the potential reduction in disease risk, such as YourDiseaseRisk for cancer, visually demonstrate the risk-reduction benefits of behaviour change.18 This interactive functionality has been shown to improve response efficacy.19
Our aim was therefore to develop a clinical prediction model that allows prediction of CVD risk under a range of interventions such as initiating statin or antihypertensive therapy, smoking cessation, changes in lifestyle such as diet or exercise, or ‘doing nothing’.
The intended target population is UK adults free from CVD. The first intended setting for model use is when a patient meets with a general practitioner in primary care, for example during an NHS health check (although not restricted to this setting). The second intended setting for the model is through patient-facing online health records platforms.20 The goal of the model is to provide better information to support individuals and clinicians to make informed choices. If successful, this could help motivate behaviour change in order to prevent incident CVD events. The objectives of this study are to show that we can develop a model which performs well in the classical clinical prediction sense (i.e. is well calibrated and discriminates well), whilst also having the functionality to ‘predict under intervention’. A comparison with existing models developed for this purpose is provided in the discussion.
2. Methods
2.1. Study design
We conduct a retrospective cohort study using primary care data from Clinical Practice Research Datalink (CPRD) Aurum June 2021 extract. CPRD is a large resource of electronic health records (EHRs) from the UK containing information on demography, medical history, test results and drug use of individuals registered with a general practice. CPRD Aurum contains data from general practices using the EMIS Web computer system. As of June 2024, CPRD Aurum contained data on 47 million (16 million currently registered) individuals.21 Linkage was provided to secondary care data (Hospital Episode Statistics), death data (Office for National Statistics) and index of multiple deprivation. Individuals were only included in the cohort if they were eligible for linkage, which is only available for individuals registered in England.
Individuals enter the cohort at the latest of: the start of the study period (1/1/2005), attaining at least one year of registration with a contributing practice to CPRD Aurum, attaining age 18 (referred to as ‘baseline index date’ or ‘visit 0’). Individuals exit the cohort at the earliest of: the date of diagnosis of first CVD event, death, deregistration with practice, last data upload from practice, or end of follow-up in HES (01/03/2020). Individuals were excluded if their cohort exit date was smaller than or equal to their cohort entry date, or they had a history of intracerebral haemorrhage prior to their cohort entry date. After applying all exclusion criteria, this resulted in individuals from 1478 general practices. The cohort comprises all individuals aged 18 or over registered during the study period, who have not yet been diagnosed with CVD. This study design is chosen to match the proposed target population for primary prevention of CVD – of all CVD free adults in the UK. Given the lack of linked data in Wales, Scotland and Northern Ireland, recalibration and validation of the model in these Nations will be required. Access to all data was approved via CPRD’s Research Data Governance process (protocol ID 22_002333).
2.2. Outcome
The outcome is incident CVD, where CVD is defined to be a composite outcome of coronary heart disease, ischaemic stroke or transient ischaemic attack. CVD was identified through the primary care, secondary care, and office for national statistics death data. CVD events as a consequence of developing a different condition in hospital were not considered. Code lists and motivation for the definition of the outcome and the exclusion criteria (which includes intracerebral haemorrhage) are given in the supplementary material. The model will allow prediction over any time period up to 10-years, and will be validated at 5 and 10-years.
2.3. Exposures
Exposures are split into two groups, modifiable risk factors and interventions. Modifiable risk factors are systolic blood pressure, body mass index, non-high-density-lipoproteins (non-HDL) cholesterol and smoking status. We consider any interventions which we expect to act through modification of these risk factors, e.g. smoking cessation, statin use, antihypertensive use and changes to diet and exercise. For statins and antihypertensives, an individual was considered to be “off treatment” 180 days after their last prescription, if they did not receive another prescription in that 180-day time period.
2.4. Baseline predictors
This is the set of variables which are used for estimation of the risk prior to intervention. The same set of predictor variables (with some minor differences) that were used in QR42 were selected: age, sex, ethnicity, index of multiple deprivation (vigintile)), systolic blood pressure, body mass index, smoking status, history of hypertension, atrial fibrillation, chronic kidney disease stage 3, 4 or 5, diabetes, serious mental illness, family history of CVD, migraine, systemic lupus erythematosus, corticosteroid use, atypical antipsychotic use, chronic obstructive pulmonary disease, intellectual disability, downs syndrome, oral cancer, brain cancer, lung cancer, blood cancer, pre-eclampsia (females only), postnatal depression (females only) and impotence (males only). These variables have been shown to result in a strong prediction model for CVD in a similar population and therefore no variables selection was applied. Systolic blood pressure variability was not considered given the role of systolic blood pressure as a modifiable risk factor which will be treated causally (see section 2.3). Non-HDL cholesterol was included rather than cholesterol/HDL ratio due to the availability of causal estimates for changes in this modifiable risk factor (see supplementary data file 1).
We also included a calendar time variable given the secular trend in CVD that was observed (see supplementary data file 2). Code lists and operational definitions for each variable are provided in supplementary data file 1.
2.5. Model development
Models were developed separately for male and female cohorts, and therefore all the steps in sections 2.5 and 2.6 were done separately for both cohorts. We used a 70:30 split sample approach to model development and validation. Cross validation, or optimism adjusted bootstrap approaches, were not deemed necessary given the sample size. The model development process has two layers, a risk-estimation layer (see section 2.5.2) and an intervention layer (2.5.3).
2.5.1. Missing data
Missing data on systolic blood pressure, body mass index, non-HDL cholesterol, smoking status, ethnicity and index of multiple deprivation were imputed using multiple imputation by chained equations.22,23 All the other baseline predictors, exposure status at cohort entry date, an event indicator, and the Nelson-Aalen estimate of survival at the time of event/censoring, were used as predictors in the imputation model for each variable. Predictive mean matching was used as the imputation model for all missing variables. There were 10 imputation chains each of length 20. Performance of the imputation algorithm is evaluated and presented in supplementary material data file 2. After the imputation procedure, data was split into development (70%) and validation (30%) datasets. Imputing the entire dataset before splitting into development and validation dataset will result in an of estimate of ideal model performance.24
2.5.2. Initial risk-estimation layer
The initial risk-estimation layer fits a model which estimates the risk of CVD under the strategy of ‘continue on current intervention strategy’ or ‘do nothing’. The model itself is a Cox proportional hazard model. Age, calendar time, systolic blood pressure, body mass index and non-HDL cholesterol were fitted as restricted cubic splines25 with 4 knots (age and calendar time) or 3 knots (all others) respectively. Knot locations were chosen manually for age at {25, 40, 57.5, 75}. Knot locations for the other predictors were data driven, by combining all imputed datasets, and calculating the 10th, 50th and 90th percentiles. All predictors except calendar time were interacted with the spline of age.
To estimate risk of ‘continue on current intervention strategy’, changes in treatments during follow-up need to be addressed (treatment drop-in). We used an approach inspired by Xu et al.26 We create interval censored data at all time points where an individual starts or stops taking statins or antihypertensives, or changes smoking status. Time varying versions of these variables are then created at each of these intervals, defined by their value relative to their value at baseline. The coefficients of these time varying variables are fixed to their causal average total effect, becoming offset terms. These effects were taken from literature27–29, converted to hazard ratios for model fitting,30 and are presented in Table 1. The effects of statins and antihypertensives are taken from an overview of systematic reviews of randomised controlled trials closely aligned with those used in the Millions Hearts Risk Assessment Tool.16 The effects of these interventions were assumed to be independent because no evidence was found to support any interactions. More details on the methodology for deriving the time-varying offset terms and estimating the values in Table 1 are provided in supplementary material data file 1.

Table 1:
Treatment effects used in the baseline and intervention layers
2.5.3. Intervention layer using memory
The intervention layer does not involve fitting any more models. We assume that the effect of an intervention acts through changes in the modifiable risk factors: systolic blood pressure, body mass index, non-HDL cholesterol and smoking status. This intervention can be any intervention (i.e. not just those mentioned above) or combination of interventions, provided we can estimate the overall effect on each of the modifiable risk factors. We create a tool which allows the modifiable risk factors to be changed from an individual’s current value of those variables. Based on these changes, we apply odds ratios to the risk (odds) estimated from the risk-estimation layer. The (causal) odds ratios are also contained in Table 1. Note that odds ratios are used here, compared to hazard ratios which were required for the model fitting process in the initial risk estimation layer. These were estimated through the use of a Causal Directed Acyclic Graph (DAG31; Figure 1) and effect estimates from a variety of sources in the literature.28,29,32–39 This process is detailed in supplementary material data file 1. Through this process we also estimated the effect that starting statins or antihypertensives, or a change in one of the modifiable risk factors, would have on the other modifiable risk factors (e.g. stopping smoking will on average result in an increase in body mass index and reduction in systolic blood pressure). This allows flexibility to estimate risk under a specific intervention, or under specified changes in the modifiable risk factors. The process is detailed in supplementary data file 3.

FIGURE 1:
Directed Acyclic Graph of the causal structure of the modifiable risk factors
The model is designed to be used over multiple visits following a similar process to that used in the Million Hearts Tool.16 The first time an individual interacts with the model (we refer to this as visit 0), the individuals risk is estimated using the initial risk estimation layer, but the values of the modifiable risk factors at visit 0 are remembered. Hypothetical risk scores under different interventions can then be estimated, i.e., what is my risk is I stop smoking? Initiate statins? Or get my weight and blood pressure down to healthy levels? This can either be done using the total effect of an intervention, or based on the effect estimates from the DAG, and the expected or targeted change in each the modifiable risk factors. At each of the follow-up visits, the following process is applied: A) Remember the values of the modifiable risk factors from visit 0, B) Update the other predictors (e.g. age, any other changes to medical history), C) Estimate risk if we had not intervened and modifiable risk factors remained at visit 0 levels, D) Estimate current risk based on achieved levels of the modifiable risk factors, E) Estimate hypothetical future risks based on further interventions and modification of the risk factors. An exemplar of how to use to model is given in section 3.1.
2.5.4. Sample size
A minimum required sample size was estimated using the criteria of Riley et al.,40 based on the mean follow up time and outcome incidence in the cohort, a conservative estimate of, 162 and 158 predictor parameters in the female and male models respectively, and a desired shrinkage of 0.99. The minimum required sample size was 437,350 in the female cohort, and 347,541 in the male cohort.
2.6. Model evaluation
2.6.1. Evaluation of the risk-estimation layer
The chosen outputs for model evaluation were 10-year probabilities of CVD. We first evaluated the risk-estimation layer. The 10-year risk of CVD for each individual in the validation cohort was estimated under intervention strategy of “continue on current interventions”. These are counterfactual risk predictions as many individuals do not follow this intervention strategy, i.e. they start or stop taking statins or antihypertensives or change smoking status during follow-up. There are a number of approaches to validating a prediction model which makes counterfactual predictions,41–45 however we again follow the work of Xu et al.26 This involves estimating counterfactual event times that would have been observed under the desired intervention strategy. The process for doing so is detailed in supplementary data file 1. Once the counterfactual survival times are estimated, model evaluation can proceed as normal.
Moderate calibration was assessed using graphical calibration curves, the integrated calibration index, E50 and E90,46 and also the Kaplan-Meier observed risk vs mean predicted risk within 100 subgroups ranked by predicted risk.47 Discrimination was assessed using Harrell’s C-index.25 Decision curve analysis was not considered as further work is required to understand how to assess net benefit in the prediction under intervention context. Fairness was evaluated by estimating calibration and discrimination in subgroups defined by protected characteristics age, sex and ethnicity. We also evaluate performance by UK region. Instability plots for individual risks were estimated.48
2.6.2. Evaluation of the intervention layer (temporal validation)
We evaluate the interventional part of the model by emulating the way in which the model is designed to be used in practice (section 2.5.3). We designate the index date defined in section 2.1 to be visit 0. We then extract cohorts at 1, 2, 3, 4 and 5 years after visit 0, referring to these as 1st, 2nd, 3rd, 4th and 5th follow-up visits, or ‘follow-up index dates’. All variables (predictors and exposures) are extracted in the same way as described in sections 2.3 and 2.4, but relative to the follow-up date, as opposed to the baseline index date. Individuals who have had a CVD event, or have been censored, prior to their follow-up index dates are excluded from the respective cohorts. When evaluating the interventional layer, we choose both 5-year and 10-year risk because few individuals have sufficient follow-up at their fifth follow-up visit to allow us to evaluate 10-year risk predictions.
Next, for each modifiable risk factor we applied the following process, using systolic blood pressure as an example. Select patients who had a non-missing value for systolic blood pressure at visit 0, and calculate the change in that modifiable risk factor between the first visit and follow-up visits. If there is no systolic blood pressure value recorded between visit 0 and the follow-up visit, we assume it is unchanged. We then estimate the risk of CVD using the risk-estimation layer, where all variables (both exposures and predictors) except systolic blood pressure take the value extracted at the follow-up visit. For systolic blood pressure, we use the value from visit 0, and apply a relative risk based on its change between visit 0 and the follow-up visit. Calibration and discrimination were then estimated as described in section 2.6.1, including the estimation of counterfactual survival times. This process was also applied to all four modifiable risk factors simultaneously.
We also evaluated 5-year risk predictions estimated at these follow-up visits using only the risk-estimation layer. This will help us understand whether any drop in performance discovered in the above analysis is driven by the interventional part of the model, or the fact there are temporal trends which we are not picking up. In these analyses, missing data in the follow-up visit cohorts was imputed using the value from the visit 0 cohort.
2.7. Patient and public involvement (PPI)
This study build upon earlier PPI work, including a workshop with 19 members of the public, facilitated by 4 members of the Primary Care Research in Manchester Engagement Resource (PRIMER) and led by one of the current study’s co-authors (BM).20 Workshop members felt that a platform which encouraged a greater degree of interactivity with their NHS Health Check results using an online records access platform, could provide benefits to the NHS (by encouraging a healthy lifestyle, reducing NHS costs, saving GP time), and benefits to patients (e.g. through improved communication, by serving an educational role, providing additional motivation, and being less confrontational).20
2.8. Open Research
All analysis were conducted using R version 4.4.2. All code is available on the Manchester Predictive Healthcare Group GitHub page.49
3. Results
Table 2 contains the medical history and demographic information of our cohort at their baseline index dates. There was 301,383 (female) and 395060 (male) outcome events respectively, and 210543 (female) and 276,806 (male) events in the development cohorts. More information on total follow-up and event rates is available in supplementary data file 2.
Table 2:
Baseline information
Given the complexity of the fitted model, the coefficients cannot be presented in Tables. The model is provided as a freely available Rshiny application.50 The fitted model and cumulative baseline hazard are provided as R workspace objects (.rds) to support this Rshiny application, and can be downloaded from the CHARIOT GitHub repository.49
3.1. Evaluation of the risk estimation layer
Figure 2 contains calibration plots of the models in the female and male cohorts. Both models are well calibrated. Calibration plots of the model within subgroups defined by geographical region, and protected characteristics ethnicity and age, as well as plots for the male cohort, and calibration metrics ICI, E50 and E90 are all provided in the supplementary material. The model is predominately well calibrated within subgroups defined by ethnicity, age and region (see data supplementary data file 2). The model has some calibration issues in the 17 – 30 age group. This is because a large number of individuals are under the age of 25. This is the position of our lowest knot, and below this point the effect of age is assumed to be linear (by the definition of restricted cubic splines25). However, given that individuals in this group are very low risk, this miscalibration is very small on the absolute scale. This is evident from the calibration metrics, where the E90 for this age group is smaller than many of the other age groups. There is also some under-prediction of risk among higher risk individuals in the North West. Results for the male cohort are broadly similar.

Figure 2:
Calibration plots. Smoothed calibration curve, and binned calibration plot with centiles or predicted risk.
The discrimination in the entire validation cohort (Harrel’s C-index), and subgroups defined by protected characteristics are presented in Figure 3. Confidence intervals have been estimated but are small enough that they are indistinguishable in the plots. The C-index of the model in the entire female and male cohorts is 0.87 and 0.85 respectively. When assessing discrimination within subgroups defined by ethnicity or region performance remains similar. There is a considerable drop in performance when assessing discrimination within age groups, as expected because age is a strong prognostic factor interacted with all other predictors. Instability plots of the model,48 plotted for 3,000 randomly selected individuals, are also presented in supplementary data file 2.

Figure 3:
Discrimination of the model in the entire cohort and within subgroups defined by protected characteristics.
3.2. Evaluation of the intervention layer (temporal validation)
Figure 4 contains the calibration plots of the initial risk estimation layer predicting 5-year risk at 1, 3 and 5 year follow-up visits among females. It is evident that the further away from visit 0 the index date is defined, the model begins to over-predict in the high-risk individuals. This could be due to selection bias, as unhealthy individuals will leave the cohort (either through having CVD events or dying), and no new individuals are added to the cohort. Plots for the male cohort and evaluating 10-year predictions, and discrimination, are provided in supplementary data file 2. Conclusions are broadly similar.

Figure 4:
Calibration plots of the initial risk estimation layer predicting 5-year risk at 1, 3 and 5-year follow-up visits, female cohort
Figure 5 contains the calibration plots of the intervention layer predicting 5-year risk at the 1-year follow-up visit, for modification of each of the modifiable risk factors. The calibration of the intervention layer when modifying non-HDL cholesterol, body mass index and smoking status remains quite strong. There is overprediction of higher risk individuals for systolic blood pressure which could be driven by measurement error in systolic blood pressure. Plots for the male cohort, evaluating 10-year predictions, and making predictions at the 2, 3, 4 and 5-year follow-up visits, and discrimination, are provided in supplementary data file 2. Conclusions are broadly similar.

Figure 5:
Calibration plots of the intervention layer for each of the modifiable risk factors at the 1-year follow-up visit, female cohort
3.3. An exemplar use-case
Setting
An as exemplar we replicate a similar patient scenario and process used to showcase the Million Hearts Tool.16 We consider a 70-year old female of black Caribbean ethnicity, a current smoker who has elevated modifiable risk factors, systolic blood pressure = 160mmHg, body mass index = 30 (165cm and 81.68kg) and non-HDL cholesterol = 10 mmol/L, but no other comorbidities or risk factors. The individual is not on statins or antihypertensive treatment, but has a relatively poor diet and does not exercise regularly.
Visit 0
The individual attends an NHS health check (which we refer to as visit 0) and the clinician engages in a discussion regarding her risks of CVD.
Using the CHARIOT model, we estimate the individual’s 10-year risk of CVD if they continue as normal as 23.15%. The discussion then moves onto possible ways to reduce this risk. The clinician suggests a number of different routes: initiation of statins, initiation of antihypertensives, attempting smoking cessation, or making changes to the individuals diet and increasing physical activity. These are not mutually exclusive and may be attempted in combination. To help inform this discussion, we estimate the 10-year cardiovascular risk under these different interventions, and under different hypothetical reduction in the modifiable risk factors as a result of these interventions. We consider the expected 10-year risk under statin therapy, antihypertensive therapy achieving a reduction in SBP of 5, 10 or 15 mmHg, smoking cessation, weight loss of 5kg or 10kg, or a ‘best case scenario’ that through a combination of changes to diet and exercise and stopping smoking, the individual may reduce their weight by 10kg, non-HDL cholesterol by 3mmol/L and systolic blood pressure by 20mmHg. The predicted risks are contained in Table 3.

Table 3:
Predicted 10-year CVD risk under a range of interventions, and hypothetical reductions in modifiable risk factors.
The individual expresses that they do not want to start medication unless it is absolutely necessary, and that they are interested in enrolling on a smoking cessation programme given it could have a broad range of positive effects beyond CVD. They also state they are willing to try and make changes to their diet and exercise but are unsure if this is realistic. The predicted risks highlight that stopping smoking is broadly in line with the effect of initiating statins or antihypertensives. Given the individual has already expressed an interest in a smoking cessation programme, a decision is made to enrol on one. A plan is also made to make some improvements to diet and an increase physical activity (directing the individuals towards the NHS weight loss plan51) hoping to push the risk lower than 19.42%, and reassess whether medication will be necessary in a year’s time.
Follow-up visit 1
The patient returns at their first follow-up visit 1 year later. The individual has successfully stopped smoking and made some changes to their diet, resulting in a reduction their non-HDL cholesterol by 2 mmol/L and a 5kg weight loss. The individual has however struggled to increase their physical activity levels, and their blood pressure remains the same. We follow the process given in section 2.5.3.
First, we re-estimate the baseline risk one year on, updating the non-modifiable risk factors to their most recent values. For example, age has increased from 70 to 71, but the individual could also have developed new comorbidities such chronic obstructive pulmonary disease, or started on medication such as oral corticosteroids, which are both predictors in the model. In contrast, we remember the values of their risk factors we have been aiming to modify from visit 0. This gives us an estimate of the 10-year CVD risk if we had not intervened (24.38%).
Next, we estimate a 10-year risk based on the levels of the modifiable risk factors that have been achieved. Given the reduction in body mass index and systolic blood pressure, and successfully stopping smoking, the individuals current 10-year risk is now down to 14.78%. A new discussion can now begin around further treatment options. Given the successful reduction in non-HDL cholesterol and the individual’s willingness to continue on their current dietary regime, statins are not deemed necessary. Given systolic blood pressure remains high, we can hypothesise new risks if the individual was to initiate antihypertensives and achieve a reduction of 5mmHg (13.21%), 10mmHg (11.77%), 15mmHg (10.48%) or 20mmHg (9.31%). The individual agrees to being prescribed antihypertensives. At follow-up visit 2, the same process from section 2.5.3 will be applied.
4. Discussion
4.1. Interpretation
We have developed a clinical prediction model for CVD that can be used to predict future risk under different interventions. The model provides evidence that directly informs decisions concerning preventative intervention – answering questions such as ‘what would my risk be if I started a statin’ compared with ‘what would my risk be if I continued with my current treatment’ compared with ‘what would my risk be if I lost weight’. As such, it is a step forward compared with existing models, that mostly focus on observed risk agnostic to intervention.
Alongside the novel prediction under intervention aspects, the model preserves good predictive performance with respect to traditional metrics – it is well calibrated and has discrimination meeting or exceeding the discrimination of models currently used to guide clinical care around CVD.
With regards to fairness, both the female and male models are well calibrated and discriminate well in subgroups defined by ethnicity and region in England. There is a considerable drop in discrimination within all subgroups defined by age. We would deem it unfair if the model discriminated well in some subgroups but not others. However, in this case, all subgroups (defined by age) experience a similar drop in performance.
4.2. Comparison with existing literature
We are aware of three other models developed for prediction of CVD under intervention. The Million Hearts Risk Assessment Tool16 considers statin therapy, blood pressure lowering medication, aspirin, smoking cessation and combinations of these interventions. The PEER Simplified Cardiovascular Decision Aid Tool52 considers statins, blood pressure lowering, Ezetimibe, proprotein convertase subtilisin/kexin type inhibitors and Fibrates medications, mediterranean diet and physical activity. Qintervention considers statins, reducing body mass index to 25, reducing systolic blood pressure to 140mmHg, and stopping smoking.
Our approach is most similar to the Millions Hearts Risk Assessment Tool, which conducted a systematic review of the literature to obtain effect estimates of risk reducing therapies and applied these to baseline risks derived from the 2013 Pooled Cohort Equations.5,6 The Million Hearts Tool has the same memory property and is implemented in the same way at follow-up visits. CHARIOT has developed the a new model for the initial risk estimation layer, as opposed to using an existing model. This has enabled the model to be developed in a large cohort representative of the target population (with noted limitations for implementation in Wales, Scotland and Northern Ireland due to available linked data), with strong discriminative performance. CHARIOT has also used causal inference techniques to deal with ‘treatment drop-in’26,53–55) during the development of the initial risk estimation layer, although the impact of this in a situation where individuals are both on and off treatment at baseline needs to be explored in more detail. The effect estimates for the interventions and changes in modifiable risk factors are similar, with one notable exception. CHARIOT has assumed a relative risk reduction of 0.8 per 10mmHg reduction in systolic blood pressure (taken from a 2016 meta-analysis56), compared to the 0.65 (taken from a 2009 systematic review57) used in the Million Hearts Tool.16 This is by no means an exact science, and inconsistencies were found when trying to validate this choice (see supplementary data file 3). Another important difference is that CHARIOT decomposed the effect of interventions into direct and indirect effects, allowing the prediction of risk under arbitrary interventions, with hypothesised effects on each of the modifiable risk factors (supplementary data file 3). While this flexibility is an advantage, this process required utilising a range of data sources including causal mediation and mendelian randomisation studies, which should be considered weaker evidence. Finally, to avoid over-estimating treatment effects, the Million Hearts Tool implemented floor values, below which the predictions under intervention couldn’t go, which were based on the predicted risk of untreated optimal levels of all risk factors at the same age, sex and race. This approach will be considered in future iterations of CHARIOT.
The PEER Simplified Cardiovascular Decision Aid52,58 estimates baseline risk from a range of different models depending on your location, and Qintervention from QRISK2.59 Both combine these with effect estimates of risk reducing therapies, however do not consider combinations of interventions, are restricted to a pre-defined list of interventions, do not account for changes in treatment or treatment drop-in26,53–55 during model development, or have a well-defined way to be used at follow-up visits.
4.3. Strengths and Limitations
The CHARIOT model has been developed in a large cohort representative of the target population and has strong performance. It has been designed with a specific clinical interaction in mind,60 and has a memory property to allow it to be used at follow-up visits for ongoing treatment. The approach combines the strengths of different data sources, routinely collected electronic health record data for estimating the baseline risk, and effect estimates from a mix of trials and mendelian randomisation studies for intervention layer.
On the contrary, the combination of different data sources could be considered weakness as the effect estimates come from studies with different cohort definitions, outcome definitions and modelling approaches. While it would be preferable to estimate these in a consistent manner, this is not be possible given the desire to use the strength of trials carried out on statin and antihypertensive treatment, and the ethical implications of carrying out trials on smoking, diet or exercise. Instead, we must aim to combine this information in the best way possible.61
A second limitation is that lifestyle interventions may reduce CVD beyond reductions in the modifiable risk factors we have considered, for example through heart muscle, mental state or hormones. One study showed that exercise could operate through improvements to insulin sensitivity, improved lining of blood vessels, reduced heart rate and increased antioxidants, none of which are routinely collected or recorded in the primary care electronic health record.62 We may therefore be underestimating the benefit for some interventions. A third limitation is that we assume that reducing one of the modifiable risk factors (i.e. systolic blood pressure from 140 to 120) has the same effect on CVD, irrespective of how that reduction was achieved. The impact of this assumption or whether it holds is unclear. A fourth limitation is that while the target population is the entire UK, due to data restrictions, the model could only be developed in English individuals. Validation and possibly model updating may therefore be necessary in the other UK nations. Also, while using CPRD has many advantages, the nature of electronic health records means there is likely measurement error and the analysis is ultimately dependent on the choice of code lists. Further validation may help assess the impact of this.
4.4. Usability of model in context of current care
Missing data should be collected on all predictors and modifiable risk factors. This is imperative for the modifiable risk factors (systolic blood pressure, body mass index, non-HDL cholesterol and smoking status) if wanting to predict under interventions which we expect to modify these. Otherwise, predictors and modifiable risk factors can be imputed using mean imputation, although performance cannot be guaranteed given the model has not been evaluated in this manner.63 The model has a ‘memory property’, requiring us to remember values from the individuals visit 0 when we first started to discuss intervening. The user will be required to understand how to operate the model with respect to this memory component and update the inputs at each visit. The back-end of this algorithm is available as an Rshiny application.50 A front-end application is also in development.64
4.5. Future work
A key area for future research will be to externally validate the model separately in Welsh, Scottish, Northern Irish and English cohorts, recalibrating the model if required. Another key area for future work is to extend the methodology to the competing risk setting. This would allow us to predict lifetime risk, and the risk of multiple outcomes. Being able to estimate the risk reduction of a lifestyle change on a number of potential outcomes (e.g. CVD, type 2 diabetes and chronic kidney disease) may be more powerful in motivating behaviour change. It also allows the end user to make decisions based on a more holistic view of their health. Similarly, being able to estimate the reduction in lifetime risk under an intervention may be able to help motivate behaviour change in younger individuals, whose 10-year risk will be very small. Understanding how to predict under intervention in a competing risk setting is essential to achieve both estimation of lifetime risk and multiple outcomes. We plan to explore the feasibility of incorporating CHARIOT into clinical EHR systems and patient-facing records access platforms to facilitate discussions around CVD risk reduction, and help motivate health behaviour change.
4.6. Conclusion
CHARIOT is clinical prediction model for future CVD with strong performance compared to existing models, and also allows for prediction under interventions which modify systolic blood pressure, body mass index, non-HDL cholesterol and smoking status.
Supporting information
supplementary material[supplements/336064_file02.zip]
Data Availability
Operational definitions for all variables and the process for extraction of the CPRD data are provided in supplementary data file 1. All code and codelists are available on the CHARIOT GitHub repository. The code supporting the extraction of the CPRD data has been written into a freely available R package, rcprd.66 CPRD data are not publicly available. Details of the application process and conditions of access are available at: https://www.cprd.com/data-access. The model developed in this paper has been made publicly available as an Rshiny app: https://alexpate30.shinyapps.io/rshiny_chariot_prototype3/
6. Supporting statements
6.1. Supplementary data files
Supplementary data file 1: methods and technical appendix
Supplementary data file 2: supplementary tables and figures
Supplementary data file 3: Derivation of DAG for the intervention layer
Supplementary data file 4: TRIPOD+AI checklist
All supplementary material files can be found on the CHARIOT GitHub repository.49
6.2. Availability of code and materials
Operational definitions for all variables and the process for extraction of the CPRD data are provided in supplementary data file 1. All code and codelists are available on the CHARIOT GitHub repository.65 The code supporting the extraction of the CPRD data has been written into a freely available R package, rcprd.66 CPRD data are not publicly available. Details of the application process and conditions of access are available at: https://www.cprd.com/data-access. The model developed in this paper has been made publicly available as an Rshiny app: https://alexpate30.shinyapps.io/rshiny_chariot_prototype3/
6.3. Funding
This research was funded by The National Institute for Health Research (NIHR) School for Primary Care Research (SPCR) (reference: NIHR SPCR-2021-2026, grant number 648) and Endeavour Health Charitable Trust. We acknowledge support of the UKRI AI programme, and the Engineering and Physical Sciences Research Council, for CHAI – Causality in Healthcare AI Hub [grant number EP/Y028856/1]. The views expressed are those of the authors and not necessarily those of the NIHR, the Department of Health and Social Care, or Endeavour Health.
6.4. Ethics
CPRD has ethics approval from the Health Research Authority to support research using anonymised patient data. CHARIOTS application (protocol 22_002333) was reviewed via the CPRD Research Data Governance (RDG) Process to ensure that the proposed research is of benefit to patients and public health.
6.5. Competing Interests
No authors had conflicts of interest to declare.
5. References
Pate A, Parisi R, Kontopantelis E, et al. rcprd: An R package to simplify the extraction and processing of Clinical Practice Research Datalink (CPRD) data, and create analysis-ready datasets. PLoS One 2025; 20: 1–25.
British Heart Foundation. UK Factsheet, https://www.bhf.org.uk/-/media/files/for-professionals/research/heart-statistics/bhf-cvd-statistics-uk-factsheet.pdf (2025).
Hippisley-Cox J, Coupland CAC, Bafadhel M, et al. Development and validation of a new algorithm for improved cardiovascular risk prediction. Nat Med 2024; 30: 1440–1447.
Woodward M, Brindle P, Tunsfall-Pedoe H. Adding social deprivation and family history to cardiovascular risk assessment: The ASSIGN score from the Scottish Heart Health Extended Cohort (SHHEC). Heart 2007; 93: 172–176.
SCORE2 working group and ESC Cardiovascular risk collaboration. SCORE2 risk prediction algorithms: new models to estimate 10-year risk of cardiovascular disease in Europe. Eur Heart J 2021; 42: 2439–2454.
Goff DC, Lloyd-jones DM, Bennett G, et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: A Report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. 2014. Epub ahead of print 2014. DOI: 10.1016/j.jacc.2014.02.606.
Karmali KN, Goff DC, Ning H, et al. A systematic examination of the 2013 ACC/AHA pooled cohort risk assessment tool for atherosclerotic cardiovascular disease. J Am Coll Cardiol 2014; 64: 959–968.
Pylypchuk R, Wells S, Kerr A, et al. Cardiovascular disease risk prediction equations in 400□ 000 primary care patients in New Zealand: a derivation and validation study. Lancet 2018; 391: 1897–1907.
Hajifathalian K, Ueda P, Lu Y, et al. A novel risk score to predict cardiovascular disease risk in national populations (Globorisk): A pooled analysis of prospective cohorts and health examination surveys. Lancet Diabetes Endocrinol 2015; 3: 339–355.
Arnett DK, Blumenthal RS, Albert MA, et al. 2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. 2019. Epub ahead of print 2019. DOI: 10.1161/CIR.0000000000000678.
Mach F, Baigent C, Catapano AL, et al. 2019 ESC/EAS Guidelines for the management of dyslipidaemias: lipid modification to reduce cardiovascular risk. Eur Heart J 2020; 41: 111–188.
National Institute for Health and Care Excellence. NICE guideline [NG238]: Cardiovascular disease: risk assessment and reduction, including lipid modification, https://www.nice.org.uk/guidance/ng238 (2023, accessed 1 August 2025).
National Health Service. NHS Health Check, https://www.nhs.uk/conditions/nhs-health-check/ (2023, accessed 16 July 2024).
National Institute for Health and Care Excellence. CVD risk assessment and management, https://cks.nice.org.uk/topics/cvd-risk-assessment-management/ (2025, accessed 1 August 2025).
Department for Health and Social Care. NHS Health Checks: applying All Our Health, https://www.gov.uk/government/publications/nhs-health-checks-applying-all-our-health/nhs-health-checks-applying-all-our-health (2022, accessed 30 June 2025).
TPP. systmone, https://tpp-uk.com/products/ (2025, accessed 4 August 2025).
Lloyd-Jones DM, Huffman MD, Karmali KN, et al. Estimating Longitudinal Risks and Benefits From Cardiovascular Preventive Therapies Among Medicare Patients: The Million Hearts Longitudinal ASCVD Risk Assessment Tool: A Special Report From the American Heart Association and American College of Cardiolog. J Am Coll Cardiol 2017; 69: 1617–1636.
Sheeran P, Harris PR, Epton T. Does heightening risk appraisals change people’s intentions and behavior? A meta-analysis of experimental studies. Psychol Bull 2014; 140: 511–543.
Colditz GA, Dart H. Commentary: 20 years online with “Your Disease Risk”. Cancer Causes Control 2021; 32: 5–11.
Fowler SL, Klein WMP, Ball L, et al. Using an Internet-based Breast Cancer Risk Assessment Tool to Improve Social-Cognitive Precursors of Physical Activity. Med Decis Mak 2017; 37: 657–669.
McMillan B, Fox S, Lyons M, et al. Using patient and public involvement to improve the research design and funding application for a project aimed at fostering a more collaborative approach to the nhs health check: The caviar project (better care via improved access to records). Res Involv Engagem 2018; 4: 1–9.
CPRD. CPRD Aurum September 2024 Dataset, https://www.cprd.com/doi/cprd-aurum-march-2024-dataset (2024, accessed 12 June 2024).
van Buuren S, Groothuis-oudshoorn K. mice: Multivariate Imputation by Chained Equations. J Stat Softw; 45.
van Buuren S. Flexible Imputation of Missing Data. 2nd ed. New York: Chapman and Hall/CRC. Epub ahead of print 2018. DOI: 10.1201/9780429492259.
Wood AM, Royston P, White IR. The estimation and use of predictions for the assessment of model performance using large samples with multiply imputed data. Biometrical J 2015; 57: 614–632.
Harrell FE. Regression Modeling Strategies. Springer S. Cham, Switzerland: Springer, 2015.
Xu Z, Arnold M, Stevens D, et al. Prediction of Cardiovascular Disease Risk Accounting for Future Initiation of Statin Treatment. Am J Epidemiol 2021; 190: 2000–2014.
Karmali KN, Lloyd-Jones DM, Berendsen M, et al. Drugs for Primary Prevention of Atherosclerotic Cardiovascular Disease: An Overview of Systematic Reviews. JAMA Cardiol 2016; 1: 341–349.
Levin MG, Klarin D, Assimes TL, et al. Genetics of Smoking and Risk of Atherosclerotic Cardiovascular Diseases: A Mendelian Randomization Study. JAMA Netw Open 2021; 4: E2034461.
Lu Y, Xu Z, Georgakis MK, et al. Smoking and heart failure: a Mendelian randomization and mediation analysis. ESC Hear Fail 2021; 8: 1954–1965.
VanderWeele TJ. Optimal approximate conversions of odds ratios and hazard ratios to risk ratios. Biometrics 2020; 76: 746–752.
Tennant PWG, Murray EJ, Arnold KF, et al. Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations. Int J Epidemiol 2021; 50: 620–632.
Canoy D, Copland E, Nazarzadeh M, et al. Antihypertensive drug effects on long-term blood pressure: an individual-level data meta-analysis of randomised clinical trials. Heart 2022; 108: 1281–1289.
Briasoulis A, Agarwal V, Valachis A, et al. Antihypertensive Effects of Statins: A Meta-Analysis of Prospective Controlled Studies. J Clin Hypertens 2013; 15: 310–320.
Beck J, Hasenböhler F, Werlen L, et al. Blood pressure changes during smoking cessation in a randomized, double-blind, placebo-controlled trial of dulaglutide treatment. Eur J Prev Cardiol 2025; 1–9.
Li S, Schooling CM. Investigating the effects of statins on ischemic heart disease allowing for effects on body mass index: a Mendelian randomization study. Sci Rep 2022; 12: 1–10.
Xu L, Borges MC, Hemani G, et al. The role of glycaemic and lipid risk factors in mediating the effect of BMI on coronary heart disease: a two-step, two-sample Mendelian randomisation study. Diabetologia 2017; 60: 2210–2220.
Piirtola M, Jelenkovic A, Latvala A, et al. Association of current and former smoking with body mass index: A study of smoking discordant twin pairs from 21 twin cohorts. PLoS One 2018; 13: 1–17.
Holmes M V., Lange LA, Palmer T, et al. Causal effects of body mass index on cardiometabolic traits and events: A Mendelian randomization analysis. Am J Hum Genet 2014; 94: 198–208.
Liu W, Yang C, Lei F, et al. Major lipids and lipoprotein levels and risk of blood pressure elevation: a Mendelian Randomisation study. eBioMedicine 2024; 100: 104964.
Riley RD, Snell KIE, Ensor J, et al. Minimum sample size for developing a multivariable prediction model: PART II – binary and time-to-event outcomes. Stat Med 2019; 38: 1276–1296.
Keogh RH, van Geloven N. Prediction under hypothetical interventions: evaluation of performance using longitudinal observational data. arXiv 2023; 1–51.
Boyer CB, Dahabreh IJ, Steingrimsson JA. Assessing model performance for counterfactual predictions.
Hoogland J, Efthimiou O, Nguyen T, et al. Evaluating individualized treatment effect predictions: a new perspective on discrimination and calibration assessment, http://arxiv.org/abs/2209.06101 (2022).
Maas CCHM, Kent DM, Hughes MC, et al. Performance metrics for models designed to predict treatment effect. BMC Med Res Methodol 2023; 23: 1–12.
Klaveren D Van, Steyerberg EW, Serruys PW, et al. The proposed ‘concordance-statistic for benefit’ provided a useful metric when modeling heterogeneous treatment effects. J Clin Epidemiol 2018; February: 59–68.
Austin PC, Harrell FE, van Klaveren D. Graphical calibration curves and the integrated calibration index (ICI) for survival models. Stat Med 2020; 39: 2714–2742.
Royston P, Altman DG. External validation of a Cox prognostic model: principles and methods. BMC Med Res Methodol; 13.
Riley RD, Collins GS. Stability of clinical prediction models developed using statistical or machine learning methods. Biometrical J 2023; 65: 1–22.
Pate A. GitHub repository. Manchester Predictive Healthcare Group. CHI-CHARIOT-project-2-pui-applied, https://github.com/manchester-predictive-healthcare-group/CHI-CHARIOT/tree/main/project-2-pui-applied (2025, accessed 11 July 2025).
CHARIOT. CHARIOT prototype Rshiny, https://alexpate30.shinyapps.io/rshiny_chariot_prototype3/ (2025, accessed 3 July 2025).
NHS. NHS Weight Loss Plan, https://www.nhs.uk/better-health/lose-weight/ (2025, accessed 10 September 2025).
Kolber MR, Klarenbach S, Cauchon M, et al. PEER simplified lipid guideline 2023 update. Can Fam Physician 2023; 69: 675.
Sperrin M, Martin GP, Pate A, et al. Using marginal structural models to adjust for treatment drop-in when developing clinical prediction models. Stat Med 2018; 37: 4142–4154.
Lin L, Sperrin M, Jenkins DA, et al. A scoping review of causal methods enabling predictions under hypothetical interventions. Diagnostic Progn Res; 5. Epub ahead of print 2021. DOI: 10.1186/s41512-021-00092-9.
van Geloven N, Swanson SA, Ramspek CL, et al. Prediction meets causal inference: the role of treatment in clinical prediction models. Eur J Epidemiol 2020; 35: 619–630.
Ettehad D, Emdin CA, Kiran A, et al. Blood pressure lowering for prevention of cardiovascular disease and death: A systematic review and meta-analysis. Lancet 2016; 387: 957–967.
Law MR, Morris JK, Wald NJ. Use of blood pressure lowering drugs in the prevention of cardiovascular disease: Meta-analysis of 147 randomised trials in the context of expectations from prospective epidemiological studies. BMJ 2009; 338: 1245.
PEER Simplified Lipid Guideline Group. PEER Simplified Cardiovascular Decision Aid, https://decisionaid.ca/cvd/ (2023, accessed 4 August 2025).
Hippisley-Cox J, Coupland C, Robson J, et al. Derivation, validation, and evaluation of a new QRISK model to estimate lifetime risk of cardiovascular disease: Cohort study using QResearch database. Bmj 2010; 342: 93.
Markowetz F. All models are wrong and yours are useless: making clinical prediction models impactful for patients. npj Precis Oncol 2024; 8: 6–8.
Colnet B, Mayer I, Chen G, et al. Causal Inference Methods for Combining Randomized Trials and Observational Studies: A Review. Stat Sci 2024; 39: 165–191.
Vaccarino V, Bremner JD. Stress and cardiovascular disease: an update. Nat Rev Cardiol 2024; 21: 603–616.
Tsvetanova A, Sperrin M, Jenkins D, et al. Compatibility in Missing Data Handling Across the Prediction Model Pipeline: A Simulation Study. Stud Health Technol Inform 2024; 310: 1476–1477.
Griffiths S, Bartlett YK, French D, et al. Co-designing cardiovascular disease risk information and support for behaviour change. In: Paper presented at the 20th UK Society for Behavioural Medicine Annual Scientific Meeting. Mercure Bristol Grand Hotel, 2025.
Pate A. GitHub repository. Manchester Predictive Healthcare Group. CHI-CHARIOT-generic-data-extraction., https://github.com/manchester-predictive-healthcare-group/CHI-CHARIOT/tree/main/generic-data-extraction (2025, accessed 10 July 2025).
