Predicting future hospital antimicrobial resistance prevalence using machine learning.
Vihta K-D., Pritchard E., Pouwels KB., Hopkins S., Guy RL., Henderson K., Chudasama D., Hope R., Muller-Pebody B., Walker AS., Clifton D., Eyre DW.
BACKGROUND: Predicting antimicrobial resistance (AMR), a top global health threat, nationwide at an aggregate hospital level could help target interventions. Using machine learning, we exploit historical AMR and antimicrobial usage to predict future AMR. METHODS: Antimicrobial use and AMR prevalence in bloodstream infections in hospitals in England were obtained per hospital group (Trust) and financial year (FY, April-March) for 22 pathogen-antibiotic combinations (FY2016-2017 to FY2021-2022). Extreme Gradient Boosting (XGBoost) model predictions were compared to the previous value taken forwards, the difference between the previous two years taken forwards and linear trend forecasting (LTF). XGBoost feature importances were calculated to aid interpretability. RESULTS: Here we show that XGBoost models achieve the best predictive performance. Relatively limited year-to-year variability in AMR prevalence within Trust-pathogen-antibiotic combinations means previous value taken forwards also achieves a low mean absolute error (MAE), similar to or slightly higher than XGBoost. Using the difference between the previous two years taken forward or LTF performs consistently worse. XGBoost considerably outperforms all other methods in Trusts with a larger change in AMR prevalence from FY2020-2021 (last training year) to FY2021-2022 (held-out test set). Feature importance values indicate that besides historical resistance to the same pathogen-antibiotic combination as the outcome, complex relationships between resistance in different pathogens to the same antibiotic/antibiotic class and usage are exploited for predictions. These are generally among the top ten features ranked according to their mean absolute SHAP values. CONCLUSIONS: Year-to-year resistance has generally changed little within Trust-pathogen-antibiotic combinations. In those with larger changes, XGBoost models can improve predictions, enabling informed decisions, efficient resource allocation, and targeted interventions.