Simulating realistic patient profiles from pharmacokinetic models by a machine learning postprocessing correction of residual variability

Christos Kaikousidis; Robert R Bies; Aristides Dokoumetzidis

doi:10.1002/psp4.13182

Simulating realistic patient profiles from pharmacokinetic models by a machine learning postprocessing correction of residual variability

CPT Pharmacometrics Syst Pharmacol. 2024 Sep;13(9):1476-1487. doi: 10.1002/psp4.13182. Epub 2024 Jun 14.

Authors

Christos Kaikousidis¹, Robert R Bies², Aristides Dokoumetzidis¹

Affiliations

¹ Department of Pharmacy, National and Kapodistrian University of Athens, Athens, Greece.
² Department of Pharmaceutical Sciences, State University of New York at Buffalo, Buffalo, New York, USA.

Abstract

We address the problem of model misspecification in population pharmacokinetics (PopPK), by modeling residual unexplained variability (RUV) by machine learning (ML) methods in a postprocessing step after conventional model building. The practical purpose of the method is the generation of realistic virtual patient profiles and the quantification of the extent of model misspecification, by introducing an appropriate metric, to be used as an additional diagnostic of model quality. The proposed methodology consists of the following steps: After developing a PopPK model, the individual residual errors IRES = DV-IPRED, are computed, where DV are the observations and IPRED the individual predictions and are modeled by ML to obtain IRES_ML. Correction of the IPREDs can then be carried out as DV_ML = IPRED + IRES_ML. The methodology was tested in a PK study of ropinirole, for which a PopPK model was developed while a second deliberately misspecified model was also considered. Various supervised ML algorithms were tested, among which Random Forest gave the best results. The ML model was able to correct individual predictions as inspected in diagnostic plots and most importantly it simulated realistic profiles that resembled the real data and canceled out the artifacts introduced by the elevated RUV, even in the case of the heavily misspecified model. Furthermore, a metric to quantify the extent of model misspecification was introduced based on the R² between IRES and IRES_ML, following the rationale that the greater the extent of variability explained by the ML model, the higher the degree of model misspecification present in the original model.

MeSH terms

Algorithms
Computer Simulation*
Humans
Machine Learning*
Models, Biological*
Pharmacokinetics