Accurate modeling of crude oil and brine interfacial tension via robust machine learning approaches

Sci Rep. 2024 Nov 20;14(1):28800. doi: 10.1038/s41598-024-80217-4.

Abstract

Interfacial tension (IFT) between water and crude oil is a crucial variable that enhanced oil recovery (EOR) techniques can adjust to increase oil extraction from depleted fields. Most of the developed intelligent models in the literature are based on synthetic oil samples rather than real crude oil samples or brine total salinity rather than salinity of each salt type. Hence, this study applies various machine learning approaches, such as Convolutional Neural Networks (CNN), Adaptive Boosting (AdaBoost), Decision Trees (DT), Random Forest (RF), K-Nearest Neighbors (KNN), Ensemble Learning, Support Vector Machines (SVM), and Multi-Layer Perceptron Artificial Neural Networks (MLP-ANN) to develop advanced models for predicting the IFT between brine and crude oil considering real crude oil samples and taking the account of each salt type prevalent within the brine phase, which represent the realistic circumstances encountered in the oil reservoirs. These predictions are based on factors like the type and concentration of salt, the API of the crude oil, and the properties of the system (pressure and temperature) using previously published experimental data. A sensitivity analysis, incorporating a relevancy factor, is performed to highlight the influence of various input parameters on the IFT. Among these models, the Decision Tree is highlighted for its high accuracy and low training cost compared to ANN-based models, as evidenced by its emerged evaluation metrics (R-squared of 0.9796 and mean square error of 5e-4). It is noted that the AdaBoost model is the least accurate with an R2 of 0.6696. Furthermore, the sensitivity analysis indicates that the molecular weight of the salt has the smallest impact on the IFT, whereas temperature has the most significant effect. The developed smart model may be used to accurately estimate crude oil/brine IFT without needing tedious, time-consuming and expensive experimental workflows.

Keywords: Intelligent modeling; Machine learning; Relevancy factor; Water-crude oil IFT.