TY - JOUR
T1 - Machine learning prediction of methane, ethane, and propane solubility in pure water and electrolyte solutions
T2 - Implications for stray gas migration modeling
AU - Kooti, Ghazal
AU - Taherdangkoo, Reza
AU - Chen, Chaofan
AU - Sergeev, Nikita
AU - Doulati Ardejani, Faramarz
AU - Meng, Tao
AU - Butscher, Christoph
N1 - Publisher Copyright: © The Author(s) 2024.
PY - 2024/3/22
Y1 - 2024/3/22
N2 - Hydraulic fracturing is an effective technology for hydrocarbon extraction from unconventional shale and tight gas reservoirs. A potential risk of hydraulic fracturing is the upward migration of stray gas from the deep subsurface to shallow aquifers. The stray gas can dissolve in groundwater leading to chemical and biological reactions, which could negatively affect groundwater quality and contribute to atmospheric emissions. The knowledge of light hydrocarbon solubility in the aqueous environment is essential for the numerical modelling of flow and transport in the subsurface. Herein, we compiled a database containing 2129 experimental data of methane, ethane, and propane solubility in pure water and various electrolyte solutions over wide ranges of operating temperature and pressure. Two machine learning algorithms, namely regression tree (RT) and boosted regression tree (BRT) tuned with a Bayesian optimization algorithm (BO) were employed to determine the solubility of gases. The predictions were compared with the experimental data as well as four well-established thermodynamic models. Our analysis shows that the BRT-BO is sufficiently accurate, and the predicted values agree well with those obtained from the thermodynamic models. The coefficient of determination (R2) between experimental and predicted values is 0.99 and the mean squared error (MSE) is 9.97 × 10−8. The leverage statistical approach further confirmed the validity of the model developed.
AB - Hydraulic fracturing is an effective technology for hydrocarbon extraction from unconventional shale and tight gas reservoirs. A potential risk of hydraulic fracturing is the upward migration of stray gas from the deep subsurface to shallow aquifers. The stray gas can dissolve in groundwater leading to chemical and biological reactions, which could negatively affect groundwater quality and contribute to atmospheric emissions. The knowledge of light hydrocarbon solubility in the aqueous environment is essential for the numerical modelling of flow and transport in the subsurface. Herein, we compiled a database containing 2129 experimental data of methane, ethane, and propane solubility in pure water and various electrolyte solutions over wide ranges of operating temperature and pressure. Two machine learning algorithms, namely regression tree (RT) and boosted regression tree (BRT) tuned with a Bayesian optimization algorithm (BO) were employed to determine the solubility of gases. The predictions were compared with the experimental data as well as four well-established thermodynamic models. Our analysis shows that the BRT-BO is sufficiently accurate, and the predicted values agree well with those obtained from the thermodynamic models. The coefficient of determination (R2) between experimental and predicted values is 0.99 and the mean squared error (MSE) is 9.97 × 10−8. The leverage statistical approach further confirmed the validity of the model developed.
KW - Boosted regression tree
KW - Gas solubility
KW - Groundwater contamination
KW - Hydraulic fracturing
KW - Regression tree
KW - Thermodynamic models
UR - http://www.scopus.com/inward/record.url?scp=85188311947&partnerID=8YFLogxK
U2 - 10.1007/s11631-024-00680-8
DO - 10.1007/s11631-024-00680-8
M3 - Article
AN - SCOPUS:85188311947
SN - 2096-0956
VL - 43.2024
SP - 971
EP - 984
JO - Acta Geochimica
JF - Acta Geochimica
IS - 5
ER -