Prediction of Geothermal Heat Flow Using XGBoost with the Applications in Coupling Coalbed Methane and Geothermal Resource

  • 0School of Energy Resources, China University of Geosciences (Beijing), 29 Xueyuan Road, Beijing 100083, China.

|

|

Summary

This summary is machine-generated.

An XGB model predicts geothermal heat flow (GHF) using geological data, improving coalbed methane (CBM) exploration. The model accurately maps thermal conditions, revealing higher GHF in coal-rich areas.

Area Of Science

  • Geophysics
  • Geochemistry
  • Machine Learning

Background

  • Reservoir temperature critically impacts coal, coalbed methane (CBM), and geothermal heat flow (GHF) formation and production.
  • GHF is a key indicator of subsurface thermal conditions, vital for energy resource assessment.

Purpose Of The Study

  • To develop and evaluate an extreme gradient boosting (XGB) model for predicting GHF in China.
  • To assess the influence of geological and geophysical features on GHF and its relationship with CBM potential.

Main Methods

  • An XGB model was trained using 12 geological/geophysical features on global and China-specific datasets.
  • Model performance was evaluated using MAE, RMSE, and R²; feature importance was analyzed using SHAP values.
  • Predicted heat flow maps were generated and compared with kriging-interpolated maps for consistency and geological coherence.

Main Results

  • The China-based XGB model showed superior performance in low-heat-flow areas (R² = 0.73) compared to the global model (R² = 0.65).
  • Generated heat flow maps displayed smoother spatial trends and stronger correlations with measured data.
  • Significantly higher GHF was observed in medium- to high-rank coal regions, indicating enhanced CBM potential.

Conclusions

  • The XGB model accurately predicts GHF, providing valuable insights for geothermal resource exploration and CBM development.
  • Crustal structure and volcanic activity are dominant controls on GHF, guiding future exploration efforts.
  • The model's predictive capability is particularly beneficial in data-scarce regions for energy resource assessment.

Related Concept Videos

Regression Analysis 01:11

6.1K

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

In the equation,  is the dependent...

Multiple Regression 01:25

3.2K

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Heating and Cooling Curves 02:44

24.2K

When a substance—isolated from its environment—is subjected to heat changes, corresponding changes in temperature and phase of the substance is observed; this is graphically represented by heating and cooling curves.
For instance, the addition of heat raises the temperature of a solid; the amount of heat absorbed depends on the heat capacity of the solid (q = mcsolidΔT). According to thermochemistry, the relation between the amount of heat absorbed or released by a substance, q, and its...

Energy Line and Hydraulic Gradient Line 01:27

1.4K

Based on Bernoulli's equation, the energy line (EL) and hydraulic grade line (HGL) provide graphical representations of energy distribution in a fluid flow system. For steady, incompressible, inviscid flows, Bernoulli's equation is expressed as:

Here, H is the total head, p/γ is the pressure head, V2/2g is the velocity head, and z is the elevation head.
The energy line (EL) represents the total head available in the flow. It is located above the datum and includes all three energy...

Residual Plots 01:07

5.1K

A residual plot is a statistical representation of data used to analyze correlation and regression results. It helps verify the requirements for drawing specific conclusions about correlation and regression. To obtain the residual plot, first, the residual for each data value is calculated, which is simply the vertical distance between the observed and the predicted value obtained from the regression equation.
When the residual values are plotted against the variable x, it is called a residual...

Levels of Use of a GIS 01:29

109

Geographic Information Systems (GIS) operate across three levels of application, each representing an increasing degree of complexity: data management, analysis, and prediction. These levels reflect the expanding functionality and versatility of GIS technology in handling spatial data for diverse purposes.Data ManagementAt its foundational level, GIS serves as a tool for data management, enabling the input, storage, retrieval, and organization of spatial data. This level is often employed in...