Научная статья на тему 'A Supervised Workflow for Predicting Lithofacies in Complex and Heterogeneous Tight Sandstone Reservoirs: A Data-Driven Approach Using Clustering and Classification Models'

A Supervised Workflow for Predicting Lithofacies in Complex and Heterogeneous Tight Sandstone Reservoirs: A Data-Driven Approach Using Clustering and Classification Models Текст научной статьи по специальности «Медицинские технологии»

CC BY
0
0
i Надоели баннеры? Вы всегда можете отключить рекламу.
Ключевые слова
lithofacies prediction / complex sandstone reservoirs / synthetic gamma-ray logging / machine learning / прогнозирование литофаций / сложные песчаные коллекторы / синтетический гаммакаротаж / машинное обучение

Аннотация научной статьи по медицинским технологиям, автор научной работы — Muhammad Ali

This study introduces a novel supervised workflow for predicting lithofacies in complex, heterogeneous tight sandstone reservoirs with intercalated facies. Using a two-information criteria clustering method, six distinct facies are identified, providing an unbiased, data-driven alternative to manual approaches. Among classification models, Gaussian Process Classification (GPC) outperforms others, including Support Vector Machine (SVM) and Artificial Neural Network (ANN), with Random Forest (RF) performing less effectively. GPC accurately predicts lithofacies in testing data and is assessed for similarity accuracy. Predicted lithofacies are integrated into acoustic impedance versus velocity ratio cross plots, resulting in 2D probability density functions. These, combined with depth data, feed a neural network to forecast synthetic gamma-ray log responses. Results show strong agreement between measured and predicted gamma-ray logs (R2 = 0.978) and nearly identical log trends. Additionally, the predicted lithofacies are classified using inverted impedance and velocity ratio volumes, yielding a facies prediction volume that aligns well with well site lithofacies classification, even without core data.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Контролируемый техпроцесс для прогнозирования литофаций в сложных неоднородных плотных песчаных коллекторах: Основанный на данных подход с использованием моделей кластеризации и классификации

В этом исследовании представлен новый контролируемый техпроцесс (методика) для прогнозирования литофаций в сложных, неоднородных коллекторах из плотного песчаника с промежуточными фациями. Используя метод кластеризации с двумя информационными критериями, идентифицируются шесть различных фаций, что обеспечивает объективную, основанную на данных, альтернативу ручным подходам. Среди моделей классификации именно классификация гауссовских процессов (КГП) превосходит другие, в том числе машину опорных векторов (МОВ) и искусственную нейронную сеть (ИНС), при этом случайный лес (СЛ) работает менее эффективно. КГП точно предсказывает литофации в данных тестирования и оценивается на предмет точности сходства. Прогнозируемые литофации интегрируются в кросс-графики зависимости акустического импеданса от отношения скоростей, в результате чего получаются двумерные функции плотности вероятностей. В сочетании с данными о глубине они поступают в нейронную сеть для прогнозирования результатов синтетического гамма-каротажа. Результаты демонстрируют близкое согласование между измеренными и прогнозируемыми данными гамма-каротажа (R2 = 0,978) и почти идентичные тенденции (формы аномалий) каротажных диаграмм. Кроме того, прогнозируемые литофации классифицируются с использованием инвертированных объемов импеданса и отношения скоростей, что дает прогнозируемый объем фаций, который хорошо согласуется с классификацией литофаций скважины, даже без керновых данных.

Текст научной работы на тему «A Supervised Workflow for Predicting Lithofacies in Complex and Heterogeneous Tight Sandstone Reservoirs: A Data-Driven Approach Using Clustering and Classification Models»

2023

ВЕСТНИК ПЕРМСКОГО УНИВЕРСИТЕТА

Геология

Том 22, № 4

ГЕОФИЗИКА

УДК 550.3

A Supervised Workflow for Predicting Lithofacies in Complex and Heterogeneous Tight Sandstone Reservoirs: A Data-Driven Approach Using Clustering and Classification Models

Muhammad Ali

Institute of Geophysics & Geomatics, China University of Geosciences

Lumo Road No.388, Hongsan district, Wuhan City, Hubei province, P.R.China.

Postal Code: 430074

Email: muhammad_ali@cug.edu.cn

(Статья поступила в редакцию 2 ноября 2023 г.)

This study introduces a novel supervised workflow for predicting lithofacies in complex, heterogeneous tight sandstone reservoirs with intercalated facies. Using a two-information criteria clustering method, six distinct facies are identified, providing an unbiased, data-driven alternative to manual approaches. Among classification models, Gaussian Process Classification (GPC) outperforms others, including Support Vector Machine (SVM) and Artificial Neural Network (ANN), with Random Forest (RF) performing less effectively. GPC accurately predicts lithofacies in testing data and is assessed for similarity accuracy. Predicted lithofacies are integrated into acoustic impedance versus velocity ratio cross plots, resulting in 2D probability density functions. These, combined with depth data, feed a neural network to forecast synthetic gamma-ray log responses. Results show strong agreement between measured and predicted gamma-ray logs (R2 = 0.978) and nearly identical log trends. Additionally, the predicted lithofacies are classified using inverted impedance and velocity ratio volumes, yielding a facies prediction volume that aligns well with well site lithofacies classification, even without core data. Keywords: lithofacies prediction, complex sandstone reservoirs, synthetic gamma-ray logging, machine learning

DOI: 10.17072/psu.geol.22.4.342

Introduction

Machine learning (ML) is a branch of artificial intelligence (AI) that utilizes data analysis techniques such as classification, regression, and clustering to make predictions and identify patterns in large datasets. The ML approach can be classified into two groups: supervised. Supervised ML involves using input parameters and desired outputs to train a model, while ML identifies patterns without predefined outputs. In the oil and natural gas industry, machine learning has become a popular tool for solving geoscientific problems related to exploration, development, and production.Wire-line logs have become a commonly used tool for geosci-entists in the oil and gas industry. With the development of machine learning, various neural

© Muhammad Ali, 2023

networks have been widely used in oil exploration (Antariksa et al., 2022; Song et al., 2021; Valentín et al., 2019). Chawshin et al. (2021) designed a convolutional neural network (CNN) that automatically predicted lithofacies from 2D core CT scan image slices. Alzubaidi et al. (2019) introduced a CNN-based method that used core images to predict lithology automatically, although it did not perform well in the subdivision of rock types. Zhang et al.(2021) used convolutional neural networks to identify lithofacies from core images. While these methods significantly reduced the identification time, they still required many core sample images for network training and labeling, which can be a challenge.To overcome this challenge, Zhang et al. (2021) used relatively low-cost well logs instead of core samples for lithofacies identifica-

tion. This approach effectively provided a first-glance analysis of core data, although the model's generalization required improvements. Despite the limitations, machine-led applications hold great promise in the oil and natural gas industry, enabling more efficient and accurate exploration and production.

The goal of this research is to investigate how supervised classification can improve the recognition of lithological facies in a dataset. The dataset consists of complex geometrically pro-gradational sequence environments that were formed during a significant tectonic event. This event not only affects the distribution of different facies but also leads to the presence of a large volume of volcaniclastic debris, which can negatively impact the quality of the reservoir. Accurately predicting facies is important because the Lower Goru formation in the Ka-danwari gas field has the potential to produce significant amounts of natural gas. The process of facies categorization involves creating new logs, generating synthetic Gamma ray logs using machine learning regression models, and creating artificial data to fill in gaps in the dataset. The final step is to train the data samples using four different classification algorithms and select the most accurate facies classifier from the validation dataset. This study uses "ensemble learning," which combines multiple models to improve accuracy.

Area of Geology:

The research area, Kadanwari, is located in the Southern Indus Basin, also known as the "Lower Indus Basin (LIB)." The Kadanwari gas field reservoir is comprised of typical Lower Goru formation sands in the D, E, and F layers, as well as the tight G layer (LB) (Figure 1) (Ali et al., 2019, 2020; Ashraf et al., 2020; Valzania et al., 2011). The development of the G sand member of the Lower Goru formation occurs during a significant tectonic event, resulting in a geometrically complex progradational sequence. This tectonic event not only impacts the distribution of different facies but also leads to an abundance of volcaniclastic material, which can negatively affect the reservoir quality. The Ka-danwari LB clastic deltaic system is dominated by rivers from the C to H layers and is only slightly modified by tides and waves. The authors suggest that river dynamics influence both the sand-prone "proximal" facies (mainly cross-bedded medium and coarse sandstones) and the fine-grained "distal" facies, characterized by extensive hummocky cross lamination. This phenomenon is associated with the creation of shelf hyperpycnal (density) flows during seasonal flooding stages and significant storms (Afzal et al., 2009; Berger et al., 2009; Kazmi & Jan, 1997). The delta lobes deposited at different stages vary in size and form (Figure 1).

Figure 1: Sedimentology Model and lithology of the study area

Data and Methodology

This study utilized a dataset comprising wireline log measurements of six parameters, namely gamma ray (GR), laterolog deep resistivity (LLD), neutron porosity (NPHI), compressional wave velocity (DT), and bulk density (RHOB), as well as two petrophysical parameters estimated from the data: volume of shale and porosity. The wells sampled were located in the Lower Goru formation of the Kadanwari gas field block, which is situated in the central Indus Basin (Figure 2).

K-15 and K-14 are wells that provide facies log and facies descriptions from geological data, which we utilized as the targeted output for the machine learning (ML) algorithms in our study. The lithology of the research area is classified into six categories, and each category is identi-

fied through meticulous petrophysical analysis and core investigation. Table lcontains the data and serves as a useful representation of the fluvial-based depositional system, using the facies nomenclature mentioned.

Clusters selection criteria

In the study, due to limited core facies data in the K-15 well (Figure 3), it's vital to select the right number of clusters for accurate facies representation before applying machine learning algorithms. To avoid model bias from overfit-ting or underfitting, two criteria are used. The Akaike Information Criterion (AIC) assesses a model's ability to predict future values based on in-sample fit, favoring lower AIC values and aiding in choosing between Holt-Winters models. The Bayesian Information

D.DE 2.DE JCE S.DE H.DE 1D.DE 1Z.DE

um: idi: 4.m: <lie k.gi: loir

Figure 2: Geographical distribution of wells in the study area Table 1: facies classification of Lower Goru

Facies Description

Sh Shale to silty Shale (Shelf deposits)

Slst Siltstone to silty-shaly sandstone (prodelta shales with turbiditic layers)

Css Low-porosity, low permeability cemented sandstone (very distal mouth bar fringe)

Lss Low-medium porosity, low permeability sideritic/chamositic sandstone (shoreface to distal mouth bar)

Ss High-porosity, high permeability sandstone (mouth bar)

Hs Highly chamosite/siderite affected lithologies (chamositized mouth bar)

Figure 3: Visualize core facies data in the small section of well K-15

Criterion (BIC) balances model complexity and fit, with lower values indicating a better fit. The study provides equations to compute AIC and BIC values for model selection, ensuring the optimal cluster number is chosen for machine learning in cases of limited core facies data in the K-15 well.

One-Class Support Vector Machine (SVM):

Support Vector Machine (SVM) is a supervised machine learning algorithm used for classification and regression tasks(Vapnik, 1995). It finds the optimal hyperplane that best separates data points into different classes while maximizing the margin between them. SVM is effective in handling high-dimensional data and can also be used in non-linear scenarios by employing kernel functions. Neural Network:

A neural network is a computational model inspired by the structure and function of the human brain. It consists of interconnected nodes, or artificial neurons, organized in layers, which process and transform input data to produce output. Neural networks are used in various machine learning and deep learning tasks, such as image recognition, natural language processing,

and more, by learning complex patterns and relationships within data (Guresen & Kayakutlu, 2011; Haykin, 2011). Random Forest:

A random forest is an ensemble machine learning method that combines multiple decision trees to make more accurate predictions or classifications (Akkurt et al., 2018). It mitigates overfitting and increases the model's robustness by aggregating the results of individual trees, often resulting in improved performance and generalization.

Data Split and Cross-Validation:

In machine learning, data partitioning is critical to ensure effective model evaluation on new data. Typically, data is split into a training set for model training and a testing set for final evaluation (Alghazal & Krinis, 2021). For small datasets, the risk of overfitting to the training data is a concern. To address this, cross-validation is used. It involves breaking the training set into multiple folds. The model is evaluated on each fold while being trained on the others, reducing overfitting. After cross-validation, the model with the best score is selected and used to predict the testing set for the final evaluation (Figure 4).

1 III II D— 1 IUI II

70% 30%

Training Data | Test Data

Split I Split 2 Split 3 Split 4

Validation

Validation

Validation

Validation

Eralute ML models

fit |1JII'J Mil' LüJ'S >

Best vafiditioii score

Best ML Model

Figure 4: The visual representation of the dataset is divided into two sets. The first one is training and the second one is testing the dataset for the machine learning model

Feature Selection:

Feature selection is a crucial step in reducing dataset complexity by removing irrelevant features. It benefits machine learning in several ways (Ali et al., 2021; Guyon & Elisseeff, 2003; Li et al., 2017), including simplifying models, saving time and costs, reducing overfitting risk, and mitigating data dimensionality issues. This study employed two feature selection methods: univariate and Pearson's Correlation (r). Uni-variate selection individually tests each feature's relationship with the target variable. Pearson's Correlation measures the strength of linear relationships between pairs of variables, with values ranging from -1 to +1, where 0 implies no correlation, negative values indicate a negative correlation, and positive values indicate a positive correlation.

Criteria for verifying model performance:

The classification performance of the models was verified using a standardized confusion matrix since its proposals detailed statistical results for both correctly and incorrectly classified lithofacies (Alghazal & Krinis, 2021). Precision, recall, and F-1 scores were all employed as verification metrics (Eqs. (1) - (3)).

True positive (1)

Precision = -

True positive + False positive

True positive (2)

Recall = ----- (2)

True positive + False negative

1= 1 ( 1 i 1 ) (3)

F1 2 VPrecision Recall) Result and discussion:

This section has multiple parts. First, we determine the cluster count in the study area using our novel algorithm from previous section. Then, we reduce dataset complexity with feature selection techniques, normalizing features for better machine learning convergence. In the third part, we compare lithology identification results for each model. Next, we evaluate algorithm efficiency. Finally, we apply the final model to predict well facies and check the prediction accuracy based on synchronization measures for synthetic logs.

Cluster Analysis:

The pivotal step in clustering analysis is visually assessing the two-information criteria plot to pinpoint the ideal number of clusters. This is done by identifying inflection points in the Euclidean distance plot, which indicate that further clusters do not significantly enhance data characterization. In Figure 5, the AIC/BIC curves level off at six clusters, signifying an effective model for describing the study area well logs' facies.

Figure 5. Determining the optimal number of clusters using the two-information criteria plot

Feature selection:

Before model training, selecting the most relevant feature with a strong target correlation is vital. In this study, eight logging curves were chosen, excluding TVD, LLM, and LLS. Results in Figure 6 show Gamma (GR) had the highest impact (Influence Factor 36.21), followed by Deep Resistivity (LLD, 16.04), Com-pressional Sonic (DT, 13.17), Density (RHOB, 9.85), and Neutron (NPHI, 7.70). Photoelectric factor (PEF), Spontaneous potential (SP), and Caliper (CAL) were least impactful (Influence Factors <6). Thus, the top four inputs (GR, LLD,

DT, RHOB) are most crucial for facies evaluation.

After estimating the cluster count from the two-information criteria plot, core facies and selected logs are added to the training data, leaving out the blind/testing well. One well's data is reserved for algorithm evaluation. Figure 7 shows the facies distribution in the training dataset, highlighting the underrepresentation of High-porosity, high-permeability sandstone (Ss) facies compared to others. To enhance prediction models, more Ss facies samples should be obtained.

Figure 6. Feature importance scores of input features versus core facies

Figure 7. Facies (class) distribution of the training data

Facies predictions utilizing ML models and comparison:

In our study, we evaluated ML models for facies prediction. Initially, default models were tested on the training dataset using confusion matrices to measure accuracy. This tool helps assess model effectiveness and estimate mis-classification losses. For identification performance, we used metrics like precision, recall, and F1-score. Figure 8 displays model performance. GPC excelled, followed by SVM and ANN, while RF performed the poorest. In li-thology identification, ensemble models outperformed RF, with GPC leading. Different models had varying success with different facies types. While all performed well with certain facies, RF

struggled with Ss. GPC performed the best overall, slightly outperforming ANN. SVM was weaker in identifying Hs, LSs, CSs, and Sh compared to ANN, highlighting ensemble models' advantage with non-uniform facies. Figure 8 shows confusion matrices, illustrating how predicted facies compare to actual facies for each lithofacies. For instance, in the GPC model's matrix, common misclassifications included 3% of samples as Hs, CSs, and Sh, and 3% of CSs samples as Sh. These errors result from overlapping logging features in the misclassified lithofacies. Nevertheless, the GPC model outperformed others, achieving high lithofacies prediction accuracy, as shown in Figure 8.

Figure 8. Confusion Matrix of SVM, ANN, GPC, and RF facies prediction outcomes and Specific facies are identified by their labels and explained in the table

Figure 9. The prediction results of final models for K-14 well (blind well)

In the last step, we used the GPC model to predict lithofacies in the testing dataset and checked the accuracy based on synchronization measures for synthetic logs. Utilizing core data from K-15 wells, we extended facies predictions to wells without core data. Figure 9 shows the facies distribution in a blind well using the GPC model. The first four plots display measured logs with depth (in meters), and the last track shows the predicted facies track by GPC.

To assess facies prediction accuracy without ground truth data in Figure 9 (blind well K-14), a novel technique was used. It confirmed predictions by leveraging elastic parameters such as acoustic impedance and velocity ratio, which are linked to reservoir quality and lithofacies. An acoustic impedance vs velocity ratio cross plot was generated for the blind well based on predicted lithofacies, producing 2D probability density functions (PDFs) (Figure 10a and b). These PDFs represent each sample's similarity to data points within its cluster and dissimilarity to those outside its cluster. They were then used as input with depth in a neural network to predict synthetic gamma-ray log responses (Figure 10c). Since the gamma-ray log is crucial for rock type differentiation, its prediction accuracy was evaluated using synchronization measures. Figure 10d compares actual gamma logs (GR)

with predicted synthetic logs (Syn_GR) from the neural network, showing visually satisfactory results with nearly identical average log trends. Cross plots between measured GR and predicted Syn_GR from the machine learning algorithm for blind wells are displayed in Figure 10e. These plots quantitatively measure the algorithm's predictive ability, yielding a satisfactory R2 value of 0.978.

After predicting lithofacies using well log data, the inverted acoustic impedance and Vp/Vs volumes were used to classify the predicted lithofacies. This information was then used to construct a facies prediction volume. The inverted facies volume is strongly correlated with the predicted lithofacies at the well locations (Figure 11a). To ensure quality control, the Vp log was compared to the Vp modeled from the predicted porosity volume generated from inversion utilizing the facies model, and the Vp modeled from the original porosity log. The modeled Vp calculated based on the porosity log correlates well with the Vp log, and the predicted Vp from inversion shows good correlation with the measured log (Figure 11b), especially in the upper section of the well where the reservoir facies are present. Overall, the model performs very well and effectively differentiates the rock types.

Figure 10. Result of the blind well (a) predicted lithofacies (b) based on lithofacies extract PDFs (c) utilized neural network to predicted synthetic log (d) actual and predicted gamma-ray log response comparison trend (e) check the prediction accuracy result by the least square method

Figure 11. Displays two panels: (a) the facies prediction volume generated by classifying the predicted lithofacies using the inverted acoustic impedance and Vp/Vs volumes, and (b) a comparison of the original Vp log, Vp predicted from the facies model applied on the original porosity log, and modeled Vp obtained from the predicted porosity volume derived from inversion using the facies model

Conclusion:

A new workflow for lithofacies prediction using supervised machine learning algorithms was developed. The novel two-information criteria clustering approach generated facies that were strictly data-driven. Four supervised machine learning classifiers were employed, with the GPC model achieving the highest identification performance. The GPC model was used to predict lithofacies in the testing dataset and the accuracy of facies similarity was assessed by synchronization measures to predict synthetic logs. A 2D probability density function was generated from an acoustic impedance vs velocity ratio cross plot of a blind well based on the predicted lithofacies. This was used as input with depth in a neural network to predict synthetic gamma-ray log response. The results obtained from the neural network were visually satisfactory, with a high correlation between the measured GR and predicted Syn_GR from the machine learning algorithm at blind wells (R2 of 0.978). Finally, the predicted lithofacies were used to create a facies prediction volume, which correlated well with the predicted litho-facies classification. The workflow provides consistent, reliable, and efficient results, saving time and effort in data processing and interpretation.

References

Afzal, J., Kuffner, T., Rahman, A., & Ibrahim, M. (2009). Seismic and Well-log Based Sequence Stratigraphy of The Early Cretaceous, Lower Goru "C" Sand of The Sawan Gas Field, Middle Indus Platform, Pakistan. Proceedings, Society of Petroleum Engineers (SPE)/Pakistan Association of Petroleum Geoscientists (PAPG) Annual Technical Conference, Islamabad, Pakistan.

Akkurt, R., Conroy, T. T., Psaila, D., Paxton, A., Low, J., & Spaans, P. (2018). Accelerating and Enhancing Petrophysical Analysis With Machine Learning: A Case Study of an Automated System for Well Log Outlier Detection and Reconstruction. SPWLA 59th Annual Logging Symposium, 2-6 June, London, UK.

Alghazal, M., & Krinis, D. (2021). A novel approach of using feature-based machine learning models to expand coverage of oil saturation from dielectric logs. Society of Petroleum Engineers -

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

SPE Europec Featured at 82nd EAGE Conference and Exhibition, EURO 2021, 2, 10. https://doi.org/ 10.2118/205162-ms

Ali, M., Jiang, R., Ma, H., Pan, H., Abbas, K., Ashraf, U., & Ullah, J. (2021). Machine learning -A novel approach of well logs similarity based on synchronization measures to predict shear sonic logs. Journal of Petroleum Science and Engineering, 203, 108602. https://doi.org/ https://doi.org/10.1016/ j.petrol.2021.108602

Ali, M., Khan, M. J., Ali, M., & Iftikhar, S.

(2019). Petrophysical analysis of well logs for reservoir evaluation: a case study of "Kadanwari" gas field, middle Indus basin, Pakistan. Arabian Journal of Geosciences, 12(6). https://doi.org/10. 1007/s12517-019-4389-x

Ali, M., Ma, H., Pan, H., Ashraf, U., & Jiang, R.

(2020). Building a rock physics model for the formation evaluation of the Lower Goru sand reservoir of the Southern Indus Basin in Pakistan. Journal of Petroleum Science and Engineering, 194, 107461. https://doi.org/10.1016Aj.petrol.2020.107 461

Antariksa, G., Muammar, R., & Lee, J. (2022). Performance evaluation of machine learning-based classification with rock-physics analysis of geological lithofacies in Tarakan Basin, Indonesia. Journal of Petroleum Science and Engineering, 208, 109250. https://doi.org/https: //doi.org/10.1016/ j.petrol.2021.109250

Ashraf, U., Zhang, H., Anees, A., Ali, M., Zhang, X., Shakeel Abbasi, S., & Nasir Mangi, H. (2020). Controls on Reservoir Heterogeneity of a Shallow-Marine Reservoir in Sawan Gas Field, SE Pakistan: Implications for Reservoir Quality Prediction Using Acoustic Impedance Inversion. Water, 12(11). https://doi.org/10.3390/w12112972

Berger, A., Gier, S., & Krois, P. (2009). Porosity-preserving chlorite cements in shallow-marine volcaniclastic sandstones: Evidence from cretaceous sandstones of the sawan gas field, Pakistan. AAPG Bulletin. https://doi.org/10.1306/ 01300908096

Chawshin, K., Gonzalez, A., Berg, C. F., Varagnolo, D., Heidari, Z., & Lopez, O. (2021). Classifying Lithofacies from Textural Features in Whole Core CT-Scan Images. SPE Reservoir Evaluation & Engineering, 24(02), 341-357. https://doi.org/10.2118/205354-PA

Guresen, E., & Kayakutlu, G. (2011). Definition of Artificial Neural Networks with comparison to other networks. Procedia Computer Science, 3, 426-433. https://doi.org/10.10161j.procs.2010.12. 071

Guyon, I., & Elisseeff, A. (2003). An Introduction to Variable and Feature Selection. J. Mach. Learn. Res., 3(null), 1157-1182.

Haykin, S. O. (2011). Neural Networks and Learning Machines. Pearson Education. https://books.google.com/books?id=faouAAAAQB AJ

Kazmi, A. H., & Jan, M. Q. (1997). Geology and tectonics of Pakistan. Graphic publishers.

Li, Y., Li, T., & Liu, H. (2017). Recent advances in feature selection and its applications. Knowledge and Information Systems, 53(3), 551577. https://doi.org/10.1007/s10115-017-1059-8

Song, L., Liu, Z., Li, C., Ning, C., Hu, Y., Wang, Y., Hong, F., Tang, W., Zhuang, Y., Zhang, R., & others. (2021). Prediction and Analysis of Geomechanical Properties of Jimusaer Shale Using a Machine Learning Approach. SPWLA 62nd Annual Logging Symposium.

Valentín, M. B., Bom, C. R., Coelho, J. M., Correia, M. D., de Albuquerque, M. P., de Albuquerque, M. P., & Faria, E. L. (2019). A deep residual convolutional neural network for automatic lithological facies identification in Brazilian pre-salt oilfield wellbore image logs.

Journal of Petroleum Science and Engineering, 179, 474-503. https: //doi .org/https: //doi .org/10.1016/ j.petrol.2019. 04.030

Valzania, S., Kfoury, M., Grandis, M., Valdisturlo, A., Fanello, G., Guerra, L., Heikal, S., Kashif, A., & Sultan, A. (2011). Kadanwari field: A tight gas reservoir study and a successful pilot well give new life to an exploited field. 73rd European Association of Geoscientists and Engineers Conference and Exhibition 2011: Unconventional Resources and the Role of Technology. Incorporating SPE EUROPEC 2011, 4, 2715-2744. https://doi.org/10.2118/143001-ms

Vapnik, V. N. (1995). The Nature of Statistical Learning Theory. Springer-Verlag. https://doi.org/ https://doi.org/10.1007/978-1-4757-3264-1

Zhang, J., Ambrose, W., & Xie, W. (2021). Applying convolutional neural networks to identify lithofacies of large-n cores from the Permian Basin and Gulf of Mexico: The importance of the quantity and quality of training data. Marine and Petroleum Geology, 133, 105307. https: //doi. org/https: //doi.org/10. 1016/ j.marpetgeo.2021. 105307

Контролируемый техпроцесс для прогнозирования литофаций в сложных неоднородных плотных песчаных коллекторах: Основанный на данных подход с использованием моделей кластеризации и классификации

А. Мухаммед

Институт геофизики и геоматики Китайского университета геонаук Ухань, КНР, г. Ухань, улица Лумо № 388, район Хунсань. E-mail: muhammad_ali@cug.edu.cn

В этом исследовании представлен новый контролируемый техпроцесс (методика) для прогнозирования литофаций в сложных, неоднородных коллекторах из плотного песчаника с промежуточными фациями. Используя метод кластеризации с двумя информационными критериями, идентифицируются шесть различных фаций, что обеспечивает объективную, основанную на данных, альтернативу ручным подходам. Среди моделей классификации именно классификация гауссовских процессов (КГП) превосходит другие, в том числе машину опорных векторов (МОВ) и искусственную нейронную сеть (ИНС), при этом случайный лес (СЛ) работает менее эффективно. КГП точно предсказывает литофации в данных тестирования и оценивается на предмет точности сходства. Прогнозируемые литофации интегрируются в кросс-графики зависимости акустического импеданса от отношения скоростей, в результате чего получаются двумерные функции плотности вероятностей. В сочетании с данными о глубине они поступают в нейронную сеть для прогнозирования результатов синтетического гамма-каротажа. Результаты демонстрируют близкое согласование между измеренными и прогнозируемыми данными гамма-каротажа (R2 = 0,978) и почти идентичные тенденции (формы ано-

малий) каротажных диаграмм. Кроме того, прогнозируемые литофации классифицируются с использованием инвертированных объемов импеданса и отношения скоростей, что дает прогнозируемый объем фаций, который хорошо согласуется с классификацией литофаций скважины, даже без керно-вых данных.

Ключевые слова: прогнозирование литофаций, сложные песчаные коллекторы, синтетический гамма-каротаж, машинное обучение.

i Надоели баннеры? Вы всегда можете отключить рекламу.