Spatial hidden Markov chain models for estimation of petroleum reservoir categorical variables

Similar documents
Transiogram: A spatial relationship measure for categorical data

To link to this article:

Bayesian Markov Chain Random Field Cosimulation for Improving Land Cover Classification Accuracy

Statistical Rock Physics

Bayesian lithology/fluid inversion comparison of two algorithms

Markov Chain Random Fields for Estimation of Categorical Variables

Optimizing Thresholds in Truncated Pluri-Gaussian Simulation

Reservoir connectivity uncertainty from stochastic seismic inversion Rémi Moyen* and Philippe M. Doyen (CGGVeritas)

The Seismic-Geological Comprehensive Prediction Method of the Low Permeability Calcareous Sandstone Reservoir

Geophysical methods for the study of sedimentary cycles

LettertotheEditor. Comments on An efficient maximum entropy approach for categorical variable prediction by D. Allard, D. D Or & R.

Earth models for early exploration stages

Inverting hydraulic heads in an alluvial aquifer constrained with ERT data through MPS and PPM: a case study

Teacher s Aide Geologic Characteristics of Hole-Effect Variograms Calculated from Lithology-Indicator Variables 1

Reliability of Seismic Data for Hydrocarbon Reservoir Characterization

Study on the Couple of 3D Geological Model and Reservoir Numerical Simulation Results

COLLOCATED CO-SIMULATION USING PROBABILITY AGGREGATION

NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET. Directional Metropolis Hastings updates for posteriors with nonlinear likelihoods

Facies Modeling in Presence of High Resolution Surface-based Reservoir Models

Computational Challenges in Reservoir Modeling. Sanjay Srinivasan The Pennsylvania State University

Geostatistics for Seismic Data Integration in Earth Models

Optimizing the reservoir model of delta front sandstone using Seismic to Simulation workflow: A case study in the South China Sea

IJMGE Int. J. Min. & Geo-Eng. Vol.49, No.1, June 2015, pp

Multiple-Point Geostatistics: from Theory to Practice Sebastien Strebelle 1

Net-to-gross from Seismic P and S Impedances: Estimation and Uncertainty Analysis using Bayesian Statistics

Downloaded 10/02/18 to Redistribution subject to SEG license or copyright; see Terms of Use at

Study on Prediction Method of Fluvial Facies Sandbody in Fluvial Shallow Water Delta

We LHR3 04 Realistic Uncertainty Quantification in Geostatistical Seismic Reservoir Characterization

Integration of Rock Physics Models in a Geostatistical Seismic Inversion for Reservoir Rock Properties

Contents 1 Introduction 2 Statistical Tools and Concepts

Abstract. Introduction. G.C. Bohling and M.K. Dubois Kansas Geological Survey Lawrence, Kansas, USA

USING GEOSTATISTICS TO DESCRIBE COMPLEX A PRIORI INFORMATION FOR INVERSE PROBLEMS THOMAS M. HANSEN 1,2, KLAUS MOSEGAARD 2 and KNUD S.

Quantitative Seismic Interpretation An Earth Modeling Perspective

Thomas Bayes versus the wedge model: An example inference using a geostatistical prior function

Probabilistic seismic inversion using pseudo-wells

Open Access Study on Reservoir-caprock Assemblage by Dual Logging Parameter Method

Application of Transition Probability Geostatistics in a Detailed Stratigraphic Framework. Gary Weissmann University of New Mexico

Combining geological surface data and geostatistical model for Enhanced Subsurface geological model

Stochastic vs Deterministic Pre-stack Inversion Methods. Brian Russell

Agricultural University, Wuhan, China b Department of Geography, University of Connecticut, Storrs, CT, Available online: 11 Nov 2011

The SPE Foundation through member donations and a contribution from Offshore Europe

COLLOCATED CO-SIMULATION USING PROBABILITY AGGREGATION

Geostatistical History Matching coupled with Adaptive Stochastic Sampling

Exploration Significance of Unconformity Structure on Subtle Pools. 1 Vertical structure characteristics of unconformity

Downloaded 10/25/16 to Redistribution subject to SEG license or copyright; see Terms of Use at

Simple closed form formulas for predicting groundwater flow model uncertainty in complex, heterogeneous trending media

Regional-scale modelling of the spatial distribution of surface and subsurface textural classes in alluvial soils using Markov chain geostatistics

DATA ANALYSIS AND INTERPRETATION

A Markov Chain Model for Subsurface Characterization: Theory and Applications 1

Statistical Inference for Stochastic Epidemic Models

Quantitative Relation of the Point BarWidth and Meander Belt Width of Subsurface Reservoir

Consistent Downscaling of Seismic Inversions to Cornerpoint Flow Models SPE

Application of seismic hydrocarbon detection technique to natural gas exploration-take Yingshan rift volcanic in the Yingcheng Groups as an instance

The GIG consortium Geophysical Inversion to Geology Per Røe, Ragnar Hauge, Petter Abrahamsen FORCE, Stavanger

Integration of seismic interpretation in to geostatistical acoustic impedance Results presentation and discussion

Comparing the gradual deformation with the probability perturbation method

Scaling Neighbourhood Methods

Bayesian Lithology-Fluid Prediction and Simulation based. on a Markov Chain Prior Model

Application of the Combination of Well and Earthquake in Reservoir Prediction of AoNan Area

Facies Classifications for Seismic Inversion

Geostatistical History Matching coupled with Adaptive Stochastic Sampling: A zonation-based approach using Direct Sequential Simulation

Best Practice Reservoir Characterization for the Alberta Oil Sands

Automatic Drawing of the Complicated Geological Faults

Geochemical characterization of Lucaogou Formation and its correlation of tight oil accumulation in Jimsar Sag of Junggar Basin, Northwestern China

The Open Petroleum Engineering Journal

Seismic Response and Wave Group Characteristics of Reef Carbonate Formation of Karloff-Oxford Group in Asser Block

Markov Chain Monte Carlo methods

Assessing uncertainty on Net-to-gross at the Appraisal Stage: Application to a West Africa Deep-Water Reservoir

Large Scale Modeling by Bayesian Updating Techniques

Porosity prediction using cokriging with multiple secondary datasets

Bayesian reservoir characterization

A008 THE PROBABILITY PERTURBATION METHOD AN ALTERNATIVE TO A TRADITIONAL BAYESIAN APPROACH FOR SOLVING INVERSE PROBLEMS

STA 414/2104: Machine Learning

Machine Learning Overview

Reservoir characterization

Article Scale Effect and Anisotropy Analyzed for Neutrosophic Numbers of Rock Joint Roughness Coefficient Based on Neutrosophic Statistics

Traps for the Unwary Subsurface Geoscientist

Time-lapse geophysical technology-based study on overburden strata changes induced by modern coal mining

Downloaded 09/15/16 to Redistribution subject to SEG license or copyright; see Terms of Use at

Applications of Randomized Methods for Decomposing and Simulating from Large Covariance Matrices

Sequential Monte Carlo Methods for Bayesian Computation

Quantitative Interpretation

Sequential Simulations of Mixed Discrete-Continuous Properties: Sequential Gaussian Mixture Simulation

Applied Geostatisitcs Analysis for Reservoir Characterization Based on the SGeMS (Stanford

Pattern Recognition and Machine Learning

An Improved Spin Echo Train De-noising Algorithm in NMRL

QUANTITATIVE INTERPRETATION

Visualizing Spatial Uncertainty of Multinomial Classes in Area-class Mapping

Model updating mechanism of concept drift detection in data stream based on classifier pool

Deterministic and stochastic inversion techniques used to predict porosity: A case study from F3-Block

We apply a rock physics analysis to well log data from the North-East Gulf of Mexico

Geostatistical applications in petroleum reservoir modelling

Bivariate Degradation Modeling Based on Gamma Process

Assessment of uncertainty in computer experiments: from Universal Kriging to Bayesian Kriging. Céline Helbert, Delphine Dupuy and Laurent Carraro

CO 2 storage capacity and injectivity analysis through the integrated reservoir modelling

Bayesian Methods for Machine Learning

3D stochastic inversion of borehole and surface gravity data using Geostatistics

Seismic inversion for reservoir properties combining statistical rock physics and geostatistics: A review

Porosity Calculation of Tight Sand Gas Reservoirs with GA-CM Hybrid Optimization Log Interpretation Method

Downloaded 09/16/16 to Redistribution subject to SEG license or copyright; see Terms of Use at

Transcription:

J Petrol Explor Prod Technol (217) 7:11 22 DOI 1.17/s1322-16-251-9 ORIGINAL PAPER - EXPLORATION GEOLOGY Spatial hidden Markov chain models for estimation of petroleum reservoir categorical variables Xiang Huang 1 Peng Jiao 2 Jie Li 2 Yuru Liang 3 Zhizhong Wang 1 Jianhua Guo 2 Received: 17 October 215 / Accepted: 2 May 216 / Published online: 9 May 216 The Author(s) 216. This article is published with open access at Springerlink.com Abstract Indicator variograms and transition probabilities are used to measure spatial continuity of petroleum reservoir categorical variables. Variogram-based Kriging variants are symmetric geostatistical methods, which cannot completely capture the complex reservoir spatial heterogeneity structure. The asymmetric spatial Markov chain (SMC) approaches employ transition probabilities to incorporate proportion, length and juxtaposition relation information in subsurface reservoir structures. Secondary data in petroleum geology, however, cannot be reasonably aggregated. We propose a spatial hidden Markov chain (SHMC) model to tackle these issues. This method integrates well data and seismic data by using Viterbi algorithm for reservoir forecasting. The classified sonic impedance is used as auxiliary data, directly in some kind of Bayesian updating process via a hidden Markov model. The SMC embedded in SHMC has been redefined according to first-order neighborhood with different lag in three-dimensional space. Compared with traditional SMC in Markov chain random field theory, the SHMC method performs better in prediction accuracy and reflecting the geological sedimentation process by integrating auxiliary information. Keywords Hidden Markov model Posterior probability Reservoir simulation Sonic impedance Viterbi algorithm & Xiang Huang huangxiang@csu.edu.cn 1 2 3 Department of Statistics, Central South University, Changsha 4112, Hunan, China School of Geosciences and Info-Physics, Central South University, Changsha 4183, Hunan, China Institute for Basic Research, Shandong Yingcai University, Jinan 2514, Shandong, China Introduction Markov process was firstly introduced by Russian mathematician, Andrei Andreyevich Markov, in 197. Markovbased algorithm, like Markov chain Monte Carlo (MCMC) simulation, has recently been used to quantify uncertainty in infill well placement in the field of petroleum exploration and production (Arinkoola et al. 215). Spatial Markov chain (SMC) models have also been widely adopted in petroleum reservoir to characterize the spatial heterogeneity of categorical variables through the conditional probabilities (transition probabilities) from different directions (Carle and Fogg 1997; Weissmann and Fogg 1999). At present, there are two kinds of different independent assumptions to simplify the conditional probability of SMC models: one is full independence assumption; the other is conditional independence assumption. The full independence assumption is defined by Elfeki and Dekking (21), and the corresponding conditional probability formulas are proposed by Elfeki and Dekking (21) and developed by Li et al. (212). A spatial Markov chain with full independence assumption consists of several one-dimensional Markov chains, which are forced to move to the same location with equal states. The full independent assumption caused the small class underestimation problem. This method is feasible only for enough conditional data. The conditional independence assumption can be found in Pickard random field in cardinal directions (Pickard 198), and its general definition is suggested by Li (27), i.e., given a cell, its nearest neighboring states are conditionally independent. The general conditional probability formulas are given by Li (27), which do not have the small class underestimation problem. An SMC is actually a dimensionality reduction process where its multi-dimensional conditional probabilities are

12 J Petrol Explor Prod Technol (217) 7:11 22 expressed as multiple one-dimensional transition probabilities. The transition probabilities of reservoir categorical variables, such as lithofacies, can be estimated from well data. The vertical transition probabilities can be estimated by the vertical transition tallies from well logs. The transition probabilities in other directions can be estimated by the Walther s law (Li et al. 212). Most traditional geostatistical models, like Markov chain random field (MCRF), use well data only and make prediction based on SMC, which results in a relatively low prediction accuracy (Huang et al. 216a). Huang et al. (216b) introduced a beta-transformed Bayesian updating model to boost the classification accuracy of category random field. Auxiliary information, however, has not been taken into consideration. To make use of secondary data, such as geophysical well logs, Eidsvik et al. (24) used hidden Markov chains for estimation of geological attributes. The hidden Markov chain uses Dirichlet prior distributions for the Markov transition probabilities between rock types. Li et al. (21) developed the Markov chain models by integrating multiscale information, such as logging, core data and seismic data. In the remote-sensing area, Li et al. (215) introduced a Bayesian MCRF cosimulation method for improving land cover classification accuracy. We propose a single spatial hidden Markov chain (SHMC), which improves the accuracy of reservoir modeling by integrating geological conceptual data with well data. Review of Markov models Markov mesh model A petroleum reservoir grid is a finite, regular grid in one to three dimensions, and its gridding cells are indexed by a positive integer s, where s takes on values in S ¼f1; 2;...; ng. All cell states F ¼ ff 1 ; F 2 ;...; F n g can be regarded as a family of random category variables defined on the set S; each random variable F s takes a state value f s in the state set X ¼f1; 2;...; mg. If all cell states F 1 ; F 2 ;...; F n follow a sequential path, it is defined as a spatial stochastic sequence. A set of reservoir category variables F can be considered as a Markov random field or a Gibbs random field, and its joint probability (likelihood function) generally takes the following form (Tjelmeland and Besag 1998; Salomão and Remacre 21) n exp P n P o s¼1 j2g s Wðf s ; f j Þ Prðf Þ¼ n Pf exp P n P o ð1þ s¼1 j2g s Wðf s ; f j Þ where g s is a set of cells which is adjacent to s; W f s ; f j denotes the relationship between cell s and cell j; f ¼ ff 1 ; f 2 ;...; f n g is a configuration of F, corresponding to a realization of the field. The use of Eq. (1) for the simulation of reservoir category variables is theoretically feasible, but it is actually limited by the highly time consuming in computation. By the best-known classical approximation, Eq. (1) can be simplified as Blake et al. (211) suggested Prðf Þ¼ Yn Pr f s jf gs ð2þ s¼1 where f gs ¼ff r jr 2 g s g stands for the set of state values at the cells neighboring s. Markov mesh models (Stien and Kolbjørnsen 211) are fully specified through the conditional probabilities in RHS of Eq. (2) as Pr f s jf gs ¼ Pr ð fs jf s 1 ; f s1 ; f s2 ;...; f sl Þ ð3þ where s 1 ; s 2 ;...; s l is its nearest known locations of current cell s in different directions; s - 1 is always the start cell of the Markov chain to the unknown cell s, which is to be estimated. The probabilities in Eq. (3) are defined through logit link functions in generalized linear models, and Markov mesh can use larger cliques or neighborhood to capture complex interclass relationships. Recently, Stien and Kolbjørnsen (211) proposed the method of a fast estimation through iterated weighted least squares and fast simulation through a unilateral path. Kolbjørnsen et al. (214) recommended using multiple grids in Markov mesh facies modeling, which is typically ten times faster than that of creating one SNESIM realization. Although Markov mesh model is widely used in geoscience, the parameter estimation and iteration process are annoying. Spatial Markov chain models Spatial Markov chain models use the full independence assumption and the conditional independence assumption to define the conditional probability for simplifying the complex computation in Eq. (3). It is actually a dimensionality reduction process where its conditional probabilities Pr f s jf gs are expressed as multiple one-dimensional transition probabilities from different directions. The spatial Markov chain can be constructed by l? 1 one-dimensional Markov chains together, but these onedimensional chains are forced to move to the same location with equal states under the full independence assumption. Then, the conditional probabilities in Eq. (3) can be expressed as Prðf s jf s 1 ; f s1 ;...; f sl Þ¼ p f s 1 f s p 1 f s1 f s...p l f sl f s P f s p fs 1f s p 1 ð4þ f s1 f...pl s f sl f s where p r f sr f s denotes a transition probability in the rth direction from state f sr to f s and p fs 1 f s denotes a transition

J Petrol Explor Prod Technol (217) 7:11 22 13 probability along moving direction of the spatial Markov chain from state f s 1 to f s. We can derive the conditional probabilities of two-dimensional Markov chain model (Elfeki and Dekking 21) and three-dimensional Markov chain model (Li et al. 212) from Eq. (4). Using the conditional independence assumption, Li (27) gives the general expression of the conditional probability formula in Eq. (3) at any location s as Prðf s jf s 1 ; f s1 ;...; f sl Þ ¼ p f s 1 f s p 1 f s f s1...p l f s f sl Pf s p fs 1 f s p 1 f s f s1...p l ð5þ f s f sl where p r f s f sr denotes a transition probability in the rth direction from state f s to f sr and p fs 1 f s denotes a transition probability along moving direction of the spatial Markov chain from state f s 1 to f s. Generally speaking, the difference between spatial Markov chain model and Markov mesh model is that the latter uses directly the local conditional probabilities in Eq. (3) or the joint probability in Eq. (2), and spatial Markov chain models use multiple one-dimensional transition probabilities or simplified formulas of the local conditional probabilities in Eq. (3). A spatial Markov chain model may be viewed as a special case of Markov mesh models, whereas a Markov mesh model is an extension of spatial Markov chain models, called a generalized spatial Markov chain model. Spatial hidden Markov chain model A spatial hidden Markov chain (SHMC), a combination of SMC and hidden Markov model (HMM), is a double random sequence process consisting of a Markov chain and a spatial stochastic sequence. It can make good use of information from well data and auxiliary data. The SHMC is an extension of SMC. It is better able to capture interclass dependency relationships (neighboring relationships, cross-correlations, directional asymmetries) among hidden variables. A spatial Markov chain F ¼ ff 1 ; F 2 ;...; F n g of reservoir categorical variables is characterized by its states and conditional probabilities through Eqs. (4) or(5), and the model is particularly useful as a prior model. The states of the chain except the wells are unobservable, therefore hidden. A stochastic sequence W ¼ ðw 1 ; W 2 ;...; W n Þ of reservoir categorical variables is from auxiliary data, and its observed values are denoted by w ¼ðw 1 ; w 2 ;...; w n Þ. Definition based on Bayes theory A spatial hidden Markov model uses the posterior probability distribution for modeling reservoir categorical variables and the distribution of the possible states F. Given the observations w, the posterior probability is computed by using the following formula Prðf jwþ ¼ Prðwjf ÞPrðf Þ Pf Prðwjf ÞPrðf Þ ð6þ Using Eq. (6), the local conditional probabilities are written as Prðf s jf s 1 ;f s1 ;...;f sl ;w s Þ¼ Prðw sjf s ÞPrðf s jf s 1 ;f s1 ;...;f sl Þ P f s Prðw s jf s ÞPrðf s jf s 1 ;f s1 ;...;f sl Þ ð7þ where Prðf Þ is the prior probability, which is estimated from well data; Prðwjf Þ is conditional probabilities of the observations w for f fixed, i.e., a likelihood item; PrðwÞ ¼ P f Prðwjf ÞPrðf Þ is the probability of W, which is a normalization constant when w is given. We call unobservable f true states and w observed values. The right side of Eqs. (6) or(7) has been widely used since Thomas Bayes (1764) and Pierre Simon Laplace (1774) introduced Bayesian statistics, but it is not found in petroleum reservoir hidden Markov application. To simulate reservoir categorical variables using Prðf s jf s 1 ; f s1 ;...; f sl ; w s Þ, we need to estimate the conditional probability Prðw s jf s Þ and compute the local conditional probability Prðf s jf s 1 ; f s1 ;...; f sl Þ in the right of Eq. (7). Specifying the prior conditional probability We use Eq. (8) to define the prior conditional probabilities; the formula is given as follows Prðf s jf s 1 ; f s1 ;...; f sl Þ ¼ p ð f s 1 f s p 1 Þ f s f s1...p ðþ l f s f sl P ð8þ f s p fs 1f s p ð1þ f s f s1...p ðþ l where s 1 ; s 2 ;...; s l is its nearest known locations of current cell s in different directions; s - 1 is always the start cell of the Markov chain to the unknown cell s, which is to be estimated; the superscript ð1þ; ð2þ;...; ðlþ indicates the different lag h. Thus, we have redefined SMC illustrated in Fig. 1. The prior conditional probabilities can be computed with ðlþ Eq. (8), where p fs f sl is given in transition probability function. Obtaining the local conditional probabilities requires to calculate Prðw s jf s Þ. The state value f s is the true value and unobservable except the well data, and the state value w is regarded as observation value. It is noted that Prðw s jf s Þ is essentially the likelihood or the forward model relating facies to sonic impedance; it is not the prior geologic concept, though of course the geology helps to pick the right rock physics and seismic model to relate facies to impedance. f s f sl

14 J Petrol Explor Prod Technol (217) 7:11 22 Impedance partition Fig. 1 SMC defined by first-order 3-D neighborhood with different lag, cell s is to be studied Case study Data set The data we used for our research are gathered from Tahe area of the Tarim Basin in Xinjiang Uygur Autonomous Region, China. Tahe oil field, located in Xayar uplift, north of Tarim basin (Fig. 2), up to now is one of the greatest domestic discoveries in the Paleozoic carbonate rock series. There are two extensive unconformities developed in this area. The Carboniferous clastic rocks directly overlie on the carbonate rocks of Ordovician and underlie the Permian pyroclastic rocks or Triassic formation (Fig. 3). In view of achievements in the carbonate formation of Ordovician, as the seal of it, the Carboniferous (T5 T56), which belongs to the same petroleum system as Ordovician, also shows its exploration potential of lithological reservoir. The purpose layer is located at the depth between 52 and 53 m, developed as part of the second formation of the lower Carboniferous (Kalashayi formation). There are three major lithofacies in this work area: mudstone, sandstone and conglomerate. The conglomerate is relatively low in content. We have got four wells log data with 59 samples in the three-dimensional space, just as shown in Fig. 4. Three wells are located in the corners of this work area; another well is located inside. The distance in east west direction of the two wells is 9 and 12 m in south north direction, the simulated space is split into a 3 4 5 grid system, and each cell is a 3 m 3 m 2 m cuboid. Figure 5 illustrates some basic descriptive statistics of sonic impedance, such as sample size, mean, variance. The impedance will be regarded as observed value w and will be divided into two classes: strong and weak. By analyzing log and core data, we choose the impedance median 8315.48 as the threshold. The impedance is regarded as strong if it is greater than 8315.48 and weak when it is less than the threshold. By doing so, we derive the emission matrix (Table 1) and the emission probability (Table 2). The initial proportion of each reservoir categorical variable can be computed from Table 1. Mudstone is 62.87 %, sandstone is 31.24 %, and conglomerate is 5.89 %, respectively. The impedance of each class is depicted in Fig. 6. By analyzing Table 2 and Fig. 6, we may find that the impedance of mudstone tends to be stronger than sandstone and conglomerate. Transiogram models The magnitude of the transition probability depends on a sampling interval, i.e., the transition probability is a nonlinear function of the sampling intervals (Carle and Fogg 1997). By increasing sampling interval, the transition probability forms a transition probability function (also called transiogram ), which is regarded as a measure of spatial continuity (Li 26). Experimental transiograms are estimated from the 59 points and fitted by exponential models. The fitted transiogram models are used for simulations. Because raster data are used, the lag h represents as the number of pixels (i.e., grid units), not the exact distance. Figure 7 illustrates the experimental auto-/crosstransiograms and their fitting models. It can be seen that most of these experimental transiograms can be approximately fitted by an exponential model. We also find that some experimental transiograms have apparent fluctuations that are difficult to fit using the basic model, such as p 22 and p 23. This may be caused by the insufficiency of observed data and the non-markovian effect of the real data. Fitted transiogram models capture only part of the features of experimental transiograms, depending on the complexity of the mathematical models used (Li 27). Using composite hole-effect models (Ma and Jones 21) may capture more details, such as periodicities, of experimental transiograms. Simulation results The SHMC can be determined by initial probabilities (prior probabilities) C, transition matrix A and emission matrix B. The states sequence depends on C and A, while the

J Petrol Explor Prod Technol (217) 7:11 22 15 Fig. 2 Location map of the studied area Fig. 3 Stratigraphic chart of Tahe area

16 J Petrol Explor Prod Technol (217) 7:11 22 Fig. 4 3-D work area with four wells, x axis and y axis indicate east west direction and south north direction, respectively. z axis indicates vertical direction Fig. 5 Basic descriptive statistics of sonic impedance, blue curve is the cumulative distribution function Table 1 Emission matrix Table 2 Emission probability Strong Weak Strong Weak Mudstone 222 98 Sandstone 28 131 Conglomerate 5 25 observed sequence is determined by B. As a result, the SHMC can be expressed as k ¼ ða; B; CÞ. We consider a first-order neighborhood, which contains six neighbors in 3-D space. Transition matrix A can be computed by Eq. (8), where l = 6. By using the transition probabilities from Fig. 7, the final result of transition matrix A with lag h ¼ 1; 2; 3...; l is Mudstone.6938.363 Sandstone.1761.8239 Conglomerate.1667.8333 1 :7952 :211 :37 A ¼ @ :2534 :6967 :499 A :1271 :68 :8121 where the main diagonal elements indicate the probabilities transfer between same reservoir categorical variables;.211, for example, is the probability transfer from mudstone to sandstone. Emission matrix B is

J Petrol Explor Prod Technol (217) 7:11 22 17 1 9 mudstone sandstone conglomerate 1 :6938 :363 B ¼ @ :1761 :8239 A :1667 :8333 impedance(g/cm 3. m/s) 8 7 6 5 4 3 5 1 15 2 25 3 35 sample Fig. 6 Impedance of each reservoir categorical variable. Conglomerate: left, sandstone: middle, mudstone: right which is given in Table 2. Initial probabilities (prior probabilities) C are C ¼ ð:6287 :3124 :589 Þ T : By using Viterbi algorithm, four realizations are simulated for work area (Fig. 8). In order to compute the simulation accuracy of the SHMC method, another well, S75, is added in the middle of the section between wells 66 and 67 (Fig. 9). Note that S75 is a short notation for well 75. We have got 121 lithofacies samples in this well. The newly obtained log data can be used as validation sets. By comparing the estimated lithofacies in S75 1.5.5.95.9 Range:4 Sill:.7689.45.4.35 Range:4 Sill:.2213.45.4.35 Range:5 Sill:.15.3.3 p 11.85 p 12.25 p 13.25.2.2.8.15.15.75.1.1.5.5.7 2 4 6 8 1 12 14 16 18 2 Lag( 3 m) 2 4 6 8 1 12 14 16 18 2 Lag( 3 m) 2 4 6 8 1 12 14 16 18 2 Lag( 3 m).5 1.5.45.4.35 Range:4 Sill:.2876.95.9 Range:4 Sill:.6962.45.4.35 Range:5 Sill:.17.3.85.3 p 21.25 p 22.8 p 23.25.2.75.2.15.1.7.15.1.5.65.5 2 4 6 8 1 12 14 16 18 2 Lag( 3 m) 2 4 6 8 1 12 14 16 18 2 Lag( 3 m) 2 4 6 8 1 12 14 16 18 2 Lag( 3 m).5.5 1.45.4.35 Range:6 Sill:.2169.45.4.35 Range:5 Sill:.67.95 Range:5 Sill:.7782.3.3.9 p 31.25 p 32.25 p 33.2.15.1.5 2 4 6 8 1 12 14 16 18 2 Lag( 3 m).2.15.1.5 2 4 6 8 1 12 14 16 18 2 Lag( 3 m).85.8.75 2 4 6 8 1 12 14 16 18 2 Lag( 3 m) Fig. 7 Experimental transiograms and fitted models

18 J Petrol Explor Prod Technol (217) 7:11 22 Fig. 8 Four realizations implemented by SHMC. Mudstone: red, sandstone: yellow, conglomerate: blue Fig. 9 A section to be simulated across three wells 1 S67 S 75 S 66 9 8 7 z 6 5 4 3 2 1 2 4 6 8 1 x 12 14 16 18 2

J Petrol Explor Prod Technol (217) 7:11 22 19 Table 3 SHMC classification accuracy Simulation (a) Simulation (b) Simulation (c) Simulation (d) Average Mudstone 66/85 58/85 61/85 51/85 55/85 Sandstone 18/31 25/31 2/31 17/31 21/31 Conglomerate 2/5 /5 /5 1/5 1/5 Overall 88/121 83/121 81/121 69/121 77/121 (63.64 %) The value corresponding to the overall average classification accuracy is in bold location with the true facies, we obtain the classification accuracy, which can be defined as #correct classification Accuracy ¼ : #validation sets The average prediction accuracy is 63.64 % according to four stochastic simulation results (Table 3). Comparison analysis To better demonstrate the superiority of the SHMC method, a comparison study has also been conducted. We Table 4 Classification accuracy comparison Mudstone Sandstone Conglomerate Overall SMC Simulation 56/85 15/31 2/5 73/121 (a) Simulation 48/85 19/31 /5 67/121 (b) Simulation 63/85 11/31 3/5 77/121 (c) Simulation 51/85 21/31 /5 72/121 (d) Average 55/85 17/31 1/5 72/121 (59.5 %) SHMC Simulation 66/85 18/31 1/5 85/121 (a) Simulation 62/85 16/31 3/5 81/121 (b) Simulation 69/85 15/31 /5 84/121 (c) Simulation 61/85 22/31 2/5 85/121 (d) Average 58/85 19/31 1/5 78/121 (64.46 %) The values corresponding to the overall average classification accuracy are in bold partition this area into a 1 9 2 grid system, with each unit denoting a 1 m 9 4.5 m subsection (Liang 214). At first, there is no auxiliary information for integration. Thus, we use SMC defined by Eq. (8) for estimation of petroleum reservoir categorical variables. The simulation results obtained by three conditional wells have been shown in Fig. 1. It is obvious that conditional data have played a role in controlling the distribution of lithofacies near the wells. However, the further counterparts are fragmented and random in the grid. The average prediction accuracy is 59.5 % according to four stochastic simulation results (Table 4). For comparison, the SHMC method has been applied by adding seismic data. Through stratum calibration, time depth conversion, as well as wave impedance inversion, a seismic section across three wells can be obtained (Fig. 11). Using the impedance partition criterion, we can compute the emission matrix B. The entries in this matrix are Prðw s jf s Þ, which can be used in Eq. (7) to calculate the posterior conditional probability combining with Eq. (8). Simulation results have been shown in Fig. 12. The average prediction accuracy increases up to 64.46 % according to four stochastic simulation results (Table 4). Unlike Fig. 1, lithofacies distribution displays certain patterns in random results. More specifically, the distribution of sandstone is not continuous as a whole, with extension about 4 5 m in the horizontal direction. In addition, the section can be divided into three small layers from top to bottom. The middle layer, with the thickness of around 2 m, is twice as thick as the upper and lower ones. Each layer is stacked in space, not connected with each other. As the background lithofacies, mudstone exists widely in this area. Conglomerate, on the other hand, is not well developed due to petroleum geology condition. The simulation results and wave impedance inversion results have good correspondence, which demonstrates that the SHMC model is preferred in the estimation of petroleum reservoir categorical variables.

2 J Petrol Explor Prod Technol (217) 7:11 22 1 1 9 9 8 8 7 7 6 6 5 5 4 4 3 3 2 2 1 1 2 4 6 8 1 12 14 16 18 2 2 4 6 8 (a) 1 9 9 8 8 7 7 6 6 5 5 4 4 3 3 2 12 14 16 18 2 18 2 2 1 1 (b) 1 2 4 6 8 1 12 14 16 18 2 2 4 (c) 6 8 1 12 14 16 (d) Fig. 1 Four stochastic simulation results based on three wells Conclusions Fig. 11 A seismic section across three wells We have presented an SHMC model for geological facies modeling. This combines spatial Markov chain theory and Bayes estimation. We have adopted the specification of earlier published hidden Markov models. SHMC is based on neighborhood and cliques and has a solid theoretical foundation. Unlike SMC, SHMC integrates well data and geological conceptual data (sonic impedance) by using Viterbi algorithm. In our research, the sonic impedance is divided into two classes: strong and weak, which is regarded as observed variable. Experimental transiograms and fitted models are given according to 59 samples, which are used to compute prior conditional probabilities (transition probabilities). Compared with SMC based on

J Petrol Explor Prod Technol (217) 7:11 22 21 1 9 8 7 6 5 4 3 2 1 1 9 8 7 6 5 4 3 2 1 1 2 4 6 8 1 12 14 16 18 2 (a) 2 4 6 8 1 12 14 16 18 2 1 (b) 9 8 7 6 5 4 3 2 1 2 4 6 8 1 12 14 16 18 2 (c) 9 8 7 6 5 4 3 2 1 2 4 6 8 1 12 14 16 18 2 (d) Fig. 12 Four stochastic simulation results based on three wells and sonic impedance well data, the SHMC method performs superiority both in prediction accuracy and reflecting the geological sedimentation process by integrating auxiliary information. Acknowledgments This study is sponsored by the Fundamental Research Funds for the Central Universities of Central South University (No. 216zzts11) and National Science and Technology Major Project of China (No. 211ZX52-5-6). The authors are indebted to Dr. Kan Wu and Dr. Dongdong Chen for their valuable help on transiograms fitting and three-dimensional stochastic simulation. Finally, the authors gratefully thank the editor-in-chief and two anonymous reviewers for their constructive comments and suggestions, which have profoundly improved the composition of this manuscript. Open Access This article is distributed under the terms of the Creative Commons Attribution 4. International License (http:// creativecommons.org/licenses/by/4./), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. References Arinkoola AO, Onuh HM, Ogbe DO (215) Quantifying uncertainty in infill well placement using numerical simulation and experimental design: case study. J Petrol Explor Prod Technol 8:1 15 Blake A, Kohli P, Rother C (211) Markov random fields for vision and image processing. The MIT Press, Cambridge, pp 11 22 Carle SF, Fogg GE (1997) Modeling spatial variability with one and multidimensional continuous-lag Markov chains. Math Geol 29(7):891 918 Eidsvik J, Mukerji T, Switzer P (24) Estimation of geological attributes from a well log: an application of hidden Markov chains. Math Geol 36(3):379 397 Elfeki A, Dekking M (21) A Markov chain model for subsurface characterization: theory and applications. Math Geol 33(5):569 589 Huang X, Wang Z, Guo J (216a) Theoretical generalization of Markov chain random field from potential function perspective. J Cent South Univ 23(1):189 2 Huang X, Wang Z, Guo J (216b) Prediction of categorical spatial data via Bayesian updating. Int J Geogr Inf Sci 3(7):1426 1449 Kolbjørnsen O, Stien M, Kjønsberg H, Fjellvoll B, Abrahamsen P (214) Using multiple grids in Markov mesh facies modeling. Math Geosci 46(2):25 225 Li W (26) Transiogram: a spatial relationship measure for categorical data. Int J Geogr Inf Sci 2(6):693 699 Li W (27) Markov chain random fields for estimation of categorical variables. Math Geol 39(3):321 335 Li J, Xiong L, Fang S, Tang L, Huo H (21) Lithology stochastic simulation based on Markov chain models integrated with multiscale data. Acta Pet Sin 31(1):73 77 (in Chinese) Li J, Yang X, Zhang X, Xiong L (212) Lithologic stochastic simulation based on the three-dimensional Markov chain model. Acta Pet Sin 33(5):846 853 (in Chinese) Li W, Zhang C, Willig MR, Dey DK, Wang G, You L (215) Bayesian Markov chain random field cosimulation for improving land cover classification accuracy. Math Geosci 47(2): 148 Liang Y (214) Stochastic simulation of reservoir lithofacies based on the bidirectional Markov chain model. Master s Thesis, Central South University, Changsha, China

22 J Petrol Explor Prod Technol (217) 7:11 22 Ma Y, Jones TA (21) Teacher s aide: modeling hole-effect variograms of lithology-indicator variables. Math Geol 33(5):631 648 Pickard DK (198) Unilateral Markov fields. Adv Appl Probab 12(3):655 671 Salomão MC, Remacre AZ (21) The use of discrete Markov random fields in reservoir characterization. J Petrol Sci Eng 32(s 2 4):257 264 Stien M, Kolbjørnsen O (211) Facies modeling using a Markov mesh model specification. Math Geosci 43(43):611 624 Tjelmeland H, Besag J (1998) Markov random fields with higherorder interactions. Scand J Stat 25(25):415 433 Weissmann GS, Fogg GE (1999) Multi-scale alluvial fan heterogeneity modeled with transition probability geostatistics in a sequence stratigraphic framework. J Hydrol 226(1):48 65