EP4198258A1 - Multiperiod optimization of oil and/or gas production - Google Patents

Multiperiod optimization of oil and/or gas production Download PDF

Info

Publication number
EP4198258A1
EP4198258A1 EP21306844.8A EP21306844A EP4198258A1 EP 4198258 A1 EP4198258 A1 EP 4198258A1 EP 21306844 A EP21306844 A EP 21306844A EP 4198258 A1 EP4198258 A1 EP 4198258A1
Authority
EP
European Patent Office
Prior art keywords
reservoir
state
controls
time
production
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP21306844.8A
Other languages
German (de)
French (fr)
Inventor
Cyrille VESSAIRE
Alejandro Rodriguez Martinez
Jean-Philippe CHANCELIER
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TotalEnergies Onetech SAS
Original Assignee
TotalEnergies Onetech SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TotalEnergies Onetech SAS filed Critical TotalEnergies Onetech SAS
Priority to EP21306844.8A priority Critical patent/EP4198258A1/en
Priority to US18/046,410 priority patent/US20230195145A1/en
Publication of EP4198258A1 publication Critical patent/EP4198258A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D7/00Control of flow
    • G05D7/06Control of flow characterised by the use of electric means
    • G05D7/0617Control of flow characterised by the use of electric means specially adapted for fluid materials
    • G05D7/0629Control of flow characterised by the use of electric means specially adapted for fluid materials characterised by the type of regulator means
    • G05D7/0676Control of flow characterised by the use of electric means specially adapted for fluid materials characterised by the type of regulator means by action on flow sources
    • EFIXED CONSTRUCTIONS
    • E21EARTH DRILLING; MINING
    • E21BEARTH DRILLING, e.g. DEEP DRILLING; OBTAINING OIL, GAS, WATER, SOLUBLE OR MELTABLE MATERIALS OR A SLURRY OF MINERALS FROM WELLS
    • E21B43/00Methods or apparatus for obtaining oil, gas, water, soluble or meltable materials or a slurry of minerals from wells
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/10Geometric CAD
    • G06F30/17Mechanical parametric or variational design
    • EFIXED CONSTRUCTIONS
    • E21EARTH DRILLING; MINING
    • E21BEARTH DRILLING, e.g. DEEP DRILLING; OBTAINING OIL, GAS, WATER, SOLUBLE OR MELTABLE MATERIALS OR A SLURRY OF MINERALS FROM WELLS
    • E21B2200/00Special features related to earth drilling for obtaining oil, gas or water
    • E21B2200/20Computer models or simulations, e.g. for reservoirs under production, drill bits

Definitions

  • the disclosure relates to the field of computer programs and systems, and more specifically to a method, system and program for multiperiod optimization of oil and/or gas production.
  • Oil and gas production projects usually span over several decades and involve complex planning and decision making.
  • the lifetime of a hydrocarbon field is usually decomposed in five phases: exploration, where reservoirs containing hydrocarbon are found; appraisal, to give a value to a field; development, where infrastructure is planned and installed; production, where hydrocarbon is finally produced; abandonment, where the field stops producing and the infrastructures are decommissioned and removed.
  • An increasing concern is to improve the oil and/or gas production, and thus to optimize it.
  • the method comprises providing a controlled dynamical system.
  • the controlled dynamical system describes the evolution over time of a state of an oil and/or gas reservoir.
  • the method further comprises providing a time-dependent admissible set of controls.
  • the controls describe actions respecting constraints for controlling oil and/or gas flow and/or pressure.
  • the method further comprises providing time-dependent observations of the content of the reservoir.
  • the method further comprises optimizing, with respect to the state of the reservoir, the controls and the observations, an expected value over a given time span of an objective production function of the state, the controls and the observations.
  • the method may comprise one or more of the following:
  • a computer system comprising a processor coupled to a memory, the memory having recorded thereon the computer program.
  • the method comprises providing a controlled dynamical system.
  • the controlled dynamical system describes the evolution over time of a state of an oil and/or gas reservoir.
  • the method further comprises providing a time-dependent admissible set of controls.
  • the controls describe actions respecting constraints for controlling oil and/or gas flow and/or pressure.
  • the method further comprises providing time-dependent observations of the content of the reservoir.
  • the method further comprises optimizing, with respect to the state of the reservoir, the controls and the observations, an expected value over a given time span of an objective production function of the state, the controls and the observations.
  • the method forms an improved solution for oil and/or gas production optimization.
  • the method performs multiperiod optimization of oil and/or gas production, i.e. allows to optimize a production of oil and/or gas over a given time span that comprises several time periods.
  • the method optimizes an expected value of an objective production function over a given time span (i.e. that encompasses several time periods, e.g. several years or months forming a production phase or at least a part thereof) with respect to time-evolving variables of the function which are the state of the underlying reservoir, observations of the content of the reservoir, and admissible controls that describe actions respecting constraints for controlling oil and/or gas flow and/or pressure.
  • the method describes the time-evolution of the state of the reservoir as a controlled dynamical system such that the time evolution of the state variable accounts for the controls and the observations. This improves robustness of the optimization and enables multiperiod optimization with high accuracy.
  • the output of the optimization is the expected value of the objective function optimized over the given time span with respect to the state of the reservoir, the controls and the observations.
  • This value represents an objective oil and/or gas production value over the given time-span and allows to take real-time decisions and/or actions for oil and/or gas production by exploiting a real-world reservoir.
  • the method may further comprise displaying the optimized value.
  • the method may be performed several times, each execution of the method yielding a respective output optimized value, and the method may then further comprise displaying a graph representing the respective optimized values ( e.g . for different reservoir configurations) and/or performing statistics on the optimized values, e.g. to take real-time decisions and/or actions for oil and/or gas production by exploiting a real-world reservoir.
  • the controls obtained as a result of the optimization are policies, i.e. functions of the observations that may be then used in real-time with real-world observations.
  • the method may be included in an oil and/or gas production process (e.g. for a single reservoir or for several reservoirs connected to one another) which may comprise:
  • the method is for multiperiod optimization of production of oil and/or gas from the reservoir.
  • the method thus optimizes the production over the given time span that encompasses several periods, e.g. several years or months of production, e.g. several decades of production.
  • the method comprises providing a controlled dynamical system.
  • the controlled dynamical system describes the evolution over time of a state of an oil and/or gas reservoir, where the time evolution of the state depends on the current state and controls.
  • the dynamically system describes how the state of the reservoir evolves over time given the controls.
  • the state may be a vector comprising one or more variables each representing a physical quantity describing a property of the reservoir.
  • the one or more variables are time-evolving variables, and may comprise any one or any combination of ( e.g.
  • the dynamical system is controlled, which means that the state variable at a given time depends on the time-dependent controls, e.g. at previous time.
  • the time-dependent state and/or controls may further depend on the time-dependent observations.
  • the providing of the controlled dynamical system may comprise establishing the controlled dynamical system, e.g. by deriving the equations thereof.
  • the controlled dynamical system may comprise evolution equations derived from material balance equations and/or black oil models.
  • deriving the controlled dynamical system may comprise deriving the controlled dynamical system from material balance equations and/or black oil models.
  • the component variables x (1) , x (2) , x (3) , x (4) , x (5) of the state x may respectively be the time-evolving amount of oil in the reservoir, the time-evolving amount of free gas in the reservoir, the time-evolving amount of water in the reservoir, the time-evolving total pore volume of the reservoir, and the time-evolving reservoir pressure.
  • may be a general production function which may comprise, as coordinates, the production of oil ⁇ (1) , the production of free gas ⁇ (2) , and the production or injection of water ⁇ (3) .
  • ⁇ x u ⁇ ⁇ h x , u
  • is a vector function with three coordinates representing respectively the production of oil as a function of ( h ( x ), u ), the production of free gas as a function of ( h ( x ) ,u ), and the production or injection of water as a function of ( h ( x ), u ), where h ( x ) is an observation function that takes as input x and that outputs the observation of the content of the reservoir corresponding to x.
  • the method further comprises providing a time-dependent admissible set of controls.
  • the controls describe actions respecting constraints for controlling oil and/or gas flow and/or pressure.
  • the controls may include opening or closing a valve and/or or a pipe, and/or choosing a well-head, and/or a bottom-hole pressure.
  • the time-dependent admissible set of controls may be a mapping which at each given time, takes as input the state of the reservoir at the given time and returns the set of controls that are allowable for this state.
  • the set of allowable controls may depend on the reservoir pressure, which constrains the different pressure in the production system. Additionally or alternatively, the set of allowable controls may depend on the production network, for example some pipes can be controlled while others cannot and/or maintenance forces facilities to be closed at different periods.
  • the method further comprises providing time-dependent observations of the content of the reservoir.
  • the time-dependent observations consists in an observation function that takes as input at each given time the state of the reservoir at the given time, and in examples also the controls at the given time, and returns an observation at the given time of the content of the reservoir.
  • the observation may be a vector comprising ( e.g. consisting of) coordinates comprising any one or any combination of ( e.g. all of): reservoir pressure at the given time, the time-evolving water-cut at the given time ( e.g.
  • the gas-oil ratio at the given time e.g. as a function of the amount of free gas in the reservoir at the given time, of the total pore volume of the reservoir at the given time, and of the reservoir pressure at the given time).
  • the method may further comprise providing an initial value for the state of the reservoir (e.g. provided as a probability distribution).
  • the starting point may further comprise an initial value of the observations.
  • the method then comprises optimizing, with respect to the state of the reservoir, the controls and the observations, an expected value over a given time span of an objective production function of the state, the controls and the observations.
  • Optimizing with respect to the state of the reservoir, the controls and the observations means that the state, the controls and the observations are the free variables of the optimization.
  • the optimization thus searches for the values of these variables that tend to optimize ( e.g. minimize or maximize) the expected value over a given time span of the objective production function.
  • the optimization is constrained by constraints between the linking state, controls and observations, the constraints being given by the controlled dynamical system ( e.g. by the function f discussed above and the observation function).
  • the given time span may be a time-interval that encompasses several production periods, i.e.
  • Optimizing may comprise applying any suitable optimization algorithm. Optimizing may for example apply a multi-stage optimization method, e.g. using a Dynamic Programming algorithm, as discussed in implementations hereinbelow.
  • the objective production function may be any production function, such as any function that capture the oil and/or gas that can be produced, e.g. depending on material and/or cost constraints.
  • is the general production function which has been previously discussed
  • r t T is a vector price for the production of each fluid (oil, gas and water)
  • c is a cost associated with the controls, such as a functioning cost of a pump which re-injects water in the reservoir
  • the function f in the optimization problem may be the function f : x u ⁇ x 1 ⁇ ⁇ 1 x u x 2 ⁇ ⁇ 2 x u + x 1 R s x 5 ⁇ x 1 ⁇ ⁇ 1 x u R s ⁇ x u x 3 ⁇ ⁇ 3 x u x 5 1 + c f ⁇ x u ⁇ x 5 ⁇ x u given by equation (S) and which has been previously discussed.
  • observations comprise partial observations.
  • the time-dependent observations represent time-dependent partial observations, i.e. the content of the reservoir is only partially observed by the observations. This allows to perform the optimization even if the content of the reservoir is partially observed, which in practice, in oil and/or gas production, may often be the case.
  • the observations may depend only on the state of the reservoir, e.g. the mapping that yields the observations at a given time takes as input only the state at the given time, and thus does not directly account for the effects of the controls applied at the given time.
  • the component variables x (1) , x (2) , x (3) , x (4) , x (5) of the state x may respectively be the time-evolving amount of oil in the reservoir, the time-evolving amount of free gas in the reservoir, the time-evolving amount of water in the reservoir, the time-evolving total pore volume of the reservoir, and the time-evolving reservoir pressure, as previously-discussed.
  • the optimization may comprise solving an optimization problem that is a Deterministic Partially Observed Markov Decision Process (det-POMDP).
  • det-POMDP Deterministic Partially Observed Markov Decision Process
  • det-POMDP the Deterministic Partially Observed Markov Decision Process
  • the optimization may comprise discretizing the optimization problem.
  • Discretizing the optimization problem may comprise providing a discrete control set and a discrete observation set and building a discrete space state by recursively applying the dynamics (i.e. the controlled dynamical system) on a given initial state with associated controls.
  • the discrete space state is a set of the space states reachable from the given initial state.
  • Discretizing the optimization problem may further comprise constructing a state of beliefs, which are probabilities on the discrete state space. A belief indicates a probability for a given state to be reached from the initial state.
  • the Deterministic Partially Observed Markov Decision Process may in examples have monotonicity, such that the state of reachable beliefs is included in a subset of the probability space, for example a fan-like or com-like subset.
  • Monotonicity means that the det-POMDP is such that, if two sequences of controls lead to a same state when staring in a given state, then applying the two sequences of controls to another state either leads to a same result ( i.e. leads to a same state), or one sequence leads to a cemetery point.
  • the cemetery point is a point that may be added to the state space ( i.e. so that the discrete state space may comprise the cemetery point) and that represents an unreachable state when considering past and present observations.
  • Monotonicity of the det-POMDP thus allows to save computational time and computation resources for the optimization.
  • the controlled dynamical system that describes the reservoir's state (behavior) overtime consists of a controlled dynamical system which gives the evolution over time of physical quantities which characterize the hydrocarbon field under exploitation.
  • the underlying equations are derived from material balance equations on the reservoir and under the hypothesis that the fluids contained in the reservoir follow a model known under the name of "black-oil models”.
  • the optimization may solve an optimization problem over time for an oil and gas production system which may be formulated with a deterministic formulation, the optimization problem being governed by the controlled dynamical system.
  • the reservoir is part of a production system that consists of the reservoir and of production assets such as pipes, wells, chokes.
  • the different production assets are placed on the graphs, with the pipes as the arcs and the rest of the assets such as the well-heads are the nodes.
  • FIG. 1 illustrates such a graphs, where the well's perforations are represented as nodes ( ⁇ i ) where the fluid produced enter the graph.
  • the implementations optimize the system over the whole production phase ( i.e. over multiple years), so multiple time steps belonging to a finite set are considered, where is a positive integer.
  • the time steps may correspond to months or years.
  • the optimization problem may be formulated as follows
  • the variables are: the controls u t , which are the decisions that can be taken at time step t (in this case, the pressure P v,t at the different vertex v ⁇ V of the graph, and the Boolean o a,t stating if a pipe a ⁇ A of the graph is opened or closed); the state of the reservoir x t , as the reservoir is defined as a controlled dynamical system, with state x t ⁇ X ⁇ R n (with the state space), control u t ⁇ U and an evolution function of the controlled dynamical system, f .
  • the controls u t which are the decisions that can be taken at time step t (in this case, the pressure P v,t at the different vertex v ⁇ V of the graph, and the Boolean o a,t stating if a pipe a ⁇ A of the graph is opened or closed);
  • the state of the reservoir x t as the reservoir is defined as a controlled dynamical system,
  • Equation (2.1a) the final state x T (the quantity of fluids remaining in the reservoir) is valued as ( x T ).
  • Equation (2.1b) define the known initial state of the reservoir.
  • Equation (2.1c) gives the controlled dynamics of the reservoir.
  • Equation (2.1d) states that the allowed controls belong to an admissibility set, which is for each time step t a set-valued mapping which takes a given state x t of the reservoir and returns the set of allowed controls.
  • Admissibility notably depends on the reservoir pressure, which constrains the different pressures in the petroleum production system.
  • the admissibility set also depends on the production network itself: some pipes can be controlled, while others cannot; or maintenance force facilities to be closed at different periods.
  • the implementations consider a deterministic controlled dynamical system. Note that, here, it is assumed a perfect knowledge of the content of the reservoir x t . In implementations later discussed, another formulation with partial observation of the content of the reservoir will be discussed. Since the state x t is known, the implementations may use dynamic programming to solve this problem.
  • the implementations compute by using a dynamic programming algorithm (Algorithm 1 below). For that purpose, the implementations discretize the controls, that now belong to a finite set denoted by , and the states that belong to a finite set . The implementations also consider that the value functions follow a multilinear interpolation between the states.
  • Algorithm 1 a dynamic programming algorithm
  • the definition of the dynamical system is now discussed.
  • the first application is a gas reservoir that can be modeled with two tanks and with a connection of known transmissivity linking the two together. It illustrates how the formulation can be applied to complex cases with multiple tanks.
  • the second application it is consider is an oil reservoir where pressure is kept constant through water injection. This shows how injection may be taken into account to go beyond the first recovery of oil and gas. All numerical applications were performed on a computer equipped with a Core i7-4700K and 16 GB of memory.
  • a gas reservoir with production data that comes from a field approaching abandonment. It is a subfield constituted of an isolated reservoir and one well which is part of a larger field which is not considered here.
  • the good geology of this particular case make it perfect for a tank model, as proved by many years of perfectly matched production. Also, the simplicity of the fluids with a high methane purity make the black-oil model a very realistic assumption.
  • the reservoir can be modeled with either one or two tanks, while the well perforation is modeled with a known stationary inflow performance relationship, noted IPR.
  • the two tanks model is illustrated in FIG. 2 . The rest of the network is not considered, and only optimization at the bottom of the well is done, without considering any vertical lift performance necessary to lift oil to the surface.
  • the goal here is to show how simple cases can be tackled with the material balance formulation, and that the formulation can also be applied on cases with multiple reservoirs. It is now presented the state reduction of this real case, and then a model with one tank, and then a model with two tanks.
  • V 0 g x 0
  • P t R ⁇ 1 T V t g , ⁇ t ⁇ T
  • Equation (2.5c) the mapping ⁇ 1T is a function that can be algorithmically computed (as discussed hereinafter) and that takes the volume of gas in the reservoir and returns the reservoir pressure, which is used to compute the production, and is detailed hereinafter.
  • the ⁇ has been divided in Equations (2.5c) and (2.5d). Since one has only one well and since the is strictly monotonous, the production function of Equation (2.5d) is injective. In the models considered here (one tank or two tanks), one can thus transparently pass from the controls to the production and from the production to the controls without any ambiguity. Moreover, one can define the admissibility set of this application.
  • the graph has only one point (one well), and the bottom-hole pressure P t is controlled.
  • the implementations use production data from a sector of a real gas field to check that the reservoir model described with the Constraints (2.5c) and (2.5e) after fitting accurately follows real measurements on the gas field. More precisely, the implementations apply a given real production schedule on a part of the field (only one well), and check that the pressure we simulate in the reservoir is close to the measured pressure of that reservoir.
  • the historical production spans over 15 years, and one has monthly values, which is why consider monthly timesteps for Problem (2.5) are considered.
  • FIG. 3 shows a comparison of the simulated one tank reservoir pressure to the historical measured pressure when applying the same (historical) production schedule.
  • the curve is the simulated pressure in the tank, whereas the dots are the measured pressures.
  • the one tank model fits the observation. However, there is a gap between the simulated and measured pressures of more than 10%. Since the pressure tends to be higher on the first half of the production, the implementations start by underestimating the decline of the production. Then, during the second half of the production, the predicted pressure is lower than the measured pressure, which means the implementations overestimate the decline of the production. This elastic effect is most likely due to the simplification of removing the secondary tank in the model. Indeed, the secondary tank act as a buffer which react slowly, explaining the extra pressure at the beginning and then sustaining a better value of the pressure latter on.
  • the implementations use dynamic programming (Algorithm 1) to get an optimal production policy.
  • the implementations consider that the revenue per volume of gas is the historical gas spot price of TTF (Netherlands gas market) from 2006 to 2020, and the implementations do not consider any operational cost.
  • FIG. 4 illustrates the evolution of the content of the reservoir in the one tank model.
  • the doted curve is the optimal trajectory of the amount of gas, while the full-line curve is the trajectory with the historical production.
  • FIG. 5 illustrates the trajectory of the production.
  • the dotted curve is the optimal production in the one tank model, the full-line curve is the historical production, whereas the dashed curve is the average monthly gas price.
  • FIG. 5 shows that the production stops when prices are low as we fully take advantage of the perfect knowledge of the future prices. There is a massive increase in the total gains when using the optimal policy, compared to the real production.
  • this gas reservoir application still illustrates one of the best case scenario of the dynamic programming approach, and shows how much could be gained from using a multistage material balance formulation.
  • the implementations generate the decline curve, g, in inequality (2.6b) of the formulation by computing the maximal production value for the same discrete states as the ones used in the dynamic programming approach. The implementations then interpolate the value of g between the different states.
  • the maximization problem 2.6 turns to be MIP (Mixed Integer Problem) with linear constraints and with 170829 binary variables when not using SOS2 variables.
  • the implementations solve that MIP by using a commercial solver, Gurobi 9.1.
  • Gurobi 9.1 The results are given in Table 2.3. Since the material balance formulation (2.5) uses a one-dimensional state, the implementations obtain similar results between the material balance formulation and the formulation using a decline curve in accordance with Proposition 3.
  • Fitting data The implementations check if the fitted two tanks reservoir model accurately follows real measurement on the gas field. The implementations use the same data as in the one tank case.
  • the two tanks model more accurately fits the observations, as is depicted in FIG. 6 (we have a gap of less than 5% for each measured point).
  • FIG. 6 shows a comparison of the simulated two tanks reservoir pressure to the measured pressure when applying the same production schedule. The curve is the simulated pressure in the first tank, whereas the dots are the measured pressure at the bottom of the well. Since the two tanks model is closer to the observations, it is considered that it is the reference of truth when comparing results of the one tank approximation and the two tanks model.
  • FIG. 7 illustrates the evolution of the content of the reservoirs when applying the optimal policy in the two tanks model.
  • the dotted curve shows the content of the first tank (linked to the well) while the full-line curve shows the content of the second tank.
  • FIG. 8 illustrates the trajectory of the optimal production in the two tanks model.
  • the dotted curve is the optimal production, whereas the dashed curve is the monthly gas price
  • the implementations To create an admissible production planning from the one tank optimization, the implementations first consider that the control policy is static. One thus has a series of controls computed with the one tank model. To make it admissible, the implementations project those controls on the admissibility set of the two tanks model, which depends on the state. Let u ⁇ t # be the projection of the controls, and x ⁇ t # the states associated with that projected series. Since is admissible for the one tank model, those controls notably verify that u t # ⁇ 0 . One only needs to check that u t # is lower than the first tank pressure.
  • This transformation of the one tank model controls make them admissible in the two tanks optimization problem, as one now has 0 ⁇ u ⁇ t # ⁇ P ⁇ t R , 1 .
  • FIG. 9 illustrates the cumulated gains with the two tanks model as reference.
  • the dotted curve is the cumulated gains of the one tank planning in the one tank model
  • the full-line curve is cumulated gains of the two tanks planning in the two tanks model
  • the dashed curve is the cumulated of the one tank planning translated for the two tanks model.
  • FIG. 10 illustrates a comparison of the trajectory of the production with the two tanks model as reference.
  • the dotted curve is the production planning in the one tank model
  • the full-line curve is for the two tanks model.
  • the dashed curve is the production planning of the one tank model translated in the two tanks model. As depicted in FIG.s 9 and 10 , following the production planning given by the one tank optimization problem differs from the production planning given by the two tanks optimization problem.
  • the production planning of the one tank model gives lower gains than anticipated, and is worse than the optimal two tanks model planning.
  • the one tank optimization is thus optimistic on the optimal value of the problem when applied in the reference model.
  • there is a 5% difference in value between the one tank and two tanks model (a value of 703M € for the translated one tank production planning against a 736M € for the two tanks production planning). This discrepancy illustrates how having a more accurate model of the reservoir can have a substantial impact on the optimal planning, all other things being equal.
  • the second application is an oil reservoir with water injection.
  • the goal is to demonstrate how the formulation can be used beyond primary recovery cases, on a numerically simple case. It is considered that one has one reservoir which contains both oil and water, produced under pressure maintenance by water injection. Moreover, it is considered that the initial pressure is above the bubble-point, which eliminates the possibility of having free-gas in the reservoir. This allows to have once again a one-dimensional state: either the water (which is used for the numerical applications), or the oil in the reservoir.
  • the optimization problem (2.1) now reduces to s . t .
  • the optimal policy yields more production when prices are high, and stop producing when they are low.
  • the production also does not fully deplete the reservoir, which means that it is not advantageous to completely deplete the reservoir if one wants to maximize the profit over the optimization time frame. Indeed, production slowly diminishes with the "stock" of oil in the reservoir. It is more advantageous to wait for high prices before producing, as it will reduce the possible future production. This leads to letting the reservoir have some residual oil, as it is preferred to wait for a higher price instead of producing when prices are low. As a side effect, numerical experiments reveals that the initial value function is almost linear. However, it is only considered simple constraints on the production. As more constraints will be added to the problem, other behaviors will certainly appear. The CPU time was at 1,575s for a 100,000 discretization, with a value of 3,376Me. Impact of the discretization is discussed later.
  • this material balance formulation gets better results than formulations based on decline curves.
  • the dynamic programming algorithm can be used in a stochastic framework.
  • the material balance formulation is amenable to tackle uncertainties on the prices, instead of assuming that prices are known in advance. This will render the optimization process more realistic, as an optimal production policy is highly dependent on prices.
  • the black-oil model relies on the assumption that there are at most three fluids in the reservoir: oil, gas and water.
  • the fluids can be in up to two phases in the reservoir: a liquid phase, and a gaseous phase.
  • a liquid phase one can have a mix of oil with dissolved gas, and water.
  • a gaseous phase one can only have free gas. This can be seen in FIG. 11 , which is a representation of a reservoir in the black oil-model.
  • V t + 1 o V t o ⁇ F t o ⁇ t ⁇ T
  • V t + 1 w V t w ⁇ F t w ⁇ t ⁇ T .
  • V t + 1 g V t g ⁇ F t g + V t o ⁇ R s P t R ⁇ R s P t + 1 R + F t o ⁇ R s P t + 1 R , ⁇ t ⁇ T .
  • the saturations of the fluids are the proportions of the available volume taken by each fluid in the reservoir.
  • S o , S g and S ⁇ the saturation of oil, free gas and water.
  • Equation (2.17) is decreasing with the new reservoir pressure P t + 1 R .
  • the volume gained by the oil when the gas dissolves into oil due to an increase in pressure ⁇ P is lower than the aggregated decrease of volume of the free gas and the other fluids due to that same ⁇ P .
  • the right-hand side is increasing with the reservoir pressure.
  • Equation (2.3) When the PVT function ( B ⁇ , B g , B ⁇ , R s ) are considered piecewise linear, the function ⁇ can be computed efficiently (according to algorithm 2 discussed below). Combining Equations (2.9), (2.10), (2.12), (2.13) and using function ⁇ , the expression of function f of Equation (2.3) follows.
  • V t o ⁇ P R ⁇ P t B o 1 ⁇ w ct V t w
  • Equation (2.7c) By combining Equations (2.10), (2.23) and (2.24), one obtains the dynamics shown in Equation (2.7c).
  • Table 2.6 Summary of the impact of the discretization of the state space on the one tank formulation, with 50 possible controls State discretization Value (M €) CPU time (s) 100 602 1.25 200 689 1.45 500 725 2.5 1000 736 7.5 2000 740 25.2 5000 742 110 10000 743 653 20000 743 2288 50000 743 8142
  • Table 2.7 Impact of the discretization of the state space on the two tanks model, with 50 possible controls State discretization CPU time (s) Value (M €) 50 ⁇ 50 5.1 730 100 ⁇ 100 28.3 735 200 ⁇ 200 115.3 736 4.00 ⁇ 400 706 736 600 ⁇ 600 3893 736 1000 ⁇ 1000 18089 736
  • Equation (2.29) defines the function g .
  • the equivalence exists when the state is reduced to one dimension (as similar reasoning can be applied to the other one-dimensional cases).
  • POMDP Partially Observed Markov Decision Process
  • det-POMDP Deterministic Partially Observed Markov Decision Process
  • det-POMDP Deterministic Partially Observed Markov Decision Process
  • uncertainties are only present in the initial state of the system. That is, the transitions from one state to another are deterministic, as are the observations mappings that give the observations knowing the current state and the current control of the system.
  • ( ⁇ , , ) be a probability space, where ⁇ is the set of possible outcomes, is the associated ⁇ -field and is a probability measure.
  • X t ⁇ ⁇ X
  • O t O are random variables representing respectively the state, the controls and observation variables of the system at time t .
  • the decision at time t is taken knowing the history of controls and observation up to time t.
  • the control U t is a function of the controls and observations up to time t , which means that U t has to be measurable with respect to the ⁇ -field generated by ( O 0 , ..., O t , U 0 , ..., U t ).
  • This non-anticipativity constraint is written as: ⁇ U t ⁇ ⁇ O 0 , ... , O t , U 0 , ... , U t ⁇ 1 , ⁇ t ⁇ T .
  • the cost incurred at each time t ⁇ T ⁇ T is given by the function of the state and controls
  • the final cost (the cost at time ) is given by the function of the final state .
  • a det-POMDP is a POMDP with the following characteristics:
  • det-POMDP are POMDP. This means that all the results and numerical methods that apply to POMDP can be carried over to det-POMDP.
  • a Dynamic Programming equation can be written on det-POMDP.
  • state dynamics and observations mappings are "deterministic". One derives bounds on the cardinality of reachable spaces, which leads to bounds on the number of operations needed to solve the Problem (3.1).
  • Equation (3.4) is well-defined. The issue would be a division by 0 if
  • a reachable belief b ⁇ B t R b 0 , t > 0, is thus an element of ⁇ X such that there exists u 0: t -1 , o 1: t which verify
  • Equation (3.4) Given y ⁇ X and using the definition of ⁇ t in Equation (3.4), we obtain that ⁇ t ( b , u , o )( y ) ⁇ 0 implies that y must be in ⁇ u , o t + 1 and that there must exist x ⁇ ⁇ t u ⁇ 1 y such that b ( x ) ⁇ 0 which gives Then, we successively have that Using the last inclusion in Equation (3.18), and the fact that the left-hand side of the inclusion is a union composed of disjoints sets (as given by the preliminary fact) we obtain that which gives Equation (3.15). Then, Equation (3.16) easily follows. Let u ⁇ U and let o ⁇ O . We have We hence get Equation (3.16).
  • Lemma 11 Lemma 11
  • Lemma 11 We have the following bounds on the union on the sets of reachable beliefs for det-POMDP:
  • the second bound can be obtained by recurrence thanks to Proposition 10.
  • Equation (3.15) we have: ⁇ b ⁇ B u + b 0 supp b ⁇ supp b 0 , and: ⁇ b ⁇ B u 0 : t + b 0 supp b ⁇ ⁇ b ′ ⁇ B u 0 : t ⁇ 1 b 0 + ⁇ b ⁇ ⁇ t b ′ , u t , O d supp b ⁇ ⁇ b ′ ⁇ B u 0 : t ⁇ 1 + b 0 supp b ′ .
  • det-POMDP mon-det-POMDP, which may also be referred to as "Well Separated det-POMDP”.
  • a set T satisfying the just described property is called a Monotonous Function Set.
  • a det-POMDP such which is such that the set of functions defined in Equations (3.10) and (3.14) is a Monotonous Function Set is called a mon-det-POMDP.
  • the cardinality of the reachable belief space of a mon-det-POMDP is bounded by the cardinality of the states space and the support of the initial belief:
  • mappings T ⁇ T such that ⁇ ⁇ Im T .
  • the mappings that satisfy Equation (3.22) are derived from the family T by choosing an element T ⁇ T and choosing if one keeps the value T x for a given element x ⁇ X , or sending it to the cemetery.
  • One hence gest to the bound U t 1 T B t R b 0 ⁇ 2 X X .
  • the reachable belief space is hence bounded by the number of mapping .
  • the set of mapping must verify Equation (3.22).
  • This example illustrates a situation with partial observations, which is to empty a bathtub while minimizing an associated cost.
  • the state x t is one dimensional and consists in the volume of water in the tub, and the control u t is also one dimensional and is the amount of water that the decision maker decides to remove during time step t.
  • the state is partially observed, and the decision maker has access at time t to o t which is smaller that the unobserved state x t .
  • Equation (3.25a) is the objective function of the bathtub problem, i.e. the implementation of Equation (3.1a) of Problem 3.1.
  • the bathtub problem as a mon-det-POMDP is clearly a mon-det-POMDP: Let x 1 be a state, u 1: t and u 1 : v ′ be two sequences of controls and let o 1: t and o 1 : v ′ , be two sequences of observations such that .
  • F w is the total amount of water that has been removed from the bathtub when applying the two sequences of controls.
  • the bathtub thus verifies Equation (3.22), and is thus a mon-det-POMDP.
  • the information vector denoted by I t , contains the all the information the optimizer has access to at time t.
  • a policy is hence considered admissible if ⁇ t ⁇ T , ⁇ I t , ⁇ t I t ⁇ U ad I t .
  • the formulation using information vector may be solved with a Dynamic Programming algorithm, and has the same optimal value as Problem (3.1).
  • * be the optimal value functions
  • ⁇ * be the optimal policy obtained through Dynamic Programming algorithm.
  • the presently-discussed implementations consider a petroleum production system, with at least one reservoir from which the hydrocarbons resources (which are considered to be fluids which follows a black oil model) are extracted.
  • the production system is constituted of pipes, used to transport the fluids; wells, from which the fluids leave the reservoir and enter the network; valves, used to control the network; and pumps used to re-inject fluids in the reservoir.
  • the reservoir is modeled as a dynamical system thanks to the material balance equations and the black oil model.
  • the controls u are the opening or closing of pipes o a , a ⁇ A , and choosing the well-head pressure P ⁇ , ⁇ ⁇ V in ⁇ V .
  • Let f be the evolution function of the reservoir, be the admissibility set of the controls of the production system.
  • the goal of the implementations is to optimize the production phase, i.e. to maximize an economic criterion such as the net present value over multiple time steps.
  • T 0 , ... , T ⁇ 1 the finite set of the time steps, where is a positive integer.
  • the deterministic optimization problem is written as the problem (2.1).
  • the observations are the reservoir pressure P R , the water-cut ⁇ ct (proportion of water produced when a volume of fluids is extracted), and the gas-oil ratio g or (proportion of gas produced when a volume of oil is extracted). Those observations allow to properly define the observation function.
  • Equation (4.1a) The objective function of the problem (in Equation (4.1a)) is defined by the cost function . It is defined as Equation (2.1a). It depends on the production values, which are affected by the observation (reservoir pressure, water-cut and gas-oil ratio). The production values are obtained through the general production function ⁇ : X ⁇ U ⁇ R 3 (in the one tank case), and the implementation associate a vector price r t for the production of each fluid: oil, gas and water. Controls u may also have an associated cost vector c, such as the functioning cost of a pump which re-inject water in the reservoir. All those costs are condensed in the cost function . L t : X ⁇ U ⁇ R o u ⁇ r t T ⁇ x u ⁇ c T u .
  • Equation (4.1b) The initialization of the state of the reservoir is represented in Equation (4.1b).
  • the implementations initialize the state with the distribution given by previous analysis on the reservoir.
  • Equation (4.1c) For the dynamics of Equation (4.1c), the implementations use the function f previously defined. The dynamics was defined using the general production function ⁇ in Equation (2.3).
  • the function f is defined as: ⁇ : X ⁇ U ⁇ X , x u ⁇ x 1 ⁇ ⁇ x u 1 x 2 ⁇ ⁇ x u 2 + x 1 R s x 5 ⁇ x 1 ⁇ ⁇ x u 1 R s ⁇ x u x 3 ⁇ ⁇ x u 3 x 4 1 + c ⁇ ⁇ x u ⁇ x 5 ⁇ x u , where ⁇ is an easily computed function. It is now considered that the computation of f is in (1), i.e. in constant time. Also, f is stationary (not time dependent).
  • Equation (4.1d) define the observation we have access to.
  • the observation function h is known, and how those observations depend on the components of the state.
  • h X ⁇ O , x ⁇ x 5 w ct x 3 x 4 x 5 g or x 2 x 4 x 5
  • the observation only depends on the state itself, not on the controls u . Moreover, the observation functions considered are stationary, whereas the observation functions of Problem (3.1) were time dependent and also depended on the previous controls.
  • Admissibility set of the controls states that for each time step t, the controls u t must belong to an admissibility set U t ad which depends on the current state.
  • this admissibility set is contained all the constraints derived from the production system (capacity of the pipes, allowed pressure range of the different asset which is translated to a pressure range at the different nodes, capacity of treatment of gas and water at the export point). Those constraints are such that they directly depend on the fluids production. There is thus a set defining all the current ranges of admissible value on the production network such that admissible controls are defined as ⁇ x u ⁇ X ⁇ U , ⁇ x u ⁇ U t .
  • the admissibility set is therefore defined as the set valued mapping U t ad : X ⁇ U , x ⁇ u
  • Equation (4.1f) is the non-anticipativity constraint. It states that to choose the controls at time t, one only has access to the history of controls and observation up to time t.
  • Problem (4.1) is a mon-det-POMDP.
  • T the set of the functions and their well defined compositions.
  • T is a Monotonous Function Set.
  • composition thus has the following form
  • T is thus a Monotonous Function Set.
  • T is a Monotonous Function Set.
  • Problem (4.1) is a mon-det-POMDP.
  • Implementations of the optimization where the optimization comprises solving the optimization problem (P), previously discussed, with f the function given by the previously-discussed formula (S), in the case where the observations are partial observations are now discussed.
  • P which corresponds to Problem (4.1)
  • P is a det-POMDP.
  • These implementations include discretizing the optimization problem (P), which includes the step of implementing the discretization framework discussed hereinafter, constructing the belief space as discussed hereinafter, constructing the reachable state space as discussed hereinafter, constructing the reachable belief space as discussed hereinafter.
  • the implementations may further include applying any suitable Dynamic Programming Algorithm to solve the discretized problem (P), for example by using Algorithm 1.
  • the implementations discretize the observations in m values, and consider that there can be up to d controls per observations o , and those controls belongs to U ⁇ ad o . Hence, it is now considered discretized sets and for the controls and observations. It is hence considered a discretized function h : X ⁇ O d , and controls u ⁇ U d ⁇ U o ⁇ O d U ⁇ ad o .
  • the implementations then build the state space by recursively applying the dynamics on the possible initial state with the relevant associated controls.
  • the implementations therefore yield a discretized dynamics f : X d ⁇ U d ⁇ X d .
  • Table 4.1 Notations of the spaces Symbol Definitions State space Discretized state space Control space Discretized control space Space of the observations Discretized space of the observations Space of the time steps Space of the beliefs Space of the reachable beliefs (discrete)
  • the implementations may construct an ordered reachable state space for each x 0 ⁇ supp b 0 thanks to Algorithm 7 (discussed below), which returns the reachable state space and the successors of each state.
  • Algorithm 7 complexity is in O d m T .
  • An underlying assumption of the algorithm is that we have an ordered observation set. The algorithm can be adapted to the case where there is only a partial order on the observation with some additional refinement to get an ordered reachable state space.
  • the observations create a number of separations of the state space .
  • applying the controls on a given state gives points that are on a deformed "discrete parallelepiped" (a set of points whose convex hull is a parallelepiped). The form of that parallelepiped depends on the observation.
  • the implementations may create the reachable state space by putting multiple "discrete parallelepipeds" next to each other, until reaching the frontier delimiting the changes to the observation. Crossing the frontier gives new points on another observation, where the implementations may apply that construction again. The implementations continue the algorithm until "discrete parallelepipeds" are put in each of the three directions of the controls.
  • the implementations compute the frontier of the "discrete parallelepiped" through the use of Algorithm 6 (discussed below). Finally, the implementations may get the successors of a given state x by the controls u by looking at the order of the other states in ListStates( o ). Getting a list of successors may hence be computed when using Algorithm 7 without changing its complexity.
  • implementations represent beliefs as tables D b of size
  • Each component of the table represents the state the system would be if the initial state was x 0, i ⁇ supp b 0 .
  • each component is in X d ⁇ ⁇ , where ⁇ is an added element, the cemetery, which represents an empty state.
  • the i-th component of D b is ⁇ , it means that the initial state could not have been x 0, i .
  • Algorithm 9 simply uses function Successors (defined in Algorithm 8) to find the successors of a given belief.
  • the beliefs added are ordered since the different states space are ordered due to how the discretization of the controls is chosen. Hence beliefs are always added after all their predecessors, which means that the implementations go through only once.
  • the implementations After applying Algorithm 9, the implementations have the belief space and the different transitions between the different beliefs. The implementations may therefore apply Algorithm 5 to solve Problem (4.1).
  • the size of the problem is such that it can be solved in a reasonable time: the generation of the problem was made in 3200 seconds (applying both Algorithms 7 and 9), while the solving time was of 400 seconds (applying Algorithm 1).
  • the code may be parallelized.
  • the method is computer-implemented. This means that steps (or substantially all the steps) of the method are executed by at least one computer, or any system alike. Thus, steps of the method are performed by the computer, possibly fully automatically, or, semi-automatically. In examples, the triggering of at least some of the steps of the method may be performed through user-computer interaction.
  • the level of user-computer interaction required may depend on the level of automatism foreseen and put in balance with the need to implement user's wishes. In examples, this level may be user-defined and/or pre-defined.
  • a typical example of computer-implementation of a method is to perform the method with a system adapted for this purpose.
  • the system may comprise a processor coupled to a memory and a graphical user interface (GUI), the memory having recorded thereon a computer program comprising instructions for performing the method.
  • GUI graphical user interface
  • the memory may also store a database.
  • the memory is any hardware adapted for such storage, possibly comprising several physical distinct parts (e.g. one for the program, and possibly one for the database).
  • FIG. 13 shows an example of the system, wherein the system is a client computer system, e.g . a workstation of a user.
  • the system is a client computer system, e.g . a workstation of a user.
  • the client computer of the example comprises a central processing unit (CPU) 1010 connected to an internal communication BUS 1000, a random access memory (RAM) 1070 also connected to the BUS.
  • the client computer is further provided with a graphical processing unit (GPU) 1110 which is associated with a video random access memory 1100 connected to the BUS.
  • Video RAM 1100 is also known in the art as frame buffer.
  • a mass storage device controller 1020 manages accesses to a mass memory device, such as hard drive 1030.
  • Mass memory devices suitable for tangibly embodying computer program instructions and data include all forms of nonvolatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM disks 1040. Any of the foregoing may be supplemented by, or incorporated in, specially designed ASICs (application-specific integrated circuits).
  • a network adapter 1050 manages accesses to a network 1060.
  • the client computer may also include a haptic device 1090 such as cursor control device, a keyboard or the like.
  • a cursor control device is used in the client computer to permit the user to selectively position a cursor at any desired location on display 1080.
  • the cursor control device allows the user to select various commands, and input control signals.
  • the cursor control device includes a number of signal generation devices for input control signals to system.
  • a cursor control device may be a mouse, the button of the mouse being used to generate the signals.
  • the client computer system may comprise a sensitive pad, and/or a sensitive screen.
  • the computer program may comprise instructions executable by a computer, the instructions comprising means for causing the above system to perform the method.
  • the program may be recordable on any data storage medium, including the memory of the system.
  • the program may for example be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them.
  • the program may be implemented as an apparatus, for example a product tangibly embodied in a machine-readable storage device for execution by a programmable processor. Method steps may be performed by a programmable processor executing a program of instructions to perform functions of the method by operating on input data and generating output.
  • the processor may thus be programmable and coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device.
  • the application program may be implemented in a high-level procedural or object-oriented programming language, or in assembly or machine language if desired. In any case, the language may be a compiled or interpreted language.
  • the program may be a full installation program or an update program. Application of the program on the system results in any case in instructions for performing the method.

Abstract

The disclosure notably relates to a computer-implemented method for multiperiod optimization of oil and/or gas production. The method comprises providing a controlled dynamical system. The controlled dynamical system describes the evolution over time of a state of an oil and/or gas reservoir. The method further comprises providing a time-dependent admissible set of controls. The controls describe actions respecting constraints for controlling oil and/or gas flow and/or pressure. The method further comprises providing time-dependent observations of the content of the reservoir. The method further comprises optimizing, with respect to the state of the reservoir, the controls and the observations, an expected value over a given time span of an objective production function of the state, the controls and the observations. This constitutes an improved solution for oil and/or gas production.

Description

    TECHNICAL FIELD
  • The disclosure relates to the field of computer programs and systems, and more specifically to a method, system and program for multiperiod optimization of oil and/or gas production.
  • BACKGROUND
  • Oil and gas production projects usually span over several decades and involve complex planning and decision making. The lifetime of a hydrocarbon field is usually decomposed in five phases: exploration, where reservoirs containing hydrocarbon are found; appraisal, to give a value to a field; development, where infrastructure is planned and installed; production, where hydrocarbon is finally produced; abandonment, where the field stops producing and the infrastructures are decommissioned and removed. An increasing concern is to improve the oil and/or gas production, and thus to optimize it.
  • However, there is still a need for improved solutions for oil and/or gas production optimization.
  • SUMMARY
  • It is therefore provided a computer-implemented method for multiperiod optimization of oil and/or gas production. The method comprises providing a controlled dynamical system. The controlled dynamical system describes the evolution over time of a state of an oil and/or gas reservoir. The method further comprises providing a time-dependent admissible set of controls. The controls describe actions respecting constraints for controlling oil and/or gas flow and/or pressure. The method further comprises providing time-dependent observations of the content of the reservoir. The method further comprises optimizing, with respect to the state of the reservoir, the controls and the observations, an expected value over a given time span of an objective production function of the state, the controls and the observations.
  • The method may comprise one or more of the following:
    • the controlled dynamical system comprises evolution equations derived from material balance equations and/or black oil models;
    • the controlled dynamical system is of the type: x t + 1 = f x t u t ,
      Figure imgb0001
      where t represents the time, xt the state of the reservoir at time t, and ut the controls at time t, and where f is of the type: f : x u x 1 Φ 1 x u x 2 Φ 2 x u + x 1 R s x 5 x 1 Φ 1 x u R s Ξ x u x 3 Φ 3 x u x 5 1 + c f Ξ x u x 5 Ξ x u
      Figure imgb0002
      where:
      • x = (x (1), x (2), x (3), x (4), x (5)),
      • Rs represents dissolved gas,
      • cf represents the pore compressibility of the reservoir,
      • ∘ (x,u): ↦ Φ(x,u) represents production values as a function of (x, u),
      • Ξ is a function such that P t + 1 R = Ξ x t u t
        Figure imgb0003
        , where PR represents a reservoir pressure;
    • the optimizing comprises solving an optimization problem of the type: min X , O , U E t = 0 T 1 L t X t U t + K X T
      Figure imgb0004
      s . t . L X 0 = μ 0
      Figure imgb0005
      X t + 1 = f X t U t , t T ,
      Figure imgb0006
      O t = h X t , t T ,
      Figure imgb0007
      U t U t ad X t , t T ,
      Figure imgb0008
      σ U t σ O 0 , , O t , U 0 , , U t 1 , t T ,
      Figure imgb0009
      where:
      • X,O,U are respectively the state of the reservoir, the observations, and the controls,
      • Figure imgb0010
        is a finite set of time steps, where
        Figure imgb0011
        is a positive integer,
      • Lt is the objective production function at time t,
      • K(
        Figure imgb0012
        ) is an objective final production function,
      • µ 0 is a probability distribution representing an initial state of the reservoir,
      • X t+1 = f(Xt , Ut ) corresponds to the dynamical system, ∘ h is an observation function,
      • U t ad
        Figure imgb0013
        represents a set of admissible controls at time t;
    • the observations comprise partial observations;
    • the observations depend only on the state of the reservoir;
    • the observations are observations functions of the form O t = h X t ,
      Figure imgb0014
      where Xt , Ot represent respectively the state of the reservoir and the observations at time t, and where h is of the type h x = x 5 ω ct x 3 x 4 x 5 g or x 2 x 4 x 5 ,
      Figure imgb0015
      where ωct is a function representing a water-cut and gor is a function representing a gas-oil ratio, and where x = (x (1), x (2) , x (3) , x (4) , x (5));
    • the optimization comprises solving an optimization problem that is a Deterministic Partially Observed Markov Decision Process (det-POMDP);
    • the optimization comprises discretizing the optimization problem;
    • discretizing the optimization problem comprises providing a discrete control set and a discrete observation set and building a discrete space state by recursively applying the dynamics on a given initial state with associated controls, the discrete space state being a set of the space states reachable from the given initial state;
    • discretizing the optimization problem comprises constructing a state of beliefs, which are probabilities on the discrete state space; and/or
    • the Deterministic Partially Observed Markov Decision Process has monotonicity, such that the state of reachable beliefs is included in a subset of the probability space.
  • It is further provided a computer program comprising instructions for performing the method.
  • It is further provided a computer readable storage medium having recorded thereon the computer program.
  • It is further provided a computer system comprising a processor coupled to a memory, the memory having recorded thereon the computer program.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • Non-limiting examples will now be described in reference to the accompanying drawings, where:
    • FIG.s 1 to 12 illustrate the method; and
    • FIG. 13 shows an example of the system.
    DETAILED DESCRIPTION
  • It is proposed a computer-implemented method for multiperiod optimization of oil and/or gas production. The method comprises providing a controlled dynamical system. The controlled dynamical system describes the evolution over time of a state of an oil and/or gas reservoir. The method further comprises providing a time-dependent admissible set of controls. The controls describe actions respecting constraints for controlling oil and/or gas flow and/or pressure. The method further comprises providing time-dependent observations of the content of the reservoir. The method further comprises optimizing, with respect to the state of the reservoir, the controls and the observations, an expected value over a given time span of an objective production function of the state, the controls and the observations.
  • The method forms an improved solution for oil and/or gas production optimization.
  • Notably, the method performs multiperiod optimization of oil and/or gas production, i.e. allows to optimize a production of oil and/or gas over a given time span that comprises several time periods. For that, the method optimizes an expected value of an objective production function over a given time span (i.e. that encompasses several time periods, e.g. several years or months forming a production phase or at least a part thereof) with respect to time-evolving variables of the function which are the state of the underlying reservoir, observations of the content of the reservoir, and admissible controls that describe actions respecting constraints for controlling oil and/or gas flow and/or pressure. Furthermore, the method describes the time-evolution of the state of the reservoir as a controlled dynamical system such that the time evolution of the state variable accounts for the controls and the observations. This improves robustness of the optimization and enables multiperiod optimization with high accuracy.
  • The output of the optimization is the expected value of the objective function optimized over the given time span with respect to the state of the reservoir, the controls and the observations. This value represents an objective oil and/or gas production value over the given time-span and allows to take real-time decisions and/or actions for oil and/or gas production by exploiting a real-world reservoir. The method may further comprise displaying the optimized value. The method may be performed several times, each execution of the method yielding a respective output optimized value, and the method may then further comprise displaying a graph representing the respective optimized values (e.g. for different reservoir configurations) and/or performing statistics on the optimized values, e.g. to take real-time decisions and/or actions for oil and/or gas production by exploiting a real-world reservoir. The controls obtained as a result of the optimization are policies, i.e. functions of the observations that may be then used in real-time with real-world observations.
  • The method may be included in an oil and/or gas production process (e.g. for a single reservoir or for several reservoirs connected to one another) which may comprise:
    • performing the method thereby obtaining an optimized expected value over a time span of an objective function that represents an optimal production of the time span for a real-world reservoir or for several real-world reservoirs connected to one another and thereby also obtaining optimal controls for the real-world reservoir(s) as functions of (e.g. partial observations) of the content of the real-world reservoir(s), the controls describing actions respecting constraints for controlling oil and/or gas flow and/or pressure;
    • taking production decisions based on the optimized value, such as drilling and/or positioning injection wells and/or production wells and/or positioning valves and/or pipes; and/or
    • performing physical actions based on the decisions, such as physically drilling and/or physically positioning injection wells and/or production wells and/or physically positioning valves and/or pipes.
  • The method is now further discussed.
  • The method is for multiperiod optimization of production of oil and/or gas from the reservoir. The method thus optimizes the production over the given time span that encompasses several periods, e.g. several years or months of production, e.g. several decades of production.
  • The method comprises providing a controlled dynamical system. The controlled dynamical system describes the evolution over time of a state of an oil and/or gas reservoir, where the time evolution of the state depends on the current state and controls. In other words, the dynamically system describes how the state of the reservoir evolves over time given the controls. The state may be a vector comprising one or more variables each representing a physical quantity describing a property of the reservoir. The one or more variables are time-evolving variables, and may comprise any one or any combination of (e.g. all of): the time-evolving amount of oil in the reservoir, the time-evolving amount of free gas in the reservoir, the time-evolving amount of water in the reservoir, the time-evolving total pore volume of the reservoir, and/or the time-evolving reservoir pressure. The dynamical system is controlled, which means that the state variable at a given time depends on the time-dependent controls, e.g. at previous time. The time-dependent state and/or controls may further depend on the time-dependent observations. The providing of the controlled dynamical system may comprise establishing the controlled dynamical system, e.g. by deriving the equations thereof. The controlled dynamical system may comprise evolution equations derived from material balance equations and/or black oil models. In such a case, deriving the controlled dynamical system may comprise deriving the controlled dynamical system from material balance equations and/or black oil models. The controlled dynamical system may be of the type: x t + 1 = f x t u t ,
    Figure imgb0016
    where t represents the time, xt the state of the reservoir at time t, and ut the controls at time t, and where f is of the type: f : x u x 1 φ 1 x u x 2 φ 2 x u + x 1 R s x 5 x 1 φ 1 x u R s Ξ x u x 3 φ 3 x u x 5 1 + c f Ξ x u x 5 Ξ x u
    Figure imgb0017
    where:
    • x = (x (1), x (2), x (3), x (4), x (5)),
    • Rs represents dissolved gas,
    • cf represents the pore compressibility of the reservoir,
    • (x, u): ↦ Φ (x, u) = (Φ (1) (x, u), Φ (2)(x, u), Φ (3) (x, u)) represents production values as a function of (x, u),
    • Ξ is a function such that P t + 1 R = Ξ x t u t
      Figure imgb0018
      , where PR represents a reservoir pressure.
  • The component variables x (1),x (2), x (3) , x (4), x (5) of the state x may respectively be the time-evolving amount of oil in the reservoir, the time-evolving amount of free gas in the reservoir, the time-evolving amount of water in the reservoir, the time-evolving total pore volume of the reservoir, and the time-evolving reservoir pressure. Φ may be a general production function which may comprise, as coordinates, the production of oil Φ (1), the production of free gas Φ (2), and the production or injection of water Φ (3). In examples, φ x u = φ ˜ h x , u
    Figure imgb0019
    where Φ̃ is a vector function with three coordinates representing respectively the production of oil as a function of (h(x),u), the production of free gas as a function of (h(x),u), and the production or injection of water as a function of (h(x),u), where h(x) is an observation function that takes as input x and that outputs the observation of the content of the reservoir corresponding to x.
  • The method further comprises providing a time-dependent admissible set of controls. The controls describe actions respecting constraints for controlling oil and/or gas flow and/or pressure. For example, the controls may include opening or closing a valve and/or or a pipe, and/or choosing a well-head, and/or a bottom-hole pressure. The time-dependent admissible set of controls may be a mapping which at each given time, takes as input the state of the reservoir at the given time and returns the set of controls that are allowable for this state. The set of allowable controls may depend on the reservoir pressure, which constrains the different pressure in the production system. Additionally or alternatively, the set of allowable controls may depend on the production network, for example some pipes can be controlled while others cannot and/or maintenance forces facilities to be closed at different periods.
  • The method further comprises providing time-dependent observations of the content of the reservoir. The time-dependent observations consists in an observation function that takes as input at each given time the state of the reservoir at the given time, and in examples also the controls at the given time, and returns an observation at the given time of the content of the reservoir. The observation may be a vector comprising (e.g. consisting of) coordinates comprising any one or any combination of (e.g. all of): reservoir pressure at the given time, the time-evolving water-cut at the given time (e.g. as a function of the amount of water in the reservoir at the given time, of the total pore volume of the reservoir at the given time, and of the reservoir pressure at the given time), and/or the gas-oil ratio at the given time (e.g. as a function of the amount of free gas in the reservoir at the given time, of the total pore volume of the reservoir at the given time, and of the reservoir pressure at the given time).
  • The method may further comprise providing an initial value for the state of the reservoir (e.g. provided as a probability distribution). The starting point may further comprise an initial value of the observations.
  • The method then comprises optimizing, with respect to the state of the reservoir, the controls and the observations, an expected value over a given time span of an objective production function of the state, the controls and the observations. Optimizing with respect to the state of the reservoir, the controls and the observations means that the state, the controls and the observations are the free variables of the optimization. The optimization thus searches for the values of these variables that tend to optimize (e.g. minimize or maximize) the expected value over a given time span of the objective production function. The optimization is constrained by constraints between the linking state, controls and observations, the constraints being given by the controlled dynamical system (e.g. by the function f discussed above and the observation function). The given time span may be a time-interval that encompasses several production periods, i.e. several periods where parameters of the production and/or affecting the production vary from one period to another, e.g. several months or years or decades. Optimizing may comprise applying any suitable optimization algorithm. Optimizing may for example apply a multi-stage optimization method, e.g. using a Dynamic Programming algorithm, as discussed in implementations hereinbelow. The objective production function may be any production function, such as any function that capture the oil and/or gas that can be produced, e.g. depending on material and/or cost constraints. The objective production function may in examples be of the type: L t x u = r t T φ x u c T u
    Figure imgb0020
    where Φ is the general production function which has been previously discussed, r t T
    Figure imgb0021
    is a vector price for the production of each fluid (oil, gas and water) and c is a cost associated with the controls, such as a functioning cost of a pump which re-injects water in the reservoir
  • The optimizing may solving, i.e. using any suitable optimization method such as a multi-stage optimization method using a Dynamic Programming algorithm, an optimization problem of the type:
    Figure imgb0022
    s . t . L X 0 = μ 0
    Figure imgb0023
    X t + 1 = f X t U t , t T ,
    Figure imgb0024
    O t = h X t , t T ,
    Figure imgb0025
    U t U t ad X t , t T ,
    Figure imgb0026
    σ U t σ O 0 , , O t , U 0 , , U t 1 , t T ,
    Figure imgb0027
    where:
    • X, O, U are respectively the state of the reservoir, the observations, and the controls,
    • Figure imgb0028
      is a finite set of time steps, where
      Figure imgb0029
      is a positive integer,
    • Lt is the objective production function at time t,
    • K(
      Figure imgb0030
      ) is an objective final production function,
    • µ 0 is a probability distribution representing an initial state of the reservoir,
    • X t+1 = f(Xt, Ut ) corresponds to the dynamical system,
    • h is an observation function,
    • U t ad
      Figure imgb0031
      represents a set of admissible controls at time t.
  • The function f in the optimization problem may be the function f : x u x 1 Φ 1 x u x 2 Φ 2 x u + x 1 R s x 5 x 1 Φ 1 x u R s Ξ x u x 3 Φ 3 x u x 5 1 + c f Ξ x u x 5 Ξ x u
    Figure imgb0032
    given by equation (S) and which has been previously discussed.
  • In examples, observations comprise partial observations. In other words, the time-dependent observations represent time-dependent partial observations, i.e. the content of the reservoir is only partially observed by the observations. This allows to perform the optimization even if the content of the reservoir is partially observed, which in practice, in oil and/or gas production, may often be the case. For example, the observations may depend only on the state of the reservoir, e.g. the mapping that yields the observations at a given time takes as input only the state at the given time, and thus does not directly account for the effects of the controls applied at the given time. In examples, the observations are observations functions of the form O t = h X t ,
    Figure imgb0033
    where Xt, Ot represent respectively the state of the reservoir and the observations at time t, and where h is of the type h x = x 5 ω ct x 3 x 4 x 5 g or x 2 x 4 x 5 ,
    Figure imgb0034
    where ωct is a function representing a water-cut and gor is a function representing a gas-oil ratio, and where x = (x (1),x (2),x (3),x (4),x (5)). The component variables x (1), x (2), x (3), x (4), x (5) of the state x may respectively be the time-evolving amount of oil in the reservoir, the time-evolving amount of free gas in the reservoir, the time-evolving amount of water in the reservoir, the time-evolving total pore volume of the reservoir, and the time-evolving reservoir pressure, as previously-discussed.
  • When the observations are partial observations, the optimization may comprise solving an optimization problem that is a Deterministic Partially Observed Markov Decision Process (det-POMDP). For example, the previously discussed optimization problem (P):
    Figure imgb0035
    s . t . L X 0 = μ 0
    Figure imgb0036
    X t + 1 = f X t U t , t T ,
    Figure imgb0037
    O t = h X t , t T ,
    Figure imgb0038
    U t U t ad X t , t T ,
    Figure imgb0039
    σ U t σ O 0 , , O t , U 0 , , U t 1 , t T ,
    Figure imgb0040
    may be a Deterministic Partially Observed Markov Decision Process (det-POMDP). The concept of det-POMDP is known per se in the art and the method may use any suitable method for solving such problems for performing the optimization, such as performing a multi-stage optimization method using a Dynamic Programming algorithm, as discussed hereinafter.
  • The optimization may comprise discretizing the optimization problem. Discretizing the optimization problem may comprise providing a discrete control set and a discrete observation set and building a discrete space state by recursively applying the dynamics (i.e. the controlled dynamical system) on a given initial state with associated controls. The discrete space state is a set of the space states reachable from the given initial state. Discretizing the optimization problem may further comprise constructing a state of beliefs, which are probabilities on the discrete state space. A belief indicates a probability for a given state to be reached from the initial state. The Deterministic Partially Observed Markov Decision Process may in examples have monotonicity, such that the state of reachable beliefs is included in a subset of the probability space, for example a fan-like or com-like subset. Monotonicity means that the det-POMDP is such that, if two sequences of controls lead to a same state when staring in a given state, then applying the two sequences of controls to another state either leads to a same result (i.e. leads to a same state), or one sequence leads to a cemetery point. The cemetery point is a point that may be added to the state space (i.e. so that the discrete state space may comprise the cemetery point) and that represents an unreachable state when considering past and present observations. Monotonicity of the det-POMDP thus allows to save computational time and computation resources for the optimization.
  • Implementations and aspects of the method, including mathematical concepts involved in the method, are now discussed.
  • In implementations, the controlled dynamical system that describes the reservoir's state (behavior) overtime consists of a controlled dynamical system which gives the evolution over time of physical quantities which characterize the hydrocarbon field under exploitation. In these implementations, the underlying equations are derived from material balance equations on the reservoir and under the hypothesis that the fluids contained in the reservoir follow a model known under the name of "black-oil models". Still in these implementations, the optimization may solve an optimization problem over time for an oil and gas production system which may be formulated with a deterministic formulation, the optimization problem being governed by the controlled dynamical system.
  • In the presently-discussed implementations, the reservoir is part of a production system that consists of the reservoir and of production assets such as pipes, wells, chokes. The topology of the production assets is represented as a graph G = V A
    Figure imgb0041
    where
    Figure imgb0042
    is the set of vertices and A V 2
    Figure imgb0043
    is the set of arcs. Control variables are indexed by either nodes or edges. The different production assets are placed on the graphs, with the pipes as the arcs and the rest of the assets such as the well-heads are the nodes. FIG. 1 illustrates such a graphs, where the well's perforations are represented as nodes (ωi ) where the fluid produced enter the graph. On the other nodes, there are assets such as well-head chokes(ωhi ), or joints between different pipes (i 1). Although not shown on FIG. 1, such graphs may comprise valves to open or close pipes. There is also an export point e.
  • All the relevant operational constraints and features, such as pressure loss on the pipes, mass balance of the fluids at each node, allowed pressure and flow rate ranges in different assets or unavailability due to maintenance are modeled as constraints using variables defined on the arcs and nodes of the graph. Indeed, the graph allows to define different controls that can be applied on the system, such as opening or closing valves, or changing the well-head pressure.
  • The implementations optimize the system over the whole production phase (i.e. over multiple years), so multiple time steps belonging to a finite set
    Figure imgb0044
    are considered, where
    Figure imgb0045
    is a positive integer. The time steps may correspond to months or years. The optimization problem may be formulated as follows
    Figure imgb0046
    Figure imgb0047
    Figure imgb0048
    Figure imgb0049
  • The variables are: the controls ut, which are the decisions that can be taken at time step t (in this case, the pressure Pv,t at the different vertex v ∈ V of the graph, and the Boolean oa,t stating if a pipe a ∈ A of the graph is opened or closed); the state of the reservoir xt, as the reservoir is defined as a controlled dynamical system, with state x t X n
    Figure imgb0050
    (with
    Figure imgb0051
    the state space), control u t U
    Figure imgb0052
    and an evolution function of the controlled dynamical system, f. At every time step t, when the decision maker takes decision ut an instantaneous gain denoted by L t x t u t
    Figure imgb0053
    occurs. In the last stage, the final state x T (the quantity of fluids remaining in the reservoir) is valued as
    Figure imgb0054
    (x T). Let ρ be the discount factor. The objective function (2.1a) is finally obtained by adding all those terms. Equation (2.1b) define the known initial state of the reservoir. Equation (2.1c) gives the controlled dynamics of the reservoir. Equation (2.1d) states that the allowed controls belong to an admissibility set, which is for each time step t a set-valued mapping which takes a given state xt of the reservoir and returns the set of allowed controls. Admissibility notably depends on the reservoir pressure, which constrains the different pressures in the petroleum production system. The admissibility set also depends on the production network itself: some pipes can be controlled, while others cannot; or maintenance force facilities to be closed at different periods.
  • As can be seen in the general formulation (2.1), the implementations consider a deterministic controlled dynamical system. Note that, here, it is assumed a perfect knowledge of the content of the reservoir xt. In implementations later discussed, another formulation with partial observation of the content of the reservoir will be discussed. Since the state xt is known, the implementations may use dynamic programming to solve this problem.
  • In order to solve problem (2.1), the implementations use a family of value functions J t : X
    Figure imgb0055
    , where
    Figure imgb0056
    is the state space. A policy µ = {µ 0,...,µ T-1} is a set of functions µ t that maps states xt into admissible controls ut. The following proposition is known from the prior art:
  • Proposition 1. For every initial state x 0 X
    Figure imgb0057
    , the optimal cost J * x 0
    Figure imgb0058
    of the problem (2.1) is equal to J 0 x 0
    Figure imgb0059
    , given by the last step of the following algorithm, which proceed backward in time from final time step T to initial time step 0.
    Figure imgb0060
    Figure imgb0061
  • Furthermore, if
    Figure imgb0062
    minimizes the right-hand side of (2.2b) for each xt and t, then the policy
    Figure imgb0063
    is optimal.
  • To solve Problem 2.1, the implementations compute
    Figure imgb0064
    by using a dynamic programming algorithm (Algorithm 1 below). For that purpose, the implementations discretize the controls, that now belong to a finite set denoted by
    Figure imgb0065
    , and the states that belong to a finite set
    Figure imgb0066
    . The implementations also consider that the value functions follow a multilinear interpolation between the states.
    Figure imgb0067
  • The definition of the dynamical system according to implementations of the invention is now discussed. The dynamical system is defined with a state x and an evolution function f such that, for each time step t, x t+1 = f(xt,ut ). The state is given by the formula x t = V t o V t g V t w V t p P t R ,
    Figure imgb0068
    where the components are defined in table 2.1 below (where Sm 3 stands for standard cubic meter, i.e. the volume taken by a fluid at standard pressure and temperature condition (1.01325 Bar and 15°C)): Table 2.1: Definition of the components of the state
    Symbol Definition
    V t g
    Figure imgb0069
    Amount of oil in the reservoir (Sm 3) at time t V t g
    Figure imgb0070
    V t w
    Figure imgb0071
    Amount of free gas in the reservoir (Sm 3) at time t V t w
    Figure imgb0072
    V t p
    Figure imgb0073
    Amount of water in the reservoir (Sm 3) at time t V t p
    Figure imgb0074
    P t R
    Figure imgb0075
    Total pore volume of the reservoir (m 3) at time t P t R
    Figure imgb0076
    Reservoir pressure (bara) at time t
  • To define the evolution function f of the content of the reservoir between time t and t + 1, the amounts of fluids produced during the period [t,t + 1] are used in these implementations. We denote them by F t o F t g F t w
    Figure imgb0077
    and they are described in Table 2.2 below. We obtain those production values with the mapping φ : X × U 3
    Figure imgb0078
    such that (Ft o,Ft g,Ft w ) = Φ(x,u). The production mapping Φ depends on the form and specifications of the production network. Table 2.2: Definition of the productions
    Symbol Definition
    F t o
    Figure imgb0079
    Volume of oil produced (Sm 3) during [t,t + 1[
    F t g
    Figure imgb0080
    Volume of gas produced (Sm 3) during [t, t + 1[
    F t w
    Figure imgb0081
    Volume of water produced (Sm 3) during [t, t + 1[
  • The following assumptions on the reservoir are made: first, the fluids contained in the reservoir follow a black-oil model; second, it is considered the reservoir is a tank-like reservoir. Using these assumptions, the following result holds: Proposition 2. There exists a function Ξ : X × U
    Figure imgb0082
    such that the following function f ƒ : x u x 1 Φ 1 x u x 2 Φ 2 x u + x 1 R s x 5 x 1 Φ 1 x u R s Ξ x u x 3 Φ 3 x u x 4 1 + c f Ξ x u x 5 Ξ x u
    Figure imgb0083
    is the dynamics of the reservoir in (2.1c) (with x = (x (1),...,x (5)), Rs is the solution gas, and cf is the pore compressibility of the reservoir).
  • Two numerical applications illustrating the use of the material balance formulation are now discussed. The first application is a gas reservoir that can be modeled with two tanks and with a connection of known transmissivity linking the two together. It illustrates how the formulation can be applied to complex cases with multiple tanks. In the second application it is consider is an oil reservoir where pressure is kept constant through water injection. This shows how injection may be taken into account to go beyond the first recovery of oil and gas. All numerical applications were performed on a computer equipped with a Core i7-4700K and 16 GB of memory.
  • First application: a gas reservoir with one well
  • In the first application, it is considered consider a gas reservoir, with production data that comes from a field approaching abandonment. It is a subfield constituted of an isolated reservoir and one well which is part of a larger field which is not considered here. The good geology of this particular case make it perfect for a tank model, as proved by many years of perfectly matched production. Also, the simplicity of the fluids with a high methane purity make the black-oil model a very realistic assumption. The reservoir can be modeled with either one or two tanks, while the well perforation is modeled with a known stationary inflow performance relationship, noted IPR. The two tanks model is illustrated in FIG. 2. The rest of the network is not considered, and only optimization at the bottom of the well is done, without considering any vertical lift performance necessary to lift oil to the surface.
  • The goal here is to show how simple cases can be tackled with the material balance formulation, and that the formulation can also be applied on cases with multiple reservoirs. It is now presented the state reduction of this real case, and then a model with one tank, and then a model with two tanks.
  • Formulation and state reduction: It is considered that the reservoir contains only gas and water. There is also no water production. The amount of water Vw in the reservoir is therefore stationary, and considered to be a known parameter. Therefore one only needs to consider the evolution of the amount of gas, the pressure and the total pore volume as states variables. The general equation stating that the volume of the fluids in the reservoir must be equal to the total pore volume of the reservoir (Equation (2.17) in Appendix 2.A) thus simplifies here to: V w × B w P t + 1 R + V t g F t g × B g P t + 1 R = V t p 1 + c f P t + 1 R P t R
    Figure imgb0084
  • Finally, since it is known that the pore compressibility cf may be considered to be a constant, the total pore volume Vp can be expressed as a function of the pressure, i.e. V t p = V 0 e c f P t R
    Figure imgb0085
    , and Equation (2.4) can therefore be inverted thanks to the
    Figure imgb0086
    Lambert function (the inverse relation of f(w) = wew ). One thus only needs to consider the amount of gas in the reservoir as the reduced state of reservoir. The details regarding this reduction will be discussed hereinafter. Since an optimization at the bottom of the well is done, one only has one control to consider: the bottom-hole pressure, noted Pt. Therefore xt = Vt g and ut = Pt.
  • The optimization problem (2.1) after state and control reduction when considering one tank is given by: max V t g P t P t R F t g t = 0 T 1 ρ t r t F t g
    Figure imgb0087
    s . t . V 0 g = x 0 ,
    Figure imgb0088
    P t R = ψ 1 T V t g , t T ,
    Figure imgb0089
    F t g = IPR P t R P t B g P t R , t T ,
    Figure imgb0090
    V t + 1 g = V t g F t g , t T ,
    Figure imgb0091
    0 F t g F max g , t T ,
    Figure imgb0092
    V t g 0 , t T ,
    Figure imgb0093
    P t 0 , t T .
    Figure imgb0094
  • Equation (2.5c), the mapping Ψ1T is a function that can be algorithmically computed (as discussed hereinafter) and that takes the volume of gas in the reservoir and returns the reservoir pressure, which is used to compute the production, and is detailed hereinafter. Equation (2.5d) is the specific implementation of the general production mapping Φ applied to the production network. In this application, we have only one well. Its production is defined thanks to
    Figure imgb0095
    , the inflow performance relationship of the well, which is considered to be a known piecewise linear function. For the one tank model the expression of the production mapping Φ is Φ 1 T x t u t = IPR Ψ 1 T x t u t B g Ψ 1 T x t = F t g .
    Figure imgb0096
    For the two tank model, the expression is Φ 2T x t 1 x t 2 u t = IPR Ψ 2T , 1 x t 1 u t B g Ψ 2T , 1 x t 1 = F t g ,
    Figure imgb0097
    with Ψ2T,(1) the function returning the pressure in the first tank. The Φ has been divided in Equations (2.5c) and (2.5d). Since one has only one well and since the
    Figure imgb0098
    is strictly monotonous, the production function of Equation (2.5d) is injective. In the models considered here (one tank or two tanks), one can thus transparently pass from the controls to the production and from the production to the controls without any ambiguity. Moreover, one can define the admissibility set of this application. Here, the graph has only one point (one well), and the bottom-hole pressure Pt is controlled. As a production must be positive (constraint (2.5f)), the one tank model gives U ad x t = 0 P t R = 0 , Ψ 1 T x t .
    Figure imgb0099
  • One tank gas reservoir model
  • Fitting model to real data. The implementations use production data from a sector of a real gas field to check that the reservoir model described with the Constraints (2.5c) and (2.5e) after fitting accurately follows real measurements on the gas field. More precisely, the implementations apply a given real production schedule on a part of the field (only one well), and check that the pressure we simulate in the reservoir is close to the measured pressure of that reservoir. The historical production spans over 15 years, and one has monthly values, which is why consider monthly timesteps for Problem (2.5) are considered.
  • FIG. 3 shows a comparison of the simulated one tank reservoir pressure to the historical measured pressure when applying the same (historical) production schedule. The curve is the simulated pressure in the tank, whereas the dots are the measured pressures. As can be seen in FIG. 3, the one tank model fits the observation. However, there is a gap between the simulated and measured pressures of more than 10%. Since the pressure tends to be higher on the first half of the production, the implementations start by underestimating the decline of the production. Then, during the second half of the production, the predicted pressure is lower than the measured pressure, which means the implementations overestimate the decline of the production. This elastic effect is most likely due to the simplification of removing the secondary tank in the model. Indeed, the secondary tank act as a buffer which react slowly, explaining the extra pressure at the beginning and then sustaining a better value of the pressure latter on.
  • Optimization of the production on the one tank approximation. The implementations use dynamic programming (Algorithm 1) to get an optimal production policy. The implementations consider that the revenue per volume of gas is the historical gas spot price of TTF (Netherlands gas market) from 2006 to 2020, and the implementations do not consider any operational cost.
  • The results of the one tank model are now discussed. The results are illustrated in FIG.s 4 and 5, and summarized in Table 2.3 below. FIG. 4 illustrates the evolution of the content of the reservoir in the one tank model. The doted curve is the optimal trajectory of the amount of gas, while the full-line curve is the trajectory with the historical production. FIG. 5 illustrates the trajectory of the production. The dotted curve is the optimal production in the one tank model, the full-line curve is the historical production, whereas the dashed curve is the average monthly gas price. FIG. 5 shows that the production stops when prices are low as we fully take advantage of the perfect knowledge of the future prices. There is a massive increase in the total gains when using the optimal policy, compared to the real production. One also produces far more over the optimization time period (2,850Sm3 instead of 2,250Sm3). However, those results are not truly comparable. Indeed, since the considered case is a small part of a much larger production network, one cannot compare the results to the actual production policy used for fitting the model, which was made with the rest of the network in mind. Moreover, the optimization is made at the bottom of the well (BHFP). The implementations only account the inflow performance of the well, not the vertical lift necessary to bring the gas to the surface. The resulting rates are therefore not realistic, reaching values closer to a multi-well development. Furthermore, the historical production was made without knowing future price, and could also have been made with constraints to ensure a minimal production of the field, or having a positive cashflow (constraints due to the contract for exploiting the field). While not directly comparable, this gas reservoir application still illustrates one of the best case scenario of the dynamic programming approach, and shows how much could be gained from using a multistage material balance formulation.
  • The numerical experiments also reveal that the value function seems to almost be an affine function, that grows with the initial volume of in place gas in the reservoir. Moreover, since the Dynamic Programming algorithm uses a discretization of the state space
    Figure imgb0100
    and the control space
    Figure imgb0101
    , different uniform discretization were tried for the states and controls spaces to prevent any side effects due to the chosen discretization. One does not observe notable changes in the value function past a 10000 points uniform discretization of the state space and a 20 points discretization of the control space, which are the values we used in this paragraph. Details on the effect of the discretization will be discussed later.
  • Comparison of the material balance formulation to those using decline curves or oil-deliverability curves is now discussed. Precision on the decline curves formulation and how decline curves are obtained will be discussed later.
  • First, the following result (for which a proof will be given later) compares the two approaches (decline curves and dynamic programming) on the one tank model:
  • Proposition 3. The formulation using decline curves, written
    Figure imgb0102
    s . t . F t o g s = 0 t 1 F s o , t T \ 0
    Figure imgb0103
    u t U t ad s = 0 t 1 F s o , t T ,
    Figure imgb0104
    is equivalent to the material balance formulation when the state of the reservoir is one-dimensional (as in the optimization problem (2.5)).
  • The implementations generate the decline curve, g, in inequality (2.6b) of the formulation by computing the maximal production value for the same discrete states as the ones used in the dynamic programming approach. The implementations then interpolate the value of g between the different states. When using piecewise linear approximation for the decline curves, the maximization problem 2.6 turns to be MIP (Mixed Integer Problem) with linear constraints and with 170829 binary variables when not using SOS2 variables. The implementations solve that MIP by using a commercial solver, Gurobi 9.1. The results are given in Table 2.3. Since the material balance formulation (2.5) uses a one-dimensional state, the implementations obtain similar results between the material balance formulation and the formulation using a decline curve in accordance with Proposition 3. The two approaches thus yields similar production policies. Note however that the dynamic programming approach has a lower computation time than a naive implementation of the decline curve formulation. Indeed, one could decrease the precision on the decline curve formulation, and use fewer points to describe the decline curve. This would improve its computation time, and could have a negligible impact on the value of the optimization if the remaining points are correctly chosen. Table 2.3: Comparison between the material balance and decline curve formulation for one tank
    Dynamic Programming Decline Curves
    CPU time (s) 653 3882
    Value (M
    Figure imgb0105
    )
    743 743
  • Two tanks gas reservoir model
  • Fitting data. The implementations check if the fitted two tanks reservoir model accurately follows real measurement on the gas field. The implementations use the same data as in the one tank case. The two tanks model more accurately fits the observations, as is depicted in FIG. 6 (we have a gap of less than 5% for each measured point). FIG. 6 shows a comparison of the simulated two tanks reservoir pressure to the measured pressure when applying the same production schedule. The curve is the simulated pressure in the first tank, whereas the dots are the measured pressure at the bottom of the well. Since the two tanks model is closer to the observations, it is considered that it is the reference of truth when comparing results of the one tank approximation and the two tanks model.
  • Optimal production with two tanks. It is now discussed the results of the two tanks model. The only changes compared to the one tank model are on the states and the dynamics of the reservoir. The same prices are used, and once again only an optimization at the bottom of the well (BHFP) is done. Details on the obtained optimal controls and states trajectory are given in FIG.s 7 and 8. FIG. 7 illustrates the evolution of the content of the reservoirs when applying the optimal policy in the two tanks model. The dotted curve shows the content of the first tank (linked to the well) while the full-line curve shows the content of the second tank. FIG. 8 illustrates the trajectory of the optimal production in the two tanks model. The dotted curve is the optimal production, whereas the dashed curve is the monthly gas price Once again it is observed that production stops when prices are low, benefiting fully from knowing the future prices. It is also to be noted that more "pauses" are present in the productions when compared to the one tank model (four instead of three). The "pauses" allows the second tank to replenish the first one. Indeed, production resumes at months 50 to 60, before stopping again for five months. One can then observe that the amount of gas in the first tank is replenished, before one resumes production at month 65, at the same date as in the one tank model. One ends up producing some more gas than with the one tank model (2,900Sm3 instead of 2,850Sm3).
  • Numerical experiments also reveal that the initial value function is almost an affine function of the sum of the states. This seems to imply that the one tank and two tanks model should yield similar results. Indeed, if the value function truly depended exclusively on the sum of the states, the optimal control would also be a function of the sum of the states.
  • Different discretization for the state space were tried. Notably, using more than 400 possible states per tank and 10 possible controls did not yield any significant improvement in the computed value function. Details on the impact of the discretization are given later.
  • Comparing the one tank formulation to the two tanks formulation. To compare the results between the two tanks and one tank formulations, the implementations consider that the two tanks material balance model is the reference. One cannot directly compare the productions, as the production functions Φ 1T and Φ 2T of the one tank and two tanks models differs and one cannot translate the state of the two tanks model in the one tank model. One must therefore return to the actual controls, the bottom-hole pressure Pt. However, the production planning given by the one tank optimization is not directly admissible in the two tanks model. Indeed, the admissibility set of the system is U ad x t = 0 , ψ 1 T x t
    Figure imgb0106
    for the one tank model, and [0, Ψ 2T,(1)(xt (1))] for the two tanks model. If one applies the production planning of the one tank optimization without any changes, one sometimes has P t > P t R , 1 = ψ 2 T , 1 x t 1
    Figure imgb0107
    (with P t R , 1
    Figure imgb0108
    the pressure in the first tank), which is outside the admissible set of the two tanks model.
  • To create an admissible production planning from the one tank optimization, the implementations first consider that the control policy is static. One thus has a series of controls
    Figure imgb0109
    computed with the one tank model. To make it admissible, the implementations project those controls on the admissibility set of the two tanks model, which depends on the state. Let u ˜ t #
    Figure imgb0110
    be the projection of the controls, and x ˜ t #
    Figure imgb0111
    the states associated with that projected series. Since
    Figure imgb0112
    is admissible for the one tank model, those controls notably verify that u t # 0
    Figure imgb0113
    . One only needs to check that u t #
    Figure imgb0114
    is lower than the first tank pressure. The implementations use for the projection: u ˜ t # = min u t # , Ψ 2 T , 1 x ˜ t # , 1 ,
    Figure imgb0115
    where x ˜ t #
    Figure imgb0116
    is defined by x ˜ t + 1 # = ƒ 2 T x ˜ t # u ˜ t # ,
    Figure imgb0117
    and x ˜ 0 # = x 0 .
    Figure imgb0118
    This transformation of the one tank model controls make them admissible in the two tanks optimization problem, as one now has 0 u ˜ t # P ˜ t R , 1
    Figure imgb0119
    . One can now make a comparison between the two models and production planning available in the two tanks model.
  • FIG. 9 illustrates the cumulated gains with the two tanks model as reference. The dotted curve is the cumulated gains of the one tank planning in the one tank model, the full-line curve is cumulated gains of the two tanks planning in the two tanks model, and the dashed curve is the cumulated of the one tank planning translated for the two tanks model. FIG. 10 illustrates a comparison of the trajectory of the production with the two tanks model as reference. The dotted curve is the production planning in the one tank model, the full-line curve is for the two tanks model. The dashed curve is the production planning of the one tank model translated in the two tanks model. As depicted in FIG.s 9 and 10, following the production planning given by the one tank optimization problem differs from the production planning given by the two tanks optimization problem. Moreover, the production planning of the one tank model gives lower gains than anticipated, and is worse than the optimal two tanks model planning. The one tank optimization is thus optimistic on the optimal value of the problem when applied in the reference model. Moreover, there is a 5% difference in value between the one tank and two tanks model (a value of 703M€ for the translated one tank production planning against a 736M€ for the two tanks production planning). This discrepancy illustrates how having a more accurate model of the reservoir can have a substantial impact on the optimal planning, all other things being equal. It also shows that contrarily to the assumption presented at the end of the previous paragraph (that the two models could yield similar results if the value function only depended on the sum of the states), the optimal value and control cannot be found with a one tank approximation, and the optimal controls and value functions are not functions of the sum of the states.
  • Comparison to decline curves with two tanks. The decline curve and the material balance formulations were numerically compared in a context where they are known to be equivalent, that is the one tank formulation. It is now discussed numerical experiments in a context where the equivalence is not assured: two tanks connected with a known transmissibility. Decline curves were generated for the two tanks formulation by following a procedure described later As with the comparison between the one tank and the two tanks model, one considers that the two tanks model is the reference. The results returned by the decline curve formulation is an admissible production in the two tanks model, as it is constrained by an admissible production schedule. One can therefore directly compare the values between the two approaches. The results of the optimization of the two formulations are compiled in Table 2.4. One ends up having close results, with a difference in optimal values of 0.5%, but with a large difference in computing times. However, it appeared that such close results were due to the selected price scenario. Using different prices by randomizing the order in which the different prices appear, the gap between the two approaches widen from 0.5% up to 4%. This implies that the initial price considered was an almost best case scenario for the decline curves approach. It also shows that the decline curves approach is far less robust to changes in the price data, and that it cannot benefit as efficiently as the material balance formulation some effects of the two tanks dynamical system, such as waiting for the second tank to empty itself in the first one. Table 2.4: Comparison between the material balance and decline curve formulation for two tanks with the initial prices sequence.
    CPU time (s) Value (M€)
    Dynamic Programming 706 736
    Decline Curves 7825 731
  • Overall, this application demonstrates that the material balance approach can work on complex cases, and that dynamic programming is well suited to optimize an oil production network. Moreover, there can be differences with results from the decline curves approach, which are likely to grow larger with the complexity of the system.
  • An oil reservoir with water injection
  • The second application is an oil reservoir with water injection. The goal is to demonstrate how the formulation can be used beyond primary recovery cases, on a numerically simple case. It is considered that one has one reservoir which contains both oil and water, produced under pressure maintenance by water injection. Moreover, it is considered that the initial pressure is above the bubble-point, which eliminates the possibility of having free-gas in the reservoir. This allows to have once again a one-dimensional state: either the water (which is used for the numerical applications), or the oil in the reservoir. We have xt = Vt ω and u = Pt. The optimization problem (2.1) now reduces to
    Figure imgb0120
    s . t . w t CT = w ct V t w B w P R V p , t T ,
    Figure imgb0121
    V t + 1 w = V t w α P R P t B w P R w t CT 1 , t T ,
    Figure imgb0122
    F min w α P R P t B w P R w t CT 1 F max w , t T ,
    Figure imgb0123
    F min o α P R P t B o P R 1 w t CT F max o , t T ,
    Figure imgb0124
    P t 0 , t T .
    Figure imgb0125
  • It is assumed that the water-cut wct (the amount of water produced when extracting one cubic meter) is given by a piecewise linear function. The water-cut depends on the water saturation Sw (proportion of water in the reservoir). Since the reservoir pressure is kept constant, the total pore volume is constant and the water saturation expression is thus S t w = V t w B w P R V p
    Figure imgb0126
    . This gives constraint (2.7b). Since w a constant pressure in the reservoir is to be kept, one needs to re-inject enough water to replace the extracted oil. Replacing the oil with water gives a new dynamics for Vt w given in Equation (2.7c) (details are discussed later). Equations (2.7d), (2.7e), (2.7f) are constraints on the controls. Indeed, it is considered that the production follows a simplified Darcy's law F t = α P R P t
    Figure imgb0127
    with α the productivity index of the well, Pt the bottom-hole pressure of the well and Ft the total production (mix of oil and water). Combining Equation (2.8), the water-cut and the dynamics (2.7c), we get the constraints (2.7d), (2.7e). A monthly optimization is done, with the 2000 to 2020 historical Brent prices for oil as the prices in the objective function (2.7a).
  • As previously discussed, the optimal policy yields more production when prices are high, and stop producing when they are low. The production goes from one bound to the other (0 production, with Pt = PR, and full production, with Pt = 0).
  • The production also does not fully deplete the reservoir, which means that it is not advantageous to completely deplete the reservoir if one wants to maximize the profit over the optimization time frame. Indeed, production slowly diminishes with the "stock" of oil in the reservoir. It is more advantageous to wait for high prices before producing, as it will reduce the possible future production. This leads to letting the reservoir have some residual oil, as it is preferred to wait for a higher price instead of producing when prices are low. As a side effect, numerical experiments reveals that the initial value function is almost linear. However, it is only considered simple constraints on the production. As more constraints will be added to the problem, other behaviors will certainly appear. The CPU time was at 1,575s for a 100,000 discretization, with a value of 3,376Me. Impact of the discretization is discussed later.
  • Overall, this application shows how one can apply the material balance approach beyond first recovery of oil and gas, and that it can be used on different kinds of reservoir.
  • It has been presented a new formulation for the management of an oil production system, based on the classical material balance equations. This formulation, where the reservoir is a controlled dynamical system, is amenable to a dynamic programming approach. As it has been shown, this approach gives good results in different cases with either oil or gas. Moreover, the dynamic programming algorithm can naturally be parallelized; therefore, the approach can scale to more complex cases.
  • It has also been shown that this material balance formulation gets better results than formulations based on decline curves. First, one gets the same results between the material balance and decline curves formulation when considering the first recovery of a one tank system. Second, one can efficiently apply the material balance when considering multiple connected tanks. This is not possible for decline curves, as they need to use a given production schedule to be computed. Third, one can apply the material balance formulation to cases which go beyond the first recovery of hydrocarbons. Indeed, as proved in above one can take into account water injection. Moreover, one does not need to assume that wells are independent, or that they are all bundled with the same cumulated production. Optimization done using the material balance formulation can account for interactions between wells and tanks.
  • Finally, the dynamic programming algorithm can be used in a stochastic framework. The material balance formulation is amenable to tackle uncertainties on the prices, instead of assuming that prices are known in advance. This will render the optimization process more realistic, as an optimal production policy is highly dependent on prices.
  • Detailed construction of the reservoir as a dynamical system
  • It is now discussed the construction of the reservoir as a dynamical system. This serves as the proof of Proposition 2.
  • Constitutive equations assuming the Black-oil model for the fluids
  • The black-oil model relies on the assumption that there are at most three fluids in the reservoir: oil, gas and water. The fluids can be in up to two phases in the reservoir: a liquid phase, and a gaseous phase. In the liquid phase, one can have a mix of oil with dissolved gas, and water. In the gaseous phase, one can only have free gas. This can be seen in FIG. 11, which is a representation of a reservoir in the black oil-model.
  • The implementations use standard volume for each of the following components:
    • Vo for oil (in the liquid phase)
    • Vg for the free gas (in the gaseous phase)
    • Vdg for the dissolved gas (in the liquid phase)
    • Vω for the water (in the liquid phase)
  • It is considered that the temperature in the reservoir is stationary and uniform (this a common assumption for a geological formation such as a reservoir). One needs to compute the place taken for those fluids at different reservoir pressures (we will not need temperature, as it is stationary). To do so, the implementations use PVT functions (Pressure-Volume-Temperature) that have been measured in lab samples. Those functions are defined in Table 2.5 below. It is known that, for a given amount of fluids, the volume taken by that mix is decreasing with pressure. This is useful when proving the uniqueness of the pressure for a given production of the fluids. Table 2.5: Definition of the PVT functions
    Notations Description
    Bo Oil formation volume factor. It is the volume in barrels occupied in the reservoir, at the prevailing pressure and temperature, by one stock tank barrel of oil plus its dissolved gas. (unit: rb/stb)
    Bg Gas formation volume factor. It is the volume in barrels that one standard cubic foot of gas will occupy as free gas in the reservoir at the prevailing reservoir pressure and temperature, (unit: rb/scf).
    Bw Water formation factor. It is the volume occupied in the reservoir by one stock tank barrel of water. (unit: rb/stb)
    Rs Solution (or dissolved) gas. It is the number of standard cubic feet of gas which will dissolve in one stock tank barrel of oil when both are taken down to the reservoir at the prevailing reservoir pressure and temperature. (unit: scf/stb)
  • It is considered that the reservoir acts like a tank. This means that its structural integrity is guaranteed, so there is no leakage of any fluids, and there will not be any in the future either. One can therefore write material balance equations (or mass conservation) for each fluid of the reservoir. Let Fo, Fg and Fω the amount of fluids produced (respectively oil, gas and water).
  • Using material balance for the oil, one gets V t + 1 o = V t o F t o t T ,
    Figure imgb0128
    and, for the water, V t + 1 w = V t w F t w t T .
    Figure imgb0129
  • The gas phase requires some more development. At any time, the total amount of dissolved gas in the oil Vdg is a function of the amount of oil and the pressure V dg = δ V o P R = V o × R s P R .
    Figure imgb0130
  • Between time t and t + 1, the amount of dissolved gas thus evolves from V t dg = δ V t o P t R to V t + 1 dg = δ V t + 1 o P t + 1 R .
    Figure imgb0131
  • Therefore, the quantity of liberated gas V t dg V t + 1 dg
    Figure imgb0132
    must be added to the gas mass conservation equation. Thus, one has a mass conservation equation for the free gas that can be written as V t + 1 g = V t g F t g + V t dg V t + 1 dg liberated gas = V t g F t g + V t o R s P t R V t + 1 o R s P t + 1 R = V t g F t g + V t o R s P t R V t o F t o R s P t + 1 R
    Figure imgb0133
    (using Equation (2.9) to transform V t + 1 o
    Figure imgb0134
    as an expression depending only on t).
  • Hence, one gets V t + 1 g = V t g F t g + V t o R s P t R R s P t + 1 R + F t o R s P t + 1 R , t T .
    Figure imgb0135
  • All the fluids are kept in the pores of the reservoir rocks. Let Vp the total pore volume of the reservoir. As known from prior art, it is considered that the pore compressibility is stationary, hence the total pore volume follows: V t + 1 p V t p V t p = c f P t + 1 R P t R , t T .
    Figure imgb0136
  • The saturations of the fluids are the proportions of the available volume taken by each fluid in the reservoir. Let So, Sg and Sω the saturation of oil, free gas and water. In the reservoir, one has a conservation of the saturation of all the fluids. Indeed, one has: S t o + S t g + S t w = 1 t T ,
    Figure imgb0137
    rewritten as V t o × B o P t R + V t g × B g P t R + V t w × B w P t R = V t p , t T .
    Figure imgb0138
  • Reservoir dynamics
  • The state of the reservoir is in the implementations defined as x = (Vo, Vg, Vω, Vp, VR ). The controls ut considered are decisions made upon the production network, such as opening or closing a pipe, choosing the well-head or bottom hole pressure. Since the conservation laws in the reservoir are written with the production values, so the production function Φ is used to transform those controls in production values Φ x u = F o F g F w .
    Figure imgb0139
  • Thanks to Equations (2.9), (2.10), (2.12), (2.13), (2.15) and (2.16) one can define the dynamics f on the state x and controls u. Indeed, if one writes Equation (2.15) at time t + 1, and express those quantities as functions of the quantities at time t thanks to Equations (2.9), (2.10), (2.12) and (2.13), one gets: V t o F t o × B o P t + 1 R + V t w F t w × B w P t + 1 R + V t g F t g + V t o × R s P t R R s P t + 1 R + F t o × R s P t + 1 R × B g P t + 1 R = V t p 1 + c f P t + 1 R P t R .
    Figure imgb0140
  • According to prior art, the left-hand side of Equation (2.17) is decreasing with the new reservoir pressure P t + 1 R
    Figure imgb0141
    . The volume gained by the oil when the gas dissolves into oil due to an increase in pressure ΔP is lower than the aggregated decrease of volume of the free gas and the other fluids due to that same ΔP. To the contrary, the right-hand side is increasing with the reservoir pressure. Hence, there is function Ξ : X × U 3
    Figure imgb0142
    such that t T
    Figure imgb0143
    , P t + 1 R = Ξ x t u t
    Figure imgb0144
    . When the PVT function (Bο , Bg , Bω , Rs ) are considered piecewise linear, the function Ξ can be computed efficiently (according to algorithm 2 discussed below). Combining Equations (2.9), (2.10), (2.12), (2.13) and using function Ξ, the expression of function f of Equation (2.3) follows.
    Figure imgb0145
  • Material on state reduction
  • It is now discussed how the general dynamics can be simplified in simpler cases.
  • Gas reservoir state reduction
  • It is here considered a gas reservoir where there is no water production. This is used for the first application previously-discussed. In the case of a gas reservoir with a constant amount of water, one can reduce the state to xt = (Vt g ). Considering Equation (2.13), since the pore compressibility and the temperature are considered constant, one has V p P R T = c f V p .
    Figure imgb0146
  • By integrating (2.18) along PR, one can then express the total pore volume as a function of the pressure: V p = V 0 e c f P R .
    Figure imgb0147
  • Now the equation stating that the volume of fluids must be equal to the total pore volume (Equation (2.4)) can be rewritten: V w × B w P t + 1 R + V t g F t g × B g P t + 1 R = V 0 e c f P t + 1 R .
    Figure imgb0148
  • The left-hand side of Equation (2.20) is a decreasing continuous piecewise linear function of the pressure (the volume of gas and the production being known) whereas the right-hand side is an increasing and continuous function of the pressure. This implies that there is a unique reservoir pressure which verifies Equation (2.20). Moreover, since the left-hand side is piecewise linear, one can compute the reservoir pressure thanks to the
    Figure imgb0149
    Lambert function (the inverse relation of f(w) = wew ), and since pressure is positive, we use the
    Figure imgb0150
    branch of the Lambert function. Finally, one obtains a mapping Ψ such that P R = Ψ V g .
    Figure imgb0151
    Ψ can be computed by adapting Algorithm 2, and using the
    Figure imgb0152
    Lambert function instead of the root of a second order polynomial. Therefore, the state is reduced to xt = Vt g , which leads to the formulation (2.5).
  • Oil reservoir with water injection state reduction
  • It is now considered the case of an oil reservoir where water injection is used to keep the reservoir pressure constant. This is the focus of the application of the previously-discussed second application. In the case of an oil reservoir with water injection to keep the reservoir pressure constant, one can reduce the state to xt = V t ω
    Figure imgb0153
    . One can consider having two controls, the bottom-hole pressure Pt and the water injection F t ωi
    Figure imgb0154
    . However, the water injection will be constrained by the production, hence by the bottom-hole pressure, and thus will not be present as a control in the problem formulation. Since pressure is to be kept constant, one needs to re-inject enough water to replace the oil. Keeping the pressure constant means that the pore volume is constant. Moreover, it is considered that the reservoir pressure is higher than the bubble point pressure, which allow to consider that the amount of gas Vg g is null. By using Equation (2.15) on time step t and t + 1, one obtains:
    Figure imgb0155
    which is simplified as: F t wi Φ w V t w P t × B w P R = Φ o V t w P t × B o P R .
    Figure imgb0156
  • The constraint on the net water production can therefore be rewritten: F t w = F t o B o B w
    Figure imgb0157
  • Φ can now be expressed as a function of Darcy's law (Equation (2.8)) and using the water-cut function to obtain the total oil produced: V t o = α P R P t B o 1 w ct V t w
    Figure imgb0158
  • By combining Equations (2.10), (2.23) and (2.24), one obtains the dynamics shown in Equation (2.7c).
  • Details on the impact of the discretization
  • One tank gas reservoir. In this application, different discretization values have been tried for the states and controls spaces. Results get better each time the number of states or controls used in the loops of Algorithm 1 is increased. The optimal values and CPU times are compiled in Table 2.6. Discretization of the control space has less impact than the discretization of the state space (there is no improvement using more than 10 possible controls). 50 possible controls are used for the rest of the analysis to ensure there is no issue due to the controls space. Moreover, the computation time grows linearly with the number of controls, hence one only gets penalized by a factor of 5 for the computation time compared to being at the most efficient level for the discretization of the controls. One can also remark that going beyond 10000 points for the state's discretization yields no discernible improvement (less than 0.2%). However, the computation time grows exponentially with the state discretization. Hence 10000 points for the states and 20 controls were used.
  • Two tanks gas reservoir. Tried different discretization values for the two reservoirs were tried: 200 × 200 (i.e. the two reservoirs are discretized with 200 points each), 400 × 400, 600 × 600 and 1000 × 1000. Results are summarized in Table 2.7, which shows the computation time of the optimization and the optimal value obtained. As can be seen, the computation time grows exponentially with the discretization, as one needs to compare more and more values when we get a finer discretization. However, performance remains reasonable for the number of time steps considered. One can also remark that going past a 200 × 200 discretization of the states of the reservoir does not improve the optimal value. A very small impact is observed from the discretization of the controls. Indeed, almost no improvement is obtained above 10 possible controls (we hence used 50 possible controls in Table 2.7 to ensure the discretization of the controls will not influence the analysis of the discretization of the state). All the results of §2.4.1.2 were therefore computed with the 400 × 400 discretization for the states, and 20 for the controls. Table 2.6: Summary of the impact of the discretization of the state space on the one tank formulation, with 50 possible controls
    State discretization Value (M€) CPU time (s)
    100 602 1.25
    200 689 1.45
    500 725 2.5
    1000 736 7.5
    2000 740 25.2
    5000 742 110
    10000 743 653
    20000 743 2288
    50000 743 8142
    Table 2.7: Impact of the discretization of the state space on the two tanks model, with 50 possible controls
    State discretization CPU time (s) Value (M€)
    50 × 50 5.1 730
    100 × 100 28.3 735
    200 × 200 115.3 736
    4.00 × 400 706 736
    600 × 600 3893 736
    1000 × 1000 18089 736
  • Oil reservoir with water injection. Different values for the discretization of the state space of this problem were tried. However, the discretization of the controls had no impact, as the controls only took two different values: either no production, or production at the maximal rate. Therefore 10 possible controls were chosen to be sure to never miss another behavior during the analysis on the impact of the discretization of the states. Table 2.8 compiles the time to solve and the associated results of the optimization depending on the number of points considered for the discretization of the state space. It is to be noted that there is not a lot of gain from going from 10,000 discretization points to 100,000, whereas computation time grows by almost 100 times. Table 2.8: Summary of the dynamic programming results for the oil reservoir with water injection
    Discretization Time steps CPU time (s) Value (M€)
    1000 240 0.35 3182
    10000 240 12.05 3358
    100000 240 1575 3376
  • Additional material on the decline curves formulation
  • Usually, formulations using decline curves, as can be seen in the prior art, are of the form:
    Figure imgb0159
    s . t . F t o g s = 0 t 1 F s o , t T \ 0 ,
    Figure imgb0160
    u t U t ad s = 0 t 1 F s o , t T .
    Figure imgb0161
  • Using decline curves, or oil deliverability curves, means using Equation (2.25b) to predict the reservoir's behavior. It states that the maximal rate at time t only depends on the cumulated production until time t. In the general case, there is no reason to believe that there is an equivalence between a material balance model for the reservoir and a decline curve represented with function g. However, when the state of the material balance formulation can be reduced to a one dimensional state (such as a reservoir which only contains gas), there can be an equivalence between the decline curve and the material balance formulations, as was stated in Proposition 3.
  • Proof of Proposition 3. Let us consider the component Φ g : X × U
    Figure imgb0162
    of the production function Φ such that: F t g = Φ g x t u t .
    Figure imgb0163
    Therefore, we have: F t g max u Φ g x t u .
    Figure imgb0164
    Moreover, having a one-dimensional state greatly simplifies the dynamics, as we only need to consider one fluid. The dynamics thus simplifies to: x t + 1 = ƒ x t u t = x t F t g .
    Figure imgb0165
    By propagating the simplified dynamics (2.28) and by re-injecting it in Equation (2.27), we get:
    Figure imgb0166
    Hence, Equation (2.29) defines the function g. The equivalence exists when the state is reduced to one dimension (as similar reasoning can be applied to the other one-dimensional cases).
  • However, when there are more complex cases, such as a reservoir with both oil and gas, or when there is water encroachment (influx of water in the reservoir from the aquifer), one cannot have a reduction to a one-dimensional state. Decline curves, or oil deliverability curves, will not be equivalent to the material balance system, as they can only represent a one dimensional dynamical system, where the state is the cumulated production. If one has a state that cannot be reduced to one dimension, one can still propagate the dynamics in Equation (2.26): F t g = Φ g x t u t = Φ g ƒ ƒ ƒ x 0 u 0 , , u t 1 , u t .
    Figure imgb0167
  • However, there is no reason to believe that there exists a function g in the general case, contrarily to the one-dimensional case. This is why they are generated with a given production planning, i.e. a series of controls applied to the reservoir. Given a series of admissible controls
    Figure imgb0168
    one can create an oil-deliverability curve, that takes as argument the total cumulated production and returns the maximal possible production. It however depends on the underlying production planning
    Figure imgb0169
    . One can create such function
    Figure imgb0170
    through the Algorithm 3 discussed below. Once one has a list of points of
    Figure imgb0171
    , one considers a linear interpolation between those points as the decline curve we use in the optimization problem (2.6). In prior art solutions, decline curves are used, i.e. oil-deliverability curves with natural depletion at the maximal rate. This means that there is no injection, and the production planning consists of maximal production rates. One can generate those decline curves with a tweaked version of the previous procedure (see Algorithm 4 below).
    Figure imgb0172
    Figure imgb0173
  • Deterministic Partially Observed Markov Decision Process
  • Mathematical tools and objects used when considering optimization of a controlled dynamical system under partial observation are now discussed: Partially Observed Markov Decision Process (POMDP). An extensive literature exists on POMDP, most of which focus on the infinite horizon case. Indeed, POMDP can be applied to numerous fields, from medical models to robotics. Algorithms based on Dynamic Programming were design to exploit specific structures in POMDP in order to solve this difficult class of problem. They do so by first reformulate the problem through the use of beliefs (distribution over the state space). One of such algorithms is SARSOP. However, POMDP is still a very difficult class of problem, and often un-tractable in the general case. Instead of focusing on the general POMDP, it is now presented a subclass that is relevant for the oil & gas case: det-POMDP. That subclass of problems is well-known. Moreover, implementations of the method manipulate an even simpler class that is tractable: mon-det-POMDP. Indeed, that new class of problems uses a property on the dynamics and observation to push back the curse of dimensionality.
  • The det-POMDP class of problems and the main complexity results in the case of finite horizon problems are now discussed. Then, the mon-detPOMDP class will be discussed, i.e. where some conditions are added on the dynamics of the system, which leads to improvement on the complexity bounds. Illustration of mon-detPOMDP with a toy problem will finally be discussed: emptying a bathtub when considering partial observations of the level of water.
  • Det-POMDP
  • det-POMDP stands for Deterministic Partially Observed Markov Decision Process, and corresponds to the subclass of POMDP where uncertainties are only present in the initial state of the system. That is, the transitions from one state to another are deterministic, as are the observations mappings that give the observations knowing the current state and the current control of the system.
  • Let (Ω,
    Figure imgb0174
    ,
    Figure imgb0175
    ) be a probability space, where Ω is the set of possible outcomes,
    Figure imgb0176
    is the associated σ-field and
    Figure imgb0177
    is a probability measure. Let T = 0 , , T
    Figure imgb0178
    be the discrete time span, and it is defined upon it three processes X = X t t T ,
    Figure imgb0179
    U = U t t T
    Figure imgb0180
    , and O = O t t T
    Figure imgb0181
    . For all t, X t : Ω X
    Figure imgb0182
    , U t : Ω U
    Figure imgb0183
    , and O t : Ω O
    Figure imgb0184
    are random variables representing respectively the state, the controls and observation variables of the system at time t.
  • It is assumed that the state process X follows the deterministic dynamics ft, i.e. L x 0 = μ 0
    Figure imgb0185
    Figure imgb0186
    where µ 0 is the (known) distribution of the initial state.
  • It is assumed that the observations are given by the deterministic observations functions ht : O 0 = h 0 X 0
    Figure imgb0187
    Figure imgb0188
  • It is assumed that for all time t, there is a set valued mapping U t ad : X U
    Figure imgb0189
    that defines the admissible controls given a state x. The controls must therefore verify: U t ω U t ad X t ω , ω Ω , t T .
    Figure imgb0190
  • The following more compact notation will be used: U t U t ad X t a . s . , t T .
    Figure imgb0191
  • Moreover, the decision at time t is taken knowing the history of controls and observation up to time t. Accordingly, the control Ut is a function of the controls and observations up to time t, which means that Ut has to be measurable with respect to the σ-field generated by (O 0, ..., Ot, U 0, ..., Ut ). This non-anticipativity constraint is written as: σ U t σ O 0 , , O t , U 0 , , U t 1 , t T .
    Figure imgb0192
  • Finally, the cost incurred at each time t T \ T
    Figure imgb0193
    is given by the function of the state and controls
    Figure imgb0194
    , while the final cost (the cost at time
    Figure imgb0195
    ) is given by the function of the final state
    Figure imgb0196
    . As the implementations optimize over random variables, it is considered that one optimizes over an additive cost function: the expected value of the sum of instantaneous costs over the time set
    Figure imgb0197
    and the final cost:
    Figure imgb0198
  • The formulation of a finite-horizon det-POMDP is hence
    Figure imgb0199
    Figure imgb0200
    Figure imgb0201
    Figure imgb0202
    Figure imgb0203
    U t U t ad X t a . s . , t T
    Figure imgb0204
    σ U t σ O 0 , , O t , U 0 , , U t 1 , t T ,
    Figure imgb0205
  • To summarize, a det-POMDP is a POMDP with the following characteristics:
    • there is no exogenous uncertainties for the dynamics f and observation functions h,
    • the only uncertainty is on the initial state x 0 of the dynamic system.
    Some recalls on det-POMDP
  • In this paragraph, some results on det-POMDP are discussed. They can be found in the literature. First, det-POMDP are POMDP. This means that all the results and numerical methods that apply to POMDP can be carried over to det-POMDP. Notably, a Dynamic Programming equation can be written on det-POMDP. Moreover, one can derive some complexity results by exploiting the fact that state dynamics and observations mappings are "deterministic". One derives bounds on the cardinality of reachable spaces, which leads to bounds on the number of operations needed to solve the Problem (3.1).
  • For any distribution μ Δ X
    Figure imgb0206
    , where Δ(
    Figure imgb0207
    ) is the set of probability distribution over the state space
    Figure imgb0208
    , let supp μ X
    Figure imgb0209
    be the support of the distribution µ: supp μ = x X | μ x > 0 .
    Figure imgb0210
  • Solving det-POMDP. The usual way to solve a POMDP consists in reformulating the problem, and use a new state, the beliefs. The whole process is known in the literature]. In the case of det-POMDP, the following result is known from the literature:
    Proposition 4 Let B = Δ X 0
    Figure imgb0211
    , let b 0 B
    Figure imgb0212
    be the distribution of X 0, the Unitas state of Problem (3.1) and consider the sequence of value functions (Vt ) t∈T defined by the following backward induction: V T : B , b x X b x K x
    Figure imgb0213
    V t : B , b min u U t b b C t b u + o O Q t + 1 b u o V t + 1 τ t b u o ,
    Figure imgb0214
    where for all t, the mappings τ t : B × U × O B
    Figure imgb0215
    , Q t : B × U × O 0,1
    Figure imgb0216
    and C t : B × U
    Figure imgb0217
    are given by
    Figure imgb0218
    Figure imgb0219
    C t b u = x X b x L t x u ,
    Figure imgb0220
    with Γ u , o t + 1 = y X | x X , ƒ t x u = y and h t + 1 y u = o ,
    Figure imgb0221
    h t u : = h t , u , ƒ t u : =
    Figure imgb0222
    ft ,u), T u , o t = x h t + 1 u ƒ t u 1 o
    Figure imgb0223
    and U t b b = x supp b U t ad x .
    Figure imgb0224
  • Then, ν * = V 0 μ o
    Figure imgb0225
    i.e. the optimal value of Problem (3.1) and the value of the mapping V0 at the initial belief b0 = µ 0 are equal.
  • Moreover, a policy π = (π 0 , ....π τ-1) (a set of mappings π t : B U
    Figure imgb0226
    ) which minimizes the right-hand side of Equation (3.3) for each b and t is an optimal policy of Problem (3.1): the controls given by Ut = πt(Bt) (where Bt is computed thanks to the recursion B t+1 = τt (Bt, Ut , O t+1), with B 0 = µ 0 ) are optimal controls of Problem (3.1).
  • Remark 5. It is possible that, when considering a given belief b, control u and observation o, we have τt (b, u, o) = 0, i.e. there is no successor of belief b such that it is possible to observe o after applying control u. This is why ωe consider that the backspace is B = Δ X 0
    Figure imgb0227
    to cover that case.
  • Moreover, we consider thatt, Vt (0) = +∞ to represent the value of an impossible belief
  • Finally, when we obtain τt (b, u, o) = 0, we also have that Q t+1 (b, u, o) = 0. Indeed, Q t+1(b, u, o) represent the probability of observing o at time t + 1 when applying controls u while holding the belief b. By convention, we assume that multiplying a probability of 0 with +∞ is equal to 0. Hence, when multiplying Q t+1 by the value function V t+1τt always lead to a finite value. The right-hand side of Equation (3.3) is thus well-defined.
  • Remark 6. Equation (3.4) is well-defined. The issue would be a division by 0 if
    Figure imgb0228
  • However, if y Γ u , o t + 1
    Figure imgb0229
    then
    Figure imgb0230
    The condition thus prevent a sum on, an empty set.
  • Moreover, if
    Figure imgb0231
    then the sum is strictly positive. The condition y Γ u , o t + 1
    Figure imgb0232
    and
    Figure imgb0233
    thus prevent a division by 0.
  • A proof of proposition 4 is discussed hereinafter and extends the known results (discussed in Dimitri P. Bertsekas. Dynamic Programming and Optimal Control, volume I. Athena Scientific, Belmont, MA, USA, 4th edition, 2017. ISBN 9781886529434, which is incorporated herein by reference), which are under the hypothesis that the admissibility set at time t does not depend on the state at time t, to the case where the admissibility set depends on the state.
  • It is still possible to transform the formulation of Problem (3.1) in order to remove the constraint (3.1f) by modifying the cost function. Indeed, let
    Figure imgb0234
    be a modified cost such that L t X t U t = L t X t U t + χ u t ad X t U t χ u t ad z
    Figure imgb0235
    represent, the characteristic function over the admissibility set: χ u t ad z u = { 0 if u U t ad x + otherwise
    Figure imgb0236
  • Applying a control u that does not belong to the admissibility set U t ad x
    Figure imgb0237
    will therefore lead to a cost of +∞. Since Problem (3.1) is a minimization problem, any solution with finite value when considering cost functions
    Figure imgb0238
    will therefore satisfy constraint (3.1f).
  • When using
    Figure imgb0239
    , we therefore obtain the following Bellman value functions: V T : B , b x X b x K x
    Figure imgb0240
    V t : B , b min u U C t b u + o O Q t + 1 b u o V t + 1 τ t b u o ,
    Figure imgb0241
    where C t b u = x X b x L t x u
    Figure imgb0242
    . We also assume that if b(x) = 0, then u U
    Figure imgb0243
    , b x L t x u = 0
    Figure imgb0244
    .
  • A proof that the optimal solutions when considering costs
    Figure imgb0245
    are also optimal solutions of problem 3.1, and leads to the value functions defined in equations (3.2)-(3.3). For the sake of clarity, let F t+1 be defined by
    Figure imgb0246
    min u U C t b u + o O F t + 1 b u o = min u U x X b x L t x u + o O F t + 1 b u o = min u U x X b x L t x u + X u t ad x u + o O F t + 1 b u o = min u U X x supp b u t ad x u + x X b x L t x u + o O F t + 1 b u o = min u U X u t b b u + x X b x L t x u + o O F t + 1 b u o = min u U t b b x X b x L t x u + o O F t + 1 b u o
    Figure imgb0247
    Therefore:
    Figure imgb0248
  • As Vτ = Vτ , then t T
    Figure imgb0249
    , V t = Vt by backward induction. Moreover, the controls u that minimize the right-hand side of Equation (3.8) also minimize the right-hand side of Equation (3.3).
  • A direct consequence of Dynamic Programming is that, in order to solve Problem (3.1), we only need to compute the Bellman functions Vt for all the beliefs reachable at time t when starting from b 0. The sets of reachable beliefs are hence formally defined before presenting some bounds on the size of those sets:
    Definition 7. Let b 0 B 0
    Figure imgb0250
    be given. Then, for any t T
    Figure imgb0251
    , we define, B t R b 0 B t
    Figure imgb0252
    , the set of reachable beliefs a time t starting from initial belief b 0 by the following induction.
    Figure imgb0253
  • We have the theorem:
    Theorem 8. The cardinal of the reachable state space is bounded: b 0 Δ X , t 0 , t = 0 t B t R b 0 1 + X supp b 0 .
    Figure imgb0254
  • Preliminaries.
    • Given (Ω,
      Figure imgb0255
      ,
      Figure imgb0256
      ) a probability space, two finite sets A and B, and a measurable mapping g : AB, the push forward (or image measure) of a distribution µ on A by g is the distribution g, µ on B defined by b B , g * μ b = a g 1 b μ a
      Figure imgb0257
    • Let two finite sets
      Figure imgb0258
      and
      Figure imgb0259
      be given and consider a family T of mappings from
      Figure imgb0260
      to
      Figure imgb0261
      . Then, for any probability distribution, µ over
      Figure imgb0262
      we have that
      Figure imgb0263
      Indeed, let μ Δ Y
      Figure imgb0264
      be given. For any
      Figure imgb0265
      we denote by
      Figure imgb0266
      the restriction of the mapping
      Figure imgb0267
      to the subset supp μ Y
      Figure imgb0268
      . For all z
      Figure imgb0269
      we have that
      Figure imgb0270
  • Thus, considering
    Figure imgb0271
    we have that
    Figure imgb0272
  • A sketch of proof of Theorem 8 is now discussed. Using the fact that state dynamics is deterministic one obtains that a reachable belief at time t is given as a push forward of the initial belief through a mapping which goes from
    Figure imgb0273
    to an augmented set X = X δ
    Figure imgb0274
    . δ is an extra point added to
    Figure imgb0275
    , the cemetery point, which is used to represent the evolution of the system toward a point which is incompatible with the observations or controls applied. The number of reachable beliefs at time t is therefore bounded by the cardinality of
    Figure imgb0276
    , the set of mappings which goes from
    Figure imgb0277
    to X = X δ
    Figure imgb0278
    .
  • Proof. Now, we give the details of the proof of Theorem 8. We consider an extended state space X = X δ
    Figure imgb0279
    and a self-map on
    Figure imgb0280
    given as follows
    Figure imgb0281
    with Θ u , o t = x X | h t + 1 ƒ t x u , u = o u U t ad π x X
    Figure imgb0282
    (here, π x X
    Figure imgb0283
    is the projection of x on
    Figure imgb0284
    ).
  • We consider a renormalization map R : Δ X Δ X 0
    Figure imgb0285
    defined by x X , R b x = { 0 if b δ = 1 , b x 1 b δ if b δ 1 .
    Figure imgb0286
    and the extension mapping ε : Δ X Δ X
    Figure imgb0287
    defined as follows: ε : Δ X Δ X , b b such that { b x = b x x X b δ = 0 .
    Figure imgb0288
  • We have the following result. For any u and o
    Figure imgb0289
    That is, up to renormalization and extension τt (b,u,o) is the push forward of the belief b by the mapping
    Figure imgb0290
  • Moreover, if we extend the previous notation
    Figure imgb0291
    to a sequence of controls u t:t' U t t
    Figure imgb0292
    and a sequence of observations o t + 1 : t + 1 O t t
    Figure imgb0293
    , i.e.:
    Figure imgb0294
  • A reachable belief b B t R b 0
    Figure imgb0295
    , t > 0, is thus an element of Δ X
    Figure imgb0296
    such that there exists u 0:t-1, o 1:t which verify
    Figure imgb0297
  • Moreover, we have
    Figure imgb0298
    Using the preliminary, with Y = X
    Figure imgb0299
    and b 0 = µ, we hence get that for all b 0 in Δ X
    Figure imgb0300
    and for all t ≥ 0 s = 0 t B s R b 0 1 + X supp b 0 .
    Figure imgb0301
  • Remark 9. Theorem 8 is an improvement of the bound b 0 Δ X , t = 0 B t R b 0 1 + X X
    Figure imgb0302
  • Proposition 10 The cardinality of the support of belief decreases when applying the mapping τt . More precisely, we have
    Figure imgb0303
    We therefore have that
    Figure imgb0304
    Proof. Let u U
    Figure imgb0305
    and o O
    Figure imgb0306
    be given. As a preliminary fact, we note, using the definition of Γ u , o t + 1
    Figure imgb0307
    given in Proposition 4, that Γ u , o t + 1 Γ u , o ʺ t + 1 = φ
    Figure imgb0308
    when ο'o" as otherwise there would exist y X
    Figure imgb0309
    such that h t+1(y,u) = o' and h t+1(y, u) = o" which is not possible.
  • Now, given y X
    Figure imgb0310
    and using the definition of τt in Equation (3.4), we obtain that τt (b, u, o)(y) ≠ 0 implies that y must be in Γ u , o t + 1
    Figure imgb0311
    and that there must exist x ƒ t u 1 y
    Figure imgb0312
    such that b(x) ≠ 0 which gives
    Figure imgb0313
    Then, we successively have that
    Figure imgb0314
    Using the last inclusion in Equation (3.18), and the fact that the left-hand side of the inclusion is a union composed of disjoints sets (as given by the preliminary fact) we obtain that
    Figure imgb0315
    which gives Equation (3.15). Then, Equation (3.16) easily follows. Let u U
    Figure imgb0316
    and let o O
    Figure imgb0317
    . We have
    Figure imgb0318
    We hence get Equation (3.16).
  • Theorem 8 and Proposition 10 yield the following Lemma:
    Lemma 11. We have the following bounds on the union on the sets of reachable beliefs for det-POMDP:
    Figure imgb0319
  • Proof. Using Theorem 8, we have
    Figure imgb0320
  • The second bound can be obtained by recurrence thanks to Proposition 10. Let us consider the set of attainable beliefs when one applies a sequence of controls u 0:t-1 when starting in belief b 0, which we denote by B u 0 : t 1 + b 0
    Figure imgb0321
    :
    Figure imgb0322
  • Using Equation (3.15), we have: b B u + b 0 supp b supp b 0 ,
    Figure imgb0323
    and: b B u 0 : t + b 0 supp b b B u 0 : t 1 b 0 + b τ t b , u t , O d supp b b B u 0 : t 1 + b 0 supp b .
    Figure imgb0324
  • By induction, we thus have b B u 0 : t + b 0 supp b supp b 0
    Figure imgb0325
    .
  • Moreover, since b B u 0 : t + b 0
    Figure imgb0326
    |supp(b)| ≥1 (as 0 B u 0 : t + b 0
    Figure imgb0327
    ), we have B u 0 : t + b 0 b B u 0 : t + b 0 supp b .
    Figure imgb0328
    Therefore u 0 : t U t + 1 , B u 0 : t + b 0 supp b 0 .
    Figure imgb0329
    Hence, we have: B t R b 0 = u 0 : t 1 U t B u 0 : t 1 + b 0 u 0 : t 1 U t B u 0 : t 1 + b 0 u 0 : t 1 U t supp b 0 = supp b 0 U t .
    Figure imgb0330
  • Looking at the union of the reachable beliefs gives us:
    Figure imgb0331
  • As we have
    Figure imgb0332
    and
    Figure imgb0333
    which leads to Equation (3.21).
  • Det-POMDP with monotonicity (mon-det-POMDP)
  • It is now discussed a sub-class of det-POMDP: mon-det-POMDP, which may also be referred to as "Well Separated det-POMDP".
  • Definition of mon-det-POMDP
  • Definition 12. Let T be a finite set of self-map of
    Figure imgb0334
    which satisfy the following property. For any given pair (
    Figure imgb0335
    ',
    Figure imgb0336
    ") in T 2, if there exists x X
    Figure imgb0337
    such that T x =
    Figure imgb0338
    Figure imgb0339
    "(x), then we must have for all x X
    Figure imgb0340
    one of the three following possibilities:
    Figure imgb0341
    A set T satisfying the just described property is called a Monotonous Function Set. Moreover, a det-POMDP such which is such that the set of functions defined in Equations (3.10) and (3.14) is a Monotonous Function Set is called a mon-det-POMDP.
  • The property can also be stated as follows. Suppose that there exists x
    Figure imgb0342
    such that
    Figure imgb0343
    then
    Figure imgb0344
    and
    Figure imgb0345
    coincide on the set
    Figure imgb0346
  • Otherwise stated for mon-det-POMDP, if two sequences of controls leads to the same state when starting in state x, then applying the two sequences of controls to another state x' either leads to the same state, or one sequences leads to the cemetery point. This main property is illustrated on FIG. 12, where the arrows represent two different sequences of controls u 1:t' u 1 : t
    Figure imgb0347
    with their associated sequences of observations o 1:t , o 1 : t
    Figure imgb0348
    such that
    Figure imgb0349
    This leads to
    Figure imgb0350
    whereas
    Figure imgb0351
    as h f x 0,3 u 1 : t o t
    Figure imgb0352
    .
  • Proposition. 13. The cardinality of the reachable belief space of a mon-det-POMDP is bounded by the cardinality of the states space and the support of the initial belief:
    Figure imgb0353
  • It is now discussed a sketch of proof of proposition 13. When considering a mon-det-POMDP, one can first follow the same reasoning as in the proof of Theorem 8. Indeed, one needs to consider sets of mapping from X X
    Figure imgb0354
    that verify equation (3.22).
  • There can be at most |
    Figure imgb0355
    | mappings T T
    Figure imgb0356
    such that δ Im T
    Figure imgb0357
    . The mappings that satisfy Equation (3.22) are derived from the family T by choosing an element T T
    Figure imgb0358
    and choosing if one keeps the value T x
    Figure imgb0359
    for a given element x X
    Figure imgb0360
    , or sending it to the cemetery. One hence gest to the bound U t = 1 T B t R b 0 2 X X
    Figure imgb0361
    . By then applying the preliminary of the proof of Theorem 8, on gets to Equation (3.23).
  • Proof. Now we give the details. Using the definitions of
    Figure imgb0362
    ,
    Figure imgb0363
    and ε (Equations (3.10), (3.11), (3.12)), we have:
    Figure imgb0364
  • As there can be only one renormalized value per mapping applied to b 0, the reachable belief space is hence bounded by the number of mapping
    Figure imgb0365
    . However, the set of mapping
    Figure imgb0366
    must verify Equation (3.22).
  • We first consider the set of mappings T such that ∀
    Figure imgb0367
    ,
    Figure imgb0368
    If that set T verifies Equation (3.22), then
    Figure imgb0369
    if x X
    Figure imgb0370
    such that
    Figure imgb0371
    Figure imgb0372
    (since δ is not in
    Figure imgb0373
    or
    Figure imgb0374
    ). We can thus apply the preliminary, and we have
    Figure imgb0375
  • Let us return to the set of mappings
    Figure imgb0376
    . As the mappings verify Equation (3.22),
    Figure imgb0377
    Figure imgb0378
    such that:
    Figure imgb0379
  • There is therefore at most
    Figure imgb0380
    mappings
    Figure imgb0381
    per mapping
    Figure imgb0382
    .
  • Hence, we have T ˜ X 2 X
    Figure imgb0383
    .
  • That reasoning stands for any kind of belief b Δ X
    Figure imgb0384
    , instead of the reachable beliefs. To get the reachable beliefs sets, we need to consider mapping from supp(b 0) to
    Figure imgb0385
    . Using the preliminary of Theorem 8, we therefore get: t = 1 B t R b 0 2 supp b 0 X .
    Figure imgb0386
  • One is therefore bounded by the size of the underlying MDPs. This seems to imply that the problem with partial information may be tractable if the problems with perfect information are tractable.
  • Dynamic Programming algorithm over reduced beliefs sets.
  • Forward computation of the Bellman value functions. A bound on the reachable belief state B R b 0
    Figure imgb0387
    has been defined, which is the space which contains all the beliefs reachable when starting in belief b 0. When computing the Bellman value function defined with Equation (3.3), one hence only needs to consider those beliefs when using a forward Dynamic Programming Algorithm.
    Figure imgb0388
    may thus be restricted to U t = 1 T B t R b 0
    Figure imgb0389
    . Algorithm 5 below solves the problem in
    Figure imgb0390
    (at most
    Figure imgb0391
    computations per beliefs), i.e. in
    Figure imgb0392
    according to proposition 13.
    Figure imgb0393
  • Online computation of the controls with the Bellman value functions.
  • Consider at time t that the belief of the state of the system is b B t R b 0
    Figure imgb0394
    . Consider that the value functions Vt have all been computed. To find the optimal control u t *
    Figure imgb0395
    at time t under belief b, one needs to solve:
    Figure imgb0396
  • Example illustration of partial observations: emptying a partially observed bathtub
  • This example illustrates a situation with partial observations, which is to empty a bathtub while minimizing an associated cost. The state xt is one dimensional and consists in the volume of water in the tub, and the control ut is also one dimensional and is the amount of water that the decision maker decides to remove during time step t. The state is partially observed, and the decision maker has access at time t to ot which is smaller that the unobserved state xt.
  • Optimization problem. We now explicit the Problem (3.1) for the bathtub:
    Figure imgb0397
    s . t . X 0 with known distribution ,
    Figure imgb0398
    X t + 1 = X t U t , t T ,
    Figure imgb0399
    U t 0 O t , t T ,
    Figure imgb0400
    O t = max o i | X t o i , t T ,
    Figure imgb0401
    σ U t σ O 0 , , O t , U 0 , , U t 1 , t T .
    Figure imgb0402
  • Equation (3.25a) is the objective function of the bathtub problem, i.e. the implementation of Equation (3.1a) of Problem 3.1. The cost instantaneous function at time t is defined as L t u t = c t u t
    Figure imgb0403
    , and hence only depends on the controls.
  • The observation function h is given by a piece wise constant function which does not depend on the controls u. We assume it has m possible values. Let us write o (i) one value of the h. h(x) = max{o (i), xo (i)}. We note [ o t , o t ] the interval such that the states are compatible with the observations ot , i.e. the interval that represents {xt, h(xt ) = ot }. This leads to equation (3.25e), which is the implementation of (3.1e).
  • We assume that the admissibility set is U ad O t = 0 O t
    Figure imgb0404
    . The state x are therefore always positive as the observation is always smaller than the state. This properly define Equation (3.25d) as the implementation of Equation (3.1f). We can notably remark that in the general problem, the admissibility sets depend on the states and not on the observation. This is not an issue here, as the observations only depends on the states and not on the controls. Hence, we can rewrite the admissibility set as a dumping of the state without any issue: U ad X t = 0 , h X t = 0 O t
    Figure imgb0405
    .
  • We therefore cannot empty the tub, as we cannot remove more water in the bathtub than the state at any given time. Indeed, we have Ot Xt , hence Ut Xt. The controls we apply thus ensure that the states are kept positive (having a positive volume of water in the bathtub is thus a constraint that could be added without any impact).
  • Definition of the dynamics on the belief τ. Let b u o B × U × O
    Figure imgb0406
    , and let supp(b) = {x1,...,xn }.
  • We can rewrite Equation (3.4):
    Figure imgb0407
  • Moreover, we can write function Q as: Q : B × U × O 0,1 , b u o x o _ u , o u b x
    Figure imgb0408
  • Bellman equations for the bathtub problem.
    Figure imgb0409
    Figure imgb0410
  • The bathtub problem as a mon-det-POMDP. The bathtub is clearly a mon-det-POMDP: Let x 1 be a state, u 1:t and u 1 : v
    Figure imgb0411
    be two sequences of controls and let o 1:t and o 1 : v
    Figure imgb0412
    , be two sequences of observations such that
    Figure imgb0413
    .
  • There exists F w +
    Figure imgb0414
    such that:
    Figure imgb0415
    Fw is the total amount of water that has been removed from the bathtub when applying the two sequences of controls.
  • This leads to:
    Figure imgb0416
    and
    Figure imgb0417
  • The bathtub thus verifies Equation (3.22), and is thus a mon-det-POMDP.
  • Reformulation of POMDP in the belief space (following previously-discussed reference Dimitri P. Bertsekas. Dynamic Programming and Optimal Control, volume I. Athena Scientific, Belmont, MA, USA, 4th edition, 2017. ISBN 9781886529434 , which is incorporated herein by reference).
  • The usual way to solve a POMDP is to reformulate the problem, and use a new state, the beliefs:
    • First, reformulating the imperfect state information as a perfect information case where the state grows with time.
    • Second, defining the value functions in the new perfect information case.
    • Third, setting out the beliefs as sufficient statics, which allow us to reformulate the value functions.
  • It is considered the states of the reservoir, the observations and the controls are discretized (i.e. that x X d
    Figure imgb0418
    , o O d
    Figure imgb0419
    , u U d
    Figure imgb0420
    ).
  • Reformulation as a perfect information controlled dynamical system.
  • Definition 14. The information vector, denoted by It, contains the all the information the optimizer has access to at time t. I t = o 0 , , o t , u 0 , , u t 1 , t T \ 0 , I 0 = o 0
    Figure imgb0421
  • Proposition 15. The problem with imperfect state information can be reformulated as a problem with perfect state information, defined by a dynamical system where the state is the information vector I, the controls are the previous controls u, and the disturbance are the observation o: J π = max E x 0 t T L ˜ I t , π t I t
    Figure imgb0422
    Figure imgb0423
    o t + 1 P | I t , π t I t , t T ,
    Figure imgb0424
  • Sketch of proof of Proposition 15:
  • A policy is a sequence of functions π = {π 0,...,π τ-1}, where each of the function πt takes an information vector It as input and returns the corresponding control u U
    Figure imgb0425
    . A policy is hence considered admissible if t T , I t , π t I t U ad I t .
    Figure imgb0426
  • Note that U ad I
    Figure imgb0427
    is defined as the admissibility set of the last observation of the information vector U ad I t = U ad o t
    Figure imgb0428
    .
  • An admissible policy that maximizes J π = E x 0 t T L x t , π t I t
    Figure imgb0429
    Figure imgb0430
    o t = h x t , t T ,
    Figure imgb0431
    also maximizes Problem (3.1).
  • Let us now reformulate the problem from imperfect to perfect state information. To do so, we need to define a new dynamical system, whose state at time t is the information vector It . Indeed, that vector contains all the information available at time t.
  • Using the definition of the information vector (Equation (3.29)), we can define the dynamics on the information vector: I t + 1 = I t o t + 1 u t , t T .
    Figure imgb0432
    Adding the initial condition I 0 = o 0 ,
    Figure imgb0433
    allow us to properly define a controlled dynamical system, where I is the state, u is the control and o is the disturbance. Moreover, we can specify the law of the disturbance process. Indeed, we have: P o t + 1 | I t , u t = P o t + 1 | I t , u t , o 0 , , o t ,
    Figure imgb0434
    as the disturbances o 0, ... , ot are part of the information vector I. Hence, we have a disturbance process that only depends on the current state and controls, and not on the prior disturbances.
  • Since we have: E L t x t u t = E E x t L x t , u t | I t , u t ,
    Figure imgb0435
    we can reformulate the objective function with the variables of the new dynamical system. Indeed, we can write the new cost-to go function: L ˜ t I t u t = E x t L t x t u t | I t , u t .
    Figure imgb0436
  • The problem with imperfect state information can thus be reformulated as a problem with perfect state information: J π = max E x 0 t T L ˜ I t , π t I t
    Figure imgb0437
    I t + 1 = I t , o t + 1 , π t I t , t T \ T o t + 1 P | I t , π t I t , t T ,
    Figure imgb0438
  • Dynamic Programming for the formulation with perfect state information
  • The Dynamic Programming Algorithm relies on value functions computation. Their expression is:
    Figure imgb0439
    and
    Figure imgb0440
  • Hence, the formulation using information vector may be solved with a Dynamic Programming algorithm, and has the same optimal value as Problem (3.1). Let
    Figure imgb0441
    * be the optimal value functions and π* be the optimal policy obtained through Dynamic Programming algorithm.
  • Using beliefs as sufficient statistics
  • Sufficient statistics are known from the literature and defined as quantities of smaller dimensions that It that summarize all the essential content of It as far as controls are concerned.
  • Sufficient statistics are functions St such that there exist functions Ht and
    Figure imgb0442
    that verify: J t I t = max u H t S t I t , u = J t S t I t ,
    Figure imgb0443
    and there exist π t such that: π t I t = π t S t I t
    Figure imgb0444
  • Proposition 16. The belief b, i.e., the conditional state distribution knowing the information vector bt = P(xt |It ), are sufficient statistics.
  • Proof. Here is a sketch of proof of Proposition 16.
  • We can define a function Gt such that: E x t L t x t u t | I t , u t = G t b t u t .
    Figure imgb0445
  • Let us consider that there exists a function τ such that b t+1 = τ(bt, ut, o t+1) (we will explicit it in the discrete case in Looking back at Equation (3.32), we can write:
    Figure imgb0446
  • We denote by Ct the cost to go function associated to the belief. It is defined as: C t b t u t = E x t L x t u t | I t .
    Figure imgb0447
  • In the discrete case, we hence write it: C t b t u t = x X d b t x L x u t .
    Figure imgb0448
  • We hence have:
    Figure imgb0449
  • Let us now write
    Figure imgb0450
    as functions of bt and J t + 1
    Figure imgb0451
    , which will cement b as sufficient statistics. Looking back at Equation (3.33), we have: J t I t = max u t U ad I t E o t + 1 L ˜ t I t u t + J t + 1 I t o t + 1 u t | I t , u t = max u t U ad o t E x t o t + 1 L t x t u t + J t + 1 I t o t + 1 u t | I t , u t = max u t U ad o t C t b t u t + E o t + 1 J t + 1 τ b t u t o t + 1 = J t b t
    Figure imgb0452
  • The beliefs are therefore sufficient statistics.
  • Since the beliefs are sufficient statistics, they can be used to implement a Dynamics Programing algorithm that can solve Problem (3.1).
  • Implementations of the method which reformulate the deterministic optimization Problem (2.1) of an oil and gas production network assuming only partial observation are now discussed.
  • Implementations of the formulation and numerical resolution of a deterministic optimization problem for the management of an oil and gas production system (see Problem (2.1)) according to implementations of the method have been discussed hereinabove. In that formulation, it was considered that oil prices were known (deterministic oil prices) and that the state of the dynamical system modeling the reservoir dynamics was fully observed (i.e. the optimization problem was formulated under a complete observation assumption). Relaxing the deterministic assumption for prices and assuming that prices are driven by a Markov process could easily be taken into account as the deterministic problem was solved by dynamic programming and extensions to stochastic dynamic programming is straightforward.
  • However, assuming a complete observation of the state dynamics is a too demanding assumption. The state variables depend on the structure of an oil reservoir (which is a geological formation which contains some hydrocarbons) and are not perfectly known when starting to exploit the oil and gas production network. Implementations of the method therefore consider the optimization problem under partial observation, where it is assumed that the initial state of the reservoir is not known but that there is partial information as an initial probability law for the initial state distribution.
  • Implementations of the method which reformulate the deterministic optimization Problem (2.1) of an oil and gas production network assuming only partial observation are now discussed. This formulation leads to the optimization problem (P) (referred to in the implementations as "Problem (4.1)") which is a mon-det-POMDP. The optimization problem (P), which uses the function f defined by Equation (S), is thus in implementations obtained by the reformulation of Problem (2.1) which is now discussed. Numerical applications are discussed hereinafter and implementations the creation of the relevant spaces to solve this problem is also discussed hereinafter.
  • Reminder on the deterministic problem. The presently-discussed implementations consider a petroleum production system, with at least one reservoir from which the hydrocarbons resources (which are considered to be fluids which follows a black oil model) are extracted. The production system is constituted of pipes, used to transport the fluids; wells, from which the fluids leave the reservoir and enter the network; valves, used to control the network; and pumps used to re-inject fluids in the reservoir. Meanwhile, the reservoir is modeled as a dynamical system thanks to the material balance equations and the black oil model.
  • It is considered that the topology of the petroleum production system is represented with a graph G = V A
    Figure imgb0453
    .
    Figure imgb0454
    is the set of vertices, and A V 2
    Figure imgb0455
    is the set of arcs. It is also considered at least one reservoir which is modeled as a controlled dynamical system. The state of the reservoir is x = (Vo, Vg, Vω, Vp, PR ). The controls u are the opening or closing of pipes oa, a A
    Figure imgb0456
    , and choosing the well-head pressure Pω, ω V in V
    Figure imgb0457
    . Let f be the evolution function of the reservoir,
    Figure imgb0458
    be the admissibility set of the controls of the production system.
  • The goal of the implementations is to optimize the production phase, i.e. to maximize an economic criterion such as the net present value over multiple time steps. Let T = 0 , , T 1
    Figure imgb0459
    the finite set of the time steps, where
    Figure imgb0460
    is a positive integer. The deterministic optimization problem is written as the problem (2.1).
  • Adding partial observation. The implementations take into account the partial observation of the content of the reservoir. Indeed, it is not always possible to see the true content of the reservoir. Instead, it is considered that there is an observation o, and an observation function h such that ot = h(xt ). The observations are the reservoir pressure PR , the water-cut ωct (proportion of water produced when a volume of fluids is extracted), and the gas-oil ratio gor (proportion of gas produced when a volume of oil is extracted). Those observations allow to properly define the observation function.
  • Links to problem (3.1). The variables of the problem are: x = V o V g V w V p P R ,
    Figure imgb0461
    u = o a a A P w w V in ,
    Figure imgb0462
    o = P R w ct g or .
    Figure imgb0463
  • The general formulation of the optimization of a petroleum production system under partial observation is:
    Figure imgb0464
    s . t . L X 0 = μ 0 ,
    Figure imgb0465
    X t + 1 = ƒ X t U t , t T ,
    Figure imgb0466
    O t = h X t , t T ,
    Figure imgb0467
    U t U t ad X t , t T
    Figure imgb0468
    σ U t σ O 0 , , O t , U 0 , , U t 1 , t T . s
    Figure imgb0469
  • It is considered in this paragraph that there is only a one tank reservoir in the production system to simplify the description of the problem. Extending the formulation to multiple reservoirs is done by expanding the state vector and observation vector to accommodate each of the reservoir accordingly.
  • Objective function. The objective function of the problem (in Equation (4.1a)) is defined by the cost function
    Figure imgb0470
    . It is defined as Equation (2.1a). It depends on the production values, which are affected by the observation (reservoir pressure, water-cut and gas-oil ratio). The production values are obtained through the general production function Φ : X × U 3
    Figure imgb0471
    (in the one tank case), and the implementation associate a vector price rt for the production of each fluid: oil, gas and water. Controls u may also have an associated cost vector c, such as the functioning cost of a pump which re-inject water in the reservoir. All those costs are condensed in the cost function
    Figure imgb0472
    . L t : X × U o u r t T Φ x u c T u .
    Figure imgb0473
  • Note that while the general production function Φ is defined on the state space, it only depends on the observations. There is thus a function Φ ˜ : O × U 3
    Figure imgb0474
    such that x u X × U , Φ x u = Φ ˜ h x , u
    Figure imgb0475
  • Initialization. The initialization of the state of the reservoir is represented in Equation (4.1b). Here, the implementations initialize the state with the distribution given by previous analysis on the reservoir.
  • Dynamics. For the dynamics of Equation (4.1c), the implementations use the function f previously defined. The dynamics was defined using the general production function Φ in Equation (2.3). The function f is defined as: ƒ : X × U X , x u x 1 Φ x u 1 x 2 Φ x u 2 + x 1 R s x 5 x 1 Φ x u 1 R s Ξ x u x 3 Φ x u 3 x 4 1 + c ƒ Ξ x u x 5 Ξ x u ,
    Figure imgb0476
    where Ξ is an easily computed function. It is now considered that the computation of f is in
    Figure imgb0477
    (1), i.e. in constant time. Also, f is stationary (not time dependent).
  • Observations. Equation (4.1d) define the observation we have access to. In the management of an oil and gas production system, it is assumed that the observation function h is known, and how those observations depend on the components of the state. The reservoir pressure is directly observed, while the watercut is a function of the water saturation S ω = V ω B ω V p
    Figure imgb0478
    and the gas-oil ratio is a function of the free gas saturation S g = V g B g V p
    Figure imgb0479
    (with Bω and Bg functions of the reservoir pressure). h : X O , x x 5 w ct x 3 x 4 x 5 g or x 2 x 4 x 5
    Figure imgb0480
  • Here, the observation only depends on the state itself, not on the controls u. Moreover, the observation functions considered are stationary, whereas the observation functions of Problem (3.1) were time dependent and also depended on the previous controls.
  • Admissibility set of the controls. Equation (4.1e) states that for each time step t, the controls ut must belong to an admissibility set U t ad
    Figure imgb0481
    which depends on the current state. In this admissibility set is contained all the constraints derived from the production system (capacity of the pipes, allowed pressure range of the different asset which is translated to a pressure range at the different nodes, capacity of treatment of gas and water at the export point). Those constraints are such that they directly depend on the fluids production. There is thus a set
    Figure imgb0482
    defining all the current ranges of admissible value on the production network such that admissible controls are defined as x u X × U , Φ x u U t .
    Figure imgb0483
  • The admissibility set is therefore defined as the set valued mapping U t ad : X U , x u | Φ x u U t .
    Figure imgb0484
  • Note that all those constraints depend on the current observation: as the general production function Φ only depends on the observations, not on other functions of the state of the reservoir, hence the fluids in the network are functions of the observation and the controls. Since the constraints on the production system are due to the production values and the pressure at the different nodes (which is derived from the controls and the observed reservoir pressure), the admissibility set only depends on the current observation. The implementations therefore also use the set valued mapping U ˜ t ad : O U
    Figure imgb0485
    to define the admissibility set which depends on the observation. This simplifies the admissibility of the controls and the policy.
  • Non-anticipativity. Finally, Equation (4.1f) is the non-anticipativity constraint. It states that to choose the controls at time t, one only has access to the history of controls and observation up to time t.
  • Only the discrete case is discussed, where it is considered that one has a discrete distribution for the initial state. This means one needs to discuss how the problem is discretized before solving it. The discretization process implemented by the implementations is discussed hereinafter.
  • Monotonicity of the management of an oil and gas pro-duction system
  • Assumption 1. We assume that there. is an observer o 0 O
    Figure imgb0486
    such thatx ∈ supp(µ 0), h(x) = o 0.
  • Proposition 17. Problem (4.1) is a mon-det-POMDP.
  • Proof. Let us check that f verifies Equation (3.22) for all states in a reachable belief.
  • First, the production function Φ depends on the observation, not directly on the state. Hence, for all states x and x' such that h(x) = h(x'), then for all u U
    Figure imgb0487
    , Φ(x, u) = Φ(x', u). For convenience, we therefore define Φ ˜ : O × U 3
    Figure imgb0488
    the function that takes the observation instead of the state as input: x X
    Figure imgb0489
    , u U
    Figure imgb0490
    , Φ(x, u) = Φ̃(h(x), u).
  • We also use the mappings
    Figure imgb0491
    as defined in the proof of Theorem 8. We consider an extended state space X = X δ
    Figure imgb0492
    , and define
    Figure imgb0493
    as
    Figure imgb0494
  • We now detail T:
    Figure imgb0495
  • Indeed, we have x X , x 5 = h x 1
    Figure imgb0496
    .
  • First, let x X
    Figure imgb0497
    . According to Equation (4.2), we have that
    Figure imgb0498
  • When considering a composition, we hence get
    Figure imgb0499
  • When considering a belief b, we hence have u o U × O
    Figure imgb0500
    , ∀x ∈ supp(τ(b, u, ο)), h(x) = o.
  • Using Assumption 1, we can hence consider that b t = 0 B t R b 0
    Figure imgb0501
    , o O
    Figure imgb0502
    such that ∀x ∈ supp(b), h(x) = o.
  • To verify that Problem (4.1) is a mon-det-POMDP, we need to verify that
    Figure imgb0503
    such that there exists x X
    Figure imgb0504
    ,
    Figure imgb0505
    then ∀x'h -1(h(x)),
    Figure imgb0506
  • To do so, we analyze the different components of
    Figure imgb0507
    . We denote by
    Figure imgb0508
    the restriction of
    Figure imgb0509
    to h 1-(o), i.e.
    Figure imgb0510
    .
  • The composition of functions
    Figure imgb0511
    has restrictions on the observations involved in order to be well defined. Indeed, let o i i 1 2 3,4 O 4
    Figure imgb0512
    , and let u , u U 2
    Figure imgb0513
    .
    Figure imgb0514
    is well defined if and only if o 2 = o 3. Therefore, we can define without any ambiguity the composition of t functions T by giving a sequence of controls
    Figure imgb0515
    (ui ) i∈{0,...,t-1} and a sequence of observations (oi ) i∈{0,...,t}:
  • We denote by T the set of the functions
    Figure imgb0516
    and their well defined compositions.
  • First,
    Figure imgb0517
    Figure imgb0518
    such that
    Figure imgb0519
    . Such
    Figure imgb0520
    is associated to a sequence of controls (ui ) i∈{0,1,...,t-1} and a sequence of observations (oi ) i∈{1,...,t}. When considering
    Figure imgb0521
    with o 0 the observation given by Assumption 1, we have that ∀x ∈ suppr(b 0),
    Figure imgb0522
    Hence,
    Figure imgb0523
    Figure imgb0524
    such that
    Figure imgb0525
    .
  • In order to prove that Problem (4.1) is a mon-det-POMDP, we need to prove that T is a Monotonous Function Set.
  • First, let us write a general form of the mapping
    Figure imgb0526
    :
    Figure imgb0527
  • The composition
    Figure imgb0528
    thus has the following form
    Figure imgb0529
  • Let us focus on the different components of
    Figure imgb0530
    . First, we can remark that the components i ∈ {1,3} are of the form
    Figure imgb0531
  • When considering a composition
    Figure imgb0532
    we hence have
    Figure imgb0533
    Let
    Figure imgb0534
    and
    Figure imgb0535
    two mappings of T . If there is a states x X
    Figure imgb0536
    such that
    Figure imgb0537
    then we have that o 0 = o 0
    Figure imgb0538
    and o t = o t
    Figure imgb0539
    . Finally, we have x i + j = 0 t 1 g i o j u j = x i + j = 0 t 1 g i o j u j ,
    Figure imgb0540
    i.e. j = 0 t 1 g i o j u j = j = 0 t 1 g 1 o j u j .
    Figure imgb0541
  • Hence, x X
    Figure imgb0542
    , we have either
    Figure imgb0543
  • For components i ∈ {1,3}, T is thus a Monotonous Function Set.
  • Let us now focus on component 4. The fourth component of
    Figure imgb0544
    has the following form
    Figure imgb0545
  • When considering a composition
    Figure imgb0546
    we hence have
    Figure imgb0547
  • Let us once again consider
    Figure imgb0548
    and
    Figure imgb0549
    two mappings of T . If there is a states x X
    Figure imgb0550
    such that
    Figure imgb0551
    We have x 4 i = 0 t 1 1 + a o i + 1 o i = x 4 i = 0 t 1 1 + a o i + 1 o i .
    Figure imgb0552
  • As the fourth component of the state x must be strictly positive, we hence have i = 0 t 1 1 + a o i + 1 o i = i = 0 t 1 1 + a o i + 1 o i .
    Figure imgb0553
  • We once again verify Equation (3.22).
  • Let us now look at last component, 2. It is of the form
    Figure imgb0554
  • When considering a composition
    Figure imgb0555
    and by using the component {1}, we hence have
    Figure imgb0556
  • We can simplify it to
    Figure imgb0557
  • Let us once again consider
    Figure imgb0558
    and
    Figure imgb0559
    two mappings of T . If there is a states x X
    Figure imgb0560
    such that
    Figure imgb0561
    We have
    Figure imgb0562
  • Using Equation (4.3) (which is also verified), and since o o = o 0
    Figure imgb0563
    and o t = o t
    Figure imgb0564
    , we have
    Figure imgb0565
    which leads to x 2 + i = 0 t 1 g 2 o i u i = x 2 + i = 0 t 1 g 2 o i u i ,
    Figure imgb0566
    i.e. i = 0 t 1 g 2 o i u i = i = 0 t 1 g 2 o i u i .
    Figure imgb0567
  • Hence, for x X
    Figure imgb0568
    , if T u 0 : t 1 , o 0 : t x δ
    Figure imgb0569
    and T u 0 : t 1 , o 0 : t x δ
    Figure imgb0570
    , we have
    Figure imgb0571
  • Hence, the second component also verifies Equation (3.22). As all the components of T verify Equation (3.22), T is a Monotonous Function Set. Moreover, since for all beliefs b t = 0 B t R b 0
    Figure imgb0572
    , there is
    Figure imgb0573
    such that
    Figure imgb0574
    Problem (4.1) is a mon-det-POMDP.
  • Implementations of the optimization where the optimization comprises solving the optimization problem (P), previously discussed, with f the function given by the previously-discussed formula (S), in the case where the observations are partial observations are now discussed. In these implementations (P), which corresponds to Problem (4.1), is a det-POMDP. These implementations include discretizing the optimization problem (P), which includes the step of implementing the discretization framework discussed hereinafter, constructing the belief space as discussed hereinafter, constructing the reachable state space as discussed hereinafter, constructing the reachable belief space as discussed hereinafter. The implementations may further include applying any suitable Dynamic Programming Algorithm to solve the discretized problem (P), for example by using Algorithm 1.
  • Discretization framework
  • As Problem (4.1) is a mon-det-POMDP, discretization of Problem (4.1) is now detailed. Indeed, results on mon-det-POMDP presented in Chapter 3 were for finite state space, controls space and observation space. However, the functions presented in §4.2.1 (i.e. f and h) are continuous. One thus needs to discretize the functions and controls to get finite sets for the state, controls and observations.
  • The implementations discretize the observations in m values, and consider that there can be up to d controls per observations o, and those controls belongs to U ˜ ad o
    Figure imgb0575
    . Hence, it is now considered discretized sets
    Figure imgb0576
    and
    Figure imgb0577
    for the controls and observations. It is hence considered a discretized function h : X O d
    Figure imgb0578
    , and controls u U d U o O d U ˜ ad o
    Figure imgb0579
    . The implementations then build the state space by recursively applying the dynamics on the possible initial state with the relevant associated controls. Indeed, for all x 0 ∈ supp(b 0) (b 0 being the belief on the initial state), the implementations compute the reachable state space U t = 0 T X t R x 0
    Figure imgb0580
    , where X t R x 0
    Figure imgb0581
    is recursively defined as X 0 R x 0 = 0
    Figure imgb0582
    X t + 1 R x 0 = y | x X t R , u U ˜ ad h x , y = ƒ x u
    Figure imgb0583
  • The implementations thus yield a discrete space: X d = x 0 supp b 0 X R x 0 .
    Figure imgb0584
  • The implementations therefore yield a discretized dynamics f : X d × U d X d
    Figure imgb0585
    .
  • The implementations then define properly dynamics on the beliefs τ. Problem (4.1) is a detPOMDP, and thus satisfies Proposition 4 which states that τ is entirely defined by f and h. By propagating the initial belief b 0 with τ, the implementations obtain the discrete reachable belief space
    Figure imgb0586
    : B R = t = 1 T B t R b 0 .
    Figure imgb0587
  • The notations used in the presently-discussed implementations are presented in Table 4.1 below: Table 4.1: Notations of the spaces
    Symbol Definitions
    Figure imgb0588
    State space
    Figure imgb0589
    Discretized state space
    Figure imgb0590
    Control space
    Figure imgb0591
    Discretized control space
    Figure imgb0592
    Figure imgb0593
    Figure imgb0594
    Space of the observations
    Figure imgb0595
    Figure imgb0596
    Discretized space of the observations
    Space of the time steps
    Space of the beliefs
    Space of the reachable beliefs (discrete)
  • Construction of the belief space
  • Controls and observations. Consider a given discretization of the observer function h ˜ : X O d
    Figure imgb0597
    . The discretized observer function is constant inside a closed connex part the state space, and -1(o) is a closed connected part of the state space. It is assumed that oil or gas cannot be injected in the reservoir. That condition imply certain value on the production functions. If there is only one reservoir, the implementations can therefore consider the following. First, the production values form a three dimensional vector: o u O × U d , Φ ˜ o u = Φ ˜ 1 o u Φ ˜ 2 o u Φ ˜ 3 o u , Φ ˜ 1 o u , Φ ˜ 2 o u 0 .
    Figure imgb0598
  • Moreover, Φ̃(1), Φ̃(2) and Φ̃(3) impact the observation in a predictable manner:
    • Φ̃(1) (the production of oil) is such that it decreases the reservoir pressure PR (the component o (1) of o), and increases the water-cut wct (component o (2)) and gas-oil ratio gor (component o (3)).
    • Φ̃(2) (the production of free gas) is such that it decreases the reservoir pressure PR .
    • Φ̃(3) (the production or injection of water) is such that it decreases the reservoir pressure PR (but it can be negative and then increase the reservoir pressure), and it increases the water-cut wct when we re-inject some water.
  • Therefore, there is a reduction of the number of possible observations each time controls are applied, as the water-cut and gas-oil ratio can only increase with time. Moreover, it also means the observation set
    Figure imgb0599
    can be ordred. Indeed, when in a given state x, with observation o = h̃(x), applying any controls increases the water-cut wct and the gas-oil ratio gor. x u X d × U d , h ˜ x 2 h ˜ ƒ x u 2 , and h ˜ x 3 h ˜ ƒ x u 3
    Figure imgb0600
  • Moreover, when the reservoir pressure PR increases, so does the water-cut. Hence, by ordering the observation by increasing wct and gor, and decreasing PR , one obtains an ordered observation, where two components (wct and gor ) can only increase with time.
  • To reduce the size of
    Figure imgb0601
    , we also consider that the controls are such that the different resulting productions of each fluid are multiple of each other, which allow multiple path to a given state x 0 from state x', for example: u , u U d 2 , x X d , ƒ x u = ƒ ƒ x , u , u
    Figure imgb0602
  • For a given state x and observation o = h̃(x), one can therefore order the controls along three directions, such that u i , j , k o
    Figure imgb0603
    verifies: ƒ x u 2 i , 2 j , 2 k o = ƒ ƒ x u i , j , k o , u i , j , k o .
    Figure imgb0604
  • It is now considered that there are l, m and n controls in the three directions.
  • Note: we assume here that h ˜ ƒ x u i , j , k o = o
    Figure imgb0605
    . Moreover, this is an approximation of the state space. Indeed, if we compute x = ƒ x u 2 i , 2 j , 2 k o
    Figure imgb0606
    and x" = ƒ ƒ x u i , j , k o , u i , j , k o
    Figure imgb0607
    , there is a slight difference between x' (5) and x" (5). However, we consider that x' = x" to reduce the number of states we need to consider.
  • Construction of the reachable state space. It is now discussed an algorithm to efficiently construct the reachable state space X R x 0
    Figure imgb0608
    , x 0 ∈ supp b 0 . Indeed, the previous assumption on the observation and controls allows to compute the state space more efficiently. It is presented the algorithm for the case where there is an order with the observation which respect the topological order of the state space, i.e. when o , o O d 2 , o < o x , x h ˜ 1 o × h ˜ 1 o , x X R x .
    Figure imgb0609
  • That condition implies that if (x) < h̃(x 0), then x' cannot be a predecessor of x.
  • The implementations may construct an ordered reachable state space for each x 0 ∈ supp b 0 thanks to Algorithm 7 (discussed below), which returns the reachable state space and the successors of each state. Algorithm 7 complexity is in O d m T
    Figure imgb0610
    . An underlying assumption of the algorithm is that we have an ordered observation set. The algorithm can be adapted to the case where there is only a partial order on the observation with some additional refinement to get an ordered reachable state space.
  • When considering the previous assumptions on the controls and observations, the observations create a number of separations of the state space
    Figure imgb0611
    . On the other hand, applying the controls on a given state gives points that are on a deformed "discrete parallelepiped" (a set of points whose convex hull is a parallelepiped). The form of that parallelepiped depends on the observation. The implementations may create the reachable state space by putting multiple "discrete parallelepipeds" next to each other, until reaching the frontier delimiting the changes to the observation. Crossing the frontier gives new points on another observation, where the implementations may apply that construction again. The implementations continue the algorithm until
    Figure imgb0612
    "discrete parallelepipeds" are put in each of the three directions of the controls. The implementations compute the frontier of the "discrete parallelepiped" through the use of Algorithm 6 (discussed below). Finally, the implementations may get the successors of a given state x by the controls u by looking at the order of the other states in ListStates(o). Getting a list of successors may hence be computed when using Algorithm 7 without changing its complexity.
    Figure imgb0613
    Figure imgb0614
  • Construction of the reachable belief space. Now that the state space is constructed, the construction of the reachable belief space is discussed. To do so, the implementations browse through the ordered state space. As the states and successors are known, one only needs to go through the successors of each belief to get the ordered belief state thanks to Algorithm 9, which is in O md supp b 0 B R
    Figure imgb0615
    . The algorithm uses a representation of the belief similar to the Tables presented in Michael L. Littman, Algorithms for Sequential Decision Making, PhD thesis, Brown University, 1996, which is incorporated herein by reference.
  • Indeed, the implementations represent beliefs as tables Db of size |supp b 0 |.
  • Each component of the table represents the state the system would be if the initial state was x 0,i ∈ supp b 0 . Hence, each component is in X d δ
    Figure imgb0616
    , where δ is an added element, the cemetery, which represents an empty state. When the i-th component of Db is δ, it means that the initial state could not have been x 0,i .
  • With this representation, the implementations get the different probability of each component: b x = i 1 , , supp b 0 , D b i = x b 0 x 0 , i j 1 , , supp b 0 , D b j δ b 0 x 0 , j
    Figure imgb0617
  • Moreover, the probability of going from b to b' = τ(b, u, o) (i.e. for b' a successor of b) is given by: Q b u o = i 1 , , supp b 0 , D b i = δ b 0 x 0 , i j 1 , , supp b 0 , D b j δ b 0 x 0 , j
    Figure imgb0618
  • Algorithm 9 simply uses function Successors (defined in Algorithm 8) to find the successors of a given belief. The beliefs added are ordered since the different states space are ordered due to how the discretization of the controls is chosen. Hence beliefs are always added after all their predecessors, which means that the implementations go through
    Figure imgb0619
    only once.
    Figure imgb0620
    Figure imgb0621
  • After applying Algorithm 9, the implementations have the belief space and the different transitions between the different beliefs. The implementations may therefore apply Algorithm 5 to solve Problem (4.1).
  • First application: oil reservoir with water injection
  • It is now discussed the case of an oil reservoir where the pressure is kept constant by reinjecting water in the reservoir. The deterministic version of that problem was treated hereinabove. A partial observation of the content of the reservoir is now added.
  • The state is reduced to the vector x t = V t ω V t p
    Figure imgb0622
    , whereas the control is the bottom-hole pressure xt = Pt. Enough water is injected to keep the pressure constant, hence the amount of water injected is not a control itself, but is deduced from the bottom-hole pressure P. The observation is the water-cut wct .
  • Full formulation
    Figure imgb0623
    s . t . L V 0 w , V 0 p = μ 0 ,
    Figure imgb0624
    w t ct = WCT V t w B w P R V p , t T ,
    Figure imgb0625
    V t + 1 w = V t w α P R P t B w P R w t ct 1 , t ,
    Figure imgb0626
    F min w α P R P t B w P R w t ct F max w , t T ,
    Figure imgb0627
    F min o α P R P t B o P R 1 w t ct F max o , t T ,
    Figure imgb0628
    P t 0 , t T ,
    Figure imgb0629
    σ P t σ w 0 ct , , w t ct , P 0 , , P t 1 , t T .
    Figure imgb0630
  • The state space
    Figure imgb0631
    and the belief space
    Figure imgb0632
    are created thanks to Algorithms 7 and 9. When considering U d = 10
    Figure imgb0633
    , O d = 10
    Figure imgb0634
    , |supp(b 0)| = 10, T = 100
    Figure imgb0635
    , one obtains Table 4.2. The bounds obtained with Theorem 8 and Proposition 13 are of, respectively, 2,947 and 57226240. This is therefore far lower than any of the two bounds presented (by a factor of 1041 for the general det-POMDP bound, and of around 50 for the mon-det-POMDP bound).
  • The size of the problem is such that it can be solved in a reasonable time: the generation of the problem was made in 3200 seconds (applying both Algorithms 7 and 9), while the solving time was of 400 seconds (applying Algorithm 1). The code may be parallelized. Table 4.2: Size of the sets computed thanks to Algorithms 7 and 9
    Set considered Cardinal of the set
    55885
    Figure imgb0636
    Figure imgb0637
    809665
  • The implementation of the method on a computer is now discussed.
  • The method is computer-implemented. This means that steps (or substantially all the steps) of the method are executed by at least one computer, or any system alike. Thus, steps of the method are performed by the computer, possibly fully automatically, or, semi-automatically. In examples, the triggering of at least some of the steps of the method may be performed through user-computer interaction. The level of user-computer interaction required may depend on the level of automatism foreseen and put in balance with the need to implement user's wishes. In examples, this level may be user-defined and/or pre-defined.
  • A typical example of computer-implementation of a method is to perform the method with a system adapted for this purpose. The system may comprise a processor coupled to a memory and a graphical user interface (GUI), the memory having recorded thereon a computer program comprising instructions for performing the method. The memory may also store a database. The memory is any hardware adapted for such storage, possibly comprising several physical distinct parts (e.g. one for the program, and possibly one for the database).
  • FIG. 13 shows an example of the system, wherein the system is a client computer system, e.g. a workstation of a user.
  • The client computer of the example comprises a central processing unit (CPU) 1010 connected to an internal communication BUS 1000, a random access memory (RAM) 1070 also connected to the BUS. The client computer is further provided with a graphical processing unit (GPU) 1110 which is associated with a video random access memory 1100 connected to the BUS. Video RAM 1100 is also known in the art as frame buffer. A mass storage device controller 1020 manages accesses to a mass memory device, such as hard drive 1030. Mass memory devices suitable for tangibly embodying computer program instructions and data include all forms of nonvolatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM disks 1040. Any of the foregoing may be supplemented by, or incorporated in, specially designed ASICs (application-specific integrated circuits). A network adapter 1050 manages accesses to a network 1060. The client computer may also include a haptic device 1090 such as cursor control device, a keyboard or the like. A cursor control device is used in the client computer to permit the user to selectively position a cursor at any desired location on display 1080. In addition, the cursor control device allows the user to select various commands, and input control signals. The cursor control device includes a number of signal generation devices for input control signals to system. Typically, a cursor control device may be a mouse, the button of the mouse being used to generate the signals. Alternatively, or additionally, the client computer system may comprise a sensitive pad, and/or a sensitive screen.
  • The computer program may comprise instructions executable by a computer, the instructions comprising means for causing the above system to perform the method. The program may be recordable on any data storage medium, including the memory of the system. The program may for example be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. The program may be implemented as an apparatus, for example a product tangibly embodied in a machine-readable storage device for execution by a programmable processor. Method steps may be performed by a programmable processor executing a program of instructions to perform functions of the method by operating on input data and generating output. The processor may thus be programmable and coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. The application program may be implemented in a high-level procedural or object-oriented programming language, or in assembly or machine language if desired. In any case, the language may be a compiled or interpreted language. The program may be a full installation program or an update program. Application of the program on the system results in any case in instructions for performing the method.

Claims (15)

  1. A computer-implemented method for multiperiod optimization of oil and/or gas production, the method comprising:
    - providing:
    ∘ a controlled dynamical system describing the evolution over time of a state of an oil and/or gas reservoir,
    ∘ a time-dependent admissible set of controls, the controls describing actions respecting constraints for controlling oil and/or gas flow and/or pressure,
    ∘ time-dependent observations of the content of the reservoir,
    - optimizing, with respect to the state of the reservoir, the controls and the observations, an expected value over a given time span of an objective production function of the state, the controls and the observations.
  2. The method of claim 1, wherein the controlled dynamical system comprises evolution equations derived from material balance equations and/or black oil models.
  3. The method of claim 2, wherein the controlled dynamical system is of the type: x t + 1 = f x t u t ,
    Figure imgb0638
    where t represents the time, xt the state of the reservoir at time t, and ut the controls at time t, and where f is of the type: f : x u x 1 Φ 1 x u x 2 Φ 2 x u + x 1 R s x 5 x 1 Φ 1 x u R s Ξ x u x 3 Φ 3 x u x 5 1 + c f Ξ x u x 5 Ξ x u
    Figure imgb0639
    where:
    x = (x (1), x (2), x (3), x (4), x (5)),
    Rs represents dissolved gas,
    cf represents the pore compressibility of the reservoir,
    • (x, u): ↦ Φ(x,u) represents production values as a function of (x,u),
    Ξ is a function such that P t + 1 R = Ξ x t u t
    Figure imgb0640
    , where PR represents a reservoir pressure.
  4. The method of any one of claims 1 to 3, wherein the optimizing comprises solving an optimization problem of the type:
    Figure imgb0641
    s . t . L X 0 = μ 0
    Figure imgb0642
    X t + 1 = f X t U t , t T ,
    Figure imgb0643
    O t = h X t , t T ,
    Figure imgb0644
    U t U t ad X t , t T ,
    Figure imgb0645
    σ U t σ O 0 , , O t , U 0 , , U t 1 , t T ,
    Figure imgb0646
    where:
    X, O, U are respectively the state of the reservoir, the observations, and the controls,
    Figure imgb0647
    is a finite set of time steps, where
    Figure imgb0648
    is a positive integer,
    Lt is the objective production function at time t,
    Figure imgb0649
    is an objective final production function,
    µ 0 is a probability distribution representing an initial state of the reservoir,
    X t+1 = f(Xt, Ut ) corresponds to the dynamical system,
    h is an observation function,
    U t ad
    Figure imgb0650
    represents a set of admissible controls at time t.
  5. The method of any one of claims 1 to 4, wherein the observations comprise partial observations.
  6. The method of claim 5, wherein the observations depend only on the state of the reservoir.
  7. The method of claim 6, wherein the observations are observations functions of the form O t = h X t ,
    Figure imgb0651
    where Xt, Ot represent respectively the state of the reservoir and the observations at time t, and where h is of the type h x = x 5 ω ct x 3 x 4 x 5 g or x 2 x 4 x 5 ,
    Figure imgb0652
    where wct is a function representing a water-cut and gor is a function representing a gas-oil ratio, and where x = (x (1), x (2), x (3), x (4), x (5)).
  8. The method of any one of claims 5 to 7, wherein the optimization comprises solving an optimization problem that is a Deterministic Partially Observed Markov Decision Process (det-POMDP).
  9. The method of claim 8, wherein the optimization comprises discretizing the optimization problem.
  10. The method of claim 9, wherein discretizing the optimization problem comprises providing a discrete control set and a discrete observation set and building a discrete space state by recursively applying the dynamics on a given initial state with associated controls, the discrete space state being a set of the space states reachable from the given initial state.
  11. The method of claim 10, wherein discretizing the optimization problem comprises constructing a state of beliefs, which are probabilities on the discrete state space.
  12. The method of claim 11, wherein the Deterministic Partially Observed Markov Decision Process has monotonicity, such that the state of reachable beliefs is included in a subset of the probability space.
  13. A computer program comprising instructions for performing the method of any one of claims 1 to 12.
  14. A computer-readable data storage medium having recorded thereon the computer program of claim 13.
  15. A computer system comprising a processor coupled to a memory, the memory having recorded thereon the computer program of claim 13.
EP21306844.8A 2021-12-17 2021-12-17 Multiperiod optimization of oil and/or gas production Pending EP4198258A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP21306844.8A EP4198258A1 (en) 2021-12-17 2021-12-17 Multiperiod optimization of oil and/or gas production
US18/046,410 US20230195145A1 (en) 2021-12-17 2022-10-13 Multiperiod Optimization Of Oil And/Or Gas Production

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP21306844.8A EP4198258A1 (en) 2021-12-17 2021-12-17 Multiperiod optimization of oil and/or gas production

Publications (1)

Publication Number Publication Date
EP4198258A1 true EP4198258A1 (en) 2023-06-21

Family

ID=80034747

Family Applications (1)

Application Number Title Priority Date Filing Date
EP21306844.8A Pending EP4198258A1 (en) 2021-12-17 2021-12-17 Multiperiod optimization of oil and/or gas production

Country Status (2)

Country Link
US (1) US20230195145A1 (en)
EP (1) EP4198258A1 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120130696A1 (en) * 2009-08-12 2012-05-24 Exxonmobil Upstream Research Company Optimizing Well Management Policy

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120130696A1 (en) * 2009-08-12 2012-05-24 Exxonmobil Upstream Research Company Optimizing Well Management Policy

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
AWASTHI UTSAV ET AL: "Multiperiod optimization model for oilfield production planning: bicriterion optimization and two-stage stochastic programming model", OPTIMIZATION AND ENGINEERING, SPRINGER NEW YORK LLC, US, vol. 20, no. 4, 19 July 2019 (2019-07-19), pages 1227 - 1248, XP036914101, ISSN: 1389-4420, [retrieved on 20190719], DOI: 10.1007/S11081-019-09455-0 *
BONET BLAI: "Deterministic POMDPs Revisited", 9 May 2012 (2012-05-09), XP055920685, Retrieved from the Internet <URL:https://arxiv.org/ftp/arxiv/papers/1205/1205.2659.pdf> [retrieved on 20220512] *
CYRILLE VESSAIRE ET AL: "Multistage Optimization of a Petroleum Production System with Material Balance Model", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 4 January 2022 (2022-01-04), XP091130347 *

Also Published As

Publication number Publication date
US20230195145A1 (en) 2023-06-22

Similar Documents

Publication Publication Date Title
Gupta et al. Multistage stochastic programming approach for offshore oilfield infrastructure planning under production sharing agreements and endogenous uncertainties
Cullick et al. Optimizing multiple-field scheduling and production strategy with reduced risk
Gupta et al. Offshore oilfield development planning under uncertainty and fiscal considerations
Gupta et al. Autoregressive and Machine Learning Driven Production Forecasting–Midland Basin Case Study
De Paola et al. Reinforcement learning for field development policy optimization
Rashid et al. An efficient procedure for expensive reservoir-simulation optimization under uncertainty
Mohagheghian An application of evolutionary algorithms for WAG optimisation in the Norne Field
Lei et al. Compact formulations for efficient early-phase field development optimization of multi-reservoir fields
Vessaire et al. Multistage optimization of a petroleum production system with material balance model
Sales et al. Early field planning using optimisation and considering uncertainties: Study case: Offshore deepwater field in Brazil
EP4198258A1 (en) Multiperiod optimization of oil and/or gas production
Mirzaei-Paiaman et al. Iterative sequential robust optimization of quantity and location of wells in field development under subsurface, operational and economic uncertainty
Luguesi et al. Derivative-free optimization with proxy models for oil production platforms sharing a subsea gas network
Jahandideh et al. Stochastic oilfield optimization under uncertain future development plans
Grimstad Daily Production Optimization for Subsea Production Systems: Methods based on mathematical programming and surrogate modelling
Ismail et al. Assisted history matching and uncertainty analysis workflow for a large oilfield in middle east
WO2014160348A2 (en) Physically-based analysis apparatus and related methods
Haavardsson et al. Multisegment production profile models—A tool for enhanced total value chain analysis
Octaviano et al. Improving Well Production Performance By Using Realtime Wash Desalting Planning Tool WDPT in the North Sea
Al-Mudhafar et al. Effect of Gas Injection Pressure on the Performance of CO2-Assisted Gravity Drainage Process in Heterogeneous Clastic Reservoirs
Murray et al. Network optimization models at greater kuparuk area using neural networks and genetic algorithms
Gupta Modeling and computational strategies for optimal oilfield development planning under fiscal rules and endogenous uncertainties
Gupta et al. Development Planning of Offshore Oilfield Infrastructure
Alsaeedi et al. A Systematic Approach for Evaluating Entire Production System Value Chain to Deliver the Dynamic Market Production Demand Using an End to End Production Simulation Process
Alsaeedi et al. A Novel Approach to Maximize Revenue by Maximizing the Condensate Production in A Giant Field Using Molar Fraction Optimization in an Integrated Production System Platform

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20231221

RBV Designated contracting states (corrected)

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR