WO2020028298A1 - Method for geological steering control through reinforcement learning - Google Patents
Method for geological steering control through reinforcement learning Download PDFInfo
- Publication number
- WO2020028298A1 WO2020028298A1 PCT/US2019/044036 US2019044036W WO2020028298A1 WO 2020028298 A1 WO2020028298 A1 WO 2020028298A1 US 2019044036 W US2019044036 W US 2019044036W WO 2020028298 A1 WO2020028298 A1 WO 2020028298A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- well
- reward
- combinations
- geological
- model
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 67
- 230000002787 reinforcement Effects 0.000 title claims description 8
- 230000008569 process Effects 0.000 claims abstract description 26
- 238000005259 measurement Methods 0.000 claims abstract description 4
- 230000006870 function Effects 0.000 claims description 43
- 238000005553 drilling Methods 0.000 claims description 26
- 239000003795 chemical substances by application Substances 0.000 claims description 24
- 238000004088 simulation Methods 0.000 claims description 16
- 230000015572 biosynthetic process Effects 0.000 claims description 13
- 230000001667 episodic effect Effects 0.000 claims description 7
- 239000011159 matrix material Substances 0.000 claims description 3
- 238000012549 training Methods 0.000 claims description 3
- 238000012937 correction Methods 0.000 claims description 2
- 230000000704 physical effect Effects 0.000 claims description 2
- 238000012360 testing method Methods 0.000 description 6
- 230000005251 gamma ray Effects 0.000 description 4
- 239000011435 rock Substances 0.000 description 4
- XQCFHQBGMWUEMY-ZPUQHVIOSA-N Nitrovin Chemical compound C=1C=C([N+]([O-])=O)OC=1\C=C\C(=NNC(=N)N)\C=C\C1=CC=C([N+]([O-])=O)O1 XQCFHQBGMWUEMY-ZPUQHVIOSA-N 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- MKYBYDHXWVHEJW-UHFFFAOYSA-N N-[1-oxo-1-(2,4,6,7-tetrahydrotriazolo[4,5-c]pyridin-5-yl)propan-2-yl]-2-[[3-(trifluoromethoxy)phenyl]methylamino]pyrimidine-5-carboxamide Chemical compound O=C(C(C)NC(=O)C=1C=NC(=NC=1)NCC1=CC(=CC=C1)OC(F)(F)F)N1CC2=C(CC1)NN=N2 MKYBYDHXWVHEJW-UHFFFAOYSA-N 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 238000013531 bayesian neural network Methods 0.000 description 1
- 235000009508 confectionery Nutrition 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005183 dynamical system Methods 0.000 description 1
- 230000010429 evolutionary process Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
Classifications
-
- E—FIXED CONSTRUCTIONS
- E21—EARTH OR ROCK DRILLING; MINING
- E21B—EARTH OR ROCK DRILLING; OBTAINING OIL, GAS, WATER, SOLUBLE OR MELTABLE MATERIALS OR A SLURRY OF MINERALS FROM WELLS
- E21B7/00—Special methods or apparatus for drilling
- E21B7/04—Directional drilling
Definitions
- the present invention relates to the field of geosteering and, in particular, to a method for autonomous geosteering for a well-boring process.
- rock destruction is guided by a drilling assembly.
- the drilling assembly includes sensors and actuators for biasing the trajectory and determining the heading in addition to properties of the surrounding borehole media.
- the intentional guiding of a trajectory to remain within the same rock or fluid and/or along a fluid boundary such as an oil/water contact or an oil/gas contact is known as geosteering.
- Geosteering is drilling a horizontal wellbore that ideally is located within or near preferred rock layers. As interpretive analysis is performed while or after drilling, geosteering determines and communicates a wellbore's stratigraphic depth location in part by estimating local geometric bedding structure. Modern geosteering normally incorporates more dimensions of information, including insight from downhole data and quantitative correlation methods. Ultimately, geosteering provides explicit approximation of the location of nearby geologic beds in relationship to a wellbore and coordinate system.
- Geosteering relies on mapping data acquired in the structural domain along the horizontal wellbore and into the stratigraphic depth domain.
- Relative Stratigraphic Depth means that the depth in question is oriented in the stratigraphic depth direction and is relative to a geologic marker. Such a marker is typically chosen from type log data to be the top of the pay zone/target layer.
- the actual drilling target or“sweet spot” is located at an onset stratigraphic distance from the top of the pay zone/target layer.
- US8,892,407B2 (ExxonMobil) relates to a process for well trajectory planning.
- the process involves receiving data relevant to drilling and completion of an oil or gas well, and to reservoir development.
- Well trajectory and drilling and completion decision parameters are simultaneously calculated using a Markov decision process-based model that accounts for an uncertain parameter to optimize an objective function that generates a plan for drilling and completion of one or more oil or gas wells.
- the objective function optimizes one or more performance metrics that include reservoir performance, well drilling performance, and financial performance, subject to satisfying constraints on the drilling.
- a method for autonomous geosteering for a well-boring process comprising the steps of: (a) providing a trained function approximating agent; (b) determining a geological objective; (c) determining a sequence of control inputs to steer a well-boring tool towards the geological objective, wherein the trained function approximating agent is adapted to enact the sequence of control inputs upon receiving a signal from a measurement from the well-boring process.
- Fig.1 illustrates a result of one embodiment of the present invention
- Fig.2 illustrates one embodiment of a reward function suitable for the method of the present invention
- Fig.3 is a graphical representation of the results of a first test of a simulation environment produced according to the method of the present invention
- Fig.4 is a graphical representation of the results of a second test of a simulation environment produced according to the method of the present invention.
- Fig.5 is a graphical representation of the results of a third test of a simulation environment produced according to the method of the present invention.
- Fig.6 is a graphical representation of the results of a fourth test of a simulation environment produced according to the method of the present invention.
- the present invention provides a method for autonomous geosteering using a trained function approximating agent.
- the method is a computer-implemented method.
- function approximating agent we mean a process for finding an underlying relationship from a given finite set of input-output data.
- function approximating agents include neural networks, such as backpropagation-enabled processes, including deep learning, machine learning, frequency neural networks, Bayesian neural networks, Gaussian processes, polynomials, and derivative-free processes, such as annealing processes, evolutionary processes and sampling processes.
- the function approximating agent is trained on a physical simulator approximating a real geological and drilling operation, for example, in the intended subterranean formation.
- the function approximating agent may be trained by (a) providing an earth model defining boundaries between formation layers and petrophysical properties of the formation layers in a subterranean formation comprising data selected from the group consisting of seismic data, data from an offset well and combinations thereof, and producing a set of model coefficients; (b) providing a toolface input corresponding to the set of model coefficients to a drilling attitude model for determining a drilling attitude state; (c) determining a drill bit position in the subterranean formation from the drilling attitude state; (d) feeding the drill bit position to the training earth model, and determining an updated set of model coefficients for a predetermined interval and a set of signals representing physical properties of the subterranean formation for the drill bit position; (e) inputting the set of signals to a sensor model for producing at least one sensor output and determining a sensor reward from the at least one sensor output;(f) correlating the toolface input and the corresponding drilling attitude state, drill bit position, set of model
- the drilling model for the simulation environment may be a kinematic model, a dynamical system model, a finite element model, a Markov decision process, and
- Preferred examples of function approximating agents include stochastic clustering and pattern matching, greedy Monte Carlo, differential dynamic programming, and combinations and derivatives thereof.
- the function approximating agent is trained by reinforcement learning, deep reinforcement learning, approximate dynamic programming, stochastic optimal control, and combinations thereof.
- a sequence of control inputs is determined to steer a well-boring tool towards a geological objective.
- the geological objective may, for example, without limitation, a relative 1D position, a relative 2D position, a relative 3D position, a dip angle, a strike angle, and combinations thereof.
- the sequence of control inputs includes, without limitation, curvature, roll angle, set points for inclination, set points for azimuth, Euler angle, rotation matrix quaternions, angle axis, position vector, position Cartesian, polar, and combinations thereof
- the trained function approximating agent is adapted to enact the sequence of control inputs upon receiving a signal from a measurement from the well-boring process.
- a reward function is used in the method of the present invention. More preferably, the reward function is based on a reward objective including, without limitation, shortest distance to the geological objective, lowest percentage of out-of-zone time, lowest deviation from targeted relative stratigraphic depth, lowest deviation from a well plan, reaching a target waypoint, consistency with target heading, lowest number of steering correction control signals, minimizing angular deviation, and combinations thereof. More preferably, the reward function further includes, without limitation, negative rewards for reduced drilling speed, increased wear on drill bit, proximity to region identified as being nearby a well, proximity to region having a geological feature that should be avoided, and combinations thereof. Preferably, the reward function includes negative rewards for angular deviation, tortuosity, excess curvature, and combinations thereof.
- Examples of a geological objective include an existing well, a target well path for a future well, simulations of an existing well, simulations of a target well path for a future well, and combinations thereof.
- a target well path avoids collision with an existing well.
- the reward function has a positive reward for colliding with the geological objective.
- the reward function includes a positive episodic reward for an episodic action including, without limitation, reaching a predetermined end depth, reaching a target zone, extending a predetermined number of feet in a target zone, and combinations thereof.
- the reward function may also include a negative reward for an episodic action including, without limitation, missing the target, deviating too far from a predetermined geological datum, entering into a no-go zone, and combinations thereof.
- Examples of a no-go zone include, without limitation, lease lines, permeability, porosity, petrophysical properties, nearby wells, and the like.
- Examples of a geological datum can be, for example, without limitation, a rock formation boundary, a geological feature, an offset well, an oil/water contact, an oil/gas contact, an oil/tar contact and combinations thereof.
- the well-boring process is modeled as a Markov decision process.
- Model Predictive Control which reframes the task of following a trajectory as an optimization problem.
- the solution to the optimization problem is the optimal trajectory.
- Model Predictive Control involves simulating different actuator inputs, predicting the resulting trajectory and selecting that trajectory with a minimum cost. Parameters involved are starting state, process model, reference trajectory, errors, length, duration, cost function and constraints.
- Fig.1 illustrates one embodiment of a reward function.
- the vertical dashed lines represent a user-defined tolerance.
- the shape of the curve can also be selected by the user, depending on the user’s objective.
- the reward function is selected to balance precision and speed, in this case with a coasting threshold of 0.60 m (2 ft) and a coasting bonus of 0.3.
- the coasting threshold is the distance from the well plan at which the user wants the bottom hole assembly to prioritize speed over accuracy.
- a synthetic well was generated based on an actual gamma ray log.
- the real data is identified by a type log gamma ray plot 62.
- a boundary 64 representing the top of a target formation was determined and a synthetic true well path 66 was generated.
- Region 72 represents a 1.5-m (5- foot) error about the true well path 66
- region 74 represents a 3-m (10-foot) error about the well path 66.
- the goal of the test was to match the true well path 66 as best as possible.
- Example 1– 4 the function approximating agent is described in co- pending application entitled“Process for Real Time Geological Localization with Bayesian Reinforcement Learning” filed in the USPTO on the same day as the present application, as provisional application US62/712,518 filed 31 July 2018, the entirety of which is incorporated by reference herein.
- the Bayesian Reinforcement Learning (BRL) function approximating agent was trained according to the method described in co-pending application entitled “Method for Simulating a Coupled Geological and Drilling Environment” filed in the USPTO on the same day as the present application, as provisional application US62/712,490 filed 31 July 2018, the entirety of which is incorporated by reference herein.
- Well log gamma ray data 76 was fed to the trained agent and a set of control inputs, in this case well inclination angle 78, was used to steer the well-boring along the true well path 66, according to the method described herein.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Geology (AREA)
- Mining & Mineral Resources (AREA)
- Physics & Mathematics (AREA)
- Environmental & Geological Engineering (AREA)
- Fluid Mechanics (AREA)
- General Life Sciences & Earth Sciences (AREA)
- Geochemistry & Mineralogy (AREA)
- Geophysics And Detection Of Objects (AREA)
Abstract
A method for autonomous geosteering for a well-boring process uses a trained function approximating agent. A geological objective is determined. Then, using the trained function approximating agent, a sequence of control inputs is determined to steer a well-boring tool towards the geological objective. The trained function approximating agent is adapted to enact the sequence of control inputs upon receiving a signal from a measurement from the well-boring process.
Description
METHOD FOR GEOLOGICAL STEERING CONTROL
THROUGH REINFORCEMENT LEARNING FIELD OF THE INVENTION
[0001] The present invention relates to the field of geosteering and, in particular, to a method for autonomous geosteering for a well-boring process. BACKGROUND OF THE INVENTION
[0002] In a well construction process, rock destruction is guided by a drilling assembly. The drilling assembly includes sensors and actuators for biasing the trajectory and determining the heading in addition to properties of the surrounding borehole media. The intentional guiding of a trajectory to remain within the same rock or fluid and/or along a fluid boundary such as an oil/water contact or an oil/gas contact is known as geosteering.
[0003] Geosteering is drilling a horizontal wellbore that ideally is located within or near preferred rock layers. As interpretive analysis is performed while or after drilling, geosteering determines and communicates a wellbore's stratigraphic depth location in part by estimating local geometric bedding structure. Modern geosteering normally incorporates more dimensions of information, including insight from downhole data and quantitative correlation methods. Ultimately, geosteering provides explicit approximation of the location of nearby geologic beds in relationship to a wellbore and coordinate system.
[0004] Geosteering relies on mapping data acquired in the structural domain along the horizontal wellbore and into the stratigraphic depth domain. Relative Stratigraphic Depth (RSD) means that the depth in question is oriented in the stratigraphic depth direction and is relative to a geologic marker. Such a marker is typically chosen from type log data to be the top of the pay zone/target layer. The actual drilling target or“sweet spot” is located at an onset stratigraphic distance from the top of the pay zone/target layer.
[0005] US8,892,407B2 (ExxonMobil) relates to a process for well trajectory planning. The process involves receiving data relevant to drilling and completion of an oil or gas well, and to reservoir development. Well trajectory and drilling and completion decision parameters are simultaneously calculated using a Markov decision process-based model that accounts for an uncertain parameter to optimize an objective function that generates a plan for drilling and completion of one or more oil or gas wells. The objective function optimizes one
or more performance metrics that include reservoir performance, well drilling performance, and financial performance, subject to satisfying constraints on the drilling.
[0006] There is a need for autonomous geosteering that is trained by a function approximating agent. SUMMARY OF THE INVENTION
[0007] According to one aspect of the present invention, there is provided a method for autonomous geosteering for a well-boring process, comprising the steps of: (a) providing a trained function approximating agent; (b) determining a geological objective; (c) determining a sequence of control inputs to steer a well-boring tool towards the geological objective, wherein the trained function approximating agent is adapted to enact the sequence of control inputs upon receiving a signal from a measurement from the well-boring process. BRIEF DESCRIPTION OF THE DRAWINGS
[0008] The method of the present invention will be better understood by referring to the following detailed description of preferred embodiments and the drawings referenced therein, in which:
[0009] Fig.1 illustrates a result of one embodiment of the present invention;
[00010] Fig.2 illustrates one embodiment of a reward function suitable for the method of the present invention;
[00011] Fig.3 is a graphical representation of the results of a first test of a simulation environment produced according to the method of the present invention;
[00012] Fig.4 is a graphical representation of the results of a second test of a simulation environment produced according to the method of the present invention;
[00013] Fig.5 is a graphical representation of the results of a third test of a simulation environment produced according to the method of the present invention; and
[00014] Fig.6 is a graphical representation of the results of a fourth test of a simulation environment produced according to the method of the present invention.
DETAILED DESCRIPTION OF THE INVENTION
[00015] The present invention provides a method for autonomous geosteering using a trained function approximating agent. The method is a computer-implemented method.
[00016] By“function approximating agent” we mean a process for finding an underlying relationship from a given finite set of input-output data. Examples of function approximating agents include neural networks, such as backpropagation-enabled processes, including deep learning, machine learning, frequency neural networks, Bayesian neural networks, Gaussian processes, polynomials, and derivative-free processes, such as annealing processes, evolutionary processes and sampling processes.
[00017] Preferably, the function approximating agent is trained on a physical simulator approximating a real geological and drilling operation, for example, in the intended subterranean formation.
[00018] Preferably, the function approximating agent is trained according to the method described in co-pending application entitled“Method for Simulating a Coupled Geological and Drilling Environment” filed in the USPTO on the same day as the present application, as provisional application US62/712,490 filed 31 July 2018, the entirety of which is incorporated by reference herein.
[00019] In a preferred embodiment, the function approximating agent may be trained by (a) providing an earth model defining boundaries between formation layers and petrophysical properties of the formation layers in a subterranean formation comprising data selected from the group consisting of seismic data, data from an offset well and combinations thereof, and producing a set of model coefficients; (b) providing a toolface input corresponding to the set of model coefficients to a drilling attitude model for determining a drilling attitude state; (c) determining a drill bit position in the subterranean formation from the drilling attitude state; (d) feeding the drill bit position to the training earth model, and determining an updated set of model coefficients for a predetermined interval and a set of signals representing physical properties of the subterranean formation for the drill bit position; (e) inputting the set of signals to a sensor model for producing at least one sensor output and determining a sensor reward from the at least one sensor output;(f) correlating the toolface input and the corresponding drilling attitude state, drill bit position, set of model coefficients, and the at least one sensor output and sensor reward in the simulation environment; and (g) repeating steps b)– f) using the updated set of model coefficients from step d).
[00020] The drilling model for the simulation environment may be a kinematic model, a dynamical system model, a finite element model, a Markov decision process, and
combinations thereof.
[00021] Preferred examples of function approximating agents include stochastic clustering and pattern matching, greedy Monte Carlo, differential dynamic programming, and combinations and derivatives thereof.
[00022] Preferably, the function approximating agent is trained by reinforcement learning, deep reinforcement learning, approximate dynamic programming, stochastic optimal control, and combinations thereof.
[00023] According to the method of the present invention, a sequence of control inputs is determined to steer a well-boring tool towards a geological objective. The geological objective may, for example, without limitation, a relative 1D position, a relative 2D position, a relative 3D position, a dip angle, a strike angle, and combinations thereof. The sequence of control inputs includes, without limitation, curvature, roll angle, set points for inclination, set points for azimuth, Euler angle, rotation matrix quaternions, angle axis, position vector, position Cartesian, polar, and combinations thereof
[00024] The trained function approximating agent is adapted to enact the sequence of control inputs upon receiving a signal from a measurement from the well-boring process.
[00025] Preferably, a reward function is used in the method of the present invention. More preferably, the reward function is based on a reward objective including, without limitation, shortest distance to the geological objective, lowest percentage of out-of-zone time, lowest deviation from targeted relative stratigraphic depth, lowest deviation from a well plan, reaching a target waypoint, consistency with target heading, lowest number of steering correction control signals, minimizing angular deviation, and combinations thereof. More preferably, the reward function further includes, without limitation, negative rewards for reduced drilling speed, increased wear on drill bit, proximity to region identified as being nearby a well, proximity to region having a geological feature that should be avoided, and combinations thereof. Preferably, the reward function includes negative rewards for angular deviation, tortuosity, excess curvature, and combinations thereof.
[00026] Examples of a geological objective include an existing well, a target well path for a future well, simulations of an existing well, simulations of a target well path for a future well, and combinations thereof. Often, a target well path avoids collision with an existing well. However, there are times when collision with an existing well is the objective, for example, without limitation, when the objective is a relief well. In this case, the reward function has a positive reward for colliding with the geological objective.
[00027] In another embodiment, the reward function includes a positive episodic reward for an episodic action including, without limitation, reaching a predetermined end depth, reaching a target zone, extending a predetermined number of feet in a target zone, and combinations thereof. The reward function may also include a negative reward for an episodic action including, without limitation, missing the target, deviating too far from a predetermined geological datum, entering into a no-go zone, and combinations thereof.
Examples of a no-go zone include, without limitation, lease lines, permeability, porosity, petrophysical properties, nearby wells, and the like. Examples of a geological datum can be, for example, without limitation, a rock formation boundary, a geological feature, an offset well, an oil/water contact, an oil/gas contact, an oil/tar contact and combinations thereof.
[00028] The output action can be an action including, without limitation, curvature, roll angle, set points for inclination, set points for azimuth, Euler angle, rotation matrix quaternions, angle axis, position vector, position Cartesian, polar, and combinations thereof.
[00029] In a preferred embodiment, the well-boring process is modeled as a Markov decision process.
[00030] Preferably, the trained function approximating agent is solved by Model Predictive Control, which reframes the task of following a trajectory as an optimization problem. The solution to the optimization problem is the optimal trajectory. Model Predictive Control involves simulating different actuator inputs, predicting the resulting trajectory and selecting that trajectory with a minimum cost. Parameters involved are starting state, process model, reference trajectory, errors, length, duration, cost function and constraints.
[00031] Two embodiments are illustrated below:
[00032] Referring now to Fig.1, the accuracy of the method the present invention is illustrated by the solid trajectory lines and their proximation to the dashed well plan lines. The deviation from the well plans at the beginning of the tests is caused in large measure by controls to avoid curvature angles that are unrealistic for a drilling assembly. As shown in Fig.1, the sideforce is curvature.
[00033] Fig.2 illustrates one embodiment of a reward function. The vertical dashed lines represent a user-defined tolerance. The shape of the curve can also be selected by the user, depending on the user’s objective. As shown in Fig.2, the reward function is selected to balance precision and speed, in this case with a coasting threshold of 0.60 m (2 ft) and a coasting bonus of 0.3. The coasting threshold is the distance from the well plan at which the user wants the bottom hole assembly to prioritize speed over accuracy. EXAMPLES 1– 4
[00034] The accuracy of the simulation environment produced in accordance with the present invention was tested by training a function approximating agent.
[00035] Referring now to Figs.3– 6, a synthetic well was generated based on an actual gamma ray log. The real data is identified by a type log gamma ray plot 62. Based on the type log gamma ray plot 62, a boundary 64 representing the top of a target formation was determined and a synthetic true well path 66 was generated. Region 72 represents a 1.5-m (5- foot) error about the true well path 66, while region 74 represents a 3-m (10-foot) error about the well path 66. The goal of the test was to match the true well path 66 as best as possible.
[00036] In each of Example 1– 4, the function approximating agent is described in co- pending application entitled“Process for Real Time Geological Localization with Bayesian Reinforcement Learning” filed in the USPTO on the same day as the present application, as provisional application US62/712,518 filed 31 July 2018, the entirety of which is incorporated by reference herein. The Bayesian Reinforcement Learning (BRL) function approximating agent was trained according to the method described in co-pending application entitled “Method for Simulating a Coupled Geological and Drilling Environment” filed in the USPTO on the same day as the present application, as provisional application US62/712,490 filed 31 July 2018, the entirety of which is incorporated by reference herein.
[00037] Well log gamma ray data 76 was fed to the trained agent and a set of control inputs, in this case well inclination angle 78, was used to steer the well-boring along the true well path 66, according to the method described herein.
[00038] The well path 82 resulting from the BRL agent and the well path 84 resulting from the BRL agent with mean square error demonstrated good fit to the true well path 66. As shown in Figs.3– 6, the fit of well paths 82 and 84 improved over time with a reward function described in the autonomous geosteering method.
[00039] While preferred embodiments of the present disclosure have been described, it should be understood that various changes, adaptations and modifications can be made therein without departing from the spirit of the invention(s) as claimed below.
Claims
1. A method for autonomous geosteering for a well-boring process, comprising the steps of:
a) providing a trained function approximating agent;
b) determining a geological objective;
c) determining a sequence of control inputs to steer a well-boring tool towards the geological objective,
wherein the trained function approximating agent is adapted to enact the sequence of control inputs upon receiving a signal from a measurement from the well-boring process.
2. The method of claim 1, further comprising the step of providing a reward function.
3. The method of claim 2, wherein the reward function is based on a reward objective selected from the group consisting of shortest distance to the geological objective, lowest percentage of out-of-zone time, lowest deviation from targeted relative stratigraphic depth, lowest deviation from a well plan, reaching a target waypoint, consistency with target heading, lowest number of steering correction control signals, minimizing angular deviation and combinations thereof.
4. The method of claim 3, wherein the reward function comprises negative rewards for reduced drilling speed, increased wear on drill bit, proximity to region identified as being nearby a well, proximity to region having a geological feature that should be avoided, and combinations thereof.
5. The method of claim 2, wherein the reward function comprises negative rewards for angular deviation, tortuosity, excess curvature, and combinations thereof.
6. The method of claim 2, wherein the reward function comprises a positive episodic reward for an episodic action selected from the group consisting of reaching a predetermined end depth, reaching a target zone, extending a predetermined number of feet in a target zone, and combinations thereof.
7. The method of claim 2, wherein the reward function comprises a negative episodic reward for an episodic action selected from the group consisting of missing the target, deviating too far from a predetermined geological datum, entering into a no-go zone, and combinations thereof.
8. The method of claim 2, wherein the geological objective is selected from the group consisting of an existing well, a target well path for a future well, simulations of an existing well, simulations of a target well path for a future well, and combinations thereof, and wherein the reward function comprises a positive reward for colliding with the geological objective.
9. The method of claim 1, wherein the function approximating agent is trained by a function approximating process selected from the group consisting of reinforcement learning, deep reinforcement learning, approximate dynamic programming, stochastic optimal control, and combinations thereof.
10. The method of claim 1, wherein the well-boring process is modelled as a Markov decision process.
11. The method of claim 1, wherein the trained function approximating agent is solved by Model Predictive Control with respect to a simulation environment or a state space model.
12. The method of claim 1, wherein the sequence of control inputs is selected from the group consisting of curvature, roll angle, set points for inclination, set points for azimuth, Euler angle, rotation matrix quaternions, angle axis, position vector, position Cartesian, polar, and combinations thereof.
13. The method of claim 1, wherein the geological objective is selected from the group consisting of a relative 1D position, a relative 2D position, a relative 3D position, a dip angle, a strike angle, and combinations thereof.
14. The method of claim 1, wherein the function approximating agent is trained in a simulation environment.
15. The method of claim 14, wherein the simulation environment approximates a real geological and drilling operation.
16. The method of claim 14, wherein the simulation environment is produced by a training method comprising the steps of: a) providing an earth model defining boundaries between formation layers and petrophysical properties of the formation layers in a subterranean formation comprising data selected from the group consisting of seismic data, data from an offset well and combinations thereof, and producing a set of model coefficients; b) providing a toolface input corresponding to the set of model coefficients to a drilling attitude model for determining a drilling attitude state; c) determining a drill bit position in the subterranean formation from the drilling attitude state; d) feeding the drill bit position to the earth model, and determining an updated set of model coefficients for a predetermined interval and a set of signals representing physical properties of the subterranean formation for the drill bit position; e) inputting the set of signals to a sensor model for producing at least one sensor output and determining a sensor reward from the at least one sensor output; f) correlating the toolface input and the corresponding drilling attitude state, drill bit position, set of model coefficients, and the at least one sensor output and sensor reward in the simulation environment; and g) repeating steps b)– f) using the updated set of model coefficients from step d).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/263,986 US20210310347A1 (en) | 2018-07-31 | 2019-07-30 | Method for geological steering control through reinforcement learning |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201862712506P | 2018-07-31 | 2018-07-31 | |
US62/712,506 | 2018-07-31 | ||
EP18194442 | 2018-09-14 | ||
EP18194442.2 | 2018-09-14 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020028298A1 true WO2020028298A1 (en) | 2020-02-06 |
Family
ID=67620533
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2019/044036 WO2020028298A1 (en) | 2018-07-31 | 2019-07-30 | Method for geological steering control through reinforcement learning |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2020028298A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113464120A (en) * | 2021-09-06 | 2021-10-01 | 中国石油集团川庆钻探工程有限公司 | Tool face state prediction method and system, and sliding directional drilling method and system |
WO2022104324A1 (en) * | 2020-11-12 | 2022-05-19 | Schlumberger Technology Corporation | Multi-agent drilling decision system and method |
US11828155B2 (en) | 2019-05-21 | 2023-11-28 | Schlumberger Technology Corporation | Drilling control |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8892407B2 (en) | 2008-10-01 | 2014-11-18 | Exxonmobil Upstream Research Company | Robust well trajectory planning |
WO2018106748A1 (en) * | 2016-12-09 | 2018-06-14 | Schlumberger Technology Corporation | Field operations neural network heuristics |
-
2019
- 2019-07-30 WO PCT/US2019/044036 patent/WO2020028298A1/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8892407B2 (en) | 2008-10-01 | 2014-11-18 | Exxonmobil Upstream Research Company | Robust well trajectory planning |
WO2018106748A1 (en) * | 2016-12-09 | 2018-06-14 | Schlumberger Technology Corporation | Field operations neural network heuristics |
Non-Patent Citations (1)
Title |
---|
JACOB POLLOCK ET AL: "Machine Learning for Improved Directional Drilling (OTC-28633-MS)", OFFSHORE TECHNOLOGY CONFERENCE 2018, 30 April 2018 (2018-04-30), Houston, Texas, USA, pages 1 - 9, XP055564922, ISBN: 978-1-61399-571-6, DOI: 10.4043/28633-MS * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11828155B2 (en) | 2019-05-21 | 2023-11-28 | Schlumberger Technology Corporation | Drilling control |
US12252975B2 (en) | 2019-05-21 | 2025-03-18 | Schlumberger Technology Corporation | Drilling control |
WO2022104324A1 (en) * | 2020-11-12 | 2022-05-19 | Schlumberger Technology Corporation | Multi-agent drilling decision system and method |
US12071844B2 (en) | 2020-11-12 | 2024-08-27 | Schlumberger Technology Corporation | Multi-agent drilling decision system and method |
CN113464120A (en) * | 2021-09-06 | 2021-10-01 | 中国石油集团川庆钻探工程有限公司 | Tool face state prediction method and system, and sliding directional drilling method and system |
CN113464120B (en) * | 2021-09-06 | 2021-12-03 | 中国石油集团川庆钻探工程有限公司 | Tool face state prediction method and system, and sliding directional drilling method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210310307A1 (en) | Process for real time geological localization with reinforcement learning | |
US11401798B2 (en) | Process for real time geological localization with stochastic clustering and pattern matching | |
US10947784B2 (en) | Sliding mode control techniques for steerable systems | |
US10801314B2 (en) | Real-time trajectory control during drilling operations | |
CA2842598C (en) | Apparatus and method of landing a well in a target zone | |
US20220349297A1 (en) | Generating drilling paths using a drill model | |
EP2895950B1 (en) | System, method and computer program product for a rug plot for geosteering applications | |
RU2660827C1 (en) | Continuous determination of location during drilling | |
NO322922B1 (en) | Method for determining position uncertainty when drilling wells | |
US11675938B2 (en) | Optimal path planning for directional drilling | |
US9182510B2 (en) | Methods and systems of incorporating pseudo-surface pick locations in seismic velocity models | |
WO2020028298A1 (en) | Method for geological steering control through reinforcement learning | |
CN110073246B (en) | Improved method relating to quality control | |
US20210164293A1 (en) | Curvature-based feedback control techniques for directional drilling | |
NO347038B1 (en) | Automatic geosteering and evolutionary algorithm for use with same | |
US20210293129A1 (en) | Process for real time geological localization with kalman filtering | |
US20210310347A1 (en) | Method for geological steering control through reinforcement learning | |
US20210312332A1 (en) | Method for simulating a coupled geological and drilling environment for training a function approximating agent | |
US20210293132A1 (en) | Process for real time geological localization with greedy monte carlo | |
CN112578445B (en) | Method and device for seismic point-by-point guided drilling | |
CN110088647A (en) | Improved structural modeling | |
NO346996B1 (en) | Integrated well survey management and planning tool | |
CN116025281A (en) | Landing track control method for thin oil layer horizontal well | |
CN116413827A (en) | Method and device for predicting target depth | |
Ajetunmobi | Depth Issues in the Oil patch |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19753228 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19753228 Country of ref document: EP Kind code of ref document: A1 |