WO2020110149A2 - System and method for operation optimization of an equipment - Google Patents

System and method for operation optimization of an equipment Download PDF

Info

Publication number
WO2020110149A2
WO2020110149A2 PCT/IN2019/050871 IN2019050871W WO2020110149A2 WO 2020110149 A2 WO2020110149 A2 WO 2020110149A2 IN 2019050871 W IN2019050871 W IN 2019050871W WO 2020110149 A2 WO2020110149 A2 WO 2020110149A2
Authority
WO
WIPO (PCT)
Prior art keywords
function
input parameters
equipment
reward
neural network
Prior art date
Application number
PCT/IN2019/050871
Other languages
French (fr)
Other versions
WO2020110149A3 (en
Inventor
Vishnu TANKASALA VEPARALA
Solomon Pushparaj MANUELRAJ
Ankit Bansal
Pankaj Malhotra
Lovekesh Vig
Gautam Shroff
Venkataramana Runkana
Sivakumar Subramanian
Aditya Pareek
Vishnu Swaroopji Masampally
Nishit RAJ
Original Assignee
Tata Consultancy Services Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tata Consultancy Services Limited filed Critical Tata Consultancy Services Limited
Priority to JP2021530219A priority Critical patent/JP7179181B2/en
Priority to US17/309,432 priority patent/US20220036166A1/en
Priority to EP19888392.8A priority patent/EP3887913A4/en
Publication of WO2020110149A2 publication Critical patent/WO2020110149A2/en
Publication of WO2020110149A3 publication Critical patent/WO2020110149A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/042Knowledge-based neural networks; Logical representations of neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/10Interfaces, programming languages or software development kits, e.g. for simulating neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks

Definitions

  • FIG. 5A and 5B illustrates an exemplary graphical representation of result of optimization by a RNN-opt versus conventional optimization algorithms for benchmark functions, according to embodiments of the present disclosure.
  • the multi-head input handler 206A includes multiple processing heads, one per equipment instance because number of external input parameters can vary across different instances of equipment.
  • the external input parameters (e t ) is reduced to a fixed dimensional vector before inputting to the Neural Network 208A.
  • the processing head for the j* equipment instance is used.
  • the fixed dimensional output of the Multi- Head Input Handler 212A, i t-i and r t-i are input to the Neural Network 208A to obtain the control parameter i t .
  • the control parameter i t is passed through the domain constraints handler 210A. In an embodiment, if certain values in the obtained control parameter i t do not fall in prescribed ranges according to the domain, then the control parameter i t is properly modified before further processing to obtain modified i t .
  • the neural network based optimizer is also simultaneously trained to minimize f p .
  • values in the control parameter i t are clipped to satisfy domain constraints. In another embodiment, only those control parameter i t which satisfy domain constraints are passed to the Multi-Head Output Handler 212A.
  • i t and r t are then fed back into the Neural Network based optimizer along with the vector of external input parameters at time‘t+1’, e t+i to obtain the set of control input parameters at time‘t+1’, i t+i .
  • the equipment instance chosen for fine-tuning the pre-trained optimizer for a domain is changed periodically (after some constant iterations of training) in a round robin fashion so that the optimizer is trained on all the instances.
  • the loss function used for training the Neural Network based optimizer can be a regret-based loss function (as given by equation 1).
  • the multi-head input handler include a processing head for each equipment instance.
  • number of external input parameters differs across different instances of equipments.
  • the function f j are generated by at least one of (i) a Gaussian processes, or (ii) Gaussian mixture models constrained on dimensionality of the vector i t and the vector of external input parameters e t , and combination thereof.

Abstract

This disclosure relates to optimizing an operation of an equipment by a neural network based optimizer is provided. The method include receiving, information associated with at least one equipment instance (j) as an input at a predefined sequence of timestamps; training, a plurality of simulation models for each equipment instance to obtain a function (fj); processing, the external input parameters (et) to obtain a fixed-dimensional vector and passed as an input to obtain an vector (it); generating, a modified (it) from the output vector (it) based on a domain constraint value; computing, a reward (rt) based on (i) the function (fj), (ii) the modified (it), (iii) the external input parameters (et), and (iv) a reward function (Rj); and iteratively performing the steps of processing, generating, and computing reward (rt) for a series of subsequent equipment instances after expiry of the predefined sequence of timestamps associated with a first equipment instance.

Description

SYSTEM AND METHOD FOR OPERATION OPTIMIZATION OF AN EQUIPMENT
CROSS-REFERENCE TO RELATED APPLICATIONS AND PRIORITY
[001] The present application claims priority from Indian provisional patent application no. 201821044941, filed on November 28, 2018. The entire contents of the aforementioned application are incorporated herein by reference.
TECHNICAL FIELD
[002] The disclosure herein generally relates to operation optimization, and, more particularly, to system and method for operation optimization of an equipment by a neural network based optimizer.
BACKGROUND
[003] Several practical optimization problems for complex dynamical systems pose a unique challenge owing to the restriction on the number of possible function evaluations. The optimization functions do not have a simple closed form but can be evaluated (queried) at any arbitrary query point on the domain. However, evaluation of real world complex processes is expensive and time consuming and has to be optimized while employing as real world function evaluations. Most practical optimization problems are constrained in nature, i.e. have one or more constraints on values of input parameters. At present scenario, finding one or more control input parameters for an equipment’s (e.g., boiler operations in a power plant) optimal operational efficiency is difficult.
[004] Human trials are conducted to obtain the optimal input parameters. These experimental trials are costly to conduct, time consuming, and require costly domain experts and manual resources. Highly non-linear and dynamic nature of complex equipment (such as in the power plant) makes it difficult to find optimal control input parameters especially given the lack of good simulation models for the complex operations. This makes current machine learning (ML) and optimization techniques ineffective. Also, in case of a completely new equipment type, lack of any historical data of trials makes it hard to build simulation models which are required to obtain optimal input parameters. In another scenario, the equipment at hand might be similar to other equipments whose historical data is available. In such cases, an optimizer trained on the similar set of equipment to the new equipment needs to be directly used for optimizing for the new equipment. SUMMARY
[005] Embodiments of the present disclosure present technological improvements as solutions to one or more of the above-mentioned technical problems recognized by the inventors in conventional systems. For example, in one aspect, a processor implemented method of optimizing an operation of an equipment by a neural network based optimizer is provided. The processor implemented method includes at least one of: receiving, information associated with at least one equipment instance (j) as an input at a predefined sequence of timestamps; training, by a simulation model trainer, a plurality of simulation models for each equipment instance using the information associated with the at least one equipment instance (j) to obtain a function (fj); generating, by a sequence generator, at least one sequence of the external input parameters (et); processing, by a multi-head input handler, the external input parameters (et) to obtain a fixed-dimensional vector that is passed as an input to a neural network to obtain an output vector (it); generating, by a domain constraints handler, a modified (it) from the vector (it) based on a domain constraint value; computing, by a multi-head output handler, a reward (rt) based on (i) the function (fj), (ii) the modified (it), (iii) the external input parameters (et), and (iv) a reward function (Rj); and iteratively performing the steps of processing, generating, and computing reward (rt) for a series of subsequent equipment instances after expiry of the predefined sequence of timestamps associated with a first equipment instance. In an embodiment, the information associated with the at least one equipment instance (j) corresponds to at least one of (i) a plurality of control parameters (it), (ii) external input parameters (et), (iii) output parameters (ot), and (iv) historical operational data. In an embodiment, the function (fj) corresponds to a set of simulation models of the equipment instance (j).
[006] In an embodiment, the multi-head input handler may include a processing head for each equipment instance. In an embodiment, number of external input parameters may differ across different instances of equipments. In an embodiment, the function fj may be generated by at least one of (i) a Gaussian processes, or (ii) Gaussian mixture models constrained on dimensionality of the output vector it and the vector of external input parameters et, and combination thereof. In an embodiment, one or more hardware processors may further configured to input, the vector (it) and reward (rt) into the Neural Network with the vector of external input parameters at time (‘t+1’), et+i for iteratively perform the steps of process, generate, and compute reward to obtain the set of control input parameters at time (t+1), it+i. In an embodiment, dimensionality of it and et may be determined by a target domain. In an embodiment, the step of training the neural network with a loss function may be performed based on a maximum attainable value of the reward function (Rj). In an embodiment, the neural network may be trained by including an additional loss function using a plurality of penalty values for violating domain constraints.
[007] In another aspect, there is provided a system for operation optimization of an equipment by a neural network based optimizer. The system comprises a memory storing instructions; one or more communication interfaces; and one or more hardware processors coupled to the memory via the one or more communication interfaces, wherein the one or more hardware processors are configured by the instructions to: receive, information associated with at least one equipment instance (j) as an input at a predefined sequence of timestamps; train, a plurality of simulation models for each equipment instance using the information associated with the at least one equipment instance (j) to obtain a function (fj); generate, at least one sequence of the external input parameters (et); process, the external input parameters (et) to obtain a fixed-dimensional vector that is passed as an input to a neural network to obtain an vector (it); generate, a modified (it) from the vector (it) based on a domain constraint value; compute, a reward (rt) based on (i) the function (fj), (ii) the modified (it), (iii) the external input parameters (et), and (iv) a reward function (Rj); and iteratively perform the steps of processing, generating, and computing reward (rt) for a series of subsequent equipment instances after expiry of the predefined sequence of timestamps associated with a first equipment instance. In an embodiment, the information associated with the at least one equipment instance (j) corresponds to at least one of (i) a plurality of control parameters (it), (ii) external input parameters (et), (iii) output parameters (ot), and (iv) historical operational data. In an embodiment, the function (fj) corresponds to a set of simulation models of the equipment instance (j).
[008] In an embodiment, the multi-head input handler may include a processing head for each equipment instance. In an embodiment, number of external input parameters may differ across different instances of equipments. In an embodiment, the function fj may be generated by at least one of (i) a Gaussian processes, or (ii) Gaussian mixture models constrained on dimensionality of the output vector it and the vector of external input parameters et, and combination thereof. In an embodiment, one or more hardware processors may further configured to input, the output vector (it) and reward (rt) into the Neural Network with the vector of external input parameters at time (‘t+G), et+i for iteratively perform the steps of process, generate, and compute reward to obtain the set of control input parameters at time (t+1), it+i. In an embodiment, dimensionality of it and et may be determined by a target domain. In an embodiment, the step of training the neural network with a loss function may be performed based on a maximum attainable value of the reward function (Rj). In an embodiment, the neural network may be trained by including an additional loss function using a plurality of penalty values for violating domain constraints.
[009] In yet another aspect, there are provided one or more non-transitory machine readable information storage mediums comprising one or more instructions which when executed by one or more hardware processors causes at least one of: receiving, information associated with at least one equipment instance (j) as an input at a predefined sequence of timestamps; training, a plurality of simulation models for each equipment instance using the information associated with the at least one equipment instance (j) to obtain a function (fj); generating, at least one sequence of the external input parameters (et); processing, the external input parameters (et) to obtain a fixed-dimensional vector that is passed as an input to a neural network to obtain an output vector (it); generating, a modified (it) from the vector (it) based on a domain constraint value; computing, a reward (rt) based on (i) the function (fj), (ii) the modified (it), (iii) the external input parameters (et), and (iv) a reward function (Rj); and iteratively performing the steps of processing, generating, and computing reward (rt) for a series of subsequent equipment instances after expiry of the predefined sequence of timestamps associated with a first equipment instance. In an embodiment, the information associated with the at least one equipment instance (j) corresponds to at least one of (i) a plurality of control parameters (it), (ii) external input parameters (et), (iii) output parameters (ot), and (iv) historical operational data. In an embodiment, the function (fj) corresponds to a set of simulation models of the equipment instance (j).
[010] In an embodiment, the multi-head input handler may include a processing head for each equipment instance. In an embodiment, number of external input parameters may differ across different instances of equipments. In an embodiment, the function fj may be generated by at least one of (i) a Gaussian processes, or (ii) Gaussian mixture models constrained on dimensionality of the output vector it and the vector of external input parameters et, and combination thereof. In an embodiment, one or more hardware processors may further configured to input, the output vector (it) and reward (rt) into the Neural Network with the vector of external input parameters at time (‘t+G), et+i for iteratively perform the steps of process, generate, and compute reward to obtain the set of control input parameters at time (t+1), it+i. In an embodiment, dimensionality of it and et may be determined by a target domain. In an embodiment, the step of training the neural network with a loss function may be performed based on a maximum attainable value of the reward function (Rj). In an embodiment, the neural network may be trained by including an additional loss function using a plurality of penalty values for violating domain constraints.
[Oi l] It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
BRIEF DESCRIPTION OF THE DRAWINGS
[012] The accompanying drawings, which are incorporated in and constitute a part of this disclosure, illustrate exemplary embodiments and, together with the description, serve to explain the disclosed principles:
[013] FIG. 1 illustrates a block diagram of an operation optimization system for an equipment by using a neural network based optimizer according to embodiments of the present disclosure.
[014] FIG. 2A and FIG. 2B is an exemplary block diagram of an operation optimization system for the equipment by using the neural network based optimizer according to embodiments of the present disclosure.
[015] FIG. 3 is an exemplary block diagram illustrating a computation flow in RNN- Opt domain constraint according to embodiments of the present disclosure.
[016] FIG. 4A and 4B is an exemplary flow diagram illustrating a method of optimizing operation of an equipment by using the neural network based optimizer using the system of FIGS. 1, 2A and 2B, according to embodiments of the present disclosure.
[017] FIG. 5A and 5B illustrates an exemplary graphical representation of result of optimization by a RNN-opt versus conventional optimization algorithms for benchmark functions, according to embodiments of the present disclosure.
[018] FIG. 6 illustrates an exemplary graphical representation of result as a regret plot showing effect of incremental normalization in RNN-Opt, according to embodiments of the present disclosure.
[019] FIG. 7 illustrates an exemplary graphical representation of result as a regret plot comparing RNN-Opt-domain constraint and RNN-Opt-P, according to embodiments of the present disclosure.
[020] FIG. 8 is an exemplary table view with summary results of an operation optimization system for the equipment by using the neural network based optimizer according to embodiments of the present disclosure. DETAILED DESCRIPTION OF EMBODIMENTS
[021] Exemplary embodiments are described with reference to the accompanying drawings. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. Wherever convenient, the same reference numbers are used throughout the drawings to refer to the same or like parts. While examples and features of disclosed principles are described herein, modifications, adaptations, and other implementations are possible without departing from the scope of the disclosed embodiments. It is intended that the following detailed description be considered as exemplary only, with the true scope being indicated by the following claims.
[022] Referring now to the drawings, and more particularly to FIGS. 1 through 8, where similar reference characters denote corresponding features consistently throughout the figures, there are shown preferred embodiments and these embodiments are described in the context of the following exemplary system and/or method.
[023] FIG. 1 illustrates a block diagram of the operation optimization system 100 for a process and an equipment by using the neural network based optimizer according to embodiments of the present disclosure. In an embodiment, the system 100 includes one or more processors 104, communication interface device(s) or input/output (I/O) interface(s) 106, and one or more data storage devices or memory 102 operatively coupled to the one or more processors 104. The memory 102 comprises a database 108. The one or more processors 104 that are hardware processors can be implemented as one or more microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries, and/or any devices that manipulate signals based on operational instructions. Among other capabilities, the processor(s) is configured to fetch and execute computer-readable instructions stored in the memory. In an embodiment, the system 100 can be implemented in a variety of computing systems, such as laptop computers, notebooks, hand held devices, workstations, mainframe computers, servers, a network cloud and the like.
[024] The I O interface device(s) 106 can include a variety of software and hardware interfaces, for example, a web interface, a graphical user interface, and the like and can facilitate multiple communications within a wide variety of networks N/W and protocol types, including wired networks, for example, LAN, cable, etc., and wireless networks, such as WLAN, cellular, or satellite. In an embodiment, the I/O interface device(s) can include one or more ports for connecting a number of devices to one another or to another server.
[025] The memory 102 may include any computer-readable medium known in the art including, for example, volatile memory, such as static random access memory (SRAM) and dynamic random access memory (DRAM), and/or non-volatile memory, such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes.
[026] The database 108 may store information but are not limited to, a plurality of parameters obtained from one or more sensors, wherein the parameters are specific to an entity (e.g., user, machine, equipment, and the like). In an embodiment, the equipment can be industrial boilers, engines, health care devices, chemical reactions/processes etc. Parameters may comprise sensor data captured through the sensors either connected to the user and/or machine. Further, the database 108 stores information pertaining to inputs fed to the system 100 and/or outputs generated by the system (e.g., at each stage), specific to the methodology described herein. More specifically, the database 108 stores information being processed at each step of the proposed methodology.
[027] FIG. 2A and 2B is an exemplary block diagram of the operation optimization system 200 for the equipment by using the neural network based optimizer according to embodiments of the present disclosure. With reference to FIG. 2A, the operation optimization system 200 comprises a simulation model trainer 202A, a sequence generator 204A, a multi head input handler 206A, a neural network 208A, a domain constraints handler 210A, and a multi-head output handler 212A. In an embodiment, at any particular time during a training stage, information associated with one or more equipment instances (e.g., j*) is received as an input. In an embodiment, the information associated with the at least one equipment instance (j) corresponds to at least one of (i) a plurality of control parameters (it), (ii) external input parameters (et), (iii) output parameters (ot), and (iv) historical operational data. In an embodiment, for training the Neural Network based optimizer, a historical data from multiple instances of an equipment is used. For example, data of the j* equipment instance is chosen.
[028] The information associated with the j* equipment instance is used to train a plurality of simulation models by the simulation model trainer 202A for each equipment instance to obtain a function (fj). In an embodiment, the function (fj) corresponds to a set of simulation models of the equipment instance (j). In an embodiment, for pre-training of the Neural Network based optimizer, a large number of functions (fj) can be generated via Gaussian Processes or Gaussian Mixture models constrained on the dimensionality of it and et, and combination thereof. For adapting the Neural Network based optimizer to domain at hand and fine-tuning of the Neural Network based optimizer to the domain, historical data from multiple instances of an equipment or a process can be used. For example, the data of the jth equipment instance is chosen. [029] For example, the neural network can be a recurrent neural network (RNN) such as Long Short Term Memory (LSTM) based neural network or a temporal convolutional neural network with causal connections. In an embodiment, during a training phase, a diverse set of differentiable non convex functions with known global optima are synthetically generated for training a RNN optimizer. The RNN optimizer predicts next query in order to explore search space provided sequences of previous queries and functionality responses.
[030] Consider, ropt of fj given by ropt = fj(iopt ) is known for determining a regret roptmaxi<t ri after‘t’ iterations (queries) to the functions fj. For example. A regret based loss function are:
Figure imgf000010_0001
where ReLU (x) = max(x,0). Since the regret is expected to be high during initial iterations because of random initialization of x but desired to be low close to T, by providing exponentially increasing importance to regret terms via discount factor 0 < g < 1.
[031] In an embodiment, dealing any range of values that fj can take during training or that fb can take during interference, incremental normalization is considered while training such that rt is replaced by rt = -j== such that ht+1 = fo(ht it rt; 0), where mΐ =
Figure imgf000010_0002
[032] In an embodiment, the simulation models can be conditioned Gaussian Processes, regression models (e.g., as linear regression models), or deep regression models based on neural networks such as self-normalizing neural networks, long short term memory networks (LSTM), feed forward neural networks.
[033] The sequence generator 204 A is configured to generate one or more sequences of the external input parameters (et). In an embodiment, the external input parameters (et) can be sampled from the historical data of the equipment instance (j) at a predefined time instance. The multi-head input handler 206A is configured to process an external input parameter vector (et) to obtain a fixed dimensional vector which is inputted to the Neural Network 208A to obtain the control parameter it.
[034] In an embodiment, the multi-head input handler 206A includes multiple processing heads, one per equipment instance because number of external input parameters can vary across different instances of equipment. In an embodiment, since the Neural Network’s input dimension have to be fixed, the external input parameters (et) is reduced to a fixed dimensional vector before inputting to the Neural Network 208A. The processing head for the j* equipment instance is used. In an embodiment, the fixed dimensional output of the Multi- Head Input Handler 212A, it-i and rt-i are input to the Neural Network 208A to obtain the control parameter it. The control parameter it is passed through the domain constraints handler 210A. In an embodiment, if certain values in the obtained control parameter it do not fall in prescribed ranges according to the domain, then the control parameter it is properly modified before further processing to obtain modified it.
[035] In an embodiment, after obtaining the control parameter it a check according to pre-determined domain constraints is performed on the control parameter it. For example, if the control parameter it violates domain constraints, then the control parameter it is not passed through the simulation models (during training/fine-tuning) or a physical equipment (during testing). An additional input to the neural network based optimizer is included which denotes a penalty for out of domain values for the control parameter it as mentioned subsequently in equation 2. Also, a loss function using the penalty values over time is additionally included in the training/fine-tuning procedure of the neural network based optimizer. Accordingly, the neural network based optimizer additionally tries to minimize the number of out of domain suggestions for the it during testing as mentioned subsequently in equation 3.
[036] With reference to FIG. 3 is an exemplary block diagram illustrating a computation flow in RNN-Opt domain constraint according to embodiments of the present disclosure. Here fj is the function to be optimized, and fp is compute penalty pt. Further, if pt = 0, actual value of fj, i.e. rt is passed to the RNN, else rt is set to rt-·^
[037] For example, considering enhancements in RNN-opt in which input an explicit feedback pt via a penalty function, pt = fp(it ) to the RNN that captures the extent to which a proposed query it violates any of the C domain constraints.
Figure imgf000011_0001
[038] Further, whenever pt > 0, i.e. when one or more of the domain constraints are violated for the proposed query, by set rt is set to
Figure imgf000011_0002
rather than actually getting rt from fj.
[039] During training, an additional domain constraint loss LD is considered that penalizes the neural network based optimizer if it proposes a query that does not satisfy one or more of the domain constraints as in below equation (3)
Figure imgf000011_0003
The overall loss is then provided by as in below equation (4):
Figure imgf000011_0004
[040] In an embodiment, apart from optimizing fj , the neural network based optimizer is also simultaneously trained to minimize fp.
[041] The modified it, along with et are passed to the multi-head output handler 212A to obtain rt. In an embodiment, the multi -head output handler 212A comprises multiple processing heads, one per equipment instance. The processing head for the j* equipment instance is considered. In an embodiment, each processing head includes two components i.e., (i) a set of simulation models (fj), and (ii) a reward function (Rj), both specific to the equipment instance. In an embodiment, the reward function (Rj) is used to compute a reward rt using ot.
[042] In an embodiment, values in the control parameter it are clipped to satisfy domain constraints. In another embodiment, only those control parameter it which satisfy domain constraints are passed to the Multi-Head Output Handler 212A.
[043] In an embodiment, it and rt are then fed back into the Neural Network based optimizer along with the vector of external input parameters at time‘t+1’, et+i to obtain the set of control input parameters at time‘t+1’, it+i. In an embodiment, the equipment instance chosen for fine-tuning the pre-trained optimizer for a domain is changed periodically (after some constant iterations of training) in a round robin fashion so that the optimizer is trained on all the instances. In one embodiment, the loss function used for training the Neural Network based optimizer can be a regret-based loss function (as given by equation 1). Further, for domain constraint handling an additional loss function using the penalty values for violating domain constraints can be included to train the Neural Network based optimizer (as given by equation 3). In an embodiment, for an (N+l)th equipment instance, the Neural Network based optimizer can be fine-tuned using corresponding historical data, conditioned simulation model fN+i and corresponding reward function RN+I .
[044] With reference to FIG. 2B, in an embodiment, for testing, a trained Neural Network based optimizer is considered. The function fj is replaced by an actual equipment instance. For example, at any time‘f, it-i, rt-i and et are input to the Neural Network 204B to obtain the control parameter it. In an embodiment, control parameter it is then modified by the domain constraints handler 206B. In one embodiment, a penalty term can be input to the optimizer and also values of the control parameter it that violate domain constraints may not be passed through the equipment instance and be handled differently. The modified it and et are together input to an actual equipment 208B and output parameters, ot, are collected and the reward rt is computed. In an embodiment, it, rt along with et+i are then input to the Neural Network 204B to obtain it+i. This process is continued until a satisfactory result is obtained in terms of reward, output parameters or combination thereof.
[045] In an exemplary embodiment, illustrating an application of the neural network based optimizer by considering a scenario of optimizing the operation of an aircraft engine during a single flight.
Considering, control input parameters (it: 3 dimensional):
a) Engine speed: Rotational speed of the engine - Allowed range: 5000 to 10000 rpm;
b) Fuel air ratio: Ratio of air mixed with fuel before burning - Allowed range: 0.05 to 0.20;
c) Bypass ratio: Ratio of air entering fan to air entering compressor - Allowed range: 0.25 to 0.45.
Considering, corresponding output variables (ot: 2 dimensional):
a) Average aircraft speed: 900 to 1000 kmph (950 Ideal); b) Fuel consumed: As low as possible.
Considering corresponding external input parameters variables (et: 2 dimensional): a) Ambient air temperature at inlet of engine;
b) Ambient pressure at inlet of engine.
[046] The neural network based optimizer is configured to learn to optimize for it under the external conditions given by et to meet a specific criterion defined by ot. The allowed ranges for the control variables are used to define the domain constraints and a penalty function is used to obtain a penalty value in case of violations of the constraints when the neural network based optimizer suggests values for it.
[047] Fet us consider historical data for two aircraft engines El and E2 which have been operated for 50 flights each so that the data consists of 50 recorded values for the variables in it, et and ot. Using this data simulation models are learned for El and E2 that predict the values of ot using it and et.
[048] Also the reward functions R1 and R2 can be defined using the output variables for El and E2 respectively. The simulation models together with the reward functions form the Multi-Head Output Handler. The sequence generator can be used to generate different sequences of et. The multi-head input handler is used to map the external variables to the input of the neural network.
[049] FIG. 4 is an exemplary flow diagram illustrating a method of optimizing operation of an equipment by using the neural network based optimizer using the system of FIGS. 1, 2A and 2B, according to embodiments of the present disclosure. In an embodiment, the system 100 comprises one or more data storage devices or the memory 102 operatively coupled to the one or more hardware processors 104 and is configured to store instructions for execution of steps of the method by the one or more processors 104. The flow diagram depicted is better understood by way of following explanation/description. The steps of the method of the present disclosure will now be explained with reference to the components of the system as depicted in FIGS. 1, 2 A and 2B.
[050] At step 402, information associated with at least one equipment instance (j) as an input at a predefined sequence of timestamps is received. In an embodiment, the information associated with the at least one equipment instance (j) corresponds to at least one of (i) a plurality of control parameters (it), (ii) external input parameters (et), (iii) output parameters (ot), and (iv) historical operational data. At step 404, a plurality of simulation models for each equipment instance is trained by the simulation model trainer 202A using the information associated with the at least one equipment instance (j) to obtain a function (fj). In an embodiment, the function (fj) corresponds to a set of simulation models of the equipment instance (j). At step 406, at least one sequence of the external input parameters (et) is generated by the sequence generator 204A. At step 408, the external input parameters (et) is processed by, the multi-head input handler 206A to obtain a fixed-dimensional vector that is passed as an input to the neural network 208 A to obtain an output vector (it). At step 410, a modified (it) from the output vector (it) based on a domain constraint value is generated by the domain constraints handler 210A. At step 412, a reward (rt) is computed by the multi-head output handler 212A, based on (i) the function (fj), (ii) the modified (it), (iii) the external input parameters (et), and (iv) a reward function (Rj). At step 414, the steps of processing, generating, and computing reward (rt) is iteratively performed for a series of subsequent equipment instances after expiry of the predefined sequence of timestamps associated with a first equipment instance.
[051] In an embodiment, the multi-head input handler include a processing head for each equipment instance. In an embodiment, number of external input parameters differs across different instances of equipments. In an embodiment, the function fj are generated by at least one of (i) a Gaussian processes, or (ii) Gaussian mixture models constrained on dimensionality of the vector it and the vector of external input parameters et, and combination thereof. In an embodiment, one or more hardware processors further configured to input, the vector (it) and reward (rt) into the Neural Network with the vector of external input parameters at time (T+G), et+i for iteratively performing the steps of process, generate, and compute reward to obtain the set of control input parameters at time (t+1), it+i. In an embodiment, dimensionality of it and et are determined by a target domain. In an embodiment, the step of training the neural network with a loss function is performed based on a maximum attainable value of the reward function (Rj). In an embodiment, the neural network is trained by including an additional loss function using a plurality of penalty values for violating domain constraints.
[052] Experimental results:
[053] For example, for measuring performance of any optimizer after‘t’ iterations i.e., after proposing‘t’ queries, a regret given by ropt— maxi£tri is used. Lower values of regret indicate superior optimizer performance.
[054] FIG. 5A and 5B illustrates an exemplary graphical representation of result of optimization by RNN-opt versus conventional optimization algorithms for benchmark functions for d = 2 and d = 6, according to embodiments of the present disclosure. The RNN- Opt outperforms all the baselines considered on most functions considered while being at least good as the baselines when compared to conventional optimization algorithms such as CMA- ES and Nelder-Mead.
[055] FIG. 6 illustrates an exemplary graphical representation of result as a regret plot showing effect of incremental normalization in RNN-Opt, according to embodiments of the present disclosure. Considering, incremental normalization during training and inference to optimize functions with diverse range of values. For example, RNN-Opt-Basic uses incremental normalization only during testing phase whereas RNN-Opt uses incremental normalization of inputs during training as well as testing phase.
[056] FIG. 7 illustrates an exemplary graphical representation of result as a regret plot comparing RNN-Opt-domain constraint and RNN-Opt-P, according to embodiments of the present disclosure. Considering the RNN-Opt with minor variation during inference time (with no change in training procedure) where, instead of passing rt as input to the RNN, passing rt— pt to capture penalty feedback is referred to as RNN-Opt-P. While RNN-Opt-Domain constraint is explicitly trained to minimize penalty pt explicitly, RNN-Opt-P captures requirement of trying to maximize rt under a soft-constraint of minimizing pt only during inference time. The RNN-Opt with domain constraints, namely, RNN-Opt-DC is able to effectively use explicit penalty feedback.
[057] FIG. 8 is an exemplary table view with summary results of an operation optimization system for the equipment by using the neural network based optimizer according to embodiments of the present disclosure. The table provides the summary of results obtained by using one or more modes i.e. machine learning & domain based approach and the present approach automated neural network based optimizer. The machine learning & domain based approach results with a reward as 8.4 and are obtained in number of trails taken is 36. In contrary, with the claimed approach i.e., with the Neural Network based optimizer results with reward as 16.4 and are obtained in number of trails is 33.
[058] In an embodiment, objective of the Neural Network optimizer is to output a vector of control parameters, it, at any trial ‘f’ such that a reward r, is maximized (or minimized), or overall the number of trials is reduced. A method for training a neural network as an optimizer and then adapting for a target domain is proposed. Further, adapting the optimizer for a particular instance of the target domain is proposed. In one case, the neural network based optimizer is trained via meta-learning to solve the optimization problem on a large number of non-convex functions.
[059] The Neural Network consider a control parameter vector it-i and reward rt-i from time t-V as inputs along with a vector of external input parameters denoted by et, where dimensionality of it and et is determined by a target domain. The reward is calculated as a function (denoted by R,) of the output parameter vector, ot of the equipment. The function f, with inputs h and et, computes ot.
it = NN ^, G^, b
ot = fj (it< et)
rt = Rj(ot)
Here j = 1, 2, ..., N, where N is the number of equipment instances available for training.
[060] The parameters given by h can be an input to the equipment which can be controlled. The values taken by the parameters can also be constrained based on domain knowledge. The parameters in et are external or non-controllable inputs, for example coal quality, ambient temperature, etc. The function ^ can be generated from Gaussian processes which can, in some cases, be conditioned on the observed historical data. In other cases, the function^ can be a regression model trained on historical data via deep neural networks, linear regression models, etc. The function f is approximated by simulation models of the jth equipment instance during training and during testing, is replaced by the actual physical equipment.
[061] The embodiments of present disclosure herein addresses unresolved problem of number of trails required to obtain input parameter values. The embodiment, thus provides an automated approach to obtain the optimal control input parameter values by using historical sensor data and previous optimization trials. Moreover, the embodiments herein further provides a solution can significantly reduce the number of trials required to obtain the optimal control input parameter values, and reduce time taken and costs involved for conducting such trials. Moreover, the embodiments herein further can obtain an optimizer that is general- purpose in the sense that once trained, it can be easily fine-tuned and used for obtaining optimal parameters of a new equipment. The embodiments of present disclosure can be extended for use to other domains as well. For example, where historical data can be used to build simulation models for processes/equipment, and this present method can then be used to optimize such processes.
[062] The optimizer is trained to obtain optimal control input parameters on a large number of equipment simultaneously. The embodiments of present disclosure is generic in the sense that the number of external parameters across equipment need not be same. Further, once the neural network based optimizer is obtained, this approach allows to easily fine tune the optimization algorithm to perform better on the new equipment of interest. The embodiments of present disclosure is a data-driven approach that leverages sensor data from equipment to learn simulation models that are used for training the optimizer. Further, historical trials are leveraged to obtain new sequence of trials via the sequence generation module.
[063] The embodiments of present disclosure leverages external input parameters that provide an estimate of the environmental operating conditions of an equipment, e.g. via various sensors installed on the equipment. Given the external input parameters which can vary over time, the optimal values for the control input parameters are estimated by the neural network based optimizer. These values are then tried by the technicians on the actual equipment and the reward in terms of output parameters is obtained. This reward and the most recent external input parameters are then used by the neural network based optimizer to recommend the next set of optimal values for the control input parameters.
[064] The embodiments of present disclosure leverages the equipment operational data captured through various sensors to obtain simulation models and then use these models to learn data-driven optimization algorithms that can work across various models of the same equipment (e.g. different boiler types) or even across equipment types (e.g. across boilers and engines).
[065] The neural network receives input via a multi-head input handler that allows to input varying number of external parameters depending on specific equipment instance useful to estimate operating conditions. Similarly, the neural network output is processed by a multi head output handler that allows to use specific equipment instance simulation models. The core neural network based optimizer remains the same while it is trained simultaneously on various instances of same equipment with varying number of external parameters by changing the input and output heads.
[066] The embodiments of present disclosure leverages various simulation models obtained from the historical sensor data to estimate the rewards during offline training of the neural network optimizer. These simulation models can be learned via machine learning and statistical models from the historical sensor data using neural networks, linear regression models, MARS, stepwise second order methods, etc.
[067] The embodiments of present disclosure further simulates the sequence of trials by generating sequence of external input parameters for each equipment instance used for training the neural network based optimizer. The loss function used for training the neural network based optimizer is based on cumulative reward over trials. Therefore, present approach can be considered as learning an optimal policy in a reinforcement learning setting. The embodiments of present disclosure hypothesize that training an RNN optimizer using the loss function that minimizes regret observed for a given number of queries more closely resembles performance measures of an optimizer. The embodiments of present disclosure propose an incremental normalization that dynamically normalizes output from the function using one or more response values observed before passing as an input to the RNN optimizer. The embodiments of present disclosure overcome a problem of set of constraints in the input parameters by providing an additional feedback to the RNN optimizer while proposing a query that violates domain constraints.
[068] At test time, the simulation models can be replaced by the actual equipment, and actual rewards can be computed by setting the equipment control input parameters to those given by the already trained neural network optimizer based on the current external input parameters. The reward can then be input back to the neural network to obtain the next control input parameter recommendation, and so on to eventually obtain an optimal value of control input parameter with desired behavior in terms of output parameters and reward.
[069] The written description describes the subject matter herein to enable any person skilled in the art to make and use the embodiments. The scope of the subject matter embodiments is defined by the claims and may include other modifications that occur to those skilled in the art. Such other modifications are intended to be within the scope of the claims if they have similar elements that do not differ from the literal language of the claims or if they include equivalent elements with insubstantial differences from the literal language of the claims. [070] It is to be understood that the scope of the protection is extended to such a program and in addition to a computer-readable means having a message therein; such computer-readable storage means contain program-code means for implementation of one or more steps of the method, when the program runs on a server or mobile device or any suitable programmable device. The hardware device can be any kind of device which can be programmed including e.g. any kind of computer like a server or a personal computer, or the like, or any combination thereof. The device may also include means which could be e.g. hardware means like e.g. an application- specific integrated circuit (ASIC), a field- programmable gate array (FPGA), or a combination of hardware and software means, e.g. an ASIC and an FPGA, or at least one microprocessor and at least one memory with software processing components located therein. Thus, the means can include both hardware means and software means. The method embodiments described herein could be implemented in hardware and software. The device may also include software means. Alternatively, the embodiments may be implemented on different hardware devices, e.g. using a plurality of CPUs.
[071] The embodiments herein can comprise hardware and software elements. The embodiments that are implemented in software include but are not limited to, firmware, resident software, microcode, etc. The functions performed by various components described herein may be implemented in other components or combinations of other components. For the purposes of this description, a computer-usable or computer readable medium can be any apparatus that can comprise, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
[072] The illustrated steps are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope of the disclosed embodiments. Also, the words “comprising,” “having,” “containing,” and “including,” and other similar forms are intended to be equivalent in meaning and be open ended in that an item or items following any one of these words is not meant to be an exhaustive listing of such item or items, or meant to be limited to only the listed item or items. It must also be noted that as used herein and in the appended claims, the singular forms“a,”“an,” and“the” include plural references unless the context clearly dictates otherwise.
[073] Furthermore, one or more computer-readable storage media may be utilized in implementing embodiments consistent with the present disclosure. A computer-readable storage medium refers to any type of physical memory on which information or data readable by a processor may be stored. Thus, a computer-readable storage medium may store instructions for execution by one or more processors, including instructions for causing the processor(s) to perform steps or stages consistent with the embodiments described herein. The term“computer-readable medium” should be understood to include tangible items and exclude carrier waves and transient signals, i.e., be non-transitory. Examples include random access memory (RAM), read-only memory (ROM), volatile memory, nonvolatile memory, hard drives, CD ROMs, DVDs, flash drives, disks, and any other known physical storage media.
[074] It is intended that the disclosure and examples be considered as exemplary only, with a true scope of disclosed embodiments being indicated by the following claims.

Claims

CLAIMS:
1. A processor implemented method of optimizing an operation of an equipment by a neural network based optimizer, comprising:
receiving, information associated with at least one equipment instance (j) as an input at a predefined sequence of timestamps, wherein the information associated with the at least one equipment instance (j) corresponds to at least one of (i) a plurality of control parameters (it), (ii) external input parameters (et), (iii) output parameters (ot), and (iv) historical operational data;
training, by a simulation model trainer (202A), a plurality of simulation models for each equipment instance using the information associated with the at least one equipment instance (j) to obtain a function (fj), wherein the function (fj) corresponds to a set of simulation models of the equipment instance (j);
generating, by a sequence generator (204 A), at least one sequence of the external input parameters (et);
processing, by a multi-head input handler (206A), the external input parameters (et) to obtain a fixed-dimensional vector that is passed as an input to a neural network (208A) to obtain an vector (it);
generating, by a domain constraints handler (210A), a modified (it) from the output vector (it) based on a domain constraint value;
computing, by a multi-head output handler (212A), a reward (rt) based on (i) the function (fj), (ii) the modified (it), (iii) the external input parameters (et), and (iv) a reward function (Rj); and
iteratively performing the steps of processing, generating, and computing reward (rt) for a series of subsequent equipment instances after expiry of the predefined sequence of timestamps associated with a first equipment instance.
2. The method as claimed in claim 1, wherein the multi-head input handler (206 A) comprises a processing head for each equipment instance, wherein number of external input parameters differs across different instances of equipments.
3. The method as claimed in claim 1, wherein the function fj are generated by at least one of (i) a Gaussian processes, or (ii) Gaussian mixture models constrained on dimensionality of the output vector it and the vector of external input parameters et, and combination thereof.
4. The method as claimed in claim 1, further comprising, inputting, the output vector (it) and reward (rt) into the Neural Network (208A) with the vector of external input parameters at time (T+G), et+i for iteratively performing the steps of processing, generating, and computing reward to obtain the set of control input parameters at time (t+1), it+i, wherein dimensionality of it and et is determined by a target domain.
5. The method as claimed in claim 1, wherein the step of training the neural network (208A) with a loss function is performed based on a maximum attainable value of the reward function (Rj).
6. The method as claimed in claim 1, wherein the neural network (208 A) is trained by including an additional loss function using a plurality of penalty values for violating domain constraints.
7. A system (100) for operation optimization of an equipment by a neural network based optimizer, wherein the system comprising:
a memory (102) storing instructions;
one or more communication interfaces (106); and
one or more hardware processors (104) coupled to the memory (102) via the one or more communication interfaces (106), wherein the one or more hardware processors (104) are configured by the instructions to:
receive, information associated with at least one equipment instance (j) as an input at a predefined sequence of timestamps, wherein the information associated with the at least one equipment instance (j) corresponds to at least one of (i) a plurality of control parameters (it), (ii) external input parameters (et), (iii) output parameters (ot), and (iv) historical operational data;
train, a plurality of simulation models for each equipment instance using the information associated with the at least one equipment instance (j) to obtain a function (fj), wherein the function (fj) corresponds to a set of simulation models of the equipment instance (j);
generate, at least one sequence of the external input parameters (et);
process, the external input parameters (et) to obtain a fixed-dimensional vector that is passed as an input to a neural network (208 A) to obtain an output vector (it); generate, a modified (it) from the output vector (it) based on a domain constraint value;
compute, a reward (rt) based on (i) the function (fj), (ii) the modified (it), (iii) the external input parameters (et), and (iv) a reward function (Rj); and
iteratively perform the steps of process, generate, and compute reward (rt) for a series of subsequent equipment instances after expiry of the predefined sequence of timestamps associated with a first equipment instance.
8. The system as claimed in claim 7, wherein a multi -head input handler (206 A) comprises a processing head for each equipment instance, wherein number of external input parameters differs across different instances of equipments.
9. The system as claimed in claim 7, wherein the function fj are generated by at least one of (i) a Gaussian processes, or (ii) Gaussian mixture models constrained on dimensionality of the output vector it and the vector of external input parameters et, and combination thereof.
10. The system as claimed in claim 7, wherein the one or more hardware processors (104) are further configured by the instructions to input, the output vector (it) and reward (rt) into the Neural Network (208 A) with the vector of external input parameters at time (T+G), et+i to iteratively perform the steps of process, generate, and compute reward (rt) to obtain the set of control input parameters at time (t+1), it+i, wherein dimensionality of it and etis determined by a target domain.
11. The system as claimed in claim 7, wherein the step of training the neural network (208A) with a loss function is performed based on a maximum attainable value of the reward function (Rj).
12. The system as claimed in claim 7, wherein the neural network (208 A) is trained by including an additional loss function using a plurality of penalty values for violating domain constraints.
13. One or more non-transitory machine readable information storage mediums comprising one or more instructions which when executed by one or more hardware processors causes: receiving, information associated with at least one equipment instance (j) as an input at a predefined sequence of timestamps, wherein the information associated with the at least one equipment instance (j) corresponds to at least one of (i) a plurality of control parameters (it), (ii) external input parameters (et), (iii) output parameters (ot), and (iv) historical operational data;
training, a plurality of simulation models for each equipment instance using the information associated with the at least one equipment instance (j) to obtain a function (fj), wherein the function (fj) corresponds to a set of simulation models of the equipment instance
(j);
generating, at least one sequence of the external input parameters (et);
processing, the external input parameters (et) to obtain a fixed-dimensional vector that is passed as an input to a neural network (208A) to obtain an vector (it);
generating, a modified (it) from the output vector (it) based on a domain constraint value;
computing, a reward (rt) based on (i) the function (fj), (ii) the modified (it), (iii) the external input parameters (et), and (iv) a reward function (Rj); and
iteratively performing the steps of processing, generating, and computing reward (rt) for a series of subsequent equipment instances after expiry of the predefined sequence of timestamps associated with a first equipment instance.
14. The one or more non-transitory machine readable information storage mediums as claimed in claim 13, wherein the multi -head input handler (206 A) comprises a processing head for each equipment instance, wherein number of external input parameters differs across different instances of equipments.
15. The one or more non-transitory machine readable information storage mediums as claimed in claim 13, wherein the function fj are generated by at least one of (i) a Gaussian processes, or (ii) Gaussian mixture models constrained on dimensionality of the output vector it and the vector of external input parameters et, and combination thereof.
16. The one or more non-transitory machine readable information storage mediums as claimed in claim 13, wherein the one or more hardware processors (104) are further configured by the instructions for inputting, the output vector (it) and reward (rt) into the Neural Network (208 A) with the vector of external input parameters at time (T+G), et+i for iteratively performing the steps of processing, generating, and computing reward to obtain the set of control input parameters at time (t+1), it+i, wherein dimensionality of it and etis determined by a target domain.
17. The one or more non-transitory machine readable information storage mediums as claimed in claim 13, wherein the step of training the neural network (208A) with a loss function is performed based on a maximum attainable value of the reward function (Rj).
18. The one or more non-transitory machine readable information storage mediums as claimed in claim 13, wherein the neural network (208 A) is trained by including an additional loss function using a plurality of penalty values for violating domain constraints.
PCT/IN2019/050871 2018-11-28 2019-11-28 System and method for operation optimization of an equipment WO2020110149A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2021530219A JP7179181B2 (en) 2018-11-28 2019-11-28 Systems and methods for device operation optimization
US17/309,432 US20220036166A1 (en) 2018-11-28 2019-11-28 System and method for operation optimization of an equipment
EP19888392.8A EP3887913A4 (en) 2018-11-28 2019-11-28 System and method for operation optimization of an equipment

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN201821044941 2018-11-28
IN201821044941 2018-11-28

Publications (2)

Publication Number Publication Date
WO2020110149A2 true WO2020110149A2 (en) 2020-06-04
WO2020110149A3 WO2020110149A3 (en) 2020-07-09

Family

ID=70852274

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IN2019/050871 WO2020110149A2 (en) 2018-11-28 2019-11-28 System and method for operation optimization of an equipment

Country Status (4)

Country Link
US (1) US20220036166A1 (en)
EP (1) EP3887913A4 (en)
JP (1) JP7179181B2 (en)
WO (1) WO2020110149A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021019551A1 (en) * 2019-07-26 2021-02-04 Tata Consultancy Services Limited System and method for real-time self-optimization of manufacturing operations

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7058617B1 (en) * 1996-05-06 2006-06-06 Pavilion Technologies, Inc. Method and apparatus for training a system model with gain constraints
EP2223260A2 (en) * 2007-11-07 2010-09-01 EDSA Micro Corporation Systems and methods for real-time forecasting and predicting of electrical peaks and managing the energy, health, reliability, and performance of electrical power systems based on an artificial adaptive neural network
EP2553535B1 (en) * 2010-03-29 2014-03-05 Siemens Aktiengesellschaft Engineering tool and method for parameterizing a model-based predictive controller
US10074054B2 (en) * 2013-05-30 2018-09-11 President And Fellows Of Harvard College Systems and methods for Bayesian optimization using non-linear mapping of input
EP3133268B1 (en) * 2015-08-21 2020-09-30 Ansaldo Energia IP UK Limited Method for operating a power plant
US20180024508A1 (en) * 2016-07-25 2018-01-25 General Electric Company System modeling, control and optimization

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
JOST TOBIAS SPRINGENBERG ET AL., BAYESIAN OPTIMIZATION WITH ROBUST BAYESIAN NEURAL NETWORKS
KONSTANTINOS CHATZILYGEROUDIS ET AL., A SURVEY ON POLICY SEARCH ALGORITHMS FOR LEARNING ROBOT CONTROLLERS IN A HANDFUL OF TRIALS
See also references of EP3887913A4
YUTIAN CHEN ET AL., LEARNING TO LEARN WITHOUT GRADIENT DESCENT BY GRADIENT DESCENT

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021019551A1 (en) * 2019-07-26 2021-02-04 Tata Consultancy Services Limited System and method for real-time self-optimization of manufacturing operations

Also Published As

Publication number Publication date
JP2022510223A (en) 2022-01-26
WO2020110149A3 (en) 2020-07-09
US20220036166A1 (en) 2022-02-03
EP3887913A4 (en) 2022-11-09
JP7179181B2 (en) 2022-11-28
EP3887913A2 (en) 2021-10-06

Similar Documents

Publication Publication Date Title
US11550329B2 (en) Neural network based vehicle dynamics model
US10810491B1 (en) Real-time visualization of machine learning models
Papakonstantinou et al. POMDP and MOMDP solutions for structural life-cycle cost minimization under partial and mixed observability
US20220187819A1 (en) Method for event-based failure prediction and remaining useful life estimation
WO2018157752A1 (en) Approximate random number generator by empirical cumulative distribution function
US11836257B2 (en) Method and system for defending universal adversarial attacks on time-series data
Moniz et al. Smoteboost for regression: Improving the prediction of extreme values
Ong et al. Predictive maintenance for edge-based sensor networks: A deep reinforcement learning approach
Wang et al. Cooling strategies for the moment-generating function in Bayesian global optimization
CN110909758A (en) Computer-readable recording medium, learning method, and learning apparatus
US20200167660A1 (en) Automated heuristic deep learning-based modelling
Asiain et al. Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies
US11915112B2 (en) Method for classification based diagnosis with partial system model information
Yue et al. Federated Gaussian process: Convergence, automatic personalization and multi-fidelity modeling
Quin et al. Reducing large adaptation spaces in self-adaptive systems using classical machine learning
WO2020110149A2 (en) System and method for operation optimization of an equipment
Belzile et al. A modeler’s guide to extreme value software
Gera et al. Dynamic adaptive threshold based learning for noisy annotations robust facial expression recognition
US20230069913A1 (en) Multi-objective machine learning with model and hyperparameter optimization fusion
US20230195607A1 (en) System For Automatic Identification and Selection of Optimization Metrics and Accompanying Models in Experiimentation Platforms
Lakra et al. Development of Efficient and Optimal Models for Software Maintainability Prediction using Feature Selection Techniques
Benavides-Prado et al. Generalised controller design using continual learning
US20240126943A1 (en) System and method for stabilizing and accelerating iterative numerical simulation
US11087085B2 (en) Method and system for inferential data mining
Pantiskas et al. Channel-Adaptive Early Exiting using Reinforcement Learning for Multivariate Time Series Classification

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19888392

Country of ref document: EP

Kind code of ref document: A2

ENP Entry into the national phase

Ref document number: 2021530219

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19888392

Country of ref document: EP

Kind code of ref document: A2

ENP Entry into the national phase

Ref document number: 2019888392

Country of ref document: EP

Effective date: 20210628