US20230023896A1 - Method of Transfer Learning for a Specific Production Process of an Industrial Plant - Google Patents

Method of Transfer Learning for a Specific Production Process of an Industrial Plant Download PDF

Info

Publication number
US20230023896A1
US20230023896A1 US17/957,592 US202217957592A US2023023896A1 US 20230023896 A1 US20230023896 A1 US 20230023896A1 US 202217957592 A US202217957592 A US 202217957592A US 2023023896 A1 US2023023896 A1 US 2023023896A1
Authority
US
United States
Prior art keywords
data
machine learning
learning model
training
historic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
US17/957,592
Inventor
Benedikt Schmidt
Ido Amihai
Arzam Muzaffar Kotriwala
Moncef CHIOUA
Dennis JANKA
Felix Lenders
Jan Christoph SCHLAKE
Martin Hollender
Hadil Abukwaik
Benjamin Kloepper
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ABB Schweiz AG
Original Assignee
ABB Schweiz AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ABB Schweiz AG filed Critical ABB Schweiz AG
Assigned to ABB SCHWEIZ AG reassignment ABB SCHWEIZ AG ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: Amihai, Ido, HOLLENDER, MARTIN, KLOEPPER, Benjamin, SCHLAKE, Jan Christoph, Kotriwala, Arzam Muzaffar, CHIOUA, MONCEF, ABUKWAIK, Hadil, Janka, Dennis, Lenders, Felix, SCHMIDT, BENEDIKT
Publication of US20230023896A1 publication Critical patent/US20230023896A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
    • G05B19/41835Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM] characterised by programme execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/0265Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B17/00Systems involving the use of models or simulators of said systems
    • G05B17/02Systems involving the use of models or simulators of said systems electric
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM]
    • G05B19/41885Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS] or computer integrated manufacturing [CIM] characterised by modeling, simulation of the manufacturing system
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/32Operator till task planning
    • G05B2219/32015Optimize, process management, optimize production line
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/32Operator till task planning
    • G05B2219/32352Modular modeling, decompose large system in smaller systems to simulate
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/80Management or planning

Definitions

  • the present disclosure relates to method of transfer learning for a specific production process of an industrial plant, a use of a new machine learning model, trained by such a method, a data processing system, and a computer program.
  • a method of transfer learning for a specific production process of an industrial plant comprises the following steps.
  • a step a plurality of data templates defining expected data for a production process are provided.
  • plant data of the industrial plant comprising data points of the specific production process, are provided, wherein the data points comprise information about input and output of the specific production process.
  • the data template defines a grouping for the expected data according to their relation in the industrial plant.
  • a process instance of the specific production process is determined, defining a mapping between the plant data to the expected data of the specific production process.
  • Historic process data being historic sensor data relating to the specific production process, is determined, using the determined process instance.
  • training data is determined using the determined process instance and the determined historic process data; wherein the training data comprises a structured data matrix, wherein columns of the data matrix represent the sensor data that are grouped in accordance with the data template and wherein rows of the data matrix represent timestamps of obtaining the sensor data.
  • a pre-trained machine learning model is provided using the determined process instance.
  • a new machine learning model is trained using the provided pre-trained model and the determined training data.
  • FIG. 1 is a schematic of a training process for transfer learning in accordance with the disclosure.
  • FIG. 2 is a diagram of a relation between the data template and the pre-trained machine learning model in accordance with the disclosure.
  • FIG. 3 is a schematic of an arrangement for reusing layers of a pre-trained machine learning model in accordance with the disclosure.
  • FIG. 4 is a flowchart for a method of transfer learning for a specific production process in accordance with the disclosure.
  • the functional modules and/or the configuration mechanisms are implemented as programmed software modules or procedures, respectively; however, one skilled in the art will understand that the functional modules and/or the configuration mechanisms can be implemented fully or assembly partially in hardware.
  • FIG. 1 shows a schematic view of a training process for transfer learning.
  • a process instance is created either manually by a human who defines the mapping between industrial plant data P, in particular inputs/outputs, I/Os, in the industrial plant to data templates T.
  • one template T is selected corresponding to the industrial plant data P of the current industrial plant.
  • this is done automatically using digital P&ID and I/O lists and eventually the C&E matrices of the plant by using pre-defined rules for mapping sensor locations to data points in the data template T.
  • historic process data H is extracted from a historian, in particular using I/Os' names.
  • the process instance reflects the current asset or production process of the current industrial plant on which the new machine learning model M should be used.
  • the process instance for example defines names of inputs and outputs of the current industrial plant for which historical production data H can be determined.
  • a standard data matrix is build, in which columns represent the data points of the historical production data H and the rows represent the timestamps of corresponding sensor readings.
  • the individual data points are subject to various data pre-processing steps as follows: Adapting the sampling frequencies to the standard matrix format, e.g., down sampling from seconds to minutes or up sampling from minutes to 30 seconds, Scaling the data to 0-1 domain, optionally fuse missing data points from available data points, e.g., estimate bottom section temperature based on top section temperature, and remove outliers.
  • a new model is trained starting from a pre-trained model Mp using weights obtain from previous trainings and allow the training process to adjust these weights according to loss generated from data samples of the current plant. This may involve using all or parts of the of the pre-trained model.
  • certain layers of the network can be excluded, e.g., freeze the layer, from the changing the weights, e.g., keep top layer as it is, or optionally choose different learning rates across the layered networks.
  • FIG. 2 shows a relation between the data template and the pre-trained machine learning model.
  • the data template T is a list of data point for example, I 1 : temperature values, I 2 : pressure values, I 3 : level alarms, and I 4 : valve positions with information on the location on the process or asset (e.g., temperature on top section of processing column).
  • I 1 temperature values
  • I 2 pressure values
  • I 3 level alarms
  • I 4 valve positions with information on the location on the process or asset (e.g., temperature on top section of processing column).
  • Each prediction, the order of the training data is maintained across all training runs of the new machine learning model M, or in other words the transferred learning model. In this way, the weights the pre-trained machine learning model Mp has obtained during training still can be mapped to the same meaningful features F 1 -F 5 across all training runs.
  • FIG. 3 shows a schematic view of reusing layers of a pre-trained machine learning model.
  • a new machine learning model M comprises a plurality of layers, in this case, a first layer L 1 , a second layer L 2 , a third layer L 3 and a fourth layer Ln.
  • the first layer L 1 , the second layer L 2 , the third layer L 3 and the fourth layer Ln are pre-trained layers that have been trained with plant data for a first plant A.
  • weights obtained by training the first layer L 1 , the second layer L 2 , the third layer L 3 and the fourth layer Ln are already known to the new machine learning model M.
  • the new machine learning model M when training the new machine learning model M with plant data of a second plant B, not all weights are adjusted.
  • the first layer L 1 , the second layer L 2 and the third layer L 3 are frozen. In other words, those weights are not adjusted during training with the plant data of the second plant B.
  • the new machine learning model M that has been trained with the data of the second plant B does not perform to a predetermined satisfaction, an iterative process is executed in which it is decided which parts of the pre-trained machine learning model Mp can be reused and which parts should be dropped and retrained.
  • the performance of the new machine learning model M is determined in an evaluation process using a score model, for example classification, regression values or anomaly scores. In other words, if the new machine learning model M does not perform satisfactory, an amount of frozen layers are iteratively unfrozen and retrained.
  • FIG. 4 shows a schematic view of a method of transfer learning for a specific production process.
  • a plurality of data templates T defining expected data for a production process are provided.
  • plant data of the industrial plant comprising data points of the specific production process, are provided, wherein the data points comprise information about input and output of the specific production process.
  • the data template defines a grouping for the expected data according to their relation in the industrial plant.
  • a process instance I of the specific production process is determined, defining a mapping between the plant data to the expected data of the specific production process.
  • Historic process data H being historic sensor data relating to the specific production process, is determined in a fourth step S 40 , using the determined process instance I.
  • training data is determined using the determined process instance I and the determined historic process data H; wherein the training data comprises a structured data matrix, wherein columns of the data matrix represent the sensor data that are grouped in accordance with the data template T and wherein rows of the data matrix represent timestamps of obtaining the sensor data.
  • a pre-trained machine learning model Mp is provided using the determined process instance I.
  • a new machine learning model Mn is trained using the provided pre-trained model Mp and the determined training data.
  • the data points comprise information the specific production process, in particular an asset of the production process, with basic semantic information, for example sensor positions and/or sensor types.
  • data templates comprises a list of the typical data points or measurements that are typically available from an asset (e.g. a drive train (pump, motor, drive) or distillation columns (temperature, levels, pressures and flows on different height levels). Furthermore, the data template places measurements that are related in proximity in the list. e.g. the speed setpoint of the drive, the voltage/current of the motor and the vibration of pump and motor are subsequent elements of the list.
  • asset e.g. a drive train (pump, motor, drive) or distillation columns (temperature, levels, pressures and flows on different height levels).
  • the data template places measurements that are related in proximity in the list. e.g. the speed setpoint of the drive, the voltage/current of the motor and the vibration of pump and motor are subsequent elements of the list.
  • ANN artificial neural network
  • typical signal, A&E, combination e.g. 2 ⁇ level, 2 ⁇ pressure, temperature, inflow, outflow of a processing columns.
  • These signals are always grouped together in the plant data, e.g., neighboring columns, so that an artificial neural network processes the data together, e.g. by convolutions, or control the network architecture, e.g. which data is convoluted. This helps the performance of the machine learning model. It can be also used to facilitate transfer learning. If a new model is trained and also data is used from a process column, the network architecture and weights from previously learnt models can be partially extracted.
  • Digital libraries of data templates that define what data is expected from production processes are provided as inputs. Additionally, plant data, comprising a list of data points of a specific asset or processes with basic semantic information, e.g., sensors position and their types, are provided. Further, historic process data from the current process that are tried to transfer the machine learning model to are provided.
  • a new working machine learning model is achieved by tuning the pre-trained model to the current industrial plant.
  • the new model is used to present the production process or asset status to the human user or to trigger automated actions, e.g., closing a valve.
  • the data templates comprise digital libraries that define what data are expected from a production process.
  • the data points comprise temperature values, pressure values, level alarms, valve positions.
  • the pre-trained machine learning model has been trained from at least one asset or production process of an industrial plant.
  • the method provides working machine learning model by tuning a pre-trained machine learning model to the current industrial plant or in particular a component of the current industrial plant.
  • the described method allows for providing transfer learning for industrial applications based on data templates of industrial plant signals.
  • an improved method for transfer learning for a specific production process of an industrial plant is provided.
  • determining the training data comprises pre-processing the historic process data, thereby standardizing a format of the training data.
  • the pre-processing steps format the historic process data so that a data matrix is determined that is semantically identical to what the pre-trained model has been trained on.
  • the determined data matrix is used as input for new machine learning model for training to obtain predictions from the new machine learning model that are either displayed to a human user or used to trigger automatic actions.
  • pre-processing the historic process data comprises adapting a sampling frequency to a standardized data matrix format.
  • pre-processing the historic process data comprises scaling the historic process data to a 0-1 domain.
  • pre-processing the historic process data comprises fusing missing data points of the historic process data from available data points of the historic process data.
  • pre-processing the historic process data comprises removing outliers of the historic process data.
  • the pre-trained model comprises weights wherein training the new machine learning model comprises adjusting the weights
  • the weights are obtained from previous trainings of the pre-trained model.
  • the weights are adjusted according to loss generated from data samples of new machine learning model, in other words the current industrial plant.
  • the pre-trained machine learning model comprises at least one layer wherein training the new machine learning model comprises the following steps.
  • each layer is categorised, using the determined process instance, in one of the categories frozen or non-frozen.
  • the frozen layers of the pre-trained machine learning model are reused and the non-frozen layers of the pre-trained machine learning model are retrained.
  • each layer it is determined if the layer is a frozen layer that is not retrained or a non-frozen layer that is retrained, using the corresponding data template.
  • reusing the frozen layers allows to use a network architecture and/or weights from the pre-trained machine learning model to train the new machine learning model.
  • the determination of the layer is a frozen layer or a non-frozen layer is automatically optimized using hyper-parameter optimization.
  • the retraining is performed in an iterative way where additional layers are retrained until a satisfactory level of performance is achieved.
  • determining, which layer is a frozen layer and which layer is a non-frozen layer is done based on the type of the layer.
  • the aim is to retrain mainly the decision logic of the machine learning network.
  • these layer have a different type of architecture (densely connected) then previous layers (e.g. convolutional and pooling layers or Recurrent Layers).
  • the determination is done by trying out reusing different layers and selecting the configuration that yield the best results (best performance on a test data set, e.g. measured as root-mean-square error for regression or accuracy for classification).
  • the pre-trained machine learning model comprises at least one layer, wherein training the new machine learning model comprises the following steps: In a step, each layer is categorised, using the determined process instance, in one of the categories frozen or non-frozen. In another step, different learning rates are applied on the at least one layer depending on the determination if the layer is a frozen layer or a non-frozen layer.
  • different learning rates can be chosen across the layers of the pre-trained machine learning model.
  • the determination of the layer is a frozen layer or a non-frozen layer is automatically optimized using hyper-parameter optimization.
  • the retraining is performed in an iterative way where additional layers are retrained until a satisfactory level of performance is achieved.
  • the data points comprise input/output names of the specific production process, wherein the historic process data is determined using the input/output names.
  • training the new machine learning model comprises using the data matrix as input for the new machine learning model to obtain a prediction as output from the new machine learning model.
  • the prediction comprises a classification, regression values and/or an anomaly score.
  • the new machine learning model trained by a method, as described herein, is used to provide status data of the industrial plant.
  • the working new machine learning model allows presenting a process status or an asset status of the industrial plant to a human user or to trigger an automated action, for example closing a valve of the industrial plant.
  • a data processing system comprising means for carrying out the steps of a method, as described herein, is provided.
  • a computer program comprising instructions, which, when the program is executed by a computer, cause the computer to carry out the steps of a method, as used herein, is provided.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Automation & Control Theory (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Manufacturing & Machinery (AREA)
  • Quality & Reliability (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Testing And Monitoring For Control Systems (AREA)
  • Feedback Control In General (AREA)

Abstract

A method of transfer learning for a specific production process of an industrial plant includes providing data templates defining expected data for a production process, and providing plant data, wherein the data templates define groupings for the expected data according to their relation in the industrial plant; determining a process instance and defining a mapping with the plant data; determining historic process data; determining training data using the determined process instance and the determined historic process data, wherein the training data comprises a structured data matrix, wherein columns of the data matrix represent the sensor data that are grouped in accordance with the data template and wherein rows of the data matrix represent timestamps of obtaining the sensor data; providing a pre-trained machine learning model using the determined process instance; and training a new machine learning model using the provided pre-trained model and the determined training data.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This patent application claims priority to International Patent Application No. PCT/EP2021/058477, filed on Mar. 31, 2021, which claims priority to International Application No. PCT/EP2020/059169, filed on Mar. 31, 2020, each of which is incorporated herein in its entirety by reference.
  • FIELD OF THE DISCLOSURE
  • The present disclosure relates to method of transfer learning for a specific production process of an industrial plant, a use of a new machine learning model, trained by such a method, a data processing system, and a computer program.
  • BACKGROUND OF THE INVENTION
  • Looking at the current state of machine learning in industry, there is a growing interest in utilizing it for different useful applications. The machine learning-based industrial applications play a role in different tasks like predictive maintenance, process monitoring, and quality control. In these different tasks of problems, certain signals, such as temperature, pressure, flow, etc., can be shared across different tasks, and thus enable knowledge transfer among tasks. However, building a machine learning model for a specific problem of an industrial plant then transfer its learning by reusing it to solve similar problem of another plant is not trivial. This is due to the fact even similar tasks and plants are still having different space of signals.
  • Each time a new problem in industrial plants and their processes needs to be addressed using machine learning, it is required to go through the tedious and time-consuming tasks of training and validating the model. To decrease this effort and its cost, it would be of advantage to reuse prior learning and knowledge acquired on industrial plant and processes and incorporate them when training new models for similar problems. However, reusing machine learning models or parts of them is a complex task itself and it requires better organization of analyzed input signals. This challenge can be even harder when it is applied to the industrial applications that may involve several signals related to one process or plant.
  • BRIEF SUMMARY OF THE INVENTION
  • According to an aspect of the disclosure, a method of transfer learning for a specific production process of an industrial plant comprises the following steps. In a step, a plurality of data templates defining expected data for a production process are provided. In another step, plant data of the industrial plant, comprising data points of the specific production process, are provided, wherein the data points comprise information about input and output of the specific production process. The data template defines a grouping for the expected data according to their relation in the industrial plant. In another step, a process instance of the specific production process is determined, defining a mapping between the plant data to the expected data of the specific production process.
  • Historic process data, being historic sensor data relating to the specific production process, is determined, using the determined process instance. In another step, training data is determined using the determined process instance and the determined historic process data; wherein the training data comprises a structured data matrix, wherein columns of the data matrix represent the sensor data that are grouped in accordance with the data template and wherein rows of the data matrix represent timestamps of obtaining the sensor data. In another step, a pre-trained machine learning model is provided using the determined process instance. In another step, a new machine learning model is trained using the provided pre-trained model and the determined training data.
  • BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING(S)
  • FIG. 1 is a schematic of a training process for transfer learning in accordance with the disclosure.
  • FIG. 2 is a diagram of a relation between the data template and the pre-trained machine learning model in accordance with the disclosure.
  • FIG. 3 is a schematic of an arrangement for reusing layers of a pre-trained machine learning model in accordance with the disclosure.
  • FIG. 4 is a flowchart for a method of transfer learning for a specific production process in accordance with the disclosure.
  • DETAILED DESCRIPTION OF THE INVENTION
  • The reference symbols used in the drawings, and their meanings, are listed in summary form in the list of reference symbols. In principle, identical assembly parts are provided with the same reference symbols in the figures.
  • Preferably, the functional modules and/or the configuration mechanisms are implemented as programmed software modules or procedures, respectively; however, one skilled in the art will understand that the functional modules and/or the configuration mechanisms can be implemented fully or assembly partially in hardware.
  • FIG. 1 shows a schematic view of a training process for transfer learning. In one step S30, a process instance is created either manually by a human who defines the mapping between industrial plant data P, in particular inputs/outputs, I/Os, in the industrial plant to data templates T. In other words, from a plurality of generic templates T, comprising expected data of specific assets or production processes, one template T is selected corresponding to the industrial plant data P of the current industrial plant. Alternatively, this is done automatically using digital P&ID and I/O lists and eventually the C&E matrices of the plant by using pre-defined rules for mapping sensor locations to data points in the data template T.
  • In another step S40, historic process data H is extracted from a historian, in particular using I/Os' names. In other words, the process instance reflects the current asset or production process of the current industrial plant on which the new machine learning model M should be used. Thus, the process instance for example defines names of inputs and outputs of the current industrial plant for which historical production data H can be determined.
  • In another step S50, a standard data matrix is build, in which columns represent the data points of the historical production data H and the rows represent the timestamps of corresponding sensor readings. The individual data points are subject to various data pre-processing steps as follows: Adapting the sampling frequencies to the standard matrix format, e.g., down sampling from seconds to minutes or up sampling from minutes to 30 seconds, Scaling the data to 0-1 domain, optionally fuse missing data points from available data points, e.g., estimate bottom section temperature based on top section temperature, and remove outliers.
  • In another step S60, a new model is trained starting from a pre-trained model Mp using weights obtain from previous trainings and allow the training process to adjust these weights according to loss generated from data samples of the current plant. This may involve using all or parts of the of the pre-trained model. Optionally, certain layers of the network can be excluded, e.g., freeze the layer, from the changing the weights, e.g., keep top layer as it is, or optionally choose different learning rates across the layered networks. These two options could be explored and optimized automatically using hyper-parameter optimization.
  • FIG. 2 shows a relation between the data template and the pre-trained machine learning model. The data template T is a list of data point for example, I1: temperature values, I2: pressure values, I3: level alarms, and I4: valve positions with information on the location on the process or asset (e.g., temperature on top section of processing column). Each prediction, the order of the training data is maintained across all training runs of the new machine learning model M, or in other words the transferred learning model. In this way, the weights the pre-trained machine learning model Mp has obtained during training still can be mapped to the same meaningful features F1-F5 across all training runs.
  • FIG. 3 shows a schematic view of reusing layers of a pre-trained machine learning model. A new machine learning model M comprises a plurality of layers, in this case, a first layer L1, a second layer L2, a third layer L3 and a fourth layer Ln. The first layer L1, the second layer L2, the third layer L3 and the fourth layer Ln are pre-trained layers that have been trained with plant data for a first plant A. In other words, weights obtained by training the first layer L1, the second layer L2, the third layer L3 and the fourth layer Ln are already known to the new machine learning model M. However, when training the new machine learning model M with plant data of a second plant B, not all weights are adjusted. In this case, the first layer L1, the second layer L2 and the third layer L3 are frozen. In other words, those weights are not adjusted during training with the plant data of the second plant B.
  • If the new machine learning model M that has been trained with the data of the second plant B does not perform to a predetermined satisfaction, an iterative process is executed in which it is decided which parts of the pre-trained machine learning model Mp can be reused and which parts should be dropped and retrained. The performance of the new machine learning model M is determined in an evaluation process using a score model, for example classification, regression values or anomaly scores. In other words, if the new machine learning model M does not perform satisfactory, an amount of frozen layers are iteratively unfrozen and retrained.
  • FIG. 4 shows a schematic view of a method of transfer learning for a specific production process.
  • In a first step S10, a plurality of data templates T defining expected data for a production process are provided. In a second step S20, plant data of the industrial plant, comprising data points of the specific production process, are provided, wherein the data points comprise information about input and output of the specific production process. The data template defines a grouping for the expected data according to their relation in the industrial plant. In a third step S30, a process instance I of the specific production process is determined, defining a mapping between the plant data to the expected data of the specific production process.
  • Historic process data H, being historic sensor data relating to the specific production process, is determined in a fourth step S40, using the determined process instance I. In a fifth step S50, training data is determined using the determined process instance I and the determined historic process data H; wherein the training data comprises a structured data matrix, wherein columns of the data matrix represent the sensor data that are grouped in accordance with the data template T and wherein rows of the data matrix represent timestamps of obtaining the sensor data. In a sixth step S60, a pre-trained machine learning model Mp is provided using the determined process instance I. In a seventh step S70, a new machine learning model Mn is trained using the provided pre-trained model Mp and the determined training data.
  • The one embodiment, the data points comprise information the specific production process, in particular an asset of the production process, with basic semantic information, for example sensor positions and/or sensor types.
  • The term “data templates”, as used herein, comprises a list of the typical data points or measurements that are typically available from an asset (e.g. a drive train (pump, motor, drive) or distillation columns (temperature, levels, pressures and flows on different height levels). Furthermore, the data template places measurements that are related in proximity in the list. e.g. the speed setpoint of the drive, the voltage/current of the motor and the vibration of pump and motor are subsequent elements of the list.
  • When the data templates are determined, typical signal combinations are identified in the expected data. Those typical signal combinations are always grouped together in the training data. Further preferably, the grouped signals are disposed in neighbouring columns of the data matrix. Thus, a machine learning model, in particular an artificial neural network, ANN, processes the grouped signals together, for example by convolutions, or control the network architecture, in particular which data is convoluted with which data. Thus, a performance of the new machine learning model is improved. Further, transfer learning is facilitated.
  • In other words, typical signal, A&E, combination, e.g. 2× level, 2× pressure, temperature, inflow, outflow of a processing columns, are identified. These signals are always grouped together in the plant data, e.g., neighboring columns, so that an artificial neural network processes the data together, e.g. by convolutions, or control the network architecture, e.g. which data is convoluted. This helps the performance of the machine learning model. It can be also used to facilitate transfer learning. If a new model is trained and also data is used from a process column, the network architecture and weights from previously learnt models can be partially extracted.
  • Digital libraries of data templates that define what data is expected from production processes are provided as inputs. Additionally, plant data, comprising a list of data points of a specific asset or processes with basic semantic information, e.g., sensors position and their types, are provided. Further, historic process data from the current process that are tried to transfer the machine learning model to are provided.
  • As an output, a new working machine learning model is achieved by tuning the pre-trained model to the current industrial plant. In addition, the new model is used to present the production process or asset status to the human user or to trigger automated actions, e.g., closing a valve.
  • In one embodiment, the data templates comprise digital libraries that define what data are expected from a production process.
  • In one embodiment, the data points comprise temperature values, pressure values, level alarms, valve positions.
  • In one embodiment, the pre-trained machine learning model has been trained from at least one asset or production process of an industrial plant.
  • In other words, the method provides working machine learning model by tuning a pre-trained machine learning model to the current industrial plant or in particular a component of the current industrial plant.
  • The described method allows for providing transfer learning for industrial applications based on data templates of industrial plant signals. Thus, an improved method for transfer learning for a specific production process of an industrial plant is provided.
  • In a preferred embodiment, determining the training data comprises pre-processing the historic process data, thereby standardizing a format of the training data.
  • Preferably, the pre-processing steps format the historic process data so that a data matrix is determined that is semantically identical to what the pre-trained model has been trained on. The determined data matrix is used as input for new machine learning model for training to obtain predictions from the new machine learning model that are either displayed to a human user or used to trigger automatic actions.
  • In one embodiment, pre-processing the historic process data comprises adapting a sampling frequency to a standardized data matrix format.
  • In one embodiment, pre-processing the historic process data comprises scaling the historic process data to a 0-1 domain.
  • In one embodiment, pre-processing the historic process data comprises fusing missing data points of the historic process data from available data points of the historic process data.
  • In one embodiment, pre-processing the historic process data comprises removing outliers of the historic process data.
  • In one embodiment, the pre-trained model comprises weights wherein training the new machine learning model comprises adjusting the weights
  • In other words, the weights are obtained from previous trainings of the pre-trained model.
  • Preferably, the weights are adjusted according to loss generated from data samples of new machine learning model, in other words the current industrial plant.
  • In a preferred embodiment, the pre-trained machine learning model comprises at least one layer wherein training the new machine learning model comprises the following steps. In a step, each layer is categorised, using the determined process instance, in one of the categories frozen or non-frozen. In another step, the frozen layers of the pre-trained machine learning model are reused and the non-frozen layers of the pre-trained machine learning model are retrained.
  • Preferably, for each layer, it is determined if the layer is a frozen layer that is not retrained or a non-frozen layer that is retrained, using the corresponding data template.
  • Preferably, reusing the frozen layers allows to use a network architecture and/or weights from the pre-trained machine learning model to train the new machine learning model.
  • Preferably, the determination of the layer is a frozen layer or a non-frozen layer is automatically optimized using hyper-parameter optimization.
  • Preferably, the retraining is performed in an iterative way where additional layers are retrained until a satisfactory level of performance is achieved.
  • Preferably, determining, which layer is a frozen layer and which layer is a non-frozen layer, is done based on the type of the layer. The aim is to retrain mainly the decision logic of the machine learning network. Usually, these layer have a different type of architecture (densely connected) then previous layers (e.g. convolutional and pooling layers or Recurrent Layers). Further preferably, the determination is done by trying out reusing different layers and selecting the configuration that yield the best results (best performance on a test data set, e.g. measured as root-mean-square error for regression or accuracy for classification).
  • Thus, an automatic matching of reusable pre-trained machine learning models based on their data templates is provided.
  • In a preferred embodiment, the pre-trained machine learning model comprises at least one layer, wherein training the new machine learning model comprises the following steps: In a step, each layer is categorised, using the determined process instance, in one of the categories frozen or non-frozen. In another step, different learning rates are applied on the at least one layer depending on the determination if the layer is a frozen layer or a non-frozen layer.
  • In other words, different learning rates can be chosen across the layers of the pre-trained machine learning model.
  • Preferably, the determination of the layer is a frozen layer or a non-frozen layer is automatically optimized using hyper-parameter optimization.
  • Preferably, the retraining is performed in an iterative way where additional layers are retrained until a satisfactory level of performance is achieved.
  • In a preferred embodiment, the data points comprise input/output names of the specific production process, wherein the historic process data is determined using the input/output names.
  • In a preferred embodiment, wherein training the new machine learning model comprises using the data matrix as input for the new machine learning model to obtain a prediction as output from the new machine learning model.
  • Preferably, the prediction comprises a classification, regression values and/or an anomaly score.
  • According to an aspect of the disclosure, the new machine learning model, trained by a method, as described herein, is used to provide status data of the industrial plant.
  • In other words, the working new machine learning model allows presenting a process status or an asset status of the industrial plant to a human user or to trigger an automated action, for example closing a valve of the industrial plant.
  • According to an aspect of the invention, a data processing system comprising means for carrying out the steps of a method, as described herein, is provided.
  • According to an aspect of the invention, a computer program comprising instructions, which, when the program is executed by a computer, cause the computer to carry out the steps of a method, as used herein, is provided.
  • LIST OF REFERENCE SYMBOLS
    • T data template
    • M new machine learning model
    • Mp pre-trained machine learning model
    • H historic process data
    • P plant data
    • l1 first list
    • l2 second list
    • l3 third list
    • l4 fourth list
    • F1 first feature
    • F2 second feature
    • F3 third feature
    • F4 fourth feature
    • F5 fifth feature
    • L1 first layer
    • L2 second layer
    • L3 third layer
    • Ln fourth layer
    • A plant data of a first plant
    • B plant data of a second plant
  • All references, including publications, patent applications, and patents, cited herein are hereby incorporated by reference to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein.
  • The use of the terms “a” and “an” and “the” and “at least one” and similar referents in the context of describing the invention (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. The use of the term “at least one” followed by a list of one or more items (for example, “at least one of A and B”) is to be construed to mean one item selected from the listed items (A or B) or any combination of two or more of the listed items (A and B), unless otherwise indicated herein or clearly contradicted by context. The terms “comprising,” “having,” “including,” and “containing” are to be construed as open-ended terms (i.e., meaning “including, but not limited to,”) unless otherwise noted. Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g., “such as”) provided herein, is intended merely to better illuminate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention.
  • Preferred embodiments of this invention are described herein, including the best mode known to the inventors for carrying out the invention. Variations of those preferred embodiments may become apparent to those of ordinary skill in the art upon reading the foregoing description. The inventors expect skilled artisans to employ such variations as appropriate, and the inventors intend for the invention to be practiced otherwise than as specifically described herein. Accordingly, this invention includes all modifications and equivalents of the subject matter recited in the claims appended hereto as permitted by applicable law. Moreover, any combination of the above-described elements in all possible variations thereof is encompassed by the invention unless otherwise indicated herein or otherwise clearly contradicted by context.

Claims (11)

What is claimed is:
1. A method of transfer learning for a specific production process of an industrial plant, comprising:
providing a plurality of data templates defining expected data for a production process;
providing plant data of the industrial plant, comprising data points of the specific production process, wherein the data points comprise information about input and output of the specific production process;
wherein the data template defines a grouping for the expected data according to their relation in the industrial plant;
determining a process instance of the specific production process, defining a mapping between the plant data (P) to the expected data of the specific production process;
determining historic process data, being historic sensor data relating to the specific production process using the determined process instance;
determining training data using the determined process instance and the determined historic process data, wherein the training data comprises a structured data matrix, wherein columns of the data matrix represent the sensor data that are grouped in accordance with the data template, and wherein rows of the data matrix represent timestamps of obtaining the sensor data;
providing a pre-trained machine learning model using the determined process instance; and
training a new machine learning model using the provided pre-trained model and the determined training data.
2. The method of claim 1, wherein determining the training data comprises preprocessing the historic process data, thereby standardizing a format of the training data.
3. The method of claim 2, wherein preprocessing the historic process data comprises adapting a sampling frequency to a standardized data matrix format.
4. The method of claim 2, wherein preprocessing the historic process data comprises scaling the historic process data to a 0-1 domain.
5. The method of claim 2, wherein preprocessing the historic process data comprises fusing missing data points of the historic process data from available data points of the historic process data.
6. The method of claim 2, wherein preprocessing the historic process data comprises removing outliers from the historic process data.
7. The method of claim 1, wherein the pre-trained model comprises trained weights, and wherein training the new machine learning model comprises adjusting the trained weights.
8. The method of claim 1, wherein the pre-trained machine learning model comprises at least one layer, and wherein training the new machine learning model comprises:
categorizing each layer using the determined process instance in one of the categories frozen or non-frozen; and
reusing the frozen layers of the pre-trained machine learning model and retraining the non-frozen layers of the pre-trained machine learning model.
9. The method of claim 1, wherein the pre-trained machine learning model comprises at least one layer, and wherein training the new machine learning model comprises:
categorizing each layer using the determined process instance in one of the categories frozen or non-frozen; and
applying different learning rates on the at least one layer depending on the determination if the layer is a frozen layer or a non-frozen layer.
10. The method of claim 1, wherein the data points comprise input/output names of the specific production process, and wherein the historic process data is determined using the input/output names.
11. The method of claim 1, wherein training the new machine learning model comprises using the data matrix as input for the new machine learning model to obtain a prediction as output from the new machine learning model.
US17/957,592 2020-03-31 2022-09-30 Method of Transfer Learning for a Specific Production Process of an Industrial Plant Pending US20230023896A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EPPCT/EP2020/059169 2020-03-31
PCT/EP2021/058477 WO2021198357A1 (en) 2020-03-31 2021-03-31 Method of transfer learning for a specific production process of an industrial plant

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2021/058477 Continuation WO2021198357A1 (en) 2020-03-31 2021-03-31 Method of transfer learning for a specific production process of an industrial plant

Publications (1)

Publication Number Publication Date
US20230023896A1 true US20230023896A1 (en) 2023-01-26

Family

ID=75302597

Family Applications (3)

Application Number Title Priority Date Filing Date
US17/956,076 Pending US20230019201A1 (en) 2020-03-31 2022-09-29 Industrial Plant Machine Learning System
US17/957,592 Pending US20230023896A1 (en) 2020-03-31 2022-09-30 Method of Transfer Learning for a Specific Production Process of an Industrial Plant
US17/957,609 Pending US20230029400A1 (en) 2020-03-31 2022-09-30 Method of Hierarchical Machine Learning for an Industrial Plant Machine Learning System

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US17/956,076 Pending US20230019201A1 (en) 2020-03-31 2022-09-29 Industrial Plant Machine Learning System

Family Applications After (1)

Application Number Title Priority Date Filing Date
US17/957,609 Pending US20230029400A1 (en) 2020-03-31 2022-09-30 Method of Hierarchical Machine Learning for an Industrial Plant Machine Learning System

Country Status (4)

Country Link
US (3) US20230019201A1 (en)
EP (3) EP4128070A1 (en)
CN (3) CN115087996A (en)
WO (3) WO2021198357A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115618269B (en) * 2022-12-12 2023-03-03 江门市润宇传感器科技有限公司 Big data analysis method and system based on industrial sensor production

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3827387A1 (en) * 2018-08-27 2021-06-02 Siemens Corporation Systematic prognostic analysis with dynamic causal model

Also Published As

Publication number Publication date
US20230029400A1 (en) 2023-01-26
WO2021198354A1 (en) 2021-10-07
WO2021198356A1 (en) 2021-10-07
CN115087995A (en) 2022-09-20
EP4128070A1 (en) 2023-02-08
CN115087996A (en) 2022-09-20
CN115362454A (en) 2022-11-18
WO2021198357A1 (en) 2021-10-07
EP4128069A1 (en) 2023-02-08
EP4128071A1 (en) 2023-02-08
US20230019201A1 (en) 2023-01-19

Similar Documents

Publication Publication Date Title
Diez-Olivan et al. Data fusion and machine learning for industrial prognosis: Trends and perspectives towards Industry 4.0
US11216741B2 (en) Analysis apparatus, analysis method, and non-transitory computer readable medium
US10984338B2 (en) Dynamically updated predictive modeling to predict operational outcomes of interest
US11604442B2 (en) Predictive monitoring and diagnostics systems and methods
US11022965B2 (en) Controlling multi-stage manufacturing process based on internet of things (IOT) sensors and cognitive rule induction
US20190384255A1 (en) Autonomous predictive real-time monitoring of faults in process and equipment
US11754998B2 (en) System and methods for automated model development from plant historical data for advanced process control
US10739736B2 (en) Apparatus and method for event detection and duration determination
CN110678816B (en) Method and control device for controlling a technical system
JP4133627B2 (en) Construction machine state determination device, construction machine diagnosis device, construction machine state determination method, and construction machine diagnosis method
US20230023896A1 (en) Method of Transfer Learning for a Specific Production Process of an Industrial Plant
Angelov et al. Adaptive inferential sensors based on evolving fuzzy models
US20220004163A1 (en) Apparatus for predicting equipment damage
JP7164028B2 (en) LEARNING SYSTEM, DATA GENERATION DEVICE, DATA GENERATION METHOD, AND DATA GENERATION PROGRAM
CN108960421A (en) The unmanned surface vehicle speed of a ship or plane online forecasting method based on BP neural network of improvement
Glavan et al. Production modelling for holistic production control
CN117542169A (en) Automatic equipment temperature abnormality early warning method based on big data analysis
Mypati et al. A critical review on applications of artificial intelligence in manufacturing
CN114861522A (en) Precision manufacturing quality monitoring method and device based on artificial intelligence meta-learning technology
US20210197205A1 (en) Method and device for controlling a process within a system, in particular a grinding process in a grinding device
Angelov et al. Evolving inferential sensors in the chemical process industry
Schmid et al. Neural networks and advanced algorithms for intelligent monitoring in industry
Angelov et al. Evolving fuzzy inferential sensors for process industry
Biswas A brief appraisal of machine learning in industrial sensing probes
US20230281364A1 (en) Automated operating mode detection for a multi-modal system

Legal Events

Date Code Title Description
AS Assignment

Owner name: ABB SCHWEIZ AG, SWITZERLAND

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SCHMIDT, BENEDIKT;AMIHAI, IDO;KOTRIWALA, ARZAM MUZAFFAR;AND OTHERS;SIGNING DATES FROM 20220828 TO 20220928;REEL/FRAME:061300/0218

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION