DE102014207683A1

DE102014207683A1 - Method and device for creating a data-based function model

Info

Publication number: DE102014207683A1
Application number: DE102014207683.8A
Authority: DE
Inventors: The Duy Nguyen-Tuong; Heiner Markert; Rene Diener; Nils Tietze
Original assignee: Robert Bosch GmbH
Current assignee: Robert Bosch GmbH
Priority date: 2014-04-24
Filing date: 2014-04-24
Publication date: 2015-10-29

Abstract

Die Erfindung betrifft ein Verfahren zum Erstellen eines datenbasierten Funktionsmodells, insbesondere eines Gauß-Prozess-Modells, mit folgenden Schritten: – Bereitstellen (S1) von Trainingsdaten; – Unterteilen (S2) der Trainingsdaten in Untermengen von Trainingsdaten; – Erstellen (S3) eines datenbasierten lokalen Teilmodells (51) basierend auf jeder der Untermenge der Trainingsdaten; und – Kombinieren (S5) der lokalen Teilmodelle (51), um das datenbasierte Funktionsmodell zu erhalten.The invention relates to a method for creating a data-based function model, in particular a Gaussian process model, with the following steps: providing (S1) training data; - dividing (S2) the training data into subsets of training data; - creating (S3) a data-based local sub-model (51) based on each of the subset of the training data; and - combining (S5) the local submodels (51) to obtain the data-based functional model.

Description

Technisches GebietTechnical area

Die vorliegende Erfindung betrifft Steuergeräte für Motorsysteme, insbesondere Steuergeräte mit einer separaten Recheneinheit zum Auswerten von datenbasierten Funktionsmodellen, wie beispielsweise Gauß-Prozess-Modellen. Die vorliegende Erfindung betrifft weiterhin Algorithmen zum Erstellen von datenbasierten Funktionsmodellen basierend auf Trainingsdaten.The present invention relates to control devices for engine systems, in particular control devices with a separate computing unit for evaluating data-based function models, such as Gaussian process models. The present invention further relates to algorithms for creating data-based function models based on training data.

Stand der TechnikState of the art

Bislang werden Funktionsmodelle in Steuergeräten, d. h. Strecken- und Systemmodelle, durch die Vorgabe von Kennfeldern, Kennlinien oder ein physikalisches System nachbildenden Funktionen implementiert. Diese werden von einem Benutzer durch Anpassen der Modellparameter an die Gegebenheiten des physikalischen Systems adaptiert.So far, functional models in ECUs, i. H. Track and system models, implemented by the specification of maps, characteristics or a physical system replicating functions. These are adapted by a user by adapting the model parameters to the conditions of the physical system.

Eine Alternative stellt die Verwendung von nicht parametrischen, datenbasierten Funktionsmodellen dar, mit denen die Funktionen physikalischer Systeme im Wesentlichen ohne die Vorgabe von Parametern nachgebildet werden können. Als datenbasiertes Funktionsmodell kann beispielsweise ein Gauß-Prozess-Modell verwendet werden, das im Wesentlichen durch Hyperparameter und Stützstellen definiert wird. Die datenbasierten Funktionsmodelle werden auf Grundlage von Trainingsdaten erstellt, die in einem Prüfsystem ermittelt werden können. Die Stützstellen für das Gauß-Prozess-Modell können den Trainingsdaten entsprechen bzw. aus diesen ausgewählt sein oder generiert werden.An alternative is the use of non-parametric, data-based function models, with which the functions of physical systems can be simulated essentially without the specification of parameters. As a data-based functional model, for example, a Gaussian process model can be used, which is essentially defined by hyperparameters and support points. The data-based functional models are created on the basis of training data that can be determined in a test system. The support points for the Gaussian process model can correspond to the training data or can be selected from it or generated.

Insbesondere lokale Effekte können unter Umständen von dem erstellten datenbasierten Funktionsmodell nicht angemessen abgebildet werden. Wurde bereits ein datenbasiertes Funktionsmodell auf Grundlage von Trainingsdaten eines anfänglichen Trainingsdatensatzes bestimmt, so ist es zudem schwierig, Trainingsdaten eines nachträglich ermittelten Trainingsdatensatzes in angemessener Weise in dem bereits erstellten datenbasierten Funktionsmodell zu berücksichtigen. Durch das bloße Vereinigen der Trainingsdatensätze mit oder ohne Variation der Hyperparameter des datenbasierten Modells können lokale Effekte jedoch nur dann angemessen berücksichtigt werden, wenn die Trainingsdaten des nachträglich hinzugefügten Trainingsdatensatzes eine ausreichende Anzahl an Messpunkten aufweisen. Weiterhin ist es erforderlich, dass die Messpunkte des nachträglich hinzugefügten Trainingsdatensatzes nicht im Widerspruch zu bereits vorhandenen Messpunkten des anfänglichen Trainingsdatensatzes stehen, d. h. eine relativ große Abweichung zu diesen aufweisen. Anderenfalls erhält man datenbasierte Funktionsmodelle mit hohem Messrauschen und folglich hohem Modellfehler für die Funktionswerte im Bereich des lokalen Effekts.In particular, local effects may not be adequately mapped by the created data-based function model. If a data-based functional model has already been determined on the basis of training data of an initial training data record, then it is difficult to appropriately include training data of a subsequently determined training data record in the already created data-based functional model. However, by simply merging the training data sets with or without varying the hyperparameters of the data-based model, local effects can only be adequately considered if the training data of the subsequently added training data set has a sufficient number of measurement points. Furthermore, it is necessary that the measurement points of the subsequently added training data set are not inconsistent with already existing measurement points of the initial training data set, ie. H. have a relatively large deviation from these. Otherwise, one obtains data-based function models with high measurement noise and consequently high model error for the function values in the area of the local effect.

Aus dem Stand der Technik sind Steuergeräte mit einem Mikrocontroller und einer separaten Modellberechnungseinheit zur Berechnung von datenbasierten Modellen in einem Steuergerät bekannt. So ist beispielsweise aus DE 10 2010 028 259 A1 ein Steuergerät mit einer zusätzlichen Logikschaltung bekannt, die zur Berechnung von Exponentialfunktionen ausgebildet ist, um die Durchführung von Bayes-Regressionsverfahren, die insbesondere für die Berechnung von Gauß-Prozess-Modellen benötigt werden, zu unterstützen.Control units with a microcontroller and a separate model calculation unit for calculating data-based models in a control unit are known from the prior art. For example, this is off DE 10 2010 028 259 A1 a controller with an additional logic circuit, which is designed to calculate exponential functions, in order to support the implementation of Bayes regression methods, which are needed in particular for the calculation of Gaussian process models.

Weiterhin ist aus C. Plagemann, K. Kersting, W. Burgard, ”Nonstationary Gaussian Process Regression Using Point Estimates of Local Smoothness”, ICML Proceedings, Seiten 204–2116, 2006 , ein weiteres Verfahren zum Hinzufügen von Messpunkten eines weiteren Trainingsdatensatzes zu einem bestehenden Gauß-Prozess-Modell bekannt. Dieses Verfahren ist jedoch ineffizient, insbesondere da die Parameteroptimierung schwierig ist.Furthermore, it is off C. Plagemann, K. Kersting, W. Burgard, "Nonstationary Gaussian Process Regression Using Point Estimates of Local Smoothness", ICML Proceedings, pages 204-2116, 2006 , another method for adding measurement points of another training data set to an existing Gaussian process model is known. However, this method is inefficient, especially since the parameter optimization is difficult.

Offenbarung der ErfindungDisclosure of the invention

Erfindungsgemäß sind das Verfahren zum Erstellen eines datenbasierten Funktionsmodells basierend auf Trainingsdaten gemäß Anspruch 1 sowie die Vorrichtung, das Computerprogramm und das Steuergerät gemäß den nebengeordneten Ansprüchen vorgesehen.According to the invention, the method for creating a data-based functional model based on training data according to claim 1 and the device, the computer program and the control device are provided according to the independent claims.

Weitere Ausgestaltungen sind in den abhängigen Ansprüchen angegeben.Further embodiments are specified in the dependent claims.

Gemäß einem ersten Aspekt ist ein Verfahren zum Erstellen eines datenbasierten Funktionsmodells, insbesondere eines Gauß-Prozess-Modells, vorgesehen, das die folgenden Schritte umfasst:

– Bereitstellen von Trainingsdaten;
– Unterteilen der Trainingsdaten in Untermengen von Trainingsdaten;
– Erstellen eines datenbasierten lokalen Teilmodells basierend auf jeder der Untermengen der Trainingsdaten; und
– Kombinieren der lokalen Teilmodelle, um das datenbasierte Funktionsmodell zu erhalten.

According to a first aspect, a method for creating a data-based function model, in particular a Gaussian process model, is provided, which comprises the following steps:

- providing training data;
- dividing the training data into subsets of training data;
- creating a data-based local sub-model based on each of the subsets of the training data; and
Combining the local submodels to obtain the data-based function model.

Eine Idee des obigen Verfahrens zum Erstellen eines datenbasierten Funktionsmodells besteht darin, Trainingsdaten eines vorgegebenen Trainingsdatensatzes in mehrere Untermengen zu unterteilen und daraus datenbasierte lokale Teilmodelle zu generieren. Das zu erstellende datenbasierte Funktionsmodell wird dann als Kombination der gewichteten lokalen Teilmodellen gebildet. Das Erstellen der lokalen datenbasierten Teilmodellen erfolgt in gleicher Weise wie die an sich bekannte Erstellung von nicht parametrischen datenbasierten Modellen.One idea of the above method for creating a data-based function model is to subdivide training data of a given training data set into a plurality of sub-sets and to generate data-based local submodels therefrom. The data-based function model to be created is then formed as a combination of the weighted local submodels. The creation of the local data-based submodels takes place in the same way as the known creation of non-parametric data-based models.

Insbesondere beim Modellieren von dynamischem Verhalten ist das Erstellen des datenbasierten Funktionsmodells über lokale Teilmodelle vorteilhaft, da das Erstellen eines globalen datenbasierten Funktionsmodells basierend auf der gesamten Trainingsdatenmenge ungenau ist. Ein Grund hierfür liegt beispielsweise darin, dass man mit einem einzigen Datensatz (entweder stationär oder dynamisch) nicht die gesamte Dynamik des Systems abbilden kann.In particular, when modeling dynamic behavior, creating the data-based function model via local submodels is advantageous because creating a global data-based function model based on the total amount of training data is inaccurate. One reason for this, for example, is that one single record (either stationary or dynamic) can not map the overall dynamics of the system.

Üblicherweise weist ein Eingangsgrößenraum von dynamischen Systemen komplexere Hüllen auf und ist größer als Eingangsgrößenräume mit nur stationären Eingangsgrößen. Die Hüllen von dynamischen Eingangsgrößenräumen weisen zudem konkave Formen auf, was eine Schätzung der Hülle erschwert.Usually, an input size space of dynamic systems has more complex envelopes and is larger than input size spaces with only stationary input variables. The envelopes of dynamic input size spaces also have concave shapes, making it difficult to estimate the envelope.

Die Verwendung von lokalen Teilmodellen ermöglicht eine inkrementelle Adaptierung bzw. Erweiterung von datenbasierten Funktionsmodellen für dynamische Systeme durch Hinzufügen und gegebenenfalls Entfernen von datenbasierten Teilmodellen, die basierend auf dem dynamischen Eingangsgrößenraum des physikalischen Modells erstellt worden sind. Das obige Verfahren ermöglicht durch die Berücksichtigung des dynamikfüllenden Eingangsgrößenraums auch eine verbesserte Modellierung der lokalen Effekte, die durch das dynamische Verhalten des physikalischen Systems bewirkt werden.The use of local submodels allows incremental adaptation or expansion of data-based functional models for dynamic systems by adding and possibly removing data-based submodels that have been created based on the dynamic input space of the physical model. The above method also allows improved modeling of the local effects caused by the dynamic behavior of the physical system by taking into account the dynamics-filling input space.

Darüber hinaus kann, da die Rechenkomplexität bei herkömmlichen datenbasierten Funktionsmodellen n³ (n = Anzahl der Trainingsdatenpunkte) entspricht, die Komplexität durch Erstellen von Teilmodellen aus Teilmengen der dynamikfüllenden Trainingsdaten erheblich reduziert werden.In addition, because the computational complexity of traditional data-based function models is n ³ (n = number of training data points), complexity can be significantly reduced by building sub-models from subsets of the dynamics-filling training data.

Die Gewichtung der Teilmodelle kann mithilfe der prädiktiven Varianz erfolgen. Die prädiktive Varianz kann als Metrik dienen, welche den Abstand des Abfragepunktes von der Hülle des jeweiligen Teilmodells angibt. Das Abstandsmaß dient als Beschreibung der Unsicherheit des Teilmodells.The weighting of the submodels can be done using the predictive variance. The predictive variance can serve as a metric indicating the distance of the query point from the shell of the respective submodel. The distance measure serves as a description of the uncertainty of the submodel.

Die additive Zusammenführung von mehreren datenbasierten Teilmodellen hat den weiteren Vorteil, dass diese in einem Steuergerät mit einer zusätzlichen Modellberechnungseinheit in einfacher Weise gerechnet werden kann.The additive combination of several data-based submodels has the further advantage that they can be easily calculated in a control unit with an additional model calculation unit.

Durch das Vorsehen der Modellberechnungseinheit, die in selbstständiger Weise mehrere datenbasierte Teilmodelle nacheinander berechnet und die resultierenden Teilergebnisse aufaddiert, wird eine einfache Ermittlung eines Funktionswerts basierend auf dem obigen additiven Funktionsmodell ermöglicht. Indem vermieden wird, dass die Recheneinheit die Modellberechnungseinheit für die Berechnung jedes einzelnen Teilmodells vorbereiten muss, kann die Berechnung eines entsprechenden Funktionswerts deutlich beschleunigt werden.By providing the model calculation unit which independently calculates a plurality of data-based submodels one by one and adds up the resulting sub-results, a simple determination of a function value based on the above additive function model is enabled. By avoiding that the arithmetic unit must prepare the model calculation unit for the calculation of each individual submodel, the calculation of a corresponding function value can be significantly accelerated.

Weiterhin können die Trainingsdaten Trainingsdatenpunkte umfassen, wobei die Trainingsdaten Trainingsdatenpunkte umfassen, wobei jeder Trainingsdatenpunkt einen Wert mindestens einer Messgröße eines aktuellen Rechenzyklus, einen Wert mindestens einer Ausgangsgröße eines aktuellen Rechenzyklus, mindestens einen Wert einer der Messgrößen aus einem vorangehenden Rechenzyklus und/oder einen Wert der Ausgangsgröße aus einem vorangehenden Rechenzyklus als Eingangsgrößenwerte beinhaltet.Furthermore, the training data may comprise training data points, the training data comprising training data points, each training data point having a value of at least one measurement variable of a current calculation cycle, a value of at least one output variable of a current calculation cycle, at least one value of one of the measurement variables from a preceding calculation cycle and / or a value of Output variable from a previous calculation cycle includes as input values.

Gemäß einer Ausführungsform kann das Unterteilen der Trainingsdaten manuell oder automatisch, insbesondere mithilfe eines k-Means-Algorithmus oder eines Mean-Shift-Algorithmus durchgeführt werden.According to one embodiment, the subdivision of the training data may be performed manually or automatically, in particular using a k-means algorithm or a mean-shift algorithm.

Es kann vorgesehen sein, dass das Unterteilen der Trainingsdaten durchgeführt wird, indem Untermengen von Trainingsdatenpunkten gebildet werden, wobei die Trainingsdatenpunkte einer Untermenge so gewählt sind, dass sie eine euklidische Distanz zueinander aufweisen, die kleiner ist als ein vorgegebener Schwellenwert.It can be provided that the subdivision of the training data is performed by forming subsets of training data points, wherein the training data points of a subset so are selected to have an Euclidean distance from each other that is less than a predetermined threshold.

Insbesondere kann das Kombinieren der lokalen Teilmodelle durch Bilden eines gewichteten Durchschnitts der lokalen Teilmodelle durchgeführt werden.In particular, the combining of the local submodels may be performed by forming a weighted average of the local submodels.

Gemäß einer Ausführungsform können die lokalen Teilmodelle abhängig von einem Maß der Modellunsicherheit, insbesondere abhängig von einer prädiktiven Varianz, gewichtet werden.According to one embodiment, the local submodels may be weighted depending on a measure of the model uncertainty, in particular depending on a predictive variance.

Weiterhin können in dem erstellten datenbasierten Funktionsmodell neue Trainingsdatenpunkte berücksichtigt werden, indem diese einer der Untermengen der Trainingsdaten hinzugefügt werden.Furthermore, in the created data-based function model, new training data points can be taken into account by adding them to one of the subsets of the training data.

In dem erstellten datenbasierten Funktionsmodell kann ein neuer Trainingsdatenpunkt berücksichtigt werden, indem dieser einer der Untermengen der ursprünglichen Trainingsdaten hinzugefügt wird, das lokale Teilmodell aus der entsprechenden Untermenge der ursprünglichen Trainingsdaten erstellt wird und das entsprechende lokale Teilmodell anschließend mit den übrigen lokalen Teilmodellen kombiniert wird.In the created data-based function model, a new training data point may be taken into account by adding it to one of the subsets of the original training data, building the local sub-model from the corresponding subset of the original training data, and then combining the corresponding local sub-model with the remaining local sub-models.

Gemäß einem weiteren Aspekt ist ein Verfahren zum Bereitstellen eines datenbasierten Funktionsmodells vorgesehen, wobei Trainingsdatenpunkte aus einem oder mehreren Funktionsbereichen des datenbasierten Funktionsmodells ausgewählt werden und ein weiteres datenbasiertes Funktionsmodell basierend auf den ausgewählten Trainingsdatenpunkten entsprechend dem obigen Verfahren ermittelt wird, wobei das datenbasierte Funktionsmodell und das weitere datenbasierte Funktionsmodell kombiniert werden.According to a further aspect, a method for providing a data-based function model is provided, wherein training data points are selected from one or more functional areas of the data-based functional model and another data-based functional model is determined based on the selected training data points according to the above method, wherein the data-based functional model and the other data-based functional model can be combined.

Weiterhin können die Trainingsdatenpunkte aus den Trainingsdaten ausgewählt werden, wenn deren Varianz einen vorgegebenen Varianzschwellenwert übersteigt.Furthermore, the training data points may be selected from the training data if their variance exceeds a predetermined variance threshold.

Weiterhin können das datenbasierte Funktionsmodell und das weitere datenbasierte Funktionsmodell kombiniert werden, indem aus den Funktionsmodellen ein gewichteter Durchschnitt gebildet wird, wobei die Funktionsmodelle insbesondere basierend auf der prädiktiven Varianz gewichtet werden.Furthermore, the data-based function model and the further data-based function model can be combined by forming a weighted average from the function models, wherein the function models are weighted in particular on the basis of the predictive variance.

Gemäß einem weiteren Aspekt ist eine Vorrichtung, insbesondere eine Recheneinheit, zum Erstellen eines datenbasierten Funktionsmodells, insbesondere eines Gauß-Prozess-Modells, vorgesehen, wobei die Vorrichtung ausgebildet ist, um

– Trainingsdaten bereitzustellen;
– die Trainingsdaten in Untermengen von Trainingsdaten zu unterteilen;
– ein datenbasiertes lokales Teilmodell basierend auf jeder der Untermengen der Trainingsdaten zu erstellen; und
– die lokalen Teilmodelle zu kombinieren, um das datenbasierte Funktionsmodell zu erhalten.

According to a further aspect, a device, in particular a computing unit, for creating a data-based function model, in particular a Gaussian process model, is provided, wherein the device is designed to

- provide training data;
- subdivide the training data into subsets of training data;
To create a data-based local sub-model based on each of the subsets of the training data; and
- Combine the local submodels to obtain the data-based functional model.

Gemäß einem weiteren Aspekt ist ein Computerprogramm vorgesehen, welches dazu eingerichtet ist, alle Schritte des obigen Verfahrens zum Erstellen eines datenbasierten Funktionsmodells auszuführen.According to a further aspect, a computer program is provided, which is set up to execute all steps of the above method for creating a data-based function model.

Kurzbeschreibung der ZeichnungenBrief description of the drawings

Ausführungsformen werden nachfolgend anhand der beigefügten Zeichnungen näher erläutert. Es zeigen:Embodiments are explained below with reference to the accompanying drawings. Show it:

1 eine schematische Darstellung eines Prüfstands zum Vermessen eines physikalischen Systems, um eine Ausgangsgröße y für Messgrößen zu erhalten; 1 a schematic representation of a test bench for measuring a physical system to obtain an output variable y for measured variables;

2 eine schematische Darstellung eines zu modellierenden dynamischen Systems mit einem Eingangsgrößenvektor u und einer Ausgangsgröße y; 2 a schematic representation of a dynamic system to be modeled with an input variable vector u and an output variable y;

3 eine Funktionsmodellstruktur mit einem Eingangsgrößenvektor u, der Werte von Messgrößen und einer Ausgangsgröße y eines aktuellen Rechenzyklus sowie Werte von Messgrößen und der Ausgangsgröße von vorangehenden Rechenzyklen umfasst; 3 a function model structure with an input variable vector u comprising values of measured variables and an output variable y of a current computing cycle and values of measured variables and the output of preceding computing cycles;

4 ein Flussdiagramm zur Veranschaulichung des Verfahrens zum Erstellen eines datenbasierten Funktionsmodells; 4 a flowchart for illustrating the method for creating a data-based function model;

5 eine schematische Darstellung der grundlegenden Struktur der Ermittlung eines Funktionswerts für ein datenbasiertes Funktionsmodell, das aus lokalen Teilmodellen erstellt ist; 5 a schematic representation of the basic structure of determining a function value for a data-based function model, which is created from local sub-models;

6 ein Diagramm eines zweidimensionalen Eingangsgrößenraums mit Trainingsdatenpunkten in einer ersten Untermenge und einer zweiten Untermenge sowie hinzuzufügenden neuen Trainingsdatenpunkten; 6 a diagram of a two-dimensional input size space with training data points in a first subset and a second subset and to be added new training data points;

7 ein Flussdiagramm zur Veranschaulichung eines Verfahrens zum Hinzufügen eines neuen Trainingsdatenpunkts zu einem bestehenden datenbasierten Funktionsmodell; 7 a flowchart for illustrating a method for adding a new training data point to an existing data-based function model;

8 ein Diagramm zur Veranschaulichung des Hinzufügens der neuen Trainingsdaten zu den Untermengen; und 8th a diagram illustrating the addition of the new training data to the subsets; and

9 ein Flussdiagramm zur Veranschaulichung eines Verfahrens zum Hinzufügen eines neuen Trainingsdatenpunkts zu einem bestehenden datenbasierten Funktionsmodell. 9 a flowchart for illustrating a method for adding a new training data point to an existing data-based function model.

Beschreibung von AusführungsformenDescription of embodiments

1 zeigt eine schematische Darstellung eines Prüfstands 1 zum Testen eines physikalischen Systems 2, wie beispielsweise eines Verbrennungsmotors. Der Prüfstand 1 umfasst weiterhin eine Teststeuereinheit 3, die Messgrößen u (Eingangssignale) für den Verbrennungsmotor 2 bereitstellt. Entsprechend werden Ausgangsgrößen des Verbrennungsmotors 2 gemessen, die den Messgrößen zugeordnet und als Testdaten in einem Testdatenspeicher 4 gespeichert werden. Die so erhaltenen Testdaten stehen anschließend zur Auswertung des Verhaltens des Verbrennungsmotors 2 bzw. zur Erstellung von den Verbrennungsmotor 2 beschreibenden Modellen zur Verfügung. 1 shows a schematic representation of a test bench 1 for testing a physical system 2 , such as an internal combustion engine. The test bench 1 further comprises a test control unit 3 , the measured quantities u (input signals) for the internal combustion engine 2 provides. Corresponding output variables of the internal combustion engine 2 measured, which are assigned to the measured quantities and as test data in a test data memory 4 get saved. The test data thus obtained are then used to evaluate the behavior of the internal combustion engine 2 or for the creation of the internal combustion engine 2 descriptive models available.

Mögliche Messgrößen, die durch die Teststeuereinheit 3 generiert werden, stellen für einen Verbrennungsmotor 2 beispielsweise Stellgrößen für einen Drosselklappensteller, die Drehzahl des Verbrennungsmotors 2, die Last des Verbrennungsmotors 2, eine Stellgröße für einen Nockenwellenphasensteller sowie eine Stellgröße zum Einstellen eines Zündzeitpunkts, einer Einspritzmenge und dergleichen dar. Ausgangsgrößen des Verbrennungsmotors 2 können ein Lambdawert, ein Abgasgegendruck, eine Motortemperatur, eine Drehzahl, ein Motormoment und dergleichen sein.Possible measured variables generated by the test control unit 3 are generated, represent an internal combustion engine 2 For example, manipulated variables for a throttle valve actuator, the speed of the internal combustion engine 2 , the load of the internal combustion engine 2 , a manipulated variable for a camshaft phaser and a manipulated variable for setting an ignition timing, an injection quantity and the like. Output variables of the internal combustion engine 2 may be a lambda value, an exhaust backpressure, an engine temperature, a rotational speed, an engine torque, and the like.

Um das dynamische Verhalten des Verbrennungsmotors 2 vollumfänglich zu testen, ist es nicht ausreichend, die Messgrößen u in ihren Wertebereichen zu variieren, in dem Verbrennungsmotor 2 anzuwenden und das entsprechend resultierende Verhalten des Verbrennungsmotors 2 in Form der sich einstellenden Ausgangsgrößen zu ermitteln. Vielmehr ist es wünschenswert, auch das dynamische Verhalten des Verbrennungsmotors 2 in entsprechender Weise in einem zu erstellenden datenbasierten Funktionsmodell abzubilden. Daher können die Werte der angelegten Messgrößen u, die den Eingangsgrößenraum definieren, mit unterschiedlichem dynamischem Verhalten bzw. unterschiedlichen zeitlichen Gradienten variiert werden, um die entsprechende dynamische Antwort des Verbrennungsmotors 2 in Form der entsprechenden Ausgangsgrößen zu erhalten.To the dynamic behavior of the internal combustion engine 2 to fully test, it is not sufficient to vary the measurands u in their value ranges, in the internal combustion engine 2 apply and the corresponding resulting behavior of the internal combustion engine 2 in the form of self-adjusting output variables. Rather, it is desirable, even the dynamic behavior of the internal combustion engine 2 in a corresponding manner in a data-based function model to be created. Therefore, the values of the applied measured quantities u, which define the input quantity space, can be varied with different dynamic behavior or different temporal gradient, respectively, to the corresponding dynamic response of the internal combustion engine 2 in the form of the corresponding output quantities.

Die durch die Kombination der Werte der Messgrößen u definierten Messpunkte X in einem Eingangsgrößenraum sollen möglichst sowohl raumfüllend als auch dynamikfüllend sein. Raumfüllend sind die so generierten Messpunkte X, wenn der Abstand jedes durch die Kombination der diskretisierten Signalwerte der Eingangssignale bestimmten Messpunkts X von mindestens einem weiteren der so bestimmten Messpunkte X kleiner als ein vorgegebener Abstandswert ist. Analog sind die so generierten Messpunkte X dynamikfüllend, wenn der Abstand jedes durch die Kombination der diskretisierten Signalwerte der Eingangssignale und deren zeitlichen Gradienten bestimmten Messpunkts X von mindestens einem weiteren der so bestimmten Messpunkte X kleiner als ein vorgegebener Dynamik-Abstandswert ist.The measuring points X defined by the combination of the values of the measured quantities u in an input variable space should as far as possible be both space-filling and dynamic-filling. The measuring points X thus generated are space-filling if the distance of each measuring point X determined by the combination of the discretized signal values of the input signals from at least one further of the measuring points X thus determined is smaller than a predetermined distance value. Analogously, the measuring points X generated in this way are dynamic filling when the distance of each measuring point X determined by the combination of the discretized signal values of the input signals and their temporal gradients from at least one further of the measuring points X thus determined is smaller than a predetermined dynamic distance value.

Die mit dem obigen Prüfstand 1 bestimmten Messpunkte X dienen nun der Erstellung eines datenbasierten Funktionsmodells, das auch die dynamischen Eigenschaften des physikalischen Systems 2 abbildet.The with the above test bench 1 certain measurement points X are now used to create a data-based function model, which also maps the dynamic properties of the physical system 2.

Die Verwendung von nicht parametrischen, datenbasierten Funktionsmodellen basiert auf einem Bayes-Regressionsverfahren. Die Grundlagen der Bayes-Regression sind beispielsweise in C. E. Rasmussen et al., „Gaussian Processes for Machine Learning”, MIT Press 2006 , beschrieben. Bei der Bayes-Regression handelt es sich um ein datenbasiertes Verfahren, das auf einem Modell basiert. Zur Erstellung des Modells sind Messpunkte von Trainingsdaten sowie zugehörige Ausgangsdaten einer zu modellierenden Ausgangsgröße erforderlich. Die Erstellung des Modells erfolgt anhand der Verwendung von Stützstellendaten, die den Trainingsdaten ganz oder teilweise entsprechen oder aus diesen generiert werden. Weiterhin werden abstrakte Hyperparameter bestimmt, die den Raum der Modellfunktionen parametrieren und effektiv den Einfluss der einzelnen Messpunkte der Trainingsdaten auf die spätere Modellvorhersage gewichten.The use of non-parametric, data-based function models is based on a Bayes regression method. The basics of Bayesian regression are, for example, in CE Rasmussen et al., Gaussian Processes for Machine Learning, MIT Press 2006 , described. Bayesian regression is a data-based method based on a model. To create the model, measurement points of training data and associated output data of an output variable to be modeled are required. The modeling is done by using support point data that contains the Training data completely or partially correspond or be generated from these. Furthermore, abstract hyperparameters are determined which parameterize the space of the model functions and effectively weight the influence of the individual measurement points of the training data on the later model prediction.

Die abstrakten Hyperparameter werden durch ein Optimierungsverfahren bestimmt. Eine Möglichkeit für ein solches Optimierungsverfahren besteht in einer Optimierung einer Marginal Likelihood p(Y|H, X). Die Marginal Likelihood p(Y|H, X) beschreibt die Plausibilität der gemessenen y-Werte der Trainingsdaten, dargestellt als Vektor Y, gegeben die Modellparameter H und die x-Werte (Werte der Eingangsgrößen) der Trainingsdaten. Im Modelltraining wird p(Y|H, X) maximiert, indem geeignete Hyperparameter gesucht werden, die zu einem Verlauf der durch die Hyperparameter und die Trainingsdaten bestimmten Modellfunktion führen und die Trainingsdaten möglichst genau abbilden. Zur Vereinfachung der Berechnung wird der Logarithmus von p(Y|H, X) maximiert, da der Logarithmus die Stetigkeit der Plausibilitätsfunktion nicht verändert.The abstract hyperparameters are determined by an optimization method. One possibility for such an optimization method is an optimization of a marginal likelihood p (Y | H, X). The marginal likelihood p (Y | H, X) describes the plausibility of the measured y-values of the training data, represented as vector Y, given the model parameters H and the x-values (input values) of the training data. In model training, p (Y | H, X) is maximized by searching for suitable hyperparameters that result in a course of the model function determined by the hyperparameters and the training data and map the training data as accurately as possible. To simplify the calculation, the logarithm of p (Y | H, X) is maximized because the logarithm does not change the continuity of the plausibility function.

Die herkömmliche Modellierung von nicht parametrischen, datenbasierten Funktionsmodellen führt nicht immer zu ausreichend genauen Modellen. Ein Grund hierfür besteht darin, dass sich aus der großen Anzahl von dynamisch erfassten Messdaten 100.000 oder mehr Trainingsdatenpunkte ergeben, so dass die ursprünglichen Daten reduziert und/oder spärliche Gauß-Prozesse angewendet werden müssen. Somit können die resultierenden datenbasierten Funktionsmodelle hinsichtlich ihrer Genauigkeit aufgrund des Informationsverlusts ungenau sein.Conventional modeling of non-parametric, data-based function models does not always yield sufficiently accurate models. One reason for this is that the large number of dynamically acquired measurement data results in 100,000 or more training data points, so the original data must be reduced and / or sparse Gaussian processes applied. Thus, the resulting data-based functional models may be inaccurate in accuracy due to information loss.

Weiterhin ist es in der Regel nicht möglich, die Eingangsgrößen x zum Abbilden aller dynamischen Effekte des physikalischen Systems 2 zu variieren, so dass ein resultierendes datenbasiertes Funktionsmodell häufig nicht alle Effekte, die in dem physikalischen System 2 auftreten können, exakt abbilden kann. Herkömmliche Modellierungen für ein datenbasiertes Funktionsmodell, wie zum Beispiel das Gauß-Prozess-Modell, ergeben eine globale dynamische Repräsentation der Trainingsdaten, bei der lokale dynamische Effekte vernachlässigt werden können.Furthermore, it is generally not possible to vary the input quantities x for mapping all the dynamic effects of the physical system 2, so that a resulting data-based function model often can not accurately map all the effects that can occur in the physical system 2. Traditional modeling for a data-based function model, such as the Gaussian process model, provides a global dynamic representation of the training data that can neglect local dynamic effects.

Weiterhin ist es bei herkömmlichen datenbasierten Funktionsmodellen schwierig, bereits erstellte Modelle bei Vorhandensein von neuen Trainingsdaten zu adaptieren. Bislang ist es notwendig, das gesamte datenbasierte Funktionsmodell erneut zu lernen, wenn zusätzliche Trainingsdaten hinzugefügt werden sollen. Dies ist jedoch besonders schwierig, wenn die Gesamtmenge von Trainingsdaten den Rechenaufwand zur Erstellung des datenbasierten Funktionsmodells erheblich vergrößert.Furthermore, with traditional data-based function models, it is difficult to adapt already created models in the presence of new training data. So far, it is necessary to re-learn the entire data-based function model if additional training data is to be added. However, this is particularly difficult if the total amount of training data significantly increases the computational effort to create the data-based functional model.

2 zeigt schematisch ein zu modellierendes dynamisches System mit einem Messgrößenvektor u und einer Ausgangsgröße y. Die Elemente des Messgrößenvektors u sind u₁(k), u₂(k), ..., u_n(k). Eine sich aus einer Vermessung des dynamischen Systems ergebende Trainingsdatenmenge kann wie folgt angegeben werden:

2 schematically shows a dynamic system to be modeled with a measured variable vector u and an output variable y. The elements of the measurand vector u are u ₁ (k), u ₂ (k), ..., u _n (k). A training data set resulting from a measurement of the dynamic system can be given as follows:

Zum Modellieren von dynamischen Prozessen kann eine Eingangsgrößenstruktur verwendet werden, die auch als ”Nonlinear Autoregression With Exogenous Inputs (NARX)” bekannt ist. Eine typische NARX-Struktur ist in 3 gezeigt, wobei für mindestens eine der Messgrößen u historische Werte, d. h. Werte von vorangegangenen Rechenzyklen sowie historische Werte der Ausgangsgröße y, als Eingangsgrößen x in das zu erstellende datenbasierte Funktionsmodell eingehen. Mathematisch wird diese NARX-Struktur wie folgt beschrieben: y(k) = f_NL((u₁(k), ..., u₁(k – τ₁), ..., u_n(k), ..., u_n(k – τ_n), y(k – 1), ..., y(k – τ_y)) = f_NL((x(k)) wobei u den Messgrößenvektor, d die Eingangsdimension, k den Abfragezeitpunkt und x und y die Eingangsgrößen und Ausgangsgrößen zum angegebenen Zeitpunkt bezeichnen. f_NL entspricht einer nichtlinearen Funktion, die mithilfe des datenbasierten Funktionsmodells, wie zum Beispiel der oben beschriebenen Gauß-Prozess-Regression, beschrieben werden kann.To model dynamic processes, an input structure known as Nonlinear Autoregression With Exogenous Inputs (NARX) may be used. A typical NARX structure is in 3 shown, for at least one of the measured variables u historical values, ie values of previous calculation cycles and historical values of the output variable y, as input variables x in the data-based function model to be created. Mathematically, this NARX structure is described as follows: y (k) = f _NL ((u ₁ (k), ..., u ₁ (k - τ ₁ ), ..., u _n (k), ..., u _n (k - τ _n ) , y (k - 1), ..., y (k - τ _y)) = f _NL ((x (k)) where u is the measured variable vector, d is the input dimension, k is the sampling time, and x and y are the input quantities and output quantities at the specified time. f _NL corresponds to a nonlinear function that can be described using the data-based function model, such as the Gaussian process regression described above.

Ein Gauß-Prozess entspricht einer multivariaten Gauß-Funktion mit zufälligen variablen Punkten. Ein Gauß-Prozess wird vollständig durch die Mittelwertfunktion μ(x) und die Kovarianz k(X, X)-Funktion beschrieben. y ~ N(μ(X), k(X, X)) A Gaussian process corresponds to a multivariate Gaussian with random variable points. A Gaussian process is fully described by the mean value function μ (x) and the covariance k (X, X) function. y ~N (μ (X), k (X, X))

Die Kovarianzfunktion kann mit einem positiv definiten Kern, zum Beispiel einem quadratischen Exponentialkern, beschrieben werden: k(x_i, x_i) = σ 2 / fexp( 1 / 2Σ D / d=1(x_i – x_i)²/l_d ²), wobei σ_f und I_d den Hyperparametern entsprechen. Diese Parameter können mithilfe von Trainingsdaten optimiert werden. Damit kann der Gauß-Prozess geschrieben werden als

wobei σ_n ² dem Ausgangsrauschen und f_* dem zu schätzenden Ausgangswert entsprechen. Für den obige Gauß-Prozess entspricht die posterior Verteilung für X_* ebenfalls einem Gauß-Prozess und kann angegeben werden als:

f_*|X, Y, X_* ~ N(f(X_*), cov(X_*)),

mit einer Mittelwertfunktion, die bestimmt ist als:

f_* ≙ E[f_*|X, Y, X_*] = K(X_*, X)[K(X, X) + σ_n ²I]^–1Y

und der entsprechenden prädiktiven Varianz

cov(X_*) = K(X_*, X_*) – K(X_*, X)[K(X, X) + σ_n ²I]^–1K(X, X_*).

The covariance function can be described with a positive definite nucleus, for example a quadratic exponential nucleus:

k (x _i, x _i) = σ 2 / Fexp (1 / 2Σ D / d = 1 (x _i - x _i) ² / l _d ^2),

where σ _f and I _d correspond to the hyperparameters. These parameters can be optimized using training data. Thus the Gaussian process can be written as

where σ _n ² corresponds to the output noise and f _{* to} the initial value to be estimated. For the above Gaussian process, the posterior distribution for X _* also corresponds to a Gaussian process and can be given as:

f _* | X, Y, X _* N (f (X _* ), cov (X _* )),

with an average function determined as:

f _* ≙ E [f _* | X, Y, X _* ] = K (X _* , X) [K (X, X) + σ _n ² I] ^-1 Y

and the corresponding predictive variance

cov (X _*) = K (X _*, X _*) - K (X _* X) [K (X, X) + σ _n ² I] ^-1 K (X, X _*).

Die prädiktive Varianz kann als ein Maß für die Modellunsicherheit interpretiert werden. Je größer der Wert der Varianz ist, desto unsicherer ist das datenbasierte Funktionsmodell bezüglich der Vorhersage des Abfragepunkts. Diese Varianz kann für die Prädiktion in dem nachfolgend beschriebenen Verfahren zum Erstellen eines datenbasierten Funktionsmodells verwendet werden. Das beschriebene Verfahren basiert auf dem separaten Erstellen von lokalen Teilmodellen 51, d. h. lokalen datenbasierten Funktionsmodellen. Die Teilmodelle 51 werden mit Untermengen der ursprünglichen Messdaten als Trainingsdaten erstellt. Die Untermengen der Messdaten entsprechen kleineren lokalen Unterräumen der Messdaten.The predictive variance can be interpreted as a measure of model uncertainty. The larger the value of the variance, the more uncertain the data-based function model is in terms of prediction of the query point. This variance can be used for the prediction in the method described below for creating a data-based function model. The method described is based on the separate creation of local submodels 51 ie local data-based function models. The submodels 51 are created with subsets of the original measurement data as training data. The subsets of the measurement data correspond to smaller local subspaces of the measurement data.

In 4 ist ein Flussdiagramm zur Veranschaulichung eines Verfahrens zur Erstellung eines datenbasierten Funktionsmodells aus Teilmodellen dargestellt.In 4 FIG. 4 is a flow chart illustrating a method of creating a data-based function model from sub-models.

In Schritt S1 wird für jeden Rechenzyklus ein Eingangsgrößenvektor x bereitgestellt, der die Messgrößenwerte der Messgrößen u der durch das Prüfsystem 1 bereitgestellten Werte sowie von mindestens einer der Messgrößen u einen oder mehrere historische bzw. vergangene Messgrößenwerte umfasst. Die historischen Messwerte betreffen Werte einer Messgröße u aus vorangehenden Rechenzyklen. Weiterhin kann der Eingangsgrößenvektor x auch historische Werte der betrachteten Ausgangsgröße y umfassen. Der so bestimmte Eingangsgrößenvektor x und der für den betreffenden Messzeitpunkt erhaltene Wert der Ausgangsgröße y für eine Vielzahl von Messzeitpunkten stellen die Trainingsdaten für das zu erstellende Gauß-Prozess-Modell dar.In step S1, an input variable vector x is provided for each computing cycle, which measures the measured values of the measured quantities u u by the test system 1 and at least one of the measured quantities u comprises one or more historical or past measured variable values. The historical measured values relate to values of a measured variable u from previous calculation cycles. Furthermore, the input variable vector x may also include historical values of the considered output variable y. The input variable vector x determined in this way and the value of the output variable y obtained for the relevant measurement time for a plurality of measurement times represent the training data for the Gauss process model to be created.

In Schritt S2 werden die Trainingsdaten in Cluster unterteilt bzw. klassifiziert. Das Aufteilen der Trainingsdaten kann entweder manuell durch Expertenwissen oder mit herkömmlichen automatisierten Clustering-Verfahren durchgeführt werden, um Untermengen der Trainingsdaten zu erhalten. Die Aufteilung durch Expertenwissen kann manuell basierend auf physikalischen Erkenntnissen erfolgen. Zum Beispiel kann ein Experte bei Vorhandensein von verschiedenen Systemmodi die bestimmten Betriebsmodi anhand von Wertebereichen einer, mehrerer oder aller Eingangsgrößen x identifizieren und somit die bereitgestellten Trainingsdaten in geeignete Untermengen unterteilen. In vielen Anwendungsfällen kann der Experte die Trainingsdatenpunkte manuell selektieren und dadurch das zu erlernende Funktionsmodell verbessern.In step S2, the training data is divided into clusters. The splitting of the training data may be performed either manually by expert knowledge or by conventional automated clustering techniques to obtain subsets of the training data. The division by expert knowledge can be done manually based on physical findings. For example, in the presence of various system modes, an expert may identify the particular modes of operation based on ranges of values of one, more, or all inputs x, thus subdividing the provided training data into appropriate subsets. In many applications, the expert can manually select the training data points and thereby improve the functional model to be learned.

Liegt kein Expertenwissen vor, so können herkömmliche automatische Clustering- oder Klassifizierungsverfahren verwendet werden, wie zum Beispiel ein k-Means-Algorithmus, ein Mean-Shift-Algorithmus und dergleichen. Dabei müssen geeignete Metriken für die Klassifizierung der Trainingsdaten untersucht und ausgewählt werden.If no expert knowledge is available, conventional automatic clustering or classification methods can be used, such as a k-means algorithm, a mean-shift algorithm and the like. In doing so, suitable metrics for the classification of the training data must be examined and selected.

In Schritt S3 wird nun für jede der erhaltenen Untermengen von Trainingsdaten in oben beschriebener Weise ein lokales Teilmodell 51 erstellt. Jedes datenbasierte Funktionsmodell wird durch seine Stützstellen, die der Untermenge der Trainingsdaten entsprechen kann, den Vektor Q_y und die Hyperparameter bestimmt. In step S3, a local submodel is now determined for each of the subset of training data obtained in the manner described above 51 created. Each data-based function model is determined by its nodes, which can correspond to the subset of training data, the vector Q _y and the hyperparameters.

In Schritt S4 werden für alle lokalen Teilmodelle 51 zusätzlich die prädiktiven Varianzen cov(X_*) ermittelt. Der Trainingsprozess für die lokalen Teilmodelle 51 kann einem herkömmlich Prozess zur Erstellung eines Gauß-Prozess-Modells entsprechen. Auch andere an sich bekannte Trainingsprozesse können verwendet werden. Hier werden die lokalen Teilmodelle 51 unabhängig voneinander gelernt, d. h. die Hyperparameter der lokalen Teilmodelle 51 werden mithilfe der zuvor ermittelten Untermengen der Trainingsdaten getrennt voneinander optimiert.In step S4, for all local submodels 51 additionally determines the predictive variances cov (X _* ). The training process for the local submodels 51 may correspond to a conventional process for creating a Gaussian process model. Other training processes known per se can also be used. Here are the local submodels 51 learned independently, ie the hyperparameters of the local submodels 51 are optimized separately from each other using previously determined subsets of training data.

In Schritt S5 wird nun ein gewichteter Durchschnitt der lokalen Teilmodelle 51 ermittelt, wobei die Gewichtung jedes der lokalen Teilmodelle 51 von dessen prädiktiver Varianz cov(X_*) abhängt.In step S5, a weighted average of the local submodels now becomes 51 determines the weighting of each of the local submodels 51 depends on its predictive variance cov (X _* ).

Nach der Gewichtung der einzelnen Teilmodelle werden diese in Schritt S6 aufsummiert, um das datenbasierte Funktionsmodell zu erhalten. Bei einer Anzahl m von lokalen Teilmodellen 51 kann ein gewichteter Durchschnitt y von einem Abfragepunkt x_* berechnet werden, zum Beispiel:

After the weighting of the individual partial models, these are summed up in step S6 in order to obtain the data-based functional model. For a number m of local submodels 51 For example, a weighted average y can be calculated from a query point x _* , for example:

Hier wird die prädiktive Varianz covⁱ(x_*) zum Gewichten der entsprechenden mittleren Prädiktionswerte f ⁱ(x_*) für jedes lokale Teilmodell (i: Index des lokalen Teilmodells) verwendet.Here, the predictive variance cov ⁱ (x _* ) becomes the weighting of the corresponding mean prediction values f ⁱ (x _* ) for each local submodel (i: index of the local submodel) used.

Die grundlegende Struktur der gewichteten Prädiktion ist in 5 dargestellt. Man erkennt, dass die Funktionswerte f ¹(x_*), f ²(x_*), ..., f ⁿ(x_*) von m lokalen Teilmodellen 51 mit der entsprechenden prädiktiven Varianz cov¹(x_*), cov²(x_*), ..., covⁿ(x_*) für einen Abfragepunkt x_* miteinander in jeweiligen Multiplikationsblöcken 52 multipliziert und anschließend in einem Additionsblock 53 addiert werden. Die so erhaltene Summe wird in einem Dividierglied 54 durch die in einem Summierglied 55 erhaltenen Summe der prädiktiven Varianzen cov¹(x_*), cov²(x_*), ..., covⁿ(x_*) aller lokalen Teilmodelle 51 dividiert, um den gewichteten Durchschnitt ŷ für alle lokalen Teilmodelle 51 zu erhalten.The basic structure of weighted prediction is in 5 shown. One recognizes that the functional values f ¹ (x _* ), f ² (x _* ), ..., f ⁿ (x _* ) from m local submodels 51 with the corresponding predictive variance cov ¹ (x _* ), cov ² (x _* ), ..., cov ⁿ (x _* ) for a query point x _* together in respective multiplication blocks 52 multiplied and then in an addition block 53 be added. The sum thus obtained is in a divider 54 through the in a summator 55 obtained sum of the predictive variances cov ¹ (x _* ), cov ² (x _* ), ..., cov ⁿ (x _* ) of all local submodels 51 divided by the weighted average ŷ for all local submodels 51 to obtain.

Die entsprechende Varianz der lokalen Teilmodelle 51 kann bestimmt sein als

The corresponding variance of the local submodels 51 can be determined as

Mithilfe dieser Gleichungen kann die mittlere Prädiktion und Varianz der lokalen Teilmodelle 51 für einen Abfragepunkt x_* bestimmt werden.Using these equations, the mean prediction and variance of the local submodels 51 for a query point x _* .

Durch das obige Verfahren kann die Genauigkeit von Berechnungen von Funktionswerten mithilfe von datenbasierten Funktionsmodellen deutlich verbessert werden. Dies liegt insbesondere daran, dass durch die Kombination der dynamischen Betrachtung der Messgrößen u und/oder der Ausgangsgröße y in Verbindung mit der Unterteilung des so entstehenden Eingangsgrößenraums die komplexe Hülle des Eingangsgrößenraums eines dynamischen Systems besser in dem resultierenden datenbasierten Funktionsmodell abgebildet werden kann als dies bei einer bloßen stationären Betrachtung der Fall wäre.The above method can significantly improve the accuracy of function value calculations using data-based function models. This is due in particular to the fact that the complex envelope of the input variable space of a dynamic system can be mapped better in the resulting data-based functional model than in the case of the combination of the dynamic consideration of the measured quantities u and / or the output variable y in conjunction with the subdivision of the resulting input variable space a mere stationary consideration would be the case.

Bezüglich des obigen Schritts der Klassifizierung können die Metriken für die Klassifizierung der Trainingsdaten auch verwendet werden, um nachträglich neue Trainingsdaten zu einem lokalen Teilmodell 51 hinzuzufügen. Sind zum Beispiel die neuen Trainingsdaten ähnlich den bestehenden Trainingsdaten, so können diese Trainingsdaten zu einer Untermenge oder zu Teilen mehrerer bereits definierter Untermengen hinzugefügt werden. Zur Adaption des datenbasierten Funktionsmodells müssen dann lediglich die eine oder die mehreren veränderten Untermengen neu trainiert werden und es muss, wie oben beschrieben, die gewichtete Summe über alle Teilmodelle 51, d. h. die bestehenden Teilmodelle 51 basierend auf den unveränderten Untermengen und die veränderten Teilmodelle 51 basierend auf den veränderten Untermengen, bestimmt werden.With regard to the above classification step, the metrics for classifying the training data may also be used to retrofit new training data to a local submodel 51 add. For example, if the new training data is similar to the existing training data, then that training data may be added to a subset or portions of several already defined subsets. In order to adapt the data-based function model, only the one or more changed subsets need to be re-trained and, as described above, the weighted sum over all submodels has to be trained 51 ie the existing submodels 51 based on the unchanged subsets and the changed submodels 51 based on the changed subsets.

Kommen nach dem Erstellen eines datenbasierten Funktionsmodells basierend auf ursprünglichen Trainingsdaten neue Trainingsdatenpunkte NTP hinzu, so können diese auch den bestehenden Untermengen der ursprünglichen Trainingsdaten hinzugefügt werden. 6 zeigt für einen zweidimensionalen Eingangsgrößenraum die Möglichkeit, neue Trainingsdatenpunkte NTP einer ersten Untermenge GPR1 oder einer zweiten Untermenge GPR2 hinzuzufügen. Herkömmliche, auf dem Abstand zu dem Mittelpunkt der jeweiligen Untermenge basierende Verfahren können u. U. jedoch nicht zur optimalen Lösung führen. If new training data points NTP are added after the creation of a data-based function model based on original training data, these can also be added to the existing subsets of the original training data. 6 For a two-dimensional input size space, it is possible to add new training data points NTP to a first subset GPR1 or to a second subset GPR2. Conventional methods based on the distance to the midpoint of the respective subset may include: U. but not lead to the optimal solution.

In 7 ist ein Flussdiagramm zur Veranschaulichung eines Verfahrens zum Hinzufügen eines neuen Trainingsdatenpunkts NTP beschrieben.In 7 FIG. 3 is a flowchart for illustrating a method for adding a new training data point NTP.

In Schritt S11 wird der betreffende neue Trainingsdatenpunkt NTP zu jeder der zuvor ermittelten Untermengen der ursprünglichen Trainingsdatenpunkte hinzugefügt und die jeweilige prädiktive Varianz ermittelt. Der neue Trainingsdatenpunkt NTP wird in Schritt S12 derjenigen Untermenge hinzugefügt, zu der die geringste Varianz ermittelt worden ist. Da die prädiktive Varianz Teil des Ansatzes der lokalen Teilmodelle 51 ist, wird diese berechnet und kann direkt verwendet werden, um das Auswahlproblem zu lösen. Anschließend werden, wie oben beschrieben, in Schritt S13 die bezüglich der übrigen Untermengen und der Untermenge mit dem hinzugefügten neuen Trainingsdatenpunkt NTP ermittelten Teilmodelle 51 gewichtet und addiert, um das datenbasierte Funktionsmodell zu erhalten.In step S11, the respective new training data point NTP is added to each of the previously determined subsets of the original training data points and the respective predictive variance is determined. The new training data point NTP is added in step S12 to the subset at which the least variance has been found. Since the predictive variance is part of the approach of the local submodels 51 is calculated, and can be used directly to solve the selection problem. Subsequently, as described above, in step S13, the partial models determined with respect to the remaining subsets and the subset with the new training data point NTP added are obtained 51 weighted and added to obtain the data-based function model.

In dem Diagramm der 8 erkennt man, dass die außerhalb der Hüllen der Untermengen GPR1, GPR2 liegenden neuen Trainingsdatenpunkte NTP durch das obige Verfahren denjenigen Untermengen GPR1, GPR2 hinzugefügt werden, deren Hülle dem jeweiligen neuen Trainingsdatenpunkt NTP am nächsten liegt.In the diagram of 8th it can be seen that the new training data points NTP lying outside the shells of the subgroups GPR1, GPR2 are added by the above method to those subgroups GPR1, GPR2 whose sheath is closest to the respective new training data point NTP.

Um bei der Klassifizierung der Trainingsdaten ein Messrauschen zu berücksichtigen, das zu Abweichungen des Werts der Ausgangsgröße y an einer Position im Eingangsgrößenraum führt, kann das Aufteilen der Trainingsdaten in Untermengen mithilfe der euklidischen Distanz der Trainingsdatenpunkte im Eingangsgrößenraum vorgenommen werden. Das heißt, Trainingsdatenpunkte werden mithilfe ihrer euklidischen Distanz zueinander in Untermengen aufgeteilt, so dass die Unterschiede bezüglich ihrer Varianzen zueinander erfasst werden können.In order to take into account a measurement noise in the classification of the training data, which leads to deviations of the value of the output variable y at a position in the input size space, the division of the training data into subsets can be performed using the Euclidean distance of the training data points in the input size space. That is, training data points are subdivided into subsets using their Euclidean distance so that differences in their variances can be detected.

Das Aufteilen der Trainingsdaten unter Berücksichtigung der euklidischen Distanz kann dazu dienen, die Varianz im Eingangsgrößenraum zu optimieren. Die Aufteilung in Untermengen kann beispielsweise durch einen Schwellwertvergleich der euklidischen Distanz mit einem vorgegebenen Schwellenwert vorgenommen werden. Das heißt Trainingsdatenpunkte, deren euklidische Distanzen zueinander kleiner als der vorgegebene Schwellenwert sind, werden einer Untermenge der Trainingsdatenpunkte zugeordnet.The splitting of the training data taking into account the Euclidean distance can serve to optimize the variance in the input size space. The division into subsets can be carried out, for example, by a threshold value comparison of the Euclidean distance with a predetermined threshold value. That is, training data points whose Euclidean distances are smaller than the predetermined threshold are assigned to a subset of the training data points.

Eine weitere Variante bei der Klassifizierung besteht darin, dass nur diejenigen Untermengen der Trainingsdaten berücksichtigt werden, deren Varianzen jeweils unter einem vorgegebenen Varianzschwellenwert liegen. Für die so ausgewählten Untermengen werden lokale Funktionsmodelle 51 erstellt. Diese lokalen Teilmodelle 51 können gemäß ihren prädiktiven Varianzen gewichtet und addiert werden.Another variant in the classification is that only those subsets of the training data are taken into account whose variances are each below a predetermined variance threshold. The subsets thus selected become local function models 51 created. These local submodels 51 can be weighted and added according to their predictive variances.

Entsprechend dem durch das Flussdiagramm der 9 bestimmten Verfahren kann ein datenbasiertes Funktionsmodell, das auf herkömmliche Weise oder entsprechend einem der obigen Verfahren ermittelt wird, weiterhin verbessert werden. Dazu werden in Schritt S21 Trainingsdatenpunkte in Funktionsbereichen des erstellten datenbasierten Funktionsmodells bestimmt, deren Varianzen über einem vorgegebenen Varianzschwellenwert liegen.According to the flowchart of FIG 9 According to certain methods, a data-based functional model, which is determined in a conventional manner or according to one of the above methods, can be further improved. For this purpose, in step S21 training data points are determined in functional areas of the created data-based function model whose variances are above a predetermined variance threshold value.

In Schritt S22 wird für die Trainingsdatenpunkte, die in den entsprechenden Funktionsbereichen des datenbasierten Funktionsmodells liegen, wie in Verbindung mit dem Flussdiagramm der 4 beschrieben, ein weiteres datenbasiertes Funktionsmodell ermittelt. Das ursprüngliche datenbasierte Funktionsmodell und das weitere datenbasierte Funktionsmodell werden in Schritt S23 dann in geeigneter Weise kombiniert. Das Kombinieren kann, wie in Verbindung mit dem Schritt S5 beschrieben, durch Bilden eines gewichteten Durchschnitts, insbesondere mit einer von der Varianz abhängigen Gewichtung, erfolgen.In step S22, for the training data points residing in the respective functional areas of the data-based functional model, as described in connection with the flowchart of FIG 4 described another data-based functional model determined. The original data-based function model and the further data-based function model are then suitably combined in step S23. The combining may occur as described in connection with step S5 by forming a weighted average, in particular with a variance-dependent weighting.

Das Erstellen des Weiteren datenbasierten Funktionsmodells kann mit geringem Aufwand erfolgen, je nachdem bei welchem Wert der Varianzschwellenwert festgelegt wird. Insbesondere kann der Varianzschwellenwert so festgelegt werden, dass zwischen 10 und 50%, vorzugsweise zwischen 10 und 20% der Trainingsdatenpunkte der Erstellung des weiteren datenbasierten Funktionsmodells zugrunde gelegt werdenThe creation of the additional data-based function model can be done with little effort, depending on which value the variance threshold is set. In particular, the variance threshold can be set such that between 10 and 50%, preferably between 10 and 20%, of the training data points are used to create the further data-based function model

Durch Trainingsdatenpunkte, die auch in dem Eingangsgrößenraum des ursprünglichen datenbasierten Funktionsmodells liegen, erhält man eine Überlappung zwischen den beiden datenbasierten Funktionsmodellen bei dem ursprünglich bereitgestellten datenbasierten Funktionsmodell und dem lokalen Teilmodell 51. Diese Überlappung stört jedoch nicht die Vorhersage, da die Modellausgangsgröße y des globalen Modells mit der Varianz gewichtet wird. Training data points, which are also located in the input space of the original data-based function model, provide an overlap between the two data-based function models in the originally provided data-based function model and the local submodel 51 , However, this overlap does not disturb the prediction because the model output y of the global model is weighted with the variance.

ZITATE ENTHALTEN IN DER BESCHREIBUNG QUOTES INCLUDE IN THE DESCRIPTION

Diese Liste der vom Anmelder aufgeführten Dokumente wurde automatisiert erzeugt und ist ausschließlich zur besseren Information des Lesers aufgenommen. Die Liste ist nicht Bestandteil der deutschen Patent- bzw. Gebrauchsmusteranmeldung. Das DPMA übernimmt keinerlei Haftung für etwaige Fehler oder Auslassungen.This list of the documents listed by the applicant has been generated automatically and is included solely for the better information of the reader. The list is not part of the German patent or utility model application. The DPMA assumes no liability for any errors or omissions.

Zitierte PatentliteraturCited patent literature

DE 102010028259 A1 [0005]

Zitierte Nicht-PatentliteraturCited non-patent literature

C. Plagemann, K. Kersting, W. Burgard, "Nonstationary Gaussian Process Regression Using Point Estimates of Local Smoothness", ICML Proceedings, pages 204-2116, 2006 [0006]
CE Rasmussen et al., Gaussian Processes for Machine Learning, MIT Press 2006 [0045]

Claims

Method for creating a data-based function model, in particular a Gaussian process model, comprising the following steps: - providing (S1) training data; - dividing (S2) the training data into subsets of training data; - Create (S3) a data-based local submodel ( 51 ) based on each of the subsets of the training data; and - combining (S5) the local submodels ( 51 ) to obtain the data-based function model.

The method of claim 1, wherein the training data comprises training data points, each training data point having a value of at least one measurand (u) of a current computation cycle, a value of at least one output (y) of a current computation cycle, at least one value of one of the measurands (u) from a previous one Calculation cycle and / or a value of the output variable (y) from a previous calculation cycle includes as input values

The method of claim 1 or 2, wherein the subdivision (S2) of the training data manually or automatically, in particular by means of a k-means algorithm or a mean-shift algorithm is performed.

The method of claim 1 or 2, wherein the subdividing (S2) of the training data is performed by forming subsets of training data points, wherein the training data points of a subset are selected to have a Euclidean distance from each other that is less than a predetermined threshold.

Method according to one of claims 1 to 4, wherein the combining (S5) of the local submodels ( 51 ) by forming a weighted average of the local submodels ( 51 ) is carried out.

Method according to claim 5, wherein the local submodels ( 51 ) are weighted depending on a measure of the model uncertainty, in particular depending on a predictive variance.

Method according to one of claims 1 to 6, wherein in the created data-based function model new training data points (NTP) are taken into account by adding these to one of the subsets of the training data.

Method according to one of claims 1 to 7, wherein in the created data-based function model a new training data point (NTP) is taken into account by adding it to one of the subsets of the original training data, the local submodel ( 51 ) is created from the corresponding subset of the original training data and the corresponding local sub-model ( 51 ) with the remaining local submodels ( 51 ) is combined.

A method for providing a data-based functional model, wherein training data points are selected from one or more functional areas of the data-based functional model and another data-based functional model is determined based on the selected training data points according to a method of any one of claims 1 to 7, wherein the data-based functional model and the further data-based Functional model can be combined.

The method of claim 9, wherein the training data points are selected from the training data if their predictive variance exceeds a predetermined variance threshold.

The method of claim 9 or 10, wherein the data-based function model and the further data-based function model are combined by a weighted average is formed from the function models, wherein the function models are weighted in particular based on the predictive variance.

Device, in particular a computing unit, for creating a data-based function model, in particular a Gaussian process model, wherein the device is designed to: provide training data; - subdivide the training data into subsets of training data; A data-based local sub-model ( 51 ) based on each of the subsets of the training data; and - the local submodels ( 51 ) to obtain the data-based function model.

Computer program adapted to carry out all steps of a method according to one of Claims 1 to 11.

Machine-readable storage medium on which a computer program according to claim 13 is stored.

An electronic control device comprising an electronic storage medium according to claim 14.