CN107256425B

CN107256425B - Random weight network generalization capability improvement method and device

Info

Publication number: CN107256425B
Application number: CN201710354539.0A
Authority: CN
Inventors: 何玉林; 敖威
Original assignee: Shenzhen University
Current assignee: Shenzhen University
Priority date: 2017-05-18
Filing date: 2017-05-18
Publication date: 2020-04-14
Anticipated expiration: 2037-05-18
Also published as: CN107256425A

Abstract

The invention discloses a method and a device for improving generalization ability of a random weight network, which are characterized in that on the premise of not changing a frame structure of the random weight network, a target sample with the maximum uncertainty value in a training sample is mined, a simulation sample which is approximately distributed with the target sample with the maximum uncertainty value is generated, and weights of an output layer of the random weight network are updated iteratively based on the simulation sample, so that the aim of actively mining hidden information of the training sample and further improving the generalization ability of the random weight network can be fulfilled.

Description

Random weight network generalization capability improvement method and device

Technical Field

The invention relates to the technical field of data mining, in particular to a random weight network generalization capability improvement method and device.

Background

The Random Weight Network (RWN) is a full-link feedforward neural Network that does not rely on an iterative Weight update strategy, and unlike a traditional Weight update strategy based on an error back-propagation method, the RWN randomly selects input layer weights, and calculates an analytic solution of the output layer weights by solving a pseudo-inverse of a hidden layer output matrix. Because iterative weight adjustment is avoided, the random weight network obtains extremely high training speed, and meanwhile, the convergence of the random weight network is guaranteed in the universal approximation theorem theory.

At present, a great deal of research work on the generalization capability improvement of the random weight network is provided, and mainly focuses on the improvement of a random weight network framework, including the improvement of input layer weight, the selection of the number of nodes of an optimal hidden layer, the integration of multiple random weight networks and the like, which improve the prediction performance of the random weight network to a certain extent, but neglect the deep utilization of information contained in training data. In other words, the existing random weight network improvement work is passive use of training data, which is only used as a test field for checking the effect of the improvement work, rather than actively mining the intrinsic information of the training data to guide how to improve the generalization ability of the random weight network.

Disclosure of Invention

The invention mainly aims to provide a method and a device for improving the generalization ability of a random weight network, and aims to solve the technical problem that the prior art does not actively mine the internal information of training data to guide how to improve the generalization ability of the random weight network.

To achieve the above object, a first aspect of the present invention provides a random access network generalization capability improving method, including:

step (ii) of1: using training samples

Training random weights network RWN^(r)Obtaining the trained random weight network RWN^(r ⁺¹⁾And the training sample

Wherein r has an initial value of 0, and

to initiate training samples, RWN⁽⁰⁾Is an initial random weight network;

step 2: from the training sample

Selecting a target sample with the largest uncertainty value, and generating a simulation sample by using the target sample and a preset neighborhood control factor;

and step 3: calculating the simulation sample and the training sample

As a new training sample

And 4, step 4: and returning to the step 1 by making R ═ R +1 until R ═ R, and ending the training process after the step 1 is executed to obtain an improved random weight network RWN^(R)And R is the preset iterative training times.

In order to achieve the above object, a second aspect of the present invention further provides a random access network generalization capability improving apparatus, including:

a training module for utilizing training samples

Training random weights network RWN^(r)After having been trainedRandom access network RWN^(r+1)And the training sample

Wherein r has an initial value of 0, and

to initiate training samples, RWN⁽⁰⁾Is an initial random weight network;

a selection generation module for generating a selection from the training samples

a calculation module for calculating the simulation sample and the training sample

As a new training sample

A return ending module, configured to make R ═ R +1, return to the training module, and end the training process after executing the training module until R ═ R is reached, so as to obtain an improved random weight network RWN^(R)And R is the preset iterative training times.

The invention provides a random weight network generalization ability improvement method, which comprises the steps of mining a target sample with the largest uncertainty value in a training sample to generate a simulation sample on the premise of not changing a random weight network framework structure, and training a random weight network in an iteration mode on the basis of the simulation sample, so that the aim of actively mining the training sample to improve the generalization ability of the random weight network can be fulfilled.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a flow chart illustrating a method for improving generalization capability of a random access network according to a first embodiment of the present invention;

FIG. 2 is a diagram illustrating the utilization of training samples in step 101 according to a first embodiment of the present invention

For random access network RWN^(r)Training to obtain the training sample

A flow chart of the step of refining the uncertainty value of each sample in the step (A);

FIG. 3 is a flowchart illustrating a detailed step of generating a simulation sample by using a target sample and a preset neighborhood control factor in step 102 according to the first embodiment of the present invention;

FIG. 4 is a flow chart illustrating additional steps of the first embodiment of the present invention;

FIG. 5 is a diagram illustrating functional modules of an apparatus for improving generalization capability of random access network according to a second embodiment of the present invention;

FIG. 6 is a diagram illustrating the detailed functional blocks of the training module 501 according to a second embodiment of the present invention;

FIG. 7 is a diagram of additional functional modules according to a second embodiment of the present invention;

fig. 8 is a schematic diagram of a refinement function module of the selection generation module in the second embodiment of the present invention.

Detailed Description

In order to make the objects, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In the prior art, the technical problem that active mining of internal information of training data for guiding improvement on generalization capability of a random weight network is not realized exists.

In order to solve the above technical problems, the present invention provides a method and an apparatus for improving the generalization capability of a random access network. On the premise of not changing the frame structure of the random weight network, a simulation sample is generated by mining a target sample with the largest uncertainty value in a training sample, and the random weight network is trained in an iterative mode based on the simulation sample, so that the aim of actively mining the training sample to improve the generalization capability of the random weight network can be fulfilled. Furthermore, the method for improving the generalization capability of the random weight network in the embodiment of the invention not only obviously improves the generalization capability of the random weight network, but also has extremely strong capability of controlling the overfitting of the random weight network.

Referring to fig. 1, a flow chart of a method for improving generalization capability of a random access network according to a first embodiment of the present invention is shown, the method comprising:

step 101: using training samples

Wherein r has an initial value of 0, and

to initiate training samples, RWN⁽⁰⁾Is an initial random weight network;

in an embodiment of the invention, an initial training sample

Is a data set containing N D-dimensional training samples as follows:

the initial training sample described above

For training an initial random weight network RWN⁽⁰⁾Wherein the initial random right network RWN⁽⁰⁾The number of input layer nodes is D, the format of hidden layer nodes is K, and the number of output layer nodes is 1, wherein the initial random weight network RWN⁽⁰⁾The input layer input matrix of (a) is:

wherein the initial random right network RWN⁽⁰⁾The output layer output matrix of (a) is:

wherein, the random right network RWN⁽⁰⁾The implicit input matrix of (c) is:

wherein, the random right network RWN⁽⁰⁾The hidden layer output matrix of (a) is:

wherein the content of the first and second substances,

for the input layer weight matrix to be the input layer weight matrix,

the layer bias is implied.

Wherein, ω is_kd，b_kK is 1, 2, …, and K is a random number in any interval. For example, it may be the interval [0, 1 ]]Random numbers are uniformly distributed throughout the course of administration.

Wherein the content of the first and second substances,

the function is activated for Sigmoid.

In an embodiment of the invention, an initial training sample is utilized

For initial random right network RWN⁽⁰⁾Training is carried out, and the trained random weight network RWN is obtained⁽¹⁾And training samples

The uncertainty of each sample in (a).

Wherein, the training times can be represented by r, and the initial value of r is 0, that is, after one training, the obtained random weight network is RWN⁽¹⁾. And RWN^(r)Representing the random weight network obtained after the r-th training.

In the embodiment of the present invention, the uncertainty is obtained based on the actual output and the true output of the output layer output matrix of the random weight network, please refer to fig. 2, which is a first embodiment of the present invention in which the training samples are utilized in step 101

For random access network RWN^(r)Training to obtain the training sample

The flow chart of the step of refining the uncertainty value of each sample in the method comprises the following steps:

step 201, utilizing training samples

For random access network RWN^(r)Training to obtain an output layer output matrix;

step 202, taking the output layer output matrix as the real output of the training sample, calculating the error between the real output and the actual output of the training sample, and taking the error as the uncertainty value of each sample in the training sample.

Training samples obtained for the r-th training

And random weight network obtained by the r training

The training sample is

Input random access network RWN^(r)And training to obtain an output layer output matrix.

Using the output layer output matrix as a training sample

Calculating an error between the real output and the actual output, and using the error as the training sample

The uncertainty value of each sample in (1).

Wherein, the actual output is obtained by using a test sample, and the preset test sample is used for inputting an initial random weight network RWN⁽⁰⁾And taking the obtained output layer output matrix as actual output, wherein the actual output is used in the iterative training process.

Wherein, because the output matrix of the output layer is:

that is, each sample has a corresponding output layer matrix value, the output layer value of each sample in the output layer matrix is used as the real output, and the error between the actual output and the real output of each sample is calculated, so as to obtain the uncertainty value of each sample.

Wherein the uncertainty value is

Wherein the content of the first and second substances,

representing the actual output of the nth sample,

the output of the random weight network obtained after the nth training sample is shown as the real output, U represents the uncertainty value,

representing the input value of sample n in the input layer input matrix.

Step 102: from the training sample

in the embodiment of the invention, after one training is finished, a training sample is obtained

And from the training samples

And selecting the sample with the largest uncertainty value as a target sample, and generating a simulation sample by using the target sample and a preset neighborhood control factor.

Wherein the training sample

The samples were as follows:

in the r training, the last training sample is used

And last obtaining random right network RWN^(r-1). In obtaining the above-mentioned training

After the uncertainty value of each sample in (1), the sample with the largest uncertainty value is

Then utilize the sample

And generating a simulation sample.

Compared with samples with small uncertainty values, samples with large uncertainty values play a more important role in improving the generalization capability of the random weight network. As an ion, if the sample

Is 0, i.e. it has no uncertainty, and it is not necessary to adapt the current learning algorithm to the sample.

Referring to fig. 3, a flowchart illustrating a refinement step of generating a simulation sample by using a target sample and a preset neighborhood control factor in step 102 according to a first embodiment of the present invention includes:

step 301, determining a value range of an input layer input matrix and a value range of an output layer output matrix of a simulation sample to be generated by using the target sample and the neighborhood control factor;

step 302, randomly extracting a random number from the value range of the input layer input matrix, and generating the input layer input matrix of the simulation sample by using the extracted random number; and randomly extracting random numbers from the value range of the output layer output matrix, and generating the output layer output matrix of the simulation sample by using the extracted random numbers.

In the embodiment of the invention, simulation samples approximately distributed with the high uncertainty target samples are obtained based on the high uncertainty target samples.

Specifically, the value range of the input layer input matrix and the value range of the output layer output matrix of the simulation sample to be generated are determined by using the target sample and the preset neighborhood control factor.

Wherein the target sample

The corresponding field control factor is a D +1 dimensional 'super-rectangular solid', and the value range of the input layer input matrix and the value range of the output layer output matrix of the obtained simulation sample to be generated are respectively as follows:

wherein the content of the first and second substances,

representing the input layer input of the target sample obtained by the r-th training,

represents the output layer output of the target sample obtained by the r training, represents the neighborhood control factor,

representing a difference between a maximum value and a minimum value in the input layer input of the target sample,

an output layer output representing the target sample. Wherein the neighborhood control factor δ > 0.

After the value range is obtained, random numbers are randomly extracted from the value range of the input layer input matrix, the extracted random numbers are used for generating the input layer input matrix of the simulation sample, the random numbers are randomly extracted from the value range of the output layer output matrix, and the extracted random numbers are used for generating the output layer output matrix of the simulation sample so as to obtain the simulation sample.

It will be appreciated that in embodiments of the invention, the neighborhood control factor is set so as to obtain simulated samples that are approximately the same distribution as the sample with the largest uncertainty value, i.e., at high uncertainty samples_δAnd generating simulation samples in the neighborhood so as to reduce the current random weight network error.

Step 103: calculating the simulation sample and the training sample

As a new training sample

Step 104: returning to the step 101 by making R ═ R +1 until R ═ R, and ending the training process after executing the step 101 to obtain the improved random weight network RWN^(R)And R is the preset iterative training times.

In the embodiment of the invention, the simulation sample and the training sample need to be calculated

The union is used as a new training sample

Therefore, the temperature of the molten metal is controlled,

that is, each training will be used based on the trainingAnd obtaining the training sample used in the next training by the training sample and the simulation sample obtained by the training.

In the embodiment of the invention, a new training sample is obtained

Then, let R be R +1, return to step 101, so that the random weight network can be trained in an iterative manner, and when R is R, after step 101 is executed, the training process is ended to obtain an improved random weight network RWN^(R)And R is the preset iterative training times.

Wherein the random access network RWN (^R) The output layer weight matrix of (a) is:

wherein the content of the first and second substances,

where W represents the input layer weight matrix and B is the hidden layer bias.

Further, please refer to fig. 4, which is a flowchart illustrating additional steps of the first embodiment of the present invention, including:

step 401, taking M random numbers as the weight of an input layer and P random numbers as the bias of a hidden layer from a preset arbitrary interval;

step 402, calculating a first average value of the M random numbers, and setting the initial random weight network RWN according to the first average value⁽⁰⁾Calculating a second average value of the P random numbers, and setting the initial random weight network RWN according to the second average value⁽⁰⁾Is biased.

In the embodiment of the present invention, before training the initial random weight network, the input layer weight and the hidden layer bias of the initial random weight network need to be set, and the setting may be specifically completed as follows:

taking M random numbers as input layer weight and P random numbers as hidden layer bias from any preset interval, calculating a first average value of the M random numbers, and setting an initial random weight network RWN according to the first average value⁽⁰⁾Computing a second average value of the P random numbers, and setting the initial random weight network RWN according to the second average value⁽⁰⁾Is biased.

For example, 100 random numbers may be taken from an arbitrary interval and the average of the 100 random numbers may be used as the input layer weight, and 100 random numbers may be taken from the arbitrary interval and the average of the 100 random numbers may be used as the hidden layer bias.

It can be understood that the influence of randomization of the input layer weight and the hidden layer bias on the training of the random weight network can be effectively eliminated by setting the input layer weight and the hidden layer bias by taking the average value of a plurality of random numbers.

In the embodiment of the invention, on the premise of not changing the frame structure of the random weight network, the target sample with the largest uncertainty value in the training sample is mined to generate the simulation sample, and the random weight network is trained in an iterative manner based on the simulation sample, so that the aim of actively mining the training sample to improve the generalization capability of the random weight network can be fulfilled. Furthermore, the method for improving the generalization capability of the random weight network in the embodiment of the invention not only obviously improves the generalization capability of the random weight network, but also has extremely strong capability of controlling the overfitting of the random weight network.

It can be understood that the random weight network RWN is obtained through R training^(R)The random right network RWN can then be further matched^(R)The generalization ability of (a) is verified, which can be specifically as follows:

assume initial random-weight network RWN⁽⁰⁾The test error on the data set of the independent test sample is E⁽⁰⁾The core problem to be solved by the embodiment of the invention is how to generate a data set x containing R ≧ 1 simulation sample⁽¹⁾，x⁽²⁾，…，x^(R)Is based on

Trained random weight network RWN^(R)Test error E on a data set of a test sample^(R)＜E⁽⁰⁾. Thus, a data set of test samples can be input into the random-weight network RWN^(R)Obtaining the output test error E^(R)And using the test error E^(R)And test error E⁽⁰⁾And comparing to verify the improvement of generalization capability brought by training the random weight network by the technical scheme in the embodiment of the invention.

It is understood that the improvement in the embodiment of the present invention is an improvement on the output layer weights of the random weight network, and the generalization capability of the random weight network is improved by iteratively optimizing the output layer weights of the random weight network by using simulation samples. Compared with the improvement of the random weight network framework structure in the prior art, the technical scheme provided by the embodiment of the invention starts with training samples on the basis of not changing the random weight network framework structure, and achieves the purpose of improving the prediction performance of the random weight network by introducing more simulation samples, namely the improvement of the output layer weight of the random weight network is realized.

It can be understood that the technical solution in the embodiment of the present invention has the following advantages over the prior art: the frame structure of the random right network does not need to be modified, so that the generalization capability of the random right network is improved more conveniently and easily; the method has strong expansibility, is suitable for random weight networks needing to be improved based on a frame structure, expands the knowledge space represented by training samples and generates a batch of simulation samples distributed with high-uncertainty training samples; and the overfitting capability is efficiently controlled, and the probability of the overfitting phenomenon is greatly reduced.

Referring to fig. 5, a schematic diagram of functional modules of an apparatus for improving generalization capability of random access network according to a second embodiment of the present invention is shown, where the apparatus includes:

a training module 501 for utilizing training samples

Training random weights network RWN^(r)Obtaining the trained random weight network RWN^(r+1)And the training sample

Wherein r has an initial value of 0, and

to initiate training samples, RWN⁽⁰⁾Is an initial random weight network;

a selection generation module 502 for generating training samples from the training samples

a calculating module 503 for calculating the simulation sample and the training sample

As a new training sample

A returning and ending module 504, configured to return the training module 501 to the point R +1 until the point R is equal to R, and after executing the training module, end the training process to obtain an improved random weight network RWN^(R)And R is the preset iterative training times.

Further, please refer to fig. 6, which is a schematic diagram of a detailed functional module of the training module 501 according to a second embodiment of the present invention, including:

a network training module 601 for utilizing training samples

Training random weights network RWN^(r)Obtaining output layer output matrix and after trainingRandom access network RWN^(r+1)；

An uncertainty calculation module 602, configured to use the output layer output matrix as a true output of the training sample, calculate an error between the true output and an actual output of the training sample, and use the error as an uncertainty value of each sample in the training sample.

Further, please refer to fig. 7, which is a schematic diagram of an additional functional module according to a second embodiment of the present invention, the additional functional module includes:

an extracting module 701, configured to take M random numbers as weights of an input layer and P random numbers as offsets of a hidden layer from a preset arbitrary interval;

a mean value calculating module 702, configured to calculate a first mean value of the M random numbers, and set the initial random weight network RWN according to the first mean value⁽⁰⁾Calculating a second average value of the P random numbers, and setting the initial random weight network RWN according to the second average value⁽⁰⁾Is biased.

Further, please refer to fig. 8, which is a schematic diagram of a detailed functional module of the selection generating module 502 according to the second embodiment of the present invention, wherein the selection generating module 502 includes:

a selection module 801 for selecting from the training samples

Selecting a target sample with the maximum uncertainty value;

a range determining module 802, configured to determine, by using the target sample and the neighborhood control factor, a value range of an input layer input matrix and a value range of an output layer output matrix of the simulation sample to be generated;

a sample generation module 803, configured to randomly extract a random number from the value range of the input layer input matrix, and generate an input layer input matrix of a simulation sample by using the extracted random number; randomly extracting random numbers from the value range of the output layer output matrix, and generating the output layer output matrix of the simulation sample by using the extracted random numbers;

wherein, the value range of the input layer input matrix and the value range of the output layer output matrix are respectively as follows:

wherein the content of the first and second substances,

an output layer output representing the target sample.

In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the modules is merely a logical division, and in actual implementation, there may be other divisions, for example, multiple modules or components may be combined or integrated into another system, or some features may be omitted, or not implemented. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or modules, and may be in an electrical, mechanical or other form.

The modules described as separate parts may or may not be physically separate, and parts displayed as modules may or may not be physical modules, may be located in one place, or may be distributed on a plurality of network modules. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment.

In addition, functional modules in the embodiments of the present invention may be integrated into one processing module, or each of the modules may exist alone physically, or two or more modules are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode.

The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

It should be noted that, for the sake of simplicity, the above-mentioned method embodiments are described as a series of acts or combinations, but those skilled in the art should understand that the present invention is not limited by the described order of acts, as some steps may be performed in other orders or simultaneously according to the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no acts or modules are necessarily required of the invention.

In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.

In view of the above description of the method and apparatus for improving the generalization capability of random access network provided by the present invention, those skilled in the art may change the concept of the embodiments of the present invention in the specific implementation manners and application ranges.

Claims

1. A method for improving generalization capability of a random weight network, the method comprising:

step 1: using training samples

Wherein r has an initial value of 0, and

to initiate training samples, RWN⁽⁰⁾Is an initial random weight network;

step 2: from the training sample

and step 3: calculating the simulation sample and the training sample

As a new training sample

And 4, step 4: and returning to the step 1 by making R ═ R +1 until R ═ R, and ending the training process after the step 1 is executed to obtain an improved random weight network RWN^(R)R is a preset iterative training frequency;

the generating of the simulation sample by using the target sample and the preset neighborhood control factor comprises:

determining the value range of an input layer input matrix and the value range of an output layer output matrix of the simulation sample to be generated by using the target sample and the neighborhood control factor;

randomly extracting random numbers from the value range of the input layer input matrix, and generating the input layer input matrix of the simulation sample by using the extracted random numbers; randomly extracting random numbers from the value range of the output layer output matrix, and generating the output layer output matrix of the simulation sample by using the extracted random numbers;

wherein the content of the first and second substances,

an output layer output representing the target sample.

2. The method of claim 1, wherein the utilizing training samples

For random access network RWN^(r)Training to obtain the training sample

The uncertainty values of the samples in (1), comprising:

using training samples

and taking the output layer output matrix as the real output of the training sample, calculating the error between the real output and the actual output of the training sample, and taking the error as the uncertainty value of each sample in the training sample.

3. The method of claim 1, further comprising:

taking M random numbers as the weight of an input layer and P random numbers as the bias of a hidden layer from a preset arbitrary interval;

calculating a first average value of the M random numbers, and setting the initial random weight network RWN according to the first average value⁽⁰⁾Calculating a second average value of the P random numbers, and setting the initial random weight network RWN according to the second average value⁽⁰⁾Is biased.

4. An apparatus for improving generalization capability of a random weight network, the apparatus comprising:

a training module for utilizing training samples

Wherein r has an initial value of 0, and

to initiate training samples, RWN⁽⁰⁾Is an initial random weight network;

As a new training sample

A return ending module, configured to make R ═ R +1, return to the training module, and end the training process after executing the training module until R ═ R is reached, so as to obtain an improved random weight network RWN^(R)R is a preset iterative training frequency;

the selection generation module comprises:

a selection module for selecting from the training samples

Selecting a target sample with the maximum uncertainty value;

the range determining module is used for determining the value range of an input layer input matrix and the value range of an output layer output matrix of the simulation sample to be generated by using the target sample and the neighborhood control factor;

the sample generation module is used for randomly extracting random numbers from the value range of the input layer input matrix and generating the input layer input matrix of the simulation sample by using the extracted random numbers; randomly extracting random numbers from the value range of the output layer output matrix, and generating the output layer output matrix of the simulation sample by using the extracted random numbers;

wherein, the value range of the input layer input matrix and the value range of the output layer output are respectively as follows:

wherein the content of the first and second substances,

an output layer output representing the target sample.

5. The apparatus of claim 4, wherein the training module comprises:

a network training module for utilizing the training samples

Training random weights network RWN^(r)Obtaining output matrix of output layer and trained random weight network RWN^(r+1)；

And the uncertainty calculation module is used for taking the output layer output matrix as the real output of the training sample, calculating the error between the real output and the actual output of the training sample, and taking the error as the uncertainty value of each sample in the training sample.

6. The apparatus of claim 4, further comprising:

the extraction module is used for taking M random numbers as the weight of the input layer from a preset arbitrary interval and P random numbers as the bias of the hidden layer from the preset arbitrary interval;

a mean value calculating module, configured to calculate a first mean value of the M random numbers, and set the initial random weight network RWN according to the first mean value⁽⁰⁾Calculating a second average value of the P random numbers, and setting the initial random weight network RWN according to the second average value⁽⁰⁾Is biased.