CN113283373A

CN113283373A - Method for enhancing detection of limb motion parameters by depth camera

Info

Publication number: CN113283373A
Application number: CN202110642003.5A
Authority: CN
Inventors: 张若琦; 吴小鹰; 朱业安; 张丽; 张琳; 侯文生; 陈琳
Original assignee: Chongqing University
Current assignee: Taiji Computer Corp Ltd
Priority date: 2021-06-09
Filing date: 2021-06-09
Publication date: 2021-08-20
Anticipated expiration: 2041-06-09
Also published as: CN113283373B

Abstract

The invention relates to a method for enhancing detection of limb motion parameters by a depth camera, and belongs to the field of human body posture information detection. The method comprises the following steps: collecting limb movement data of a human body by using a depth camera; s2: when the phenomenon that all mark points of collected human body limb movement data are missing at a certain moment is detected, namely the data volume is insufficient, predicting and reconstructing the missing data by utilizing a trained neural network with an attention mechanism to obtain complete skeleton three-dimensional coordinate data; when the phenomenon that part of mark points of collected body limb movement data at a certain moment are missing is detected, namely three-dimensional coordinate data of individual skeletal joint points at a certain moment are missing, the missing data points are complemented by using a trained neural network with an attention mechanism, and finally complete skeletal three-dimensional coordinate data are obtained. The invention realizes non-contact human body free motion detection and effectively eliminates the problem of data loss which may occur when a depth camera is used for collecting data.

Description

Method for enhancing detection of limb motion parameters by depth camera

Technical Field

The invention belongs to the field of human body posture information detection, and relates to a method for enhancing the detection of limb motion parameters by a depth camera.

Background

The human body posture detection refers to measuring posture information of a human body or a local part of the human body during movement by a certain detection method, for example: the human body posture information is analyzed to obtain the instantaneous posture of the human body and the change rule of the posture of the human body.

Human motion posture detection is an essential detection technology for obtaining human body information, and is widely applied to the fields of physical training, gait analysis, action recognition, medical rehabilitation and military.

The patent application with the publication number of CN105046281A discloses a human behavior detection method based on Kinect, and the method adopts Kinect somatosensory equipment (a depth image sensor) to capture human gestures, so that compared with the traditional method of analyzing video images captured by a camera, the method reduces the influence of external environment on analysis results, and the detection results are more accurate. However, this method requires the motion of the subject to be restricted, and the subject needs to make a specified action according to the experimenter's request.

Patent application publication No. CN110477922A discloses a limb movement detection method and system, in which a movement capture device worn on a limb (including an arm and a leg, hereinafter referred to as a limb) is obtained to obtain limb movement posture data, a limb movement model is generated, and is compared and analyzed with a limb movement model of a person in good health condition, so as to determine the deviation degree between the generated limb movement model and a normal limb movement model of the person in good health condition. But wearable equipment may have certain hindrance to the motion of being tried, influences the motion condition of being tried, and receives outside environment to disturb greatly, and the operation is also relatively loaded down with trivial details.

The patent application with the publication number of CN112215172 discloses a human body lying posture three-dimensional posture estimation method fusing a human color image and depth information, because a depth camera is difficult to distinguish the human body lying posture, the method obtains the information of the human body lying posture by using a depth image and color image fusion mode, and has the advantages of low cost, no contact, high precision and the like, and the frame loss problem of the depth camera in the human body posture detection process is solved by using a spline interpolation method. However, the scheme only aims at the human body in the lying posture, and the detection and identification types of the human body movement posture are too single.

The solutions given in the market at present basically aim at some specific postures, namely, constraints are made on the movement to be tested, and the detected human posture information is not rich and comprehensive enough. Various problems may occur in the data acquisition process, for example, the hardware equipment is not operated properly, the body condition to be tested is not allowed, and the like, which results in serious shortage of the acquired data amount, a lot of frame data is lost, the data loss also has a certain influence on the subsequent data analysis result, and a phenomenon that partial bone joint point data is lost may also occur in the data acquisition process, which causes the data acquisition personnel to need to re-acquire the data to be tested, and wastes time and labor for the data acquisition personnel to be tested and the data acquisition personnel.

Therefore, a new method for detecting limb movement is needed to overcome the above drawbacks.

Disclosure of Invention

In view of the above, the present invention provides a method for enhancing the depth camera to detect the limb movement parameters, which utilizes a non-contact device, i.e. a depth camera, to capture the body posture under the condition of unconstrained free movement, perform predictive reconstruction on the limb movement data of the subject in case of insufficient data amount, or effectively and accurately complement the three-dimensional coordinate information of the individual bone joint point lost when the depth camera collects the limb movement data of the subject at a certain frame time.

In order to achieve the purpose, the invention provides the following technical scheme:

a method for enhancing detection of limb motion parameters by a depth camera specifically comprises the following steps:

s1: collecting limb movement data of a human body by using a depth camera;

s2: when the phenomenon that all mark points of collected human body limb movement data are missing at a certain moment is detected, namely the data volume is insufficient, predicting and reconstructing the missing data by utilizing a trained neural network with an attention mechanism to obtain complete skeleton three-dimensional coordinate data;

when the phenomenon that part of mark points of collected body limb movement data at a certain moment are missing is detected, namely three-dimensional coordinate data of individual skeletal joint points at a certain moment are missing, the missing data points are complemented by using a trained neural network with an attention mechanism, and finally complete skeletal three-dimensional coordinate data are obtained.

Further, the step S1 specifically includes: after the depth camera is connected with a computer, a testee stands in front of the depth camera, the three-dimensional coordinate data of the skeletal joint points in the free movement process of the testee are collected in real time by using a visual capture sensor in the depth camera, and the three-dimensional coordinate data of the skeletal joint points collected by the depth camera are stored as files.

Further, in step S2, when all the marker points after a certain time are missing, prediction reconstruction is performed on the future movement situation based on the existing data. The invention utilizes a neural network method in machine learning to predict and reconstruct the missing conditions of all the mark points after a certain time.

The neural network is actually referred to as "neural network learning" and the purpose of learning is to assign a correct value to the weight of the connection of each node of the hidden layer. These weights may determine the vector of outputs given an input vector. In the initial stage, all weights are randomly assigned. For each input value of the training set, after the forward calculation of the neural network, the obtained output value will be compared with the expected output value, and then the obtained error will be transmitted back to the previous network layer (back propagation algorithm). This error is noted and the weights are adjusted accordingly. The neural network needs to iterate repeatedly until the error of the output is lower than a set threshold, i.e. the difference between the output value and the expected output is small enough, and this process is also called a model training process.

In the invention, along with the continuous input of three-dimensional coordinate data of different tested skeletal joint points into the neural network with attention mechanism, the neural network continuously adjusts the mutual connection relationship among a large number of nodes in the neural network, the output is continuously close to the expected output, the reconstructed data is predicted to be gradually close to the real motion situation of the human body, namely the reconstructed motion track of the human body is predicted to be gradually close to the real motion track of the human body, and the predicted reconstructed data is used for calculating the limb motion parameters.

The specific steps of model training are as follows:

s201: calibrating collected three-dimensional coordinate data of skeletal joint points under the condition of free motion of a group of complete testees by using a three-dimensional motion capture system, then removing data after k frames, namely data after a certain moment, inputting missing data into a neural network, and automatically selecting a k value by an experimenter;

s202: taking the complete three-dimensional coordinate data of the skeletal joint points as expected output of the model, comparing the output of the neural network with the expected output, and forming a mapping relation between the input and the output in the neural network through multiple iterations;

s203: when the input and the output achieve a good mapping relation, the training of the neural network is completed, and then the test data is input into the neural network for prediction reconstruction. The test data is data of all the marker point missing conditions after a certain time appears in the data acquisition process.

Further, in step S2, when a partial marker point is missing at a certain time, a point is complemented based on the context information of the missing data.

The method comprises the steps of training a neural network with an attention mechanism along with continuously inputting three-dimensional coordinate data of different tested limb movement skeletal joint points into the neural network, continuously adjusting the interconnection relationship among a large number of nodes in the neural network, outputting the data which is continuously close to expected output, namely, continuously outputting a result which is close to a real result, continuously improving the accuracy of point supplement, enabling the data after the point supplement to be gradually close to the real skeleton information of a human body, namely, gradually approaching the real motion track of the human body after the point supplement, and then calculating limb motion parameters by using the data after the point supplement.

The specific steps of model training are as follows:

s211: utilizing a three-dimensional motion capture system to calibrate the collected three-dimensional coordinate data of the skeletal joint points under the condition of free motion of a group of complete testees, deducting data of a plurality of mark points from the complete data, and inputting the data of missing part of the mark points at a certain moment into a neural network;

s212: taking data containing complete information as expected output of the neural network, comparing the output of the neural network with the expected output, and forming a mapping relation between the input and the output in the neural network through multiple iterations;

s213: when the input and the output form a good mapping relation, the training of the neural network is completed, and then the test data is input into the neural network to complete the missing of part of the mark points at a certain moment. The test data is data of a part of mark point missing phenomenon at a certain moment in the data acquisition process.

Further, a neural network with attention mechanism includes: the coding component and the decoding component and the connection between the coding component and the decoding component; the coding component consists of a stack of encoders, and the decoding component consists of the same number of decoders; firstly, converting each frame of coordinate sequence input into a vector in an encoder at the bottommost layer, then enabling the vector to pass through two layers (a self-attention neural network and a feedforward neural network) of each encoder, enabling the self-attention to pay attention to the three-dimensional coordinates of each frame of the whole input sequence, helping the network to better encode the sequence, wherein the feedforward neural network aims to fit a function, and finally, the training time of the feedforward neural network is reduced by utilizing summation normalization; an attention vector set is output to a decoder after passing through all encoders, a real number vector is generated after passing through all decoders, and the real number vector is converted into a three-dimensional bone joint point sequence to be output after passing through a linear transformation layer and a flexible maximum value transfer function.

The invention has the beneficial effects that:

1) the depth camera adopted by the invention is a non-contact device, and cannot influence the movement of a tested object;

2) the invention does not restrict the movement of the testee, and the detected body and limb movement information of the human body is richer and more comprehensive;

3) the invention effectively eliminates the condition of data loss which can occur when the depth camera is used for collecting data.

Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objectives and other advantages of the invention may be realized and attained by the means of the instrumentalities and combinations particularly pointed out hereinafter.

Drawings

For the purposes of promoting a better understanding of the objects, aspects and advantages of the invention, reference will now be made to the following detailed description taken in conjunction with the accompanying drawings in which:

FIG. 1 is a schematic structural diagram of a method for detecting a motion parameter of a limb according to the present invention;

FIG. 2 is a flow chart of a method of detecting a limb movement parameter of the present invention;

FIG. 3 is a schematic diagram of a neural network model training with attention mechanism according to the present invention;

FIG. 4 is a human skeletal joint points extracted by the depth camera;

FIG. 5 is a schematic diagram of the internal structure of a neural network model with attention mechanism in the present invention.

Detailed Description

The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. It should be noted that the drawings provided in the following embodiments are only for illustrating the basic idea of the present invention in a schematic way, and the features in the following embodiments and examples may be combined with each other without conflict.

Referring to fig. 1 to 5, the present invention provides a technical method for enhancing the detection of limb movement parameters by a depth camera, comprising the following steps:

the method comprises the following steps: a depth camera is selected as data acquisition equipment, and the depth camera can acquire three-dimensional coordinate information of human body bone joint points (as shown in figure 4). The testee stands in front of the equipment and freely moves at will (as shown in figure 1), a visual capture sensor in the depth camera is used for acquiring the three-dimensional data of the bone joints in the process of the free movement of the testee in real time, and the motion data acquired by the depth camera, namely the three-dimensional coordinate sequence of the corresponding bone joint points, is stored as a file.

Step two: for the case that all the mark points are missing after a certain time, the neural network with attention mechanism is used to perform the predictive reconstruction of data, as shown in fig. 5, the neural network model with attention mechanism is divided into an encoding component and a decoding component and the connection between them, the encoding component is composed of a stack of encoders, and the decoding component is composed of the same number of decoders. The coordinate sequence of each frame of input is firstly converted into a vector in the encoder at the lowest layer, and then the vector passes through two layers of each encoder (self-attention and feedforward neural network), wherein the self-attention focuses on the three-dimensional coordinates of each frame of the whole input sequence to help the model to better encode the sequence, and the aim of the feedforward neural network is to fit a function, and the aim of summation normalization is to reduce the training time of the feedforward neural network. An attention vector set is output to a decoder after passing through all encoders, a real number vector is generated after passing through all decoders, and the real number vector is converted into a three-dimensional bone joint point sequence to be output after passing through a linear transformation layer and a flexible maximum value transfer function.

The method comprises the steps that a group of complete three-dimensional coordinate sequences of skeletal joint points of a human body with sufficient data volume during limb movement needs to be collected, after a start password is audited, the motion is freely made without any restriction on the audited motion until the end password is audited, after data collected by a depth camera is calibrated with three-dimensional motion capture data, the data at a certain frame time is removed and used as input of an untrained depth self-attention transformation model, the collected complete three-dimensional coordinate sequences of the skeletal joint points are used as expected output, the weight of the depth self-attention transformation model is continuously adjusted through a back propagation algorithm, and a model which is closer to expected output is trained. And (4) carrying out prediction reconstruction on the test data by using the trained model, and then calculating the limb movement parameters by using the data after prediction reconstruction.

Step three: a neural network with an attention mechanism is used for completing the missing of part of mark points at a certain moment, a complete group of data is firstly subjected to random removal of three-dimensional coordinates of certain joint points in certain frames, then the group of data is input into an untrained deep self-attention transformation model, parameters (weights) of the model are generated randomly at the moment, the complete data is used for being compared with an output result of the model, the weights of the deep self-attention transformation model are slightly adjusted by using a back propagation algorithm, and the model with the output result closer to real data is trained. And (3) performing partial mark point missing completion on the test data at a certain moment by using the trained model, and then calculating the limb motion parameters by using the data after point completion.

Example 1: technical method and implementation for enhancing the detection of upper limb movement parameters by a depth camera, comprising the steps of:

as shown in fig. 1 to 2, the subject stands in front of the depth camera and is tried to make various movements with the upper limbs without any constraint for a prescribed time, for example: reach motion, elbow flexion and extension motion, etc. The process needs to be repeated n times, and the value of n is automatically judged by an operator according to the current situation.

Selecting a group with the most complete data and calibrated data in the n times of experimental data as training set data;

when three-dimensional coordinates of skeletal joint points of upper limb movement of a subject are acquired, situations with insufficient data amount may occur, such as: the upper limb movement of the tested person is inconvenient, the coordinate sequence acquired within the specified time is short, the subsequent data analysis is not facilitated, the data of the tested person needs to be predicted and reconstructed at the moment to obtain sufficient data quantity, the accuracy of the data after prediction and reconstruction needs to be guaranteed to a certain extent, and the closer the data is to the real data, the better the analysis result of the upper limb movement data is. Aiming at the condition that all mark points are missing after a certain moment, model training is firstly carried out, sequence information (k value is selected by an operator) of a tested complete upper limb movement skeletal joint point three-dimensional coordinate sequence after a kth frame is removed, a front k frame sequence is input into an original depth self-attention transformation model, a prediction sequence is obtained according to original weight in the model, a result sequence is compared with expected output of the user, namely the tested complete upper limb movement skeletal joint point three-dimensional coordinate sequence, the difference between the two is returned to the model through a back propagation algorithm, the depth self-attention transformation model adjusts the weight of the depth self-attention transformation model according to the difference between the two, the mapping relation between the input and the output is continuously optimized through multiple iterations, the input result of the model is gradually close to the expected output of the invention, namely, the prediction reconstruction result obtained through the depth self-attention transformation model is gradually approximate to real motion data, when this mapping is applied to all the marker-point missing instances after a certain time, the missing part of the instance can be predicted according to the attribute characteristics of the instance. Then, the predicted and reconstructed data can be used for calculating the motion parameters of the tested limb.

In the data acquisition process, the information of part of the bone joint points at a certain frame time is not acquired and is blocked, so that the condition that the three-dimensional coordinates of the part of the bone joint points are lost at certain time is caused, namely, the three-dimensional coordinate lost coordinate sequence of the individual bone joint points is incomplete, and certain influence is caused on the analysis result of the subsequent upper limb movement data of the tested upper limb, at the moment, the point supplementation is required to be carried out on the three-dimensional coordinate data of the tested complete upper limb movement bone joint points to obtain a complete coordinate sequence, and the data after the point supplementation is as close as possible to the real movement data. Aiming at the condition that part of mark points at a certain moment are missing, model training is firstly carried out, three-dimensional coordinate data of a certain skeleton joint point at certain moment are randomly removed from a complete three-dimensional coordinate sequence of the skeleton joint point of the upper limb movement, the sequence is input into an untrained depth self-attention conversion model, a result sequence is obtained according to the original weight of the untrained model, the result sequence is compared with expected output of people, namely the three-dimensional coordinate sequence of the skeleton joint point in the complete walking process of a human body, the difference between the two is returned to the model through a back propagation algorithm, the depth self-attention conversion model adjusts the weight of the depth self-attention conversion model according to the difference between the two, the mapping relation of the input and the output is continuously optimized through multiple iterations, the input result of the model is gradually close to the expected output of the people, namely, the point supplement result obtained after the depth self-attention conversion model is closer to the real movement data, when this mapping is applied to a partially marked point missing instance at a certain time, the missing part of the instance can be predicted according to the attribute characteristics of the instance. And then, calculating the limb movement parameters of the testee by using the data after point supplementation.

Example two: the technical method and implementation for enhancing detection of human gait parameters by a depth camera comprise the following steps:

as shown in fig. 1, a tested station completes a gait data acquisition process before a distance depth camera, and the process needs to be repeated for n times;

selecting a group with the most complete data in the n times of experimental data and subjected to three-dimensional dynamic capture data calibration as training set data;

in the data acquisition process, various emergencies may occur, if a phenomenon that a coordinate sequence is too short occurs when acquiring the three-dimensional coordinate data of the skeletal joint point in the walking process of the tested human body, namely the data volume is insufficient, and subsequent gait analysis cannot be performed, the three-dimensional coordinate data of the skeletal joint point in the walking process of the tested human body needs to be predicted and reconstructed to obtain sufficient data volume, but the accuracy requirement on the data after prediction and reconstruction is higher, and the gait analysis result is more accurate as the data is closer to the real data. Aiming at the condition that all mark points are missing after a certain moment, model training is firstly carried out (as shown in figure 3), sequence information after the k frame is removed from a three-dimensional coordinate sequence of skeletal joint points acquired in the process of trial walking is input into an original depth self-attention transformation model, a result sequence is obtained according to the original weight in the model, the result sequence is compared with expected output of people, namely the three-dimensional coordinate sequence of skeletal joint points in the complete human body walking process, the difference between the two is returned to the model through a back propagation algorithm, the depth self-attention transformation model adjusts the weight of the depth self-attention transformation model according to the difference between the two, the input result of the model gradually approaches the expected output of the people through a plurality of times of iterative cycle, namely, the predicted reconstruction result obtained through the depth self-attention transformation model is close to real motion data, when the mapping relation is applied to all the marker point missing examples after a certain time, the missing part of the example can be predicted according to the attribute characteristics of the example, and then the motion parameters of the tested limb can be calculated by using the data after prediction and reconstruction;

inputting the three-dimensional coordinate sequences of all the marker point deletions after a certain moment in the acquisition process into a trained depth self-attention conversion model, and calculating the model to obtain complete coordinate information after prediction and reconstruction.

In the data acquisition process, the information of part of the bone joint points at a certain frame time is not acquired and is blocked, so that the condition that the three-dimensional coordinates of the part of the bone joint points are missing at certain time is caused, namely, the three-dimensional coordinate missing coordinate sequence of the individual bone joint point is incomplete, and certain influence is caused on the subsequent gait analysis result of the tested person, at the moment, the three-dimensional coordinate data of the bone joint points in the walking process of the tested person needs to be supplemented to obtain a complete coordinate sequence, and the data after point supplementation is close to the real motion data as much as possible. Aiming at the condition that part of mark points at a certain moment are missing, model training is firstly carried out (as shown in figure 3), three-dimensional coordinate data of a certain skeleton joint point at certain moment are randomly removed from a complete three-dimensional coordinate sequence of the skeleton joint point, the sequence is input into an untrained depth self-attention transformation model, a result sequence is obtained according to the original weight of the untrained model, the result sequence is compared with expected output of people, namely the three-dimensional coordinate sequence of the skeleton joint point in the whole human body walking process, the difference between the three-dimensional coordinate sequence and the untrained depth self-attention transformation model is returned to the model through a back propagation algorithm, the depth self-attention transformation model adjusts the weight of the depth self-attention transformation model according to the difference between the two, and through multiple iterations, the mapping relation between the input and the output is continuously optimized, the input result of the model is gradually close to the expected output of people, namely, the point complementing result obtained through the depth self-attention transformation model is closer to the real motion data, when the mapping relation is applied to a part of the mark point missing example at a certain moment, the missing part of the example can be predicted according to the attribute characteristics of the example, and then the data after point supplementation can be used for calculating the limb movement parameters of the testee.

Finally, the above embodiments are only intended to illustrate the technical solutions of the present invention and not to limit the present invention, and although the present invention has been described in detail with reference to the preferred embodiments, it will be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions, and all of them should be covered by the claims of the present invention.

Claims

1. A method for enhancing detection of limb motion parameters by a depth camera, the method comprising the steps of:

s1: collecting limb movement data of a human body by using a depth camera;

2. The method for enhancing detection of limb movement parameters by a depth camera according to claim 1, wherein the step S1 specifically comprises: and acquiring the three-dimensional coordinate data of the skeletal joint points in the free movement process of the testee in real time by using the depth camera, and storing the three-dimensional coordinate data of the skeletal joint points acquired by the depth camera as a file.

3. The method for enhancing the detection of the limb movement parameters by the depth camera according to claim 1, wherein in step S2, when it is detected that all the marker points of the collected human limb movement data are missing at a certain time, the specific steps of training the neural network with attention mechanism are as follows:

s203: when the input and the output reach a proper mapping relation, the training of the neural network is completed, and then the test data is input into the neural network for prediction reconstruction.

4. The method for enhancing the detection of the limb movement parameters by the depth camera according to claim 3, wherein in step S203, the test data is data of all the marker point missing situations after a certain time in the data acquisition process.

5. The method for enhancing the detection of the limb movement parameters by the depth camera according to claim 1, wherein in step S2, when the phenomenon that part of the markers of the collected human limb movement data are missing at a certain moment is detected, the training of the neural network with attention mechanism includes the following steps:

s213: when the input and the output form a proper mapping relation, the training of the neural network is completed, and then the test data is input into the neural network to complete the missing of part of the mark points at a certain moment.

6. The method for enhancing detection of limb movement parameters by a depth camera according to claim 5, wherein in step S213, the test data is data of partial marker point missing phenomenon at a certain moment in the data acquisition process.

7. The method for enhancing depth camera detection of limb motion parameters according to claim 3 or 5, wherein the neural network comprises: assigning a correct value to the weight of the connection of each node of the hidden layer; when an input vector is given, determining an output vector according to the weight; in the initial stage, all weights are randomly assigned; for each input value of the training set, after forward calculation of the neural network, the obtained output value is compared with an expected output value, and then the obtained error is transmitted back to the previous network layer, namely a back propagation algorithm; then the weight is correspondingly adjusted according to the error; the neural network needs to iterate repeatedly until the error of the output is lower than a set threshold.

8. The method for enhancing depth camera detection of limb motion parameters according to claim 3 or 5, wherein the neural network with attention mechanism comprises: the coding component and the decoding component and the connection between the coding component and the decoding component; the coding component consists of a stack of encoders, and the decoding component consists of the same number of decoders; firstly, converting each frame of coordinate sequence input into a vector in an encoder at the bottom layer, then enabling the vector to pass through a self-attention and feedforward neural network of each encoder, enabling the self-attention to pay attention to the three-dimensional coordinates of each frame of the whole input sequence, helping the network to better encode the sequence, wherein the feedforward neural network aims to fit a function, and finally reducing the training time of the feedforward neural network by using summation normalization; an attention vector set is output to a decoder after passing through all encoders, a real number vector is generated after passing through all decoders, and the real number vector is converted into a three-dimensional bone joint point sequence to be output after passing through a linear transformation layer and a flexible maximum value transfer function.