CN110349673B - Group constitution evaluation method based on Gaussian mixture distribution - Google Patents

Group constitution evaluation method based on Gaussian mixture distribution Download PDF

Info

Publication number
CN110349673B
CN110349673B CN201910570304.4A CN201910570304A CN110349673B CN 110349673 B CN110349673 B CN 110349673B CN 201910570304 A CN201910570304 A CN 201910570304A CN 110349673 B CN110349673 B CN 110349673B
Authority
CN
China
Prior art keywords
data
model
group
distribution
gaussian mixture
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910570304.4A
Other languages
Chinese (zh)
Other versions
CN110349673A (en
Inventor
赵宏伟
张宝亮
赵浩宇
范丽丽
胡黄水
李星
姚瑶
张原瑞
王万鹏
刘萍萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jilin University
Original Assignee
Jilin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jilin University filed Critical Jilin University
Priority to CN201910570304.4A priority Critical patent/CN110349673B/en
Publication of CN110349673A publication Critical patent/CN110349673A/en
Application granted granted Critical
Publication of CN110349673B publication Critical patent/CN110349673B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment

Abstract

The invention discloses a population constitution evaluation method based on Gaussian mixture distribution, which comprises the following steps: step 1: randomly dividing unlabeled physical test data into a plurality of sections as test data and training data; step 2: preprocessing each section of test data and training data in the step 1; and step 3: extracting each group of characteristic data by using an unsupervised learning algorithm; and 4, step 4: fitting each group of characteristic data obtained in the step 3, and judging the number of mixed distribution; and 5: calculating the weight and the average value of each mixed distribution by using an EM algorithm; step 6: and (4) establishing a three-level evaluation model, and substituting the observation and calculation results obtained in the step (4) and the step (5) into the three-level evaluation model and the group constitution evaluation quantification formula to obtain a grade and a grading result. The method is completely independent of individual constitution evaluation results, and the group constitution evaluation results can be obtained without depending on the individual evaluation results.

Description

Group constitution evaluation method based on Gaussian mixture distribution
Technical Field
The invention belongs to the field of physique evaluation, relates to a physique health evaluation method, and particularly relates to a population physique evaluation method based on Gaussian mixture distribution.
Background
Constitutions refer to the quality of the human body, and are the comprehensive and relatively stable characteristics of morphological structure, physiological functions, and psychological factors of the human body expressed on the basis of heredity and acquirement. Physical health assessment has been a hot topic in the field of health research. The physique evaluation can evaluate the national physique and health condition by using scientific indexes and methods, and further continuously improve and strengthen the national physique. At present, many domestic scholars have made a lot of active and beneficial exploration and practice on the research of physique monitoring and evaluation systems, and have achieved good effects. Most of the existing research results are obtained by obtaining expert knowledge through early investigation to obtain various evaluation indexes and weight coefficients thereof, and then, the existing statistical formulas or curve fitting technology is utilized to evaluate the individual constitutions. In the last two decades of published literature, assessing the physical health of a population is a simple statistic of the results of individual assessments.
Machine learning is a branch of artificial intelligence and in many cases is almost a pronoun for artificial intelligence. Machine learning systems are used to identify objects in images, transcribe speech into text, match news items, posts or products to the interests of the user, and select relevant results for searches. It is also an important medical auxiliary means, and has important application value in the field of medical care. Although the evaluation model is widely applied in other fields, the problem of assessing the health of the population in a complex data environment is still a worthwhile and not deeply researched problem.
Disclosure of Invention
In order to solve the problem of the existing group constitution evaluation, the invention provides a group constitution evaluation method based on Gaussian mixture distribution. The method has the core idea that a convolutional neural network is used for automatically learning characteristics from original sports test data without supervision, a group constitution three-level evaluation model is provided based on Gaussian mixture distribution, and the learned characteristics are sent to the evaluation model to obtain a group constitution evaluation result.
The purpose of the invention is realized by the following technical scheme:
a population constitution evaluation method based on Gaussian mixture distribution comprises the following steps:
step 1: randomly dividing unlabeled physical test data into a plurality of sections as test data and training data;
step 2: preprocessing each section of test data and training data in the step 1;
and step 3: taking the training data preprocessed in the step 2 as each input of a convolutional neural network model, and extracting each group of characteristic data by using an unsupervised learning algorithm;
and 4, step 4: fitting each group of characteristic data obtained in the step 3, and judging the number of mixed distribution;
and 5: calculating the weight and mean value of each mixed distribution by using an EM (effective electromagnetic) algorithm according to each group of characteristic data obtained in the step 3;
step 6: and (4) establishing a three-level evaluation model, and substituting the observation and calculation results obtained in the step (4) and the step (5) into the three-level evaluation model and the group constitution evaluation quantification formula to obtain a grade and a grading result.
Compared with the prior art, the invention has the following advantages:
1. the method is completely independent of individual constitution evaluation results, and the group constitution evaluation results can be obtained without depending on the individual evaluation results.
2. The method fully considers the population constitution distribution characteristics, can be used for the constitution evaluation of all regions and all classes of people, and has the characteristics of global property and universality.
Drawings
FIG. 1 is a training flow chart of the method for assessing population constitution based on Gaussian mixture distribution according to the present invention;
FIG. 2 is a diagram of a feature extraction convolutional neural network in the present invention;
FIG. 3 is a test data feature distribution diagram of the Gaussian mixture distribution based population fitness evaluation method of the present invention;
FIG. 4 is a graph of two sets of Gaussian mixture distributions under test data in the present invention.
Detailed Description
The technical solution of the present invention is further described below with reference to the accompanying drawings, but not limited thereto, and any modification or equivalent replacement of the technical solution of the present invention without departing from the spirit and scope of the technical solution of the present invention shall be covered by the protection scope of the present invention.
The invention provides a group constitution evaluation method based on Gaussian mixture distribution, which takes the characteristic distribution situation of group constitution into consideration and has important influence on group constitution evaluation. The method provides a physical health assessment model mainly comprising characteristic learning and characteristic assessment by using a machine learning method, and aims to establish a group physical health state assessment model by analyzing group physical test data independent of individual assessment results so as to master group health states. As shown in fig. 1, the physical fitness evaluation method specifically includes the following steps:
step 1: the unlabeled sports test data is divided into several segments as training data.
And dividing the unlabeled sports test data into a plurality of sections as training data, and inputting the training data into the model as the input of the convolutional neural network. According to different test items, data are divided into 7 dimensions, 1800 data items are randomly extracted from each dimension to serve as test data, and the rest data serve as training data. Of these training data, 1800 data items are randomly drawn at a time as a training input.
Step 2: and carrying out preprocessing operation on each section of data.
Before feature extraction, we need to perform normalization processing on the data. In feature extraction, normalization is very important. This is because there may be multiple features from sample to sample, and the scale of the values of the different features is different. Without normative handling, the large differences in dimensions may cause the entire model to fail.
In this step, we adopt 7-dimensional data, and in order to maintain the distribution characteristics of the original data, we adopt a 0-1 normalization method:
Figure GDA0003662649060000041
wherein X is an input data item comprising training data and test data, X max For the maximum term in this set of data, X min The min term for this set of data. In the invention, max is 1, and min is 0. The data normalization formula can be written as:
Figure GDA0003662649060000042
eventually all data is normalized to between 0-1.
And step 3: feature information is extracted using an unsupervised learning algorithm.
In the step, a model consisting of two layers of convolutional neural networks is used to convert a large amount of original signals into reduction set characteristics. As shown in fig. 2, the convolutional neural network includes two convolutional layers, two activation layers, and two pooling layers. In the convolutional layer, the size of a convolutional kernel is set to be 3 multiplied by 1, and the step length is designed to be 1; the activation layer adopts a ReLu activation function; the pooling layer filter was set to 2 x 1, using the maximum pooling function. And (3) taking the data in the step (2) as each input of the model, and obtaining the feature mapping of the constitution data after passing through the convolution layer, the activation layer and the pooling layer. And by combining the thought of self-coding, the reconstruction error between the input data and the reconstruction input is analyzed, and the network parameters are fed back and adjusted, so that better learning characteristics can be obtained finally. And 3, 5000 times of iteration is needed, and when the error approaches zero, the corresponding characteristic column can be taken out.
And 4, step 4: and fitting the characteristic data and judging the number of Gaussian mixture distribution.
In this step, the mixed distribution of the feature data needs to be observed. And (3) fitting the characteristic data obtained in the step (3) by using a fitting function in a Python language, observing the data distribution condition, and recording the distribution number of the mixed distribution.
And 5: and calculating the weight and the average value of each mixed distribution by using an EM algorithm.
In this step, the weight and the average value of each mixed distribution in the corresponding group are calculated by using the characteristic data of each group obtained in the step 3. The EM algorithm process is shown in table 1:
TABLE 1
Figure GDA0003662649060000051
Figure GDA0003662649060000061
Step 6: and (4) establishing a three-level evaluation model, and substituting the calculation result of the step (5) into a group constitution evaluation quantification formula to obtain a grade and a grading result.
Step 6.1:
the three-level evaluation model established in this step is shown in table 2:
TABLE 2
Figure GDA0003662649060000071
Figure GDA0003662649060000081
Wherein K is the number of partial models, alpha max Is the largest weight, mu, of the K fractional models n And mu m The average values are respectively corresponding to the two maximum weight component models. a (0 < a < 1) is a threshold used to describe the weight difference, and b (0 < b < 1) is a distance threshold used to describe the two distributions with the greatest weight. In this step, the weight difference threshold value a is set to 0.3, and the distance threshold value b is set to 0.3. Substituting the distribution number, weight and average value observed and calculated in the steps 4 and 5 into the table 2 to obtain the constitution evaluation grade.
When the feature is expressed as a single Gaussian model or a multi-Gaussian model and satisfies the inequality alpha max -(1-α max ) At > alpha, scoreSetting as A grade;
when the features are expressed as multiple Gaussian models and satisfy the inequality set
Figure GDA0003662649060000082
When the test is finished, the test is rated as B;
when the features are expressed as multiple Gaussian models and satisfy the inequality set
Figure GDA0003662649060000083
When the test was completed, the rating was C.
Step 6.2: and calculating an evaluation result.
According to the distribution number, the weight and the average value, the constitution evaluation result of the belonged group can be calculated by combining the group constitution evaluation quantification formula designed in the invention. The quantitative formula for the group constitution evaluation is as follows:
Figure GDA0003662649060000091
in the formula, the function h is expressed as that when the input is greater than 0, the function value is the input itself, otherwise, the function value is 0, and the other parameters are the same as those in step 6.1.
In the experiment, the method is tested by using the sports test result of the whole school (girl) published by a certain college website. According to the latest revised national student physical health standards of the ministry of education of China, the BIM, the vital capacity, the standing long jump, the forward bending of the sitting position, the running at 50 meters, the running at 800 meters and the sit-up at one minute are selected as seven test items, and a characteristic probability distribution diagram is obtained, as shown in figure 3.
From this probability distribution map, the features follow a substantially gaussian distribution. However, the probability distribution of some features is not subjected to single gaussian distribution, but to mixed gaussian distribution, and the results are obtained by using the group constitution evaluation quantitative formula according to the three-level evaluation model established in step 6.1, as shown in table 3.
TABLE 3
Figure GDA0003662649060000092
Finally, the gaussian mixture distribution map of BIM and vital capacity can be split into a single gaussian distribution map, as shown in fig. 4.

Claims (6)

1. A population constitution assessment method based on Gaussian mixture distribution is characterized by comprising the following steps:
step 1: randomly dividing unlabeled physical test data into a plurality of sections as test data and training data;
step 2: preprocessing each section of test data and training data in the step 1;
and step 3: taking the training data preprocessed in the step 2 as each input of a convolutional neural network model, and extracting each group of characteristic data by using an unsupervised learning algorithm;
and 4, step 4: fitting each group of characteristic data obtained in the step 3, and judging the number of mixed distribution;
and 5: calculating the weight and the mean value of each mixed distribution by using an EM (effective electromagnetic radiation) algorithm according to each group of characteristic data obtained in the step 3;
step 6: establishing a third-level evaluation model, substituting the observation and calculation results of the step 4 and the step 5 into a third-level evaluation model and a group constitution evaluation quantification formula to obtain a grade and a grading result, wherein the third-level evaluation model comprises a grade A, a grade B and a grade C, and the third-level evaluation model comprises the following steps:
the A grade satisfies one of the following conditions:
the characteristics are expressed as a single Gaussian model, and K is 1;
the characteristics are expressed as a Gaussian mixture model, K > is 2, and the following conditions are satisfied:
α max -(1-α max )>a;
the B-level features are expressed as a multiple Gaussian model, K > 2, and satisfy the set of inequalities:
Figure FDA0003662649050000011
the C grade characteristic is expressed as a multiple Gaussian model, K > is 2, and satisfies the inequality group:
Figure FDA0003662649050000021
in the formula, K is the number of the partial models, alpha max Is the largest weight, mu, of the K fractional models n And mu m The average values respectively corresponding to the two maximum weight component models are used for describing a threshold value of the weight difference, b is used for describing a distance threshold value of the two maximum weight distributions, and the group constitution assessment quantification formula is defined as follows:
Figure FDA0003662649050000022
in the formula, K is the number of the partial models, alpha max Is the largest weight, mu, of the K fractional models n And mu m The function h is expressed as that when the input is greater than 0, the function value is the input itself, otherwise, the function value is 0.
2. The method for assessing the fitness of a population based on the Gaussian mixture distribution as claimed in claim 1, wherein in the step 1, each set of the test data and the training data comprises 1800 data items.
3. The method for assessing the group constitution based on Gaussian mixture distribution as claimed in claim 1, wherein in the step 2, each segment of test data and training data is preprocessed by a 0-1 normalization method, wherein: the data normalization formula is as follows:
Figure FDA0003662649050000023
in which X is an input data item, X max For the maximum term in this set of data, X min The min term for this set of data.
4. The method for population quality assessment based on Gaussian mixture distribution as claimed in claim 1, wherein in said step 3, the convolutional neural network comprises two convolutional layers, two activation layers and two pooling layers, wherein: in the convolutional layer, the size of a convolutional kernel is set to be 3 multiplied by 1, and the step length is designed to be 1; the activation layer adopts a ReLu activation function; the pooling layer filter was set to 2 x 1, using the maximum pooling function.
5. The method according to claim 1, wherein in the step 4, the characteristic data of each group obtained in the step 3 are fitted by using a Python fitting function, the data distribution is observed, and the distribution number of the mixture distribution is recorded.
6. The method for population health assessment based on Gaussian mixture distribution as claimed in claim 1, wherein in said step 5, the E step of the EM algorithm is:
calculating the partial model k to the observation data y according to the current model parameters i The formula is as follows:
Figure FDA0003662649050000031
wherein alpha is k Is a coefficient, α k ≥0,
Figure FDA0003662649050000032
φ(y ik ) Is the density of the gaussian distribution,
Figure FDA0003662649050000033
μ k is the mean difference of the kth partial model,
Figure FDA0003662649050000034
is the variance of the kth partial model;
the M steps of the EM algorithm are as follows:
calculating the model parameters of a new iteration, and the formula is as follows:
Figure FDA0003662649050000035
Figure FDA0003662649050000041
Figure FDA0003662649050000042
wherein the content of the first and second substances,
Figure FDA0003662649050000043
is the mean value in the new round of iterative model;
Figure FDA0003662649050000044
is the variance in the new iteration model;
Figure FDA0003662649050000045
is the weight in the new iteration model;
Figure FDA0003662649050000046
is the model k to the observed data y i Is the total number of observed data.
CN201910570304.4A 2019-06-27 2019-06-27 Group constitution evaluation method based on Gaussian mixture distribution Active CN110349673B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910570304.4A CN110349673B (en) 2019-06-27 2019-06-27 Group constitution evaluation method based on Gaussian mixture distribution

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910570304.4A CN110349673B (en) 2019-06-27 2019-06-27 Group constitution evaluation method based on Gaussian mixture distribution

Publications (2)

Publication Number Publication Date
CN110349673A CN110349673A (en) 2019-10-18
CN110349673B true CN110349673B (en) 2022-08-05

Family

ID=68177158

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910570304.4A Active CN110349673B (en) 2019-06-27 2019-06-27 Group constitution evaluation method based on Gaussian mixture distribution

Country Status (1)

Country Link
CN (1) CN110349673B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117114436A (en) * 2023-07-27 2023-11-24 中冶建筑研究总院有限公司 Existing prestressed concrete member performance evaluation method based on measured data

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103745122A (en) * 2014-01-23 2014-04-23 山东农业大学 Computer processing method for student exercise prescriptions
CN104391987A (en) * 2014-12-13 2015-03-04 吉林大学 Large-scale target identification method based on mobile platform
CN104504296A (en) * 2015-01-16 2015-04-08 湖南科技大学 Gaussian mixture hidden Markov model and regression analysis remaining life prediction method
CN105338343A (en) * 2015-10-20 2016-02-17 北京理工大学 No-reference stereo image quality evaluation method based on binocular perception
CN105956392A (en) * 2016-04-28 2016-09-21 南京师范大学 Student physical health evaluation management system and method
CN107832716A (en) * 2017-11-15 2018-03-23 中国科学技术大学 Method for detecting abnormality based on active-passive Gauss on-line study
CN108038300A (en) * 2017-12-07 2018-05-15 长春理工大学 Optical fiber state evaluating method based on improved membership function combination neutral net
CN108172297A (en) * 2018-01-29 2018-06-15 广东工业大学 A kind of appraisal procedure of upper-limbs rehabilitation training robot rehabilitation training function
CN108563806A (en) * 2018-01-05 2018-09-21 哈尔滨工业大学(威海) Engine air passage parameter long-range forecast method based on similitude and system
CN109101738A (en) * 2018-08-24 2018-12-28 河北工业大学 A kind of IGBT module degree of aging appraisal procedure
CN109583500A (en) * 2018-11-30 2019-04-05 中共中央办公厅电子科技学院 A kind of aesthetic images quality prediction system and method based on depth drift-diffusion method
CN109710763A (en) * 2018-12-27 2019-05-03 郑州云海信息技术有限公司 A kind of classification method of text data, device and system
CN109859110A (en) * 2018-11-19 2019-06-07 华南理工大学 The panchromatic sharpening method of high spectrum image of control convolutional neural networks is tieed up based on spectrum

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103745122A (en) * 2014-01-23 2014-04-23 山东农业大学 Computer processing method for student exercise prescriptions
CN104391987A (en) * 2014-12-13 2015-03-04 吉林大学 Large-scale target identification method based on mobile platform
CN104504296A (en) * 2015-01-16 2015-04-08 湖南科技大学 Gaussian mixture hidden Markov model and regression analysis remaining life prediction method
CN105338343A (en) * 2015-10-20 2016-02-17 北京理工大学 No-reference stereo image quality evaluation method based on binocular perception
CN105956392A (en) * 2016-04-28 2016-09-21 南京师范大学 Student physical health evaluation management system and method
CN107832716A (en) * 2017-11-15 2018-03-23 中国科学技术大学 Method for detecting abnormality based on active-passive Gauss on-line study
CN108038300A (en) * 2017-12-07 2018-05-15 长春理工大学 Optical fiber state evaluating method based on improved membership function combination neutral net
CN108563806A (en) * 2018-01-05 2018-09-21 哈尔滨工业大学(威海) Engine air passage parameter long-range forecast method based on similitude and system
CN108172297A (en) * 2018-01-29 2018-06-15 广东工业大学 A kind of appraisal procedure of upper-limbs rehabilitation training robot rehabilitation training function
CN109101738A (en) * 2018-08-24 2018-12-28 河北工业大学 A kind of IGBT module degree of aging appraisal procedure
CN109859110A (en) * 2018-11-19 2019-06-07 华南理工大学 The panchromatic sharpening method of high spectrum image of control convolutional neural networks is tieed up based on spectrum
CN109583500A (en) * 2018-11-30 2019-04-05 中共中央办公厅电子科技学院 A kind of aesthetic images quality prediction system and method based on depth drift-diffusion method
CN109710763A (en) * 2018-12-27 2019-05-03 郑州云海信息技术有限公司 A kind of classification method of text data, device and system

Also Published As

Publication number Publication date
CN110349673A (en) 2019-10-18

Similar Documents

Publication Publication Date Title
CN105279495B (en) A kind of video presentation method summarized based on deep learning and text
CN106503055B (en) A kind of generation method from structured text to iamge description
CN110781298B (en) Medicine classification method, apparatus, computer device and storage medium
CN109165692B (en) User character prediction device and method based on weak supervised learning
CN111126386B (en) Sequence domain adaptation method based on countermeasure learning in scene text recognition
CN109620152B (en) MutifacolLoss-densenert-based electrocardiosignal classification method
CN110070895B (en) Mixed sound event detection method based on factor decomposition of supervised variational encoder
CN109994201B (en) Diabetes and hypertension probability calculation system based on deep learning
CN110705247B (en) Based on x2-C text similarity calculation method
CN104636580A (en) Health monitoring mobile phone based on human face
CN111461025B (en) Signal identification method for self-evolving zero-sample learning
CN109657011A (en) A kind of data digging method and system screening attack of terrorism criminal gang
CN112989802B (en) Bullet screen keyword extraction method, bullet screen keyword extraction device, bullet screen keyword extraction equipment and bullet screen keyword extraction medium
CN108567418A (en) A kind of pulse signal inferior health detection method and detecting system based on PCANet
CN110751216A (en) Judgment document industry classification method based on improved convolutional neural network
CN115952292B (en) Multi-label classification method, apparatus and computer readable medium
CN110349673B (en) Group constitution evaluation method based on Gaussian mixture distribution
CN115309860A (en) False news detection method based on pseudo twin network
CN113011330B (en) Electroencephalogram signal classification method based on multi-scale neural network and cavity convolution
CN114547303A (en) Text multi-feature classification method and device based on Bert-LSTM
CN112466284B (en) Mask voice identification method
CN113069117A (en) Electroencephalogram emotion recognition method and system based on time convolution neural network
CN111312394A (en) Psychological health condition evaluation system based on combined emotion and processing method thereof
CN111798455A (en) Thyroid nodule real-time segmentation method based on full convolution dense cavity network
CN116421200A (en) Brain electricity emotion analysis method of multi-task mixed model based on parallel training

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant