CN103294893A - Machine learning method for reducing inconsistency between traditional Chinese medicine subjective questionnaires - Google Patents

Machine learning method for reducing inconsistency between traditional Chinese medicine subjective questionnaires Download PDF

Info

Publication number
CN103294893A
CN103294893A CN2013101597369A CN201310159736A CN103294893A CN 103294893 A CN103294893 A CN 103294893A CN 2013101597369 A CN2013101597369 A CN 2013101597369A CN 201310159736 A CN201310159736 A CN 201310159736A CN 103294893 A CN103294893 A CN 103294893A
Authority
CN
China
Prior art keywords
questionnaire
npq
mpq
subjective
solution
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013101597369A
Other languages
Chinese (zh)
Other versions
CN103294893B (en
Inventor
张钢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Original Assignee
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology filed Critical Guangdong University of Technology
Priority to CN201310159736.9A priority Critical patent/CN103294893B/en
Publication of CN103294893A publication Critical patent/CN103294893A/en
Application granted granted Critical
Publication of CN103294893B publication Critical patent/CN103294893B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a machine learning method for reducing the inconsistency between traditional Chinese medicine subjective questionnaires. The machine learning method comprises the following steps that 1), subjective questionnaire data is vectorized, wherein a subjective questionnaire consists of questions, weight and point values, and the vectorization allows a structure of the questionnaire to be converted into a vector; 2), a consistency target of questionnaire groups is defined and expressed; a contradiction function C(x) is defined to express the consistency between the questionnaire groups; the contradiction function takes a value obtained by converting a point value of the questionnaire group as an input, wherein negative correlation serves as the contradiction function in the process, and accords with a statistical learning theory; 3), main subjective questionnaires such as an NPQ (Nonverbal Personality Questionnaire), an MPQ (Mcgill Pain Questionnaire) and an SF-36 (Short Form-36) used by traditional Chinese medicine are subjected to consistency analysis, wherein each of the NPQ and the MPQ has a sub-questionnaire, and the SF-36 has eight sub-questionnaires; the following objective function of the consistency is defined according to the contradiction function in Step 2); and 4), the objective function is solved. The machine learning method reduces the inconsistency between results of different traditional Chinese medicine treatment effect evaluation questionnaires, and improves the accuracy of the evaluation on a traditional Chinese medicine treatment effect.

Description

A kind of machine learning method that reduces subjective the differing property of questionnaire of the traditional Chinese medical science
Technical field
The present invention is a kind of machine learning method that reduces subjective the differing property of questionnaire of the traditional Chinese medical science, belongs to the renovation technique of the machine learning method that reduces subjective the differing property of questionnaire of the traditional Chinese medical science.
Background technology
Survey is a kind of data collect means that social investigation causes.To be investigator's questionnaire of using unified design go and find out what's going on or the investigation method of consultation to the respondent who is selected for it.Survey is to write in a kind of research method that the mode of problem is gathered information.Survey is a kind of research mode of excavating true present situation, maximum purpose is to collect, accumulate the basic document of every education of science attribute of a certain target group, should consider and whether can reach goal in research smoothly and note the fitness of research sample on questionnaire.But there is subjectivity in survey.
In the Chinese traditional treatment process, the doctor needs to be grasped patient's subjective sensation with the order of severity of judgement disease and the effect for the treatment of.The result of cervical spondylopathy patient treatment report is the theory and technology based on psychometrics, obtains the patient in daily life, healthy impression with to the physical experience of aspects such as treatment measure subjectivity satisfaction by forms such as interview, self-appraisal questionnaires.Pain is a kind of subjective symptom of patient's perception, still is difficult to the service-strong objective indicator at present and measures.More accurate in order to make the subjective sensation measurement, researchers have designed the various questionnaires (scale) at different aspect.As in the treatment of cervical spondylopathy, use three kinds of different questionnaires of NPQ, MPQ and SF-36 to determine the sensation of patient's cervical spondylopathy usually, wherein NPQ and MPQ are for the questionnaire of weighing pain degree, and SF-36 is used for weighing life quality.In the treatment of the knee joint of orthopaedics, adopt WOMAC and SF-36 to determine the kneed sensation of patient usually, wherein WOMAC is the scale of international knee osteoarthritis, SF-36 is with aforementioned consistent.Patient after having answered each questionnaire, score values that can have one or more processes to add up and obtain all, wherein NPQ, MPQ and WOMAC are a score value, and SF-36 has 8 score values, represents the different aspect of life quality respectively.
The doctor rule of thumb judges patient's state then mainly by observing the data of several questionnaires simultaneously at present, or judges the effect for the treatment of according to the lifting of the score value of twice questionnaire before and after the treatment.Through regular meeting's inconsistent situation of patient condition that questionnaire shows of coming out, as a certain patient NPQ questionnaire is shown that its pain index is very high, but the SF-36 questionnaire shows that again its life quality is pretty good but in practice; Twice MPQ value is to have risen before and after a certain patient, and the pain increased to show it, but the NPQ score value descended, and shows that its pain slows down.When the doctor faces this situation, can only carry out subjective judgement according to the experience of self, be used for addressing this problem and also have special technology at present.
Above questionnaire all is that its theoretical foundation and correctness are undoubted through the medical science of strictness demonstration, and the generation of this contradiction mainly is the subjective deviation when being answered by patient, and dissimilar questionnaires has different potential weights to cause to different patients.
Summary of the invention
The objective of the invention is to consider the problems referred to above and inconsistency between a kind of result who lowers different Chinese traditional treatment recruitment evaluation questionnaires is provided, to improve the machine learning method of subjective the differing property of questionnaire of the minimizing traditional Chinese medical science of accuracy that its centering is cured the assessment of therapeutic effect.The present invention is convenient and practical.
Technical scheme of the present invention is: the machine learning method of subjective the differing property of questionnaire of the minimizing traditional Chinese medical science of the present invention comprises the steps:
1) the subjective questionnaire data of vectorization: subjective questionnaire is made of problem, weight and score value, and vectorization is the thaumatropy of questionnaire vector, establishes subjective questionnaire and is expressed as Q={q 1, q 2..., q m, q wherein iIt is the score of i questionnaire item;
2) the consistance target of definition questionnaire group, and formalization is expressed; Definition contradiction function C (x) is expressed consistance between the questionnaire group, the contradiction function with the score value of questionnaire group through the value that obtains after the conversion as input, this process uses negative correlation as the contradiction function, coincidence statistics is practised theoretical;
3) main subjective questionnaire NPQ, MPQ and the SF-36 that the traditional Chinese medical science is used carries out consistency analysis, and wherein NPQ and MPQ respectively have a sub-questionnaire, and SF-36 has 8 sub-questionnaires, according to the 2nd) the contradiction function in step, define following conforming target:
max Φ corr ( φ 1 ( NPQ ) ) , φ 2 ( MPQ ) ) , φ 3 ( SF - 36 1 ) , · · · , φ 10 ( SF - 36 8 ) )
(1)
In (1) formula, corr represents the computing function of correlativity;
4) to the finding the solution of (1) formula, think that Φ has following linear forms: Φ (x)=w TX, owing to used the nuclear mapping, so its result can not be subjected to the restriction of the linear combination of basis function, used the thought that is similar to KCCA that the optimization problem shown in (1) formula is found the solution, release the maximized solution of consistance between two groups of questionnaires earlier, and then be generalized to the situation of many group questionnaires; Be example with NPQ and two groups of questionnaires of MPQ, establish it and have scale l, i.e. NPQ={NPQ 1, NPQ 2..., NPQ lAnd MPQ={MPQ 1, MPQ 2..., MPQ l, establish u=M N* NPQ, υ=M M* MPQ; Make the correlativity maximum of u and υ, be equivalent to and find the solution following optimization problem:
max M N , M M E [ uυ ] E [ u 2 ] E [ υ 2 ] - - - ( 3 )
E[wherein] be mathematical expectation, note C NPQ, MPQBe covariance matrix, then (2) can be expressed as:
max M N , M M M N T C N , M M M M N T C N , N M N M M T C M , M M M - - - ( 4 )
According to the KCCA theory, in optimization aim (3), consider u TU and υ Tυ rather than original u and υ help to introduce kernel function in optimization aim; Quote two standardization items:
Figure BDA00003136216800043
With
Figure BDA00003136216800044
It only calculates for simplifying, and can't influence the optimization aim of (3); Simultaneously, as long as levying the dimension in space, the data point number bit of training wants big, the optimum solution of (3)
Figure BDA00003136216800049
With
Figure BDA000031362168000410
One fixes on set span{ φ (x 1), φ (x 2) ..., φ (x n) and
Figure BDA00003136216800045
In, wherein x and y are respectively the training sample points of NPQ and MPQ, namely M M * = Σ i = 1 n α i φ ( x i ) ,
Figure BDA00003136216800047
Use the mode of maximum likelihood to control the model complexity, increase by two regularization term about α and β, obtain following optimization aim expression formula:
max α , β α T K X K Y β - η α | | α | | 2 - η β | | β | | 2 - - - ( 4 )
K wherein Xiφ (x i) Tφ (x i),
Figure BDA00003136216800051
(4) formula is and finds the solution target.
The represented optimization aim of above-mentioned formula (4) adopts the enclosed solving method to find the solution:
Note w=(α T, β T) T, then being had by representation theorem, the optimum solution of (4) is equivalent to secular equation B -1The solution of Aw=pw, p can regard square formation B as -1The proper vector of A, wherein:
A = 0 K X K Y K Y K X 0 B = K X K Y 0 0 K Y K X - - - ( 5 )
Step to above-mentioned solving-optimizing target is carried out following conclusion:
The 1st step: input questionnaire group Q iAnd Q j
The 2nd step: calculate questionnaire group Q iAnd Q jThe Gram matrix based on radial basis function nuclear
Figure BDA00003136216800056
With
The 3rd step: use formula (5) to calculate questionnaire group Q iAnd Q jCorrespondence With
The 4th step: compute matrix
Figure BDA00003136216800054
With
Figure BDA00003136216800055
Proper vector
Figure BDA000031362168000510
With
Figure BDA000031362168000511
The 5th step: output is as transformation matrix;
Top step is the enclosed solving method, can directly from training data, obtain transformation matrix, but because it will carry out the feature decomposition of matrix, therefore be only applicable to the little situation of amount of training data, be not more than 1000 examples, use a kind of based on the method for decomposing the index iteration convergence, respectively questionnaire group consistance is separately found the solution, connecting mapping then merges, algorithm uses a w at random as initial value, correlativity is once calculated in every optimization one time, and twice correlativity change is less than 0.5% the time when front and back, and algorithm finishes.
The represented optimization aim of above-mentioned formula (4) adopts following steps to find the solution:
The 1st step: input questionnaire group Q iAnd Q j, initializes weights vector w;
The 2nd step: calculate questionnaire group Q iAnd Q jThe Gram matrix based on radial basis function nuclear
Figure BDA00003136216800067
With
The 3rd step: the Gram matrix K based on radial basis function nuclear of calculating weight vectors w w
The 4th step: use formula (5), calculate questionnaire group Q iAnd Q j
Figure BDA00003136216800069
With
Figure BDA00003136216800061
The 5th step: compute matrix
Figure BDA00003136216800062
With
Figure BDA00003136216800063
Proper vector With
The 6th step: upgrade w = M Q i M Q j ;
The 7th step: calculate δ=corr (Q i* w, Q j* w);
The 8th step: if δ>0.5% then jumped to for the 3rd step, otherwise finishes;
Said process is the complexity that exchanges algorithm with the time for, has avoided big square formation is directly carried out feature decomposition, comes down to each composition is carried out based on the finding the solution of method one, and then carries out addition;
Because what said process carried out is to connect after optimizing again, therefore need one to connect the step result is connected, adopt the method shown in the following formula (6) to connect, its objective is the expression formula that obtains an expansion mapping, can be used for the data in future;
φ ( x ) = ( Σ i Q i / | | M Q i | | + Σ i Q j / | | M Q j | | ) x - - - ( 6 )
By formula (6), obtain a conversion that can be used for any unknown data It can change the distribution of unknown data, makes it and existing questionnaire group Q iAnd Q jScore value correlativity maximum.
The present invention is owing to adopt by conversion of historical questionnaire data study, and under the effect of this conversion, the inconsistency between the questionnaire reduces to minimum, eliminates the subjective deviation of patient when answering a questionnaire to greatest extent, makes it more to be conducive to doctor's diagnosis.Invention is the machine learning method of subjective the differing property of questionnaire of a kind of convenient and practical minimizing traditional Chinese medical science.
Embodiment
Embodiment:
In the present embodiment, the present invention reduces the inconsistency between the questionnaire to greatest extent by a conversion that acts on existing questionnaire data.For effective solution of clear expression, at first problem is carried out formal description.Be provided with the questionnaire data set
Q={Q 1, Q 2..., Q m, Q wherein iBe the vector of n * 1, represent the score value of i questionnaire, then target of the present invention is to seek a conversion Φ by machine learning method, makes contradiction functional value C (Φ (the Q)) minimum of definition on it.In the present invention, the consistency problem of a questionnaire fractional value that consideration changes namely reduces the inconsistency that changes between the twice questionnaire mark in treatment front and back, even C is (Φ (Q T+1)-Φ (Q t)) minimum, consider the convenience in the calculating, target is decided to be the correlativity maximum that makes through between the questionnaire mark after the Φ conversion.
The present invention adopts a kind of Kernel of being similar to Canonical Correlation Analysis(KCCA) method obtain optimum Φ value.The particular content of invention is described below:
With NPQ, the consistance of MPQ and three groups of questionnaires of SF-36 is that example describes.Because SF-36 has 8 score values, and NPQ and MPQ have only a score value, so establish Φ={ φ 1..., φ 10, target is:
max Φ corr ( φ 1 ( NPQ ) ) , φ 2 ( MPQ ) ) , φ 3 ( SF - 36 1 ) , · · · , φ 10 ( SF - 36 8 ) ) - - - ( 5 )
Set Φ and have following linear forms: Φ (x)=w TX, owing to used the nuclear mapping among the present invention, so its result can not be subjected to the restriction of the linear combination of basis function.Use is similar to the thought of KCCA the optimization problem shown in (1) formula is found the solution, and releases the maximized solution of consistance between two groups of questionnaires earlier, and then is generalized to the situation of many group questionnaires.Be example with NPQ and two groups of questionnaires of MPQ, establish it and have scale l, i.e. NPQ={NPQ 1, NPQ 2..., NPQ lAnd MPQ={MPQ 1, MPQ 2..., MPQ l, establish u=M N* NPQ, υ=M M* MPQ.Make the correlativity maximum of u and υ, be equivalent to and find the solution following optimization problem:
max M N , M M E [ uυ ] E [ u 2 ] E [ υ 2 ] - - - ( 6 )
E[wherein] be mathematical expectation, note C NPQ, MPQBe covariance matrix, then (2) can be expressed as:
max M N , M M M N T C N , M M M M N T C N , N M N M M T C M , M M M - - - ( 7 )
According to the KCCA theory, in optimization aim (3), consider u TU and υ Tυ rather than original u and υ help to introduce kernel function in optimization aim.Quote two standardization items:
Figure BDA00003136216800084
With
Figure BDA00003136216800085
It only calculates for simplifying, and can't influence the optimization aim of (3).Simultaneously, as long as levying the dimension in space, the data point number bit of training wants big, the optimum solution of (3) With
Figure BDA000031362168000810
One fixes on set span{ φ (x 1), φ (x 2) ..., φ (x n) and
Figure BDA00003136216800086
In, wherein x and y are respectively the training sample points of NPQ and MPQ, namely M M * = Σ i = 1 n α i φ ( x i ) ,
Figure BDA00003136216800088
Use the mode of maximum likelihood to control the model complexity, increase by two regularization term about α and β, obtain following optimization aim expression formula:
max α , β α T K X K Y β - η α | | α | | 2 - η β | | β | | 2 - - - ( 4 )
K wherein Xiφ (x i) Tφ (x i),
Figure BDA00003136216800092
(4) formula is the target of finding the solution of the present invention.
The present invention proposes to adopt two kinds of methods that (4) represented optimization aim is found the solution.Method one is the enclosed solving method.Note w=(α T, β T) T, then being had by representation theorem, the optimum solution of (4) is equivalent to secular equation B -1The solution of Aw=pw, p can regard square formation B as -1The proper vector of A, wherein:
A = 0 K X K Y K Y K X 0 B = K X K Y 0 0 K Y K X - - - ( 5 )
1 pair of above-mentioned steps of algorithm is concluded.
The 1st step: input questionnaire group Q iAnd Q j
The 2nd step: calculate questionnaire group Q iAnd Q jThe Gram matrix based on radial basis function nuclear With
Figure BDA00003136216800098
The 3rd step: use formula (5) to calculate questionnaire group Q iAnd Q jCorrespondence
Figure BDA00003136216800099
With
Figure BDA000031362168000910
The 4th step: compute matrix With
Figure BDA00003136216800096
Proper vector
Figure BDA000031362168000911
With
The 5th step: output is as transformation matrix
Top step is the enclosed solving method, can directly obtain transformation matrix from training data, but because it will carry out the feature decomposition of matrix, therefore is only applicable to the little situation of amount of training data (being not more than 1000 examples).Therefore we use a kind of based on the method for decomposing the index iteration convergence in the present invention, respectively questionnaire group consistance is separately found the solution, connecting mapping then merges, algorithm uses a w at random as initial value, correlativity is once calculated in every optimization one time, twice correlativity change is less than 0.5% the time when front and back, and algorithm finishes.Method two is the complexity that exchanges algorithm with the time for, has avoided big square formation is directly carried out feature decomposition, comes down to each composition is carried out based on the finding the solution of method one, and then carries out addition.The key step of method two method for solving is summarized as follows:
The 1st step: input questionnaire group Q iAnd Q j, initializes weights vector w
The 2nd step: calculate questionnaire group Q iAnd Q jThe Gram matrix based on radial basis function nuclear
Figure BDA00003136216800107
With
Figure BDA00003136216800108
The 3rd step: the Gram matrix K based on radial basis function nuclear of calculating weight vectors w w
The 4th step: use formula (5), calculate questionnaire group Q iAnd Q j
Figure BDA00003136216800101
With
Figure BDA00003136216800102
The 5th step: compute matrix
Figure BDA00003136216800103
With Proper vector
Figure BDA00003136216800105
With
Figure BDA00003136216800106
The 6th step: upgrade w = M Q i M Q j
The 7th step: calculate δ=corr (Q i* w, Q j* w)
The 8th step: if δ>0.5% then jumped to for the 3rd step, otherwise finishes
Because what method two carried out is to connect after optimizing again, therefore need a connection step that the result is connected, we adopt the method shown in the formula (6) to connect in the present invention, its objective is the expression formula that obtains an expansion mapping, can be used for following data.
φ ( x ) = ( Σ i Q i / | | M Q i | | + Σ i Q j / | | M Q j | | ) x - - - ( 6 )
By formula (6), obtain a conversion that can be used for any unknown data It can change the distribution of unknown data, makes it and existing questionnaire group Q iAnd Q jScore value correlativity maximum.
Content of the present invention is concentrated at the cervical spondylopathy of certain institute of traditional Chinese medicine and orthopaedics clinical data and has been carried out application testing, has all obtained effect preferably.
The result of cervical spondylopathy patient treatment report is the theory and technology based on psychometrics, obtains the patient in daily life, healthy impression with to the physical experience of aspects such as treatment measure subjectivity satisfaction by forms such as interview, self-appraisal questionnaires.Pain is a kind of subjective symptom of patient's perception, still is difficult to the service-strong objective indicator at present and measures.At the deficiency of domestic similar research on the scientific research and design methodology, introduce NPQ cervicodynia scale, McGill pain scale (MPQ) and the SF-36 life quality scale patient subjective assessment index of international endorsement as therapeutic evaluation, carried out comprehensive evaluation from aspects such as pain relief, functional rehabilitation, life qualities.
Adopted the patient data of 794 examples from different branches in this test, each patient record the subjective questionnaire score value in 5 stages to represent the order of severity of its cervical spondylopathy, be respectively before the treatment, treatment beginning 1 month, treatment finish, treatment finished back 1 month and treatment finishes back 3 months.Ideal situation is that the variation of all indexs is all synchronous, and following three kinds of possibilities occur but actual conditions are meetings: synchronously, part is synchronous and asynchronous.Synchronously: in therapeutic process, the variation of NPQ, MPQ and SF-36 is consistent, and as the treatment through a period, NPQ and MPQ descend, and SF-36 rises, and shows treatment pain decline afterwards, and life quality rises, and such three indexs are synchronous; Part is synchronously: in therapeutic process, the variation of NPQ, MPQ and SF-36 is not quite identical, in certain treatment stage, the phenomenon that NPQ descends, MPQ rises occurs, and then the doctor can't judge that the patient alleviates through the pain after the treatment or increased the weight of; Asynchronous: the phenomenon of subjective questionnaire contradiction has all appearred in each treatment stage in therapeutic process.
Be synchronously with explanatory note explanation, part synchronously and asynchronous, not with scheming (seeing above-mentioned blue literal)
For nonsynchronous index, the doctor has been difficult to judge the effect for the treatment of.Therefore adopted method of the present invention to improve consistance between the different indexs in clinical, table 1 has showed that the consistance that Guangdong Provincial TCM Hospital cervical spondylopathy index gathers improves degree:
Table 1. gathers the raising degree of indicator consilience
The present invention also uses at the treatment of cervical spondylopathy achievement data of each branch and concentrates, and table 2 has showed that the indicator consilience of different branches after using the present invention improves degree:
Table 2. gathers the raising degree (NPQ vs MPQ) of indicator consilience
Figure BDA00003136216800131
Annotate: employed data from " national Eleventh Five-Year Plan science and technology support planning item: the clinical research of different acupuncture-moxibustion therapy method treatment cervical spondylopathy cervicodynia prioritization schemes, bullets: 2006BAI12B04-1 "
The present invention can significantly improve the consistance between the index as can be seen, is conducive to the doctor and diagnoses.
6.2 the clinical trial of orthopaedics data
Adopt at random, contrast, polycentric test design, for the knee osteoarthritis patient who meets the standard of including in, accept traditional Chinese medicine for oral administration, external application Chinese medicine, gimmick, acupuncture, the treatment of traditional medicine synthetic therapy scheme respectively, and carry out simultaneously in 5 tame hospitals.Adopt international osteoarthritis research scale (WOMAC), life quality scale (SF-36) to carry out observation of curative effect, form the suitable prioritization scheme of promoting with the effect characteristics of clear and definite above therapy and after analysis-by-synthesis.
Since WOMAC scale and SF-36 scale, and inconsistency can appear between 8 territories of SF-36 scale itself, and use the present invention reduces the inconsistent degree between these indexs.Table 3 has been showed in Guangdong Provincial TCM Hospital " research of the knee osteoarthritis Chinese medicine scheme optimization " project raising situation of using consistent degree after the method for the present invention.
The correlativity of table 3. orthopaedics index WOMAC and life quality index S F-36
Figure BDA00003136216800141

Claims (3)

1. a machine learning method that reduces subjective the differing property of questionnaire of the traditional Chinese medical science is characterized in that comprising the steps:
1) the subjective questionnaire data of vectorization: subjective questionnaire is made of problem, weight and score value, and vectorization is the thaumatropy of questionnaire vector, establishes subjective questionnaire and is expressed as Q={q 1, q 2..., q m, q wherein iIt is the score of i questionnaire item;
2) the consistance target of definition questionnaire group, and formalization is expressed; Definition contradiction function C (x) is expressed consistance between the questionnaire group, the contradiction function with the score value of questionnaire group through the value that obtains after the conversion as input, this process uses negative correlation as the contradiction function, coincidence statistics is practised theoretical;
3) main subjective questionnaire NPQ, MPQ and the SF-36 that the traditional Chinese medical science is used carries out consistency analysis, and wherein NPQ and MPQ respectively have a sub-questionnaire, and SF-36 has 8 sub-questionnaires, according to the 2nd) the contradiction function in step, define following conforming target:
max Φ corr ( φ 1 ( NPQ ) ) , φ 2 ( MPQ ) ) , φ 3 ( SF - 36 1 ) , · · · , φ 10 ( SF - 36 8 ) )
(1)
In (1) formula, corr represents the computing function of correlativity;
4) to the finding the solution of (1) formula, think that Φ has following linear forms: Φ (x)=w TX, owing to used the nuclear mapping, so its result can not be subjected to the restriction of the linear combination of basis function, used the thought that is similar to KCCA that the optimization problem shown in (1) formula is found the solution, release the maximized solution of consistance between two groups of questionnaires earlier, and then be generalized to the situation of many group questionnaires; Be example with NPQ and two groups of questionnaires of MPQ, establish it and have scale l, i.e. NPQ={NPQ 1, NPQ 2..., NPQ lAnd MPQ={MPQ 1, MPQ 2..., MPQ l, establish u=M N* NPQ, υ=M M* MPQ; Make the correlativity maximum of u and υ, be equivalent to and find the solution following optimization problem:
max M N , M M E [ uυ ] E [ u 2 ] E [ υ 2 ] - - - ( 1 )
E[wherein] be mathematical expectation, note C NPQ, MPQBe covariance matrix, then (2) can be expressed as:
max M N , M M M N T C N , M M M M N T C N , N M N M M T C M , M M M - - - ( 2 )
According to the KCCA theory, in optimization aim (3), consider u TU and υ Tυ rather than original u and υ help to introduce kernel function in optimization aim; Quote two standardization items:
Figure FDA00003136216700023
With
Figure FDA00003136216700024
It only calculates for simplifying, and can't influence the optimization aim of (3); Simultaneously, as long as levying the dimension in space, the data point number bit of training wants big, the optimum solution of (3)
Figure FDA00003136216700028
With
Figure FDA00003136216700029
One fixes on set span{ φ (x 1), φ (x 2) ..., φ (x n) and In, wherein x and y are respectively the training sample points of NPQ and MPQ, namely M M * = Σ i = 1 n α i φ ( x i ) ,
Figure FDA00003136216700026
Use the mode of maximum likelihood to control the model complexity, increase by two regularization term about α and β, obtain following optimization aim expression formula:
max α , β α T K X K Y β - η α | | α | | 2 - η β | | β | | 2 - - - ( 4 )
K wherein Xiφ (x i) Tφ (x i),
Figure FDA00003136216700031
(4) formula is and finds the solution target.
2. the machine learning method of subjective the differing property of questionnaire of the minimizing traditional Chinese medical science according to claim 1 is characterized in that the represented optimization aim of above-mentioned formula (4) adopts the enclosed solving method to find the solution:
Note w=(α T, β T) T, then being had by representation theorem, the optimum solution of (4) is equivalent to secular equation B -1The solution of Aw=pw, p can regard square formation B as -1The proper vector of A, wherein:
A = 0 K X K Y K Y K X 0 B = K X K Y 0 0 K Y K X - - - ( 5 )
Step to above-mentioned solving-optimizing target is carried out following conclusion:
The 1st step: input questionnaire group Q iAnd Q j
The 2nd step: calculate questionnaire group Q iAnd Q jThe Gram matrix based on radial basis function nuclear
Figure FDA00003136216700034
With
The 3rd step: use formula (5) to calculate questionnaire group Q iAnd Q jCorrespondence
Figure FDA00003136216700036
With
Figure FDA00003136216700037
The 4th step: compute matrix
Figure FDA00003136216700038
With
Figure FDA00003136216700039
Proper vector
Figure FDA000031362167000310
With
Figure FDA000031362167000311
The 5th step: output is as transformation matrix;
Top step is the enclosed solving method, can directly from training data, obtain transformation matrix, but because it will carry out the feature decomposition of matrix, therefore be only applicable to the little situation of amount of training data, be not more than 1000 examples, use a kind of based on the method for decomposing the index iteration convergence, respectively questionnaire group consistance is separately found the solution, connecting mapping then merges, algorithm uses a w at random as initial value, correlativity is once calculated in every optimization one time, and twice correlativity change is less than 0.5% the time when front and back, and algorithm finishes.
3. the machine learning method of subjective the differing property of questionnaire of the minimizing traditional Chinese medical science according to claim 2 is characterized in that the represented optimization aim of above-mentioned formula (4) adopts following steps to find the solution:
The 1st step: input questionnaire group Q iAnd Q j, initializes weights vector w;
The 2nd step: calculate questionnaire group Q iAnd Q jThe Gram matrix based on radial basis function nuclear With
Figure FDA00003136216700045
The 3rd step: the Gram matrix K based on radial basis function nuclear of calculating weight vectors w w
The 4th step: use formula (5), calculate questionnaire group Q iAnd Q j With
Figure FDA00003136216700041
The 5th step: compute matrix
Figure FDA00003136216700042
With
Figure FDA00003136216700043
Proper vector
Figure FDA00003136216700047
With
Figure FDA00003136216700048
The 6th step: upgrade w = M Q i M Q j ;
The 7th step: calculate δ=corr (Q i* w, Q j* w);
The 8th step: if δ>0.5% then jumped to for the 3rd step, otherwise finishes;
Said process is the complexity that exchanges algorithm with the time for, has avoided big square formation is directly carried out feature decomposition, comes down to each composition is carried out based on the finding the solution of method one, and then carries out addition;
Because what said process carried out is to connect after optimizing again, therefore need one to connect the step result is connected, adopt the method shown in the following formula (6) to connect, its objective is the expression formula that obtains an expansion mapping, can be used for the data in future;
φ ( x ) = ( Σ i Q i / | | M Q Q i | | + Σ i Q j / | | M Q j | | ) x - - - ( 6 )
By formula (6), obtain a conversion that can be used for any unknown data
Figure FDA00003136216700052
It can change the distribution of unknown data, makes it and existing questionnaire group Q iAnd Q jScore value correlativity maximum.
CN201310159736.9A 2013-05-02 2013-05-02 A kind of machine learning method for reducing differing property of traditional Chinese medical science subjective questionnaire Expired - Fee Related CN103294893B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310159736.9A CN103294893B (en) 2013-05-02 2013-05-02 A kind of machine learning method for reducing differing property of traditional Chinese medical science subjective questionnaire

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310159736.9A CN103294893B (en) 2013-05-02 2013-05-02 A kind of machine learning method for reducing differing property of traditional Chinese medical science subjective questionnaire

Publications (2)

Publication Number Publication Date
CN103294893A true CN103294893A (en) 2013-09-11
CN103294893B CN103294893B (en) 2017-08-25

Family

ID=49095745

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310159736.9A Expired - Fee Related CN103294893B (en) 2013-05-02 2013-05-02 A kind of machine learning method for reducing differing property of traditional Chinese medical science subjective questionnaire

Country Status (1)

Country Link
CN (1) CN103294893B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107194144A (en) * 2017-04-05 2017-09-22 深圳前海合泰生命健康技术有限公司 A kind of method and device for assessing scale

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110184729A1 (en) * 2008-09-29 2011-07-28 Sang Hyob Nam Apparatus and method for extracting and analyzing opinion in web document
US20120284080A1 (en) * 2011-05-04 2012-11-08 Telefonica S.A. Customer cognitive style prediction model based on mobile behavioral profile

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110184729A1 (en) * 2008-09-29 2011-07-28 Sang Hyob Nam Apparatus and method for extracting and analyzing opinion in web document
US20120284080A1 (en) * 2011-05-04 2012-11-08 Telefonica S.A. Customer cognitive style prediction model based on mobile behavioral profile

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DA GUO等: ""Research on optimal Traditional Chinese Medicine treatment of knee ostarthritis with data mining algorithms"", 《2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW)》 *
GANG ZHANG等: ""Therapeutic effectiveness evaluation algorithm based on KCCA for neck pain caused by different diagnostic sub-types of cervical spondylosis"", 《2011 4TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI)》 *
ZHONG DI等: ""A Clinical Outcome Evaluation Model with Local Sample Selection: a Study on Efficacy of Acupuncture for Cervical Spondylosis"", 《2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107194144A (en) * 2017-04-05 2017-09-22 深圳前海合泰生命健康技术有限公司 A kind of method and device for assessing scale

Also Published As

Publication number Publication date
CN103294893B (en) 2017-08-25

Similar Documents

Publication Publication Date Title
Roberts et al. The design and analysis of reliability studies for the use of epidemiological and audit indices in orthodontics
Sintonen et al. A fifteen-dimensional measure of health-related quality of life (15D) and its applications
Tan Misuses of KR-20 and Cronbach's alpha reliability coefficients
Osteen An introduction to using multidimensional item response theory to assess latent factor structures
Gallegos et al. A measure of teamwork perceptions for team-based learning
Koballa Jr The determinants of female junior high school students' intentions to enroll in elective physical science courses in high school: Testing the applicability of the theory of reasoned action
Hatala et al. How well is each learner learning? Validity investigation of a learning curve-based assessment approach for ECG interpretation
Dennett et al. Adherence and drop-out in randomized controlled trials of exercise interventions in people with multiple sclerosis: a systematic review and meta-analyses
CN102670190A (en) Heart rate variability nonlinear characteristic-based automatic diagnosis method for congestive heart failure
Burns How to establish interrater reliability
Tosun et al. The Turkish version of the pregnancy physical activity questionnaire: cross-cultural adaptation, reliability, and validity
Ehmke et al. Measuring mathematics competence in international and national large scale assessments: Linking PISA and the national educational panel study in Germany
CN107945873A (en) A kind of online health control method, system, electronic equipment and storage medium
JP2022125154A (en) Assessment and management system for rehabilitative conditions and related methods
Yan et al. Developing a Rasch measurement physical fitness scale for Hong Kong primary school-aged students
Gignac Residual group-level factor associations: Possibly negative implications for the mutualism theory of general intelligence
Chatterjee et al. Assessment of a simulated case-based measurement of physician diagnostic performance
CN107085661A (en) A kind of the elderly's physical function Quantitative Calculation Method
de Beurs et al. A common measurement scale for self-report instruments in mental health care: T scores with a normal distribution.
Tan KR-20 ve Cronbach Alfa katsayılarının yanlış kullanımları
Friend Group empowerment in nursing education
CN106295202B (en) Juvenile healthy situation dynamic analysing method based on Hale indexes
CN103294893A (en) Machine learning method for reducing inconsistency between traditional Chinese medicine subjective questionnaires
Dorrel et al. Role strain in dual role collegiate athletic trainers
Caruso et al. The properties of equally and differentially weighted WAIS-III factor scores.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170825

Termination date: 20200502

CF01 Termination of patent right due to non-payment of annual fee