CN111242156B - Hyperplane nearest neighbor classification method for microangioma medical record images - Google Patents

Hyperplane nearest neighbor classification method for microangioma medical record images Download PDF

Info

Publication number
CN111242156B
CN111242156B CN201911104118.8A CN201911104118A CN111242156B CN 111242156 B CN111242156 B CN 111242156B CN 201911104118 A CN201911104118 A CN 201911104118A CN 111242156 B CN111242156 B CN 111242156B
Authority
CN
China
Prior art keywords
microangioma
medical record
disthyper
image
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911104118.8A
Other languages
Chinese (zh)
Other versions
CN111242156A (en
Inventor
丁卫平
丁帅荣
任龙杰
鞠恒荣
曹金鑫
孙颖
冯志豪
李铭
胡彬
陈森博
万杰
赵理莉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nantong University Technology Transfer Center Co ltd
Original Assignee
Nantong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nantong University filed Critical Nantong University
Priority to CN201911104118.8A priority Critical patent/CN111242156B/en
Publication of CN111242156A publication Critical patent/CN111242156A/en
Application granted granted Critical
Publication of CN111242156B publication Critical patent/CN111242156B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes
    • G06F18/24143Distances to neighbourhood prototypes, e.g. restricted Coulomb energy networks [RCEN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/751Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/03Recognition of patterns in medical or anatomical images
    • G06V2201/032Recognition of patterns in medical or anatomical images of protuberances, polyps nodules, etc.

Abstract

The invention discloses a hyperplane nearest neighbor classification method for a microangioma medical record image. Firstly, preprocessing and segmenting diabetic fundus image data, and extracting a lesion area of a microangioma medical record image from a processed fundus medical record image; then, the morphological characteristics, the textural characteristics and the gray-scale characteristics of the image area of the microangioma lesion are converted into a data vector x with the dimension of li(ii) a The data is then divided into training data XtrAnd test data XteBy comparing training data XtrTraining to obtain a set X comprising classified hyperplane and support vectorsvThe efficient classification model comprises a distance threshold t, the number k of nearest neighbors and the length nb of a spectral Hash coding code; final test data XteAnd during prediction, according to the relation between the distance from the test sample to the classified hyperplane and the distance threshold t, respectively adopting a support vector machine model and a neighbor algorithm of a fusion spectrum hash algorithm to perform prediction, and integrating related prediction results. The method can quickly and effectively classify the extracted characteristic of the microangioma medical record image in the fundus medical record, has higher classification accuracy, and greatly reduces the execution time of the characteristic classification of the microangioma medical record image.

Description

Hyperplane nearest neighbor classification method for microangioma medical record images
The technical field is as follows:
the invention relates to medical image classification, in particular to a hyperplane nearest neighbor classification method for a microangioma medical record image.
Background art:
diabetes is a disease with high morbidity and is now a great threat to human health. The cost of treatment for the advanced stages of diabetes is quite expensive and detection of diabetes as early as possible can effectively reduce the cost of treatment. Diabetes mellitus often results in retinal abnormalities, a microvascular complication of diabetes, known as diabetic retinopathy. Fundus images can monitor retinal abnormalities, so fundus image classification has now become an effective method for non-invasive detection of diabetes. The fundus images can be converted into feature vectors by segmenting lesion areas and then classified and predicted in a machine learning mode, but in view of reasons such as high dimensionality of the feature vectors or insufficient machine learning, classification accuracy and efficiency need to be improved.
The support vector machine basically does not involve probability measures, law of large numbers and the like, and therefore is different from the existing statistical method. In essence, the method avoids the traditional process from induction to deduction, realizes efficient transduction reasoning from the training sample to the forecast sample, and greatly simplifies the problems of common classification, regression and the like. The final result is determined by a few support vectors, which not only can help us to grasp key samples and 'reject' a large number of redundant samples, but also ensures that the method is simple in algorithm and has better robustness. But the classification accuracy of support vector machines tends to be low near the classification hyperplane. The kNN nearest neighbor method has no hypothesis on data, high accuracy, insensitivity to abnormal points, and is more suitable for automatic classification of class domains with larger sample capacity, the training time complexity is lower than that of a support vector machine, but the overall classification accuracy is not higher than that of the support vector machine. The traditional kNN nearest neighbor method generally uses Euclidean distance, and the calculation amount is huge. The encoding process of the image feature vector by the spectral hash can be regarded as a graph segmentation problem, a relaxation solution can be provided for the graph segmentation problem by analyzing the characteristic value of the Laplacian matrix of the similar graph and the characteristic vector, and binary encoding is generated by thresholding the characteristic vector. The calculation of the Hamming distance between the binary codes is greatly reduced compared with the calculation amount of the Euclidean distance used by the traditional kNN nearest neighbor method. The combination of the three can be better applied to image feature classification.
The invention content is as follows:
the invention aims to provide a hyperplane nearest neighbor classification method for a microangioma medical record image, which is accurate and rapid in classification and comprises the following specific steps:
A. firstly, preprocessing a diabetic fundus medical record image, removing a medical record image background by adopting a median filtering method, removing a medical record blood vessel structure by utilizing image morphology, extracting pathological change information in the image, carrying out corrosion operation on the image by utilizing a linear element, and carrying out Gaussian filtering on the diabetic fundus medical record image to enhance the contrast of a microangioma region;
B. the method comprises the following steps of selecting microangioma lesions in a diabetic fundus medical record image as a target for template matching, designing a function model to match the target according to the gray level and shape parameters of the microangioma in the medical record image, and enabling the gray level of the microangioma image to obey Gaussian distribution, wherein the matching formula of the function template is as follows:
Figure BDA0002270748130000021
where l is the lowest value of the gray scale, h is the height of the gray scale, e is the natural base number, and d is the point-to-templateThe distance between the centers of the circles, r is the radius, s is the gray scale steepness (x)0,y0) Is the central point of the microangioma;
C. transforming the morphological characteristics, textural characteristics and gray-scale characteristics of the obtained microangioma medical record pathological image into an l-dimensional data vector xi=(xi1,xi2,...,xil),i=1,2,3...;
D. Dividing microangioma medical record image data into training data Xtr=(x1,x2,...,xn) N-1, 2,3te=(x1',x2',...,xm'),m=1,2,3...;
E. Training data X for microangioma medical record imagestr=(x1,x2,...,xn) Training to obtain a set X of support vectors including classification hyperplane and support vectorsv=(x1”,x2”,...,xs"), s ═ 1,2,3., a distance threshold t spectral hash code length nb and a classification Model of the number k of nearest neighbors;
F. data X of a microangioma medical record image test set is classified according to a classification Modelte=(x1',x2',...,xm'), m is 1,2,3, mte, and test sample x is judged first when predictingi'=(xi1',xi2',...,xil'), i ═ 1,2,. times, m distance distHyper to the classification hyperplaneiI is 1,2, K, m, if it is greater than the distance threshold t, if the test sample x is not equal to the threshold valuei'=(xi1',xi2',...,xil'), i is 1,2, the distance from m to the classification hyperplane is greater than a distance threshold t, then the support vector machine model is used for prediction, if the distance from the test set data to the classification hyperplane is not greater than the distance threshold, then a neighbor algorithm combined with a spectrum hash algorithm is used for prediction, and finally a prediction result Y is obtained through synthesiste=(y1',y2',...,ym'),m=1,2,3...。
The invention is further improved in that: the step E specifically comprises the following steps: training data X for microangioma medical record imagestr=(x1,x2,...,xn),n=1,2,3..Training to obtain a set X including classified hyperplane and support vectorsv=(x1”,x2”,...,xs"), s ═ 1,2,3., a distance threshold t spectral hash code length nb and a classification Model of the number k of nearest neighbors; the method comprises the following steps:
a. training data X of medical record images of microangiomastr=(x1,x2,...,xn) N is 1,2,3. the support vector set X is obtained through the training of a support vector machine modelsv=(x1”,x2”,...,xs"), s ═ 1,2,3.. and classification hyperplane Hyper: w is aT·x+b=0,
Wherein w is a classification hyperplane normal vector, T is a transposition operation, b is an offset, and x is a point on the hyperplane;
b. calculating microangioma medical record image training data Xtr=(x1,x2,...,xn) N-1, 2,3.. distance matrix distHyper-w to the classification Hyper-plane HyperT·Xtr+b;
c. Respectively obtaining the training data sets X predicted by using the support vector machine through the set distance threshold ttrsvm={xi|distHyperiT, i ═ 1, 2.. n } and a training data set X predicted using a neighbor algorithmtrknn={xi|distHyperi≤t,i=1,2,...,n};
d. Data set X for microangioma medical record image trainingtrsvm={xi|distHyperiAnd if t, i is 1,2, 1, n, predicting by using a support vector machine model obtained by training to obtain a microangioma medical record image prediction label set Ytrsvm={yi|distHyperi>t,i=1,2,...,n};
e. For support vector set Xsv=(x1”,x2”,...,xs1,2,3, training a corresponding parameter SHAparam by using a spectral hash method through a set code length nb;
f. training data X of microangioma medical record images according to parameter SHAparamtr=(x1,x2,...,xn) Support vector set X of 1,2,3sv=(x1”,x2”,...,xs1,2, 3.) and a microangioma medical record image training dataset X predicted using a neighbor algorithmtrknn={xi|distHyperiCompressing t, i ═ 1, 2.. and n ≦ into binary codes with the same length by a spectral hash method;
g. calculating microangioma medical record image training dataset Xtrknn={xi|distHyperiT, i ≦ 1,2,.., n } and a set of support vectors Xsv=(x1”,x2”,...,xsCompressing the obtained binary codes, calculating Hamming distances corresponding to all the binary codes, and storing the Hamming distances into a Hamming distance table Dhamm _ train;
h. reading a Hamming distance table Dhamm _ train, taking the labels of the first k support vectors with the closest distance, and recording the label with the most occurrence times as a microangioma medical record image training data set Xtrknn={xi|distHyperiObtaining a label set Y of a microangioma medical record image training set prediction label set corresponding to the label of the corresponding sample in t, i ═ 1,2trknn={yi|distHyperi≤t,i=1,2,...,n};
i. Combining two parts to predict tag set YtrsvmAnd YtrknnAnd solving a microangioma medical record image training set prediction label set Ytr=(y1,y2,...,yn) Optimizing three parameters of a distance threshold t, a binary code length nb and the number k of nearest neighbors according to the prediction accuracy to obtain an optimal solution, wherein n is 1,2 and 3;
j. obtaining a set X containing classified hyperplane and support vectorsv=(x1”,x2”,...,xs"), s ═ 1,2,3., distance threshold t, spectral hash code length nb, and number of nearest neighbors k.
The invention is further improved in that: the step F specifically comprises the following steps: data X of a microangioma medical record image test set is classified according to a classification Modelte=(x1',x2',...,xm'), m is 1,2,3This xi'=(xi1',xi2',...,xil'), i ═ 1,2,. times, m distance distHyper to the classification hyperplaneiI is 1,2, K, m, if it is greater than the distance threshold t, if the test sample x is not equal to the threshold valuei'=(xi1',xi2',...,xil'), i is 1,2, the distance from m to the classification hyperplane is greater than a distance threshold t, then the support vector machine model is used for prediction, if the distance from the test set data to the classification hyperplane is not greater than the distance threshold, then a neighbor algorithm combined with a spectrum hash algorithm is used for prediction, and finally a prediction result Y is obtained through synthesiste=(y1',y2',...,ym'), m ═ 1,2,3.
a. Calculating microangioma medical record image test data Xte=(x1',x2',...,xm'), m 1,2,3.. the distance matrix distHyper w to the classification Hyper plane HyperT·Xte+b;
b. Respectively obtaining microangioma medical record test data sets X predicted by using a support vector machine through a set distance threshold ttesvm={xi|distHyperiT, i 1,2, m and microangioma medical record test dataset X predicted using a neighbor algorithmteknn={xi|distHyperi≤t,i=1,2,...,m};
c. To Xtesvm={xi|distHyperiT, i ═ 1, 2.., m } using the classified hyperplane Hyper-plane obtained by training to carry out support vector machine model prediction to obtain a prediction label set Ytesvm={yi|distHyperi>t,i=1,2,...,m};
d. The microangioma medical record test data set Xteknn={xi|distHyperiT, i ≦ 1,2,.., m } and the set of support vectors X in the Modelsv=(x1”,x2”,...,xs1,2,3. compressing the code length nb of the spectral hash coding in the classification Model into binary codes with the same length by a spectral hash method, calculating the Hamming distances corresponding to all the binary codes and storing the Hamming distances into a Hamming distance table Dhamm _ test;
e. reading Chinese charactersAnd (3) taking labels of the first k support vectors closest to the distance table Dhamm _ test, and recording the label with the most statistical occurrence times as a microangioma medical record image test data set Xteknn={xi|distHyperiObtaining a microangioma medical record prediction label set Y by using labels of corresponding samples in t, i ═ 1, 2.., m ≦teknn={yi|distHyperi≤t,i=1,2,...,m};
f. Microangioma medical record prediction tag set Y obtained by comprehensively using support vector machine model for predictiontesvm={yi|distHyperiT, i-1, 2, m and a microangioma medical record prediction tag set Y obtained by prediction by using a nearest neighbor algorithm combined with a spectral hash methodteknn={yi|distHyperiT, i ═ 1,2,.. m }, and the final microangioma medical record prediction result Y is obtainedte=(y1',y2',...,ym'),m=1,2,3...。
Firstly, preprocessing and segmenting diabetic fundus image data, and extracting a lesion area of a microangioma medical record image from a processed fundus medical record image; then, the morphological characteristics, the textural characteristics and the gray-scale characteristics of the image area of the microangioma lesion are converted into a data vector x with the dimension of li(ii) a The data is then divided into training data XtrAnd test data XteBy comparing training data XtrTraining a support vector machine to obtain a set X comprising classified hyperplane and support vectorsvA high-efficiency classification model of a distance threshold t and a spectral hash code length nb; final test data XteAnd during prediction, according to the relation between the distance from the test sample to the classified hyperplane and the distance threshold t, respectively adopting a support vector machine model and a neighbor algorithm of a fusion spectrum hash algorithm to perform prediction, and integrating related prediction results.
Compared with the prior art, the invention has the following advantages:
1. the method has better classification accuracy: the method combines the adjacent algorithm with the support vector machine, divides the diabetic fundus image data into a part which is classified by using the support vector and a part which is classified by using the adjacent algorithm by setting the distance threshold t, simultaneously solves the problems that the classification accuracy of the support vector machine near a classification hyperplane is not high and the total classification accuracy of the adjacent algorithm is not higher than that of the support vector machine, and well improves the classification accuracy.
2. The method has the advantages of good rapidity: the invention aims at the problem that the traditional kNN nearest neighbor method has huge calculation amount by using Euclidean distance, selects the part classified by using the nearest neighbor algorithm in the diabetic fundus image data through the set spectral Hash coding code length nb and the support vector set X of the part classified by using the support vector in the diabetic fundus image datasv=(x1”,x2”,...,xs"), s ═ 1,2,3
Figure BDA0002270748130000062
And classifying the part X in the diabetic fundus image data using a nearest neighbor algorithm according to the parameter SHAparam _ trainteknn={xi|distHyperiT, i ≦ 1, 2.. multidot.m } with the set of support vectors Xsv=(x1”,x2”,...,xs1,2,3, compressing the binary codes into binary codes with the same length, and then performing classification prediction on the obtained binary codes in a Hamming space by adopting a neighbor algorithm. Compared with the traditional kNN nearest neighbor method for calculating the Euclidean distance, the calculation time of the Hamming distance is greatly reduced, and the method has very high rapidity.
Drawings
FIG. 1 is a general flow diagram of the present invention;
FIG. 2 is a flow chart of a specific joint classifier training process;
FIG. 3 is a flow chart of a specific joint classifier test.
Detailed Description
For the purpose of enhancing understanding of the present invention, the present invention will be further described in detail with reference to the following examples, which are provided for illustration only and are not to be construed as limiting the scope of the present invention.
Fig. 1-3 show a specific embodiment of a hyperplane nearest neighbor classification method for a medical record image of microangioma:
the method comprises the following specific steps:
A. firstly, preprocessing a diabetic fundus medical record image, removing a medical record image background by adopting a median filtering method, removing a medical record blood vessel structure by utilizing image morphology, extracting pathological change information in the image, carrying out corrosion operation on the image by utilizing a linear element, and carrying out Gaussian filtering on the diabetic fundus medical record image to enhance the contrast of a microangioma region;
B. the method comprises the following steps of selecting microangioma lesions in a diabetic fundus medical record image as a target for template matching, designing a function model to match the target according to the gray level and shape parameters of the microangioma in the medical record image, and enabling the gray level of the microangioma image to obey Gaussian distribution, wherein the matching formula of the function template is as follows:
Figure BDA0002270748130000071
wherein l is the lowest value of gray scale, h is the height of gray scale, e is the natural base number, d is the distance from the point to the center of the template circle, r is the radius, s is the steepness of gray scale, (x)0,y0) Is the central point of the microangioma;
C. transforming the morphological characteristics, textural characteristics and gray-scale characteristics of the obtained microangioma medical record pathological image into an l-dimensional data vector xi=(xi1,xi2,...,xil),i=1,2,3...;
D. Dividing microangioma medical record image data into training data Xtr=(x1,x2,...,xn) N-1, 2,3te=(x1',x2',...,xm'),m=1,2,3...;
E. Training data X for microangioma medical record imagestr=(x1,x2,...,xn) Training to obtain a set X of support vectors including classification hyperplane and support vectorsv=(x1”,x2”,...,xs"), s ═ 1,2,3., distance threshold t, number of nearest neighbors k, and spectral hash code length nb, classifying Model;
the method specifically comprises the following steps: training data X for microangioma medical record imagestr=(x1,x2,...,xn) Training to obtain a set X of support vectors including classification hyperplane and support vectorsv=(x1”,x2”,...,xs"), s ═ 1,2,3., a distance threshold t spectral hash code length nb and a classification Model of the number k of nearest neighbors;
the method comprises the following steps:
a. training data X of medical record images of microangiomastr=(x1,x2,...,xn) N is 1,2,3. the support vector set X is obtained through the training of a support vector machine modelsv=(x1”,x2”,...,xs"), s ═ 1,2,3.. and classification hyperplane Hyper: w is aT·x+b=0,
Wherein w is a classification hyperplane normal vector, T is a transposition operation, b is an offset, and x is a point on the hyperplane;
b. calculating microangioma medical record image training data Xtr=(x1,x2,...,xn) N-1, 2,3.. distance matrix distHyper-w to the classification Hyper-plane HyperT·Xtr+b;
c. Respectively obtaining the training data sets X predicted by using the support vector machine through the set distance threshold ttrsvm={xi|distHyperiT, i ═ 1, 2.. n } and a training data set X predicted using a neighbor algorithmtrknn={xi|distHyperi≤t,i=1,2,...,n}
d. Data set X for microangioma medical record image trainingtrsvm={xi|distHyperiAnd if t, i is 1,2, 1, n, predicting by using a support vector machine model obtained by training to obtain a microangioma medical record image prediction label set Ytrsvm={yi|distHyperi>t,i=1,2,...,n};
e. For support vector set Xsv=(x1”,x2”,...,xs”),s=1,2,3...,Training a corresponding parameter SHAparam by a spectral hash method through a set code length nb;
f. training data X of microangioma medical record images according to parameter SHAparamtr=(x1,x2,...,xn) Support vector set X of 1,2,3sv=(x1”,x2”,...,xs1,2, 3.) and a microangioma medical record image training dataset X predicted using a neighbor algorithmtrknn={xi|distHyperiCompressing t, i ═ 1, 2.. and n ≦ into binary codes with the same length by a spectral hash method;
g. calculating microangioma medical record image training dataset Xtrknn={xi|distHyperiT, i ≦ 1,2,.., n } and a set of support vectors Xsv=(x1”,x2”,...,xsCompressing the obtained binary codes, calculating Hamming distances corresponding to all the binary codes, and storing the Hamming distances into a Hamming distance table Dhamm _ train;
h. reading a Hamming distance table Dhamm _ train, taking the labels of the first k support vectors with the closest distance, and recording the label with the most occurrence times as a microangioma medical record image training data set Xtrknn={xi|distHyperiObtaining a label set Y of a microangioma medical record image training set prediction label set corresponding to the label of the corresponding sample in t, i ═ 1,2trknn={yi|distHyperi≤t,i=1,2,...,n};
i. Combining two parts to predict tag set YtrsvmAnd YtrknnAnd solving a microangioma medical record image training set prediction label set Ytr=(y1,y2,...,yn) Optimizing three parameters, namely a distance threshold t, the number k of nearest neighbors and a spectrum hash coding code length nb, according to the prediction accuracy to obtain an optimal solution;
j. obtaining a set X containing classified hyperplane and support vectorsv=(x1”,x2”,...,xs"), s ═ 1,2,3., distance threshold t, number of nearest neighbors k, and spectral hash code length nb
F. Data X of a microangioma medical record image test set is classified according to a classification Modelte=(x1',x2',...,xm'), m is 1,2,3, mte, and test sample x is judged first when predictingi'=(xi1',xi2',...,xil'), i ═ 1,2,. times, m distance distHyper to the classification hyperplaneiI is 1,2, K, m, if it is greater than the distance threshold t, if the test sample x is not equal to the threshold valuei'=(xi1',xi2',...,xil'), i is 1,2, the distance from m to the classification hyperplane is greater than a distance threshold t, then the support vector machine model is used for prediction, if the distance from the test set data to the classification hyperplane is not greater than the distance threshold, then a neighbor algorithm combined with a spectrum hash algorithm is used for prediction, and finally a prediction result Y is obtained through synthesiste=(y1',y2',...,ym'), m ═ 1,2, 3.; the method specifically comprises the following steps:
data X of microangioma medical record image test set according to classification model Modte=(x1',x2',...,xm'), m is 1,2,3i'=(xi1',xi2',...,xil'), i ═ 1,2,. times, m distance distHyper to the classification hyperplaneiI is 1,2, K, m, if it is greater than the distance threshold t, if the test sample x is not equal to the threshold valuei'=(xi1',xi2',...,xil'), i is 1,2, the distance from m to the classification hyperplane is greater than a distance threshold t, then the support vector machine model is used for prediction, if the distance from the test set data to the classification hyperplane is not greater than the distance threshold, then a neighbor algorithm combined with a spectrum hash algorithm is used for prediction, and finally a prediction result Y is obtained through synthesiste=(y1',y2',...,ym'), m ═ 1,2,3.
a. Calculating microangioma medical record image test data Xte=(x1',x2',...,xm'), m 1,2,3.. the distance matrix distHyper w to the classification Hyper plane HyperT·Xte+b;
b. Respectively obtaining microangioma medical record test data sets predicted by using support vector machine through set distance threshold tXtesvm={xi|distHyperiT, i 1,2, m and microangioma medical record test dataset X predicted using a neighbor algorithmteknn={xi|distHyperi≤t,i=1,2,...,m};
c. To Xtesvm={xi|distHyperiT, i ═ 1, 2.., m } using the classified hyperplane Hyper-plane obtained by training to carry out support vector machine model prediction to obtain a prediction label set Ytesvm={yi|distHyperi>t,i=1,2,...,m};
d. The microangioma medical record test data set Xteknn={xi|distHyperiT, i ≦ 1,2,.., m } and the set of support vectors X in the Modelsv=(x1”,x2”,...,xs1,2,3. compressing the code length nb of the spectral hash coding in the classification Model into binary codes with the same length by a spectral hash method, calculating the Hamming distances corresponding to all the binary codes and storing the Hamming distances into a Hamming distance table Dhamm _ test;
e. reading a Hamming distance table Dhamm _ test, taking the labels of the first k support vectors with the closest distance, and recording the label with the most occurrence times as a microangioma medical record image test data set Xteknn={xi|distHyperiObtaining a microangioma medical record prediction label set Y by using labels of corresponding samples in t, i ═ 1, 2.., m ≦teknn={yi|distHyperi≤t,i=1,2,...,m};
f. Microangioma medical record prediction tag set Y obtained by comprehensively using support vector machine model for predictiontesvm={yi|distHyperiT, i-1, 2, m and a microangioma medical record prediction tag set Y obtained by prediction by using a nearest neighbor algorithm combined with a spectral hash methodteknn={yi|distHyperiT, i ═ 1,2,.. m }, and the final microangioma medical record prediction result Y is obtainedte=(y1',y2',...,ym'),m=1,2,3...。
The invention has better classification accuracy: the invention combines the proximity algorithm with the support vector machineThe method has the advantages that the distance threshold t is set, the diabetic fundus image data are divided into a part which is classified by using the support vector and a part which is classified by using the neighbor algorithm, the problems that the classification accuracy of the support vector machine near the classification hyperplane is not high and the total classification accuracy of the neighbor algorithm is not higher than that of the support vector machine are solved, and the classification accuracy is improved well. The method has the advantages of good rapidity: the invention aims at the problem that the traditional kNN nearest neighbor method has huge calculation amount by using Euclidean distance, selects the part classified by using the nearest neighbor algorithm in the diabetic fundus image data through the set spectral Hash coding code length nb and the support vector set X of the part classified by using the support vector in the diabetic fundus image datasv=(x1”,x2”,...,xs1,2,3. training to obtain a parameter SHparam _ train, and classifying a part X of the diabetic fundus image data by using a neighbor algorithm according to the parameter SHparam _ trainteknn={xi|distHyperiT, i ≦ 1, 2.. multidot.m } with the set of support vectors Xsv=(x1”,x2”,...,xs1,2,3, compressing the binary codes into binary codes with the same length, and then performing classification prediction on the obtained binary codes in a Hamming space by adopting a neighbor algorithm. Compared with the traditional kNN nearest neighbor method for calculating the Euclidean distance, the calculation time of the Hamming distance is greatly reduced, and the method has very high rapidity.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention.
Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (2)

1. A hyperplane nearest neighbor classification method for a microangioma medical record image is characterized by comprising the following steps: the method comprises the following specific steps:
A. firstly, preprocessing a diabetic fundus medical record image, removing a medical record image background by adopting a median filtering method, removing a medical record blood vessel structure by utilizing image morphology, extracting pathological change information in the image, carrying out corrosion operation on the image by utilizing a linear element, and carrying out Gaussian filtering on the diabetic fundus medical record image to enhance the contrast of a microangioma region;
B. the method comprises the following steps of selecting microangioma lesions in a diabetic fundus medical record image as a target for template matching, designing a function model to match the target according to the gray level and shape parameters of the microangioma in the medical record image, and enabling the gray level of the microangioma image to obey Gaussian distribution, wherein the matching formula of the function template is as follows:
Figure FDA0003356183160000011
wherein l is the lowest value of gray scale, h is the height of gray scale, e is the natural base number, d is the distance from the point to the center of the template circle, r is the radius, s is the steepness of gray scale, (x)0,y0) Is the central point of the microangioma;
C. transforming the morphological characteristics, textural characteristics and gray-scale characteristics of the obtained microangioma medical record pathological image into an l-dimensional data vector xi=(xi1,xi2,...,xil),i=1,2,3...,Ns
Wherein N issThe number of the pathological change image data of the microangioma medical record is shown;
D. dividing microangioma medical record image data into training data Xtr=(x1,x2,...,xn),n=1,2,3...,NtrWherein N is the number of microangioma images in the training set, and the value range of N is 1,2,3trAnd test data Xte=(x1',x2',...,xm'),m=1,2,3K,NteWherein m is the number of microangioma images in the test set, and m represents that the value range of m is 1,2,3te
Wherein N istrFor the pathological changes of microangiomasNumber of image training data, NteNumber of data tested for microangioma medical record lesion images, and Ntr+Nte=Ns
E. Training data X for microangioma medical record imagestr=(x1,x2,...,xn),n=1,2,3...,NtrTraining to obtain a set X comprising classified hyperplane and support vectorsv=(x1”,x2”,...,xs"), s 1,2,3, sv, distance threshold t, number of nearest neighbors k, and a classification Model of spectral hash code length nb,
wherein sv is the number of support vectors of lesion image data of a microangioma medical record, and the method comprises the following specific steps:
a. training data X of medical record images of microangiomastr=(x1,x2,...,xn),n=1,2,3...,NtrObtaining a support vector set X through the training of a support vector machine modelsv=(x1”,x2”,...,xs"), s ═ 1,2,3., sv and classification hyperplane Hyper: w is aT·x+b=0,
Wherein w is a classification hyperplane normal vector, T is a transposition operation, b is an offset, and x is a point on the hyperplane;
b. calculating microangioma medical record image training data Xtr=(x1,x2,...,xn),n=1,2,3...,NtrDistance matrix distHyper to sort Hyper plane Hyper wT·Xtr+b;
c. Respectively obtaining the training data sets X predicted by using the support vector machine through the set distance threshold ttrsvm={xi|distHyperiT, i ═ 1,2,. n }, where m is the microangioma image in the training set, xiThe index i of (a) has a value in the range of 1, 2.. multidot.m, and the training data set X predicted using a neighbor algorithmtrknn={xi|distHyperiT, i ═ 1,2,., n }, where n is the microangioma image in the training set, xiThe subscript i of (a) has a value range of 1, 2.. multidot.n;
d. data set X for microangioma medical record image trainingtrsvm={xi|distHyperiAnd if t, i is 1,2, 1, n, predicting by using a support vector machine model obtained by training to obtain a microangioma medical record image prediction label set Ytrsvm={yi|distHyperi>t,i=1,2,...,n};
e. For support vector set Xsv=(x1”,x2”,...,xs1,2,3, sv, training a corresponding parameter SHparam by a spectral hash method through a set code length nb;
f. training data X of microangioma medical record images according to parameter SHAparamtr=(x1,x2,...,xn),n=1,2,3...,NtrSupport vector set X ofsv=(x1”,x2”,...,xs1,2,3., sv and a microvascular tumor medical record image training dataset X predicted using a neighbor algorithmtrknn={xi|distHyperiCompressing t, i ═ 1, 2.. and n ≦ into binary codes with the same length by a spectral hash method;
g. calculating microangioma medical record image training dataset Xtrknn={xi|distHyperiT, i ≦ 1,2,.., n } and a set of support vectors Xsv=(x1”,x2”,...,xs"), s ═ 1,2,3, sv compresses the binary code, calculates the hamming distance corresponding to all binary codes and stores it in the hamming distance table Dhamm _ train;
h. reading a Hamming distance table Dhamm _ train, taking the labels of the first k support vectors with the closest distance, and recording the label with the most occurrence times as a microangioma medical record image training data set Xtrknn={xi|distHyperiObtaining a label set Y of a microangioma medical record image training set prediction label set corresponding to the label of the corresponding sample in t, i ═ 1,2trknn={yi|distHyperi≤t,i=1,2,...,n};
i. Combining two parts to predict tag set YtrsvmAnd YtrknnAnd solving a microangioma medical record image training set prediction label set Ytr=(y1,y2,...,yn),n=1,2,3K,NtrOptimizing three parameters of a distance threshold t, the number k of nearest neighbors and the length nb of the spectral hash code according to the prediction accuracy to obtain an optimal solution;
j. obtaining a set X containing classified hyperplane and support vectorsv=(x1”,x2”,...,xs"), s ═ 1,2,3., sv, distance threshold t, number of nearest neighbors k, and classification Model of spectral hash code length nb;
F. data X of a microangioma medical record image test set is classified according to a classification Modelte=(x1',x2',...,xm'),m=1,2,3K,NteFirst judging test sample x in predictioni'=(xi1',xi2',...,xil'), i ═ 1,2,. times, m distance distHyper to the classification hyperplaneiI 1, 2.. m, if it is greater than the distance threshold t, if the test sample x is testedi'=(xi1',xi2',...,xil'), i is 1,2, the distance from m to the classification hyperplane is greater than a distance threshold t, then the support vector machine model is used for prediction, if the distance from the test set data to the classification hyperplane is not greater than the distance threshold, then a neighbor algorithm combined with a spectrum hash algorithm is used for prediction, and finally a prediction result Y is obtained through synthesiste=(y1',y2',...,ym'),m=1,2,3K,Nte
2. The method of claim 1, wherein the method comprises the steps of: the step F specifically comprises the following steps: data X of a microangioma medical record image test set is classified according to a classification Modelte=(x1',x2',...,xm'),m=1,2,3K,NteFirst judging test sample x in predictioni'=(xi1',xi2',...,xil'), i ═ 1,2,. times, m distance distHyper to the classification hyperplaneiI 1, 2.. m, if it is greater than the distance threshold t, if the test sample x is testedi'=(xi1',xi2',...,xil'), i 1,2, the distance from m to the classification hyperplane is larger than a distance threshold t, and if the test set data is classified into classification, the support vector machine model is used for predictionIf the distance of the hyperplane is not more than the distance threshold, predicting by using a neighbor algorithm combined with the spectral hash algorithm, and finally synthesizing to obtain a prediction result Yte=(y1',y2',...,ym'),m=1,2,3K,NteThe method comprises the following steps:
a. calculating microangioma medical record image test data Xte=(x1',x2',...,xm'),m=1,2,3K,NteDistance matrix distHyper to sort Hyper plane Hyper wT·Xte+b;
b. Respectively obtaining microangioma medical record test data sets X predicted by using a support vector machine through a set distance threshold ttesvm={xi|distHyperiT, i 1,2, m and microangioma medical record test dataset X predicted using a neighbor algorithmteknn={xi|distHyperi≤t,i=1,2,...,m};
c. To Xtesvm={xi|distHyperiT, i ═ 1, 2.., m } using the classified hyperplane Hyper-plane obtained by training to carry out support vector machine model prediction to obtain a prediction label set Ytesvm={yi|distHyperi>t,i=1,2,...,m};
d. The microangioma medical record test data set Xteknn={xi|distHyperiT, i ≦ 1,2,.., m } and the set of support vectors X in the Modelsv=(x1”,x2”,...,xs1,2,3, sv compresses binary codes with the same length by a spectral hash method according to the length nb of the spectral hash coding code in the classification Model, calculates the hamming distances corresponding to all the binary codes and stores the hamming distances into a hamming distance table Dhamm _ test;
e. reading a Hamming distance table Dhamm _ test, taking the labels of the first k support vectors with the closest distance, and recording the label with the most occurrence times as a microangioma medical record image test data set Xteknn={xi|distHyperiObtaining a microangioma medical record prediction label set Y by using labels of corresponding samples in t, i ═ 1, 2.., m ≦teknn={yi|distHyperi≤t,i=1,2,...,m};
f. Microangioma medical record prediction tag set Y obtained by comprehensively using support vector machine model for predictiontesvm={yi|distHyperiT, i-1, 2, m and a microangioma medical record prediction tag set Y obtained by prediction by using a nearest neighbor algorithm combined with a spectral hash methodteknn={yi|distHyperiT, i ═ 1,2,.. m }, and the final microangioma medical record prediction result Y is obtainedte=(y1',y2',...,ym'),m=1,2,3K,Nte
CN201911104118.8A 2019-11-13 2019-11-13 Hyperplane nearest neighbor classification method for microangioma medical record images Active CN111242156B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911104118.8A CN111242156B (en) 2019-11-13 2019-11-13 Hyperplane nearest neighbor classification method for microangioma medical record images

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911104118.8A CN111242156B (en) 2019-11-13 2019-11-13 Hyperplane nearest neighbor classification method for microangioma medical record images

Publications (2)

Publication Number Publication Date
CN111242156A CN111242156A (en) 2020-06-05
CN111242156B true CN111242156B (en) 2022-02-08

Family

ID=70863910

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911104118.8A Active CN111242156B (en) 2019-11-13 2019-11-13 Hyperplane nearest neighbor classification method for microangioma medical record images

Country Status (1)

Country Link
CN (1) CN111242156B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113610007B (en) * 2021-08-09 2024-04-16 重庆医科大学 Nasal cavity sampling object detection device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105069803A (en) * 2015-08-19 2015-11-18 西安交通大学 Classifier for micro-angioma of diabetes lesion based on colored image
CN110287787A (en) * 2019-05-21 2019-09-27 平安国际智慧城市科技股份有限公司 Image-recognizing method, device and computer readable storage medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4057215A1 (en) * 2013-10-22 2022-09-14 Eyenuk, Inc. Systems and methods for automated analysis of retinal images
CN106530283A (en) * 2016-10-20 2017-03-22 北京工业大学 SVM (support vector machine)-based medical image blood vessel recognition method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105069803A (en) * 2015-08-19 2015-11-18 西安交通大学 Classifier for micro-angioma of diabetes lesion based on colored image
CN110287787A (en) * 2019-05-21 2019-09-27 平安国际智慧城市科技股份有限公司 Image-recognizing method, device and computer readable storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于自学哈希的信息检索;李洋;《中国优秀博硕士学位论文全文数据库(硕士)信息科技辑》;20150815(第08期);第I138-1645页 *
基于自适应PSO的改进K-means算法及其在电子病历聚类分析应用;丁卫平 等;《计算机与数字工程》;20190820(第8期);第1-2页 *

Also Published As

Publication number Publication date
CN111242156A (en) 2020-06-05

Similar Documents

Publication Publication Date Title
Valindria et al. Reverse classification accuracy: predicting segmentation performance in the absence of ground truth
Yuan et al. Multi-level attention network for retinal vessel segmentation
Tabesh et al. Multifeature prostate cancer diagnosis and Gleason grading of histological images
JP7294695B2 (en) Program, Information Recording Medium, Classification Apparatus, and Classification Method Based on Trained Model
da Rocha et al. Texture analysis of masses malignant in mammograms images using a combined approach of diversity index and local binary patterns distribution
Vieira et al. Automatic segmentation and detection of small bowel angioectasias in WCE images
CN112651927A (en) Raman spectrum intelligent identification method based on convolutional neural network and support vector machine
Sukhia et al. Content-based histopathological image retrieval using multi-scale and multichannel decoder based LTP
CN111242156B (en) Hyperplane nearest neighbor classification method for microangioma medical record images
Renukalatha et al. Classification of glaucoma using simplified-multiclass support vector machine
Gudadhe et al. Classification of intracranial hemorrhage ct images based on texture analysis using ensemble-based machine learning algorithms: A comparative study
Viswanath et al. Breast cancer detection using image processing techniques and classification algorithms
Van Der Sommen et al. Computer-aided detection of early cancer in the esophagus using HD endoscopy images
Badeka et al. Evaluation of LBP variants in retinal blood vessels segmentation using machine learning
Dey et al. Red-plane asymmetry analysis of breast thermograms for cancer detection
Bhaskar et al. Pulmonary lung nodule detection and classification through image enhancement and deep learning
CN110176298B (en) Kernel principal component spectrum Hash method for diabetic fundus image classification
Shilaskar et al. Machine Learning Based Brain Tumor Detection and Classification using HOG Feature Descriptor
Liu et al. SASG-GCN: self-attention similarity guided graph convolutional network for multi-type lower-grade glioma classification
Alqudah et al. International Journal of Advanced Trends in Computer Science and Engineering
Yang et al. Feature-Based Discrimination of Thyroid Cancer on Ultrasound Images
UmaMaheswaran et al. Enhanced non-contrast computed tomography images for early acute stroke detection using machine learning approach
Pandala et al. Analysis, Prediction and Classification of Skin Cancer using Artificial Intelligence-A Brief Study and Review
Sundar et al. Decision Support System for Breast Cancer Risk Scoring
Dutta et al. Cross-validated AdaBoost classifier used for brain tumor detection

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20230816

Address after: 226000 No. 9 Siyuan Road, Chongchuan District, Nantong City, Jiangsu Province

Patentee after: Nantong University Technology Transfer Center Co.,Ltd.

Address before: 226000 No. 8, Siyuan Road, Chongchuan District, Nantong City, Jiangsu Province

Patentee before: NANTONG University