CN103530658A - Method for recognizing plant leaf data based on sparse representation - Google Patents
Method for recognizing plant leaf data based on sparse representation Download PDFInfo
- Publication number
- CN103530658A CN103530658A CN201310481334.0A CN201310481334A CN103530658A CN 103530658 A CN103530658 A CN 103530658A CN 201310481334 A CN201310481334 A CN 201310481334A CN 103530658 A CN103530658 A CN 103530658A
- Authority
- CN
- China
- Prior art keywords
- matrix
- data
- sigma
- vector data
- plant leaf
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 45
- 239000011159 matrix material Substances 0.000 claims description 70
- 238000012545 processing Methods 0.000 claims description 12
- 230000009466 transformation Effects 0.000 claims description 8
- 230000008859 change Effects 0.000 claims description 4
- 238000000354 decomposition reaction Methods 0.000 claims description 4
- 238000009499 grossing Methods 0.000 claims 1
- 241000196324 Embryophyta Species 0.000 abstract description 96
- 230000000694 effects Effects 0.000 abstract description 5
- 238000007781 pre-processing Methods 0.000 description 9
- 238000009826 distribution Methods 0.000 description 5
- 238000000605 extraction Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 241000282414 Homo sapiens Species 0.000 description 4
- 238000005457 optimization Methods 0.000 description 4
- 238000000513 principal component analysis Methods 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000008033 biological extinction Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 235000017807 phytochemicals Nutrition 0.000 description 1
- 229930000223 plant secondary metabolite Natural products 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Landscapes
- Image Analysis (AREA)
Abstract
本发明具体涉及一种基于稀疏表示的植物叶片数据识别方法。其技术方案是:本发明基于同类数据分布在同一流形上,不同数据分布于不同流形上的假设,一方面利用植物叶片数据类别信息定义流形间距离,另一方面对于流形局部邻域进行稀疏表示,通过在局部近邻图中建立一种稀疏表示关系来获取线性表示系数,建立目标函数寻找最佳投影低维空间,实现在该子空间内,流形间距离最大,而多流形的稀疏性能够较好地保持,最后采用最近邻分类方法在子空间内对植物叶片的稀疏特征进行分类和识别,提高了植物叶片数据的识别效果。The invention specifically relates to a method for identifying plant leaf data based on sparse representation. Its technical solution is: the present invention is based on the assumption that the same kind of data is distributed on the same manifold, and different data are distributed on different manifolds. On the one hand, the plant leaf data category information is used to define the distance between manifolds; The domain is sparsely represented, and the linear representation coefficient is obtained by establishing a sparse representation relationship in the local neighbor graph, and the objective function is established to find the best projected low-dimensional space, so that in this subspace, the distance between the manifolds is the largest, and the multi-flow The sparsity of the shape can be well maintained. Finally, the nearest neighbor classification method is used to classify and identify the sparse features of the plant leaves in the subspace, which improves the recognition effect of the plant leaf data.
Description
技术领域technical field
本发明属于植物叶片数据识别技术领域。具体涉及一种基于稀疏表示的植物叶片数据识别方法。The invention belongs to the technical field of plant leaf data identification. Specifically, it relates to a method for identifying plant leaf data based on sparse representation.
背景技术Background technique
植物是地球上物种数量最多、分布最广泛的生命形式之一。植物是人类生存与发展的重要遗传资源,是人类的重要食物来源,也是人类生产和生活必需的资源。同时,植物在水土保持、抑制荒漠和改善气候等方面起着至关重要的作用。近年来随着人类生产活动的日益增加,生态环境不断遭到破坏。据调查统计,世界上大约有3.4万种植物物种已处于灭绝的边缘,占世界上已知的25万种植物的13%。对植物进行保护已经是势在必行。Plants are one of the most abundant and widely distributed forms of life on Earth. Plants are an important genetic resource for human survival and development, an important source of food for human beings, and an essential resource for human production and life. At the same time, plants play a vital role in soil and water conservation, desert suppression and climate improvement. In recent years, with the increase of human production activities, the ecological environment has been continuously destroyed. According to survey statistics, about 34,000 plant species in the world are on the verge of extinction, accounting for 13% of the 250,000 known plant species in the world. It is imperative to protect plants.
目前的植物分类有很多种方法,如植物细胞分类学、植物化学分类学、植物血清分类学以及植物遗产学,但对于非专业人员,这些分类方法很难掌握或不实用。因此,有必要研究借助数字图像处理、模式识别、人工智能等信息技术进行计算机辅助的植物分类性状自动提取,实现植物物种的自动分类、机器识别,以及研究这些数字分类性状在植物物种生态分类中的意义。There are many methods of plant classification, such as plant cell taxonomy, phytochemical taxonomy, plant serotaxonomy and plant heritage, but for non-professionals, these classification methods are difficult or impractical. Therefore, it is necessary to study computer-aided automatic extraction of plant taxonomic traits with the help of information technologies such as digital image processing, pattern recognition, and artificial intelligence, to realize automatic classification and machine identification of plant species, and to study the role of these digital taxonomic traits in the ecological classification of plant species. meaning.
从目前的研究结果来看,应用在植物物种识别中比较广泛和成功的方法是结合植物叶片形状特征的神经网络方法,而此方法的成功在于神经网络的构造和植物图像特征矢量化,即如何从植物图像中提取特征。特征提取和选择对于机器学习方法来说是至关重要的,所抽取和选择的特征决定了分类器的性能和整个算法的结果。目前,绝大多数应用到植物物种识别中的机器学习方法的不同之处在于植物图像特征矢量化方法的不同,由此可见特征提取和选择在植物识别中的重要性。Judging from the current research results, the widely used and successful method used in plant species recognition is the neural network method combined with the shape characteristics of plant leaves, and the success of this method lies in the construction of the neural network and the vectorization of plant image features, that is, how to Extract features from plant images. Feature extraction and selection are crucial to machine learning methods, and the extracted and selected features determine the performance of the classifier and the results of the entire algorithm. At present, most of the machine learning methods applied to plant species identification are different in the vectorization method of plant image features, which shows the importance of feature extraction and selection in plant identification.
目前最常用的特征抽取技术就是主成分分析方法。在植物物种识别中,主成分分析也是一种常用的特征维数约简方法。主成分分析对具有线性结构的数据处理效果很好,它通过寻找数据的二阶统计性质来发现数据的线性结构,但对于高度非线性分布的数据并不能找到真正的分布结构。基于非线形分布数据本征维数分析的流形学习方法提供了一种新的解决途径。流形学习旨在发现高维流形分布数据的内在规律性,其本质是要从采样数据学习低维流形的内在几何结构.这就意味着流形学习方法比传统的维数约简方法更能体现事物的本质,更利于对数据的理解和进一步处理。因此,针对植物物种的多类别、高维分类数据,流形学习更有助于发现这些数据的内在分布规律和几何结构,这为植物分类学提供一种新型有效的分类性状分析工具。目前流形学习方法已经初步地应用到植物叶片数据特征提取和分类中,但是在流形学习中,需要大量的训练样本来学习流形的局部结构信息,而植物叶片训练数据还比较少。The most commonly used feature extraction technique is principal component analysis. In plant species identification, principal component analysis is also a commonly used feature dimensionality reduction method. Principal component analysis works well for data with a linear structure. It finds the linear structure of the data by looking for the second-order statistical properties of the data, but it cannot find the real distribution structure for the data with a highly nonlinear distribution. The manifold learning method based on the eigendimensional analysis of nonlinear distribution data provides a new solution. Manifold learning aims to discover the internal regularity of high-dimensional manifold distribution data, and its essence is to learn the internal geometric structure of low-dimensional manifolds from sampled data. This means that the manifold learning method is better than the traditional dimensionality reduction method. It can better reflect the essence of things, and is more conducive to the understanding and further processing of data. Therefore, for the multi-category and high-dimensional classification data of plant species, manifold learning is more helpful to discover the internal distribution and geometric structure of these data, which provides a new and effective tool for the analysis of taxonomic traits for plant taxonomy. At present, the manifold learning method has been preliminarily applied to the feature extraction and classification of plant leaf data, but in manifold learning, a large number of training samples are needed to learn the local structure information of the manifold, and the training data of plant leaves is still relatively small.
发明内容Contents of the invention
本发明旨在克服现有技术缺陷,目的是提出一种能提高识别效果的基于稀疏表示的植物叶片数据识别方法。The invention aims to overcome the defects of the prior art, and aims to propose a sparse representation-based plant leaf data recognition method that can improve the recognition effect.
为实现上述目的,本发明采用的技术方案的具体步骤如下:In order to achieve the above object, the concrete steps of the technical solution adopted in the present invention are as follows:
1)植物叶片数据的预处理1) Preprocessing of plant leaf data
先对原始采集的植物叶片图像进行去噪处理和平滑处理,然后进行植物叶片的图像分割,再将分割后的彩色图像转换为灰度图像,最后将灰度图像进行归一化和向量化处理,得到任一副植物叶片图像处理后的向量数据Xi和所有植物叶片图像预处理后的矩阵数据X。First, denoise and smooth the original collected plant leaf image, then segment the image of the plant leaf, then convert the segmented color image into a grayscale image, and finally normalize and vectorize the grayscale image , to obtain the vector data X i after processing any plant leaf image and the matrix data X after preprocessing all plant leaf images.
2)计算任一副植物叶片图像预处理后的向量数据Xi投影后的向量数据Yi 2) Calculate the vector data Y i after the projection of the vector data X i after the preprocessing of any pair of plant leaf images
A、建立流形之间差异度矩阵JD A. Establish the difference degree matrix J D between manifolds
根据所有植物叶片图像预处理后的矩阵数据X和类别信息矩阵H,建立流形之间差异度矩阵JD According to the preprocessed matrix data X and category information matrix H of all plant leaf images, establish the difference matrix J D between manifolds
式(1)中:Hij表示类别信息矩阵H的第i行第j列元素,In formula (1): H ij represents the i-th row and j-th column element of the category information matrix H,
Qii表示类别信息矩阵对角化矩阵Q的第i行第i列元素,Q ii represents the i-th row and i-column element of the category information matrix diagonalization matrix Q,
Qii=ΣjHij (3)Q ii =Σ j H ij (3)
B、建立基于稀疏表示流形局部结构矩阵JL B. Establish the local structure matrix J L based on the sparse representation manifold
根据所有植物叶片图像预处理后的矩阵数据X,建立基于稀疏表示流形局部结构矩阵JL According to the preprocessed matrix data X of all plant leaf images, establish a local structure matrix J L based on the sparse representation manifold
式(4)中:Sij表示稀疏表示系数矩阵第i行第j列元素;In formula (4): S ij represents the element in row i and column j of the sparse representation coefficient matrix;
Dii表示稀疏表示系数矩阵对角化矩阵D的第i行第i列元素;D ii represents the element in row i and column i of the diagonalization matrix D of the sparse representation coefficient matrix;
C、计算一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi C. Calculate the vector data Y i after projection of the vector data X i after the image processing of a pair of plant leaves
通过线性变化,得到一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi Through linear change, a pair of plant leaf image processed vector data X i and projected vector data Y i are obtained
Yi=WTXi (7)Y i = W T X i (7)
式(7)中:W表示变换矩阵,变换矩阵W通过如下目标函数获得:In formula (7): W represents the transformation matrix, and the transformation matrix W is obtained by the following objective function:
maxtr{WT(JD-JL)W} (8)maxtr{W T (J D -J L )W} (8)
对(JD-JL)进行特征值分解,Eigenvalue decomposition of (J D -J L ),
(JD-JL)ν=λν (9)(J D -J L ) ν = λν (9)
式(9)中:λ表示特征值;In formula (9): λ represents the eigenvalue;
ν表示特征向量。ν denotes the eigenvector.
将特征值λ按照由大到小顺序排列取前d个特征值所对应特征向量ν,组成投影矩阵W。Arrange the eigenvalues λ in descending order and take the eigenvectors ν corresponding to the first d eigenvalues to form the projection matrix W.
3)植物叶片数据的识别3) Identification of plant leaf data
对于每一未知类别的每一植物叶片图像处理后的向量数据Xi,获得一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi,然后在低维空间内采用最近邻法识别一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi的类别。For each plant leaf image processed vector data X i of each unknown category, obtain a pair of plant leaf image processed vector data X i and projected vector data Y i , and then use the nearest neighbor method in the low-dimensional space Identify the category of the vector data Y i after the projection of the vector data X i processed by a pair of plant leaf images.
所述最近邻法是:在低维空间进行分类时,采用K近邻分类器,K为1。The nearest neighbor method is: when classifying in a low-dimensional space, a K-nearest neighbor classifier is used, and K is 1.
由于采用上述技术方案,本发明的有益效果是:本发明为解决面向植物叶片图像的识别问题,基于同类植物叶片数据分布在同一流形上,不同类别植物叶片数据分布在不同流形上的假设,提出了一种基于稀疏表示的植物叶片数据识别方法,针对植物叶片数据,通过在局部近邻图中建立一种稀疏表示关系来获取线性表示系数,与传统的流形学习方法比较而言,本发明提供了一种更加鲁棒的线性表示系数的优化学习方法,同时以植物叶片数据类别信息定义流形间距离,并建立目标函数以最大化流形间距离为优化目标实现植物叶片数据分类需求,提高了植物叶片数据的识别效果。Due to the adoption of the above technical solution, the beneficial effects of the present invention are: the present invention solves the identification problem facing plant leaf images, based on the assumption that the data of similar plant leaves are distributed on the same manifold, and the data of different types of plant leaves are distributed on different manifolds , proposed a method for plant leaf data recognition based on sparse representation. For plant leaf data, a sparse representation relationship was established in the local neighbor graph to obtain linear representation coefficients. Compared with traditional manifold learning methods, this The invention provides a more robust optimization learning method for linear representation coefficients. At the same time, the distance between manifolds is defined by the category information of plant leaf data, and the objective function is established to maximize the distance between manifolds as the optimization goal to realize the classification requirements of plant leaf data. , which improves the recognition effect of plant leaf data.
具体实施方式Detailed ways
下面结合本具体实施方式对本发明作进一步的描述,并非对其保护范围的限制。The present invention will be further described below in conjunction with the present specific embodiment, which is not intended to limit the scope of protection thereof.
实施例1Example 1
一种基于稀疏表示的植物叶片数据识别方法。其具体步骤如下:A recognition method for plant leaf data based on sparse representation. The specific steps are as follows:
1)植物叶片数据的预处理1) Preprocessing of plant leaf data
本实施例原始采集的20类数据共1000副植物叶片图像,每幅图像为64×64像素。先对植物叶片图像进行去噪处理和平滑处理,然后进行植物叶片的图像分割,再按照RGB图像与灰度图像的转换方法,将分割后的彩色图像转换为灰度图像。最后将灰度图像进行归一化和向量化处理,得到一副植物叶片图像处理后的向量数据Xi为4096维和所有植物叶片图像预处理后的规模为1000×4096的矩阵数据X。In this embodiment, 20 types of data were originally collected, and a total of 1000 plant leaf images were collected, and each image was 64×64 pixels. First, denoise and smooth the plant leaf image, then segment the image of the plant leaf, and then convert the segmented color image into a grayscale image according to the conversion method of RGB image and grayscale image. Finally, the grayscale image is normalized and vectorized to obtain a vector data X i of 4096 dimensions after processing a pair of plant leaf images and a matrix data X of 1000×4096 after preprocessing all plant leaf images.
2)计算任一副植物叶片图像预处理后的向量数据Xi投影后的向量数据Yi 2) Calculate the vector data Y i after the projection of the vector data X i after the preprocessing of any pair of plant leaf images
A、建立流形之间差异度矩阵JD A. Establish the difference degree matrix J D between manifolds
根据所有植物叶片图像预处理后的矩阵数据X和类别信息矩阵H,建立规模为4096×4096的流形之间差异度矩阵JD According to the preprocessed matrix data X and category information matrix H of all plant leaf images, establish a difference matrix J D between manifolds with a scale of 4096×4096
式(1)中:Hij表示类别信息矩阵H的第i行第j列元素,In formula (1): H ij represents the i-th row and j-th column element of the category information matrix H,
Qii表示类别信息矩阵对角化矩阵Q的第i行第i列元素,Q ii represents the i-th row and i-column element of the category information matrix diagonalization matrix Q,
Qii=ΣjHij (3)Q ii =Σ j H ij (3)
B、建立基于稀疏表示流形局部结构矩阵JL B. Establish the local structure matrix J L based on the sparse representation manifold
根据所有植物叶片图像预处理后的矩阵数据X,建立规模为4096×4096的基于稀疏表示流形局部结构矩阵JL According to the preprocessed matrix data X of all plant leaf images, a local structure matrix J L based on a sparse representation manifold with a scale of 4096×4096 is established
式(4)中:Sij表示稀疏表示系数矩阵第i行第j列元素,In formula (4): S ij represents the sparse representation coefficient matrix element in row i and column j,
Dii表示稀疏表示系数矩阵对角化矩阵D的第i行第i列元素,D ii represents the element in the i-th row and i-column of the diagonalization matrix D of the sparse representation coefficient matrix,
C、计算一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi C. Calculate the vector data Y i after projection of the vector data X i after the image processing of a pair of plant leaves
通过线性变化,得到一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi Through linear change, a pair of plant leaf image processed vector data X i and projected vector data Y i are obtained
Yi=WTXi (7)Y i = W T X i (7)
式(7)中:W表示变换矩阵,变换矩阵W通过如下目标函数获得:In formula (7): W represents the transformation matrix, and the transformation matrix W is obtained by the following objective function:
maxtr{WT(JD-JL)W} (8)maxtr{W T (J D -J L )W} (8)
对(JD-JL)进行特征值分解,Eigenvalue decomposition of (J D -J L ),
(JD-JL)ν=λν (9)(J D -J L ) ν = λν (9)
式(9)中:λ表示特征值;In formula (9): λ represents the eigenvalue;
ν表示特征向量。ν denotes the eigenvector.
将特征值λ按照由大到小顺序排列取前d个特征值所对应特征向量ν,组成投影矩阵W,在对20类数据共1000副植物叶片图像进行计算时,取前38个特征值所对应特征向量,组成规模为4096×38的投影矩阵W。The eigenvalues λ are arranged in order from large to small, and the eigenvectors ν corresponding to the first d eigenvalues are taken to form the projection matrix W. When calculating 1000 plant leaf images of 20 types of data, the first 38 eigenvalues are taken. Corresponding to the eigenvectors, a projection matrix W with a size of 4096×38 is formed.
3)植物叶片数据的识别3) Identification of plant leaf data
对于每一未知类别的每一植物叶片图像处理后的向量数据Xi,获得一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi,然后在低维空间内采用最近邻法识别一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi的类别。For each plant leaf image processed vector data X i of each unknown category, obtain a pair of plant leaf image processed vector data X i and projected vector data Y i , and then use the nearest neighbor method in the low-dimensional space Identify the category of the vector data Y i after the projection of the vector data X i after the image processing of a pair of plant leaves.
本实施例所述最近邻法是:在低维空间进行分类时,采用K近邻分类器,K为1。The nearest neighbor method described in this embodiment is: when performing classification in a low-dimensional space, a K-nearest neighbor classifier is used, and K is 1.
重复实验10次,并对预测正确率计算平均值,比较于传统的流形学习方法LPP,本发明方法能提高凭据识别率2.34%。Repeat the experiment 10 times, and calculate the average value of the prediction accuracy rate. Compared with the traditional manifold learning method LPP, the method of the present invention can improve the credential recognition rate by 2.34%.
实施例2Example 2
一种基于稀疏表示的植物叶片数据识别方法。其具体步骤如下:A recognition method for plant leaf data based on sparse representation. The specific steps are as follows:
2)植物叶片数据的预处理2) Preprocessing of plant leaf data
本实施例原始采集的50类数据共1500副植物叶片图像,每幅图像为32×32像素。先对植物叶片图像进行去噪处理和平滑处理,然后进行植物叶片的图像分割,再按照RGB图像与灰度图像的转换方法,将分割后的彩色图像转换为灰度图像。最后将灰度图像进行归一化和向量化处理,得到一副植物叶片图像处理后的向量数据Xi为1024维和所有植物叶片图像预处理后的规模为1500×1024的矩阵数据X。In this embodiment, 50 categories of data were originally collected, with a total of 1500 plant leaf images, and each image is 32×32 pixels. First, denoise and smooth the plant leaf image, then segment the image of the plant leaf, and then convert the segmented color image into a grayscale image according to the conversion method of RGB image and grayscale image. Finally, the grayscale image is normalized and vectorized to obtain a vector data X i of 1024 dimensions after processing a pair of plant leaf images and a matrix data X of 1500×1024 after preprocessing all plant leaf images.
2)计算任一副植物叶片图像预处理后的向量数据Xi投影后的向量数据Yi 2) Calculate the vector data Y i after the projection of the vector data X i after the preprocessing of any pair of plant leaf images
A、建立流形之间差异度矩阵JD A. Establish the difference degree matrix J D between manifolds
根据所有植物叶片图像预处理后的矩阵数据X和类别信息矩阵H,建立规模为1024×1024的流形之间差异度矩阵JD:According to the preprocessed matrix data X and category information matrix H of all plant leaf images, a difference matrix J D between manifolds with a scale of 1024×1024 is established:
式(1)中:Hij表示类别信息矩阵H的第i行第j列元素,In formula (1): H ij represents the i-th row and j-th column element of the category information matrix H,
Qii表示类别信息矩阵对角化矩阵Q的第i行第i列元素,Q ii represents the i-th row and i-column element of the category information matrix diagonalization matrix Q,
Qii=ΣjHij (3)Q ii =Σ j H ij (3)
B、建立基于稀疏表示流形局部结构矩阵JL B. Establish the local structure matrix J L based on the sparse representation manifold
根据所有植物叶片图像预处理后的矩阵数据X,建立规模为1024×1024的基于稀疏表示流形局部结构矩阵JL According to the preprocessed matrix data X of all plant leaf images, establish a local structure matrix J L based on a sparse representation manifold with a scale of 1024×1024
式(4)中:Sij表示稀疏表示系数矩阵第i行第j列元素,In formula (4): S ij represents the sparse representation coefficient matrix element in row i and column j,
Dii表示稀疏表示系数矩阵对角化矩阵D的第i行第i列元素,D ii represents the element in the i-th row and i-column of the diagonalization matrix D of the sparse representation coefficient matrix,
C、计算一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi C. Calculate the vector data Y i after projection of the vector data X i after the image processing of a pair of plant leaves
通过线性变化,得到一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi Through linear change, a pair of plant leaf image processed vector data X i and projected vector data Y i are obtained
Yi=WTXi (7)Y i = W T X i (7)
式(7)中:W表示变换矩阵,变换矩阵W通过如下目标函数获得:In formula (7): W represents the transformation matrix, and the transformation matrix W is obtained by the following objective function:
maxtr{WT(JD-JL)W} (8)maxtr{W T (J D -J L )W} (8)
对(JD-JL)进行特征值分解,Eigenvalue decomposition of (J D -J L ),
(JD-JL)ν=λν (9)(J D -J L ) ν = λν (9)
式(9)中:λ表示特征值;In formula (9): λ represents the eigenvalue;
ν表示特征向量。ν represents the eigenvector.
将特征值λ按照由大到小顺序排列取前d个特征值所对应特征向量ν,组成投影矩阵W,在对50类数据共1500副植物叶片图像进行计算时,取前102个特征值所对应特征向量,组成规模为1024×102的投影矩阵W。The eigenvalues λ are arranged in order from large to small, and the eigenvectors ν corresponding to the first d eigenvalues are taken to form the projection matrix W. When calculating 1500 plant leaf images of 50 types of data, the first 102 eigenvalues are taken. Corresponding to the eigenvectors, a projection matrix W with a size of 1024×102 is formed.
3)植物叶片数据的识别3) Identification of plant leaf data
对于每一未知类别的每一植物叶片图像处理后的向量数据Xi,获得一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi,然后在低维空间内采用最近邻法识别一副植物叶片图像处理后的向量数据Xi投影后的向量数据Yi的类别。For each plant leaf image processed vector data X i of each unknown category, obtain a pair of plant leaf image processed vector data X i and projected vector data Y i , and then use the nearest neighbor method in the low-dimensional space Identify the category of the vector data Y i after the projection of the vector data X i after the image processing of a pair of plant leaves.
本实施例所述最近邻法是:在低维空间进行分类时,采用K近邻分类器,K为1。The nearest neighbor method described in this embodiment is: when performing classification in a low-dimensional space, a K-nearest neighbor classifier is used, and K is 1.
重复实验10次,并对预测正确率计算平均值,比较于传统的流形学习方法LPP,本发明方法能提高凭据识别率1.67%。Repeat the experiment 10 times, and calculate the average value of the prediction accuracy rate. Compared with the traditional manifold learning method LPP, the method of the present invention can improve the credential recognition rate by 1.67%.
本具体实施方式的有益效果是:本具体实施方式为解决面向植物叶片图像的识别问题,基于同类植物叶片数据分布在同一流形上,不同类别植物叶片数据分布在不同流形上的假设,提出了一种基于稀疏表示的植物叶片数据识别方法,针对植物叶片数据,通过在局部近邻图中建立一种稀疏表示关系来获取线性表示系数,与传统的流形学习方法比较而言,本具体实施方式提供了一种更加鲁棒的线性表示系数的优化学习方法,同时以植物叶片数据类别信息定义流形间距离,并建立目标函数以最大化流形间距离为优化目标实现植物叶片数据分类需求,提高了植物叶片数据的识别效果。The beneficial effects of this specific embodiment are: this specific embodiment is to solve the recognition problem of plant leaf images, based on the assumption that the data of similar plant leaves are distributed on the same manifold, and the data of different types of plant leaves are distributed on different manifolds. A method for identifying plant leaf data based on sparse representation is proposed. For plant leaf data, a sparse representation relationship is established in the local neighbor graph to obtain linear representation coefficients. Compared with traditional manifold learning methods, this specific implementation The method provides a more robust optimization learning method for linear representation coefficients, at the same time defines the inter-manifold distance with the plant leaf data category information, and establishes an objective function to maximize the inter-manifold distance as the optimization goal to meet the plant leaf data classification requirements , which improves the recognition effect of plant leaf data.
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310481334.0A CN103530658B (en) | 2013-10-15 | 2013-10-15 | A recognition method of plant leaf data based on sparse representation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310481334.0A CN103530658B (en) | 2013-10-15 | 2013-10-15 | A recognition method of plant leaf data based on sparse representation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103530658A true CN103530658A (en) | 2014-01-22 |
CN103530658B CN103530658B (en) | 2016-08-10 |
Family
ID=49932654
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310481334.0A Expired - Fee Related CN103530658B (en) | 2013-10-15 | 2013-10-15 | A recognition method of plant leaf data based on sparse representation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103530658B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104376566A (en) * | 2014-11-26 | 2015-02-25 | 武汉科技大学 | Strip steel surface defect image characteristic extraction method based on local characteristic spatial distance |
CN105160354A (en) * | 2015-08-19 | 2015-12-16 | 师韵 | Apple disease identifying method based on sparse representation |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101398898A (en) * | 2008-10-20 | 2009-04-01 | 中国科学院合肥物质科学研究院 | Plant leaf identification method based on manifold learning |
CN102156878A (en) * | 2011-04-08 | 2011-08-17 | 南京邮电大学 | Sparse embedding with manifold information-based human face identification method |
-
2013
- 2013-10-15 CN CN201310481334.0A patent/CN103530658B/en not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101398898A (en) * | 2008-10-20 | 2009-04-01 | 中国科学院合肥物质科学研究院 | Plant leaf identification method based on manifold learning |
CN102156878A (en) * | 2011-04-08 | 2011-08-17 | 南京邮电大学 | Sparse embedding with manifold information-based human face identification method |
Non-Patent Citations (4)
Title |
---|
RONGXIANG HU 等: "Multiscale Distance Matrix for Fast Plant Leaf Recognition", 《IEEE TRANSACTIONS ON IMAGE PROCESSING》 * |
张善文 等: "一种鲁棒的监督流形学习算法及其在植物叶片分类中的应用", 《模式识别与人工智能》 * |
王伟 等: "基于稀疏流形学习的SAR图像识别", 《电子学报》 * |
肖雪洋: "植物叶片图像识别特征的研究和在线识别系统实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104376566A (en) * | 2014-11-26 | 2015-02-25 | 武汉科技大学 | Strip steel surface defect image characteristic extraction method based on local characteristic spatial distance |
CN104376566B (en) * | 2014-11-26 | 2017-08-08 | 武汉科技大学 | Steel strip surface defect image characteristic extracting method based on local feature space length |
CN105160354A (en) * | 2015-08-19 | 2015-12-16 | 师韵 | Apple disease identifying method based on sparse representation |
Also Published As
Publication number | Publication date |
---|---|
CN103530658B (en) | 2016-08-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111860612B (en) | Unsupervised hyperspectral image hidden low-rank projection learning feature extraction method | |
WO2022160771A1 (en) | Method for classifying hyperspectral images on basis of adaptive multi-scale feature extraction model | |
WO2021003951A1 (en) | Hyperspectral image classification method based on label-constrained elastic network graph model | |
CN104268593B (en) | The face identification method of many rarefaction representations under a kind of Small Sample Size | |
CN103258214B (en) | Based on the Classifying Method in Remote Sensing Image of image block Active Learning | |
CN106023065A (en) | Tensor hyperspectral image spectrum-space dimensionality reduction method based on deep convolutional neural network | |
CN107145836B (en) | Hyperspectral image classification method based on stacked boundary identification self-encoder | |
CN109241995B (en) | An Image Recognition Method Based on Improved ArcFace Loss Function | |
CN109858557B (en) | Novel semi-supervised classification method for hyperspectral image data | |
CN109446985B (en) | Multi-angle Plant Recognition Method Based on Vector Neural Network | |
CN102915453B (en) | Real-time feedback and update vehicle detection method | |
CN104732244A (en) | Wavelet transform, multi-strategy PSO (particle swarm optimization) and SVM (support vector machine) integrated based remote sensing image classification method | |
CN103868865B (en) | A kind of material optimal classification recognition methods based on high-spectral data Informax | |
CN102289673A (en) | Method for selecting hyperspectral remote sensing image bands based on partial least squares | |
CN103049767B (en) | Aurora image classification method based on biological stimulation characteristic and manifold learning | |
CN107590515A (en) | The hyperspectral image classification method of self-encoding encoder based on entropy rate super-pixel segmentation | |
CN105205449A (en) | Sign language recognition method based on deep learning | |
CN101667292B (en) | SAR Image Segmentation System and Segmentation Method Based on Immune Cloning and Projection Pursuit | |
CN109543546B (en) | Gait age estimation method based on depth sequence distribution regression | |
CN106127240A (en) | A kind of classifying identification method of plant image collection based on nonlinear reconstruction model | |
CN111242056B (en) | Maximum-minimum distance embedded unsupervised hyperspectral image classification method | |
CN105320963B (en) | The semi-supervised feature selection approach of large scale towards high score remote sensing images | |
CN104182767A (en) | Active learning and neighborhood information combined hyperspectral image classification method | |
CN115496950A (en) | Image classification method based on semi-supervised discriminative dictionary pair learning with neighborhood information embedding | |
CN105631478A (en) | Plant classification method based on sparse expression dictionary learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160810 Termination date: 20181015 |
|
CF01 | Termination of patent right due to non-payment of annual fee |