CN106203528B - It is a kind of that intelligent classification algorithm is drawn based on the 3D of Fusion Features and KNN - Google Patents
It is a kind of that intelligent classification algorithm is drawn based on the 3D of Fusion Features and KNN Download PDFInfo
- Publication number
- CN106203528B CN106203528B CN201610569138.2A CN201610569138A CN106203528B CN 106203528 B CN106203528 B CN 106203528B CN 201610569138 A CN201610569138 A CN 201610569138A CN 106203528 B CN106203528 B CN 106203528B
- Authority
- CN
- China
- Prior art keywords
- algorithm
- feature
- knn
- picture
- fusion
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000004927 fusion Effects 0.000 title claims abstract description 26
- 238000007635 classification algorithm Methods 0.000 title claims abstract description 12
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 39
- 239000013598 vector Substances 0.000 claims abstract description 16
- 101100477520 Homo sapiens SHOX gene Proteins 0.000 claims abstract description 12
- 102000048489 Short Stature Homeobox Human genes 0.000 claims abstract description 12
- 108700025071 Short Stature Homeobox Proteins 0.000 claims abstract description 12
- 238000000034 method Methods 0.000 claims description 12
- 238000001914 filtration Methods 0.000 claims description 9
- 238000012549 training Methods 0.000 claims description 7
- 238000012360 testing method Methods 0.000 claims description 4
- 239000000203 mixture Substances 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 238000005070 sampling Methods 0.000 claims description 3
- 238000009472 formulation Methods 0.000 claims 2
- 238000000605 extraction Methods 0.000 claims 1
- 238000010422 painting Methods 0.000 abstract description 58
- 238000013461 design Methods 0.000 abstract description 5
- 238000002474 experimental method Methods 0.000 description 5
- 230000014509 gene expression Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000010339 dilation Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/50—Extraction of image or video features by performing operations within image blocks; by using histograms, e.g. histogram of oriented gradients [HoG]; by summing image-intensity values; Projection analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Multimedia (AREA)
- Image Processing (AREA)
- Image Analysis (AREA)
Abstract
一种基于特征融合和KNN的3D画智能分类算法,包括如下步骤:1)分别采用Lbp算法、Gist算法和Phog算法进行3D画图片特征提取得到对应的特征向量,2)将提取到的特征向量进行融合,得到融合特征;3)基于融合特征,采用KNN算法构建3D画智能分类器。本发明是一种将计算机与艺术相结合的创新性尝试,可以促进3D画的自动手机存储,为设计人员和大众提供丰富全面的3D画浏览和检索。在算法设计方面,LBP特征能提取图像中的纹理,Gist特征能提取图像空间包络,PHOG特征能提取局部图像边缘,这些特征的融合能全面地反映出墙画、地画、墙地画和凹墙角画的艺术区别,有助于提高3D画分类的准确度。
A 3D painting intelligent classification algorithm based on feature fusion and KNN, comprising the following steps: 1) using Lbp algorithm, Gist algorithm and Phog algorithm to extract 3D painting picture features to obtain corresponding feature vectors; 2) extracting the extracted feature vectors Fusion is performed to obtain fusion features; 3) Based on the fusion features, KNN algorithm is used to construct a 3D painting intelligent classifier. The invention is an innovative attempt to combine computer and art, which can promote automatic mobile phone storage of 3D paintings, and provide designers and the public with rich and comprehensive browsing and retrieval of 3D paintings. In terms of algorithm design, the LBP feature can extract the texture in the image, the Gist feature can extract the image space envelope, and the PHOG feature can extract the local image edge. The artistic distinction of concave corner paintings helps to improve the accuracy of 3D painting classification.
Description
技术领域technical field
本发明涉及3D画分类领域,特别是一种基于特征融合和KNN的3D画智能分类算法。The invention relates to the field of 3D painting classification, in particular to a 3D painting intelligent classification algorithm based on feature fusion and KNN.
背景技术Background technique
近年来,裸眼3D画以其特殊的艺术表现、超强的视觉震撼力以及极具趣味的互动性受到越来越多的关注与追捧,覆盖了广告,展会,家居等多个领域,具有广阔的发展前景。一般情况下,3D画是根据其艺术表现形式进行分类的,比如墙画、地画、墙地画、凹墙角画等。然而,随着3D画艺术的流行发展和商业应用,越来越多的作品被艺术家们创作出来。如何有效地收集汇总大量的3D画,构建3D画数据库,进行准确而快速的管理和检索3D画,成了3D画设计和应用行业的迫切需求。针对3D画这个垂直领域的服务,尚未有企业和组织进行相关工作。In recent years, naked-eye 3D paintings have attracted more and more attention and sought after due to their special artistic expression, strong visual shock and very interesting interactivity, covering advertising, exhibitions, home furnishing and other fields. development prospects. In general, 3D paintings are classified according to their artistic expressions, such as wall paintings, floor paintings, wall paintings, concave wall paintings, etc. However, with the popular development and commercial application of 3D painting art, more and more works are created by artists. How to effectively collect and summarize a large number of 3D paintings, build a 3D painting database, and manage and retrieve 3D paintings accurately and quickly has become an urgent need for the 3D painting design and application industry. For services in the vertical field of 3D painting, no enterprises and organizations have yet carried out related work.
发明内容SUMMARY OF THE INVENTION
本发明的主要目的在于提出一种基于特征融合和KNN的3D画智能分类方法,能对用户上传的图片进行基于艺术表现形式的智能识别和分类,有助于3D画的自动化存储系统构建。The main purpose of the present invention is to propose an intelligent classification method for 3D paintings based on feature fusion and KNN, which can intelligently identify and classify pictures uploaded by users based on artistic expressions, and is helpful for the construction of an automatic storage system for 3D paintings.
本发明采用如下技术方案:The present invention adopts following technical scheme:
一种基于特征融合和KNN的3D画智能分类算法,其特征在于:1)分别采用Lbp算法、Gist算法和Phog算法进行3D画图片特征提取得到对应的特征向量,2)将提取到的特征向量进行融合,得到融合特征;3)基于融合特征,采用KNN算法构建3D画智能分类器。A 3D painting intelligent classification algorithm based on feature fusion and KNN, which is characterized in that: 1) using Lbp algorithm, Gist algorithm and Phog algorithm to extract 3D painting image features to obtain corresponding feature vectors, 2) extracting the extracted feature vectors Fusion is performed to obtain fusion features; 3) Based on the fusion features, KNN algorithm is used to construct a 3D painting intelligent classifier.
优选的,在步骤1)中,将一张大小为M×N的3D画图像img通过旋转不变模式LPB算法提取若干个关键特征点,生成特征向量FVLBP。Preferably, in step 1), a 3D drawing image img with a size of M×N is used to extract several key feature points through a rotation invariant mode LPB algorithm to generate a feature vector FV LBP .
优选的,所述LPB算法具体如下Preferably, the LPB algorithm is specifically as follows
1.1A)以R为半径的P点邻域,gc为中心,gp为邻域点,区分邻域比中心亮度大还是小的方法是式中s(x)=1(x≥0),s(x)=0(x<0);1.1A) The P point neighborhood with R as the radius, g c as the center, and g p as the neighborhood point, the method to distinguish whether the neighborhood is larger or smaller than the center brightness is: where s(x)=1(x≥0), s(x)=0(x<0);
1.2A)在环形邻居的点集中,若中心像素的位置为(x,y),则邻近像素点gi的位置计算为: 1.2A) In the point set of circular neighbors, if the position of the central pixel is (x, y), the position of the adjacent pixel gi is calculated as:
1.3A)对环形邻居集上的编码进行按右循环右移操作ROR,获得LBP旋转不变编码,取值最小的编码为最后的LBP编码。旋转操作如下: 1.3A) Perform the ROR operation on the code on the circular neighbor set to obtain the LBP rotation invariant code, and the code with the smallest value is the last LBP code. The rotation operation is as follows:
1.4A)通过测度U将LBP编码中的U≤2的编码归为等价模式类,除等价模式类以外的模式都归为另一类,称为混合模式类,测度U定义为 1.4A) The code with U≤2 in LBP coding is classified into the equivalent pattern class by the measure U, and the patterns other than the equivalent pattern class are classified into another class, which is called the mixed pattern class, and the measure U is defined as
1.5A)经过旋转和归一化,一幅图片的LBP特征是包含P+2个特征点。1.5A) After rotation and normalization, the LBP feature of a picture contains P+2 feature points.
优选的,在步骤1)中,将一幅大小为M×N的3D画图像img通过Gist算法提取若干个关键特征点,生成特征向量FVGist。Preferably, in step 1), a 3D painting image img with a size of M×N is used to extract several key feature points through the Gist algorithm to generate a feature vector FV Gist .
5、如权利要求4所述的一种基于特征融合和KNN的3D画智能分类算法,其特征在于:所述Gist算法具体如下5. A 3D painting intelligent classification algorithm based on feature fusion and KNN according to claim 4, wherein the Gist algorithm is as follows:
1.1B)将3D图相灰度化并缩放成128*128;1.1B) Grayscale and scale the 3D image to 128*128;
1.2B)将一幅大小为r×c的灰度图像f(x,y)划分成np*np规格的网格,则网格块数为ng=np*np,各个网格块按行依次记作Pi,其中i=1,2,...,ng;网格块的大小为r′×c′,其中r′=r/np,c′=c/np;1.2B) Divide a grayscale image f(x, y) of size r×c into grids of n p *n p specifications, then the number of grid blocks is n g =n p *n p . The grid blocks are denoted as Pi in row order, where i =1,2,...,n g ; the size of grid blocks is r′×c′, where r′=r/n p ,c′=c/ n p ;
1.3B)分别用nc个通道的滤波器对图像进行卷积滤波,则每个网格块各通道滤波后级联的结果成为块Gist(PG)特征,即1.3B) Perform convolution filtering on the image with filters of n c channels respectively, then the result of concatenation after filtering of each channel of each grid block becomes the block Gist (PG) feature, that is,
其中(x,y)∈Pi,gmn(x,y)=α-mg(x′,y′),α>1是多尺度多方向Gabor滤波器,x′=α-m(xcosθ+ycosθ),y′=α-m(-xsinθ+ycosθ),θ=nπ/(n+1),式中x,y为图像像素坐标位置,σx,σy分别是x和y方向上高斯因子的方差,f0是滤波器中心频率,φ是该谐波因子的相位差,α-m为母小波膨胀的尺度因子,θ为旋转角度,m为尺度数,n为方向数,GP的维数为nc×r′×c′;where (x,y)∈P i , g mn (x,y)=α- m g(x′,y′),α>1 is a multi-scale and multi-directional Gabor filter, x′=α- m (xcosθ +ycosθ),y′=α- m (-xsinθ+ycosθ),θ=nπ/(n+1), where x and y are the image pixel coordinate positions, σ x , σ y are the variances of the Gaussian factors in the x and y directions, respectively, f 0 is the center frequency of the filter, φ is the phase difference of the harmonic factor, and α -m is The scale factor of the mother wavelet dilation, θ is the rotation angle, m is the scale number, n is the direction number, and the dimension of G P is n c ×r′×c′;
1.4B)对GP各通道滤波结果取均值后按行组合的结果称为全局Gist(GG)特征,即其中GG的维数为nc×ng。1.4B) The result of taking the average of the filtering results of each channel of G P and then combining them by row is called the global Gist (GG) feature, that is, in The dimension of G G is n c ×n g .
优选的,在步骤1)中,将一幅大小为M×N的3D画图像img通过PHOG算法提取若干个关键特征点,生成特征向量FVPHOG。Preferably, in step 1), a 3D painting image img with a size of M×N is used to extract several key feature points through the PHOG algorithm to generate a feature vector FV PHOG .
优选的,所述PHOG算法具体如下Preferably, the PHOG algorithm is specifically as follows
1.1C)选择第一层划分,划分成1*1小cells;1.1C) Select the first layer division and divide it into 1*1 small cells;
1.2C)计算每个cell中每个pixel的gradient,即式中Ix,Iy代表水平和垂直方向上的梯度值,M(x,y),θ(x,y)分别表示梯度的幅度值和方向;1.2C) Calculate the gradient of each pixel in each cell, ie where I x and I y represent the gradient values in the horizontal and vertical directions, and M(x, y) and θ(x, y) represent the magnitude and direction of the gradient, respectively;
1.3C)将360度分割成8个bin,,每个bin包含45度,整个直方图包含8维,然后根据每个像素点的梯度方向,利用双线性内插值法将其幅值累加到直方图中,此时得到整幅图的小HOG特征为1*8=8;1.3C) Divide 360 degrees into 8 bins, each bin contains 45 degrees, and the entire histogram contains 8 dimensions, and then according to the gradient direction of each pixel point, the bilinear interpolation method is used to accumulate its amplitude to In the histogram, the small HOG feature of the whole image is obtained at this time as 1*8=8;
1.4C)选择第二层划分,划分成2*2小cells,回到1.2C),直至整幅图的小HOG特征为4*8=32,进入1.5C);1.4C) Select the second layer division, divide it into 2*2 small cells, go back to 1.2C), until the small HOG feature of the whole picture is 4*8=32, enter 1.5C);
1.5C)选择第三层划分,划分成4*4小cells,回到1.2C),直至整幅图的小HOG特征为16*8=128,进入1.6);1.5C) Select the third layer division, divide it into 4*4 small cells, return to 1.2C), until the small HOG feature of the whole picture is 16*8=128, enter 1.6);
1.6C)选择第四层划分,划分成8*8小cells,回到1.2),直至整幅图的小HOG特征为64*8=512,进入1.7);1.6C) Select the fourth layer division, divide it into 8*8 small cells, go back to 1.2), until the small HOG feature of the whole picture is 64*8=512, enter 1.7);
1.7C)对四层小HOG特征进行归一化后级联,则得到总特征为8+32+128+512=680。1.7C) After normalizing and concatenating the four-layer small HOG features, the total features are 8+32+128+512=680.
优选的,在步骤3)中基于融合特征使用KNN分类器对3D画数据集进行算法实验,具体为:数据集包括墙画、地画、墙地画和凹墙角;各分类随机选择一半样本作为训练数据集,余下的样本作为测试集数据,该随机抽样实验重复3次,取平均值作为报告结果;在训练过程中,距离度量为欧式距离。Preferably, in step 3), based on the fusion feature, KNN classifier is used to perform algorithm experiments on the 3D painting data set, specifically: the data set includes wall paintings, floor paintings, wall and floor paintings and concave wall corners; each classification randomly selects half of the samples as The training data set, the remaining samples are used as the test set data, the random sampling experiment is repeated 3 times, and the average value is taken as the report result; in the training process, the distance metric is the Euclidean distance.
由上述对本发明的描述可知,与现有技术相比,本发明具有如下有益效果:As can be seen from the above description of the present invention, compared with the prior art, the present invention has the following beneficial effects:
本发明是一种将计算机与艺术相结合的创新性尝试,可以促进3D画的自动手机存储,为设计人员和大众提供丰富全面的3D画浏览和检索。在算法设计方面,LBP特征能提取图像中的纹理,Gist特征能提取图像空间包络,PHOG特征能提取局部图像边缘,这些特征的融合能全面地反映出墙画、地画、墙地画和凹墙角画的艺术区别,有助于提高3D画分类的准确度。The invention is an innovative attempt to combine computer and art, which can promote automatic mobile phone storage of 3D paintings, and provide designers and the public with rich and comprehensive browsing and retrieval of 3D paintings. In terms of algorithm design, the LBP feature can extract the texture in the image, the Gist feature can extract the image space envelope, and the PHOG feature can extract the local image edge. The artistic distinction of concave corner paintings helps to improve the accuracy of 3D painting classification.
附图说明Description of drawings
图1为本发明分类结果示意图。Figure 1 is a schematic diagram of the classification results of the present invention.
具体实施方式Detailed ways
以下通过具体实施方式对本发明作进一步的描述。The present invention will be further described below through specific embodiments.
一种基于特征融合和KNN的3D画智能分类算法,包括如下步骤A 3D painting intelligent classification algorithm based on feature fusion and KNN, including the following steps
1)分别采用Lbp算法、Gist算法和Phog算法进行3D画图片特征提取得到对应的特征向量,具体如下:1) The Lbp algorithm, the Gist algorithm and the Phog algorithm are respectively used to extract the features of the 3D pictures to obtain the corresponding feature vectors, as follows:
1A)将一张大小为M×N的3D画图像img通过旋转不变模式LPB算法提取18个关键特征点,生成特征向量FVLBP,记为FVLBP=(x1,x2,…,x18),具体如下:1A) Extract 18 key feature points from a 3D painting image img with a size of M×N through the rotation invariant mode LPB algorithm to generate a feature vector FV LBP , denoted as FV LBP = (x 1 , x 2 ,...,x 18 ), as follows:
1.1A)以R为半径的P点邻域,gc为中心,gp为邻域点,区分邻域比中心亮度大还是小的方法是式中s(x)=1(x≥0),s(x)=0(x<0)。具体地,R=2,P=16。1.1A) The P point neighborhood with R as the radius, g c as the center, and g p as the neighborhood point, the method to distinguish whether the neighborhood is larger or smaller than the center brightness is: In the formula, s(x)=1 (x≥0), s(x)=0 (x<0). Specifically, R=2, P=16.
1.2A)在环形邻居的点集中,那些没有刚好落在像素中心位置的像素点的灰度值是由邻近像素通过双线性插值计算得到的。如果中心像素的位置为(x,y),则邻近像素点gi的位置计算为: 1.2A) In the point set of circular neighbors, the gray values of those pixels that do not just fall in the center of the pixel are calculated by bilinear interpolation from neighboring pixels. If the position of the center pixel is (x, y), the position of the adjacent pixel gi is calculated as:
1.3A)对环形邻居集上的编码进行按右循环右移操作ROR,获得LBP旋转不变编码,取值最小的编码为最后的LBP编码。旋转操作如下: 1.3A) Perform the ROR operation on the code on the circular neighbor set to obtain the LBP rotation invariant code, and the code with the smallest value is the last LBP code. The rotation operation is as follows:
1.4A)通过测度U将LBP编码中的U≤2的编码归为等价模式类,除等价模式类以为的模式都归为另一类,称为混合模式类。测度U定义为 1.4A) The codes with U≤2 in LBP coding are classified into the equivalent mode class by measuring U, and the modes except the equivalent mode class are classified into another class, which is called the mixed mode class. The measure U is defined as
1.5A)经过旋转和归一化,一幅图片的LBP模式(特征)是P+2=18。1.5A) After rotation and normalization, the LBP mode (feature) of a picture is P+2=18.
1B)将一幅大小为M×N的3D画图像img通过Gist算法提取512个关键特征点,生成特征向量FVGist,记为FVGist=(x1,x2,…,x512),具体如下:1B) Extract 512 key feature points from a 3D painting image img with a size of M×N through the Gist algorithm, and generate a feature vector FV Gist , denoted as FV Gist = (x 1 , x 2 , . . . , x 512 ), specifically as follows:
1.1B)将图片灰度化并缩放成128*128。1.1B) Grayscale and scale the image to 128*128.
1.2B)将一幅大小为r×c的灰度图像f(x,y)划分成np*np的规格网格,则网格块数为ng=np*np。各个网格块按行依次记作Pi,其中i=1,2,...,ng。网格块的大小为r′×c′,其中r′=r/np,c′=c/np。具体地,r=128,c=128,np=4。1.2B) Divide a grayscale image f(x, y) with a size of r×c into a standard grid of n p *n p , then the number of grid blocks is n g =n p *n p . Each grid block is denoted as P i row by row, where i=1,2,..., ng . The size of the grid block is r'xc', where r'=r/ np and c'=c/ np . Specifically, r=128, c=128, np =4.
1.3B)分别用nc个通道的滤波器对图像进行卷积滤波,则每个网格块各通道滤波后级联的结果成为块Gist(PG)特征,即其中(x,y)∈Pi,gmn(x,y)是多尺度多方向Gabor滤波器,即gmn(x,y)=α-mg(x′,y′),α>1,其中x′=α-m(xcosθ+ycosθ),y′=α-m(-xsinθ+ycosθ),θ=nπ/(n+1),式中x,y为图像像素坐标位置,σx,σy分别是x和y方向上高斯因子的方差,f0是滤波器中心频率,φ是该谐波因子的相位差,α-m为母小波膨胀的尺度因子,θ为旋转角度,即滤波器的方向。m为尺度数,n为方向数。GP的维数为nc×r′×c′。具体地,nc为4尺度8方向共32个Gabor滤波器。1.3B) Perform convolution filtering on the image with filters of n c channels respectively, then the result of concatenation after filtering of each channel of each grid block becomes the block Gist (PG) feature, that is, where (x,y)∈P i , g mn (x,y) is a multi-scale and multi-directional Gabor filter, that is, g mn (x, y)=α- m g(x′,y′),α>1 , where x′=α- m (xcosθ+ycosθ), y′=α- m (-xsinθ+ycosθ), θ=nπ/(n+1), where x and y are the image pixel coordinate positions, σ x , σ y are the variances of the Gaussian factors in the x and y directions, respectively, f 0 is the center frequency of the filter, φ is the phase difference of the harmonic factor, and α -m is The scale factor of the mother wavelet expansion, θ is the rotation angle, that is, the direction of the filter. m is the number of scales, and n is the number of directions. The dimension of G P is n c ×r′×c′. Specifically, n c is a total of 32 Gabor filters in 4 scales and 8 directions.
1.4B)对GP各通道滤波结果取均值后按行组合的结果称为全局Gist(GG)特征,即其中GG的维数为nc×ng=32*16=512。1.4B) The result of taking the average of the filtering results of each channel of G P and then combining them by row is called the global Gist (GG) feature, that is, in The dimension of G G is n c ×n g =32*16=512.
1C)将一幅大小为M×N的3D画图像img通过PHOG算法提取680个关键特征点,生成特征向量FVPHOG,记为FVPHOG=(x1,x2,…,x680),具体如下:1C) Extract 680 key feature points from a 3D painting image img with a size of M×N through the PHOG algorithm to generate a feature vector FV PHOG , denoted as FV PHOG = (x 1 , x 2 , . . . , x 680 ), specifically as follows:
1.1C)选择第一层划分,划分成1*1小cells。1.1C) Select the first layer division and divide it into 1*1 small cells.
1.2C)计算每个cell中每个pixel的gradient(即orientation),即式中Ix,Iy计代表水平和垂直方向上的梯度值,M(x,y),θ(x,y)分别表示梯度的幅度值和方向。1.2C) Calculate the gradient (ie orientation) of each pixel in each cell, ie In the formula, I x and I y represent the gradient values in the horizontal and vertical directions, and M(x, y) and θ(x, y) represent the magnitude and direction of the gradient, respectively.
1.3C)将360度分割成8个bin,,每个bin包含45度,整个直方图包含8维。然后根据每个像素点的梯度方向,利用双线性内插值法将其幅值累加到直方图中。此时得到整幅图的小HOG特征为1*8=8。1.3C) Divide 360 degrees into 8 bins, each bin contains 45 degrees, and the entire histogram contains 8 dimensions. Then according to the gradient direction of each pixel point, its amplitude is accumulated into the histogram using bilinear interpolation method. At this time, the small HOG feature of the whole image is obtained as 1*8=8.
1.4C)选择第二层划分,划分成2*2小cells。回到步骤1.2C),直至整幅图的小HOG特征为4*8=32,进入1.5C)。1.4C) Select the second layer division and divide it into 2*2 small cells. Go back to step 1.2C), until the small HOG feature of the whole image is 4*8=32, and enter 1.5C).
1.5C)选择第三层划分,划分成4*4小cells。回到步骤1.2C),直至整幅图的小HOG特征为16*8=128,进入1.6C)。1.5C) Select the third layer division and divide it into 4*4 small cells. Go back to step 1.2C), until the small HOG feature of the whole picture is 16*8=128, and enter 1.6C).
1.6C)选择第四层划分,划分成8*8小cells。回到步骤1.2C),直至整幅图的小HOG特征为64*8=512,进入1.7C)。1.6C) Select the fourth layer to divide into 8*8 small cells. Go back to step 1.2C), until the small HOG feature of the whole picture is 64*8=512, and enter 1.7C).
1.7C)对四层小HOG特征进行归一化后级联,则得到总特征为8+32+128+512=680。1.7C) After normalizing and concatenating the four-layer small HOG features, the total features are 8+32+128+512=680.
2)将上述三种算法提取到的特征进行融合,得到最后的融合特征FVmix=(FVLBP,FVGist,FVPHOG)。该融合特征的个数是18+512+680=1210。2) The features extracted by the above three algorithms are fused to obtain the final fusion feature FV mix =(FV LBP , FV Gist , FV PHOG ). The number of the fusion features is 18+512+680=1210.
3)基于融合特征使用KNN分类器对3D画数据集进行算法实验。本发明主要是对3D画进行基于艺术表现形式的智能分类,因此数据集按照墙画786、地画800、墙地画138、凹墙角46来构成。对1770*1210的4类数据集按5:5选取训练集和测试集,各分类随机选择一半样本作为训练数据集,余下的样本作为测试集数据。该随机抽样实验重复3次,取平均值值为报告结果。在训练过程中,距离度量为欧式距离。对K设置不同值,从1到20进行实验,取最优值K=4为分类器参数。以上KNN算法模型能够对新出现的3D画进行智能分类。3) Based on the fusion features, the KNN classifier is used to perform algorithm experiments on the 3D painting dataset. The present invention mainly performs intelligent classification of 3D paintings based on artistic expressions, so the data set is composed of wall paintings 786, floor paintings 800, wall and floor paintings 138, and concave wall corners 46. For the 1770*1210 4 types of data sets, the training set and the test set are selected at a ratio of 5:5, and half of the samples are randomly selected as the training data set for each classification, and the remaining samples are used as the test set data. The random sampling experiment was repeated 3 times, and the average value was taken as the reported result. During training, the distance metric is Euclidean distance. Different values are set for K, from 1 to 20 for experiments, and the optimal value K=4 is taken as the classifier parameter. The above KNN algorithm model can intelligently classify new 3D paintings.
参照图1,将本发明方法嵌入到3D画展览系统中。该系统允许用户上传3D画图片并且按照“场所”、“表现”和“题材”进行分类展示。本算法完成了对3D画的“表现”类别的自动智能识别。用户上传3D画后,系统将自动识别所上传3D画的表现形式(墙画、地画、墙地画或凹墙角),其余分类仍需用户手动选择。分类结果如图1所示。Referring to Fig. 1, the method of the present invention is embedded in a 3D painting exhibition system. The system allows users to upload 3D pictures and display them in categories according to "place", "performance" and "theme". This algorithm completes the automatic and intelligent identification of the "performance" category of 3D paintings. After the user uploads the 3D painting, the system will automatically identify the representation of the uploaded 3D painting (wall painting, floor painting, wall painting or concave wall corner), and the user still needs to manually select the remaining categories. The classification results are shown in Figure 1.
本发明是一种将计算机与艺术相结合的创新性尝试,可以促进3D画的自动手机存储,为设计人员和大众提供丰富全面的3D画浏览和检索。在算法设计方面,LBP特征能提取图像中的纹理,Gist特征能提取图像空间包络,PHOG特征能提取局部图像边缘,这些特征的融合能全面地反映出墙画、地画、墙地画和凹墙角画的艺术区别,有助于提高3D画分类的准确度。The invention is an innovative attempt to combine computer and art, which can promote automatic mobile phone storage of 3D paintings, and provide designers and the public with rich and comprehensive browsing and retrieval of 3D paintings. In terms of algorithm design, the LBP feature can extract the texture in the image, the Gist feature can extract the image space envelope, and the PHOG feature can extract the local image edge. The artistic distinction of concave corner paintings helps to improve the accuracy of 3D painting classification.
上述仅为本发明的具体实施方式,但本发明的设计构思并不局限于此,凡利用此构思对本发明进行非实质性的改动,均应属于侵犯本发明保护范围的行为。The above are only specific embodiments of the present invention, but the design concept of the present invention is not limited to this, and any non-substantial modification of the present invention by using this concept should be regarded as an act of infringing the protection scope of the present invention.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610569138.2A CN106203528B (en) | 2016-07-19 | 2016-07-19 | It is a kind of that intelligent classification algorithm is drawn based on the 3D of Fusion Features and KNN |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610569138.2A CN106203528B (en) | 2016-07-19 | 2016-07-19 | It is a kind of that intelligent classification algorithm is drawn based on the 3D of Fusion Features and KNN |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106203528A CN106203528A (en) | 2016-12-07 |
CN106203528B true CN106203528B (en) | 2019-07-09 |
Family
ID=57493290
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610569138.2A Active CN106203528B (en) | 2016-07-19 | 2016-07-19 | It is a kind of that intelligent classification algorithm is drawn based on the 3D of Fusion Features and KNN |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106203528B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106960176B (en) * | 2017-02-22 | 2020-03-10 | 华侨大学 | Pedestrian gender identification method based on transfinite learning machine and color feature fusion |
CN109376680A (en) * | 2018-11-06 | 2019-02-22 | 大连理工大学 | A fast face recognition method based on the efficient fusion of Hog and Gabor features based on near-infrared face images |
CN110136124A (en) * | 2019-05-17 | 2019-08-16 | 江门市中心医院 | A Pleural Contact Pulmonary Nodule Segmentation Method Based on Robust Velocity Function |
CN113688829B (en) * | 2021-08-05 | 2024-02-20 | 南京国电南自电网自动化有限公司 | Automatic identification method and system for monitoring picture of transformer substation |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103400144A (en) * | 2013-07-17 | 2013-11-20 | 山东师范大学 | Active learning method based on K-neighbor for support vector machine (SVM) |
CN104392240A (en) * | 2014-10-28 | 2015-03-04 | 中国疾病预防控制中心寄生虫病预防控制所 | Parasite egg identification method based on multi-feature fusion |
CN105512682A (en) * | 2015-12-07 | 2016-04-20 | 南京信息工程大学 | Secret level marking identification method based on Krawtchouk moment and KNN-SMO classifier |
-
2016
- 2016-07-19 CN CN201610569138.2A patent/CN106203528B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103400144A (en) * | 2013-07-17 | 2013-11-20 | 山东师范大学 | Active learning method based on K-neighbor for support vector machine (SVM) |
CN104392240A (en) * | 2014-10-28 | 2015-03-04 | 中国疾病预防控制中心寄生虫病预防控制所 | Parasite egg identification method based on multi-feature fusion |
CN105512682A (en) * | 2015-12-07 | 2016-04-20 | 南京信息工程大学 | Secret level marking identification method based on Krawtchouk moment and KNN-SMO classifier |
Non-Patent Citations (1)
Title |
---|
基于叶片图像多特征融合的观叶植物种类识别;王丽君 等;《北京林业大学学报》;20150130;第37卷(第1期);第55-61页 |
Also Published As
Publication number | Publication date |
---|---|
CN106203528A (en) | 2016-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105550381B (en) | A kind of efficient image search method based on improvement SIFT feature | |
Saxena et al. | Make3d: Learning 3d scene structure from a single still image | |
US20190156567A1 (en) | 3D Virtual Environment Generating Method and Device | |
US7421125B1 (en) | Image analysis, editing and search techniques | |
US9626585B2 (en) | Composition modeling for photo retrieval through geometric image segmentation | |
CN105809651B (en) | Image saliency detection method based on edge dissimilarity contrast | |
CN106203528B (en) | It is a kind of that intelligent classification algorithm is drawn based on the 3D of Fusion Features and KNN | |
CN104200240A (en) | Sketch retrieval method based on content adaptive Hash encoding | |
CN104616247B (en) | A kind of method for map splicing of being taken photo by plane based on super-pixel SIFT | |
CN110019912A (en) | Graphic searching based on shape | |
CN105913377A (en) | Image splicing method for reserving image correlation information | |
Magliani et al. | An accurate retrieval through R-MAC+ descriptors for landmark recognition | |
US20210295478A1 (en) | Method and apparatus for recognizing landmark in panoramic image and non-transitory computer-readable medium | |
Zhang et al. | Perception-based shape retrieval for 3D building models | |
Li et al. | The research on traffic sign recognition based on deep learning | |
Park et al. | Estimating the camera direction of a geotagged image using reference images | |
CN104809721B (en) | A kind of caricature dividing method and device | |
Zhou et al. | Computation and memory efficient image segmentation | |
Andrushia et al. | An efficient visual saliency detection model based on Ripplet transform | |
CN112884884A (en) | Candidate region generation method and system | |
CN106980878A (en) | The determination method and device of three-dimensional model geometric style | |
CN108256425B (en) | A method of harbour container is extracted using Remote Spectra efficient information rate | |
Yang et al. | MLFNet-point cloud semantic segmentation convolution network based on multi-scale feature fusion | |
Kobyshev et al. | 3D saliency for finding landmark buildings | |
Shen et al. | Raster-based method for building selection in the multi-scale representation of two-dimensional maps |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |