CN112364726A - Part code spraying character positioning method based on improved EAST - Google Patents
Part code spraying character positioning method based on improved EAST Download PDFInfo
- Publication number
- CN112364726A CN112364726A CN202011163480.5A CN202011163480A CN112364726A CN 112364726 A CN112364726 A CN 112364726A CN 202011163480 A CN202011163480 A CN 202011163480A CN 112364726 A CN112364726 A CN 112364726A
- Authority
- CN
- China
- Prior art keywords
- improved
- candidate
- east
- area
- loss function
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 36
- 238000005507 spraying Methods 0.000 title claims abstract description 10
- 230000006870 function Effects 0.000 claims abstract description 31
- 238000012545 processing Methods 0.000 claims abstract description 14
- 238000012805 post-processing Methods 0.000 claims abstract description 7
- 238000012549 training Methods 0.000 claims description 17
- 230000008447 perception Effects 0.000 claims description 7
- 238000011176 pooling Methods 0.000 claims description 6
- 230000009467 reduction Effects 0.000 claims description 5
- 230000006872 improvement Effects 0.000 claims description 4
- 230000008859 change Effects 0.000 claims description 3
- 230000007704 transition Effects 0.000 claims description 3
- 238000013135 deep learning Methods 0.000 abstract description 3
- 230000000007 visual effect Effects 0.000 abstract description 2
- 230000000694 effects Effects 0.000 description 10
- 238000001514 detection method Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 3
- 238000013528 artificial neural network Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000004576 sand Substances 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
- G06V10/225—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition based on a marking or identifier characterising the area
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Character Input (AREA)
- Image Analysis (AREA)
Abstract
The invention provides a part code spraying character positioning method based on improved EAST, which is based on the existing EAST network structure, improves the network structure, label generation, loss function and candidate area processing of the existing EAST network, obtains the improved network structure, the improved label generation, the improved loss function and candidate area post-processing, and positions characters by using the improved EAST network. According to the method and the device, on the basis of the existing EAST algorithm, the network structure, the label generation, the loss function and the candidate area post-processing are optimized, the improved deep learning network can effectively enlarge the experience visual field, the recognition efficiency of long texts is improved, and the accurate positioning of part code spraying characters is realized.
Description
Technical Field
The invention relates to the technical field of character recognition, in particular to a part code spraying character positioning method based on improved EAST.
Background
With the development of information science and technology, image processing and machine vision technologies have been widely used in the industrial field. As the most widely used machine vision technology, character text detection has been the focus and difficulty of research. The primary step of character text detection is to locate regions of character text. The EAST algorithm, one of the excellent natural scene text localization algorithms, performs well in a number of public data sets. However, due to the limitations of the size of the receptive field and the setting of the training process, the EAST algorithm is not ideal enough in character positioning, and the detection effect of the EAST algorithm on long texts needs to be improved.
Therefore, an end-to-end character recognition method with less dependency on character segmentation and high communication is needed.
Disclosure of Invention
In view of the above, the present invention provides a method for positioning a part code-spraying character based on improved EAST, which is characterized in that: the method is based on the existing EAST network structure, improves the network structure, label generation, loss function and candidate area processing of the existing EAST network, obtains the improved network structure, the improved label generation, the improved loss function and candidate area post-processing, and uses the improved EAST network to position characters;
the network structure for improving the existing EAST network specifically comprises:
constructing a VGG16 model, wherein the VGG16 model comprises 5 convolutional layers and 5 pooling layers, the size of a convolutional kernel is 3 multiplied by 3, and the step length is 1;
improving a VGG16 model, and replacing the convolution operation of the last stage in the VGG16 network structure with the mixed hole convolution HDC with a hole coefficient of [1,2,5 ];
the improved tag generation specifically comprises: reducing a quadrangle formed by a character area of the target training image by adopting a mode of differential treatment of long and short sides;
the improvement loss function specifically includes: replacing a balance cross entropy loss function in the EAST algorithm with a dice loss function;
the processing of the improved candidate region specifically includes: and sorting all the candidate frames according to the value of the coordinates of the upper left corner of the candidate frames by adopting a local perception NMS algorithm based on pre-sorting.
Further, the improved tag generation specifically comprises the following steps:
reducing a quadrangle formed by a character area of the target training image in a mode of respectively treating long and short sides;
the short side of the quadrangle is reduced by 0.3 time on the basis of the length of the short side of the original label remark;
the length of the long side of the quadrangle is reduced by a scrub _ rate multiple on the basis of the length of the long side of the original label remark:
wherein the scrub _ rate represents the reduction factor of the long side, lengthsLength, representing the length of the short side of the quadrilaterallIndicating the length of the long side of the quadrilateral.
Further, the dice loss function is determined by the following method:
L=Ls+λgLg=Ls+λg(LIoU+λθLθ) (2)
wherein L represents the modified dice loss function, LsRepresents the score map loss, LgDenotes the geometric loss, λgCoefficient representing geometric loss, LIoUDenotes the loss of overlap area, λθCoefficient representing angular loss, LθIndicating the loss of angle.
Further, the processing of the improved candidate region specifically includes a pre-ranking based locally-aware NMS algorithm, and the pre-ranking based locally-aware NMS algorithm specifically includes the following steps:
s1: initializing a set box, an overlapping threshold lambda and a catastrophe threshold omega of all candidate rectangular frames;
s2: extracting coordinates (x) of upper left corner of all rectangular boxes in boxes1,y1) According to x respectively1And y1Arranging in sequence from small to large, respectively calculating and comparing x according to omega1And y1The blocks are arranged according to the order of the rapid change times from small to large;
s3: setting s as an empty set, and setting p as an empty set, wherein for g belonging to boxes, g represents elements in a set box of all candidate rectangular frames, namely the candidate rectangular frames, and the steps are sequentially performed according to the order of the transition times of all the rectangular frames in the boxes from small to large: if p is not empty and the overlapping area ratio of p and g is larger than lambda, p is a rectangular frame formed by combining p and g; if p is not empty and the ratio of the overlapping areas of p and g is not more than lambda, putting p into s; if p is empty, then p is g;
s4: if p is not empty, put p into s;
s5: and outputting a set S, wherein the S represents the filtered rectangular frame set.
Further, the processing of improving the candidate region further includes candidate frame merging, where the candidate frame merging is determined by the following method:
S(a)=ηgS(g)+ηpS(p) (3)
ai=ηgS(g)gi+ηpS(p)pi (4)
where a is a merged rectangular frame, g and p denote rectangular frame candidates satisfying an overlap condition, i denotes a parameter index in the rectangular frame, i is 1,2,3 … 9, 8 coordinates including 4 vertices, 1 angular value, S denotes a score of the rectangular frame, and ηgArea coefficient, η, representing candidate rectangular box gpAn area coefficient representing a candidate rectangular frame p;
the area coefficient etagAnd ηpThe following method is adopted for setting:
judging whether the area of the candidate rectangular frame g is larger than that of the candidate rectangular frame p, if so, etag=1.1,ηpIf not, ηg=1,ηp=1.1。
The invention has the beneficial technical effects that: according to the method and the device, on the basis of the existing EAST algorithm, the network structure, the label generation, the loss function and the candidate area post-processing are optimized, the improved deep learning network can effectively enlarge the experience visual field, the recognition efficiency of long texts is improved, and the accurate positioning of part code spraying characters is realized.
Drawings
The invention is further described below with reference to the following figures and examples:
fig. 1 is a schematic diagram of a network structure of an improved EAST algorithm proposed by the present invention.
FIG. 2 is a schematic diagram of the tag generation process of the present invention.
FIG. 3 is a comparison graph of the detection effect of the present invention and the detection effect of the original algorithm.
Detailed Description
The invention is further described with reference to the accompanying drawings in which:
the invention provides a part code spraying character positioning method based on improved EAST, which is characterized by comprising the following steps: the method is based on the existing EAST network structure, improves the network structure, label generation, loss function and candidate area processing of the existing EAST network, obtains the improved network structure, the improved label generation, the improved loss function and candidate area post-processing, and uses the improved EAST network to position characters;
the network structure for improving the existing EAST network specifically comprises:
constructing a VGG16 model, wherein the VGG16 model comprises 5 convolutional layers and 5 pooling layers, the size of a convolutional kernel is 3 multiplied by 3, and the step length is 1;
improving a VGG16 model, and replacing the convolution operation of the last stage in the VGG16 network structure with the mixed hole convolution HDC with a hole coefficient of [1,2,5 ];
the improved tag generation specifically comprises: reducing a quadrangle formed by a character area of the target training image by adopting a mode of differential treatment of long and short sides;
the improvement loss function specifically includes: replacing a balance cross entropy loss function in the EAST algorithm with a dice loss function;
the processing of the improved candidate region specifically includes: and sorting all the candidate frames according to the value of the coordinates of the upper left corner of the candidate frames by adopting a local perception NMS algorithm based on pre-sorting.
In the convolutional neural network, the size of the receptive field is determined by parameters such as convolution kernel and convolution step. In the standard VGG16 model, the expansion of the receptive field is achieved by the superposition and pooling of convolutional layers. However, this method of enlarging the field is limited by the number of layers of convolution, so the effect is very limited, enlarging the size of the convolution kernel will result in too many parameters, and increasing the field through the pooling layer will result in information loss. These factors limit the size of the VGG16 network receptive field. The hole convolution is a method that the size of a convolution kernel is not changed, but a plurality of holes are added to enlarge the whole convolution matrix and expand a convolution area, so that parameters are not increased and the receptive field is enlarged, but a grid effect and a small target cannot be detected. However, a hybrid hole convolution scheme is proposed, and it is considered that the above-mentioned problems can be avoided as long as the convolution coefficient setting in the process of building the neural network meets the following three requirements, according to the idea, the convolution operation (Conv _5) in the last stage of the VGG16 is replaced, the original standard convolution for 3 consecutive times is replaced by the hybrid hole convolution (HDC) with the hole coefficient of [1,2,5], and the original 7 × 7 receptive field of the part is expanded to 17 × 17, so that the VGG16 network with a larger receptive field is obtained. The new network structure is shown in fig. 1, and the dark blue part is an improved part of the VGG16 network, where maxporoling represents maximum pooling operation, Up Sampling represents upsampling operation, and Concat represents feature map splicing.
According to the technical scheme, the network structure, the label generation, the loss function and the candidate area post-processing are optimized on the basis of the conventional EAST algorithm, the improved deep learning network can effectively enlarge the perception field of view, the recognition efficiency of long texts is improved, and the accurate positioning of part code spraying characters is realized.
In this embodiment, the improved tag generation specifically includes the following steps:
reducing a quadrangle formed by a character area of the target training image in a mode of respectively treating long and short sides;
the short side of the quadrangle is reduced by 0.3 time on the basis of the length of the short side of the original label remark;
the length of the long side of the quadrangle is reduced by a scrub _ rate multiple on the basis of the length of the long side of the original label remark:
wherein the scrub _ rate represents the reduction factor of the long side, lengthsLength, representing the length of the short side of the quadrilaterallIndicating the length of the long side of the quadrilateral.
In the network training process, the character area labels of all training images are quadrangles, and the coordinates of four points are expressed by (x)1,y1,x2,y2,x3y3,x4,y4) Is shown in the format of (1). x and y represent the abscissa and ordinate of the point, respectively. The subscripts "1", "2", "3", "4" indicate the points of the upper left, upper right, lower right and lower left corners, respectively, i.e., the clockwise order notation. In EAST algorithm training, the RBOX form is adopted to generate a training label, namely 5 channels { d }1,d2,d3d4θ, where d isiThe distances from points within the training labels to the four sides of the labels (text boxes in fig. 1) are shown, and θ represents the rotation angle (angle in fig. 1). There are 1 more channels to record the likelihood score (score map in fig. 1) that each point in the image is a character point. In order to reduce interference caused by label labeling errors, a score map (score map) in an EAST algorithm is reduced by 0.3 times on the basis of the minimum bounding rectangle of the original label labeling. The whole label generation process is shown in fig. 2. The method for reducing the label rectangle can reduce the interference caused by the character edge area and improve the character positioning accuracy. However, as is apparent from fig. 2, since the long-side base of the character region is large, the reduction by a factor of 0.3 may result in a very large area of the character region not entering the score map. Such training is very poor for long character positioning. Aiming at the problem, the label training of the EAST algorithm is improved, the long and short sides are treated in a different mode, the short sides of the label rectangle are still reduced by 0.3 time, and the long sides of the label rectangle are longThe reduction factor of the sides is 0.3 times the ratio of the length of the short side of the rectangle to the length of the long side of the rectangle.
In this embodiment, the dice loss function is determined by the following method:
L=Ls+λgLg=Ls+λg(LIoU+λθLθ) (2)
wherein L represents the modified dice loss function, LsRepresents the score map loss, LgDenotes the geometric loss, λgCoefficient representing geometric loss, LIoUDenotes the loss of overlap area, λθCoefficient representing angular loss, LθIndicating the loss of angle.
For neural network training, a loss function is an optimization target of the whole network training, and the quality of the loss function setting directly influences the training result of the network and the detection effect of final parameters. For the FCN network proposed by the present invention, the loss function L is mainly composed of the score map loss LsAnd geometric loss LgTwo-part, in which the geometric loss LgAgain, the loss of L from the overlap areaIoUAnd angle loss LθAnd (4) forming. Loss of L for score mapsThe method is essentially a two-classification loss function, namely, a character region is a positive sample, other regions are negative samples, the positive and negative samples are quite unbalanced, and the area of the character region is far smaller than that of the other regions. The EAST algorithm adopts a balance cross entropy loss function to solve the sample imbalance problem, and recent researches find that the dice loss has a better performance in dealing with the sample imbalance problem. Therefore, the method adopts dice loss to express the score map loss, and can better deal with the sample imbalance problem compared with the EAST algorithm.
In this embodiment, the processing of the improved candidate region specifically includes a locally aware NMS algorithm based on pre-ranking, and the steps of the locally aware NMS algorithm based on pre-ranking are as follows:
s1: initializing a set box, an overlapping threshold lambda and a catastrophe threshold omega of all candidate rectangular frames;
s2: extracting in boxesAll the coordinates (x) of the upper left corner of the rectangular frame1,y1) According to x respectively1And y1Arranging in sequence from small to large, respectively calculating and comparing x according to omega1And y1The blocks are arranged according to the order of the rapid change times from small to large;
s3: setting S as an empty set, wherein p is an empty set, and for g belonging to boxes, wherein g represents elements in a set box of all candidate rectangular frames, namely the candidate rectangular frames, sequentially executing the following steps in the order from small to large according to the transition times of all the rectangular frames in the box: if p is not empty and the overlapping area ratio of p and g is larger than lambda, p is a rectangular frame formed by combining p and g; if p is not empty and the ratio of the overlapping areas of p and g is not more than lambda, putting p into S; if p is empty, then p is g;
s5: and outputting a set S, wherein the S represents the filtered rectangular frame set.
In general, when character region detection is performed using the network, there may be hundreds or even thousands of candidate boxes extracted by FCN. Most of them are overlapped, so that the combination of the candidate boxes with large number is the last step of the whole character area positioning algorithm. The former subsequent block merging often adopts non-maximum suppression, namely NMS algorithm, but the time complexity of the algorithm is O (n)2) And is very time-consuming. For this case, the EAST algorithm employs a locally aware NMS algorithm. The algorithm only compares two adjacent candidate areas in sequence, and if the overlapping degree is greater than a threshold value, the two candidate areas are combined, and if the overlapping degree is smaller than the threshold value, the two candidate areas are kept. The time complexity depends on the arrangement of the candidate regions, the best case is O (n) (i.e. the candidate frames that should be merged are arranged together), and the worst case is O (n)2) (alternate candidate frames that do not overlap with each other). Meanwhile, for two candidate frames meeting the overlapping degree condition, one candidate frame is not discarded and the other candidate frame is left according to the standard NMS, and the two candidate frames are merged according to the scoring conditions of the two candidate frames.
It can be seen that the temporal complexity of the locally-aware NMS algorithm depends on the random candidate arrangement. Aiming at the whole problem, the invention improves the local perception NMS algorithm, provides the local perception NMS algorithm based on pre-sorting, and sorts all the candidate frames according to the value of the coordinate at the upper left corner of the candidate frames, so that all the candidate frames which are possibly merged can be arranged together, and the effect that the time complexity is O (n) as much as possible is achieved. Meanwhile, arranging the candidate frames which are possibly merged together is also beneficial to improving the merging precision, so that the final text positioning effect is improved.
In this embodiment, the processing of improving the candidate region further includes candidate frame merging, where the candidate frame merging is determined by the following method:
S(a)=ηgS(g)+ηpS(p) (3)
ai=ηgS(g)gi+ηpS(p)pi (4)
where a is a merged rectangular frame, g and p denote rectangular frame candidates satisfying an overlap condition, i denotes a parameter index in the rectangular frame, i is 1,2,3 … 9, 8 coordinates including 4 vertices, 1 angular value, S denotes a score of the rectangular frame, and ηgArea coefficient, η, representing candidate rectangular box gpAn area coefficient representing a candidate rectangular frame p;
the area coefficient etagAnd ηpThe following method is adopted for setting:
judging whether the area of the candidate rectangular frame g is larger than that of the candidate rectangular frame p, if so, etag=1.1,ηpIf not, ηg=1,ηp=1.1。
In addition, in the original local perception NMS algorithm, when merging the candidate rectangular boxes, only the score condition is considered, and for long texts, the area information of the candidate boxes is also important, and the probability that the candidate box with a larger area covers the whole text area is higher. Therefore, in the candidate box merging, η of the area coefficients η is introduced, and η of the larger area of g and p is set to 1.05, and the other is 1.
According to the implementation of the improved character positioning algorithm based on EAST, the detection effect before and after the improvement of the algorithm is compared, as shown in FIG. 3. It is obvious that the defect of the original EAST algorithm for the detection of long text is that the rightmost rectangle of the first line of characters does not cover all the character areas, and the last "1" of the second line of characters is only covered by half. The optimized algorithm of the invention is used for detecting the long text, so that the whole character area is well covered, and the condition that the head and the tail of the character area are missed does not occur.
Finally, the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting, although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made to the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention, and all of them should be covered in the claims of the present invention.
Claims (5)
1. A method for positioning part code spraying characters based on improved EAST is characterized in that: the method is based on the existing EAST network structure, improves the network structure, label generation, loss function and candidate area processing of the existing EAST network, obtains the improved network structure, the improved label generation, the improved loss function and candidate area post-processing, and uses the improved EAST network to position characters;
the network structure for improving the existing EAST network specifically comprises:
constructing a VGG16 model, wherein the VGG16 model comprises 5 convolutional layers and 5 pooling layers, the size of a convolutional kernel is 3 multiplied by 3, and the step length is 1;
improving a VGG16 model, and replacing the convolution operation of the last stage in the VGG16 network structure with the mixed hole convolution HDC with a hole coefficient of [1,2,5 ];
the improved tag generation specifically comprises: reducing a quadrangle formed by a character area of the target training image by adopting a mode of differential treatment of long and short sides;
the improvement loss function specifically includes: replacing a balance cross entropy loss function in the EAST algorithm with a diceloss loss function;
the processing of the improved candidate region specifically includes: and sorting all the candidate frames according to the value of the coordinates of the upper left corner of the candidate frames by adopting a local perception NMS algorithm based on pre-sorting.
2. The method for improved EAST-based part inkjet character positioning as recited in claim 1, wherein: the improved label generation specifically comprises the following steps:
reducing a quadrangle formed by a character area of the target training image in a mode of respectively treating long and short sides;
the short side of the quadrangle is reduced by 0.3 time on the basis of the length of the short side of the original label remark;
the length of the long side of the quadrangle is reduced by a scrub _ rate multiple on the basis of the length of the long side of the original label remark:
wherein the scrub _ rate represents the reduction factor of the long side, lengthsLength, representing the length of the short side of the quadrilaterallIndicating the length of the long side of the quadrilateral.
3. The method for improved EAST-based part inkjet character positioning as recited in claim 1, wherein: the dice loss function is determined by the following method:
L=Ls+λgLg=Ls+λg(LIoU+λθLθ) (2)
wherein L represents the modified dice loss function, LsRepresents the score map loss, LgDenotes the geometric loss, λgCoefficient representing geometric loss, LIoUDenotes the loss of overlap area, λθCoefficient representing angular loss, LθIndicating the loss of angle.
4. The method for improved EAST-based part inkjet character positioning as claimed in claim 3, wherein: the processing of the improved candidate region specifically includes a pre-ranking based locally-aware NMS algorithm, which specifically includes the following steps:
s1: initializing a set box, an overlapping threshold lambda and a catastrophe threshold omega of all candidate rectangular frames;
s2: extracting coordinates (x) of upper left corner of all rectangular boxes in boxes1,y1) According to x respectively1And y1Arranging in sequence from small to large, respectively calculating and comparing x according to omega1And y1The blocks are arranged according to the order of the rapid change times from small to large;
s3: setting S as an empty set, wherein p is an empty set, and for g belonging to boxes, wherein g represents elements in a set box of all candidate rectangular frames, namely the candidate rectangular frames, sequentially executing the following steps in the order from small to large according to the transition times of all the rectangular frames in the box: if p is not empty and the overlapping area ratio of p and g is larger than lambda, p is a rectangular frame formed by combining p and g; if p is not empty and the ratio of the overlapping areas of p and g is not more than lambda, putting p into S; if p is empty, then p is g;
s4: if p is not empty, putting p into S;
s5: and outputting a set S, wherein the S represents the filtered rectangular frame set.
5. The method for improved EAST-based part inkjet character positioning as recited in claim 4, wherein: the processing for improving the candidate region further comprises candidate frame merging, wherein the candidate frame merging is determined by adopting the following method:
S(a)=ηgS(g)+ηpS(p) (3)
ai=ηgS(g)gi+ηpS(p)pi (4)
where a is a merged rectangular frame, g and p denote rectangular frame candidates satisfying an overlap condition, i denotes a parameter index in the rectangular frame, i is 1,2,3 … 9, 8 coordinates including 4 vertices, 1 angular value, S denotes a score of the rectangular frame, and ηgArea coefficient, η, representing candidate rectangular box gpAn area coefficient representing a candidate rectangular frame p;
the area coefficient etagAnd ηpThe following method is adopted for setting:
judging whether the area of the candidate rectangular frame g is larger than that of the candidate rectangular frame p, if so, etag=1.1,ηpIf not, ηg=1,ηp=1.1。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011163480.5A CN112364726B (en) | 2020-10-27 | 2020-10-27 | Part code-spraying character positioning method based on improved EAST |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011163480.5A CN112364726B (en) | 2020-10-27 | 2020-10-27 | Part code-spraying character positioning method based on improved EAST |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112364726A true CN112364726A (en) | 2021-02-12 |
CN112364726B CN112364726B (en) | 2024-06-04 |
Family
ID=74510665
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011163480.5A Active CN112364726B (en) | 2020-10-27 | 2020-10-27 | Part code-spraying character positioning method based on improved EAST |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112364726B (en) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997017988A1 (en) * | 1995-11-13 | 1997-05-22 | Fibrogen, Inc. | Type ix collagen and chimeras |
US7143153B1 (en) * | 2000-11-09 | 2006-11-28 | Ciena Corporation | Internal network device dynamic health monitoring |
US20160191194A1 (en) * | 2014-12-29 | 2016-06-30 | Juniper Networks, Inc. | Network topology optimization with feasible optical paths |
CN107133616A (en) * | 2017-04-02 | 2017-09-05 | 南京汇川图像视觉技术有限公司 | A kind of non-division character locating and recognition methods based on deep learning |
US20180359029A1 (en) * | 2017-01-18 | 2018-12-13 | Ciena Corporation | Distributed measurements and analysis in networks |
CN110287960A (en) * | 2019-07-02 | 2019-09-27 | 中国科学院信息工程研究所 | The detection recognition method of curve text in natural scene image |
CN111310760A (en) * | 2020-02-13 | 2020-06-19 | 辽宁师范大学 | Method for detecting onychomycosis characters by combining local prior characteristics and depth convolution characteristics |
WO2020164281A1 (en) * | 2019-02-13 | 2020-08-20 | 平安科技(深圳)有限公司 | Form parsing method based on character location and recognition, and medium and computer device |
CN111583180A (en) * | 2020-04-03 | 2020-08-25 | 苏宁云计算有限公司 | Image tampering identification method and device, computer equipment and storage medium |
-
2020
- 2020-10-27 CN CN202011163480.5A patent/CN112364726B/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997017988A1 (en) * | 1995-11-13 | 1997-05-22 | Fibrogen, Inc. | Type ix collagen and chimeras |
US7143153B1 (en) * | 2000-11-09 | 2006-11-28 | Ciena Corporation | Internal network device dynamic health monitoring |
US20160191194A1 (en) * | 2014-12-29 | 2016-06-30 | Juniper Networks, Inc. | Network topology optimization with feasible optical paths |
US20180359029A1 (en) * | 2017-01-18 | 2018-12-13 | Ciena Corporation | Distributed measurements and analysis in networks |
CN107133616A (en) * | 2017-04-02 | 2017-09-05 | 南京汇川图像视觉技术有限公司 | A kind of non-division character locating and recognition methods based on deep learning |
WO2020164281A1 (en) * | 2019-02-13 | 2020-08-20 | 平安科技(深圳)有限公司 | Form parsing method based on character location and recognition, and medium and computer device |
CN110287960A (en) * | 2019-07-02 | 2019-09-27 | 中国科学院信息工程研究所 | The detection recognition method of curve text in natural scene image |
CN111310760A (en) * | 2020-02-13 | 2020-06-19 | 辽宁师范大学 | Method for detecting onychomycosis characters by combining local prior characteristics and depth convolution characteristics |
CN111583180A (en) * | 2020-04-03 | 2020-08-25 | 苏宁云计算有限公司 | Image tampering identification method and device, computer equipment and storage medium |
Non-Patent Citations (3)
Title |
---|
LEW, S: "Environmental controls on the abundance of methanotrophs and methanogens in peat bog lakes", SCIENCE OF THE TOTAL ENVIRONMENT, pages 1201 - 1211 * |
唐倩文;陈良育;: "基于复杂网络理论的Java开源系统演化分析", 计算机科学, no. 08, pages 173 - 180 * |
李代杨: "基于深度神经网络的汽车刹车片喷码检测识别技术研究", 万方硕士学位论文, pages 1 - 7 * |
Also Published As
Publication number | Publication date |
---|---|
CN112364726B (en) | 2024-06-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110032998B (en) | Method, system, device and storage medium for detecting characters of natural scene picture | |
CN110766008A (en) | Text detection method facing any direction and shape | |
CN108334881B (en) | License plate recognition method based on deep learning | |
CN110738207A (en) | character detection method for fusing character area edge information in character image | |
CN110334709B (en) | License plate detection method based on end-to-end multi-task deep learning | |
CN109886174A (en) | A kind of natural scene character recognition method of warehouse shelf Sign Board Text region | |
CN113435240A (en) | End-to-end table detection and structure identification method and system | |
CN104615911B (en) | Method based on sparse coding and chain study prediction memebrane protein beta barrel trans-membrane regions | |
CN110880176B (en) | Semi-supervised industrial image defect segmentation method based on countermeasure generation network | |
CN109948533B (en) | Text detection method, device and equipment and readable storage medium | |
CN115457565A (en) | OCR character recognition method, electronic equipment and storage medium | |
CN111027538A (en) | Container detection method based on instance segmentation model | |
CN110598698A (en) | Natural scene text detection method and system based on adaptive regional suggestion network | |
CN112330699B (en) | Three-dimensional point cloud segmentation method based on overlapping region alignment | |
CN110852327A (en) | Image processing method, image processing device, electronic equipment and storage medium | |
CN110008900A (en) | A kind of visible remote sensing image candidate target extracting method by region to target | |
CN114266794A (en) | Pathological section image cancer region segmentation system based on full convolution neural network | |
CN111553351A (en) | Semantic segmentation based text detection method for arbitrary scene shape | |
CN117893481A (en) | Hybrid wafer defect identification and segmentation method based on self-supervision contrast learning | |
CN111274863A (en) | Text prediction method based on text peak probability density | |
CN114693966A (en) | Target detection method based on deep learning | |
CN114155540A (en) | Character recognition method, device and equipment based on deep learning and storage medium | |
CN111832497B (en) | Text detection post-processing method based on geometric features | |
CN112418106A (en) | Ship detection method based on dense key point guidance | |
CN113780040A (en) | Lip key point positioning method and device, storage medium and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |