CN106709489A - Processing method and device of character identification - Google Patents

Processing method and device of character identification Download PDF

Info

Publication number
CN106709489A
CN106709489A CN201510410166.5A CN201510410166A CN106709489A CN 106709489 A CN106709489 A CN 106709489A CN 201510410166 A CN201510410166 A CN 201510410166A CN 106709489 A CN106709489 A CN 106709489A
Authority
CN
China
Prior art keywords
character
page
row
altitude range
current
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510410166.5A
Other languages
Chinese (zh)
Other versions
CN106709489B (en
Inventor
周龙沙
王红法
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201510410166.5A priority Critical patent/CN106709489B/en
Publication of CN106709489A publication Critical patent/CN106709489A/en
Application granted granted Critical
Publication of CN106709489B publication Critical patent/CN106709489B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Character Input (AREA)

Abstract

The invention discloses a processing method and device of character identification, and the method and device are used to improve the character identification effect. The method comprises that characters in a page belong to rows in the page according to height information of the characters in the page, the characters included by each row of the page are obtained, and the height information of each character in the page comprises a ordinate of the character in the page as well as a height range of the character; row correction is carried out on the characters included by each row in the page according to overlap information of the characters in the height range, and row-corrected characters included by each row in the page are obtained; and a semantic analysis model is used to carry out semantic correction on the row-corrected characters included by each row in the page.

Description

A kind for the treatment of method and apparatus of character recognition
Technical field
The present invention relates to field of computer technology, more particularly to a kind for the treatment of method and apparatus of character recognition.
Background technology
Character segmentation and character recognition are optical character identification (English full name:Optical Character Recognition, english abbreviation:OCR) most important two aspects, the direct shadow of this two parts in technology The effect and result to character recognition are rung, needs the character that will have split to carry out line direction in OCR technique On sequence after be presented to user, therefore the line direction ordering techniques of character can be directly influenced and be presented to use The recognition effect of family viewing.But currently for OCR branch's technology be mainly based upon segmentation after intercharacter Away to character carrying out simple branch.
Enter the merging and fractionation of line character according to the character pitch after segmentation in the prior art, when not apposition When occurring after the character of formula carries out typesetting, situations such as the every line character for photographing has very big inclination in the page, There is larger error to the character recognition on the page, and semanteme is carried out in later use recognition result It also is difficult to reach accuracy very high during analysis.In addition, being in the prior art according to character to character branch What spacing was realized, but with environmental change when character block combination is embarked on journey, there is a strong possibility can by other words Symbol is influenceed, so as to final given recognition effect can be influenceed.
The content of the invention
A kind for the treatment of method and apparatus of character recognition are the embodiment of the invention provides, is known for improving character Other recognition effect.
In order to solve the above technical problems, the embodiment of the present invention provides following technical scheme:
In a first aspect, the embodiment of the present invention provides a kind of processing method of character recognition, including:
Multiple characters on the page are belonged to the page by the elevation information according to character on the page On multiple rows on, obtain multiple characters that the every a line on the page includes, the character is in the page On elevation information include:The altitude range of character ordinate on the page and the character;
According to the overlay information between character on the page on altitude range to each on the page Multiple characters that row includes enter every trade correction, after obtaining the row correction that the every a line on the page includes Multiple characters;
Entered using the multiple characters after the row correction that semantic analysis model includes to the every a line on the page Row missed suppression.
Second aspect, the embodiment of the present invention also provides a kind of processing unit of character recognition, including:
Row splits module, for the elevation information according to character on the page by the multiple words on the page Symbol is belonged on the multiple rows on the page, obtains multiple characters that the every a line on the page includes, Elevation information of the character on the page includes:Character ordinate on the page and the character Altitude range;
Row correction module, for according to the overlay information pair between character on the page on altitude range Multiple characters that each row on the page includes enter every trade correction, obtain the every a line on the page Including row correction after multiple characters;
Missed suppression module, for the row included to the every a line on the page using semantic analysis model Multiple characters after correction carry out missed suppression.
As can be seen from the above technical solutions, the embodiment of the present invention has advantages below:
In embodiments of the present invention, the elevation information first according to character on the page is by the multiple on the page Character is belonged on the multiple rows on the page, obtains multiple characters that the every a line on the page includes, character Elevation information on the page includes:The altitude range of ordinate of the character on the page and the character, Next each row on the page is included according to the overlay information between character on the page on altitude range Multiple characters enter every trade correction, obtain the multiple characters after the row that every a line on the page includes is corrected, Multiple characters after the row for finally being included to the every a line on the page using semantic analysis model is corrected carry out language Justice correction.The all characters on the page are belonged to using elevation information of the character on the page in the present invention Multiple rows, because the altitude range of the ordinate and character of the character on the page of same a line is all relatively solid It is fixed, thus according to character elevation information all characters on the page are carried out branch to branch's result be Accurately, and in the present invention can also be according to the overlay information between character on the page on altitude range Multiple characters that each row includes are entered with every trade correction, therefore can be detected to difference by overlay information The row that character should belong in the character typesetting of form, the original of shooting is may also detect that by overlay information Inclined character is there may be in the beginning page, in addition can also be using semantic analysis model to word in the present invention Symbol carry out missed suppression, therefore character is entered every trade correction and missed suppression can change because of character typesetting There is identification mistake caused by inclined character in form difference and shooting parent page, improve character knowledge Other recognition effect.
Brief description of the drawings
Technical scheme in order to illustrate more clearly the embodiments of the present invention, in being described to embodiment below The required accompanying drawing for using is briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, to those skilled in the art, can also obtain according to these accompanying drawings Other accompanying drawings.
Fig. 1 is a kind of process blocks schematic diagram of the processing method of character recognition provided in an embodiment of the present invention;
Fig. 2-a are that a kind of processing method to character recognition provided in an embodiment of the present invention realizes that scene is shown It is intended to;
Fig. 2-b are a kind of implementation schematic diagram of the row attribute of correction character provided in an embodiment of the present invention;
Fig. 2-c be use semantic analysis model provided in an embodiment of the present invention carry out before missed suppression one Plant content of pages schematic diagram;
Fig. 2-d be use semantic analysis model provided in an embodiment of the present invention carry out after missed suppression one Plant content of pages schematic diagram;
Fig. 3-a are a kind of composition structural representation of the processing unit of character recognition provided in an embodiment of the present invention Figure;
Fig. 3-b are the composition structural representation that a kind of row provided in an embodiment of the present invention splits module;
Fig. 3-c are a kind of composition structural representation of row correction module provided in an embodiment of the present invention;
Fig. 3-d are that the composition structure of the processing unit of another character recognition provided in an embodiment of the present invention is shown It is intended to;
Fig. 3-e are that the composition structure of the processing unit of another character recognition provided in an embodiment of the present invention is shown It is intended to;
Fig. 4 is that the processing method of character recognition provided in an embodiment of the present invention is applied to the composition knot of server Structure schematic diagram.
Specific embodiment
A kind for the treatment of method and apparatus of character recognition are the embodiment of the invention provides, is known for improving character Other recognition effect.
To enable that goal of the invention of the invention, feature, advantage are more obvious and understandable, below will With reference to the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Ground description, it is clear that the embodiments described below are only a part of embodiment of the invention, and not all Embodiment.Based on the embodiment in the present invention, the every other implementation that those skilled in the art is obtained Example, belongs to the scope of protection of the invention.
Term " comprising " and " having " in description and claims of this specification and above-mentioned accompanying drawing And their any deformation, it is intended that covering is non-exclusive to be included, so as to comprising a series of units Process, method, system, product or equipment are not necessarily limited to those units, but may include without clearly It is listing or for these processes, method, product or other intrinsic units of equipment.
It is described in detail individually below.
One embodiment of the processing method of character recognition of the present invention, in specifically can apply to COR technologies, Refer to shown in Fig. 1, the processing method of the character recognition that one embodiment of the invention is provided can include Following steps:
101st, the elevation information according to character on the page belongs on the page the multiple characters on the page On multiple rows, multiple characters that the every a line on the page includes are obtained.
Wherein, elevation information of the character on the page includes:Ordinate of the character on the page and the word The altitude range of symbol.
In embodiments of the present invention, the character on the page can be partitioned into from the page by OCR technique Single character and the character obtained after each character is identified, each character on the page is got first Elevation information, wherein, the elevation information of character refers to:Ordinate of the character on the page and the word The altitude range of symbol, the elevation information by character on the page can accurately determine character in the page Fixed coordinates on upper ordinate direction, and minimum point coordinates and highest of the character on ordinate direction Point coordinates, wherein, the difference between highest point coordinates and minimum point coordinates on ordinate direction is word The altitude range of symbol.For example, can using the top left corner apex of current page as the origin of coordinates, with Top left corner apex is as a reference point, measures the top left corner apex of the character on ordinate direction to the coordinate The distance of origin as character ordinate, in addition the top left corner apex of character be character in ordinate direction On highest point coordinates, the lower-left angular vertex of character is minimum point coordinates of the character on ordinate direction, Go out the altitude range of character by the mathematic interpolation between highest point coordinates and minimum point coordinates, it is also possible to claim Be height size.
In the embodiment of the present invention, the elevation information of all characters on page-out is calculated, then according to the page The elevation information of upper character enters the fractionation of every trade ownership to the multiple characters on the page, i.e., according to the height of character Degree information belongs to the multiple characters on the page on the multiple rows on the page, and character is used with prior art Spacing enter every trade and divide different, the row ownership determination method in the embodiment of the present invention to character is used The height model of the elevation information of the page, the i.e. ordinate using character on the page and the character where character Enclose to determine the ownership of row, because the height model of the ordinate and character with the character of a line on the page Enclose and be all relatively fixed, therefore elevation information according to character carries out to all characters on the page branch and arrives Branch's result is accurate.Further, in some embodiments of the invention, step 101 is according to word The elevation information accorded with the page belongs to the multiple characters on the page on the multiple rows on the page, obtains Multiple characters that every a line on the page includes, specifically may include steps of:
A1, from the multiple characters on the page arbitrarily selection one character as current character, according to current Elevation information of the character on the page calculates ordinate of the central point of current character on the page;
A2, judge the central point of current character whether in the altitude range of the previous character of current character, If the central point of current character is in the altitude range of the previous character of current character, current character and Previous character belongs to same row, if the central point of current character is not in the previous character of current character Altitude range in, then current character is belonging respectively to two different rows with previous character.
Wherein, the determination methods that a character enters every trade ownership are given in step A1 to A2, for step The row ownership judgment mode of rapid A1 to A2 is applied to the row ownership determination process to each character on the page, Wherein, using an optional character in the multiple characters on the page as current character, step A1 and In step A2 so that the row ownership to current character judges as an example, all characters in the page can be used To the row ownership judgment mode of current character.Wherein, the central point of current character refers to character in vertical seat The median of altitude range on mark direction, for example, the altitude range of current character is (y1, y2), then word Ordinate of the central point of symbol on ordinate direction is (y1+y2)/2, by the ordinate of current character (y1+y2The altitude range of the previous character of)/2 and current character is judged, such that it is able to judge to work as The row ownership of the previous character of preceding character and current character.
It should be noted that in some embodiments of the invention, step 101 is according to character on the page Elevation information the multiple characters on the page are belonged on the page multiple rows on before, the present invention is implemented The processing method of the character recognition that example is provided can also comprise the following steps:
The multiple original characters on multiple symbolic blocks identification page-out that B1, basis are partitioned into from the page;
B2, all originals according to the altitude range and width range of each original character on the page from the page Excessive character is weeded out in beginning character or small characters are crossed, the multiple characters on the page are obtained.
Wherein, the original character on the page can be partitioned into single character from the page by OCR technique And the character obtained after each character is identified, the height of each original character on the page is got first Degree information and width information, wherein, the height of the character of elevation information and the foregoing teachings description of original character Degree information is similar, and the width information of original character refers to:Abscissa of the original character on the page and The width range of the original character, elevation information and width information by original character on the page can be with Accurately determine original character on the fixed coordinates and abscissa direction on ordinate direction on the page Fixed coordinates, and altitude range and width range of the original character on ordinate direction.Other step Excessive character described in B2 refers to the word more than certain numerical value on altitude range and/or width range Symbol, the mistake small characters described in step B2 are referred on altitude range and/or width range less than certain The character of numerical value, excessive character is weeded out from all original characters on the page or small characters are crossed, can be with Obtain the multiple characters on the page described in step 101.
Further, altitude ranges and width range of the above-mentioned steps B2 according to each original character on the page Excessive character is weeded out from all original characters on the page or small characters are crossed, the multiple on the page is obtained Character, specifically may include steps of:
B21, calculate the flat of the page respectively according to the altitude range and width range of each original character on the page Equal character height and average character duration;
B22, altitude range is weeded out from all original characters on the page according to average character height it is more than The original of original character or altitude range less than M times of average character height of N times of average character height Beginning character, and from all original characters on the page to weed out width range according to average character duration big In M times of N times of average character duration of original character or width range less than average character duration Original character, completes to obtain the character on the page after rejecting, and N is the numerical value more than 1, and M is more than 0 Numerical value less than 1.
Wherein, the average character height of the page refers to the height of all original characters on the page in step B21 Average value, the average character duration of the page refers to the average value of the width of all original characters on the page, The value of N can be average to need to weed out altitude range in 1.5, i.e. step B22 in step B22 The original character of 1.5 times of character height, it is also desirable to weed out that width range is average character duration 1.5 Times original character, the value of M can be to need to weed out in 0.2, i.e. step B22 in step B22 Altitude range is the original character of 0.2 times of average character height, it is also desirable to which it is average to weed out width range The original character of 0.2 times of character duration, rejects from all original characters on the page in the manner described above Fall excessive character or cross small characters, obtain the character on the page described in step 101, wherein N and M Specific value can be not limited to it is foregoing for example, can be combined with specific application scenarios determine N, The specific value of M.
In some embodiments of the invention, elevation information of the step 101 according to character on the page is by page Multiple characters on face are belonged on the multiple rows on the page, obtain the multiple that the every a line on the page includes After character, the processing method of character recognition provided in an embodiment of the present invention can also comprise the following steps:
C1, according between two neighboring character in the every a line on the page character pitch calculate the page on Character field segmentation distance per a line;
C2, according to character field split distance multiple characters that the every a line on the page includes are segmented, Obtain multiple character fields that the every a line on the page includes.
Wherein, in step C1, the width information of each character on the page, the width of character are got first Information refers to:The width range of abscissa of the character on the page and the character, by character in the page On width information can accurately determine fixed coordinates of the character on the abscissa direction on the page, And width range of the character on ordinate direction.Then between two neighboring character in same row Character pitch is exactly the abscissa difference between latter character and previous character, according to phase in a row Character pitch between adjacent two characters calculates the character field segmentation distance of each row on the page, Carried out to how to set character field segmentation distance according to the character pitch between two neighboring character in every a line Set, wherein, character field be can split into the set of multigroup character in a row, illustrate such as Under:The content recorded in a row in the page is as follows:
Name:Xiao Ming grade:Two classes of sexes of Third school grade:Man
Then having just in row as above can have 3 character fields, respectively " name:Xiao Ming ", " grade: Two classes of Third school grade ", " sex:Man ".Predict after character field segmentation distance, split using the character field Distance is segmented to multiple characters that every a line includes, obtains multiple words that the every a line on the page includes Symbol section.
Further, step C1 is according to the character pitch between two neighboring character in the every a line on the page The character field segmentation distance of the every a line on the page is calculated, specifically be may include steps of:
Character pitch in every a line on C11, calculating page-out between two neighboring character;
C12, descending row is carried out according to numerical values recited to the character pitch between two neighboring character in every a line Row, the median in selection character pitch splits distance as the character field of the every a line on the page.
Wherein, after calculating the character pitch in every a line between two neighboring character, to getting Have character pitch carries out descending arrangement according to numerical values recited, and selection is in the intercharacter of median in the ranking Split distance away from as character field, the foundation that character field segmentation distance is split as character field, selection is all Median in character pitch can accurately get the character in every a line as character field segmentation distance Section.
Further, step C2 splits multiple words that distance includes to the every a line on the page according to character field Symbol is segmented, and obtains multiple character fields that the every a line on the page includes, can specifically include following step Suddenly:
C21, arbitrarily one character of selection as current character, obtains current from the multiple characters on the page Character pitch between character and adjacent character;
If character pitch between C22, current character and adjacent character less than or equal to character field split away from From, current character and adjacent character are divided into a character field, if current character and adjacent character it Between character pitch more than character field split distance, by current character and adjacent character be divided into two it is different Character field in.
Wherein, a determination methods for character field are given in step C21 to C22, for step C21 Character field judgment mode to C22 is applied to the character field determination process of each character on the page, wherein, An optional character is used as current character, step C21 and step using in the multiple characters on the page In C22 so that the character field to current character judges as an example, all characters in the page can use right The character field judgment mode of current character.
102nd, according to the overlay information between character on the page on altitude range to each the row bag on the page The multiple characters for including enter every trade correction, obtain the multiple characters after the row correction that the every a line on the page includes.
In embodiments of the present invention, for all characters on the page, obtain on the page between all characters There is the character of overlap on altitude range, every trade is entered to there is the character for overlapping between character on altitude range Correction, wherein, on the page between character on altitude range exist overlap may determine that it is big on page-out The multiple characters in same row are caused, because belonging to the height of multiple characters of same row on the page Scope is all similar, if because when character produces inclination in the picture of the typesetting of character or the shooting page, Some characters may be divided into the row of mistake, therefore character in the embodiment of the present invention on to the page is returned Belong to after multiple rows, multiple characters that each row on the page includes can also be entered according to overlay information Every trade is corrected, thus correct be likely to occur enter the misjudgment that produces when every trade belongs to character.
In some embodiments of the invention, step 102 according between character on the page on altitude range Overlay information multiple characters that each row on the page includes are entered every trade correction, specifically can include such as Lower step:
D1, arbitrarily one character of selection as current character, obtains height from the multiple characters on the page Scope has overlap multiple characters with the altitude range of current character;
If D2, the altitude range for getting have overlap multiple characters all to belong to the altitude range of current character In same row, then the row where keeping current character is constant;
If D3, the altitude range for getting have overlap multiple characters to distinguish with the altitude range of current character Belong to two rows, altitude range has overlap with the altitude range of current character during two rows are calculated respectively The number of character, the row where current character is defined as into altitude range has with the altitude range of current character The most row of the number of the character of overlap.
Wherein, the method that a character enters every trade correction is given in step D1 to D2, for step The row correction judgment mode of D1 to D3 is applied to the determination process to the row correction of each character on the page, Wherein, using an optional character in the multiple characters on the page, used as current character, step D1 is extremely In step D3 by taking the row correction to current character as an example, all characters in the page can be used to working as The row correcting mode of preceding character.Wherein, altitude range has overlap multiple with the altitude range of current character Character is probably same row, it is also possible to two adjacent rows, is sentenced by way of statistics in step D3 Break and the row that current character should belong to, after can judging entering every trade ownership in former step 101 further Realization row correction, such that it is able to realize the character recognition of high accuracy.
103rd, the multiple characters after the row included to the every a line on the page using semantic analysis model is corrected enter Row missed suppression.
In embodiments of the present invention, the multiple characters after the row correction that each row on the page includes are obtained Afterwards, missed suppression can be carried out to above-mentioned character according to default semantic analysis model, wherein, this hair The semantic analysis model used in bright embodiment can be word2vec, or HMM etc., The further optimization that missed suppression can be realized to character identification result is carried out to character, is more met language Say the character recognition effect of custom.
In previously described embodiments of the present invention, if performing step C1 and C2, carried out by step 102 After row correction, all include multiple character fields, each character field bag in multiple character fields on the page per a line Include:Row correction after multiple characters, it is this realize scene under, step 103 use semantic analysis model Multiple characters after the row correction included to the every a line on the page carry out missed suppression, including:
E1, using semantic analysis model to information and the intersegmental letter of character in character field in the every a line on the page Breath carries out missed suppression respectively.
Wherein, if having got the character field that the every a line on the page includes, can it is intersegmental to character and Missed suppression is carried out respectively in character field, and the process that specifically used semantic analysis model carries out missed suppression can Refering to prior art.
Description by above example to the embodiment of the present invention, first according to character on the page Elevation information belongs to the multiple characters on the page on the multiple rows on the page, obtains each on the page Multiple characters that row includes, elevation information of the character on the page includes:Vertical seat of the character on the page The altitude range of mark and the character, next believes according to the overlap between character on the page on altitude range Breath enters every trade correction to multiple characters that each row on the page includes, the every a line obtained on the page includes Row correction after multiple characters, the row for finally being included to the every a line on the page using semantic analysis model Multiple characters after correction carry out missed suppression.Elevation information in the present invention using character on the page will All characters on the page belong to multiple rows, due to same a line ordinate of the character on the page and The altitude range of character is all relatively fixed, thus according to character elevation information to all characters on the page Carry out branch to branch's result be accurate, and can also be according between character on the page in the present invention Overlay information on altitude range enters every trade correction to multiple characters that each row includes, therefore can lead to The row that lap over infomation detection should belong to character in the character typesetting to different-format, is believed by overlapping Breath there may be inclined character in may also detect that the parent page of shooting, may be used also in the present invention in addition Missed suppression is carried out to character with using semantic analysis model, therefore enters every trade correction and semantic school to character Can just change causes because there is inclined character in the form difference of character typesetting and shooting parent page Identification mistake, improve character recognition recognition effect.
For ease of being better understood from and implementing the such scheme of the embodiment of the present invention, illustrate accordingly should below It is specifically described with scene.For getting split character block in the embodiment of the present invention, not only Employ mutual logical relation between character block, and row is not lined up in itself to also allow for character in branch Problem, i.e., row correction when captured character row is not horizontal.
Due to many people in current OCR technique it is contemplated that Character segmentation and identification division, for final Display result, split by character row and merged, can be to OCR the semantic analysis that needs to use is combined Whole structure have very big lifting, especially generation OCR character rows it is not parallel, have various situations such as noise Under, the embodiment of the present invention can well carry out character row fractionation, and usage scenario is extensive.One kind of the invention Application scenarios, specifically may include steps of, and refer to as shown in Fig. 2-a.
The single character that step 1, OCR are identified
In the embodiment of the present invention, first by OCR technique to single Character segmentation, identification.First with Character segmentation method (method such as such as image binaryzation, convex closure) in OCR technique split after it is each Individual character block, then by the way that relation between logical sum character block is rational single Character segmentation and identifies original Beginning character.Recognition methods is such as:Based on the matching of convex closure profile, the Gradient Features matching based on gradation of image Deng, the character and its recognition result of all segmentations under a page have been obtained, based on the above results, connect Each fritter for having obtained each character by methods such as binaryzation, convex closures to constitute, such as character " small " The fritter of three parts is obtained, respectively:Zuo Dian, erects hook, right point, is recognized by combining each character block, Finally identify that this three pieces of characters merge the Chinese character of gained.
Noise on step 2, the removal page
Char_height (is used by the width (being represented with char_width) and height that calculate overall character in the page Represent) summation, obtain average character duration (char_average_width) on the page and averagely Character height (char_average_height), then removes from the original character of the page and is more than The original character of 1.5*char_average_width or 1.5*char_average_height, because in a word In the page of symbol, it is all based on the distribution of character in most cases, and character is distributed with certain rule Rule property, or on height wide unanimously (for example:State and), or it is wide consistent (for example:First, two), Height is consistent (for example such as:!, [etc.), total some larger blocks or less piece can in character recognition Influence branch's effect of character, such as relatively large character height that can take two rows makes during remerging The character field that currently must be originally divided into two rows merges into a character field, so as to influence final character recognition As a result.
Step 3, the calculating page are per a line and character field
First, the preliminary split result of the character branch being calculated on the page.Obtained using in step 1 Rational character distribution, the merging for entering line character according to following logic (is used so as to obtain preliminary character field Char_section is represented).
(1), to traveling row label belonging to each character:The position of current character can be defined as (char_x, char_y, char_width, char_height), wherein, char_x is current character upper left corner place The x coordinate of the page, char_y is the y-coordinate of the page where the current character upper left corner, and char_width is to work as Preceding character duration, the width range of current character is (char_x, char_x+char_width), char_height For current character highly, the altitude range of current character is (char_y, char_y+char_height).Profit With character location information affiliated here, the y-coordinate that can calculate the central point of each character is: Char_y+char_height/2, if the central point y-coordinate of current character is in the previous character institute of current character Altitude range in, then current character and previous character are belonging to a line, formulate as follows:
chari+1_y+chari+1_height/2∈[chari_y,chari_y+chari_height];
Wherein, i=(1,2,3 ... .n), n are total number of characters of current page.
Each character can be belonged on corresponding row according to the above method, thus obtain current The first walking property distribution of page character.
(2) character field merged into line character is calculated to the affiliated character per a line and splits distance:
After by above, the method for (1) obtains the affiliated character of every a line, calculated currently using following methods Character pitch in row between connected two characters:
distancek=chari+1_x-(chari_x+chari_width);
Wherein i=1,2,3 ... m, k=1,2,3 ... ..m-1, m are the number of characters in current line. Each adjacent character spacing (distance) to being stored is ranked up according to descending, inside selection in Between value (i.e. distance_sort_middle) as current line character field split distance.
(3) split distance according to character field to be segmented the character of current line, obtain each word of current line Symbol section:
Character for the character pitch in current line more than distance_sort_middle is split to two words In symbol section, it is merged into same character field for the character less than distance_sort_middle.Wherein, Here fractionation is exactly that, separately as new set expression, merging is that current character is merged current character It is used to build character field in the set for meet condition.
The segment information of character in the affiliated row of character and row under a page has been obtained according to the method described above, so Step 4 is performed afterwards.
Step 4, character is entered every trade correction
Wherein, to the character correction character row attribute on the page:Entering every trade Attribute transposition process to character It is middle because shooting angle problem occur this for a line character no longer horizontal direction on, as shown in figure Fig. 2-b, Each frame is a character distribution, originally belongs to the character of the second row because the outside cause such as shooting angle is marked The attribute of the first row is noted, the character row that wire frame frame long is outlined with point frame is according to the method in step 3 Obtain, can be as follows using the method for correction in this step 4:
(1) the row attribute of all characters of current page, is obtained in the case where step 3 method is completed, to each character The row attaching information of current character is calculated as follows:
Current character line range:chari_ range=[chari_y,chari_y+chari_height];
If chariThe char of _ range and another characterj_ range has overlap, then right chariThis one-dimensional record array relevant position of _ line_record increases a record value.Finally compare two It is individually present in row after the number of the character of overlap, takes chari_ line_record the insides numerical value highest institute is right The current char of behavior for answeringiRow attribute.By taking Fig. 2-b as an example, for last in two rows in the page Character, chari_ line_record array lengths are 2, are judged using current character line range, can be obtained It is 5, char to the character number that Fig. 2-b last characters are overlapped in the first rowi_ line_record [0]=5 And the character number for overlapping in a second row is:6, chari_ line_record [1]=6, at this moment in selection array Numerical value highest 6 is expert at (i.e. the second row) as the row attribute of current character, then last in Fig. 2-b The row attribute of one character is defined as the second row.
Wherein, i, j=(1,2,3 ... .n), n is total number of characters of current page, chari_ range is one Individual character y directions distribution, chari_ line_record is the dimension group that initial value is 0, Number of dimensions is total line number resulting under step 3 method.
Step 5, missed suppression is carried out to information in character field and the intersegmental information of character
For the segmented each character field for having merged, character field can again be entered using semantic analysis model Row missed suppression, and be that the semantic analysis model of each character content fusion of identification in character field is sentenced Disconnected, semantic analysis model can be using the technology of current comparative maturity such as:Word2vec, or hidden Ma Er Section's husband's model etc..So it is corrected for itself wrong part of identification in character field, is such as originally used for depth Zhen Shi, is erroneously identified as Shen Xun cities etc., intersegmental for character, then can be entered using semantic analysis model The further missed suppression of row, please refers to as shown in Fig. 2-c and Fig. 2-d respectively, and Fig. 2-c are for before missed suppression Character field schematic diagram, Fig. 2-d be missed suppression after character field schematic diagram, according to semantic analysis model, " knot " and " fruit " should belong to a semantic section, therefore can carry out the merging of semantic section.
By the foregoing citing description of this invention, the present invention can utilize character pitch and character institute In the position of the page, character content and semantic analysis in itself is merged, the fractionation to partial character row is rectified Just, so that character row segmentation more rationally, logic of language is more met on result is presented.By a large amount of Experiment test prove that method provided in an embodiment of the present invention can be than other original methods to OCR words In the segmentation of symbol branch more rationally, the custom of language and is more met in terms of content and in charcter topology arrangement.
It should be noted that for foregoing each method embodiment, in order to be briefly described, therefore by its all table It is a series of combination of actions to state, but those skilled in the art should know, the present invention does not receive to be retouched The limitation of the sequence of movement stated because according to the present invention, some steps can using other order or Carry out simultaneously.Secondly, those skilled in the art should also know, embodiment described in this description Preferred embodiment is belonged to, necessary to involved action and the module not necessarily present invention.
For ease of preferably implementing the such scheme of the embodiment of the present invention, it is also provided below for implementation State the relevant apparatus of scheme.
Refer to shown in Fig. 3-a, a kind of processing unit 300 of character recognition provided in an embodiment of the present invention, Can include:Row splits module 301, row correction module 302, missed suppression module 303, wherein,
Row splits module 301, for the elevation information according to character on the page by the multiple on the page Character is belonged on the multiple rows on the page, obtains multiple words that the every a line on the page includes Symbol, elevation information of the character on the page includes:Character ordinate on the page and should The altitude range of character;
Row correction module 302, for according to the overlay information between character on the page on altitude range Multiple characters that each row on the page includes are entered with every trade correction, obtains each on the page Multiple characters after the row correction that row includes;
Missed suppression module 303, for what is included to the every a line on the page using semantic analysis model Multiple characters after row correction carry out missed suppression.
In some embodiments of the invention, as shown in Fig. 3-b, the row splits module 301, including:
Character center point determining unit 3011, for any selection one from the multiple characters on the page Individual character calculates described current as current character, the elevation information according to the current character on the page The central point of character ordinate on the page;
Row judging unit 3012, for judging the central point of the current character whether in the current character Previous character altitude range in, if the central point of current character is in the previous of the current character In the altitude range of character, then the current character and the previous character belong to same row, if working as The central point of preceding character is not in the altitude range of the previous character of the current character, then described current Character is belonging respectively to two different rows with the previous character.
In some embodiments of the invention, as shown in Fig. 3-c, the row correction module 302, including:
High superposed character determining unit 3021, for arbitrarily being selected from the multiple characters on the page Used as current character, obtain altitude range has overlap to one character with the altitude range of the current character Multiple characters;
Row determining unit 3022, if for the altitude range of the altitude range that gets and the current character The multiple characters for having overlap belong to same row, then the row where keeping the current character is constant;If The altitude range for getting has overlap multiple characters to be belonging respectively to two with the altitude range of the current character Individual row, altitude range has overlap word with the altitude range of the current character during two rows are calculated respectively The number of symbol, the row where the current character is defined as the height of altitude range and the current character Scope has the most row of the number of the character of overlap.
In some embodiments of the invention, as shown in Fig. 3-d, the processing unit 300 of the character recognition, Also include:Character denoising module 304, height of the module 301 according to character on the page is split for the row Before on multiple rows that degree information belongs to the multiple characters on the page on the page, according to from The multiple symbolic blocks being partitioned on the page identify the multiple original characters on the page;According to institute State all original characters of the altitude range and width range of each original character on the page from the page In weed out excessive character or cross small characters, obtain the multiple characters on the page.
In some embodiments of the invention, as shown in Fig. 3-e, the processing unit 300 of the character recognition, Also include:Character field determining module 305, for the row split module 301 according to character on the page Elevation information belongs to the multiple characters on the page on the multiple rows on the page, obtains described After multiple characters that every a line on the page includes, according to two neighboring in the every a line on the page Character pitch between character calculates the character field segmentation distance of the every a line on the page;According to described Character field segmentation distance is segmented to multiple characters that the every a line on the page includes, obtains described Multiple character fields that every a line on the page includes.
In some embodiments of the invention, each character field includes in the multiple character field:Row correction Character afterwards;
The missed suppression module 303, specifically for using semantic analysis model to each on the page Information and the intersegmental information of character carry out missed suppression respectively in character field in row.
Description more than to the embodiment of the present invention, the height letter first according to character on the page Breath belongs to the multiple characters on the page on the multiple rows on the page, and the every a line obtained on the page includes Multiple characters, elevation information of the character on the page include:Ordinate of the character on the page and should The altitude range of character, next according to the overlay information between character on the page on altitude range to page Multiple characters that each row on face includes enter every trade correction, obtain the row school that the every a line on the page includes Multiple characters after just, after the row for finally being included to the every a line on the page using semantic analysis model is corrected Multiple characters carry out missed suppression.Using elevation information of the character on the page by the page in the present invention All characters belong to multiple rows, due to the ordinate and character of the character on the page of same a line Altitude range is all relatively fixed, therefore elevation information according to character is divided all characters on the page The branch's result gone is accurate, and in the present invention can also according between character on the page height Overlay information in scope enters every trade correction to multiple characters that each row includes, therefore can be by overlapping The row that infomation detection should belong to character in the character typesetting to different-format, also may be used by overlay information There may be inclined character in the parent page for detecting shooting, can also be utilized in the present invention in addition Semantic analysis model carries out missed suppression to character, therefore enters every trade correction to character and missed suppression can be with Change because the form of character typesetting is different and shoots in parent page to exist and recognize caused by inclined character Mistake, improves the recognition effect of character recognition.
Fig. 4 is that the processing method of character recognition provided in an embodiment of the present invention is applied to a kind of knot of server Structure schematic diagram, the server 400 be able to can be wrapped because of configuration or performance is different and the larger difference of producing ratio One or more central processing units (central processing units, CPU) 422 is included (for example, one Individual or more than one processor) and memory 432, one or more storage application programs 442 or number According to 444 storage medium 430 (such as one or more mass memory units).Wherein, memory 432 and storage medium 430 can be it is of short duration storage or persistently storage.Store the program in storage medium 430 One or more modules (diagram is not marked) can be included, each module can include in server Series of instructions operation.Further, central processing unit 422 could be arranged to and storage medium 430 Communication, the series of instructions operation in performing storage medium 430 on server 400.
Server 400 can also include one or more power supplys 426, one or more it is wired or Radio network interface 450, one or more input/output interfaces 458, and/or, one or one with Upper operating system 441, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
Step in above-described embodiment as performed by server can be based on the character recognition shown in the Fig. 1 Processing method.
In addition it should be noted that, device embodiment described above is only schematical, wherein described The unit illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit The part for showing can be or may not be physical location, you can with positioned at a place, or also may be used To be distributed on multiple NEs.Some or all of mould therein can according to the actual needs be selected Block realizes the purpose of this embodiment scheme.In addition, in the device embodiment accompanying drawing of present invention offer, mould Annexation between block represents between them there is communication connection, specifically can be implemented as one or more Communication bus or holding wire.Those of ordinary skill in the art without creative efforts, i.e., It is appreciated that and implements.
Through the above description of the embodiments, it is apparent to those skilled in the art that originally Invention can add the mode of required common hardware to realize by software, naturally it is also possible to by specialized hardware Realized including application specific integrated circuit, dedicated cpu, private memory, special components and parts etc..General feelings Under condition, all functions of being completed by computer program can be realized easily with corresponding hardware, and And, the particular hardware structure for realizing same function can also be it is diversified, such as analog circuit, Digital circuit or special circuit etc..But, it is more for the purpose of the present invention in the case of software program realize be more Good implementation method.Based on such understanding, technical scheme is substantially in other words to existing skill The part that art contributes can be embodied in the form of software product, computer software product storage In the storage medium that can read, such as computer floppy disk, USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic Dish or CD etc., including some instructions are used to so that computer equipment (can be personal computer, Server, or the network equipment etc.) perform method described in each embodiment of the invention.
In sum, the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations; Although being described in detail to the present invention with reference to above-described embodiment, one of ordinary skill in the art should Work as understanding:It can still modify to the technical scheme described in the various embodiments described above, or to it Middle some technical characteristics carry out equivalent;And these modifications or replacement, do not make appropriate technical solution Essence depart from various embodiments of the present invention technical scheme spirit and scope.

Claims (12)

1. a kind of processing method of character recognition, it is characterised in that including:
Multiple characters on the page are belonged to the page by the elevation information according to character on the page On multiple rows on, obtain multiple characters that the every a line on the page includes, the character is in the page On elevation information include:The altitude range of character ordinate on the page and the character;
According to the overlay information between character on the page on altitude range to each on the page Multiple characters that row includes enter every trade correction, after obtaining the row correction that the every a line on the page includes Multiple characters;
Entered using the multiple characters after the row correction that semantic analysis model includes to the every a line on the page Row missed suppression.
2. method according to claim 1, it is characterised in that it is described according to character on the page Elevation information belongs to the multiple characters on the page on the multiple rows on the page, obtains described Multiple characters that every a line on the page includes, including:
A character is arbitrarily selected from the multiple characters on the page as current character, according to described Elevation information of the current character on the page calculates the central point of the current character on the page vertical Coordinate;
Judge the current character central point whether the current character previous character height model In enclosing, if the central point of current character is in the altitude range of the previous character of the current character, The current character and the previous character belong to same row, if the central point of current character is not in institute State in the altitude range of previous character of current character, then the current character and the previous character It is belonging respectively to two different rows.
3. method according to claim 1, it is characterised in that described according to character on the page Between overlay information on altitude range every trade is entered to multiple characters that each row on the page includes Correction, including:
Arbitrarily one character of selection obtains height as current character from the multiple characters on the page Scope has overlap multiple characters with the altitude range of the current character;
If the altitude range for getting has overlap multiple characters all to belong to the altitude range of the current character In same row, then the row where keeping the current character is constant;
If the altitude range for getting has overlap multiple characters to distinguish with the altitude range of the current character Belong to two rows, altitude range has weight with the altitude range of the current character during two rows are calculated respectively The number of folded character, altitude range is defined as with the current character by the row where the current character Altitude range have the most row of the number of the character of overlap.
4. method according to claim 1, it is characterised in that it is described according to character on the page It is described before on multiple rows that elevation information belongs to the multiple characters on the page on the page Method also includes:
Multiple symbolic blocks according to being partitioned into from the page identify the multiple original words on the page Symbol;
The institute of altitude range and width range according to each original character on the page from the page Have in original character and weed out excessive character or cross small characters, obtain the multiple characters on the page.
5. method according to any one of claim 1 to 4, it is characterised in that described according to word Accord with multiple rows that the elevation information on the page belongs to the multiple characters on the page on the page On, obtaining after multiple characters that the every a line on the page includes, methods described also includes:
The page is calculated according to the character pitch between two neighboring character in the every a line on the page On every a line character field segmentation distance;
Split distance according to the character field to divide multiple characters that the every a line on the page includes Section, obtains multiple character fields that the every a line on the page includes.
6. method according to claim 5, it is characterised in that each word in the multiple character field Symbol section includes:Multiple characters after row correction;
Multiple words after the row correction that the use semantic analysis model includes to the every a line on the page Symbol carries out missed suppression, including:
Using semantic analysis model to information and the intersegmental letter of character in character field in the every a line on the page Breath carries out missed suppression respectively.
7. a kind of processing unit of character recognition, it is characterised in that including:
Row splits module, for the elevation information according to character on the page by the multiple words on the page Symbol is belonged on the multiple rows on the page, obtains multiple characters that the every a line on the page includes, Elevation information of the character on the page includes:Character ordinate on the page and the character Altitude range;
Row correction module, for according to the overlay information pair between character on the page on altitude range Multiple characters that each row on the page includes enter every trade correction, obtain the every a line on the page Including row correction after multiple characters;
Missed suppression module, for the row included to the every a line on the page using semantic analysis model Multiple characters after correction carry out missed suppression.
8. device according to claim 7, it is characterised in that the row splits module, including:
Character center point determining unit, for arbitrarily selecting a word from the multiple characters on the page Symbol calculates the current character as current character, the elevation information according to the current character on the page Central point ordinate on the page;
Row judging unit, for judging the central point of the current character whether before the current character In one altitude range of character, if the central point of current character is in the previous character of the current character Altitude range in, then the current character and the previous character belong to same row, if current word The central point of symbol not in the altitude range of the previous character of the current character, then the current character Two different rows are belonging respectively to the previous character.
9. device according to claim 7, it is characterised in that the row correction module, including:
High superposed character determining unit, for any selection one from the multiple characters on the page Used as current character, obtain altitude range has overlap multiple to character with the altitude range of the current character Character;
Row determining unit, if the altitude range and the altitude range of the current character for getting have weight Folded multiple characters belong to same row, then the row where keeping the current character is constant;If obtaining To altitude range there are overlap multiple characters to be belonging respectively to two with the altitude range of the current character OK, altitude range has overlap character with the altitude range of the current character during two rows are calculated respectively Number, the row where the current character is defined as the height model of altitude range and the current character It is with the most row of the number of the character of overlap.
10. device according to claim 7, it is characterised in that the treatment dress of the character recognition Put, also include:Character denoising module, height of the module according to character on the page is split for the row Before on multiple rows that information belongs to the multiple characters on the page on the page, according to from institute State multiple original characters that the multiple symbolic blocks being partitioned on the page are identified on the page;According to described The altitude range and width range of each original character are from all original characters on the page on the page Weed out excessive character or cross small characters, obtain the multiple characters on the page.
11. device according to any one of claim 7 to 10, it is characterised in that the character The processing unit of identification, also includes:Character field determining module, module is split according to character for the row Multiple characters on the page are belonged to elevation information on the page the multiple rows on the page On, after obtaining multiple characters that the every a line on the page includes, according to each on the page In row character pitch between two neighboring character calculate every a line on the page character field split away from From;Split distance according to the character field to divide multiple characters that the every a line on the page includes Section, obtains multiple character fields that the every a line on the page includes.
12. devices according to claim 11, it is characterised in that in the multiple character field each Character field includes:Character after row correction;
The missed suppression module, specifically for using semantic analysis model to the every a line on the page Information and the intersegmental information of character carry out missed suppression respectively in middle character field.
CN201510410166.5A 2015-07-13 2015-07-13 Character recognition processing method and device Active CN106709489B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510410166.5A CN106709489B (en) 2015-07-13 2015-07-13 Character recognition processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510410166.5A CN106709489B (en) 2015-07-13 2015-07-13 Character recognition processing method and device

Publications (2)

Publication Number Publication Date
CN106709489A true CN106709489A (en) 2017-05-24
CN106709489B CN106709489B (en) 2020-03-03

Family

ID=58898678

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510410166.5A Active CN106709489B (en) 2015-07-13 2015-07-13 Character recognition processing method and device

Country Status (1)

Country Link
CN (1) CN106709489B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107437084A (en) * 2017-07-24 2017-12-05 南京晓庄学院 A kind of character center of gravity localization method of line Handwritten text identification
CN108182432A (en) * 2017-12-28 2018-06-19 北京百度网讯科技有限公司 Information processing method and device
CN109871743A (en) * 2018-12-29 2019-06-11 口碑(上海)信息技术有限公司 The localization method and device of text data, storage medium, terminal
CN110135417A (en) * 2018-02-09 2019-08-16 北京世纪好未来教育科技有限公司 Sample mask method and computer storage medium
CN110378347A (en) * 2019-07-04 2019-10-25 北京爱医生智慧医疗科技有限公司 A kind of the key message extracting method and device of medical inspection list

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3435374B2 (en) * 1999-10-04 2003-08-11 沖電気工業株式会社 Character reading device and character recognition method
CN102024139A (en) * 2009-09-18 2011-04-20 富士通株式会社 Device and method for recognizing character strings
CN102779275A (en) * 2012-07-04 2012-11-14 广州广电运通金融电子股份有限公司 Paper characteristic identification method and relative device
CN103577818A (en) * 2012-08-07 2014-02-12 北京百度网讯科技有限公司 Method and device for recognizing image characters
CN104683629A (en) * 2013-11-26 2015-06-03 柯尼卡美能达株式会社 Image forming apparatus, text data embedding method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3435374B2 (en) * 1999-10-04 2003-08-11 沖電気工業株式会社 Character reading device and character recognition method
CN102024139A (en) * 2009-09-18 2011-04-20 富士通株式会社 Device and method for recognizing character strings
CN102779275A (en) * 2012-07-04 2012-11-14 广州广电运通金融电子股份有限公司 Paper characteristic identification method and relative device
CN103577818A (en) * 2012-08-07 2014-02-12 北京百度网讯科技有限公司 Method and device for recognizing image characters
CN104683629A (en) * 2013-11-26 2015-06-03 柯尼卡美能达株式会社 Image forming apparatus, text data embedding method

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107437084A (en) * 2017-07-24 2017-12-05 南京晓庄学院 A kind of character center of gravity localization method of line Handwritten text identification
CN107437084B (en) * 2017-07-24 2020-12-08 南京晓庄学院 Character gravity center positioning method for off-line handwritten text recognition
CN108182432A (en) * 2017-12-28 2018-06-19 北京百度网讯科技有限公司 Information processing method and device
US10963760B2 (en) 2017-12-28 2021-03-30 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and apparatus for processing information
CN110135417A (en) * 2018-02-09 2019-08-16 北京世纪好未来教育科技有限公司 Sample mask method and computer storage medium
CN109871743A (en) * 2018-12-29 2019-06-11 口碑(上海)信息技术有限公司 The localization method and device of text data, storage medium, terminal
CN110378347A (en) * 2019-07-04 2019-10-25 北京爱医生智慧医疗科技有限公司 A kind of the key message extracting method and device of medical inspection list
CN110378347B (en) * 2019-07-04 2021-10-08 北京爱医生智慧医疗科技有限公司 Method and device for extracting key information of medical examination sheet

Also Published As

Publication number Publication date
CN106709489B (en) 2020-03-03

Similar Documents

Publication Publication Date Title
CN110084292B (en) Target detection method based on DenseNet and multi-scale feature fusion
CN109816012B (en) Multi-scale target detection method fusing context information
CN108960211B (en) Multi-target human body posture detection method and system
CN110991311B (en) Target detection method based on dense connection deep network
CN106709489A (en) Processing method and device of character identification
CN106570453B (en) Method, device and system for pedestrian detection
CN110738207A (en) character detection method for fusing character area edge information in character image
CN108537824B (en) Feature map enhanced network structure optimization method based on alternating deconvolution and convolution
CN110738101A (en) Behavior recognition method and device and computer readable storage medium
CN106960195A (en) A kind of people counting method and device based on deep learning
CN110598788B (en) Target detection method, target detection device, electronic equipment and storage medium
CN109492596B (en) Pedestrian detection method and system based on K-means clustering and regional recommendation network
CN106548169A (en) Fuzzy literal Enhancement Method and device based on deep neural network
CN105303163B (en) A kind of method and detection device of target detection
CN109472193A (en) Method for detecting human face and device
CN114140683A (en) Aerial image target detection method, equipment and medium
CN109685145A (en) A kind of small articles detection method based on deep learning and image procossing
Wang et al. Deep learning model for target detection in remote sensing images fusing multilevel features
CN114783021A (en) Intelligent detection method, device, equipment and medium for wearing of mask
Hung et al. Skyline localization for mountain images
CN105023264A (en) Infrared image remarkable characteristic detection method combining objectivity and background property
CN111027551B (en) Image processing method, apparatus and medium
CN104463896A (en) Image corner point detection method and system based on kernel similar region distribution characteristics
CN111797737A (en) Remote sensing target detection method and device
CN116778386A (en) Sulfur hexafluoride leakage detection method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant