CN108171239A - The extracting method of certificate pictograph, apparatus and system, computer storage media - Google Patents

The extracting method of certificate pictograph, apparatus and system, computer storage media Download PDF

Info

Publication number
CN108171239A
CN108171239A CN201810104851.9A CN201810104851A CN108171239A CN 108171239 A CN108171239 A CN 108171239A CN 201810104851 A CN201810104851 A CN 201810104851A CN 108171239 A CN108171239 A CN 108171239A
Authority
CN
China
Prior art keywords
certificate
image
character
text message
pictograph
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810104851.9A
Other languages
Chinese (zh)
Inventor
李梓萁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Qing Technology Co Ltd
Original Assignee
Hangzhou Qing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Qing Technology Co Ltd filed Critical Hangzhou Qing Technology Co Ltd
Priority to CN201810104851.9A priority Critical patent/CN108171239A/en
Publication of CN108171239A publication Critical patent/CN108171239A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/28Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet
    • G06V30/287Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet of Kanji, Hiragana or Katakana characters

Abstract

The present invention provides a kind of extracting method of certificate pictograph, apparatus and system, computer storage media, wherein, method includes:Certificate type is determined according to certificate image;The reference certificate text message of same type is transferred from far-end server according to the certificate type;Extract the character format with reference to certificate text message;Slit mode is determined according to the character format;The certificate image, which is cut, using the slit mode obtains character picture and non-character image.The present invention can effectively cut the character of different fonts form in certificate image, and character identification rate and reliability are high.

Description

The extracting method of certificate pictograph, apparatus and system, computer storage media
Technical field
The present invention relates to technical field of image processing more particularly to it is a kind of extract certificate image Chinese word method, specifically For be exactly a kind of extracting method of certificate pictograph, apparatus and system, computer storage media.
Background technology
With increasingly dependence of the people to computer technology and network communication, need to calculate a large amount of papery data typing Machine for example, during movable property and real estate sales/purchase, lease, is scanned certificate, material, qualifications, to historical document, Books carry out electronic disposal.In order to preferably preserve, retrieving, check word class paper material, people develop word knowledge again Other technology, automatic identification, extraction picture or photo in text information.
So-called Text region is exactly the technology using Computer Automatic Recognition character, is a weight of application of pattern recognition Want field.Particularly as being using a large amount of character sample, by complicated neural network learning, corresponding model file is generated, So as to achieve the purpose that identify character in picture or photo.Wherein, OCR (optical character recognition) texts Word identification is the representative of character recognition technology.OCR technique mainly identifies the character in shooting, scanned picture, it is necessary first to will scheme Character string cutting as in is opened, and forms the small picture for including single word, then the word after cutting is identified.Existing text Character segmentation is sciagraphy into common method, is by after pictograph binary conversion treatment, two are found by vertical projection method Character segmentation is come according to line of demarcation in line of demarcation between a word.
However, for the certificates such as diploma, business license, between the character in the certificate image of scanning or shooting Difference is very big, for example, the font of character, character boundary, character color and luster difference are very big in certificate image, and in certificate image There is adhesion, existing projecting method is difficult the character preferably in cutting certificate image, while cutting quality is good between character Bad to directly influence OCR Text region effects, correct text information can not be extracted from certificate image by eventually leading to.
Therefore, those skilled in the art improve OCR texts there is an urgent need for researching and developing a kind of method of character in effective cutting certificate image The accuracy of word identification.
Invention content
In view of this, the technical problem to be solved in the present invention is to provide a kind of extracting method of certificate pictograph, dress It puts and system, computer storage media, certificate image character segmentation can not be adapted to by solving conventional images word cutting mode, The problem of causing OCR technique that can not correctly identify character in certificate image.
In order to solve the above-mentioned technical problem, specific embodiment of the invention provides a kind of extraction side of certificate pictograph Method, including:Certificate type is determined according to certificate image;The ginseng of same type is transferred from far-end server according to the certificate type Investigate book text message;Extract the character format with reference to certificate text message;Cutting side is determined according to the character format Formula;The certificate image, which is cut, using the slit mode obtains character picture and non-character image.
The specific embodiment of the present invention also provides a kind of extraction element of certificate pictograph, including:First determines list Member, for determining certificate type according to certificate image;Unit is transferred, for being transferred according to the certificate type from far-end server The reference certificate text message of same type;Extraction unit, for extracting the character format with reference to certificate text message;The Two determination units, for determining slit mode according to the character format;Cutting unit, for being cut using the slit mode The certificate image obtains character picture and non-character image.
The specific embodiment of the present invention also provides a kind of extraction system of certificate pictograph, including:Multiple extraction dresses The far-end server put and connect with the extraction element.Wherein, the extraction element is used for true according to the certificate image Determine certificate type;The far-end server is used to be provided to the offer device according to the certificate type of the certificate image Reference certificate text message;The extraction element is additionally operable to extract the character format with reference to certificate text message, and root Slit mode is determined according to the character format, to cut the certificate image using the slit mode.
The specific embodiment of the present invention also provides a kind of computer storage media for including computer executed instructions, described When computer executed instructions are handled via data processing equipment, which performs the extraction side of certificate pictograph Method.
Above-mentioned specific embodiment according to the present invention is it is found that the extracting method of certificate pictograph, apparatus and system, meter Calculation machine storage medium at least has the advantages that:Certificate type is determined according to certificate image first, further according to certificate type Text message (Chinese, English, number etc.) of the same type with reference to certificate image is transferred from far-end server, extracts text message Character format, slit mode is then determined according to character format, slit mode is recycled to carry out cutting to certificate image, finally Identify the character picture after cutting.The present invention can effectively cut the character of different fonts form in certificate image, and be not required to Complicated neural network model is constructed, character identification rate and reliability are high, adapt to the needs of OCR character recognition technologies.
It is to be understood that above-mentioned general description and detailed description below are merely illustrative and illustrative, not It can the limitation range of the invention to be advocated.
Description of the drawings
Following appended attached drawing is the part of specification of the present invention, depicts example embodiments of the present invention, institute Attached drawing is used for illustrating the principle of the present invention together with the description of specification.
Fig. 1 is the stream of the embodiment one of the extracting method of a kind of certificate pictograph that the specific embodiment of the invention provides Cheng Tu.
Fig. 2 is the stream of the embodiment two of the extracting method of a kind of certificate pictograph that the specific embodiment of the invention provides Cheng Tu.
Fig. 3 is the stream of the embodiment three of the extracting method of a kind of certificate pictograph that the specific embodiment of the invention provides Cheng Tu.
Fig. 4 is the stream of the example IV of the extracting method of a kind of certificate pictograph that the specific embodiment of the invention provides Cheng Tu.
Fig. 5 is the knot of the embodiment one of the extraction element of a kind of certificate pictograph that the specific embodiment of the invention provides Structure schematic diagram.
Fig. 6 is the knot of the embodiment two of the extraction element of a kind of certificate pictograph that the specific embodiment of the invention provides Structure schematic diagram.
Fig. 7 is the knot of the embodiment three of the extraction element of a kind of certificate pictograph that the specific embodiment of the invention provides Structure schematic diagram.
Fig. 8 is the knot of the example IV of the extraction element of a kind of certificate pictograph that the specific embodiment of the invention provides Structure schematic diagram.
Fig. 9 is the application schematic diagram of the extraction system of a kind of certificate pictograph that the specific embodiment of the invention provides.
Specific embodiment
Purpose, technical scheme and advantage to make the embodiment of the present invention are more clearly understood, below will with attached drawing and in detail Narration clearly illustrates the spirit of disclosed content, and any skilled artisan is understanding the content of present invention After embodiment, when the technology that can be taught by the content of present invention, it is changed and modifies, without departing from the essence of the content of present invention God and range.
The illustrative embodiments of the present invention and their descriptions are used to explain the present invention, but not as a limitation of the invention. In addition, element/component of the same or like label used in drawings and the embodiments is for representing same or like portion Point.
About " first " used herein, " second " ... etc., not especially censure the meaning of order or cis-position, It is non-to limit the present invention, only for distinguishing the element described with same technique term or operation.
About direction term used herein, such as:Upper and lower, left and right, front or rear etc. are only the sides of refer to the attached drawing To.Therefore, the direction term used is intended to be illustrative and not intended to limit this creation.
It is open term, i.e., about "comprising" used herein, " comprising ", " having ", " containing " etc. Mean including but not limited to.
About it is used herein " and/or ", including any of the things or all combination.
Include " two " and " two or more " about " multiple " herein;Include " two groups " about " multigroup " herein And " more than two ".
About term used herein " substantially ", " about " etc., to modify it is any can be with the quantity or mistake of microvariations Difference, but this slight variations or error can't change its essence.In general, microvariations that such term is modified or error Range in some embodiments can be 20%, in some embodiments can be 10%, can be in some embodiments 5% or its His numerical value.It will be understood by those skilled in the art that the aforementioned numerical value referred to can be adjusted according to actual demand, it is not limited thereto.
It is certain to describe the word of the application by lower or discuss in the other places of this specification, to provide art technology Personnel's guiding additional in relation to the description of the present application.
Fig. 1 is the stream of the embodiment one of the extracting method of a kind of certificate pictograph that the specific embodiment of the invention provides Cheng Tu as shown in Figure 1, transferring the reference certificate text message of same type from far-end server according to certificate type, and is extracted With reference to the character format of certificate text message, slit mode is determined further according to character format, is finally cut and demonstrate,proved using slit mode Book image.
In the specific embodiment shown in the drawings, the extracting method of certificate pictograph includes:
Step 101:Certificate type is determined according to certificate image.In specific embodiments of the present invention, certificate can be graduation Card, marriage certificate, business license, honorary certificate etc.;Certificate image is the scanned copy of above-mentioned certificate or shooting photo.Certificate type For specific certificate issued department provide particular certificate, for example, certain colleges and universities provide diploma.
Step 102:The reference certificate text message of same type is transferred from far-end server according to the certificate type.This In the specific embodiment of invention, far-end server can be cloud server, server cluster etc..With reference to certificate text message just Refer to that the text on certificate image can be edited.
Step 103:Extract the character format with reference to certificate text message.In specific embodiments of the present invention, reference The character format of certificate text message is with specific reference to character boundary, character font, font color of certificate text message etc..
Step 104:Slit mode is determined according to the character format.In specific embodiments of the present invention, by corresponding Slit mode can be just rationally separated by the character in certificate image, convenient for later stage OCR Text region.
Step 105:The certificate image, which is cut, using the slit mode obtains character picture and non-character image.This hair In bright specific embodiment, character picture refers in image as a character (such as Chinese character, number, English word), non-character figure As referring in image as mark (e.g., figure, insignia, personage etc.).
Referring to Fig. 1, the reference certificate text of same type is transferred from far-end server according to the certificate type of certificate image Information, slit mode is determined using with reference to the character format of certificate text message, and certificate image is cut using slit mode, can be with Slit mode is rationally determined with reference to the form of certificate image context sheet, so as to which the character in certificate image is rationally separated, Improve the accuracy and discrimination of OCR Text regions.
Fig. 2 is the stream of the embodiment two of the extracting method of a kind of certificate pictograph that the specific embodiment of the invention provides Cheng Tu, as shown in Fig. 2, after cutting certificate image using slit mode, identification character picture obtains editable digital certificate text This information.
In the specific embodiment shown in the drawings, after step 105, the extracting method of certificate pictograph further includes:
Step 106:Identify that the character picture obtains editable digital certificate text message.Specifically, it can utilize OCR character recognition technologies identify character picture, and obtained editable digital certificate text message does not have any special form, text Information can be edited, be deleted.
Referring to Fig. 2, editable digital certificate text message is obtained using OCR character recognition technologies identification character picture, side Just user edits, and user experience is good.
Fig. 3 is the stream of the embodiment three of the extracting method of a kind of certificate pictograph that the specific embodiment of the invention provides Cheng Tu, as shown in figure 3, after identification character picture obtains editable digital certificate text message, using character format, editor can Digital certificate text message is edited, certificate graphs are restored according to non-character image and edited editable digital certificate text message Picture.
In the specific embodiment shown in the drawings, after step 106, the extracting method of certificate pictograph further includes:
Step 107:Utilize editable digital certificate text message described in the character format editor.The specific reality of the present invention It applies in example, after character format editor's editable digital certificate text message, the form and card of digital certificate text message The form of book image context sheet is identical.
Step 108:Contained according to the non-character image and the edited editable digital certificate text message recovery There is the certificate image of the editable digital certificate text message.In specific embodiments of the present invention, by non-character image It is punctured into edited editable digital certificate text message, the final space of a whole page for restoring certificate image.
Referring to Fig. 3, restore certificate image, card using non-character image and edited editable digital certificate text message Text message can be edited in book image, facilitate editor and the storage of certificate image, meet the particular demands of user.
Fig. 4 is the stream of the example IV of the extracting method of a kind of certificate pictograph that the specific embodiment of the invention provides Cheng Tu, as shown in figure 4, determining certificate type according to certificate image.
In the specific embodiment shown in the drawings, step 101 specifically includes:
Step 1011:Extract the main feature of the certificate image.In specific embodiments of the present invention, the master of certificate image Feature is wanted to include:Realizing text information, identification information, layout information etc. in certificate image.For example, in marriage certificate image " marriage certificate " three words, the layout information of marriage certificate is the main feature of certificate image.
Step 1012:The certificate type is determined according to the main feature.In specific embodiments of the present invention, according to card One or more main features of book image determine certificate type.
Referring to Fig. 4, certificate type is determined according to certificate image like clockwork, so as to according to certificate type from remote service Device accurately transfers the reference certificate text message of same type.
Fig. 5 is the knot of the embodiment one of the extraction element of a kind of certificate pictograph that the specific embodiment of the invention provides Structure schematic diagram, device as shown in Figure 5 can be applied in Fig. 1~method shown in Fig. 4, according to certificate type from remote service Device transfers the reference certificate text message of same type, and extracts the character format with reference to certificate text message, further according to character Form determines slit mode, finally cuts certificate image using slit mode.
In the specific embodiment shown in the drawings, the extraction element of certificate pictograph includes:First determination unit 1, Transfer unit 2, extraction unit 3, the second determination unit 4 and cutting unit 5.Wherein, the first determination unit 1 is used for according to certificate graphs As determining certificate type;It transfers unit 2 and is demonstrate,proved for transferring the reference of same type from far-end server according to the certificate type Book text message;Extraction unit 3 is used to extract the character format with reference to certificate text message;Second determination unit 4 is used for Slit mode is determined according to the character format;Cutting unit 5 is used to obtain using the slit mode cutting certificate image To character picture and non-character image.
Referring to Fig. 5, the reference certificate text of same type is transferred from far-end server according to the certificate type of certificate image Information, slit mode is determined using with reference to the character format of certificate text message, and certificate image is cut using slit mode, can be with Slit mode is rationally determined with reference to the form of certificate image context sheet, so as to which the character in certificate image is rationally separated, Improve the accuracy and discrimination of OCR Text regions.
Fig. 6 is the knot of the embodiment two of the extraction element of a kind of certificate pictograph that the specific embodiment of the invention provides Structure schematic diagram, as shown in fig. 6, after cutting certificate image using slit mode, identification character picture obtains editable number card Book text message.
In the specific embodiment shown in the drawings, the extraction element of certificate pictograph further includes recognition unit 6.Its In, recognition unit 6 is used to identify that the character picture obtains editable digital certificate text message.
Referring to Fig. 6, editable digital certificate text message is obtained using OCR character recognition technologies identification character picture, side Just user edits, and user experience is good.
Fig. 7 is the knot of the embodiment three of the extraction element of a kind of certificate pictograph that the specific embodiment of the invention provides Structure schematic diagram as shown in fig. 7, after identification character picture obtains editable digital certificate text message, is compiled using character format Editable digital certificate text message is collected, card is restored according to non-character image and edited editable digital certificate text message Book image.
In the specific embodiment shown in the drawings, the extraction element of certificate pictograph further includes:Edit cell 7 and extensive Multiple unit 8.Wherein, edit cell 7 is used to utilize editable digital certificate text message described in the character format editor;Restore Unit 8 is used for can containing described according to the non-character image and the edited editable digital certificate text message recovery Edit the certificate image of digital certificate text message.
Referring to Fig. 7, restore certificate image, card using non-character image and edited editable digital certificate text message Text message can be edited in book image, facilitate editor and the storage of certificate image, meet the particular demands of user.
Fig. 8 is the knot of the example IV of the extraction element of a kind of certificate pictograph that the specific embodiment of the invention provides Structure schematic diagram, as shown in figure 8, the first determination unit specifically includes the extraction module of the main feature of extraction certificate image and determines The determining module of certificate type.
In the specific embodiment shown in the drawings, the first determination unit 1 specifically includes extraction module 11 and determining module 12.Wherein, first determination unit 1 specifically includes:Extraction module 11 is used to extract the main feature of the certificate image;Really Cover half block 12 is used to determine the certificate type according to the main feature.
Referring to Fig. 8, certificate type is determined according to certificate image like clockwork, so as to according to certificate type from remote service Device accurately transfers the reference certificate text message of same type
Fig. 9 is the application schematic diagram of the extraction system of a kind of certificate pictograph that the specific embodiment of the invention provides, As shown in figure 9, the system includes:Multiple extraction elements 100 and the far-end server being connect with the extraction element 100 200.Wherein, the extraction element 100 is used to determine certificate type according to the certificate image;The far-end server 200 is used In the reference certificate text message that the certificate type according to the certificate image is provided to the offer device 100;It is described Extraction element 100 is additionally operable to extract the character format with reference to certificate text message, and determine to cut according to the character format The mode of dividing, to cut the certificate image using the slit mode.
Referring to Fig. 9, the character of different fonts form in certificate image can be effectively cut, and not need to construction complexity Neural network model, character identification rate and reliability are high, adapt to the needs of OCR character recognition technologies.
The specific embodiment of the invention provides a kind of computer storage media for including computer executed instructions, the computer When execute instruction is handled via data processing equipment, which performs the extracting method of certificate pictograph.Method Include the following steps:
Step 101:Certificate type is determined according to certificate image.
Step 102:The reference certificate text message of same type is transferred from far-end server according to the certificate type.
Step 103:Extract the character format with reference to certificate text message.
Step 104:Slit mode is determined according to the character format.
Step 105:The certificate image, which is cut, using the slit mode obtains character picture and non-character image.
The specific embodiment of the invention also provides a kind of computer storage media for including computer executed instructions, the calculating When machine execute instruction is handled via data processing equipment, which performs the extracting method of certificate pictograph.Side Method includes the following steps:
Step 101:Certificate type is determined according to certificate image.
Step 102:The reference certificate text message of same type is transferred from far-end server according to the certificate type.
Step 103:Extract the character format with reference to certificate text message.
Step 104:Slit mode is determined according to the character format.
Step 105:The certificate image, which is cut, using the slit mode obtains character picture and non-character image.
Step 106:Identify that the character picture obtains editable digital certificate text message.
The specific embodiment of the invention also provides a kind of computer storage media for including computer executed instructions, the calculating When machine execute instruction is handled via data processing equipment, which performs the extracting method of certificate pictograph.Side Method includes the following steps:
Step 101:Certificate type is determined according to certificate image.
Step 102:The reference certificate text message of same type is transferred from far-end server according to the certificate type.
Step 103:Extract the character format with reference to certificate text message.
Step 104:Slit mode is determined according to the character format.
Step 105:The certificate image, which is cut, using the slit mode obtains character picture and non-character image.
Step 106:Identify that the character picture obtains editable digital certificate text message.
Step 107:Utilize editable digital certificate text message described in the character format editor.
Step 108:Contained according to the non-character image and the edited editable digital certificate text message recovery There is the certificate image of the editable digital certificate text message.
The specific embodiment of the invention provides a kind of computer storage media for including computer executed instructions, the computer When execute instruction is handled via data processing equipment, which performs the extracting method of certificate pictograph.Method Include the following steps:
Step 1011:Extract the main feature of the certificate image.
Step 1012:The certificate type is determined according to the main feature.
Step 102:The reference certificate text message of same type is transferred from far-end server according to the certificate type.
Step 103:Extract the character format with reference to certificate text message.
Step 104:Slit mode is determined according to the character format.
Step 105:The certificate image, which is cut, using the slit mode obtains character picture and non-character image.
The specific embodiment of the invention provides a kind of extracting method of certificate pictograph, apparatus and system, computer storage Medium determines certificate type according to certificate image first, and same type reference is transferred from far-end server further according to certificate type The text message (Chinese, English, number etc.) of certificate image extracts the character format of text message, then according to character format It determines slit mode, slit mode is recycled to carry out cutting to certificate image, finally identify the character picture after cutting.The present invention The character of different fonts form in certificate image can be effectively cut, and not need to the complicated neural network model of construction, word It accords with discrimination and reliability is high, adapt to the needs of OCR character recognition technologies.
The above-mentioned embodiment of the present invention can be implemented in various hardware, Software Coding or both combination.For example, this hair Bright embodiment, which is alternatively in data signal processor (Digital Signal Processor, DSP), performs the above method Program code.The present invention can also refer to computer processor, digital signal processor, microprocessor or field-programmable gate array Arrange the multiple functions that (Field Programmable Gate Array, FPGA) is performed.Can above-mentioned processing be configured according to the present invention Device performs particular task, and the machine-readable software code of ad hoc approach or the firmware generation that the present invention discloses are defined by performing Code is completed.Software code or firmware code can be developed into different program languages and different forms or form.Or Different target platform composing software codes.However, in generation, is configured according to the software code of execution task of the present invention and other types Different code pattern, type and the language of code do not depart from spirit and scope of the invention.
The foregoing is merely the schematical specific embodiment of the present invention, before the design of the present invention and principle is not departed from It puts, the equivalent variations and modification that any those skilled in the art is made should all belong to the scope of protection of the invention.

Claims (10)

1. a kind of extracting method of certificate pictograph, which is characterized in that this method includes:
Certificate type is determined according to certificate image;
The reference certificate text message of same type is transferred from far-end server according to the certificate type;
Extract the character format with reference to certificate text message;
Slit mode is determined according to the character format;And
The certificate image, which is cut, using the slit mode obtains character picture and non-character image.
2. the extracting method of certificate pictograph as described in claim 1, which is characterized in that cut using the slit mode After the step of certificate image, this method further includes:
Identify that the character picture obtains editable digital certificate text message.
3. the extracting method of certificate pictograph as claimed in claim 2, which is characterized in that identify that the character picture obtains After the step of editable digital certificate text message, this method further includes:
Utilize editable digital certificate text message described in the character format editor;And
Restore to contain the editable according to the non-character image and the edited editable digital certificate text message The certificate image of digital certificate text message.
4. the extracting method of certificate pictograph as described in claim 1, which is characterized in that certificate is determined according to certificate image The step of type, specifically includes:
Extract the main feature of the certificate image;And
The certificate type is determined according to the main feature.
5. a kind of extraction element of certificate pictograph, which is characterized in that the device includes:
First determination unit, for determining certificate type according to certificate image;
Unit is transferred, for transferring the reference certificate text message of same type from far-end server according to the certificate type;
Extraction unit, for extracting the character format with reference to certificate text message;
Second determination unit, for determining slit mode according to the character format;And
Cutting unit obtains character picture and non-character image for cutting the certificate image using the slit mode.
6. the extraction element of certificate pictograph as claimed in claim 5, which is characterized in that the device further includes:
Recognition unit, for identifying that the character picture obtains editable digital certificate text message.
7. the extraction element of certificate pictograph as claimed in claim 6, which is characterized in that the device further includes:
Edit cell, for utilizing editable digital certificate text message described in the character format editor;And
Recovery unit, for being contained according to the non-character image and the edited editable digital certificate text message recovery There is the certificate image of the editable digital certificate text message.
8. the extraction element of certificate pictograph as claimed in claim 5, which is characterized in that first determination unit is specific Including:
Extraction module, for extracting the main feature of the certificate image;And
Determining module, for determining the certificate type according to the main feature.
9. a kind of extraction system of certificate pictograph, which is characterized in that the system includes:Multiple such as claims 5~8 are any The extraction element and the far-end server being connect with the extraction element, wherein,
The extraction element is used to determine certificate type according to the certificate image;
The far-end server is used for the reference provided according to the certificate type of the certificate image to the offer device Certificate text message;
The extraction element is additionally operable to extract the character format with reference to certificate text message, and true according to the character format Slit mode is determined, to cut the certificate image using the slit mode.
10. a kind of computer storage media for including computer executed instructions, the computer executed instructions are via data processing During equipment processing, the extracting method of 1~4 any certificate pictograph of data processing equipment perform claim requirement.
CN201810104851.9A 2018-02-02 2018-02-02 The extracting method of certificate pictograph, apparatus and system, computer storage media Pending CN108171239A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810104851.9A CN108171239A (en) 2018-02-02 2018-02-02 The extracting method of certificate pictograph, apparatus and system, computer storage media

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810104851.9A CN108171239A (en) 2018-02-02 2018-02-02 The extracting method of certificate pictograph, apparatus and system, computer storage media

Publications (1)

Publication Number Publication Date
CN108171239A true CN108171239A (en) 2018-06-15

Family

ID=62513061

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810104851.9A Pending CN108171239A (en) 2018-02-02 2018-02-02 The extracting method of certificate pictograph, apparatus and system, computer storage media

Country Status (1)

Country Link
CN (1) CN108171239A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110442744A (en) * 2019-08-09 2019-11-12 泰康保险集团股份有限公司 Extract method, apparatus, electronic equipment and the readable medium of target information in image
CN111160395A (en) * 2019-12-05 2020-05-15 北京三快在线科技有限公司 Image recognition method and device, electronic equipment and storage medium
WO2020113561A1 (en) * 2018-12-07 2020-06-11 华为技术有限公司 Method for extracting structural data from image, apparatus and device
CN111405191A (en) * 2020-04-24 2020-07-10 Oppo(重庆)智能科技有限公司 Image management method, device, terminal and storage medium
CN112686237A (en) * 2020-12-21 2021-04-20 福建新大陆软件工程有限公司 Certificate OCR recognition method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10272874A (en) * 1997-03-31 1998-10-13 Toppan Forms Co Ltd Cut form with variable information and method for forming the information
CN102402684A (en) * 2010-09-15 2012-04-04 富士通株式会社 Method and device for determining type of certificate and method and device for translating certificate
CN103885723A (en) * 2014-03-04 2014-06-25 广东数字证书认证中心有限公司 Digital certificate storage method, digital certificate storage system, digital certificate reading method and digital certificate reading system
US20140219561A1 (en) * 2013-02-06 2014-08-07 Nidec Sankyo Corporation Character segmentation device and character segmentation method
CN104079587A (en) * 2014-07-21 2014-10-01 深圳天祥质量技术服务有限公司 Certificate identification device and certificate check system
CN106886776A (en) * 2017-02-23 2017-06-23 山东浪潮云服务信息科技有限公司 The application model of license electronization is realized in a kind of utilization image recognition

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10272874A (en) * 1997-03-31 1998-10-13 Toppan Forms Co Ltd Cut form with variable information and method for forming the information
CN102402684A (en) * 2010-09-15 2012-04-04 富士通株式会社 Method and device for determining type of certificate and method and device for translating certificate
US20140219561A1 (en) * 2013-02-06 2014-08-07 Nidec Sankyo Corporation Character segmentation device and character segmentation method
CN103885723A (en) * 2014-03-04 2014-06-25 广东数字证书认证中心有限公司 Digital certificate storage method, digital certificate storage system, digital certificate reading method and digital certificate reading system
CN104079587A (en) * 2014-07-21 2014-10-01 深圳天祥质量技术服务有限公司 Certificate identification device and certificate check system
CN106886776A (en) * 2017-02-23 2017-06-23 山东浪潮云服务信息科技有限公司 The application model of license electronization is realized in a kind of utilization image recognition

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020113561A1 (en) * 2018-12-07 2020-06-11 华为技术有限公司 Method for extracting structural data from image, apparatus and device
CN111615702A (en) * 2018-12-07 2020-09-01 华为技术有限公司 Method, device and equipment for extracting structured data from image
CN111615702B (en) * 2018-12-07 2023-10-17 华为云计算技术有限公司 Method, device and equipment for extracting structured data from image
CN110442744A (en) * 2019-08-09 2019-11-12 泰康保险集团股份有限公司 Extract method, apparatus, electronic equipment and the readable medium of target information in image
CN110442744B (en) * 2019-08-09 2022-11-04 泰康保险集团股份有限公司 Method and device for extracting target information in image, electronic equipment and readable medium
CN111160395A (en) * 2019-12-05 2020-05-15 北京三快在线科技有限公司 Image recognition method and device, electronic equipment and storage medium
WO2021110174A1 (en) * 2019-12-05 2021-06-10 北京三快在线科技有限公司 Image recognition method and device, electronic device, and storage medium
CN111405191A (en) * 2020-04-24 2020-07-10 Oppo(重庆)智能科技有限公司 Image management method, device, terminal and storage medium
CN112686237A (en) * 2020-12-21 2021-04-20 福建新大陆软件工程有限公司 Certificate OCR recognition method

Similar Documents

Publication Publication Date Title
CN108171239A (en) The extracting method of certificate pictograph, apparatus and system, computer storage media
CN109308476B (en) Billing information processing method, system and computer readable storage medium
CN111046784A (en) Document layout analysis and identification method and device, electronic equipment and storage medium
CN110442744A (en) Extract method, apparatus, electronic equipment and the readable medium of target information in image
JP4993319B2 (en) Apparatus and method for supporting verification of software internationalization
CN114299528B (en) Information extraction and structuring method for scanned document
CN108090445A (en) The electronics of a kind of papery operation or paper corrects method
CN108090400A (en) A kind of method and apparatus of image text identification
CN111652232A (en) Bill identification method and device, electronic equipment and computer readable storage medium
CN112508011A (en) OCR (optical character recognition) method and device based on neural network
CN111062791A (en) Method, device and equipment for reimbursing and filling bill
CN106980857B (en) Chinese calligraphy segmentation and recognition method based on copybook
CN110516664A (en) Bank slip recognition method, apparatus, electronic equipment and storage medium
CN109271951A (en) A kind of method and system promoting book keeping operation review efficiency
CN110210470A (en) Merchandise news image identification system
CN110458014A (en) Answering card reading method, device and computer readable storage medium
JP2019079347A (en) Character estimation system, character estimation method, and character estimation program
CN109992752A (en) Label labeling method, device, computer installation and the storage medium of contract documents
CN112668580A (en) Text recognition method, text recognition device and terminal equipment
CN109726369A (en) A kind of intelligent template questions record Implementation Technology based on normative document
Elanwar et al. Extracting text from scanned Arabic books: a large-scale benchmark dataset and a fine-tuned Faster-R-CNN model
CN111144445A (en) Error detection method and system for printing book and periodical writing format and electronic equipment
CN115130437B (en) Intelligent document filling method and device and storage medium
CN115690819A (en) Big data-based identification method and system
Kumar et al. Line based robust script identification for indianlanguages

Legal Events

Date Code Title Description
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180615