CN109934227A - System for recognizing characters from image and method - Google Patents

System for recognizing characters from image and method Download PDF

Info

Publication number
CN109934227A
CN109934227A CN201910191038.4A CN201910191038A CN109934227A CN 109934227 A CN109934227 A CN 109934227A CN 201910191038 A CN201910191038 A CN 201910191038A CN 109934227 A CN109934227 A CN 109934227A
Authority
CN
China
Prior art keywords
text
model
image
annotation results
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910191038.4A
Other languages
Chinese (zh)
Inventor
周钊
郑莹斌
叶浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Chengguan Information Technology Co Ltd
Original Assignee
Shanghai Chengguan Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Chengguan Information Technology Co Ltd filed Critical Shanghai Chengguan Information Technology Co Ltd
Priority to CN201910191038.4A priority Critical patent/CN109934227A/en
Publication of CN109934227A publication Critical patent/CN109934227A/en
Pending legal-status Critical Current

Links

Abstract

A kind of system for recognizing characters from image, the system include: detection model and/or identification model.Detection model is for regional location locating for text in detection image;Identification model is for extracting text information in image;Operation module calls detection module and/or identification module, and the text in image is detected and identified, preliminary annotation results are obtained;Sample selection module selects some or all of sample in preliminary annotation results;Correction module is marked, the select annotation results of sample selection module are modified, obtains fining annotation results, the annotation results of the fining are used to that detection model and/or identification model to be continued training and optimized.

Description

System for recognizing characters from image and method
Technical field
The invention belongs to technical field of image processing, in particular to a kind of system for recognizing characters from image and method.
Background technique
Pictograph intelligent Understanding technology, refers to using artificial intelligence approach and model, is handled image and analyzed, And judge specific position and the content of text in image.The identification of specific location and content for text in image is sentenced It is disconnected, help to understand entire image, while being structuring abstract image content, key message judgement provides foundation, solves practical Problem.Such as it helps to realize sensitive text information identifies, intelligence OCR document identifies etc..
Pictograph intelligent Understanding technology may include 2 steps, comprising:
1. extracting the region where the word content in image, this process is also referred to as pictograph detection;
2. the image local area or whole image zooming-out word content therein, this process of pair input are also referred to as image Text region.
Intelligent Understanding processing for pictograph, can be used a step in above-mentioned two step, can also be with It is used in the combination (first carry out pictograph detection, then carry out pictograph identification) of two steps.
Existing general image text intelligent Understanding method, the data set usually completed according to text marking, instruction Practise pictograph detection and (or) pictograph identification model.This method relies primarily on deep neural network algorithm.The algorithm Model training of the more data sample for deep neural network is generally required, problem is detected and identified specific to pictograph On, it is necessary to the mark of text point or word content is carried out to the image containing text information of magnanimity.
Since existing general image text intelligent Understanding method is often directly labeled using all data, then benefit Model training is carried out with marked content, it is contemplated that the image with word content often has 1 or more character area, this method Not only mark amount is big, since there may be more replicated literal marks, so that efficiency is lower.
Summary of the invention
The embodiment of the present invention provide it is a kind of for system for recognizing characters from image and method, using Intelligent Hybrid mask method, Solve the text detection model that part mark sample is used only in current intelligent text understanding and Text region model accuracy not Height, new training data mark is difficult, artificial to mark the big problem of cost.
One of embodiment of the present invention is used for image character recognition method, follows the steps below based on man-machine mixing mark The pictograph intelligent Understanding of note:
The text detection and identification model on basis are called, text and word content in image is obtained, which is made For preliminary annotation results;
Algorithm is automatically selected using sample and selects part of or whole sample, then is marked by artificial correction, is manually repaired Positive content may include: that adjustment has the testing result of error, corrigendum identification content, the mark of polishing gaps and omissions, deletion error As a result;
Fining annotation results are taken out, for further training detection model and/or identification model, improve model essence Degree;
Above step is repeated, until the result for obtaining meeting demand.
Method described in the embodiment of the present invention can only mark portion in the detection and identification process to pictograph The word content divided understands for pictograph, reduces the data mark of pictograph intelligent Understanding system and builds cost.Through One or more above-mentioned man-machine mixing marks and model training process are crossed, system for recognizing characters from image will obtain more preferably text reason Solve result.Wherein artificial mark can also have programming system to be automatically performed.
The present invention can mark new training data by way of automatic or man-machine mixing marks to optimize text detection Model and Text region model, iterative raising model accuracy.Particularly, when being labeled to new data, can pass through The operation module for calling existing text detection and/or Text region model to form, carries out preliminary mark in advance, can reduce To the repeat mark of similar image data, required artificial mark workload is greatly decreased, rapid build goes out the figure of meet demand As text intelligent Understanding model.
Detailed description of the invention
The following detailed description is read with reference to the accompanying drawings, above-mentioned and other mesh of exemplary embodiment of the invention , feature and advantage will become prone to understand.In the accompanying drawings, if showing by way of example rather than limitation of the invention Dry embodiment, in which:
Fig. 1 according to embodiments of the present invention one of the recognition methods flow diagram to text contained in image.
Fig. 2 according to embodiments of the present invention one of the recognition methods flow diagram to text contained in image.
Fig. 3 according to embodiments of the present invention one of the recognition methods flow diagram to text contained in image.
Fig. 4 according to embodiments of the present invention one of the identifying system flow diagram to text contained in image.
Fig. 5 according to embodiments of the present invention one of the identifying system flow diagram to text contained in image.
Fig. 6 according to embodiments of the present invention one of the identifying system flow diagram to text contained in image.
Specific embodiment
According to one or more embodiment, as shown in Figure 1, a kind of image character recognition method, is used for institute in image The text information for including identifies, comprising the following steps:
S101 is known with the text detection model on pictograph data set that is existing or having marked training basis and text Other model;Text detection model herein is applicable to all kinds of image text detection models, includes but not limited to: being directed to standard square The model (such as Connectionist Text Proposal Network) of shape collimation mark note centainly rotates angle for having Character machining model (such as RRPN, EAST, DMP-Net), for arbitrary shape character machining model (such as Total-Text, TextSnake etc.).Text identification model is applicable to all kinds of image text identification models, includes but not limited to: being based on The depth network model of Attention, depth network model based on CTC etc..If be labelled in text using existing The image data set of appearance trains preliminary model, then the word content that may be implemented that the part marked is used only is for image Text understands, to reduce the data mark cost of pictograph system.The word content marked, the source of mark It can be academia and disclose the character machining used or identification data set, be also possible to the literal field marked by hand by implement team Domain or content.
Text detection model and Text region model are formed operation module by S102;For an input picture, mould is operated Block is by the region where calling text detection model to calculate text first, and the partial region is after certain geometric transformation (will such as tilt or curved region is straightened), is re-used as the input of Text region model, to obtain image context block domain Position and content;
S103, call operation module carry out text detection and identification to new image data, and will test recognition result work For preliminary annotation results, input system subsequent step;
S104 calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results.Sample is certainly Dynamic selection algorithm includes but not limited to: the sample selection algorithm based on sample confidence threshold value, based on the distribution of sample confidence level Sample selection algorithm;
S105 is manually modified testing result and recognition result respectively, revises text all in simultaneously polishing image Frame, while the corresponding content of text of textbox is corrected, obtain fining annotation results;
S106 further trains detection model and identification model with fining labeled data, while to text detection model Tuning is carried out with Text region model;Each model used in step S101 can be used in model training, it is possible to use increment type Other models of study;
Detection model and identification model replacement basic model after optimization is integrated into operation module, constructed more excellent by S107 The pictograph of change understands identifying system;
S108, returns to step S103, carries out next round iteration optimization, until obtain meet demand text detection model and Text region model.The standard of meet demand includes but not limited to one or more standard below: in test set or verifying collection On accuracy rate be more than certain preset value, test set or verifying collection on recall rate be more than certain preset value, model Parameter amount is lower than certain preset value, and runing time of the model on test set or verifying collection is lower than certain preset value, etc..
A complete pictograph detection and recognition methods are present embodiments provided, it is relevant to can be applied to various OCR Application scenarios.
According to one or more embodiment, under certain application scenarios, it is only necessary to detect the word content area on image Domain, and Text region is not required directly.For example, judging whether be inserted into the application such as copy or subtitle in image.This Embodiment is optimized for image text detection model, and flow chart is as shown in Figure 2.Comprising steps of
S201, with pictograph data set that is existing or having marked training basis text detection model, and by its As operation module;
S202, call operation module carry out text detection to the image data newly inputted, and will test recognition result conduct Preliminary annotation results, the input as subsequent processing;
S203 calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
S204 is manually modified preliminary testing result, comprising: text of the adjustment with text box devious is deleted Unless the callout box of character area, the callout box etc. of polishing gaps and omissions, to obtain fining annotation results;
S205 further trains detection model with fining labeled data;
Detection model after optimization is integrated into operation module by S206;
S207 returns to step S202 and carries out next round iteration optimization, until obtaining the text detection model of meet demand.
Pictograph detection method is optimized in the present embodiment, more prominent in the effect of certain applications.For example, Judge whether be inserted into copy and subtitle in image;For another example, not appropriate due to data sample in some applications Or available Text region model (identification of such as ancient writing and the identification of foreign language text), text is carried out by computer automatically Region detection, then domain expert is transferred to carry out artificial Text region.
According to one or more embodiment, as shown in figure 3, a kind of image character recognition method, comprising the following steps:
S301 knows the text detection model on pictograph data set that is existing or having marked training basis and text Other model, and will test and form operation module with identification model;
S302, call operation module carry out text detection and identification to new image data, and by recognition result therein Input as preliminary annotation results and follow-up system;
S303 calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
S304 is manually corrected preliminary recognition result, obtains fining annotation results;
S305 further trains identification model with fining labeled data;
Identification model after optimization is integrated into operation module by S306;
S307 returns to step S302 and carries out next round iteration optimization, until obtaining the Text region model of meet demand.
The present embodiment has carried out further optimization for Text region model, carries out only for pictograph identification model Iteration.Towards application scenarios, such as the bill of certain fixed forms can directly obtain text by way of template matching The position in region, therefore only need to update identification model.
According to one or more embodiment, as shown in figure 4, a kind of system for recognizing characters from image, which includes: detection Model and/or identification model.Detection model is for regional location locating for text in detection image;Identification model is for extracting figure The text information as in.And
Operation module calls detection module and/or identification module, the text in image is detected and identified, obtains Preliminary annotation results;
Sample selection module selects some or all of sample in preliminary annotation results;
Correction module is marked, the select annotation results of sample selection module are modified, obtains fining mark As a result, the annotation results of the fining are used to that detection model and/or identification model to be continued training and optimized.
A complete pictograph detection and identifying system are present embodiments provided, it is relevant to can be applied to various OCR Application scenarios.
According to one or more embodiment, as shown in figure 5, a kind of system for recognizing characters from image, which includes: detection Model, for regional location locating for text in detection image.Operation module, call detection module, to the text in image into Row detection, obtains preliminary annotation results;Sample selection module selects some or all of sample in preliminary annotation results This;Correction module is marked, the select annotation results of sample selection module are modified, obtains fining annotation results, The annotation results of the fining are used to continue training and optimization to detection model.
Pictograph detection model is optimized in the system for recognizing characters from image of the present embodiment, in certain applications Effect is more prominent.For example, judging whether be inserted into copy and subtitle in image;For another example, in some applications due to data Sample reason does not have appropriate or available Text region model (identification of such as ancient writing and the identification of foreign language text), by Computer carries out word area detection automatically, then domain expert is transferred to carry out artificial Text region.
According to one or more embodiment, as shown in fig. 6, a kind of system for recognizing characters from image, which includes: detection Model and/or identification model, detection model is for regional location locating for text in detection image;Identification model is for extracting figure The text information as in.And
Operation module calls detection module and/or identification module, the text in image is detected and identified, obtains Preliminary annotation results;
Sample selection module selects some or all of sample in preliminary annotation results;
Correction module is marked, the select annotation results of sample selection module are modified, obtains fining mark As a result, the annotation results of the fining are used to continue training and optimization to identification model.
The system for recognizing characters from image of the present embodiment has carried out further optimization for Text region model, only for figure As Text region model is iterated.Towards application scenarios, such as the bill of certain fixed forms can pass through template matching Mode directly obtain the position of character area, therefore only need to update identification model.
According to one or more embodiment, a kind of man-machine mixing mask method towards pictograph intelligent Understanding be System, wherein text intelligent Understanding includes two big functions of pictograph region detection and identification.System utilizes initial training Basic text detection model and basic Text region model, preliminary text detection and (or) knowledge are carried out to image to be marked Not;System will be exported according to preliminary as a result, part of or whole sample is selected using algorithm is automatically selected, by artificial school To amendment detection and/or recognition result, further fining mark is carried out to image;Finally the annotation results of fining are used Text detection and/or identification model in training basis, to improve the accuracy of detection model and identification model.By one Or multiple above-mentioned man-machine mixing marks and model training process, system will obtain more preferably text and understand result.
According to one or more embodiment, a kind of pictograph identification network platform, the network platform includes service Device, server have memory;And
It is coupled to the processor of the memory, which is configured as executing the finger of storage in the memory It enables, the processor executes following operation:
With pictograph data set that is existing or having marked training text detection model and Text region model;
Text detection model and Text region model are formed into operation module;
Call operation module carries out text detection and identification to image data, and will test recognition result as tentatively Annotation results;
It calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
Testing result and recognition result are modified respectively, revise textbox all in simultaneously polishing image, while school The corresponding content of text of text word frame obtains fining annotation results;
Text detection model and Text region model are further trained with fining labeled data, while to text detection mould Type and Text region model carry out tuning;
By the text detection model and the original text detection model of Text region model replacement and Text region after optimization Model constructs more optimized pictograph and understands identifying system.
According to one or more embodiment, a kind of pictograph identification server, server has memory;And coupling The processor of the memory is closed, which is configured as executing the instruction of storage in the memory, the processing Device executes following operation:
With pictograph data set that is existing or having marked training text detection model and Text region model;
Text detection model and Text region model are formed into operation module;
Call operation module carries out text detection and identification to image data, and will test recognition result as tentatively Annotation results;
It calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
Testing result and recognition result are modified respectively, revise textbox all in simultaneously polishing image, while school The corresponding content of text of text word frame obtains fining annotation results;
Text detection model and Text region model are further trained with fining labeled data, while to text detection mould Type and Text region model carry out tuning;
By the text detection model and the original text detection model of Text region model replacement and Text region after optimization Model constructs more optimized pictograph and understands identifying system.
It should be understood that in embodiments of the present invention, term "and/or" is only a kind of incidence relation for describing affiliated partner, Indicate may exist three kinds of relationships.For example, A and/or B, can indicate: individualism A exists simultaneously A and B, individualism B this Three kinds of situations.In addition, character "/" herein, typicallys represent the relationship that forward-backward correlation object is a kind of "or".
Those of ordinary skill in the art may be aware that list described in conjunction with the examples disclosed in the embodiments of the present disclosure Member and algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware With the interchangeability of software, each exemplary composition and step are generally described according to function in the above description.This A little functions are implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Specially Industry technical staff can use different methods to achieve the described function each specific application, but this realization is not It is considered as beyond the scope of this invention.
In several embodiments provided herein, it should be understood that disclosed systems, devices and methods, it can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components It can be combined or can be integrated into another system, or some features can be ignored or not executed.In addition, shown or beg for Opinion mutual coupling, direct-coupling or communication connection can be through some interfaces, the INDIRECT COUPLING of device or unit Or communication connection, it is also possible to electricity, mechanical or other form connections.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.Some or all of unit therein can be selected to realize the embodiment of the present invention according to the actual needs Purpose.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, is also possible to two or more units and is integrated in one unit.It is above-mentioned integrated Unit both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially The all or part of the part that contributes to existing technology or the technical solution can be in the form of software products in other words It embodies, which is stored in a storage medium, including some instructions are used so that a computer Equipment (can be personal computer, server or the network equipment etc.) executes the complete of each embodiment the method for the present invention Portion or part steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can store journey The medium of sequence code.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right It is required that protection scope subject to.

Claims (9)

1. a kind of system for recognizing characters from image, which includes: detection model and/or identification model,
Detection model, for regional location locating for text in detection image;
Identification model, for extracting text information in image;And
Operation module calls detection module and/or identification module, the text in image is detected and identified, obtains preliminary Annotation results;
Sample selection module selects some or all of sample in preliminary annotation results;
Correction module is marked, the select annotation results of sample selection module are modified, obtains fining annotation results, The annotation results of the fining are used to that detection model and/or identification model to be continued training and optimized.
2. a kind of system for recognizing characters from image, the system include:
Detection model, for regional location locating for text in detection image;
Operation module calls detection module, detects to the text in image, obtain preliminary annotation results;
Sample selection module selects some or all of sample in preliminary annotation results;
Correction module is marked, the select annotation results of sample selection module are modified, obtains fining annotation results, The annotation results of the fining are used to continue training and optimization to detection model.
3. a kind of system for recognizing characters from image, which includes: detection model and/or identification model,
Detection model, for regional location locating for text in detection image;
Identification model, for extracting text information in image;And
Operation module calls detection module and/or identification module, the text in image is detected and identified, obtains preliminary Annotation results;
Sample selection module selects some or all of sample in preliminary annotation results;
Correction module is marked, the select annotation results of sample selection module are modified, obtains fining annotation results, The annotation results of the fining are used to continue training and optimization to identification model.
4. a kind of image character recognition method, method includes the following steps:
With pictograph data set that is existing or having marked training text detection model and Text region model;Text is examined It surveys model and Text region model forms operation module;
Call operation module carries out text detection and identification to image data, and will test recognition result as preliminary mark As a result;
It calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
Testing result and recognition result are modified respectively, revise textbox all in simultaneously polishing image, while correcting text The corresponding content of text of word frame obtains fining annotation results;
Further train text detection model and Text region model with fining labeled data, at the same to text detection model and Text region model carries out tuning;
By after optimization text detection model and Text region model replace original text detection model and Text region model, It constructs more optimized pictograph and understands identifying system.
5. a kind of image character recognition method, method includes the following steps:
Text detection model is trained with pictograph data set that is existing or having marked, and as operation module;
Call operation module carries out text detection to image data, and will test recognition result as preliminary annotation results;
It calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
Preliminary testing result is modified, fining annotation results are obtained;
Detection model is further trained with fining labeled data;
Detection model after optimization is integrated into operation module.
6. a kind of image character recognition method, method includes the following steps:
By pictograph data set that is existing or having marked training text detection model and Text region model, and will test Operation module is formed with identification model;
Call operation module carries out text detection and identification to image data, and using recognition result as preliminary annotation results;
It calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
Preliminary recognition result is corrected, fining annotation results are obtained;
Identification model is further trained with fining labeled data;
Identification model after optimization is integrated into operation module.
7. a kind of pictograph identifies the network platform, which is characterized in that the network platform includes server, and server, which has, to be deposited Reservoir;And
It is coupled to the processor of the memory, which is configured as executing the instruction of storage in the memory, institute It states processor and executes following operation:
With pictograph data set that is existing or having marked training text detection model and Text region model;
Text detection model and Text region model are formed into operation module;
Call operation module carries out text detection and identification to image data, and will test recognition result as preliminary mark As a result;
It calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
Testing result and recognition result are modified respectively, revise textbox all in simultaneously polishing image, while correcting text The corresponding content of text of word frame obtains fining annotation results;
Further train text detection model and Text region model with fining labeled data, at the same to text detection model and Text region model carries out tuning;
By after optimization text detection model and Text region model replace original text detection model and Text region model, It constructs more optimized pictograph and understands identifying system.
8. a kind of pictograph identifies that server, server have memory;And
It is coupled to the processor of the memory, which is configured as executing the instruction of storage in the memory, institute It states processor and executes following operation:
With pictograph data set that is existing or having marked training text detection model and Text region model;Text is examined It surveys model and Text region model forms operation module;
Call operation module carries out text detection and identification to image data, and will test recognition result as preliminary mark As a result;
It calls sample to automatically select algorithm, selects all or part of sample in preliminary annotation results;
Testing result and recognition result are modified respectively, revise textbox all in simultaneously polishing image, while correcting text The corresponding content of text of word frame obtains fining annotation results;
Further train text detection model and Text region model with fining labeled data, at the same to text detection model and Text region model carries out tuning;
By after optimization text detection model and Text region model replace original text detection model and Text region model, It constructs more optimized pictograph and understands identifying system.
9. a kind of storage medium, is stored thereon with computer program, which is characterized in that when the program is executed by processor, realize Method as described in any in claim 4 to 6.
CN201910191038.4A 2019-03-12 2019-03-12 System for recognizing characters from image and method Pending CN109934227A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910191038.4A CN109934227A (en) 2019-03-12 2019-03-12 System for recognizing characters from image and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910191038.4A CN109934227A (en) 2019-03-12 2019-03-12 System for recognizing characters from image and method

Publications (1)

Publication Number Publication Date
CN109934227A true CN109934227A (en) 2019-06-25

Family

ID=66987113

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910191038.4A Pending CN109934227A (en) 2019-03-12 2019-03-12 System for recognizing characters from image and method

Country Status (1)

Country Link
CN (1) CN109934227A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110334707A (en) * 2019-06-28 2019-10-15 苏宁云计算有限公司 A kind of model modification method and update the system
CN110458162A (en) * 2019-07-25 2019-11-15 上海兑观信息科技技术有限公司 A kind of method of intelligent extraction pictograph information
CN110533079A (en) * 2019-08-05 2019-12-03 贝壳技术有限公司 Form method, apparatus, medium and the electronic equipment of image pattern
CN110901788A (en) * 2019-11-27 2020-03-24 佛山科学技术学院 Biped mobile robot system with literacy ability
CN110991279A (en) * 2019-11-20 2020-04-10 北京灵伴未来科技有限公司 Document image analysis and recognition method and system
CN111104479A (en) * 2019-11-13 2020-05-05 中国建设银行股份有限公司 Data labeling method and device
CN111291742A (en) * 2020-02-10 2020-06-16 北京百度网讯科技有限公司 Object recognition method and device, electronic equipment and storage medium
CN112100431A (en) * 2020-11-16 2020-12-18 深圳壹账通智能科技有限公司 Evaluation method, device and equipment of OCR system and readable storage medium
CN112669515A (en) * 2020-12-28 2021-04-16 上海斑马来拉物流科技有限公司 Bill image recognition method and device, electronic equipment and storage medium
CN113762292A (en) * 2020-06-03 2021-12-07 杭州海康威视数字技术股份有限公司 Training data acquisition method and device and model training method and device

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7460735B1 (en) * 2004-09-28 2008-12-02 Google Inc. Systems and methods for using image duplicates to assign labels to images
CN107492135A (en) * 2017-08-21 2017-12-19 维沃移动通信有限公司 A kind of image segmentation mask method, device and computer-readable recording medium
CN108009589A (en) * 2017-12-12 2018-05-08 腾讯科技(深圳)有限公司 Sample data processing method, device and computer-readable recording medium
CN108052937A (en) * 2017-12-28 2018-05-18 百度在线网络技术(北京)有限公司 Based on Weakly supervised character machining device training method, device, system and medium
CN108288078A (en) * 2017-12-07 2018-07-17 腾讯科技(深圳)有限公司 Character identifying method, device and medium in a kind of image
CN108764226A (en) * 2018-04-13 2018-11-06 顺丰科技有限公司 Image text recognition methods, device, equipment and its storage medium
CN109086756A (en) * 2018-06-15 2018-12-25 众安信息技术服务有限公司 A kind of text detection analysis method, device and equipment based on deep neural network
CN109299296A (en) * 2018-11-01 2019-02-01 郑州云海信息技术有限公司 A kind of interactive image text marking method and system
CN109344914A (en) * 2018-10-31 2019-02-15 焦点科技股份有限公司 A kind of method and system of the Text region of random length end to end

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7460735B1 (en) * 2004-09-28 2008-12-02 Google Inc. Systems and methods for using image duplicates to assign labels to images
CN107492135A (en) * 2017-08-21 2017-12-19 维沃移动通信有限公司 A kind of image segmentation mask method, device and computer-readable recording medium
CN108288078A (en) * 2017-12-07 2018-07-17 腾讯科技(深圳)有限公司 Character identifying method, device and medium in a kind of image
CN108009589A (en) * 2017-12-12 2018-05-08 腾讯科技(深圳)有限公司 Sample data processing method, device and computer-readable recording medium
CN108052937A (en) * 2017-12-28 2018-05-18 百度在线网络技术(北京)有限公司 Based on Weakly supervised character machining device training method, device, system and medium
CN108764226A (en) * 2018-04-13 2018-11-06 顺丰科技有限公司 Image text recognition methods, device, equipment and its storage medium
CN109086756A (en) * 2018-06-15 2018-12-25 众安信息技术服务有限公司 A kind of text detection analysis method, device and equipment based on deep neural network
CN109344914A (en) * 2018-10-31 2019-02-15 焦点科技股份有限公司 A kind of method and system of the Text region of random length end to end
CN109299296A (en) * 2018-11-01 2019-02-01 郑州云海信息技术有限公司 A kind of interactive image text marking method and system

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110334707A (en) * 2019-06-28 2019-10-15 苏宁云计算有限公司 A kind of model modification method and update the system
CN110458162A (en) * 2019-07-25 2019-11-15 上海兑观信息科技技术有限公司 A kind of method of intelligent extraction pictograph information
CN110533079A (en) * 2019-08-05 2019-12-03 贝壳技术有限公司 Form method, apparatus, medium and the electronic equipment of image pattern
CN110533079B (en) * 2019-08-05 2022-05-24 贝壳技术有限公司 Method, apparatus, medium, and electronic device for forming image sample
CN111104479A (en) * 2019-11-13 2020-05-05 中国建设银行股份有限公司 Data labeling method and device
CN110991279A (en) * 2019-11-20 2020-04-10 北京灵伴未来科技有限公司 Document image analysis and recognition method and system
CN110991279B (en) * 2019-11-20 2023-08-22 北京灵伴未来科技有限公司 Document Image Analysis and Recognition Method and System
CN110901788A (en) * 2019-11-27 2020-03-24 佛山科学技术学院 Biped mobile robot system with literacy ability
CN111291742A (en) * 2020-02-10 2020-06-16 北京百度网讯科技有限公司 Object recognition method and device, electronic equipment and storage medium
CN111291742B (en) * 2020-02-10 2023-08-04 北京百度网讯科技有限公司 Object recognition method and device, electronic equipment and storage medium
CN113762292A (en) * 2020-06-03 2021-12-07 杭州海康威视数字技术股份有限公司 Training data acquisition method and device and model training method and device
CN113762292B (en) * 2020-06-03 2024-02-02 杭州海康威视数字技术股份有限公司 Training data acquisition method and device and model training method and device
CN112100431A (en) * 2020-11-16 2020-12-18 深圳壹账通智能科技有限公司 Evaluation method, device and equipment of OCR system and readable storage medium
CN112100431B (en) * 2020-11-16 2021-02-26 深圳壹账通智能科技有限公司 Evaluation method, device and equipment of OCR system and readable storage medium
WO2022100452A1 (en) * 2020-11-16 2022-05-19 深圳壹账通智能科技有限公司 Ocr system evaluation method, apparatus and device, and readable storage medium
CN112669515A (en) * 2020-12-28 2021-04-16 上海斑马来拉物流科技有限公司 Bill image recognition method and device, electronic equipment and storage medium
CN112669515B (en) * 2020-12-28 2022-09-27 上海斑马来拉物流科技有限公司 Bill image recognition method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109934227A (en) System for recognizing characters from image and method
CN110348441B (en) Value-added tax invoice identification method and device, computer equipment and storage medium
CN110399800B (en) License plate detection method and system based on deep learning VGG16 framework and storage medium
CN104572072B (en) A kind of language transfer method and equipment to the program based on MVC pattern
CN107392125A (en) Training method/system, computer-readable recording medium and the terminal of model of mind
CN110222330B (en) Semantic recognition method and device, storage medium and computer equipment
CN110728328B (en) Training method and device for classification model
EP3869385A1 (en) Method for extracting structural data from image, apparatus and device
CN110110726A (en) The recognition methods of power equipment nameplate, device, computer equipment and storage medium
CN107862327B (en) Security defect identification system and method based on multiple features
CN106156315B (en) A kind of data quality monitoring method based on disaggregated model judgement
TWI590095B (en) Verification system for software function and verification mathod therefor
CN105824756B (en) A kind of out-of-date demand automatic testing method and system based on code dependence
CN109445778A (en) A kind of method and apparatus that the interface auxiliary based on SVG file generates
CN111159241B (en) Click conversion estimation method and device
CN113742205A (en) Code vulnerability intelligent detection method based on man-machine cooperation
CN109800776A (en) Material mask method, device, terminal and computer readable storage medium
CN106066881A (en) Data processing method and device
CN108021353B (en) Device, method, equipment and storage medium for generating description document of software product
CN115116069A (en) Text processing method and device, electronic equipment and storage medium
CN112182413B (en) Intelligent recommendation method and server based on big teaching data
CN113128496B (en) Method, device and equipment for extracting structured data from image
CN103778210A (en) Method and device for judging specific file type of file to be analyzed
CN110414845B (en) Risk assessment method and device for target transaction
CN113012189A (en) Image recognition method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: Room 15202, 15201 and 15203, Building 2, 498 Guoshoujing Road, Pudong New Area, Shanghai, 201203

Applicant after: SHANGHAI DUIGUAN INFORMATION TECHNOLOGY Co.,Ltd.

Address before: 201404 6 2595, No. 5995, Daye Highway, Fengxian District, Shanghai

Applicant before: SHANGHAI DUIGUAN INFORMATION TECHNOLOGY Co.,Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190625