WO2019174130A1 - Bill recognition method, server, and computer readable storage medium - Google Patents

Bill recognition method, server, and computer readable storage medium Download PDF

Info

Publication number
WO2019174130A1
WO2019174130A1 PCT/CN2018/089202 CN2018089202W WO2019174130A1 WO 2019174130 A1 WO2019174130 A1 WO 2019174130A1 CN 2018089202 W CN2018089202 W CN 2018089202W WO 2019174130 A1 WO2019174130 A1 WO 2019174130A1
Authority
WO
WIPO (PCT)
Prior art keywords
ticket
picture
preset
training
image
Prior art date
Application number
PCT/CN2018/089202
Other languages
French (fr)
Chinese (zh)
Inventor
田野
刘鹏
王健宗
肖京
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019174130A1 publication Critical patent/WO2019174130A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/04Billing or invoicing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes
    • G06F18/24137Distances to cluster centroïds
    • G06F18/2414Smoothing the distance, e.g. radial basis function networks [RBFN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/08Insurance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/24Aligning, centring, orientation detection or correction of the image
    • G06V10/245Aligning, centring, orientation detection or correction of the image by locating a pattern; Special marks for positioning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/30Noise filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/34Smoothing or thinning of the pattern; Morphological operations; Skeletonisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Definitions

  • the ticket identification technology By introducing the ticket identification technology, it is possible to improve the digitization efficiency of the bill under certain conditions, reduce the work intensity of the business personnel, and improve the accuracy or refinement of the data. Different from the traditional ticket scanning recognition technology, the recognition difficulty of the picture uploaded by the user is greatly increased, mainly in the different shooting environments of the user, the lighting, the rotation angle, the image definition, the occlusion, and even the completion degree of the bills are different. These factors have brought great challenges to the ticket identification process.
  • the present application proposes a ticket identification method and a server to solve the problem of how to quickly and accurately identify a ticket picture.
  • the preset method includes: retaining the top ten digits of the bill number; using the cosine similarity in the tf-idf algorithm to match the hospital name of the hospital field; extracting the date and time on the original string result output by the algorithm As the date; the uppercase Chinese character amount is transferred to Arabic numerals; the non-related characters are removed and the two decimal places are reserved, and all the amount portions of the algorithm output are formatted.
  • Processing secondly, performing text detection on the ticket picture using a pre-trained text detection model to obtain a target character area including characters in the ticket picture, the target character area including a plurality of to-be-identified fields; a target character area, calling a corresponding text recognition model for character recognition to respectively identify character information included in the plurality of to-be-identified fields in the target character area; and finally, acquiring the text recognition model to identify the target character a confidence level generated when the character information is included in the area, and the obtained confidence level is compared with a preset confidence threshold. If the confidence level is higher than the confidence threshold, the target is output according to a preset method.
  • Character information contained in the character area if the confidence is lower than the confidence threshold, The document image to a third party identified by inspection, and the output of a third party verify identification.
  • the ticket identification method, the server and the computer readable storage medium proposed by the application can improve the digitization efficiency of the ticket, reduce the work intensity of the business personnel, improve the accuracy or refinement of the data, and combine the deep learning algorithm with the third party assistance.
  • the ticket can be more accurately identified, and the present application is more convenient, faster, and more accurate than the prior art, and significantly reduces the cost.
  • 1 is a schematic diagram of an optional hardware architecture of the server of the present application.
  • FIG. 2 is a schematic diagram of a program module of a first embodiment of the ticket identification system of the present application
  • FIG. 3 is a schematic diagram of a program module of a second embodiment of the ticket identification system of the present application.
  • FIG. 4 is a schematic flow chart of a first embodiment of the ticket identification method of the present application.
  • FIG. 5 is a schematic flow chart of a second embodiment of the ticket identification method of the present application.
  • FIG. 6 is a schematic flow chart of a third embodiment of the ticket identification method of the present application.
  • FIG. 7 is a schematic flow chart of a fourth embodiment of the ticket identification method of the present application.
  • the server 1 may include, but is not limited to, the memory 11, the processor 12, and the network interface 13 being communicably connected to each other through a system bus. It is pointed out that Figure 1 only shows the server 1 with the components 11-13, but it should be understood that not all illustrated components are required to be implemented, and more or fewer components may be implemented instead.
  • the server 1 may be a computing device such as a rack server, a blade server, a tower server, or a rack server.
  • the server 1 may be an independent server or a server cluster composed of multiple servers.
  • the memory 11 includes at least one type of readable storage medium including a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static Random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, and the like.
  • the memory 11 may be an internal storage unit of the server 1, such as a hard disk or memory of the server 1.
  • the processor 12 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor, or other data processing chip in some embodiments.
  • the processor 12 is typically used to control the overall operation of the server 1.
  • the processor 12 is configured to run program code or process data stored in the memory 11, such as running the ticket identification system 2 and the like.
  • the network interface 13 may comprise a wireless network interface or a wired network interface, which is typically used to establish a communication connection between the server 1 and other electronic devices.
  • FIG. 2 it is a program block diagram of the first embodiment of the ticket identification system 2 of the present application.
  • the ticket identification system 2 includes a series of computer program instructions stored on the memory 11, and when the computer program instructions are executed by the processor 12, the ticket identification operation of the embodiments of the present application can be implemented.
  • the ticket identification system 2 can be divided into one or more modules based on the particular operations implemented by the various portions of the computer program instructions. For example, in FIG. 2, the ticket identification system 2 can be divided into a pre-processing module 21, a text detection module 22, a text recognition module 23, and a comparison module 24-level output module 25. among them:
  • the pre-processing module 21 is configured to receive a picture of the ticket to be identified, receive a picture of the ticket to be identified, and process the ticket picture by the pre-trained ticket picture recognition model to obtain the processed ticket picture.
  • the pre-processing module 21 receives the bill image to be identified, and performs pre-processing on the bill image according to a preset step, where the preset step may be to classify, denoise, correct, and intercept the bill image. And according to the preset step, the pre-trained ticket picture recognition model performs classification processing, denoising processing, correction processing, and intercepting ticket processing on the ticket picture.
  • the deep convolutional neural network model used in this application consists of one input layer, 13 convolutional layers, 5 pooling layers, 2 fully connected layers, and 1 sorting layer.
  • the detailed structure of the deep convolutional neural network model is shown in Table 1.
  • Layer Name column indicates the name of each layer
  • Input indicates the input layer
  • Conv indicates the convolution layer of the model
  • Conv1 indicates the first convolution layer of the model
  • MaxPool indicates the maximum pooling layer of the model
  • MaxPool1 indicates the model.
  • the text detection module 22 is configured to perform text detection on the ticket image processed by the pre-processing module 21 by using a pre-trained text detection model, and determine that the target character region including the character and the target character region included in the ticket image are included. The field to be identified.
  • the text detection model uses a CTPN (Connectionist Text Proposal Network) model based on CaffeNet
  • CTPN model structure includes VGG16 (convolution neural network), LSTM, fully connected layer, etc., wherein VGG is developed from Alex-net.
  • VGG convolution neural network
  • LSTM Long Short-Term Memory
  • LSTM Long Short-Term Memory
  • the step of performing text detection on the ticket picture using the text detection model includes:
  • Convolutional Layers perform feature extraction on input image dicing
  • the obtained feature sequences are placed in a loop network layer (Recurrent Layers) for character recognition;
  • the training steps of the model include:
  • the number of picture samples is greater than the number of picture samples in the second data set, the first data set is used as a training set, and the second data set is used as a test set;
  • the image samples in the first data set are sent to the text recognition model for model training, and the model is tested using the second data set at intervals (for example, every 1000 iterations) to evaluate the effect of the currently trained model.
  • the model obtained by the training is used to identify the character information of the picture in the second data set, and compares with the name of the tested picture to calculate the error of the recognition result and the labeling result. If the model at the time of the test diverge the error in the recognition of the bill picture, the training parameters are adjusted and retrained, so that the error of the recognition of the bill picture by the model at the training can converge. When the error converges, the model training is ended and the generated model is used as the final text recognition model.
  • a corresponding confidence is generated for the recognized character information.
  • the step of obtaining the confidence may be: estimating the generalized confidence using a corresponding formula for different fields to be identified; and obtaining the confidence according to the generalized confidence.
  • the generalized confidence may be obtained from the distance calculation of the unknown sample from the representative sample, or the multi-layer forward neural network may be used to obtain the generalized confidence, and the confidence may be inferred from the generalized confidence using a statistical method. It should be noted that the technician can select a suitable formula and tool according to the need to generate a corresponding confidence for the recognized character information, and details are not described herein again.
  • the comparing module 24 is configured to compare the obtained confidence level with a preset confidence threshold.
  • the confidence level is higher than the confidence threshold, the character information included in the target character region is retained, and if the confidence level is lower than the confidence threshold, the document image is passed to a third party. Carry out inspection identification.
  • the output module 25 is configured to output an output of the character recognition result according to the output value of the comparison module 24, and if the comparator inputs the confidence level higher than the confidence threshold, output the target character region according to a preset rule. Character information, if the confidence level is lower than the confidence threshold, the document picture is verified by a third party, and the result of the third party verification identification is output.
  • the preset rule includes: the first ten digits that the ticket number can be reserved; the hospital field uses the cosine similarity in the tf-idf algorithm to match the best hospital name; the original string result of the date part output in the algorithm The year, month, and day are extracted; the amount of capital Chinese characters is processed by Arabic numerals; all the amount parts are formatted uniformly for the algorithm output, and the non-related characters are removed and the two decimal places are retained.
  • the third party adopts a mechanism for randomly distributing tasks, and each task is distributed to a certain number of users, and then the majority of the same answers are obtained. That is, the result is finally recovered through a cross-validation mechanism.
  • the pre-processing module 21 in the ticket identification system 2 includes a classification module 210, a denoising module 220, a correction module 230, and an intercepting module 240.
  • the classification module 210 is configured to identify a ticket category in the received picture by using a pre-trained ticket picture recognition model after receiving the bill picture to be processed, and output a category identification result of the ticket (for example,
  • the categories of medical bills include outpatient bills, hospital bills, and other types of bills.
  • the denoising module 220 performs image smoothing processing and wavelet filtering processing on the ticket image, wherein the image smoothing processing may adopt a neighborhood averaging method and a median filtering method, and the neighborhood averaging method is to perform one pixel.
  • the average value of all the pixels in the neighborhood is assigned to the corresponding pixel in the output image to achieve the purpose of smoothing.
  • the process is to make a window slide on the image.
  • the value of the center position of the window is the average value of each point in the window. Instead, the grayscale average of a few pixels is used instead of the grayscale of one pixel.
  • the median filtering is a nonlinear smoothing filter based on the sorting statistics theory that can effectively suppress noise.
  • the correction module 230 performs a correction process on the ticket picture such that the ticket is rotated to the correct direction.
  • the present application also proposes a ticket identification method.
  • FIG. 4 it is a schematic flowchart of the first embodiment of the ticket identification method of the present application.
  • the order of execution of the steps in the flowchart shown in FIG. 5 may be changed according to different requirements, and some steps may be omitted.
  • Step S110 Receive a picture of the ticket to be identified, and process the ticket picture by a pre-trained ticket picture recognition model.
  • performing area recognition on a character area of the ticket picture, and identifying a small frame containing character information and having a fixed width of a preset value (for example, 16 pixel width) from the ticket picture, and including the included Small boxes whose character information is on the same line are stitched together in order to form a target line character area containing character information.
  • a preset value for example, 16 pixel width
  • identifying the input ticket picture may specifically be as follows:
  • the feature map (W*H*C) is obtained from the first five convolutional layers of VGG16.
  • the features of the window of 3*3*C are taken at each position of the feature map of the fifth convolutional layer, and these features are used to predict the category information and location information corresponding to the k anchors at the position.
  • the full connectivity layer feature is entered into three classification or regression layers, because by default the width of each anchor is 16 and no longer changes. The width of the returned rectangles is fixed.
  • step S140 the obtained confidence level is compared with a preset confidence threshold. If the confidence level is higher than the confidence threshold, the character information included in the target character region is output according to a preset method. If the confidence level is lower than the confidence threshold, the document picture is verified by a third party, and the result of the third party verification identification is output.
  • step S110 of the ticket identification method includes the following steps:
  • Step S210 classifying the ticket picture.
  • the correcting process includes the steps of:
  • Step S240 intercepting the ticket picture.
  • FIG. 6 is a schematic flowchart diagram of a third embodiment of the ticket identification method of the present application.
  • the training step of the text detection model in step S120 of the ticket identification method includes:
  • Step S310 preparing a preset number of ticket picture samples marked with corresponding picture categories for each preset ticket picture category.
  • the preset picture category includes an outpatient ticket and a hospitalization ticket, and the preset number is 1000 sheets.
  • the first ratio and the second ratio are 80% and 20%.
  • Step S340 the accuracy of the ticket picture recognition model of the training is verified by using the verification set. If the accuracy rate is greater than or equal to the preset accuracy rate, the training ends; if the accuracy rate is less than the preset accuracy rate, the installation is increased. The number of picture samples corresponding to each preset picture category is described, and the above steps are re-executed.
  • Step S420 splicing column by column from left to right on all channels outputted by the convolution layer to obtain a feature sequence.
  • Step S430 placing the obtained feature sequence into the loop network layer for character recognition.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Business, Economics & Management (AREA)
  • Development Economics (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Artificial Intelligence (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Technology Law (AREA)
  • Image Analysis (AREA)
  • Character Discrimination (AREA)

Abstract

Disclosed is a bill recognition method. The method comprises: receiving a bill image to be recognized, and processing the bill image by means of a pre-trained bill image recognition model; performing text detection on the bill image by using a pre-trained text detection model, and determining a target character zone comprising characters in the bill image and fields to be recognized in the target character zone; and invoking, for the fields to be recognized, a corresponding text recognition model for character recognition, so as to separately recognize character information contained in the multiple fields to be recognized in the target character zone, and outputting the recognition result. The present application further provides a server and a computer readable storage medium. The bill recognition method, the server, and the computer readable storage medium provided in the present application can improve the digitalization efficiency of bills, reduce the work intensity of service staff, and enhance the accuracy or refinement level of data.

Description

票据识别方法、服务器及计算机可读存储介质Ticket identification method, server and computer readable storage medium
优先权申明Priority claim
本申请要求于2018年03月14日提交中国专利局、申请号为201810208586.9,名称为“票据识别方法、服务器及计算机可读存储介质”的中国专利申请的优先权,该中国专利申请的整体内容以参考的方式结合本申请中。The present application claims priority to Chinese Patent Application No. 201810208586.9, entitled "Note Recognition Method, Server and Computer Readable Storage Medium", which is filed on March 14, 2018, the entire contents of which is incorporated herein by reference. This application is incorporated by reference.
技术领域Technical field
本申请涉及图像识别领域,尤其涉及一种票据识别方法、服务器计算机可读存储介质。The present application relates to the field of image recognition, and in particular, to a ticket identification method and a server computer readable storage medium.
背景技术Background technique
如今随着经济的发展和人们生活水平的提高,越来越多的人选择购买医疗,商业,金融等保险。其中某些保险公司慢慢开始了自助理赔业务,比如用户在进行医疗理赔过程中,只需要将门诊或住院发票拍照上传到保险公司系统,保险公司业务员会将用户上传的发票图片上的信息录入到理赔系统中,以进行下一步操作,这种方式大大方便了用户进行理赔的过程。但是另一方面,也增加了保险公司方面的工作压力。问题主要表现在需要花费大量的人力来处理用户上传的票据图像,许多时候业务员也会对单一的工作产生疲惫感,使得数据录入错误率升高。Nowadays, with the development of the economy and the improvement of people's living standards, more and more people choose to purchase medical, commercial, financial and other insurance. Some insurance companies have slowly started self-service claims business. For example, in the process of medical claims, users only need to upload photos of outpatient or hospital invoices to the insurance company system. The insurance company salesperson will upload the information on the invoice pictures uploaded by the users. Entering into the claims system for the next step, this method greatly facilitates the user's process of claim settlement. But on the other hand, it also increases the pressure on the insurance company. The problem is mainly caused by the need to spend a lot of manpower to process the image uploaded by the user. In many cases, the salesman is also tired of a single job, which makes the data entry error rate increase.
通过引入票据识别技术,可以在一定条件下提高票据的数字化效率,降低业务人员的工作强度,提高数据的准确性或精细化。不同于传统的票据扫描识别技术,用户拍照上传的票据图片的识别难度大大增加,主要表现在用户的拍照环境不同,光照,旋转角度,图像清晰度,遮挡,甚至票据的完成程度都各不相同,这些因素都给票据识别过程带来了极大的挑战。By introducing the ticket identification technology, it is possible to improve the digitization efficiency of the bill under certain conditions, reduce the work intensity of the business personnel, and improve the accuracy or refinement of the data. Different from the traditional ticket scanning recognition technology, the recognition difficulty of the picture uploaded by the user is greatly increased, mainly in the different shooting environments of the user, the lighting, the rotation angle, the image definition, the occlusion, and even the completion degree of the bills are different. These factors have brought great challenges to the ticket identification process.
发明内容Summary of the invention
有鉴于此,本申请提出一种票据识别方法及服务器,以解决如何快速、准确识别票据图片的问题。In view of this, the present application proposes a ticket identification method and a server to solve the problem of how to quickly and accurately identify a ticket picture.
首先,为实现上述目的,本申请提出一种票据识别方法,该方法包括步骤:First, in order to achieve the above object, the present application provides a ticket identification method, the method comprising the steps of:
接收待识别的票据图片,预先训练的票据图片识别模型对所述票据图片进行处理得到处理后的票据图片;Receiving a picture of the ticket to be identified, and processing the picture of the ticket by the pre-trained ticket picture recognition model to obtain a processed picture of the ticket;
使用预先训练的文本检测模型对所述处理后的票据图片进行文本检测,确定所述处理后的票据图片中包括字符的目标字符区域及所述目标字符区域包括的待识别字段;Performing text detection on the processed ticket image by using a pre-trained text detection model, and determining that the processed ticket image includes a target character region of the character and a to-be-identified field included in the target character region;
针对所述待识别字段,调用对应的文本识别模型进行字符识别,所述文 本识别模型识别出所述待识别字段包含的字符信息,并针对识别的所述字符信息生成置信度;及And corresponding to the to-be-identified field, calling a corresponding text recognition model for character recognition, the text recognition model identifying character information included in the to-be-identified field, and generating a confidence level for the recognized character information;
将所述置信度与预设的置信度阈值进行比较,若所述置信度高于所述置信度阈值,则按照预设方法输出所述目标字符区域包含的字符信息,若所述置信度低于所述置信度阈值,则将所述单据图片通过第三方进行检验识别,并将所述第三方检验识别的结果输出;Comparing the confidence level with a preset confidence threshold, if the confidence is higher than the confidence threshold, outputting the character information included in the target character region according to a preset method, if the confidence is low At the confidence threshold, the document picture is verified by a third party, and the result of the third party verification identification is output;
其中,所述预设方法包括:保留票据单号前十位;使用tf-idf算法中的余弦相似度匹配医院字段最佳的医院名称;在算法输出的原始字符串结果上提取出年月日作为日期;将大写汉字金额进行转阿拉伯数字处理;去除非相关字符并保留小数点后两位,对算法输出的所有金额部分进行格式统一。The preset method includes: retaining the top ten digits of the bill number; using the cosine similarity in the tf-idf algorithm to match the hospital name of the hospital field; extracting the date and time on the original string result output by the algorithm As the date; the uppercase Chinese character amount is transferred to Arabic numerals; the non-related characters are removed and the two decimal places are reserved, and all the amount portions of the algorithm output are formatted.
此外,为实现上述目的,本申请还提供一种服务器,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的票据识别系统,所述票据识别系统被所述处理器执行时实现如上述的票据识别方法的步骤。In addition, in order to achieve the above object, the present application further provides a server including a memory, a processor, and a ticket identification system stored on the memory and operable on the processor, the ticket identification system being processed The steps of the ticket identification method as described above are implemented when the device is executed.
进一步地,为实现上述目的,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质存储有票据识别系统,所述票据识别系统可被至少一个处理器执行,以使所述至少一个处理器执行如上述的票据识别方法的步骤。Further, in order to achieve the above object, the present application further provides a computer readable storage medium storing a ticket identification system, the ticket identification system being executable by at least one processor to enable the At least one processor performs the steps of the ticket identification method as described above.
相较于现有技术,本申请所提出的票据识别方法、服务器及计算机可读存储介质,首先接收待识别的票据图片,预先训练的票据图片识别模型根据预设规则对所述票据图片进行预处理;其次,使用预先训练的文本检测模型对所述票据图片进行文本检测,获得所述票据图片中包括字符的目标字符区域,所述目标字符区域包括多个待识别字段;再次,针对所述目标字符区域,调用对应的文本识别模型进行字符识别,以分别识别出所述目标字符区域中的所述多个待识别字段包含的字符信息;最后,获取所述文本识别模型识别所述目标字符区域包含的字符信息时生成的置信度,将获得的所述置信度与预设的置信度阈值进行比较,若所述置信度高于所述置信度阈值,则按照预设方法输出所述目标字符区域包含的字符信息,若所述置信度低于所述置信度阈值,则将所述单据图片通过第三方进行检验识别,并将所述第三方检验识别的结果输出。采用本申请所提出的票据识别方法、服务器及计算机可读存储介质可以提高票据的数字化效率,降低业务人员的工作强度,提高数据的准确性或精细化,并且,结合深度学习算法及第三方辅助可以更加精确的识别票据,相较于现有技术,本申请更加方便、快捷、准确,且显著降低了成本。Compared with the prior art, the ticket identification method, the server and the computer readable storage medium proposed by the present application first receive a picture of the ticket to be identified, and the pre-trained ticket picture recognition model pre-predicts the ticket picture according to a preset rule. Processing; secondly, performing text detection on the ticket picture using a pre-trained text detection model to obtain a target character area including characters in the ticket picture, the target character area including a plurality of to-be-identified fields; a target character area, calling a corresponding text recognition model for character recognition to respectively identify character information included in the plurality of to-be-identified fields in the target character area; and finally, acquiring the text recognition model to identify the target character a confidence level generated when the character information is included in the area, and the obtained confidence level is compared with a preset confidence threshold. If the confidence level is higher than the confidence threshold, the target is output according to a preset method. Character information contained in the character area, if the confidence is lower than the confidence threshold, The document image to a third party identified by inspection, and the output of a third party verify identification. The ticket identification method, the server and the computer readable storage medium proposed by the application can improve the digitization efficiency of the ticket, reduce the work intensity of the business personnel, improve the accuracy or refinement of the data, and combine the deep learning algorithm with the third party assistance. The ticket can be more accurately identified, and the present application is more convenient, faster, and more accurate than the prior art, and significantly reduces the cost.
附图说明DRAWINGS
图1是本申请服务器一可选的硬件架构的示意图;1 is a schematic diagram of an optional hardware architecture of the server of the present application;
图2是本申请票据识别系统第一实施例的程序模块示意图;2 is a schematic diagram of a program module of a first embodiment of the ticket identification system of the present application;
图3是本申请票据识别系统第二实施例的程序模块示意图;3 is a schematic diagram of a program module of a second embodiment of the ticket identification system of the present application;
图4是本申请票据识别方法第一实施例的流程示意图;4 is a schematic flow chart of a first embodiment of the ticket identification method of the present application;
图5是本申请票据识别方法第二实施例的流程示意图;5 is a schematic flow chart of a second embodiment of the ticket identification method of the present application;
图6是本申请票据识别方法第三实施例的流程示意图;6 is a schematic flow chart of a third embodiment of the ticket identification method of the present application;
图7是本申请票据识别方法第四实施例的流程示意图。FIG. 7 is a schematic flow chart of a fourth embodiment of the ticket identification method of the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The implementation, functional features and advantages of the present application will be further described with reference to the accompanying drawings.
具体实施方式detailed description
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the objects, technical solutions, and advantages of the present application more comprehensible, the present application will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the application and are not intended to be limiting. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope are the scope of the present application.
需要说明的是,在本申请中涉及“第一”、“第二”等的描述仅用于描述目的,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。另外,各个实施例之间的技术方案可以相互结合,但是必须是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在,也不在本申请要求的保护范围之内。It should be noted that the descriptions of "first", "second" and the like in the present application are for the purpose of description only, and are not to be construed as indicating or implying their relative importance or implicitly indicating the number of technical features indicated. . Thus, features defining "first" or "second" may include at least one of the features, either explicitly or implicitly. In addition, the technical solutions between the various embodiments may be combined with each other, but must be based on the realization of those skilled in the art, and when the combination of the technical solutions is contradictory or impossible to implement, it should be considered that the combination of the technical solutions does not exist. Nor is it within the scope of protection required by this application.
参阅图1所示,是本申请服务器1一可选的硬件架构的示意图。Referring to FIG. 1, it is a schematic diagram of an optional hardware architecture of the server 1 of the present application.
本实施例中,所述服务器1可包括,但不仅限于,可通过系统总线相互通信连接存储器11、处理器12、网络接口13。需要指出的是,图1仅示出了具有组件11-13的服务器1,但是应理解的是,并不要求实施所有示出的组件,可以替代的实施更多或者更少的组件。In this embodiment, the server 1 may include, but is not limited to, the memory 11, the processor 12, and the network interface 13 being communicably connected to each other through a system bus. It is pointed out that Figure 1 only shows the server 1 with the components 11-13, but it should be understood that not all illustrated components are required to be implemented, and more or fewer components may be implemented instead.
其中,所述服务器1可以是机架式服务器、刀片式服务器、塔式服务器或机柜式服务器等计算设备,该服务器1可以是独立的服务器,也可以是多个服务器所组成的服务器集群。The server 1 may be a computing device such as a rack server, a blade server, a tower server, or a rack server. The server 1 may be an independent server or a server cluster composed of multiple servers.
所述存储器11至少包括一种类型的可读存储介质,所述可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访 问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等。在一些实施例中,所述存储器11可以是所述服务器1的内部存储单元,例如该服务器1的硬盘或内存。在另一些实施例中,所述存储器11也可以是所述服务器1的外部存储设备,例如该服务器1上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。当然,所述存储器11还可以既包括所述服务器1的内部存储单元也包括其外部存储设备。本实施例中,所述存储器11通常用于存储安装于所述服务器1的操作系统和各类应用软件,例如票据识别系统2的程序代码等。此外,所述存储器11还可以用于暂时地存储已经输出或者将要输出的各类数据。The memory 11 includes at least one type of readable storage medium including a flash memory, a hard disk, a multimedia card, a card type memory (eg, SD or DX memory, etc.), a random access memory (RAM), a static Random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, and the like. In some embodiments, the memory 11 may be an internal storage unit of the server 1, such as a hard disk or memory of the server 1. In other embodiments, the memory 11 may also be an external storage device of the server 1, such as a plug-in hard disk equipped on the server 1, a smart memory card (SMC), and a secure digital (Secure) Digital, SD) cards, flash cards, etc. Of course, the memory 11 can also include both the internal storage unit of the server 1 and its external storage device. In this embodiment, the memory 11 is generally used to store an operating system installed in the server 1 and various types of application software, such as program codes of the ticket identification system 2. Further, the memory 11 can also be used to temporarily store various types of data that have been output or are to be output.
所述处理器12在一些实施例中可以是中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器12通常用于控制所述服务器1的总体操作。本实施例中,所述处理器12用于运行所述存储器11中存储的程序代码或者处理数据,例如运行所述的票据识别系统2等。The processor 12 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor, or other data processing chip in some embodiments. The processor 12 is typically used to control the overall operation of the server 1. In this embodiment, the processor 12 is configured to run program code or process data stored in the memory 11, such as running the ticket identification system 2 and the like.
所述网络接口13可包括无线网络接口或有线网络接口,该网络接口13通常用于在所述服务器1与其他电子设备之间建立通信连接。The network interface 13 may comprise a wireless network interface or a wired network interface, which is typically used to establish a communication connection between the server 1 and other electronic devices.
至此,己经详细介绍了本申请相关设备的硬件结构和功能。下面,将基于上述介绍提出本申请的各个实施例。So far, the hardware structure and functions of the devices related to this application have been described in detail. Hereinafter, various embodiments of the present application will be made based on the above description.
首先,本申请提出一种票据识别系统2。First, the present application proposes a ticket identification system 2.
参阅图2所示,是本申请票据识别系统2第一实施例的程序模块图。Referring to FIG. 2, it is a program block diagram of the first embodiment of the ticket identification system 2 of the present application.
本实施例中,所述票据识别系统2包括一系列的存储于存储器11上的计算机程序指令,当该计算机程序指令被处理器12执行时,可以实现本申请各实施例的票据识别操作。在一些实施例中,基于该计算机程序指令各部分所实现的特定的操作,票据识别系统2可以被划分为一个或多个模块。例如,在图2中,所述票据识别系统2可以被分割成预处理模块21、文本检测模块22、文本识别模块23、比较模块24级输出模块25。其中:In the present embodiment, the ticket identification system 2 includes a series of computer program instructions stored on the memory 11, and when the computer program instructions are executed by the processor 12, the ticket identification operation of the embodiments of the present application can be implemented. In some embodiments, the ticket identification system 2 can be divided into one or more modules based on the particular operations implemented by the various portions of the computer program instructions. For example, in FIG. 2, the ticket identification system 2 can be divided into a pre-processing module 21, a text detection module 22, a text recognition module 23, and a comparison module 24-level output module 25. among them:
所述预处理模块21,用于接收待识别的票据图片,接收待识别的票据图片,预先训练的票据图片识别模型对所述票据图片进行处理得到处理后的票据图片。The pre-processing module 21 is configured to receive a picture of the ticket to be identified, receive a picture of the ticket to be identified, and process the ticket picture by the pre-trained ticket picture recognition model to obtain the processed ticket picture.
具体地,所述预处理模块21接收待识别的票据图片,根据预设步骤对所述票据图片进行预处理,所述预设步骤可以是对所述票据图片进行分类、去 噪、矫正、截取票据,根据所述预设步骤,预先训练的票据图片识别模型对所述票据图片进行分类处理、去噪处理、矫正处理及截取票据处理。Specifically, the pre-processing module 21 receives the bill image to be identified, and performs pre-processing on the bill image according to a preset step, where the preset step may be to classify, denoise, correct, and intercept the bill image. And according to the preset step, the pre-trained ticket picture recognition model performs classification processing, denoising processing, correction processing, and intercepting ticket processing on the ticket picture.
具体地,所述分类处理用于将接收到的票据图片进行分类,便于后续的处理,所述去噪处理可以消除票据图片的噪声点,可以使图像产生较少的模糊,用户上传的票据可能会有多种旋转角度,我们必须将票据旋转到正确的方向,才能进行下一步操作,对票据图片进行矫正处理可以使得票据旋转到正确的方向,所述截取票据为将票据从原始票据图片中截取出来,原始票据图片包括票据及背景图片,通过截取票据可以去除背景图片的干扰。Specifically, the classification process is used to classify the received ticket pictures to facilitate subsequent processing, and the denoising process can eliminate noise points of the ticket picture, and can generate less blurring of the image, and the user uploaded the ticket may There are a variety of rotation angles, we have to rotate the ticket to the correct direction in order to proceed to the next step, the correction of the ticket picture can make the ticket rotate to the correct direction, the interception ticket is to take the ticket from the original ticket picture Intercepted, the original bill picture includes the bill and the background image, and the interception of the bill can remove the interference of the background image.
具体地,所述票据图片识别模型为深度卷积神经网络(例如,该深度卷积神经网络可以为在CaffeNet的环境下选取的基于深度卷积神经网络SSD(Single Shot MultiBox Detector)算法模型),其基本网络结构使用了VGG16的网络结构,再去掉了最后面的全连接层,加入了额外的六个不同尺度的特征层。Specifically, the ticket picture recognition model is a deep convolutional neural network (for example, the deep convolutional neural network may be a SSD (Single Shot MultiBox Detector) algorithm model selected in a CaffeNet environment), The basic network structure uses the network structure of VGG16, and then removes the last fully connected layer, adding an additional six different scale feature layers.
本申请采用的深度卷积神经网络模型由1个输入层,13个卷积层,5个池化层,2个全连接层,1个分类层构成。所述深度卷积神经网络模型的详细结构如表1所示。The deep convolutional neural network model used in this application consists of one input layer, 13 convolutional layers, 5 pooling layers, 2 fully connected layers, and 1 sorting layer. The detailed structure of the deep convolutional neural network model is shown in Table 1.
Figure PCTCN2018089202-appb-000001
Figure PCTCN2018089202-appb-000001
Figure PCTCN2018089202-appb-000002
Figure PCTCN2018089202-appb-000002
表1Table 1
其中:Layer Name列表示每一层的名称,Input表示输入层,Conv表示模型的卷积层,Conv1表示模型的第1个卷积层,MaxPool表示模型的最大值池化层,MaxPool1表示模型的第1个最大值池化层,Fc表示模型中的全连接层,Fc1表示模型中第1个全连接层,Softmax表示Softmax分类器;Batch Size表示当前层的输入图像数目;Kernel Size表示当前层卷积核的尺度(例如,Kernel Size可以等于3,表示卷积核的尺度为3x 3);Stride Size表示卷积核的移动步长,即做完一次卷积之后移动到下一个卷积位置的距离;Pad Size表示对当前网络层之中的图像填充的大小。Among them: Layer Name column indicates the name of each layer, Input indicates the input layer, Conv indicates the convolution layer of the model, Conv1 indicates the first convolution layer of the model, MaxPool indicates the maximum pooling layer of the model, and MaxPool1 indicates the model. The first maximum pooling layer, Fc represents the fully connected layer in the model, Fc1 represents the first fully connected layer in the model, Softmax represents the Softmax classifier; Batch Size represents the number of input images of the current layer; Kernel Size represents the current layer The scale of the convolution kernel (for example, the Kernel Size can be equal to 3, indicating that the scale of the convolution kernel is 3x 3); the Stride Size indicates the moving step size of the convolution kernel, that is, moving to the next convolution position after completing one convolution The distance; Pad Size indicates the size of the image fill in the current network layer.
所述文本检测模块22,用于使用预先训练的文本检测模型对预处理模块21处理后的票据图片进行文本检测,确定所述票据图片中包括字符的目标字符区域及所述目标字符区域包括的待识别字段。The text detection module 22 is configured to perform text detection on the ticket image processed by the pre-processing module 21 by using a pre-trained text detection model, and determine that the target character region including the character and the target character region included in the ticket image are included. The field to be identified.
具体地,所述文本检测模型使用基于CaffeNet的CTPN(Connectionist Text Proposal Network)模型,CTPN模型结构包括VGG16(卷积神经网络)、LSTM、全连接层等,其中,VGG是在从Alex-net发展而来的网络,LSTM(Long Short-Term Memory)是长短期记忆网络,是一种时间递归神经网络。Specifically, the text detection model uses a CTPN (Connectionist Text Proposal Network) model based on CaffeNet, and the CTPN model structure includes VGG16 (convolution neural network), LSTM, fully connected layer, etc., wherein VGG is developed from Alex-net. The network, LSTM (Long Short-Term Memory) is a long-term and short-term memory network, which is a time recurrent neural network.
使用所述文本检测模型对所述票据图片进行文本检测的步骤包括:The step of performing text detection on the ticket picture using the text detection model includes:
使用VGG16得到深度特征;Use VGG16 to get depth features;
用固定宽度(例如,16个像素宽度)的框来检测text proposal(文本线的一部分),并把同一行框对应的特征串成序列,输入到LSTM中;A frame with a fixed width (for example, 16 pixels width) is used to detect a text proposal (a part of a text line), and a feature string corresponding to the same line frame is serialized and input into the LSTM;
使用全连接层来回归以及分类,并将符合条件的text proposal合并成最终的文本线,所述文本线即所述字符区域。The fully connected layer is used to regress and classify, and the eligible text proposals are merged into a final text line, which is the character region.
CTPN充分利用text line具有上下文连接的特点,结合RNN与CNN,提升了文本检测的精度。CTPN makes full use of the contextual connection of text line, combined with RNN and CNN, improves the accuracy of text detection.
所述文本识别模块23,用于针对所述待识别字段,调用对应的文本识别模型进行字符识别,所述文本识别模型识别出所述待识别字段包含的字符信息,并针对识别的所述字符信息生成置信度。The text recognition module 23 is configured to call, according to the to-be-identified field, a corresponding text recognition model for character recognition, where the text recognition model identifies character information included in the to-be-identified field, and the recognized character is Information generation confidence.
具体地,所述文本识别模型基于MXNet的CNN+LSTM+CTC的模型结构,其中,CNN(Convolutional Neural Networks)为卷积神经网络,CTC(Connectionist temporal classification)接在CNN网络的最后一层用于序列学习所用,所述文本识别模型的结构包括卷积层(Convolutional Layers)、循环网络层(Recurrent Layers)及转译层(Transcription Layer),该模型对目标字符区域进行字符识别的步骤包括:Specifically, the text recognition model is based on a model structure of CNN+LSTM+CTC of MXNet, wherein CNN (Convolutional Neural Networks) is a convolutional neural network, and CTC (Connectionist temporal classification) is connected to the last layer of the CNN network. For the sequence learning, the structure of the text recognition model includes Convolutional Layers, Recurrent Layers, and Transcription Layer. The steps of character recognition of the target character region include:
卷积层(Convolutional Layers)对输入图片切块进行特征提取;Convolutional Layers perform feature extraction on input image dicing;
在最后一个卷积层输出的所有通道上,从左到右逐列拼接,得到特征序列;On all channels of the last convolutional layer output, splicing from left to right column by column to obtain a sequence of features;
把得到的特征序列放入循环网络层(Recurrent Layers)中进行字符的识别;The obtained feature sequences are placed in a loop network layer (Recurrent Layers) for character recognition;
经过转译层(Transcription Layer)对识别的结果进行处理,根据字符字典生成最后的识别结果。The result of the recognition is processed by the Transcription Layer, and the final recognition result is generated according to the character dictionary.
该模型的训练步骤包括:The training steps of the model include:
获取预设数量(例如,10万)的票据图片样本,将所述票据图片样本按照X:Y(例如,8:2)的比例分成第一数据集和第二数据集,第一数据集中的图片样本数量大于第二数据集中的图片样本数量,第一数据集作为训练集,第二数据集作为测试集;Obtaining a preset number (for example, 100,000) of bill picture samples, and dividing the bill picture samples into a first data set and a second data set according to a ratio of X:Y (for example, 8:2), in the first data set The number of picture samples is greater than the number of picture samples in the second data set, the first data set is used as a training set, and the second data set is used as a test set;
将第一数据集中的图片样本送入文本识别模型进行模型训练,每隔一段时间(例如每进行1000次迭代),对模型使用第二数据集进行测试,以评估当前训练的模型效果。测试时,使用训练得到的模型对第二数据集中的图片进行字符信息识别,并和测试的图片的名称做对比,以计算识别的结果和标注结果的误差。若测试时的模型对票据图片识别的误差出现发散,则调整训练参数并重新训练,使训练时模型对票据图片的识别的误差能够收敛。当误差收敛后,结束模型训练,生成的模型作为最终的文本识别模型。The image samples in the first data set are sent to the text recognition model for model training, and the model is tested using the second data set at intervals (for example, every 1000 iterations) to evaluate the effect of the currently trained model. During the test, the model obtained by the training is used to identify the character information of the picture in the second data set, and compares with the name of the tested picture to calculate the error of the recognition result and the labeling result. If the model at the time of the test diverge the error in the recognition of the bill picture, the training parameters are adjusted and retrained, so that the error of the recognition of the bill picture by the model at the training can converge. When the error converges, the model training is ended and the generated model is used as the final text recognition model.
具体地的,所述文本识别模型在进行字符识别时,对识别出的字符信息会生成对应的置信度。获取置信度的步骤可为:针对不同的待识别字段,使用对应的公式估计广义置信度;根据广义置信度获取所述置信度。例如,可以根据未知样本与代表样本的距离计算获取所述广义置信度,也可以使用多层前向神经网络获取广义置信度,可以使用统计的方法从所述广义置信度推知所述置信度。应该说明的是,技术人员可以根据需要选择合适的公式及工具针对识别的字符信息生成对应的置信度,这里不再赘述。Specifically, when the character recognition model performs character recognition, a corresponding confidence is generated for the recognized character information. The step of obtaining the confidence may be: estimating the generalized confidence using a corresponding formula for different fields to be identified; and obtaining the confidence according to the generalized confidence. For example, the generalized confidence may be obtained from the distance calculation of the unknown sample from the representative sample, or the multi-layer forward neural network may be used to obtain the generalized confidence, and the confidence may be inferred from the generalized confidence using a statistical method. It should be noted that the technician can select a suitable formula and tool according to the need to generate a corresponding confidence for the recognized character information, and details are not described herein again.
所述比较模块24,用于将获得的所述置信度与预设的置信度阈值进行比 较。The comparing module 24 is configured to compare the obtained confidence level with a preset confidence threshold.
具体地,若所述置信度高于所述置信度阈值,则保留所述目标字符区域包含的字符信息,若所述置信度低于所述置信度阈值,则将所述单据图片通过第三方进行检验识别。Specifically, if the confidence level is higher than the confidence threshold, the character information included in the target character region is retained, and if the confidence level is lower than the confidence threshold, the document image is passed to a third party. Carry out inspection identification.
所述输出模块25,用于根据比较模块24的输出值进行字符识别结果的输出,若比较器输入所述置信度高于所述置信度阈值,则按照预设规则输出所述目标字符区域包含的字符信息,若所述置信度低于所述置信度阈值,则将所述单据图片通过第三方进行检验识别,并将所述第三方检验识别的结果输出。The output module 25 is configured to output an output of the character recognition result according to the output value of the comparison module 24, and if the comparator inputs the confidence level higher than the confidence threshold, output the target character region according to a preset rule. Character information, if the confidence level is lower than the confidence threshold, the document picture is verified by a third party, and the result of the third party verification identification is output.
具体地,所述预设规则包括:票据单号可保留的前十位;医院字段使用tf-idf算法中的余弦相似度去匹配最佳的医院名称;日期部分在算法输出的原始字符串结果上提取出年月日;大写汉字金额进行转阿拉伯数字处理;所有金额部分对算法输出结果进行了格式统一,去除非相关字符并保留小数点后两位。Specifically, the preset rule includes: the first ten digits that the ticket number can be reserved; the hospital field uses the cosine similarity in the tf-idf algorithm to match the best hospital name; the original string result of the date part output in the algorithm The year, month, and day are extracted; the amount of capital Chinese characters is processed by Arabic numerals; all the amount parts are formatted uniformly for the algorithm output, and the non-related characters are removed and the two decimal places are retained.
具体地,所述第三方可以为众包平台,众包指的是一个公司或机构把过去由员工执行的工作任务,以自由自愿的形式外包给非特定的(而且通常是大型的)大众网络的做法。具体地,所述众包平台主要完成以下工作:Specifically, the third party may be a crowdsourcing platform, and the crowdsourcing refers to a company or organization that outsources tasks previously performed by employees to a non-specific (and usually large) mass network in a free and voluntary manner. way of doing. Specifically, the crowdsourcing platform mainly performs the following tasks:
1,协助算法的开发,具体包括:数据标注,数据清洗,将人工校验结果返回给识别学习系统继续进行训练,以不断提高识别模型的准确率;1. Assist in the development of algorithms, including: data annotation, data cleaning, returning the manual verification results to the recognition learning system to continue training, so as to continuously improve the accuracy of the recognition model;
2,算法人工相结合,对于复杂字段,算法实现检测文本块,再由人工解决算法难以完成的部分,比如通过人工去实现复杂的文本切分与生僻文字识别;2, the algorithm artificially combines, for complex fields, the algorithm realizes the detection of the text block, and then the part that is difficult to complete by the artificial solution algorithm, such as manually implementing complex text segmentation and unconventional text recognition;
3,人工修正算法输出的结果,将置信度低的算法输出结果转入众包,人工再进行校验,以提高最终识别准确率。3. The result of the artificial correction algorithm output, the output of the algorithm with low confidence is transferred to the crowdsourcing, and the verification is performed manually to improve the final recognition accuracy.
具体地,所述第三方在人工辅助算法输出结果过程中,为了保证准确率,我们采取任务随机发放的机制,并且每个任务发放给一定数量的用户,再取其中大多数人相同的答案,也就是最后通过交叉验证的机制来回收结果。Specifically, in the process of outputting the result of the artificial auxiliary algorithm, in order to ensure the accuracy, the third party adopts a mechanism for randomly distributing tasks, and each task is distributed to a certain number of users, and then the majority of the same answers are obtained. That is, the result is finally recovered through a cross-validation mechanism.
参阅图3所示,是本申请票据识别系统2第二实施例的程序模块图。本实施例中,所述的票据识别系统2中预处理模块21包括分类模块210、去噪模块220、矫正模块230及截取模块240。Referring to FIG. 3, it is a program block diagram of the second embodiment of the ticket identification system 2 of the present application. In this embodiment, the pre-processing module 21 in the ticket identification system 2 includes a classification module 210, a denoising module 220, a correction module 230, and an intercepting module 240.
具体地,所述分类模块210用于在收到待处理的票据图片后,利用预先训练的票据图片识别模型对收到的图片中的票据类别进行识别,并输出票据的类别识别结果(例如,医疗票据的类别包括门诊票据,住院票据,以及其他类票据)。Specifically, the classification module 210 is configured to identify a ticket category in the received picture by using a pre-trained ticket picture recognition model after receiving the bill picture to be processed, and output a category identification result of the ticket (for example, The categories of medical bills include outpatient bills, hospital bills, and other types of bills.
具体地,所述去噪模块220对所述票据图片进行图像平滑处理及小波滤波处理,其中,所述图像平滑处理可采用邻域平均法及中值滤波法,邻域平均法是将一个像素及其邻域中所有像素的平均值赋给输出图像中相应的像素,从而达到平滑的目的,其过程是使一个窗口在图像上滑动,窗口中心位置的值用窗内各点值的平均值来代替,即用几个像素的灰度平均值来代替一个像素的灰度。所述中值滤波是一种基于排序统计理论的可有效抑制噪声的非线性平滑滤波。其滤波原理是:首先确定一个以某个像素为中心点的邻域,一般为方形邻域,然后将邻域中各像素的灰度值进行排序,取中间值作为中心像素灰度的新值,这里的邻域通常被称为窗口;当窗口在图像中上下左右进行移动后,利用中值滤波算法可以很好地对图像进行平滑处理。中值滤波的输出像素是由邻域图像的中间值决定的,因而中值滤波对极限像素值(与周围像素灰度值差别较大的像素)远不如平均值那么敏感,从而可以消除孤立的噪声点,可以使图像产生较少的模糊。Specifically, the denoising module 220 performs image smoothing processing and wavelet filtering processing on the ticket image, wherein the image smoothing processing may adopt a neighborhood averaging method and a median filtering method, and the neighborhood averaging method is to perform one pixel. The average value of all the pixels in the neighborhood is assigned to the corresponding pixel in the output image to achieve the purpose of smoothing. The process is to make a window slide on the image. The value of the center position of the window is the average value of each point in the window. Instead, the grayscale average of a few pixels is used instead of the grayscale of one pixel. The median filtering is a nonlinear smoothing filter based on the sorting statistics theory that can effectively suppress noise. The filtering principle is as follows: firstly, a neighborhood with a certain pixel as a center point is determined, which is generally a square neighborhood, and then the gray values of each pixel in the neighborhood are sorted, and the intermediate value is taken as the new value of the central pixel gray scale. The neighborhood here is usually called a window; when the window moves up and down and left and right in the image, the median filtering algorithm can be used to smooth the image well. The median filtered output pixel is determined by the median value of the neighborhood image, so the median filter is far less sensitive to the extreme pixel values (pixels that differ greatly from the surrounding pixel gray values), thus eliminating isolated Noise points can make the image produce less blur.
具体地,所述矫正模块230对票据图片进行矫正处理使得票据旋转到正确的方向。Specifically, the correction module 230 performs a correction process on the ticket picture such that the ticket is rotated to the correct direction.
具体地,所述截取模块240将票据从原始票据图片中截取出来。Specifically, the intercept module 240 intercepts the ticket from the original ticket picture.
此外,本申请还提出一种票据识别方法。In addition, the present application also proposes a ticket identification method.
参阅图4所示,是本申请票据识别方法第一实施例的流程示意图。在本实施例中,根据不同的需求,图5所示的流程图中的步骤的执行顺序可以改变,某些步骤可以省略。Referring to FIG. 4, it is a schematic flowchart of the first embodiment of the ticket identification method of the present application. In this embodiment, the order of execution of the steps in the flowchart shown in FIG. 5 may be changed according to different requirements, and some steps may be omitted.
步骤S110,接收待识别的票据图片,预先训练的票据图片识别模型对所述票据图片进行处理。Step S110: Receive a picture of the ticket to be identified, and process the ticket picture by a pre-trained ticket picture recognition model.
具体地,处理方式包括对所述票据图片进行分类、去噪、矫正、截取票据。Specifically, the processing manner includes classifying, denoising, correcting, and intercepting the ticket picture.
步骤S120,使用预先训练的文本检测模型对所述票据图片进行文本检测,确定所述票据图片中包括字符的目标字符区域及所述目标字符区域包括的待识别字段。Step S120: Perform text detection on the ticket picture by using a pre-trained text detection model, and determine a target character area including characters in the ticket picture and a to-be-identified field included in the target character area.
具体地,对所述票据图片的字符区域进行区域识别,从所述票据图片上识别出包含字符信息且固定宽度为预设值(例如,16个像素宽度)的小框,并将所包含的字符信息处于同一行的小框按照先后顺序拼接在一起形成包含字符信息的目标行字符区域。Specifically, performing area recognition on a character area of the ticket picture, and identifying a small frame containing character information and having a fixed width of a preset value (for example, 16 pixel width) from the ticket picture, and including the included Small boxes whose character information is on the same line are stitched together in order to form a target line character area containing character information.
具体地,对输入的票据图片进行识别具体可为以下步骤:Specifically, identifying the input ticket picture may specifically be as follows:
第一,用VGG16的前5个卷积层得到特征地图(W*H*C)First, the feature map (W*H*C) is obtained from the first five convolutional layers of VGG16.
第二,在第五各卷积层的特征地图的每个位置上取3*3*C的窗口的特征,这些特征将用于预测该位置k个anchor对应的类别信息,位置信息。Secondly, the features of the window of 3*3*C are taken at each position of the feature map of the fifth convolutional layer, and these features are used to predict the category information and location information corresponding to the k anchors at the position.
第三,将每一行的所有窗口对应的3*3*C的特征(W*3*3*C)输入到LSTM中,得到W*256的输出Third, input the 3*3*C features (W*3*3*C) corresponding to all windows of each row into the LSTM to obtain the W*256 output.
第四,将LSTM的W*256输入到512维的全连接层Fourth, input the W*256 of LSTM to the 512-dimensional fully connected layer.
第五,全连接层特征输入到三个分类或者回归层中,因为这里默认了每个anchor的宽度是16,且不再变化。回归出来的矩形框们的宽度是一定的。Fifth, the full connectivity layer feature is entered into three classification or regression layers, because by default the width of each anchor is 16 and no longer changes. The width of the returned rectangles is fixed.
第六,用简单的文本线构造算法,把分类得到的文字的proposal中的细长的矩形框合并成文本线。Sixth, a simple text line construction algorithm is used to merge the elongated rectangular boxes in the proposal's text into a text line.
步骤S130,针对所述待识别字段,调用对应的文本识别模型进行字符识别,以分别识别出所述目标字符区域中的所述多个待识别字段包含的字符信息及获取所述文本识别模型识别所述目标字符区域包含的字符信息时生成的置信度。Step S130: Calling a corresponding text recognition model for character recognition to identify the character information included in the plurality of to-be-identified fields in the target character region and acquiring the text recognition model identification The confidence generated when the character information contained in the target character region is generated.
步骤S140,将获得的所述置信度与预设的置信度阈值进行比较,若所述置信度高于所述置信度阈值,则按照预设方法输出所述目标字符区域包含的字符信息,若所述置信度低于所述置信度阈值,则将所述单据图片通过第三方进行检验识别,并将所述第三方检验识别的结果输出。In step S140, the obtained confidence level is compared with a preset confidence threshold. If the confidence level is higher than the confidence threshold, the character information included in the target character region is output according to a preset method. If the confidence level is lower than the confidence threshold, the document picture is verified by a third party, and the result of the third party verification identification is output.
如图5所示,是本申请票据识别方法的第二实施例的流程示意图。本实施例中,所述票据识别方法的步骤S110中预处理包括步骤:As shown in FIG. 5, it is a schematic flowchart of a second embodiment of the ticket identification method of the present application. In this embodiment, the preprocessing in step S110 of the ticket identification method includes the following steps:
步骤S210,对所述票据图片进行分类。Step S210, classifying the ticket picture.
步骤S220,对所述票据图片进行去噪。Step S220, denoising the ticket picture.
步骤S230,对所述票据图片进行矫正。Step S230, correcting the picture of the ticket.
具体地,所述矫正处理包括步骤:Specifically, the correcting process includes the steps of:
确定票据中心点击票据中的印章中心点的位置;Determining the position of the center point of the stamp in the ticket center click ticket;
根据票据中心点与印章中心点的相对位置关系,确定票据的旋转角度;Determining the rotation angle of the bill according to the relative positional relationship between the center point of the bill and the center point of the stamp;
根据该角度把票据旋转到水平方向来(进行顺时针或者逆时针旋转)。Rotate the ticket to the horizontal direction according to this angle (clockwise or counterclockwise rotation).
步骤S240,对所述票据图片进行截取票据。Step S240, intercepting the ticket picture.
如图6所示,是本申请票据识别方法的第三实施例的流程示意图。本实施例中,所述票据识别方法的步骤S120中的文本检测模型的训练步骤包括:FIG. 6 is a schematic flowchart diagram of a third embodiment of the ticket identification method of the present application. In this embodiment, the training step of the text detection model in step S120 of the ticket identification method includes:
步骤S310,为每一个预设票据图片类别准备预设数量的标注有对应的图片类别的票据图片样本。Step S310, preparing a preset number of ticket picture samples marked with corresponding picture categories for each preset ticket picture category.
具体地,所述预设图片类别包括门诊票据和住院票据,所述预设数量为1000张。Specifically, the preset picture category includes an outpatient ticket and a hospitalization ticket, and the preset number is 1000 sheets.
步骤S320,将所述每一个预设图片类别对应的图片样本分为第一比例的训练子集和第二比例的验证子集,将各个训练子集中的图片样本进行混合以得到训练集,并将所述各个验证子集中的图片样本进行混合以得到验证集。Step S320, the picture samples corresponding to each preset picture category are divided into a training subset of a first ratio and a verification subset of a second ratio, and the picture samples in each training subset are mixed to obtain a training set, and The picture samples in the respective verification subsets are mixed to obtain a verification set.
具体地,所述第一比例及第二比例为80%、20%。Specifically, the first ratio and the second ratio are 80% and 20%.
步骤S330,利用所述训练集训练所述票据图片识别模型。Step S330, training the ticket picture recognition model by using the training set.
步骤S340,利用所述验证集验证训练的所述票据图片识别模型的准确率,若准确率大于或者等于预设准确率,则训练结束;若准确率小于所述预设准确率,则增加所述每一个预设图片类别对应的图片样本的数量,并重新执行以上步骤。Step S340, the accuracy of the ticket picture recognition model of the training is verified by using the verification set. If the accuracy rate is greater than or equal to the preset accuracy rate, the training ends; if the accuracy rate is less than the preset accuracy rate, the installation is increased. The number of picture samples corresponding to each preset picture category is described, and the above steps are re-executed.
具体地,所述预设准确率可为90%。Specifically, the preset accuracy rate may be 90%.
如图7所示,是本申请票据识别方法的第三实施例的流程示意图。本实施例中,所述票据识别方法的步骤S130中的文本识别模型对票据图片包括字符区域中的字符进行识别的步骤包括:FIG. 7 is a schematic flowchart diagram of a third embodiment of the ticket identification method of the present application. In this embodiment, the step of the text recognition model in the step S130 of the ticket identification method for identifying the characters in the ticket image including the character region includes:
步骤S410,所述卷积层对所述票据图片切块进行特征提取。Step S410, the convolution layer performs feature extraction on the ticket image dicing.
步骤S420,在所述卷积层输出的所有通道上,从左到右逐列拼接,得到特征序列。Step S420, splicing column by column from left to right on all channels outputted by the convolution layer to obtain a feature sequence.
步骤S430,把得到的所述特征序列放入所述循环网络层中进行字符的识别。Step S430, placing the obtained feature sequence into the loop network layer for character recognition.
步骤S440,所述转译层对识别的结果进行处理,根据字符字典生成最后的识别结果。Step S440, the translation layer processes the result of the recognition, and generates a final recognition result according to the character dictionary.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the embodiments of the present application are merely for the description, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better. Implementation. Based on such understanding, the technical solution of the present application, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, The optical disc includes a number of instructions for causing a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above is only a preferred embodiment of the present application, and is not intended to limit the scope of the patent application, and the equivalent structure or equivalent process transformations made by the specification and the drawings of the present application, or directly or indirectly applied to other related technical fields. The same is included in the scope of patent protection of this application.

Claims (20)

  1. 一种票据识别方法,应用于服务器,其特征在于,所述方法包括步骤:A ticket identification method is applied to a server, characterized in that the method comprises the steps of:
    接收待识别的票据图片,利用预先训练的票据图片识别模型对所述票据图片进行处理得到处理后的票据图片;Receiving a picture of the ticket to be identified, processing the picture of the ticket by using a pre-trained ticket picture recognition model to obtain a processed picture of the ticket;
    使用预先训练的文本检测模型对所述处理后的票据图片进行文本检测,确定所述处理后的票据图片中包括字符的目标字符区域及所述目标字符区域包括的待识别字段;Performing text detection on the processed ticket image by using a pre-trained text detection model, and determining that the processed ticket image includes a target character region of the character and a to-be-identified field included in the target character region;
    针对所述待识别字段,调用对应的文本识别模型进行字符识别,所述文本识别模型识别出所述待识别字段包含的字符信息,并针对识别的所述字符信息生成置信度;及And corresponding to the to-be-identified field, calling a corresponding text recognition model for character recognition, the text recognition model identifying character information included in the to-be-identified field, and generating a confidence level for the recognized character information;
    将所述置信度与预设的置信度阈值进行比较,若所述置信度高于所述置信度阈值,则按照预设方法输出所述目标字符区域包含的字符信息,若所述置信度低于所述置信度阈值,则将所述单据图片通过第三方进行检验识别,并将所述第三方检验识别的结果输出;Comparing the confidence level with a preset confidence threshold, if the confidence is higher than the confidence threshold, outputting the character information included in the target character region according to a preset method, if the confidence is low At the confidence threshold, the document picture is verified by a third party, and the result of the third party verification identification is output;
    其中,所述预设方法包括:保留票据单号前十位;使用tf-idf算法中的余弦相似度匹配医院字段最佳的医院名称;在算法输出的原始字符串结果上提取出年月日作为日期;将大写汉字金额进行转阿拉伯数字处理;去除非相关字符并保留小数点后两位,对算法输出的所有金额部分进行格式统一。The preset method includes: retaining the top ten digits of the bill number; using the cosine similarity in the tf-idf algorithm to match the hospital name of the hospital field; extracting the date and time on the original string result output by the algorithm As the date; the uppercase Chinese character amount is transferred to Arabic numerals; the non-related characters are removed and the two decimal places are reserved, and all the amount portions of the algorithm output are formatted.
  2. 如权利要求1所述的票据识别方法,其特征在于,所述票据图片识别模型对所述票据图片进行处理包括:对所述票据图片进行分类处理、去噪处理、矫正处理及截取票据处理,将通过所述分类处理、去噪处理、矫正处理及截取票据处理的所述票据图片作为处理后的票据图片。The ticket identification method according to claim 1, wherein the processing of the ticket image by the ticket picture recognition model comprises: classifying, denoising, correcting, and intercepting the ticket image, The ticket picture processed by the classification processing, the denoising processing, the correction processing, and the interception ticket processing is taken as the processed ticket picture.
  3. 如权利要求2述的票据识别方法,其特征在于,所述分类处理包括:将所述票据图片分为门诊票据,住院票据,以及其他类票据三种类别;所述去噪处理为:对所述票据图片进行图像平滑处理及小波滤波处理;所述矫正处理包括步骤:确定所述票据图片的票据中心点及所述票据图片中的印章中心点的位置,根据所述票据中心点与印章中心点的相对位置关系,确定票据的旋转角度,根据该角度把票据旋转到水平方向;所述截取票据为:将票据从原始票据图片中截取出来,去除原始票据图片的背景图片。The ticket identification method according to claim 2, wherein the classification processing comprises: dividing the bill picture into three categories: an outpatient bill, a hospital bill, and other types of bills; and the denoising processing is: The bill picture performs image smoothing processing and wavelet filtering processing; the rectifying processing includes the steps of: determining a bill center point of the bill picture and a position of a stamp center point in the bill picture, according to the bill center point and the stamp center The relative positional relationship of the points determines the rotation angle of the ticket, and rotates the ticket to the horizontal direction according to the angle; the intercepting the ticket is: cutting the ticket from the original ticket image to remove the background image of the original ticket image.
  4. 如权利要求1-3所述的票据识别方法,其特征在于,所述票据图片识别模型为深度卷积神经网络,该深度卷积神经网络为在CaffeNet的环境下选取的基于深度卷积神经网络SSD(Single Shot MultiBox Detector)的算法模型,所述票据图片识别模型的训练过程包括步骤:The ticket identification method according to any one of claims 1-3, wherein the bill picture recognition model is a deep convolutional neural network, and the deep convolutional neural network is a deep convolutional neural network selected in a CaffeNet environment. An algorithm model of the SSD (Single Shot MultiBox Detector), the training process of the ticket picture recognition model includes the steps of:
    为每一个预设票据图片类别准备预设数量的标注有对应的图片类别的票据图片样本;Preparing a preset number of ticket picture samples marked with corresponding picture categories for each preset ticket picture category;
    将所述每一个预设图片类别对应的图片样本分为第一比例的训练子集和第二比例的验证子集,将各个训练子集中的图片样本进行混合以得到训练集,并将所述各个验证子集中的图片样本进行混合以得到验证集;Dividing the picture samples corresponding to each of the preset picture categories into a training subset of the first ratio and a verification subset of the second ratio, mixing the picture samples in each training subset to obtain a training set, and The image samples in each verification subset are mixed to obtain a verification set;
    利用所述训练集训练所述票据图片识别模型;及Training the ticket picture recognition model with the training set; and
    利用所述验证集验证训练的所述票据图片识别模型的准确率,若准确率大于或者等于预设准确率,则训练结束;若准确率小于所述预设准确率,则增加所述每一个预设图片类别对应的图片样本的数量,并重新执行以上步骤;Using the verification set to verify the accuracy of the ticket picture recognition model of the training, if the accuracy rate is greater than or equal to the preset accuracy rate, the training ends; if the accuracy rate is less than the preset accuracy rate, each of the Predetermine the number of image samples corresponding to the image category, and re-execute the above steps;
    其中,所述预设图片类别包括门诊票据和住院票据,所述预设数量为1000张,所述第一比例及第二比例为80%、20%。The preset picture category includes an outpatient ticket and a hospitalization ticket, the preset number is 1000, and the first ratio and the second ratio are 80% and 20%.
  5. 如权利要求1所述的票据识别方法,其特征在于,所述文本检测模型为基于CaffeNet的CTPN(Connectionist Text Proposal Network)模型,所述文本检测模型对所述处理后的票据图片的字符区域进行区域识别,从所述处理后的票据图片上识别出包含字符信息且固定宽度为预设值的小框,将处于同一行包含字符信息的小框按照先后顺序拼接在一起,形成包含字符信息的目标行字符区域,其中,所述预设值为16个像素宽度。The ticket identification method according to claim 1, wherein the text detection model is a CaffeNet-based CTPN (Connectionist Text Proposal Network) model, and the text detection model performs a character region of the processed ticket image. The area is identified, and a small frame containing the character information and the fixed width is a preset value is identified from the processed ticket image, and the small frames containing the character information in the same line are stitched together in sequence to form the character information. A target line character area, wherein the preset value is 16 pixel widths.
  6. 如权利要求5所述的票据识别方法,其特征在于,所述文本检测模型的训练过程包括步骤:The ticket identification method according to claim 5, wherein the training process of the text detection model comprises the steps of:
    S1,针对待识别字段,获取预设数量的票据图片样本;S1. Obtain a preset number of bill picture samples for the to-be-identified field;
    S2,在各个票据图片样本上每隔第一预设数量的像素,设置第二预设数量的不同高宽比的且固定宽度为预设值的小框,在所述各个票据图片样本上对包含该待识别字段的部分或者全部字符信息的小框进行标记,将包含该待识别字段的字符信息的票据图片样本归入第一训练集,并将不包含该待识别字段的字符信息的票据图片样本归入第二训练集;S2, a second preset number of small frames of different aspect ratios with fixed widths and preset values are set on the first preset number of pixels on each ticket picture sample, and are performed on the respective ticket picture samples. a small frame containing part or all of the character information of the to-be-identified field is marked, and the ticket picture sample containing the character information of the to-be-identified field is classified into the first training set, and the ticket that does not contain the character information of the to-be-identified field is included The image sample is classified into the second training set;
    S3,分别从第一训练集和第二训练集中提取出第一预设比例的所述票据图片样本作为待训练的样本图片,并将第一训练集和第二训练集中剩余的票据图片样本作为待验证的样本图片;S3, extracting, from the first training set and the second training set, the first preset ratio of the ticket picture samples as a sample picture to be trained, and using the remaining ticket picture samples in the first training set and the second training set as Sample image to be verified;
    S4,利用提取的各个待训练的样本图片进行模型训练,以生成所述文本识别模型,并利用各个待验证的样本图片对生成的所述文本识别模型进行验证;及S4: performing model training by using the extracted sample images to be trained to generate the text recognition model, and verifying the generated text recognition model by using each sample image to be verified; and
    S5,若验证通过率大于等于预设阈值,则训练完成,若验证通过率小于预设阈值,则增加单据图片样本的数量,并重复执行步骤S2、S3、S4;S5, if the verification pass rate is greater than or equal to the preset threshold, the training is completed, if the verification pass rate is less than the preset threshold, increase the number of document picture samples, and repeat steps S2, S3, S4;
    其中,所述预设数量为10万,所述第一预设数量为16,所述第二预设数量为10,所述预设值为16个像素宽度,所述第一预设比例为80%,所述预设阈值为98%。The preset number is 100, the first preset number is 16, the second preset number is 10, the preset value is 16 pixel width, and the first preset ratio is 80%, the preset threshold is 98%.
  7. 如权利要求1所述的票据识别方法,其特征在于,所述文本识别模型包括卷积层、循环网络层及转译层,所述文本识别模型对所述目标字符区域进行字符识别的步骤包括:The ticket identification method according to claim 1, wherein the text recognition model comprises a convolution layer, a cyclic network layer and a translation layer, and the step of the text recognition model performing character recognition on the target character region comprises:
    所述卷积层对所述处理后的票据图片切块进行特征提取;The convolution layer performs feature extraction on the processed ticket picture dicing;
    在所述卷积层输出的所有通道上,从左到右逐列拼接,得到特征序列;On all the channels output by the convolutional layer, splicing from left to right column by column to obtain a feature sequence;
    把得到的所述特征序列放入所述循环网络层中进行字符的识别;Putting the obtained feature sequence into the loop network layer for character recognition;
    所述转译层对识别的结果进行处理,根据字符字典生成最后的识别结果。The translation layer processes the identified result and generates a final recognition result based on the character dictionary.
  8. 如权利要求7所述的票据识别方法,其特征在于,所述文本识别模型的训练过程包括步骤:The ticket identification method according to claim 7, wherein the training process of the text recognition model comprises the steps of:
    获取预设数量的票据图片样本,将所述票据图片样本按照预设比例分成第一数据集和第二数据集,所述第一数据集中的图片样本数量大于所述第二数据集中的图片样本数量,所述第一数据集作为训练集,所述第二数据集作为测试集;及Obtaining a preset number of ticket picture samples, and dividing the ticket picture sample into a first data set and a second data set according to a preset ratio, where the number of picture samples in the first data set is greater than the picture sample in the second data set Quantity, the first data set as a training set, and the second data set as a test set;
    将所述第一数据集中的图片样本送入所述文本识别模型进行模型训练,每进行预设次数迭代,对所述文本识别模型使用所述第二数据集进行测试,若测试时的所述文本识别模型对票据图片识别的误差出现发散,则调整训练参数并重新训练,使训练时所述文本识别模型对票据图片的识别的误差收敛。Sending a picture sample in the first data set to the text recognition model for model training, and performing a preset number of iterations, using the second data set to test the text recognition model, if the test The text recognition model diverges the error of the ticket picture recognition, adjusts the training parameters and retrains, so that the error of the recognition of the ticket picture by the text recognition model converges during training.
  9. 一种服务器,其特征在于,所述服务器包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的票据识别系统,所述票据识别系统被所述处理器执行时实现如下步骤:A server, comprising: a memory, a processor, and a ticket identification system stored on the memory and operable on the processor, the ticket identification system being implemented by the processor The following steps:
    接收待识别的票据图片,利用预先训练的票据图片识别模型对所述票据图片进行处理得到处理后的票据图片;Receiving a picture of the ticket to be identified, processing the picture of the ticket by using a pre-trained ticket picture recognition model to obtain a processed picture of the ticket;
    使用预先训练的文本检测模型对所述处理后的票据图片进行文本检测,确定所述处理后的票据图片中包括字符的目标字符区域及所述目标字符区域包括的待识别字段;Performing text detection on the processed ticket image by using a pre-trained text detection model, and determining that the processed ticket image includes a target character region of the character and a to-be-identified field included in the target character region;
    针对所述待识别字段,调用对应的文本识别模型进行字符识别,所述文本识别模型识别出所述待识别字段包含的字符信息,并针对识别的所述字符信息生成置信度;及And corresponding to the to-be-identified field, calling a corresponding text recognition model for character recognition, the text recognition model identifying character information included in the to-be-identified field, and generating a confidence level for the recognized character information;
    将所述置信度与预设的置信度阈值进行比较,若所述置信度高于所述置信度阈值,则按照预设方法输出所述目标字符区域包含的字符信息,若所述 置信度低于所述置信度阈值,则将所述单据图片通过第三方进行检验识别,并将所述第三方检验识别的结果输出;Comparing the confidence level with a preset confidence threshold, if the confidence is higher than the confidence threshold, outputting the character information included in the target character region according to a preset method, if the confidence is low At the confidence threshold, the document picture is verified by a third party, and the result of the third party verification identification is output;
    其中,所述预设方法包括:保留票据单号前十位;使用tf-idf算法中的余弦相似度匹配医院字段最佳的医院名称;在算法输出的原始字符串结果上提取出年月日作为日期;将大写汉字金额进行转阿拉伯数字处理;去除非相关字符并保留小数点后两位,对算法输出的所有金额部分进行格式统一。The preset method includes: retaining the top ten digits of the bill number; using the cosine similarity in the tf-idf algorithm to match the hospital name of the hospital field; extracting the date and time on the original string result output by the algorithm As the date; the uppercase Chinese character amount is transferred to Arabic numerals; the non-related characters are removed and the two decimal places are reserved, and all the amount portions of the algorithm output are formatted.
  10. 如权利要求9所述的服务器,其特征在于,所述票据图片识别模型对所述票据图片进行处理包括:对所述票据图片进行分类处理、去噪处理、矫正处理及截取票据处理,将通过所述分类处理、去噪处理、矫正处理及截取票据处理的所述票据图片作为处理后的票据图片。The server according to claim 9, wherein the processing of the ticket picture by the ticket picture recognition model comprises: classifying, denoising, correcting, and intercepting the ticket image, which will pass The classification process, the denoising process, the correction process, and the note picture of the interception ticket processing are used as the processed ticket picture.
  11. 如权利要求10所述的服务器,其特征在于,所述分类处理包括:将所述票据图片分为门诊票据,住院票据,以及其他类票据三种类别;所述去噪处理为:对所述票据图片进行图像平滑处理及小波滤波处理;所述矫正处理包括步骤:确定所述票据图片的票据中心点及所述票据图片中的印章中心点的位置,根据所述票据中心点与印章中心点的相对位置关系,确定票据的旋转角度,根据该角度把票据旋转到水平方向;所述截取票据为:将票据从原始票据图片中截取出来,去除原始票据图片的背景图片。The server according to claim 10, wherein said sorting processing comprises: dividing said bill picture into three categories of outpatient bills, hospital bills, and other types of bills; said denoising processing is: said The bill picture performs image smoothing processing and wavelet filtering processing; the rectifying processing includes the steps of: determining a bill center point of the bill picture and a position of a stamp center point in the bill picture, according to the bill center point and the stamp center point The relative positional relationship determines the rotation angle of the ticket, and rotates the ticket to the horizontal direction according to the angle; the intercepting the ticket is: cutting the ticket from the original ticket image to remove the background image of the original ticket image.
  12. 如权利要求9-11所述的服务器,其特征在于,所述票据图片识别模型为深度卷积神经网络,该深度卷积神经网络为在CaffeNet的环境下选取的基于深度卷积神经网络SSD(Single Shot MultiBox Detector)的算法模型,所述票据图片识别模型的训练过程包括步骤:The server according to any one of claims 9-11, wherein the ticket picture recognition model is a deep convolutional neural network, which is a deep convolutional neural network SSD (selected in a CaffeNet environment) The algorithm model of the Single Shot MultiBox Detector), the training process of the ticket picture recognition model includes the steps:
    为每一个预设票据图片类别准备预设数量的标注有对应的图片类别的票据图片样本;Preparing a preset number of ticket picture samples marked with corresponding picture categories for each preset ticket picture category;
    将所述每一个预设图片类别对应的图片样本分为第一比例的训练子集和第二比例的验证子集,将各个训练子集中的图片样本进行混合以得到训练集,并将所述各个验证子集中的图片样本进行混合以得到验证集;Dividing the picture samples corresponding to each of the preset picture categories into a training subset of the first ratio and a verification subset of the second ratio, mixing the picture samples in each training subset to obtain a training set, and The image samples in each verification subset are mixed to obtain a verification set;
    利用所述训练集训练所述票据图片识别模型;及Training the ticket picture recognition model with the training set; and
    利用所述验证集验证训练的所述票据图片识别模型的准确率,若准确率大于或者等于预设准确率,则训练结束;若准确率小于所述预设准确率,则增加所述每一个预设图片类别对应的图片样本的数量,并重新执行以上步骤;Using the verification set to verify the accuracy of the ticket picture recognition model of the training, if the accuracy rate is greater than or equal to the preset accuracy rate, the training ends; if the accuracy rate is less than the preset accuracy rate, each of the Predetermine the number of image samples corresponding to the image category, and re-execute the above steps;
    其中,所述预设图片类别包括门诊票据和住院票据,所述预设数量为1000张,所述第一比例及第二比例为80%、20%。The preset picture category includes an outpatient ticket and a hospitalization ticket, the preset number is 1000, and the first ratio and the second ratio are 80% and 20%.
  13. 如权利要求9所述的服务器,其特征在于,所述文本检测模型为基于 CaffeNet的CTPN(Connectionist Text Proposal Network)模型,所述文本检测模型对所述处理后的票据图片的字符区域进行区域识别,从所述处理后的票据图片上识别出包含字符信息且固定宽度为预设值的小框,将处于同一行包含字符信息的小框按照先后顺序拼接在一起,形成包含字符信息的目标行字符区域,其中,所述预设值为16个像素宽度。The server according to claim 9, wherein the text detection model is a CaffeNet-based CTPN (Connectionist Text Proposal Network) model, and the text detection model performs area identification on a character region of the processed ticket picture. Recognizing a small frame containing the character information and having a fixed width as a preset value from the processed ticket image, and splicing the small frames in the same line containing the character information in a sequential order to form a target line containing the character information. a character area, wherein the preset value is 16 pixels wide.
  14. 如权利要求13所述的服务器,其特征在于,所述文本检测模型的训练过程包括步骤:The server according to claim 13, wherein the training process of the text detection model comprises the steps of:
    S1,针对待识别字段,获取预设数量的票据图片样本;S1. Obtain a preset number of bill picture samples for the to-be-identified field;
    S2,在各个票据图片样本上每隔第一预设数量的像素,设置第二预设数量的不同高宽比的且固定宽度为预设值的小框,在所述各个票据图片样本上对包含该待识别字段的部分或者全部字符信息的小框进行标记,将包含该待识别字段的字符信息的票据图片样本归入第一训练集,并将不包含该待识别字段的字符信息的票据图片样本归入第二训练集;S2, a second preset number of small frames of different aspect ratios with fixed widths and preset values are set on the first preset number of pixels on each ticket picture sample, and are performed on the respective ticket picture samples. a small frame containing part or all of the character information of the to-be-identified field is marked, and the ticket picture sample containing the character information of the to-be-identified field is classified into the first training set, and the ticket that does not contain the character information of the to-be-identified field is included The image sample is classified into the second training set;
    S3,分别从第一训练集和第二训练集中提取出第一预设比例的所述票据图片样本作为待训练的样本图片,并将第一训练集和第二训练集中剩余的票据图片样本作为待验证的样本图片;S3, extracting, from the first training set and the second training set, the first preset ratio of the ticket picture samples as a sample picture to be trained, and using the remaining ticket picture samples in the first training set and the second training set as Sample image to be verified;
    S4,利用提取的各个待训练的样本图片进行模型训练,以生成所述文本识别模型,并利用各个待验证的样本图片对生成的所述文本识别模型进行验证;及S4: performing model training by using the extracted sample images to be trained to generate the text recognition model, and verifying the generated text recognition model by using each sample image to be verified; and
    S5,若验证通过率大于等于预设阈值,则训练完成,若验证通过率小于预设阈值,则增加单据图片样本的数量,并重复执行步骤S2、S3、S4;S5, if the verification pass rate is greater than or equal to the preset threshold, the training is completed, if the verification pass rate is less than the preset threshold, increase the number of document picture samples, and repeat steps S2, S3, S4;
    其中,所述预设数量为10万,所述第一预设数量为16,所述第二预设数量为10,所述预设值为16个像素宽度,所述第一预设比例为80%,所述预设阈值为98%。The preset number is 100, the first preset number is 16, the second preset number is 10, the preset value is 16 pixel width, and the first preset ratio is 80%, the preset threshold is 98%.
  15. 如权利要求9所述的服务器,其特征在于,所述文本识别模型包括卷积层、循环网络层及转译层,所述文本识别模型对所述目标字符区域进行字符识别的步骤包括:The server according to claim 9, wherein the text recognition model comprises a convolution layer, a loop network layer and a translation layer, and the step of the text recognition model performing character recognition on the target character region comprises:
    所述卷积层对所述处理后的票据图片切块进行特征提取;The convolution layer performs feature extraction on the processed ticket picture dicing;
    在所述卷积层输出的所有通道上,从左到右逐列拼接,得到特征序列;On all the channels output by the convolutional layer, splicing from left to right column by column to obtain a feature sequence;
    把得到的所述特征序列放入所述循环网络层中进行字符的识别;Putting the obtained feature sequence into the loop network layer for character recognition;
    所述转译层对识别的结果进行处理,根据字符字典生成最后的识别结果。The translation layer processes the identified result and generates a final recognition result based on the character dictionary.
  16. 如权利要求15所述的服务器,其特征在于,所述文本识别模型的训练过程包括步骤:The server according to claim 15, wherein the training process of the text recognition model comprises the steps of:
    获取预设数量的票据图片样本,将所述票据图片样本按照预设比例分成第一数据集和第二数据集,所述第一数据集中的图片样本数量大于所述第二数据集中的图片样本数量,所述第一数据集作为训练集,所述第二数据集作为测试集;及Obtaining a preset number of ticket picture samples, and dividing the ticket picture sample into a first data set and a second data set according to a preset ratio, where the number of picture samples in the first data set is greater than the picture sample in the second data set Quantity, the first data set as a training set, and the second data set as a test set;
    将所述第一数据集中的图片样本送入所述文本识别模型进行模型训练,每进行预设次数迭代,对所述文本识别模型使用所述第二数据集进行测试,若测试时的所述文本识别模型对票据图片识别的误差出现发散,则调整训练参数并重新训练,使训练时所述文本识别模型对票据图片的识别的误差收敛。Sending a picture sample in the first data set to the text recognition model for model training, and performing a preset number of iterations, using the second data set to test the text recognition model, if the test The text recognition model diverges the error of the ticket picture recognition, adjusts the training parameters and retrains, so that the error of the recognition of the ticket picture by the text recognition model converges during training.
  17. 一种计算机可读存储介质,所述计算机可读存储介质存储有票据识别系统,所述票据识别系统可被至少一个处理器执行时,实现如下步骤:A computer readable storage medium storing a ticket identification system, wherein when the ticket identification system is executable by at least one processor, the following steps are implemented:
    接收待识别的票据图片,利用预先训练的票据图片识别模型对所述票据图片进行处理得到处理后的票据图片;Receiving a picture of the ticket to be identified, processing the picture of the ticket by using a pre-trained ticket picture recognition model to obtain a processed picture of the ticket;
    使用预先训练的文本检测模型对所述处理后的票据图片进行文本检测,确定所述处理后的票据图片中包括字符的目标字符区域及所述目标字符区域包括的待识别字段;Performing text detection on the processed ticket image by using a pre-trained text detection model, and determining that the processed ticket image includes a target character region of the character and a to-be-identified field included in the target character region;
    针对所述待识别字段,调用对应的文本识别模型进行字符识别,所述文本识别模型识别出所述待识别字段包含的字符信息,并针对识别的所述字符信息生成置信度;及And corresponding to the to-be-identified field, calling a corresponding text recognition model for character recognition, the text recognition model identifying character information included in the to-be-identified field, and generating a confidence level for the recognized character information;
    将所述置信度与预设的置信度阈值进行比较,若所述置信度高于所述置信度阈值,则按照预设方法输出所述目标字符区域包含的字符信息,若所述置信度低于所述置信度阈值,则将所述单据图片通过第三方进行检验识别,并将所述第三方检验识别的结果输出;Comparing the confidence level with a preset confidence threshold, if the confidence is higher than the confidence threshold, outputting the character information included in the target character region according to a preset method, if the confidence is low At the confidence threshold, the document picture is verified by a third party, and the result of the third party verification identification is output;
    其中,所述预设方法包括:保留票据单号前十位;使用tf-idf算法中的余弦相似度匹配医院字段最佳的医院名称;在算法输出的原始字符串结果上提取出年月日作为日期;将大写汉字金额进行转阿拉伯数字处理;去除非相关字符并保留小数点后两位,对算法输出的所有金额部分进行格式统一。The preset method includes: retaining the top ten digits of the bill number; using the cosine similarity in the tf-idf algorithm to match the hospital name of the hospital field; extracting the date and time on the original string result output by the algorithm As the date; the uppercase Chinese character amount is transferred to Arabic numerals; the non-related characters are removed and the two decimal places are reserved, and all the amount portions of the algorithm output are formatted.
  18. 如权利要求17所述的计算机可读存储介质,其特征在于,所述票据图片识别模型对所述票据图片进行处理包括:对所述票据图片进行分类处理、去噪处理、矫正处理及截取票据处理,将通过所述分类处理、去噪处理、矫正处理及截取票据处理的所述票据图片作为处理后的票据图片。The computer readable storage medium according to claim 17, wherein the processing of the ticket image by the ticket picture recognition model comprises: classifying, denoising, correcting, and intercepting the ticket image Processing, the ticket picture processed by the classification processing, the denoising processing, the correction processing, and the interception ticket processing is used as the processed ticket picture.
  19. 如权利要求18所述的计算机可读存储介质,其特征在于,所述分类处理包括:将所述票据图片分为门诊票据,住院票据,以及其他类票据三种类别;所述去噪处理为:对所述票据图片进行图像平滑处理及小波滤波处理; 所述矫正处理包括步骤:确定所述票据图片的票据中心点及所述票据图片中的印章中心点的位置,根据所述票据中心点与印章中心点的相对位置关系,确定票据的旋转角度,根据该角度把票据旋转到水平方向;所述截取票据为:将票据从原始票据图片中截取出来,去除原始票据图片的背景图片。The computer readable storage medium according to claim 18, wherein said sorting processing comprises: dividing said bill picture into three categories of outpatient bills, hospital bills, and other types of bills; said denoising processing is Performing an image smoothing process and a wavelet filtering process on the ticket image; the correcting process includes the steps of: determining a ticket center point of the ticket picture and a position of a stamp center point in the ticket picture, according to the ticket center point The relative positional relationship with the center point of the stamp determines the rotation angle of the ticket, and rotates the ticket to the horizontal direction according to the angle; the intercepting ticket is: the ticket is taken out from the original ticket image, and the background image of the original ticket image is removed.
  20. 如权利要求17-19所述的计算机可读存储介质,其特征在于,所述票据图片识别模型为深度卷积神经网络,该深度卷积神经网络为在CaffeNet的环境下选取的基于深度卷积神经网络SSD(Single Shot MultiBox Detector)的算法模型,所述票据图片识别模型的训练过程包括步骤:A computer readable storage medium according to any of claims 17-19, wherein said ticket picture recognition model is a deep convolutional neural network, which is based on deep convolution selected in the environment of CaffeNet An algorithm model of a Sin (Single Shot MultiBox Detector), the training process of the bill picture recognition model includes the steps of:
    为每一个预设票据图片类别准备预设数量的标注有对应的图片类别的票据图片样本;Preparing a preset number of ticket picture samples marked with corresponding picture categories for each preset ticket picture category;
    将所述每一个预设图片类别对应的图片样本分为第一比例的训练子集和第二比例的验证子集,将各个训练子集中的图片样本进行混合以得到训练集,并将所述各个验证子集中的图片样本进行混合以得到验证集;Dividing the picture samples corresponding to each of the preset picture categories into a training subset of the first ratio and a verification subset of the second ratio, mixing the picture samples in each training subset to obtain a training set, and The image samples in each verification subset are mixed to obtain a verification set;
    利用所述训练集训练所述票据图片识别模型;及Training the ticket picture recognition model with the training set; and
    利用所述验证集验证训练的所述票据图片识别模型的准确率,若准确率大于或者等于预设准确率,则训练结束;若准确率小于所述预设准确率,则增加所述每一个预设图片类别对应的图片样本的数量,并重新执行以上步骤;Using the verification set to verify the accuracy of the ticket picture recognition model of the training, if the accuracy rate is greater than or equal to the preset accuracy rate, the training ends; if the accuracy rate is less than the preset accuracy rate, each of the Predetermine the number of image samples corresponding to the image category, and re-execute the above steps;
    其中,所述预设图片类别包括门诊票据和住院票据,所述预设数量为1000张,所述第一比例及第二比例为80%、20%。The preset picture category includes an outpatient ticket and a hospitalization ticket, the preset number is 1000, and the first ratio and the second ratio are 80% and 20%.
PCT/CN2018/089202 2018-03-14 2018-05-31 Bill recognition method, server, and computer readable storage medium WO2019174130A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810208586.9A CN108446621A (en) 2018-03-14 2018-03-14 Bank slip recognition method, server and computer readable storage medium
CN201810208586.9 2018-03-14

Publications (1)

Publication Number Publication Date
WO2019174130A1 true WO2019174130A1 (en) 2019-09-19

Family

ID=63194307

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/089202 WO2019174130A1 (en) 2018-03-14 2018-05-31 Bill recognition method, server, and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN108446621A (en)
WO (1) WO2019174130A1 (en)

Cited By (113)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110728277A (en) * 2019-09-27 2020-01-24 达而观信息科技(上海)有限公司 Seal intelligent detection and identification method
CN110766017A (en) * 2019-10-22 2020-02-07 国网新疆电力有限公司信息通信公司 Mobile terminal character recognition method and system based on deep learning
CN110826330A (en) * 2019-10-12 2020-02-21 上海数禾信息科技有限公司 Name recognition method and device, computer equipment and readable storage medium
CN110827333A (en) * 2019-10-31 2020-02-21 国网河北省电力有限公司电力科学研究院 Relay protection pressing plate splicing identification method, system and medium
CN110866495A (en) * 2019-11-14 2020-03-06 杭州睿琪软件有限公司 Bill image recognition method, bill image recognition device, bill image recognition equipment, training method and storage medium
CN110941717A (en) * 2019-11-22 2020-03-31 深圳马可孛罗科技有限公司 Passenger ticket rule analysis method and device, electronic equipment and computer readable medium
CN110991520A (en) * 2019-11-29 2020-04-10 汉海信息技术(上海)有限公司 Method and device for generating training sample
CN111046886A (en) * 2019-12-12 2020-04-21 吉林大学 Automatic identification method, device and equipment for number plate and computer readable storage medium
CN111079117A (en) * 2019-11-28 2020-04-28 上海三零卫士信息安全有限公司 LeNet and SSD-based point-contact type verification code automatic identification method
CN111079641A (en) * 2019-12-13 2020-04-28 科大讯飞股份有限公司 Answering content identification method, related device and readable storage medium
CN111126125A (en) * 2019-10-15 2020-05-08 平安科技(深圳)有限公司 Method, device and equipment for extracting target text in certificate and readable storage medium
CN111126396A (en) * 2019-12-25 2020-05-08 北京科技大学 Image recognition method and device, computer equipment and storage medium
CN111144358A (en) * 2019-12-30 2020-05-12 上海眼控科技股份有限公司 Vehicle quality certificate verification method and device, computer equipment and storage medium
CN111160265A (en) * 2019-12-30 2020-05-15 Oppo(重庆)智能科技有限公司 File conversion method and device, storage medium and electronic equipment
CN111178355A (en) * 2019-12-27 2020-05-19 中化资本有限公司 Seal identification method and device and storage medium
CN111191695A (en) * 2019-12-19 2020-05-22 杭州安恒信息技术股份有限公司 Website picture tampering detection method based on deep learning
CN111209792A (en) * 2019-12-02 2020-05-29 河南科技学院 Image identification method and device for value-added tax general invoice
CN111291726A (en) * 2020-03-12 2020-06-16 泰康保险集团股份有限公司 Medical bill sorting method, device, equipment and medium
CN111325104A (en) * 2020-01-22 2020-06-23 平安科技(深圳)有限公司 Text recognition method, device and storage medium
CN111340022A (en) * 2020-02-24 2020-06-26 深圳市华云中盛科技股份有限公司 Identity card information identification method and device, computer equipment and storage medium
CN111353484A (en) * 2020-02-28 2020-06-30 深圳前海微众银行股份有限公司 Image character recognition method, device, equipment and readable storage medium
CN111414911A (en) * 2020-03-23 2020-07-14 湖南信息学院 Card number identification method and system based on deep learning
CN111414917A (en) * 2020-03-18 2020-07-14 民生科技有限责任公司 Identification method of low-pixel-density text
CN111414906A (en) * 2020-03-05 2020-07-14 北京交通大学 Data synthesis and text recognition method for paper bill picture
CN111445058A (en) * 2020-03-04 2020-07-24 中国平安人寿保险股份有限公司 Data analysis method, device, equipment and computer readable storage medium
CN111476243A (en) * 2020-04-13 2020-07-31 中国工商银行股份有限公司 Image character recognition method and device
CN111489347A (en) * 2020-04-15 2020-08-04 深圳前海环融联易信息科技服务有限公司 Business license picture quality detection method and device, computer equipment and storage medium
CN111507324A (en) * 2020-03-16 2020-08-07 平安科技(深圳)有限公司 Card frame identification method, device, equipment and computer storage medium
CN111507356A (en) * 2020-04-20 2020-08-07 恒银金融科技股份有限公司 Segmentation method of handwritten characters of lower case money of financial bills
CN111581242A (en) * 2020-04-30 2020-08-25 北京贝禾无忧科技有限公司 Method and system for identifying enterprise bill use
CN111582786A (en) * 2020-04-29 2020-08-25 上海中通吉网络技术有限公司 Express bill number identification method, device and equipment based on machine learning
CN111582259A (en) * 2020-04-10 2020-08-25 支付宝实验室(新加坡)有限公司 Machine-readable code identification method and device, electronic equipment and storage medium
CN111598122A (en) * 2020-04-01 2020-08-28 深圳壹账通智能科技有限公司 Data verification method and device, electronic equipment and storage medium
CN111639648A (en) * 2020-05-26 2020-09-08 浙江大华技术股份有限公司 Certificate identification method and device, computing equipment and storage medium
CN111639566A (en) * 2020-05-19 2020-09-08 浙江大华技术股份有限公司 Method and device for extracting form information
CN111666886A (en) * 2020-06-08 2020-09-15 成都知识视觉科技有限公司 Image preprocessing method for medical document structured knowledge extraction
CN111666932A (en) * 2020-05-27 2020-09-15 平安科技(深圳)有限公司 Document auditing method and device, computer equipment and storage medium
CN111680691A (en) * 2020-04-30 2020-09-18 北京三快在线科技有限公司 Character detection method and device, electronic equipment and computer readable storage medium
CN111753744A (en) * 2020-06-28 2020-10-09 北京百度网讯科技有限公司 Method, device and equipment for classifying bill images and readable storage medium
CN111783785A (en) * 2020-07-02 2020-10-16 上海海事大学 Water meter identification system and method
CN111814785A (en) * 2020-06-11 2020-10-23 浙江大华技术股份有限公司 Invoice recognition method, training method of related model, related equipment and device
CN111814539A (en) * 2020-05-28 2020-10-23 平安科技(深圳)有限公司 Character recognition method and device based on infrared light and ultraviolet light and computer equipment
CN111832657A (en) * 2020-07-20 2020-10-27 上海眼控科技股份有限公司 Text recognition method and device, computer equipment and storage medium
CN111860489A (en) * 2019-12-09 2020-10-30 北京嘀嘀无限科技发展有限公司 Certificate image correction method, device, equipment and storage medium
CN111914729A (en) * 2020-07-28 2020-11-10 中国建设银行股份有限公司 Voucher association method and device, computer equipment and storage medium
CN111914706A (en) * 2020-07-22 2020-11-10 厦门商集网络科技有限责任公司 Method and device for detecting and controlling quality of character detection output result
CN111932593A (en) * 2020-07-21 2020-11-13 湖南中联重科智能技术有限公司 Image registration method, system and equipment based on touch screen gesture correction
CN111931229A (en) * 2020-07-10 2020-11-13 深信服科技股份有限公司 Data identification method and device and storage medium
CN111950356A (en) * 2020-06-30 2020-11-17 深圳市雄帝科技股份有限公司 Seal text positioning method and device and electronic equipment
CN111950380A (en) * 2020-07-19 2020-11-17 中国建设银行股份有限公司 Bill auditing method and device, electronic equipment and computer-readable storage medium
CN112052853A (en) * 2020-09-09 2020-12-08 国家气象信息中心 Text positioning method of handwritten meteorological archive data based on deep learning
CN112070086A (en) * 2020-09-09 2020-12-11 平安科技(深圳)有限公司 Method for optimizing text recognition system, computer device, and storage medium
CN112069894A (en) * 2020-08-03 2020-12-11 许继集团有限公司 Wire strand scattering identification method based on fast-RCNN model
CN112100366A (en) * 2020-09-17 2020-12-18 广联达科技股份有限公司 Pavement structure layer display method and device, computer equipment and storage medium
CN112115921A (en) * 2020-09-30 2020-12-22 北京百度网讯科技有限公司 True and false identification method and device and electronic equipment
CN112232340A (en) * 2020-10-15 2021-01-15 马婧 Method and device for identifying printed information on surface of object
CN112232296A (en) * 2020-11-09 2021-01-15 北京爱笔科技有限公司 Hyper-parameter exploration method and device
CN112232338A (en) * 2020-10-13 2021-01-15 中国平安人寿保险股份有限公司 Data entry method, device, equipment and storage medium for underwriting and claim settlement process
CN112257613A (en) * 2020-10-23 2021-01-22 中国平安人寿保险股份有限公司 Physical examination report information structured extraction method and device and computer equipment
CN112257768A (en) * 2020-10-19 2021-01-22 广州金融科技股份有限公司 Method and device for identifying illegal financial picture and computer storage medium
CN112270323A (en) * 2020-09-29 2021-01-26 西安电子科技大学 Intelligent medical reinspection control method, system, medium, computer equipment and application
CN112287898A (en) * 2020-11-26 2021-01-29 深源恒际科技有限公司 Method and system for evaluating text detection quality of image
CN112288039A (en) * 2020-11-26 2021-01-29 深源恒际科技有限公司 Sample labeling method and system for OCR model training
CN112329708A (en) * 2020-11-24 2021-02-05 北京百度网讯科技有限公司 Bill identification method and device
CN112381702A (en) * 2020-12-02 2021-02-19 北京皮尔布莱尼软件有限公司 Image privacy processing method, computing device and storage medium
CN112396048A (en) * 2020-11-17 2021-02-23 中国平安人寿保险股份有限公司 Picture information extraction method and device, computer equipment and storage medium
CN112418206A (en) * 2020-11-20 2021-02-26 平安普惠企业管理有限公司 Picture classification method based on position detection model and related equipment thereof
CN112464941A (en) * 2020-10-23 2021-03-09 北京思特奇信息技术股份有限公司 Invoice identification method and system based on neural network
CN112488108A (en) * 2020-12-11 2021-03-12 广州小鹏自动驾驶科技有限公司 Parking space number identification method and device, electronic equipment and storage medium
CN112507973A (en) * 2020-12-29 2021-03-16 中国电子科技集团公司第二十八研究所 Text and picture recognition system based on OCR technology
CN112508017A (en) * 2020-12-15 2021-03-16 通号智慧城市研究设计院有限公司 Intelligent digital display instrument reading identification method, system, processing equipment and storage medium
CN112507946A (en) * 2020-12-18 2021-03-16 北京百度网讯科技有限公司 Method, apparatus, device and storage medium for processing image
CN112651913A (en) * 2020-12-17 2021-04-13 广州市申迪计算机系统有限公司 Invoice seal desalination method, system and device and computer storage medium
CN112818961A (en) * 2021-03-26 2021-05-18 北京东方金朔信息技术有限公司 Image feature identification method and device
CN112818951A (en) * 2021-03-11 2021-05-18 南京大学 Ticket identification method
CN112837466A (en) * 2020-12-18 2021-05-25 北京百度网讯科技有限公司 Bill recognition method, device, equipment and storage medium
CN112926565A (en) * 2021-02-25 2021-06-08 中国平安人寿保险股份有限公司 Picture text recognition method, system, device and storage medium
CN112949450A (en) * 2021-02-25 2021-06-11 北京百度网讯科技有限公司 Bill processing method, bill processing device, electronic device and storage medium
CN112947944A (en) * 2019-12-11 2021-06-11 中盈优创资讯科技有限公司 Business work order processing method and device
CN112966583A (en) * 2021-02-26 2021-06-15 深圳壹账通智能科技有限公司 Image processing method, image processing device, computer equipment and storage medium
CN112990172A (en) * 2019-12-02 2021-06-18 阿里巴巴集团控股有限公司 Text recognition method, character recognition method and device
CN112990208A (en) * 2019-12-12 2021-06-18 搜狗(杭州)智能科技有限公司 Text recognition method and device
CN112989990A (en) * 2021-03-09 2021-06-18 平安科技(深圳)有限公司 Medical bill identification method, device, equipment and storage medium
CN113033240A (en) * 2019-12-09 2021-06-25 上海高德威智能交通系统有限公司 Multi-line text recognition method, model training method, device, equipment and medium
CN113111882A (en) * 2021-02-24 2021-07-13 北京邮电大学 Card identification method and device, electronic equipment and storage medium
CN113128490A (en) * 2021-04-28 2021-07-16 湖南荣冠智能科技有限公司 Prescription information scanning and automatic identification method
CN113221869A (en) * 2021-05-25 2021-08-06 中国平安人寿保险股份有限公司 Medical invoice structured information extraction method, device and equipment and storage medium
CN113239807A (en) * 2021-05-14 2021-08-10 北京百度网讯科技有限公司 Method and device for training bill recognition model and bill recognition
CN113255642A (en) * 2021-05-07 2021-08-13 北京精友时代信息技术发展有限公司 Medical bill information integration method for injury claims
CN113255767A (en) * 2021-05-25 2021-08-13 深圳壹账通智能科技有限公司 Bill classification method, device, equipment and storage medium
CN113269102A (en) * 2021-05-28 2021-08-17 中邮信息科技(北京)有限公司 Seal information identification method and device, computer equipment and storage medium
CN113313114A (en) * 2021-06-11 2021-08-27 北京百度网讯科技有限公司 Certificate information acquisition method, device, equipment and storage medium
CN113435439A (en) * 2021-06-30 2021-09-24 青岛海尔科技有限公司 Document auditing method and device, storage medium and electronic device
CN113537229A (en) * 2021-08-27 2021-10-22 广州广电运通金融电子股份有限公司 Bill image generation method and device, computer equipment and storage medium
CN113591705A (en) * 2021-07-30 2021-11-02 华东理工大学 Inspection robot instrument recognition system and method and storage medium
CN113627430A (en) * 2021-08-12 2021-11-09 石家庄铁道大学 Cable number detection method and device and electronic equipment
CN113627395A (en) * 2021-09-17 2021-11-09 平安银行股份有限公司 Text recognition method, text recognition device, text recognition medium and electronic equipment
CN113657377A (en) * 2021-07-22 2021-11-16 西南财经大学 Structured recognition method for airplane ticket printing data image
CN113869131A (en) * 2021-09-01 2021-12-31 南京烽火天地通信科技有限公司 Method for structuring textualized business license picture
CN113963147A (en) * 2021-09-26 2022-01-21 西安交通大学 Key information extraction method and system based on semantic segmentation
CN114240407A (en) * 2021-11-17 2022-03-25 广东电网有限责任公司 Bill risk conduction quantitative evaluation system and method based on block chain
CN114267115A (en) * 2021-12-27 2022-04-01 江苏银承网络科技股份有限公司 Bill recognition method and system
CN114299528A (en) * 2021-12-27 2022-04-08 万达信息股份有限公司 Information extraction and structuring method for scanned document
CN114358943A (en) * 2022-01-12 2022-04-15 平安普惠企业管理有限公司 Online bill processing method and device, electronic equipment and storage medium
CN114708603A (en) * 2022-05-25 2022-07-05 杭州咏柳科技有限公司 Method, system, device and medium for identifying key information in medical bill
CN114818995A (en) * 2022-06-27 2022-07-29 广东电网有限责任公司佛山供电局 Method, device and equipment for identifying and early warning illegal work without work ticket
CN115062722A (en) * 2022-07-06 2022-09-16 哈尔滨璧能科技有限公司 AI training method based on cloud service big data cleaning and artificial intelligence cloud system
CN115170261A (en) * 2022-06-08 2022-10-11 姚远 Systematic method for hospital financial audit
CN115690803A (en) * 2022-10-31 2023-02-03 中电金信软件(上海)有限公司 Digital image recognition method and device, electronic equipment and readable storage medium
WO2023078264A1 (en) * 2021-11-03 2023-05-11 中移(苏州)软件技术有限公司 Method and apparatus for training business card information extraction system, and computer-readable storage medium
CN117197816A (en) * 2023-06-19 2023-12-08 珠海盈米基金销售有限公司 User material identification method and system
CN117408764A (en) * 2023-12-13 2024-01-16 江西曼荼罗软件有限公司 Medical fee change method, system, storage medium and electronic equipment
CN117649672A (en) * 2024-01-30 2024-03-05 湖南大学 Font type visual detection method and system based on active learning and transfer learning

Families Citing this family (117)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109241271B (en) * 2018-08-30 2021-09-17 天津做票君机器人科技有限公司 Method for identifying draft due date in natural language by draft transaction robot
CN109117848B (en) * 2018-09-07 2022-11-18 泰康保险集团股份有限公司 Text line character recognition method, device, medium and electronic equipment
CN110895691B (en) * 2018-09-12 2023-06-23 阿里巴巴集团控股有限公司 Image processing method and device and electronic equipment
CN109460765A (en) * 2018-09-25 2019-03-12 平安科技(深圳)有限公司 Driving license is taken pictures recognition methods, device and the electronic equipment of image in natural scene
CN109637000B (en) * 2018-10-23 2021-12-28 深圳壹账通智能科技有限公司 Invoice detection method and device, storage medium and electronic terminal
CN109635627A (en) * 2018-10-23 2019-04-16 中国平安财产保险股份有限公司 Pictorial information extracting method, device, computer equipment and storage medium
CN111091499B (en) * 2018-10-24 2023-05-23 方正国际软件(北京)有限公司 Mobile terminal image correction method and device
CN109461247A (en) * 2018-10-29 2019-03-12 北京慧流科技有限公司 Note validating method and device, electronic equipment and storage medium
CN109685100B (en) * 2018-11-12 2024-05-10 平安科技(深圳)有限公司 Character recognition method, server and computer readable storage medium
CN109583443B (en) * 2018-11-15 2022-10-18 四川长虹电器股份有限公司 Video content judgment method based on character recognition
CN109635796B (en) * 2018-11-20 2021-09-28 泰康保险集团股份有限公司 Questionnaire recognition method, device and equipment
CN109542649A (en) * 2018-11-23 2019-03-29 于兴虎 A method of image memory is quickly shared across language under vision servo system
CN109784339A (en) * 2018-12-13 2019-05-21 平安普惠企业管理有限公司 Picture recognition test method, device, computer equipment and storage medium
CN109670494B (en) * 2018-12-13 2023-09-01 北京深智恒际科技有限公司 Text detection method and system with recognition confidence
CN109886077B (en) * 2018-12-28 2021-07-09 北京旷视科技有限公司 Image recognition method and device, computer equipment and storage medium
CN109784235A (en) * 2018-12-29 2019-05-21 广东益萃网络科技有限公司 Method for automatically inputting, device, computer equipment and the storage medium of paper form
CN111461105B (en) * 2019-01-18 2023-11-28 顺丰科技有限公司 Text recognition method and device
CN109858420A (en) * 2019-01-24 2019-06-07 国信电子票据平台信息服务有限公司 A kind of bill processing system and processing method
CN109886257B (en) * 2019-01-30 2022-10-18 四川长虹电器股份有限公司 Method for correcting invoice image segmentation result by adopting deep learning in OCR system
CN109829437B (en) * 2019-02-01 2022-03-25 北京旷视科技有限公司 Image processing method, text recognition device and electronic system
CN109978890B (en) * 2019-02-25 2023-07-07 平安科技(深圳)有限公司 Target extraction method and device based on image processing and terminal equipment
CN109872444B (en) * 2019-02-27 2021-03-09 杭州睿琪软件有限公司 Bill identification method and device
CN111626076A (en) * 2019-02-27 2020-09-04 富士通株式会社 Information processing method, information processing apparatus, and scanner
CN109948549B (en) * 2019-03-20 2022-11-29 深圳市华付信息技术有限公司 OCR data generation method and device, computer equipment and storage medium
CN110109922B (en) * 2019-04-04 2024-05-24 广州易策医管科技有限公司 Performance data acquisition method, device, computer equipment and storage medium
CN110110123B (en) * 2019-04-04 2023-07-25 平安科技(深圳)有限公司 Training set updating method and device for detection model
CN110135409B (en) * 2019-04-04 2023-11-03 平安科技(深圳)有限公司 Optimization method and device for recognition model
CN111652259B (en) * 2019-04-16 2024-03-08 上海铼锶信息技术有限公司 Method and system for cleaning data
CN110189141A (en) * 2019-04-17 2019-08-30 深圳壹账通智能科技有限公司 Verification method, device and the computer equipment of invoice
CN110070042A (en) * 2019-04-23 2019-07-30 北京字节跳动网络技术有限公司 Character recognition method, device and electronic equipment
CN110276253A (en) * 2019-05-15 2019-09-24 中国科学院信息工程研究所 A kind of fuzzy literal detection recognition method based on deep learning
CN110245658A (en) * 2019-05-21 2019-09-17 深圳壹账通智能科技有限公司 A kind of bank slip recognition method, apparatus, storage medium and server
CN110175610B (en) * 2019-05-23 2023-09-05 上海交通大学 Bill image text recognition method supporting privacy protection
CN110210400B (en) * 2019-06-03 2020-11-17 上海眼控科技股份有限公司 Table file detection method and equipment
CN110210503B (en) * 2019-06-14 2021-01-01 厦门历思科技服务有限公司 Seal identification method, device and equipment
CN110287952B (en) * 2019-07-01 2021-07-20 中科软科技股份有限公司 Method and system for recognizing characters of dimension picture
CN110414927B (en) * 2019-07-05 2023-04-07 西安网算数据科技有限公司 Method and device for automatically generating voucher during bill processing
CN110490193B (en) * 2019-07-24 2022-11-08 西安网算数据科技有限公司 Single character area detection method and bill content identification method
CN110399875A (en) * 2019-07-31 2019-11-01 山东浪潮人工智能研究院有限公司 A kind of form of general use information extracting method based on deep learning and pixel projection
CN110647956B (en) * 2019-08-12 2024-03-29 深圳市华付信息技术有限公司 Invoice information extraction method combining two-dimension code recognition
CN112258259A (en) * 2019-08-14 2021-01-22 北京京东尚科信息技术有限公司 Data processing method, device and computer readable storage medium
CN110525069A (en) * 2019-08-21 2019-12-03 珠海思格特智能系统有限公司 A kind of OCR comparison system and its method for intelligent stamper machine
CN110490267B (en) * 2019-08-23 2022-03-01 四川长虹电器股份有限公司 Bill sorting method based on deep learning
CN110738203B (en) * 2019-09-06 2024-04-05 中国平安财产保险股份有限公司 Field structured output method, device and computer readable storage medium
CN110717516B (en) * 2019-09-06 2023-08-25 中国平安财产保险股份有限公司 Bill image classification method, device and computer readable storage medium
CN110647829A (en) * 2019-09-12 2020-01-03 全球能源互联网研究院有限公司 Bill text recognition method and system
CN110807455A (en) * 2019-09-19 2020-02-18 平安科技(深圳)有限公司 Bill detection method, device and equipment based on deep learning and storage medium
CN110765910A (en) * 2019-10-14 2020-02-07 西安网算数据科技有限公司 Bill region identification method and device in dense scene
CN111626279B (en) * 2019-10-15 2023-06-02 西安网算数据科技有限公司 Negative sample labeling training method and highly-automatic bill identification method
CN110991270B (en) * 2019-11-15 2023-10-20 泰康保险集团股份有限公司 Text recognition method, device, electronic equipment and storage medium
CN111079562A (en) * 2019-11-27 2020-04-28 浙江大学 Multi-stage data generation self-circulation financial invoice text intelligent identification system and method
CN110956133A (en) * 2019-11-29 2020-04-03 上海眼控科技股份有限公司 Training method of single character text normalization model, text recognition method and device
CN111027443B (en) * 2019-12-04 2023-04-07 华南理工大学 Bill text detection method based on multitask deep learning
CN111027532A (en) * 2019-12-11 2020-04-17 上海眼控科技股份有限公司 System and method for identifying tax amount of insurance policy vehicle and ship for forced insurance
CN111160188A (en) * 2019-12-20 2020-05-15 中国建设银行股份有限公司 Financial bill identification method, device, equipment and storage medium
CN111192660B (en) * 2019-12-24 2023-10-13 上海联影智能医疗科技有限公司 Image report analysis method, device and computer storage medium
CN111209856B (en) * 2020-01-06 2023-10-17 泰康保险集团股份有限公司 Invoice information identification method and device, electronic equipment and storage medium
CN111259882B (en) * 2020-01-14 2023-12-26 平安科技(深圳)有限公司 Bill identification method and device and computer equipment
CN111242034A (en) * 2020-01-14 2020-06-05 支付宝(杭州)信息技术有限公司 Document image processing method and device, processing equipment and client
CN111291619A (en) * 2020-01-14 2020-06-16 支付宝(杭州)信息技术有限公司 Method, device and client for on-line recognition of characters in claim settlement document
CN111444912A (en) * 2020-01-14 2020-07-24 国网电子商务有限公司 Bill image character recognition method and device
CN111414905B (en) * 2020-02-25 2023-08-18 泰康保险集团股份有限公司 Text detection method, text detection device, electronic equipment and storage medium
CN111368902A (en) * 2020-02-28 2020-07-03 北京三快在线科技有限公司 Data labeling method and device
CN111325207A (en) * 2020-03-05 2020-06-23 中国银行股份有限公司 Bill identification method and device based on preprocessing
CN111401221B (en) * 2020-03-12 2023-04-25 重庆农村商业银行股份有限公司 Card ticket identification method, device, equipment and storage medium
CN111444792B (en) * 2020-03-13 2023-05-09 安诚迈科(北京)信息技术有限公司 Bill identification method, electronic equipment, storage medium and device
CN111310762A (en) * 2020-03-16 2020-06-19 天津得迈科技有限公司 Intelligent medical bill identification method based on Internet of things
CN111340032A (en) * 2020-03-16 2020-06-26 天津得迈科技有限公司 Character recognition method based on application scene in financial field
CN111476227B (en) * 2020-03-17 2024-04-05 平安科技(深圳)有限公司 Target field identification method and device based on OCR and storage medium
CN111428599B (en) * 2020-03-17 2023-10-20 北京子敬科技有限公司 Bill identification method, device and equipment
CN111462388A (en) * 2020-03-19 2020-07-28 广州市玄武无线科技股份有限公司 Bill inspection method and device, terminal equipment and storage medium
CN113469161A (en) * 2020-03-31 2021-10-01 顺丰科技有限公司 Method, device and storage medium for processing logistics list
CN111461100B (en) * 2020-03-31 2023-04-28 重庆农村商业银行股份有限公司 Bill identification method and device, electronic equipment and storage medium
CN113496115B (en) * 2020-04-08 2023-07-28 中国移动通信集团广东有限公司 File content comparison method and device
CN111428725A (en) * 2020-04-13 2020-07-17 北京令才科技有限公司 Data structuring processing method and device and electronic equipment
CN111553334A (en) * 2020-04-21 2020-08-18 招商局金融科技有限公司 Questionnaire image recognition method, electronic device, and storage medium
CN111695439B (en) * 2020-05-20 2024-05-10 平安科技(深圳)有限公司 Image structured data extraction method, electronic device and storage medium
CN111652232B (en) * 2020-05-29 2023-08-22 泰康保险集团股份有限公司 Bill identification method and device, electronic equipment and computer readable storage medium
CN111666868A (en) * 2020-06-03 2020-09-15 阳光保险集团股份有限公司 Insurance policy identification method and device and computer equipment
CN111832423A (en) * 2020-06-19 2020-10-27 北京邮电大学 Bill information identification method, device and system
WO2022006829A1 (en) * 2020-07-09 2022-01-13 国网电子商务有限公司 Bill image recognition method and system, electronic device, and storage medium
CN111814906B (en) * 2020-07-23 2023-07-11 上海东普信息科技有限公司 Express delivery face list recognition model transplanting method, device, equipment and storage medium
CN111881882A (en) * 2020-08-10 2020-11-03 晶璞(上海)人工智能科技有限公司 Medical bill rotation correction method and system based on deep learning
CN111931664B (en) * 2020-08-12 2024-01-12 腾讯科技(深圳)有限公司 Mixed-pasting bill image processing method and device, computer equipment and storage medium
CN111968056A (en) * 2020-08-18 2020-11-20 中国银行股份有限公司 Online check deposit business processing method, client, cloud server and system
CN112036304A (en) * 2020-08-31 2020-12-04 平安医疗健康管理股份有限公司 Medical bill layout identification method and device and computer equipment
CN112052858B (en) * 2020-09-02 2023-09-12 中国银行股份有限公司 Method and related device for extracting target field in bill image
CN112036396B (en) * 2020-09-14 2022-09-02 上海高德威智能交通系统有限公司 Ship name recognition method and device, electronic equipment and computer readable storage medium
CN112396047B (en) * 2020-10-30 2022-03-08 中电金信软件有限公司 Training sample generation method and device, computer equipment and storage medium
CN112241727A (en) * 2020-10-30 2021-01-19 深圳供电局有限公司 Multi-ticket identification method and system and readable storage medium
CN112434689A (en) * 2020-12-01 2021-03-02 天冕信息技术(深圳)有限公司 Method, device and equipment for identifying information in picture and storage medium
CN112464845B (en) * 2020-12-04 2022-09-16 山东产研鲲云人工智能研究院有限公司 Bill recognition method, equipment and computer storage medium
CN112560861B (en) * 2020-12-10 2022-11-18 上海亿保健康管理有限公司 Bill processing method, device, equipment and storage medium
CN112541505B (en) * 2020-12-11 2021-07-23 上海云从企业发展有限公司 Text recognition method, text recognition device and computer-readable storage medium
CN112507914A (en) * 2020-12-15 2021-03-16 江苏国光信息产业股份有限公司 OCR (optical character recognition) method and recognition system based on bankbook and bill characters
CN112613553B (en) * 2020-12-18 2022-03-08 中电金信软件有限公司 Picture sample set generation method and device, computer equipment and storage medium
CN112580657B (en) * 2020-12-23 2022-11-01 陕西天诚软件有限公司 Self-learning character recognition method
CN112613510B (en) * 2020-12-25 2023-10-31 创新奇智(青岛)科技有限公司 Picture preprocessing method, character recognition model training method and character recognition method
CN112632926B (en) * 2020-12-29 2023-10-31 平安科技(深圳)有限公司 Bill data processing method and device, electronic equipment and storage medium
CN112651353B (en) * 2020-12-30 2024-04-16 南京红松信息技术有限公司 Target calculation positioning and identifying method based on custom label
CN112784932B (en) * 2021-03-01 2024-06-07 北京百炼智能科技有限公司 Font identification method, device and storage medium
CN112699860B (en) * 2021-03-24 2021-06-22 成都新希望金融信息有限公司 Method for automatically extracting and sorting effective information in personal tax APP operation video
CN113077018A (en) * 2021-06-07 2021-07-06 浙江大华技术股份有限公司 Target object identification method and device, storage medium and electronic device
CN113095307B (en) * 2021-06-09 2021-08-20 国网浙江省电力有限公司 Automatic identification method for financial voucher information
CN113283421B (en) * 2021-06-24 2024-03-01 中国平安人寿保险股份有限公司 Information identification method, device, equipment and storage medium
CN113326895A (en) * 2021-06-25 2021-08-31 湖南星汉数智科技有限公司 Passenger ticket travel itinerary identification method and device, computer equipment and storage medium
CN113344690A (en) * 2021-06-30 2021-09-03 中国工商银行股份有限公司 Invoice reimbursement processing method and device
CN113469092B (en) * 2021-07-13 2023-09-08 深圳思谋信息科技有限公司 Character recognition model generation method, device, computer equipment and storage medium
CN113642556A (en) * 2021-08-04 2021-11-12 五八有限公司 Image processing method and device, electronic equipment and storage medium
CN113780116A (en) * 2021-08-26 2021-12-10 众安在线财产保险股份有限公司 Invoice classification method and device, computer equipment and storage medium
CN113723508B (en) * 2021-08-30 2024-04-19 杭州米数科技有限公司 Bill image classification method, device, computing equipment and storage medium
CN113705559B (en) * 2021-08-31 2024-05-10 平安银行股份有限公司 Character recognition method and device based on artificial intelligence and electronic equipment
CN113920294B (en) * 2021-10-20 2024-06-11 广东亿迅科技有限公司 Method, system and storage medium for identifying massive invoices
CN113963149A (en) * 2021-10-29 2022-01-21 平安科技(深圳)有限公司 Medical bill picture fuzzy judgment method, system, equipment and medium
CN114743198A (en) * 2022-03-14 2022-07-12 中国银行股份有限公司 Method and device for identifying bill with form
CN114926831A (en) * 2022-05-31 2022-08-19 平安普惠企业管理有限公司 Text-based recognition method and device, electronic equipment and readable storage medium
CN118195555A (en) * 2024-05-16 2024-06-14 鲁班(北京)电子商务科技有限公司 Method and device for inputting and managing building enterprise receipts

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170004370A1 (en) * 2014-07-15 2017-01-05 Google Inc. Extracting card identification data
CN107403130A (en) * 2017-04-19 2017-11-28 北京粉笔未来科技有限公司 A kind of character identifying method and character recognition device
CN107481238A (en) * 2017-09-20 2017-12-15 众安信息技术服务有限公司 Image quality measure method and device
CN107766809A (en) * 2017-10-09 2018-03-06 平安科技(深圳)有限公司 Electronic installation, billing information recognition methods and computer-readable recording medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103488999A (en) * 2013-09-11 2014-01-01 东华大学 Invoice data recording method
CN105095842B (en) * 2014-05-22 2018-12-11 口碑控股有限公司 A kind of method and apparatus of the information identification of document
CN105574513B (en) * 2015-12-22 2017-11-24 北京旷视科技有限公司 Character detecting method and device
CN107346580B (en) * 2016-05-05 2019-11-05 腾讯科技(深圳)有限公司 Billing information recognition methods and device
CN107798299B (en) * 2017-10-09 2020-02-07 平安科技(深圳)有限公司 Bill information identification method, electronic device and readable storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170004370A1 (en) * 2014-07-15 2017-01-05 Google Inc. Extracting card identification data
CN107403130A (en) * 2017-04-19 2017-11-28 北京粉笔未来科技有限公司 A kind of character identifying method and character recognition device
CN107481238A (en) * 2017-09-20 2017-12-15 众安信息技术服务有限公司 Image quality measure method and device
CN107766809A (en) * 2017-10-09 2018-03-06 平安科技(深圳)有限公司 Electronic installation, billing information recognition methods and computer-readable recording medium

Cited By (181)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110728277A (en) * 2019-09-27 2020-01-24 达而观信息科技(上海)有限公司 Seal intelligent detection and identification method
CN110728277B (en) * 2019-09-27 2024-04-05 达观数据有限公司 Intelligent seal detection and identification method and system
CN110826330A (en) * 2019-10-12 2020-02-21 上海数禾信息科技有限公司 Name recognition method and device, computer equipment and readable storage medium
CN110826330B (en) * 2019-10-12 2023-11-07 上海数禾信息科技有限公司 Name recognition method and device, computer equipment and readable storage medium
CN111126125A (en) * 2019-10-15 2020-05-08 平安科技(深圳)有限公司 Method, device and equipment for extracting target text in certificate and readable storage medium
CN111126125B (en) * 2019-10-15 2023-08-01 平安科技(深圳)有限公司 Method, device, equipment and readable storage medium for extracting target text in certificate
CN110766017A (en) * 2019-10-22 2020-02-07 国网新疆电力有限公司信息通信公司 Mobile terminal character recognition method and system based on deep learning
CN110827333A (en) * 2019-10-31 2020-02-21 国网河北省电力有限公司电力科学研究院 Relay protection pressing plate splicing identification method, system and medium
CN110827333B (en) * 2019-10-31 2022-05-03 国网河北省电力有限公司电力科学研究院 Relay protection pressing plate splicing identification method, system and medium
CN110866495A (en) * 2019-11-14 2020-03-06 杭州睿琪软件有限公司 Bill image recognition method, bill image recognition device, bill image recognition equipment, training method and storage medium
CN110866495B (en) * 2019-11-14 2022-06-28 杭州睿琪软件有限公司 Bill image recognition method, bill image recognition device, bill image recognition equipment, training method and storage medium
CN110941717B (en) * 2019-11-22 2023-08-11 深圳马可孛罗科技有限公司 Passenger ticket rule analysis method and device, electronic equipment and computer readable medium
CN110941717A (en) * 2019-11-22 2020-03-31 深圳马可孛罗科技有限公司 Passenger ticket rule analysis method and device, electronic equipment and computer readable medium
CN111079117A (en) * 2019-11-28 2020-04-28 上海三零卫士信息安全有限公司 LeNet and SSD-based point-contact type verification code automatic identification method
CN111079117B (en) * 2019-11-28 2024-02-13 上海三零卫士信息安全有限公司 Automatic point-contact verification code identification method based on LeNet and SSD
CN110991520B (en) * 2019-11-29 2023-05-02 汉海信息技术(上海)有限公司 Method and device for generating training samples
CN110991520A (en) * 2019-11-29 2020-04-10 汉海信息技术(上海)有限公司 Method and device for generating training sample
CN111209792A (en) * 2019-12-02 2020-05-29 河南科技学院 Image identification method and device for value-added tax general invoice
CN112990172A (en) * 2019-12-02 2021-06-18 阿里巴巴集团控股有限公司 Text recognition method, character recognition method and device
CN112990172B (en) * 2019-12-02 2023-12-22 阿里巴巴集团控股有限公司 Text recognition method, character recognition method and device
CN113033240B (en) * 2019-12-09 2023-05-02 上海高德威智能交通系统有限公司 Multi-line text recognition method, model training method, device, equipment and medium
CN111860489A (en) * 2019-12-09 2020-10-30 北京嘀嘀无限科技发展有限公司 Certificate image correction method, device, equipment and storage medium
CN113033240A (en) * 2019-12-09 2021-06-25 上海高德威智能交通系统有限公司 Multi-line text recognition method, model training method, device, equipment and medium
CN112947944B (en) * 2019-12-11 2023-12-05 中盈优创资讯科技有限公司 Business work order processing method and device
CN112947944A (en) * 2019-12-11 2021-06-11 中盈优创资讯科技有限公司 Business work order processing method and device
CN111046886A (en) * 2019-12-12 2020-04-21 吉林大学 Automatic identification method, device and equipment for number plate and computer readable storage medium
CN112990208A (en) * 2019-12-12 2021-06-18 搜狗(杭州)智能科技有限公司 Text recognition method and device
CN112990208B (en) * 2019-12-12 2024-04-30 北京搜狗科技发展有限公司 Text recognition method and device
CN111079641B (en) * 2019-12-13 2024-04-16 科大讯飞股份有限公司 Answer content identification method, related device and readable storage medium
CN111079641A (en) * 2019-12-13 2020-04-28 科大讯飞股份有限公司 Answering content identification method, related device and readable storage medium
CN111191695B (en) * 2019-12-19 2023-05-23 杭州安恒信息技术股份有限公司 Website picture tampering detection method based on deep learning
CN111191695A (en) * 2019-12-19 2020-05-22 杭州安恒信息技术股份有限公司 Website picture tampering detection method based on deep learning
CN111126396A (en) * 2019-12-25 2020-05-08 北京科技大学 Image recognition method and device, computer equipment and storage medium
CN111126396B (en) * 2019-12-25 2023-08-22 北京科技大学 Image recognition method, device, computer equipment and storage medium
CN111178355A (en) * 2019-12-27 2020-05-19 中化资本有限公司 Seal identification method and device and storage medium
CN111178355B (en) * 2019-12-27 2024-05-10 中化资本有限公司 Seal identification method, device and storage medium
CN111160265B (en) * 2019-12-30 2023-01-10 Oppo(重庆)智能科技有限公司 File conversion method and device, storage medium and electronic equipment
CN111160265A (en) * 2019-12-30 2020-05-15 Oppo(重庆)智能科技有限公司 File conversion method and device, storage medium and electronic equipment
CN111144358A (en) * 2019-12-30 2020-05-12 上海眼控科技股份有限公司 Vehicle quality certificate verification method and device, computer equipment and storage medium
CN111325104A (en) * 2020-01-22 2020-06-23 平安科技(深圳)有限公司 Text recognition method, device and storage medium
CN111340022A (en) * 2020-02-24 2020-06-26 深圳市华云中盛科技股份有限公司 Identity card information identification method and device, computer equipment and storage medium
CN111353484A (en) * 2020-02-28 2020-06-30 深圳前海微众银行股份有限公司 Image character recognition method, device, equipment and readable storage medium
CN111445058A (en) * 2020-03-04 2020-07-24 中国平安人寿保险股份有限公司 Data analysis method, device, equipment and computer readable storage medium
CN111445058B (en) * 2020-03-04 2024-05-28 中国平安人寿保险股份有限公司 Data analysis method, device, equipment and computer readable storage medium
CN111414906A (en) * 2020-03-05 2020-07-14 北京交通大学 Data synthesis and text recognition method for paper bill picture
CN111414906B (en) * 2020-03-05 2024-05-24 北京交通大学 Data synthesis and text recognition method for paper bill pictures
CN111291726A (en) * 2020-03-12 2020-06-16 泰康保险集团股份有限公司 Medical bill sorting method, device, equipment and medium
CN111291726B (en) * 2020-03-12 2023-08-08 泰康保险集团股份有限公司 Medical bill sorting method, device, equipment and medium
CN111507324B (en) * 2020-03-16 2024-05-31 平安科技(深圳)有限公司 Card frame recognition method, device, equipment and computer storage medium
CN111507324A (en) * 2020-03-16 2020-08-07 平安科技(深圳)有限公司 Card frame identification method, device, equipment and computer storage medium
CN111414917A (en) * 2020-03-18 2020-07-14 民生科技有限责任公司 Identification method of low-pixel-density text
CN111414911A (en) * 2020-03-23 2020-07-14 湖南信息学院 Card number identification method and system based on deep learning
CN111598122A (en) * 2020-04-01 2020-08-28 深圳壹账通智能科技有限公司 Data verification method and device, electronic equipment and storage medium
CN111582259B (en) * 2020-04-10 2024-04-16 支付宝实验室(新加坡)有限公司 Machine-readable code identification method, device, electronic equipment and storage medium
CN111582259A (en) * 2020-04-10 2020-08-25 支付宝实验室(新加坡)有限公司 Machine-readable code identification method and device, electronic equipment and storage medium
CN111476243A (en) * 2020-04-13 2020-07-31 中国工商银行股份有限公司 Image character recognition method and device
CN111489347B (en) * 2020-04-15 2023-10-31 深圳前海环融联易信息科技服务有限公司 Business license picture quality detection method and device, computer equipment and storage medium
CN111489347A (en) * 2020-04-15 2020-08-04 深圳前海环融联易信息科技服务有限公司 Business license picture quality detection method and device, computer equipment and storage medium
CN111507356A (en) * 2020-04-20 2020-08-07 恒银金融科技股份有限公司 Segmentation method of handwritten characters of lower case money of financial bills
CN111582786A (en) * 2020-04-29 2020-08-25 上海中通吉网络技术有限公司 Express bill number identification method, device and equipment based on machine learning
CN111581242A (en) * 2020-04-30 2020-08-25 北京贝禾无忧科技有限公司 Method and system for identifying enterprise bill use
CN111680691B (en) * 2020-04-30 2023-06-02 北京三快在线科技有限公司 Text detection method, text detection device, electronic equipment and computer readable storage medium
CN111680691A (en) * 2020-04-30 2020-09-18 北京三快在线科技有限公司 Character detection method and device, electronic equipment and computer readable storage medium
CN111639566A (en) * 2020-05-19 2020-09-08 浙江大华技术股份有限公司 Method and device for extracting form information
CN111639648A (en) * 2020-05-26 2020-09-08 浙江大华技术股份有限公司 Certificate identification method and device, computing equipment and storage medium
CN111639648B (en) * 2020-05-26 2023-09-19 浙江大华技术股份有限公司 Certificate identification method, device, computing equipment and storage medium
CN111666932B (en) * 2020-05-27 2023-07-14 平安科技(深圳)有限公司 Document auditing method, device, computer equipment and storage medium
CN111666932A (en) * 2020-05-27 2020-09-15 平安科技(深圳)有限公司 Document auditing method and device, computer equipment and storage medium
CN111814539B (en) * 2020-05-28 2023-07-21 平安科技(深圳)有限公司 Character recognition method and device based on infrared light and ultraviolet light and computer equipment
CN111814539A (en) * 2020-05-28 2020-10-23 平安科技(深圳)有限公司 Character recognition method and device based on infrared light and ultraviolet light and computer equipment
CN111666886A (en) * 2020-06-08 2020-09-15 成都知识视觉科技有限公司 Image preprocessing method for medical document structured knowledge extraction
CN111814785B (en) * 2020-06-11 2024-03-29 浙江大华技术股份有限公司 Invoice recognition method, training method of relevant model, relevant equipment and device
CN111814785A (en) * 2020-06-11 2020-10-23 浙江大华技术股份有限公司 Invoice recognition method, training method of related model, related equipment and device
CN111753744A (en) * 2020-06-28 2020-10-09 北京百度网讯科技有限公司 Method, device and equipment for classifying bill images and readable storage medium
CN111950356B (en) * 2020-06-30 2024-04-19 深圳市雄帝科技股份有限公司 Seal text positioning method and device and electronic equipment
CN111950356A (en) * 2020-06-30 2020-11-17 深圳市雄帝科技股份有限公司 Seal text positioning method and device and electronic equipment
CN111783785A (en) * 2020-07-02 2020-10-16 上海海事大学 Water meter identification system and method
CN111783785B (en) * 2020-07-02 2024-04-09 上海海事大学 Water meter identification system and identification method
CN111931229B (en) * 2020-07-10 2023-07-11 深信服科技股份有限公司 Data identification method, device and storage medium
CN111931229A (en) * 2020-07-10 2020-11-13 深信服科技股份有限公司 Data identification method and device and storage medium
CN111950380A (en) * 2020-07-19 2020-11-17 中国建设银行股份有限公司 Bill auditing method and device, electronic equipment and computer-readable storage medium
CN111832657A (en) * 2020-07-20 2020-10-27 上海眼控科技股份有限公司 Text recognition method and device, computer equipment and storage medium
CN111932593B (en) * 2020-07-21 2024-04-09 湖南中联重科智能技术有限公司 Image registration method, system and equipment based on touch screen gesture correction
CN111932593A (en) * 2020-07-21 2020-11-13 湖南中联重科智能技术有限公司 Image registration method, system and equipment based on touch screen gesture correction
CN111914706B (en) * 2020-07-22 2023-11-17 厦门商集网络科技有限责任公司 Method and device for detecting and controlling quality of text detection output result
CN111914706A (en) * 2020-07-22 2020-11-10 厦门商集网络科技有限责任公司 Method and device for detecting and controlling quality of character detection output result
CN111914729A (en) * 2020-07-28 2020-11-10 中国建设银行股份有限公司 Voucher association method and device, computer equipment and storage medium
CN112069894A (en) * 2020-08-03 2020-12-11 许继集团有限公司 Wire strand scattering identification method based on fast-RCNN model
CN112052853B (en) * 2020-09-09 2024-02-02 国家气象信息中心 Text positioning method of handwriting meteorological archive data based on deep learning
CN112070086B (en) * 2020-09-09 2024-05-07 平安科技(深圳)有限公司 Optimization method of text recognition system, computer equipment and storage medium
CN112070086A (en) * 2020-09-09 2020-12-11 平安科技(深圳)有限公司 Method for optimizing text recognition system, computer device, and storage medium
CN112052853A (en) * 2020-09-09 2020-12-08 国家气象信息中心 Text positioning method of handwritten meteorological archive data based on deep learning
CN112100366B (en) * 2020-09-17 2023-10-27 广联达科技股份有限公司 Pavement structure layer display method and device, computer equipment and storage medium
CN112100366A (en) * 2020-09-17 2020-12-18 广联达科技股份有限公司 Pavement structure layer display method and device, computer equipment and storage medium
CN112270323B (en) * 2020-09-29 2024-02-02 西安电子科技大学 Intelligent medical review control method, system, medium, computer equipment and application
CN112270323A (en) * 2020-09-29 2021-01-26 西安电子科技大学 Intelligent medical reinspection control method, system, medium, computer equipment and application
CN112115921A (en) * 2020-09-30 2020-12-22 北京百度网讯科技有限公司 True and false identification method and device and electronic equipment
CN112232338B (en) * 2020-10-13 2023-09-08 中国平安人寿保险股份有限公司 Data entry method, device, equipment and storage medium for warranty claim checking process
CN112232338A (en) * 2020-10-13 2021-01-15 中国平安人寿保险股份有限公司 Data entry method, device, equipment and storage medium for underwriting and claim settlement process
CN112232340A (en) * 2020-10-15 2021-01-15 马婧 Method and device for identifying printed information on surface of object
CN112257768A (en) * 2020-10-19 2021-01-22 广州金融科技股份有限公司 Method and device for identifying illegal financial picture and computer storage medium
CN112464941B (en) * 2020-10-23 2024-05-24 北京思特奇信息技术股份有限公司 Invoice identification method and system based on neural network
CN112257613A (en) * 2020-10-23 2021-01-22 中国平安人寿保险股份有限公司 Physical examination report information structured extraction method and device and computer equipment
CN112257613B (en) * 2020-10-23 2023-08-08 中国平安人寿保险股份有限公司 Physical examination report information structured extraction method and device and computer equipment
CN112464941A (en) * 2020-10-23 2021-03-09 北京思特奇信息技术股份有限公司 Invoice identification method and system based on neural network
CN112232296A (en) * 2020-11-09 2021-01-15 北京爱笔科技有限公司 Hyper-parameter exploration method and device
CN112396048B (en) * 2020-11-17 2023-09-29 中国平安人寿保险股份有限公司 Picture information extraction method and device, computer equipment and storage medium
CN112396048A (en) * 2020-11-17 2021-02-23 中国平安人寿保险股份有限公司 Picture information extraction method and device, computer equipment and storage medium
CN112418206B (en) * 2020-11-20 2024-02-27 上海昇晔网络科技有限公司 Picture classification method based on position detection model and related equipment thereof
CN112418206A (en) * 2020-11-20 2021-02-26 平安普惠企业管理有限公司 Picture classification method based on position detection model and related equipment thereof
CN112329708A (en) * 2020-11-24 2021-02-05 北京百度网讯科技有限公司 Bill identification method and device
CN112288039B (en) * 2020-11-26 2024-01-23 深源恒际科技有限公司 Sample labeling method and system for OCR model training
CN112287898A (en) * 2020-11-26 2021-01-29 深源恒际科技有限公司 Method and system for evaluating text detection quality of image
CN112288039A (en) * 2020-11-26 2021-01-29 深源恒际科技有限公司 Sample labeling method and system for OCR model training
CN112381702B (en) * 2020-12-02 2024-03-15 北京皮尔布莱尼软件有限公司 Image privacy processing method, computing device and storage medium
CN112381702A (en) * 2020-12-02 2021-02-19 北京皮尔布莱尼软件有限公司 Image privacy processing method, computing device and storage medium
CN112488108A (en) * 2020-12-11 2021-03-12 广州小鹏自动驾驶科技有限公司 Parking space number identification method and device, electronic equipment and storage medium
CN112508017A (en) * 2020-12-15 2021-03-16 通号智慧城市研究设计院有限公司 Intelligent digital display instrument reading identification method, system, processing equipment and storage medium
CN112651913B (en) * 2020-12-17 2024-03-29 广州市申迪计算机系统有限公司 Invoice seal desalination method, system, device and computer storage medium
CN112651913A (en) * 2020-12-17 2021-04-13 广州市申迪计算机系统有限公司 Invoice seal desalination method, system and device and computer storage medium
CN112507946A (en) * 2020-12-18 2021-03-16 北京百度网讯科技有限公司 Method, apparatus, device and storage medium for processing image
CN112837466A (en) * 2020-12-18 2021-05-25 北京百度网讯科技有限公司 Bill recognition method, device, equipment and storage medium
CN112507973A (en) * 2020-12-29 2021-03-16 中国电子科技集团公司第二十八研究所 Text and picture recognition system based on OCR technology
CN112507973B (en) * 2020-12-29 2022-09-06 中国电子科技集团公司第二十八研究所 Text and picture recognition system based on OCR technology
CN113111882A (en) * 2021-02-24 2021-07-13 北京邮电大学 Card identification method and device, electronic equipment and storage medium
CN113111882B (en) * 2021-02-24 2022-12-20 北京邮电大学 Card identification method and device, electronic equipment and storage medium
CN112949450B (en) * 2021-02-25 2024-01-23 北京百度网讯科技有限公司 Bill processing method, device, electronic equipment and storage medium
CN112926565B (en) * 2021-02-25 2024-02-06 中国平安人寿保险股份有限公司 Picture text recognition method, system, equipment and storage medium
CN112949450A (en) * 2021-02-25 2021-06-11 北京百度网讯科技有限公司 Bill processing method, bill processing device, electronic device and storage medium
CN112926565A (en) * 2021-02-25 2021-06-08 中国平安人寿保险股份有限公司 Picture text recognition method, system, device and storage medium
CN112966583A (en) * 2021-02-26 2021-06-15 深圳壹账通智能科技有限公司 Image processing method, image processing device, computer equipment and storage medium
CN112989990B (en) * 2021-03-09 2023-08-04 平安科技(深圳)有限公司 Medical bill identification method, device, equipment and storage medium
CN112989990A (en) * 2021-03-09 2021-06-18 平安科技(深圳)有限公司 Medical bill identification method, device, equipment and storage medium
CN112818951B (en) * 2021-03-11 2023-11-21 南京大学 Ticket identification method
CN112818951A (en) * 2021-03-11 2021-05-18 南京大学 Ticket identification method
CN112818961A (en) * 2021-03-26 2021-05-18 北京东方金朔信息技术有限公司 Image feature identification method and device
CN113128490A (en) * 2021-04-28 2021-07-16 湖南荣冠智能科技有限公司 Prescription information scanning and automatic identification method
CN113128490B (en) * 2021-04-28 2023-12-05 湖南荣冠智能科技有限公司 Prescription information scanning and automatic identification method
CN113255642A (en) * 2021-05-07 2021-08-13 北京精友时代信息技术发展有限公司 Medical bill information integration method for injury claims
CN113239807A (en) * 2021-05-14 2021-08-10 北京百度网讯科技有限公司 Method and device for training bill recognition model and bill recognition
CN113239807B (en) * 2021-05-14 2024-04-02 北京百度网讯科技有限公司 Method and device for training bill identification model and bill identification
CN113255767A (en) * 2021-05-25 2021-08-13 深圳壹账通智能科技有限公司 Bill classification method, device, equipment and storage medium
CN113255767B (en) * 2021-05-25 2023-11-24 深圳壹账通智能科技有限公司 Bill classification method, device, equipment and storage medium
CN113221869B (en) * 2021-05-25 2023-09-05 中国平安人寿保险股份有限公司 Medical invoice structured information extraction method, device equipment and storage medium
CN113221869A (en) * 2021-05-25 2021-08-06 中国平安人寿保险股份有限公司 Medical invoice structured information extraction method, device and equipment and storage medium
CN113269102A (en) * 2021-05-28 2021-08-17 中邮信息科技(北京)有限公司 Seal information identification method and device, computer equipment and storage medium
CN113313114B (en) * 2021-06-11 2023-06-30 北京百度网讯科技有限公司 Certificate information acquisition method, device, equipment and storage medium
CN113313114A (en) * 2021-06-11 2021-08-27 北京百度网讯科技有限公司 Certificate information acquisition method, device, equipment and storage medium
CN113435439B (en) * 2021-06-30 2023-11-28 青岛海尔科技有限公司 Document auditing method and device, storage medium and electronic device
CN113435439A (en) * 2021-06-30 2021-09-24 青岛海尔科技有限公司 Document auditing method and device, storage medium and electronic device
CN113657377B (en) * 2021-07-22 2023-11-14 西南财经大学 Structured recognition method for mechanical bill image
CN113657377A (en) * 2021-07-22 2021-11-16 西南财经大学 Structured recognition method for airplane ticket printing data image
CN113591705A (en) * 2021-07-30 2021-11-02 华东理工大学 Inspection robot instrument recognition system and method and storage medium
CN113591705B (en) * 2021-07-30 2024-03-19 华东理工大学 Inspection robot instrument identification system and method and storage medium
CN113627430B (en) * 2021-08-12 2023-06-20 石家庄铁道大学 Cable number detection method and device and electronic equipment
CN113627430A (en) * 2021-08-12 2021-11-09 石家庄铁道大学 Cable number detection method and device and electronic equipment
CN113537229A (en) * 2021-08-27 2021-10-22 广州广电运通金融电子股份有限公司 Bill image generation method and device, computer equipment and storage medium
CN113869131A (en) * 2021-09-01 2021-12-31 南京烽火天地通信科技有限公司 Method for structuring textualized business license picture
CN113869131B (en) * 2021-09-01 2024-03-29 南京烽火天地通信科技有限公司 Method for structuring text business license picture
CN113627395B (en) * 2021-09-17 2023-11-17 平安银行股份有限公司 Text recognition method, device, medium and electronic equipment
CN113627395A (en) * 2021-09-17 2021-11-09 平安银行股份有限公司 Text recognition method, text recognition device, text recognition medium and electronic equipment
CN113963147B (en) * 2021-09-26 2023-09-15 西安交通大学 Key information extraction method and system based on semantic segmentation
CN113963147A (en) * 2021-09-26 2022-01-21 西安交通大学 Key information extraction method and system based on semantic segmentation
WO2023078264A1 (en) * 2021-11-03 2023-05-11 中移(苏州)软件技术有限公司 Method and apparatus for training business card information extraction system, and computer-readable storage medium
CN114240407A (en) * 2021-11-17 2022-03-25 广东电网有限责任公司 Bill risk conduction quantitative evaluation system and method based on block chain
CN114267115B (en) * 2021-12-27 2024-03-08 江苏银承网络科技股份有限公司 Bill identification method and system
CN114299528A (en) * 2021-12-27 2022-04-08 万达信息股份有限公司 Information extraction and structuring method for scanned document
CN114267115A (en) * 2021-12-27 2022-04-01 江苏银承网络科技股份有限公司 Bill recognition method and system
CN114299528B (en) * 2021-12-27 2024-03-22 万达信息股份有限公司 Information extraction and structuring method for scanned document
CN114358943A (en) * 2022-01-12 2022-04-15 平安普惠企业管理有限公司 Online bill processing method and device, electronic equipment and storage medium
CN114708603A (en) * 2022-05-25 2022-07-05 杭州咏柳科技有限公司 Method, system, device and medium for identifying key information in medical bill
CN115170261A (en) * 2022-06-08 2022-10-11 姚远 Systematic method for hospital financial audit
CN114818995B (en) * 2022-06-27 2022-10-04 广东电网有限责任公司佛山供电局 Method, device and equipment for identifying and early warning non-work ticket violation operation
CN114818995A (en) * 2022-06-27 2022-07-29 广东电网有限责任公司佛山供电局 Method, device and equipment for identifying and early warning illegal work without work ticket
CN115062722A (en) * 2022-07-06 2022-09-16 哈尔滨璧能科技有限公司 AI training method based on cloud service big data cleaning and artificial intelligence cloud system
CN115690803A (en) * 2022-10-31 2023-02-03 中电金信软件(上海)有限公司 Digital image recognition method and device, electronic equipment and readable storage medium
CN117197816A (en) * 2023-06-19 2023-12-08 珠海盈米基金销售有限公司 User material identification method and system
CN117408764B (en) * 2023-12-13 2024-05-03 江西曼荼罗软件有限公司 Medical fee change method, system, storage medium and electronic equipment
CN117408764A (en) * 2023-12-13 2024-01-16 江西曼荼罗软件有限公司 Medical fee change method, system, storage medium and electronic equipment
CN117649672B (en) * 2024-01-30 2024-04-26 湖南大学 Font type visual detection method and system based on active learning and transfer learning
CN117649672A (en) * 2024-01-30 2024-03-05 湖南大学 Font type visual detection method and system based on active learning and transfer learning

Also Published As

Publication number Publication date
CN108446621A (en) 2018-08-24

Similar Documents

Publication Publication Date Title
WO2019174130A1 (en) Bill recognition method, server, and computer readable storage medium
CN107798299B (en) Bill information identification method, electronic device and readable storage medium
US11074445B2 (en) Remote sensing image recognition method and apparatus, storage medium and electronic device
WO2018010657A1 (en) Structured text detection method and system, and computing device
US10528841B2 (en) Method, system, electronic device, and medium for classifying license plates based on deep learning
CN107766809B (en) Electronic device, bill information identification method, and computer-readable storage medium
WO2019109526A1 (en) Method and device for age recognition of face image, storage medium
WO2018166116A1 (en) Car damage recognition method, electronic apparatus and computer-readable storage medium
US11106891B2 (en) Automated signature extraction and verification
WO2018205467A1 (en) Automobile damage part recognition method, system and electronic device and storage medium
US20190294921A1 (en) Field identification in an image using artificial intelligence
WO2019037259A1 (en) Electronic device, method and system for categorizing invoices, and computer-readable storage medium
WO2021012494A1 (en) Deep learning-based face recognition method and apparatus, and computer-readable storage medium
CN108491866B (en) Pornographic picture identification method, electronic device and readable storage medium
WO2020253508A1 (en) Abnormal cell detection method and apparatus, and computer readable storage medium
US10339373B1 (en) Optical character recognition utilizing hashed templates
CN112036295B (en) Bill image processing method and device, storage medium and electronic equipment
US10025976B1 (en) Data normalization for handwriting recognition
WO2019056503A1 (en) Store monitoring evaluation method, device and storage medium
US20220092353A1 (en) Method and device for training image recognition model, equipment and medium
CN111368632A (en) Signature identification method and device
WO2022103564A1 (en) Fraud detection via automated handwriting clustering
CN116311214A (en) License plate recognition method and device
US10896339B2 (en) Detecting magnetic ink character recognition codes
CN110796145A (en) Multi-certificate segmentation association method based on intelligent decision and related equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18910227

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 07/12/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18910227

Country of ref document: EP

Kind code of ref document: A1