WO2018233055A1 - 保单信息录入的方法、装置、计算机设备及存储介质 - Google Patents
保单信息录入的方法、装置、计算机设备及存储介质 Download PDFInfo
- Publication number
- WO2018233055A1 WO2018233055A1 PCT/CN2017/100057 CN2017100057W WO2018233055A1 WO 2018233055 A1 WO2018233055 A1 WO 2018233055A1 CN 2017100057 W CN2017100057 W CN 2017100057W WO 2018233055 A1 WO2018233055 A1 WO 2018233055A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- policy
- field
- information
- area
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/08—Insurance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/414—Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- the present application relates to the field of computer processing, and in particular, to a method, device, computer device and storage medium for policy information entry.
- a method, apparatus, computer device, and storage medium for policy information entry are provided.
- a method for entering policy information including:
- Identifying information in the target data area to obtain editable data information storing the data information according to the corresponding field information, and completing information entry.
- a device for entering policy information comprising:
- a type determining module configured to obtain a policy image of the information to be entered, and determine a policy type of the policy image
- a template determining module configured to determine, according to the policy type, a field image template corresponding to the policy image
- a search module configured to search, by using an image matching algorithm, a field area that matches the field image template in the policy image
- An extracting module configured to determine a corresponding target data area on the policy image according to the field area, and extract the target data area
- An identification module configured to identify information in the target data area to obtain editable data information
- the input module is configured to store the data information according to the corresponding field information, and complete information entry.
- a computer device comprising a memory and a processor, wherein the memory stores computer readable instructions, the computer readable instructions being executed by the processor, such that the processor performs the step of: acquiring information to be entered a policy image to determine the type of policy for the policy image;
- the data information is stored according to the corresponding field information, and information entry is completed.
- One or more non-volatile readable storage media storing computer-executable instructions, when executed by one or more processors, cause the one or more processors to perform the steps of: acquiring The policy image of the information to be entered, determining the policy type of the policy image;
- Identifying information in the target data area to obtain editable data information storing the data information according to the corresponding field information, and completing information entry.
- FIG. 1 is a block diagram showing the internal structure of a computer device in an embodiment
- FIG. 2 is a flow chart of a method for policy information entry in an embodiment
- FIG. 3 is a flow chart of a method for searching a field region matching a field image template by an image matching algorithm in a policy image in an embodiment
- FIG. 5 is a flowchart of a method for identifying information in the target data area to obtain editable data information in an embodiment
- FIG. 6 is a structural block diagram of an apparatus for entering policy information in an embodiment
- FIG. 7 is a structural block diagram of a search module in an embodiment
- Figure 8 is a block diagram showing the structure of an apparatus for entering policy information in another embodiment.
- the computer device can be a terminal or a server.
- the terminal may be a personal computer or a mobile electronic device including at least one of a mobile phone, a tablet, a personal digital assistant, or a wearable device.
- the server can be a standalone server or a server cluster.
- the computer device includes a processor connected by a system bus, a non-volatile storage medium, an internal memory, and a network interface.
- the non-volatile storage medium of the computer device can store an operating system and computer readable instructions that, when executed, cause the processor to perform a method of policy information entry.
- the processor of the computer device is used to provide computing and control capabilities to support the operation of the entire computer device.
- the internal memory can store computer readable instructions that, when executed by the processor, cause the processor to perform a method of policy information entry.
- the network interface of the computer device is used for network communication. It will be understood by those skilled in the art that the structure shown in FIG. 1 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation of the computer device to which the solution of the present application is applied.
- the specific computer device may It includes more or fewer components than those shown in the figures, or some components are combined, or have different component arrangements.
- a method for policy information entry is proposed, which may be applied to a terminal or a server, and specifically includes the following steps:
- Step 202 Obtain a policy image of the information to be entered, and determine a policy type of the policy image.
- the policy image refers to a policy that cannot be directly edited in the form of a picture, such as a scanned copy of a policy, a photo, and the like.
- the policy is the most intensive insurance contract signed between the insurer and the insured.
- the name of the insurer and the insured, the subject matter of the insurance, and the insurance are mainly recorded on the policy.
- the policy type of the policy image There are various ways to determine the policy type of the policy image. For example, you can preset the location where different policy types are stored (for example, different policy types are stored in different folders), and then according to the location of the obtained policy image. The type of policy that directly determines the policy image.
- the policy type of the policy image may be determined according to the policy image number, wherein the policy image number is used to uniquely identify a policy image, and the policy image number may include a number representing the type of the policy, for example, The second-to-last position of the policy image number can be set to represent the policy type.
- Step 204 Determine a field image template corresponding to the policy image according to the policy type.
- the field image template corresponding to the policy image may be obtained according to the policy type.
- a field image template is an image in units of fields.
- the field refers to the content of the topic that contains a certain topic information.
- the items "name”, “gender”, and "date of birth” in a policy are so-called "fields".
- the fields contained in it are the same, and the specifications of the fields are the same, so the field image template can be set in advance.
- different policies contain the same fields, and the same field size is often the same, so many times for the same field, only You need to make a field image template.
- the field image template is created by separately separating the regions in which the fields in the policy are located, and each field is correspondingly formed into a field image template, wherein the field image template adopts a rectangular block diagram, that is, a rectangular frame including field information. .
- the fields included in the different policy types are different. Therefore, you need to set the corresponding relationship between the policy type and the field in advance. After obtaining the policy type corresponding to the policy image, you can determine the field image template corresponding to the policy image.
- Step 206 Search for a field area matching the field image template by using an image matching algorithm in the policy image.
- the field area can search for the field area matching the field image template in the policy image by the image matching algorithm.
- the field image template is a known small image, and it is known that there is an area in the policy image having the same size and the same content as the image template of the field, and the matching process is found in the large image of the policy image. The same field area of the image and the coordinate position of the field area in the policy image.
- the field image template T is composed of m ⁇ n pixels
- the policy image S is composed of W ⁇ H pixels
- the field image template is superimposed on the searched policy image S and translated, and the field image template overlay is
- the area searched is called subgraph Sij, where i, j is the coordinates of the upper left corner of the subgraph on the searched policy image S.
- the search range is l ⁇ i ⁇ W-M, l ⁇ j ⁇ H-N.
- the process of matching is to calculate the similarity between the subgraph and the field image template, and the subgraph with the highest similarity is used as the matching field area.
- the similarity between the measurement field image template T and the sub-picture Sij can be calculated by the SAD algorithm.
- SAD Sud of absolute differences
- SAD is an image matching algorithm. Basic idea: the sum of the absolute values of the differences. This algorithm is often used for image block matching. The absolute value of the difference between the corresponding values of each pixel is summed, and the similarity between the two image blocks is evaluated accordingly. The smaller the absolute value of the difference, the higher the similarity.
- Step 208 Determine a target data area corresponding to the field image according to the field area, and extract the target data area.
- the purpose of performing field area matching is to find a target data area corresponding to the field, so as to facilitate subsequent identification of information in the target data area. Therefore, after the field area matching the field image template is found on the policy image, the corresponding target data area is determined on the policy image according to the field area, and then the target data area is extracted. Specifically, first, the coordinate position of the field area on the policy image is determined. Since the shape and size of the field image template are determined (for example, rectangle, length 3 cm, width 1 cm), the matched field area has an image with the field.
- the template is the same size, so you only need to determine the coordinate position of one corner of the field, you can find the coordinate position of the other three corners, and in the process of determining the field area matching the field image template by the image matching algorithm.
- the coordinate position of one of the corners of the field area has been determined, so it is equivalent to determining the coordinate position of the field area on the policy image. Since the field area corresponds to the corresponding target data area, and the target data area also has a fixed size. Therefore, according to the coordinate position of the field area and the size of the target data area, the coordinate position of the target data area can be calculated, and then the target data area can be extracted according to the coordinate position of the target data area.
- the "name” field, its corresponding target data area has been previously specified to be on the right side of the "name” field, and has the same width as the "name” field, for example, the width is 1em, and the length can be preset to 5cm. Therefore, if the coordinate position of the area where the "name" field is located is determined, the coordinate positions of the four corners of the target data area can be obtained.
- Step 210 Identify information in the target data area to obtain editable data information.
- the information in the target data area needs to be identified to obtain editable data information.
- the data information includes text information, digital information, and symbol information.
- the identification of the information in the target data area is mainly divided into two steps. The first step is to detect the area where the data information in the target data area is located, that is, to locate the data information in the target data area to determine the corresponding target area. In the second step, the data information in the target area is identified to obtain editable data information.
- the identification method can adopt the existing picture text recognition technology. For example, OCR (Optical Character Recognition) method can be used for identification.
- the recognition process is to identify the data information in the picture as editable data information.
- Step 212 Store the data information according to the corresponding field information, and complete information entry.
- each policy image has a policy number for uniquely identifying the policy image, and after identifying the data information corresponding to each field information, in addition to storing the data information corresponding to the corresponding field information, The data information corresponding to the same policy image is stored in association, thereby completing information entry.
- the method of information entry is fully automated, and the data information in the policy image can be recorded into the system without manual participation, which not only saves time and labor, but also can process multiple policy images in parallel, thereby improving the efficiency of input. .
- the field image template corresponding to the policy image is obtained according to the policy type of the policy image, and the field image template is matched with the field area in the policy image, and determined.
- the target data area corresponding to the field area is then identified by the information in the target data area to obtain editable data information, and then the data information is automatically entered into the corresponding location.
- the method automatically acquires and identifies the target data area corresponding to each field in the policy image, and then automatically records the identified data information, and the whole process is fully automated, saving time and labor, and improving the accuracy and efficiency of the input.
- the step 206 of searching for a field region matching the field image template by the image matching algorithm in the policy image includes:
- step 206A the field image template is superimposed on the policy image and translated according to a preset rule, and the similarity between the field image template and the corresponding coverage area is calculated by the image matching algorithm.
- the field image matching the field image template is searched on the policy image according to the field image template, and the search method is performed by stacking the field image template on the policy image and performing translation according to a preset rule.
- the preset rule refers to the distance (for example, 0.1 mm) of the image image template to be translated on the policy image, and the direction of the movement. For example, the image can be moved up and down, or left and right.
- the image matching algorithm can use existing matching algorithms, such as the SAD algorithm, the Surf algorithm, and the like.
- the SAD algorithm is taken as an example to illustrate the calculation process.
- Step 206B Determine a field area that matches the field image template according to the calculated similarity.
- the field area matching the field image template is determined by calculating the similarity between the field image template and each sub-picture on the policy image, and then the sub-graph with the highest similarity is used as the matching field area.
- a method for policy information entry comprising:
- Step 402 Obtain a policy image of the information to be entered, and determine a policy type of the policy image.
- Step 404 Determine a field image template corresponding to the policy image according to the policy type.
- Step 406 Search for a field area matching the field image template by using an image matching algorithm in the policy image.
- Step 408 calculating a coordinate position corresponding to the matched field area, where the coordinate position of the field area is the coordinate of each vertex of the field area;
- Step 410 Calculate a coordinate position corresponding to the target data area according to a preset rule according to a coordinate position corresponding to the field area, and extract a corresponding target data area according to the calculated coordinate position corresponding to the target data area.
- Step 412 Identify information in the target data area to obtain editable data information.
- step 414 the data information is stored according to the corresponding field information, and information entry is completed.
- the field image template is superimposed on the searched policy image S, and the field image template covers the searched area, which is called the subgraph Sij, where i, j is the top left corner of the subgraph is searched.
- the coordinates on the policy image S Therefore, once the subgraph matching the subgraph is determined, the coordinates of the upper left corner of the subgraph are determined.
- the coordinate positions of the other three vertices of the corresponding subgraph can be determined, that is, the field area is determined.
- the coordinates of the four corners are then calculated according to preset rules, and the coordinates of the four corners corresponding to the target data region corresponding to the field region are calculated according to the positional relationship between the target data region and the field region and the shape of the target data region itself.
- the size of the specification determines the coordinates of the vertices of the target data area.
- the coordinates of the four vertices of the target data area can be calculated. For example, suppose the coordinates of the four corners of the field in which the "name" field is located are: upper left corner (1,1), lower left corner (1,0), upper right corner (4,1), lower right corner ( 4,0). It is assumed that the target data area corresponding to the field area is on the right side of the field area, and the specification of the target data area is: the width is the same as the field area and the length is 5 cm, then the corresponding target data area can be calculated.
- the coordinates of the four corners are: upper left corner (4, 1), lower left corner (4, 0), upper right corner (9, 1), lower right corner (9, 0).
- the coordinates of the four corners of the target data region are calculated according to the calculation to extract the corresponding target data region.
- the target data area can also be set to other shapes, such as triangles, hexagons, etc., depending on the actual situation.
- the data information in the target data area can be identified, thereby obtaining editable data information, and the data information is stored according to the corresponding field template, thereby completing the entry of the policy information.
- the step 210 of identifying information in the target data region to obtain editable data information includes:
- step 210A the data information in the target data area is located.
- the data information in the target data region image is located, so that the located data information is included in the minimum circumscribed rectangle, that is, the target data.
- the data information in the area is separately extracted by a minimum circumscribed rectangle to obtain a target area picture, wherein the four sides of the minimum circumscribed rectangle are tangent to the uppermost, lowermost, leftmost, and rightmost ends of the data information.
- the target area picture containing the data information is subsequently identified as the identification object.
- step 210B the data information is identified by using a picture text recognition technology to obtain editable data information.
- the extracted minimum circumscribed rectangular image is used as the identification object, and the image character recognition technology is used to include
- the data information is identified to obtain editable data information.
- the image text recognition technology can adopt existing identification methods, for example, OCR (Optical Character Recognition) method can be used for identification.
- OCR Optical Character Recognition
- the process of recognition is to identify the data information in the picture as editable data information.
- the area containing the data information is segmented to form a plurality of sub-images, each of which contains partial data information, and the plurality of sub-images are Recognizing in parallel improves the speed of text recognition.
- the step of obtaining the policy image of the information to be entered, and determining the policy type of the policy image includes: acquiring the policy image of the information to be entered, and extracting the image number of the policy image, according to the preset image number and the policy type The correspondence relationship determines the policy type of the policy image.
- the image number is used to uniquely identify a policy image, wherein the image number may directly use the policy number or may be a number assigned to the image separately.
- the image number is associated with the policy type in advance, for example, the second digit of the image number can be associated with the policy type, for example, if the image number is the second The digit number is 1, which represents the accident insurance. If the second digit is 2, it represents auto insurance.
- an apparatus for entering policy information comprising:
- a type determining module 602 configured to acquire a policy image of the information to be entered, and determine a policy type of the policy image
- the template determining module 604 is configured to determine a field image template corresponding to the policy image according to the policy type
- the searching module 606 is configured to search, by the image matching algorithm, a field area that matches the field image template in the policy image;
- the extracting module 608 is configured to determine a corresponding target data area on the policy image according to the field area, and extract the target data area;
- the identification module 610 is configured to identify information in the target data area to obtain an editable Data information
- the entry module 612 is configured to store the data information according to the corresponding field information, and complete the information entry.
- the search module 606 includes:
- the calculating module 606A is configured to stack the field image template on the policy image and perform translation, and calculate the similarity between the field image template and the corresponding coverage area by using an image matching algorithm;
- the matching module 606B is configured to determine a field area that matches the field image template according to the calculated similarity.
- the device for entering the policy information further includes:
- a coordinate calculation module 607 configured to calculate a coordinate position corresponding to the matched field region, where a coordinate position is a coordinate of each vertex of the field region;
- the extraction module 608 is further configured to calculate a coordinate position corresponding to the target data region according to a preset rule according to the coordinate position corresponding to the field region, and extract a corresponding target data region according to the calculated coordinate position corresponding to the target data region.
- the identification module is further configured to locate data information in the target data area, and use the image text recognition technology to identify the data information to obtain editable data information.
- the type determining module is further configured to acquire a policy image of the information to be entered, extract an image number of the policy image, and determine a policy type of the policy image according to a correspondence between the preset image number and the policy type.
- the various modules in the device for entering the policy information described above may be implemented in whole or in part by software, hardware, and combinations thereof.
- the network interface may be an Ethernet card or a wireless network card.
- the above modules may be embedded in the hardware in the processor or in the memory in the server, or may be stored in the memory in the server, so that the processor calls the corresponding operations of the above modules.
- the processor can be a central processing unit (CPU), a microprocessor, a microcontroller, or the like.
- a computer device is proposed.
- the internal structure of the computer device may correspond to the structure as shown in FIG. 1, that is, the computer device may be either a server or a terminal.
- the searching by the processor, searching for a field area matching the field image template by using an image matching algorithm in the policy image, including: overlaying the field image template on the policy image And performing translation, the image matching method is used to calculate the similarity between the field image template and the corresponding coverage area; and the field area matching the field image template is determined according to the calculated similarity.
- the processor before the determining the target data area on the policy image according to the field area, and extracting the target data area, the processor is further configured to perform the following steps: calculating the matched a coordinate position corresponding to the field area, where a coordinate position is a coordinate of each vertex of the field area;
- the performing, by the processor, the information in the target data area to identify editable data information including: locating data information in the target data area;
- the picture text recognition technology identifies the data information to obtain editable data information.
- the policy image obtained by the processor to obtain the policy image to be entered, determining the policy type of the policy image includes: acquiring a policy image of the information to be entered, and extracting the policy The image number of the policy image; determining the policy type of the policy image according to the correspondence between the preset image number and the policy type.
- a computer readable storage medium having stored thereon computer instructions that, when executed by a processor, implement the steps of: acquiring a policy image of information to be entered, determining a policy type of the policy image Determining, according to the policy type, a field image template corresponding to the policy image; searching, in the policy image, an image region matching the field image template by an image matching algorithm; and the policy image according to the field region Determining a corresponding target data area, extracting the target data area; identifying information in the target data area to obtain editable data information; storing the data information according to the corresponding field information, completing the information Enter.
- the searching by the processor, searching for a field area matching the field image template by using an image matching algorithm in the policy image, including: overlaying the field image template on the policy image And performing translation, the image matching method is used to calculate the similarity between the field image template and the corresponding coverage area; and the field area matching the field image template is determined according to the calculated similarity.
- the processor before the determining the target data area on the policy image according to the field area, and extracting the target data area, the processor is further configured to perform the following steps: calculating the matched a coordinate position corresponding to the field area, where a coordinate position is a coordinate of each vertex of the field area;
- the performing, by the processor, the information in the target data area to identify editable data information including: locating data information in the target data area;
- the picture text recognition technology identifies the data information to obtain editable data information.
- the policy image obtained by the processor to obtain the policy image to be recorded, determining the policy type of the policy image includes: acquiring a policy image of the information to be entered, and extracting the image number of the policy image.
- the policy type of the policy image is determined according to the correspondence between the preset image number and the policy type.
- the storage medium may be a non-volatile storage medium such as a magnetic disk, an optical disk, or a read-only memory (ROM).
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Business, Economics & Management (AREA)
- Multimedia (AREA)
- Finance (AREA)
- Data Mining & Analysis (AREA)
- Accounting & Taxation (AREA)
- Computer Graphics (AREA)
- Geometry (AREA)
- Evolutionary Biology (AREA)
- Economics (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Development Economics (AREA)
- Evolutionary Computation (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Technology Law (AREA)
- General Business, Economics & Management (AREA)
- Image Analysis (AREA)
- Character Input (AREA)
Abstract
Description
Claims (20)
- 一种保单信息录入的方法,包括:获取待录入信息的保单影像,确定所述保单影像的保单类型;根据所述保单类型确定与所述保单影像对应的字段影像模板;在所述保单影像中通过图像匹配算法搜索与所述字段影像模板匹配的字段区域;根据所述字段区域在所述保单影像上确定对应的目标数据区域,提取所述目标数据区域;对所述目标数据区域中的信息进行识别,得到可编辑的数据信息;及将所述数据信息按照所对应的字段信息进行存储,完成信息录入。
- 根据权利要求1所述的方法,其特征在于,所述在所述保单影像中通过图像匹配算法搜索与所述字段影像模板匹配的字段区域,包括:将所述字段影像模板叠放在所述保单影像上并按照预设的规则进行平移,通过图像匹配算法计算字段影像模板与相应的覆盖区域的相似度;根据计算得到的相似度确定与所述字段影像模板匹配的字段区域。
- 根据权利要求1所述的方法,其特征在于,在所述根据所述字段区域在所述保单影像上确定对应的目标数据区域,提取所述目标数据区域之前,还包括:计算匹配到的所述字段区域对应的坐标位置,所述字段区域的坐标位置为字段区域的各个顶点的坐标;在所述保单影像上确定与所述字段区域对应的目标数据区域,提取所述目标数据区域的步骤包括:根据所述字段区域对应的坐标位置,按照预设的规则计算所述目标数据区域对应的坐标位置,根据计算得到的目标数据区域对应的坐标位置提取相应的目标数据区域。
- 根据权利要求1所述的方法,其特征在于,所述对所述目标数据区域中的信息进行识别,得到可编辑的数据信息,包括:对所述目标数据区域中的数据信息进行定位;采用图片文字识别技术对所述数据信息进行识别,得到可编辑的数据信息。
- 根据权利要求1所述的方法,其特征在于,所述获取待录入信息的保单影像,确定所述保单影像的保单类型,包括:获取待录入信息的保单影像,提取所述保单影像的影像编号;根据预设的影像编号和保单类型之间的对应关系,确定所述保单影像的保单类型。
- 一种保单信息录入的装置,包括:类型确定模块,用于获取待录入信息的保单影像,确定所述保单影像的保单类型;模板确定模块,用于根据所述保单类型确定与所述保单影像对应的字段影像模板;搜索模块,用于在所述保单影像中通过图像匹配算法搜索与所述字段影像模板匹配的字段区域;提取模块,用于根据所述字段区域在所述保单影像上确定对应的目标数据区域,提取所述目标数据区域;识别模块,用于对所述目标数据区域中的信息进行识别,得到可编辑的数据信息;及录入模块,用于将所述数据信息按照所对应的字段信息进行存储,完成信息录入。
- 根据权利要求6所述的装置,其特征在于,所述搜索模块包括:计算模块,用于将所述字段影像模板叠放在所述保单影像上并按照预设的规则进行平移,通过图像匹配算法计算字段影像模板与相应的覆盖区域的相似度;匹配模块,用于根据计算得到的相似度确定与所述字段影像模板匹配的 字段区域。
- 根据权利要求6所述的装置,其特征在于,所述装置还包括:坐标计算模块,用于计算匹配到的所述字段区域对应的坐标位置,所述字段区域的坐标位置为字段区域的各个顶点的坐标;所述提取模块还用于根据所述字段区域对应的坐标位置,按照预设的规则计算所述目标数据区域对应的坐标位置,根据计算得到的目标数据区域对应的坐标位置提取相应的目标数据区域。
- 根据权利要求6所述的装置,其特征在于,所述识别模块还用于对所述目标数据区域中的数据信息进行定位,采用图片文字识别技术对所述数据信息进行识别,得到可编辑的数据信息。
- 根据权利要求6所述的装置,其特征在于,所述类型确定模块还用于获取待录入信息的保单影像,提取所述保单影像的影像编号,根据预设的影像编号和保单类型之间的对应关系,确定所述保单影像的保单类型。
- 一种计算机设备,包括存储器和处理器,所述存储器中存储有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述处理器执行以下步骤:获取待录入信息的保单影像,确定所述保单影像的保单类型;根据所述保单类型确定与所述保单影像对应的字段影像模板;在所述保单影像中通过图像匹配算法搜索与所述字段影像模板匹配的字段区域;根据所述字段区域在所述保单影像上确定对应的目标数据区域,提取所述目标数据区域;对所述目标数据区域中的信息进行识别,得到可编辑的数据信息;及将所述数据信息按照所对应的字段信息进行存储,完成信息录入。
- 根据权利要求11所述的计算机设备,其特征在于,所述在所述保单影像中通过图像匹配算法搜索与所述字段影像模板匹配的字段区域,包括:将所述字段影像模板叠放在所述保单影像上并按照预设的规则进行平移,通过图像匹配算法计算字段影像模板与相应的覆盖区域的相似度;根据计算得到的相似度确定与所述字段影像模板匹配的字段区域。
- 根据权利要求11所述的计算机设备,其特征在于,在所述根据所述字段区域在所述保单影像上确定对应的目标数据区域,提取所述目标数据区域之前,所述处理器还用于执行以下步骤:计算匹配到的所述字段区域对应的坐标位置,所述字段区域的坐标位置为字段区域的各个顶点的坐标;在所述保单影像上确定与所述字段区域对应的目标数据区域,提取所述目标数据区域的步骤包括:根据所述字段区域对应的坐标位置,按照预设的规则计算所述目标数据区域对应的坐标位置,根据计算得到的目标数据区域对应的坐标位置提取相应的目标数据区域。
- 根据权利要求11所述的计算机设备,其特征在于,所述对所述目标数据区域中的信息进行识别,得到可编辑的数据信息,包括:对所述目标数据区域中的数据信息进行定位;采用图片文字识别技术对所述数据信息进行识别,得到可编辑的数据信息。
- 根据权利要求11所述的计算机设备,其特征在于,所述获取待录入信息的保单影像,确定所述保单影像的保单类型,包括:获取待录入信息的保单影像,提取所述保单影像的影像编号;根据预设的影像编号和保单类型之间的对应关系,确定所述保单影像的保单类型。
- 一个或多个存储有计算机可执行指令的非易失性可读存储介质,所述计算机可执行指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:获取待录入信息的保单影像,确定所述保单影像的保单类型;根据所述保单类型确定与所述保单影像对应的字段影像模板;在所述保单影像中通过图像匹配算法搜索与所述字段影像模板匹配的字段区域;根据所述字段区域在所述保单影像上确定对应的目标数据区域,提取所述目标数据区域;对所述目标数据区域中的信息进行识别,得到可编辑的数据信息;及将所述数据信息按照所对应的字段信息进行存储,完成信息录入。
- 根据权利要求16所述的非易失性可读存储介质,其特征在于,所述在所述保单影像中通过图像匹配算法搜索与所述字段影像模板匹配的字段区域,包括:将所述字段影像模板叠放在所述保单影像上并按照预设的规则进行平移,通过图像匹配算法计算字段影像模板与相应的覆盖区域的相似度;根据计算得到的相似度确定与所述字段影像模板匹配的字段区域。
- 根据权利要求16所述的非易失性可读存储介质,其特征在于,在所述根据所述字段区域在所述保单影像上确定对应的目标数据区域,提取所述目标数据区域之前,所述处理器还用于执行以下步骤:计算匹配到的所述字段区域对应的坐标位置,所述字段区域的坐标位置为字段区域的各个顶点的坐标;在所述保单影像上确定与所述字段区域对应的目标数据区域,提取所述目标数据区域的步骤包括:根据所述字段区域对应的坐标位置,按照预设的规则计算所述目标数据区域对应的坐标位置,根据计算得到的目标数据区域对应的坐标位置提取相应的目标数据区域。
- 根据权利要求16所述的非易失性可读存储介质,其特征在于,所述对所述目标数据区域中的信息进行识别,得到可编辑的数据信息,包括:对所述目标数据区域中的数据信息进行定位;采用图片文字识别技术对所述数据信息进行识别,得到可编辑的数据信 息。
- 根据权利要求16所述的非易失性可读存储介质,其特征在于,所述获取待录入信息的保单影像,确定所述保单影像的保单类型,包括:获取待录入信息的保单影像,提取所述保单影像的影像编号;根据预设的影像编号和保单类型之间的对应关系,确定所述保单影像的保单类型。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710487869.7 | 2017-06-23 | ||
CN201710487869.7A CN107688772B (zh) | 2017-06-23 | 2017-06-23 | 保单信息录入的方法、装置、计算机设备及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018233055A1 true WO2018233055A1 (zh) | 2018-12-27 |
Family
ID=61152701
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2017/100057 WO2018233055A1 (zh) | 2017-06-23 | 2017-08-31 | 保单信息录入的方法、装置、计算机设备及存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107688772B (zh) |
WO (1) | WO2018233055A1 (zh) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110764721A (zh) * | 2019-09-19 | 2020-02-07 | 北京三快在线科技有限公司 | 模板生成方法、装置、电子设备和计算机可读介质 |
CN111159201A (zh) * | 2019-12-31 | 2020-05-15 | 上海亿保健康管理有限公司 | 数据录入方法、装置和设备 |
CN111199570A (zh) * | 2019-12-31 | 2020-05-26 | 贵阳朗玛信息技术股份有限公司 | 一种可多次发布信息的打码方法及系统 |
CN111813773A (zh) * | 2020-04-22 | 2020-10-23 | 深圳市科陆电子科技股份有限公司 | 一种电网抄表数据存储方法、上传方法、装置和存储装置 |
CN111882429A (zh) * | 2020-07-28 | 2020-11-03 | 中国银行股份有限公司 | 一种银行系统字段长度分段方法和装置 |
CN115034876A (zh) * | 2022-05-26 | 2022-09-09 | 重庆银行股份有限公司 | 基于ocr识别技术的贷款信息审核方法、装置及计算机设备 |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108563618A (zh) * | 2018-03-15 | 2018-09-21 | 链家网(北京)科技有限公司 | 一种纸质文件线上化方法和系统 |
CN108710880A (zh) * | 2018-05-16 | 2018-10-26 | 深圳市众信电子商务交易保障促进中心 | 一种数据抓取方法及终端 |
CN110633457B (zh) * | 2018-06-22 | 2023-05-26 | 广州金山移动科技有限公司 | 一种内容替换方法、装置、电子设备及可读存储介质 |
CN109034735A (zh) * | 2018-07-10 | 2018-12-18 | 安心财产保险有限责任公司 | 电子保单生成方法、装置、设备及存储介质 |
CN109544389A (zh) * | 2018-11-29 | 2019-03-29 | 爱保科技(横琴)有限公司 | 基于保单的智能分析系统 |
CN109783781B (zh) * | 2018-12-13 | 2023-08-11 | 中国平安财产保险股份有限公司 | 基于图像识别的保单录入方法和相关装置 |
CN109840469B (zh) * | 2018-12-13 | 2023-10-24 | 中国平安财产保险股份有限公司 | 基于图像识别的保单录入方法和相关装置 |
CN109918679B (zh) * | 2019-03-22 | 2023-04-11 | 成都晟堃科技有限责任公司 | 一种解析纸质保单数据的方法 |
CN110245530B (zh) * | 2019-05-27 | 2023-03-07 | 苏州迈洛智能系统有限公司 | 一种条码信息录入方法、装置及存储介质 |
CN110490190B (zh) * | 2019-07-04 | 2021-10-26 | 贝壳技术有限公司 | 一种结构化图像文字识别方法及系统 |
CN110362802A (zh) * | 2019-07-18 | 2019-10-22 | 中国工商银行股份有限公司 | 用于将单据信息录入系统的方法、装置、计算设备、介质 |
CN110599338A (zh) * | 2019-08-13 | 2019-12-20 | 上海陆家嘴国际金融资产交易市场股份有限公司 | 交易数据处理方法、装置、计算机设备和存储介质 |
CN110942069A (zh) * | 2019-10-22 | 2020-03-31 | 云南大学 | 车险保单的关键信息提取方法、装置、存储介质及设备 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101469986A (zh) * | 2007-12-27 | 2009-07-01 | 鸿富锦精密工业(深圳)有限公司 | 模糊边界的自动寻点系统及方法 |
CN105095842A (zh) * | 2014-05-22 | 2015-11-25 | 阿里巴巴集团控股有限公司 | 一种单据的信息识别的方法和装置 |
CN105608452A (zh) * | 2014-11-11 | 2016-05-25 | 金蝶软件(中国)有限公司 | 单据录入的方法和系统 |
-
2017
- 2017-06-23 CN CN201710487869.7A patent/CN107688772B/zh active Active
- 2017-08-31 WO PCT/CN2017/100057 patent/WO2018233055A1/zh active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101469986A (zh) * | 2007-12-27 | 2009-07-01 | 鸿富锦精密工业(深圳)有限公司 | 模糊边界的自动寻点系统及方法 |
CN105095842A (zh) * | 2014-05-22 | 2015-11-25 | 阿里巴巴集团控股有限公司 | 一种单据的信息识别的方法和装置 |
CN105608452A (zh) * | 2014-11-11 | 2016-05-25 | 金蝶软件(中国)有限公司 | 单据录入的方法和系统 |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110764721A (zh) * | 2019-09-19 | 2020-02-07 | 北京三快在线科技有限公司 | 模板生成方法、装置、电子设备和计算机可读介质 |
CN111159201A (zh) * | 2019-12-31 | 2020-05-15 | 上海亿保健康管理有限公司 | 数据录入方法、装置和设备 |
CN111199570A (zh) * | 2019-12-31 | 2020-05-26 | 贵阳朗玛信息技术股份有限公司 | 一种可多次发布信息的打码方法及系统 |
CN111813773A (zh) * | 2020-04-22 | 2020-10-23 | 深圳市科陆电子科技股份有限公司 | 一种电网抄表数据存储方法、上传方法、装置和存储装置 |
CN111882429A (zh) * | 2020-07-28 | 2020-11-03 | 中国银行股份有限公司 | 一种银行系统字段长度分段方法和装置 |
CN111882429B (zh) * | 2020-07-28 | 2023-08-18 | 中国银行股份有限公司 | 一种银行系统字段长度分段方法和装置 |
CN115034876A (zh) * | 2022-05-26 | 2022-09-09 | 重庆银行股份有限公司 | 基于ocr识别技术的贷款信息审核方法、装置及计算机设备 |
Also Published As
Publication number | Publication date |
---|---|
CN107688772A (zh) | 2018-02-13 |
CN107688772B (zh) | 2019-12-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018233055A1 (zh) | 保单信息录入的方法、装置、计算机设备及存储介质 | |
US12019675B2 (en) | Recognizing text in image data | |
CN109492643B (zh) | 基于ocr的证件识别方法、装置、计算机设备及存储介质 | |
CN111476227B (zh) | 基于ocr的目标字段识别方法、装置及存储介质 | |
CN110348294B (zh) | Pdf文档中图表的定位方法、装置及计算机设备 | |
RU2651144C2 (ru) | Ввод данных с изображений документов с фиксированной структурой | |
WO2019169772A1 (zh) | 图片处理方法、电子装置及存储介质 | |
ES2609953T3 (es) | Procedimiento de recorte de imágenes de texto | |
EP3940589B1 (en) | Layout analysis method, electronic device and computer program product | |
WO2021017272A1 (zh) | 病理图像标注方法、装置、计算机设备及存储介质 | |
WO2018233171A1 (zh) | 单证信息录入的方法、装置、计算机设备及存储介质 | |
WO2021051527A1 (zh) | 基于图像分割的文字定位方法、装置、设备及存储介质 | |
CN112396047B (zh) | 训练样本生成方法、装置、计算机设备和存储介质 | |
CN114463567A (zh) | 一种基于区块链的智慧教育作业大数据防抄袭方法与系统 | |
CN110796145B (zh) | 基于智能决策的多证件分割关联方法及相关设备 | |
WO2020232866A1 (zh) | 扫描文本分段方法、装置、计算机设备和存储介质 | |
CN115205113A (zh) | 一种图像拼接方法、装置、设备及存储介质 | |
CN114494751A (zh) | 证照信息识别方法、装置、设备及介质 | |
CN113780116A (zh) | 发票分类方法、装置、计算机设备和存储介质 | |
CN112308057A (zh) | 一种基于文字位置信息的ocr优化方法及系统 | |
CN114220103B (zh) | 图像识别方法、装置、设备及计算机可读存储介质 | |
CN116030472A (zh) | 文字坐标确定方法及装置 | |
CN116092100A (zh) | 文本内容提取方法及装置 | |
Heitzler et al. | A modular process to improve the georeferencing of the Siegfried map | |
TWI384418B (zh) | 採用區域架構之影像處理方法及系統 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17914332 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17914332 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 14/05/2020) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17914332 Country of ref document: EP Kind code of ref document: A1 |