CN111259888A

CN111259888A - Image-based information comparison method and device and computer-readable storage medium

Info

Publication number: CN111259888A
Application number: CN202010041070.7A
Authority: CN
Inventors: 罗林锋
Original assignee: Ping An International Smart City Technology Co Ltd
Current assignee: Ping An International Smart City Technology Co Ltd
Priority date: 2020-01-15
Filing date: 2020-01-15
Publication date: 2020-06-09
Anticipated expiration: 2040-01-15
Also published as: CN111259888B; WO2021143058A1

Abstract

The invention relates to an artificial intelligence technology, and discloses an information comparison method based on image processing, which comprises the following steps: receiving an image set to be compared, performing inclination correction operation and image block cutting operation on the image set to be compared to obtain an image block set, performing image classification on the image block set, performing type identification on the image block set subjected to image classification according to an optical character identification technology to obtain a multi-type image set, extracting a standard information image set corresponding to the image set to be compared from the database, and comparing the multi-type image set with the standard information image set to obtain an information comparison result of the image set to be compared. The invention also provides an information comparison device based on image processing and a computer readable storage medium. The invention can realize more accurate information comparison function based on image processing.

Description

Image-based information comparison method and device and computer-readable storage medium

Technical Field

The invention relates to the technical field of artificial intelligence, in particular to an information comparison method and device based on image processing and a computer readable storage medium.

Background

At present, the information acquisition of the image is manually completed or a small part of the information acquisition is read by using a machine, the machine reading still needs manual cooperation, for example, an examinee writes a selected question answer in a specific answer sheet, then the machine adopts a matching algorithm to match the answers in the answer sheet, and because the matching technologies do not relate to image processing, the intelligent degree is not high, and the situation of recognition error is easy to occur in the recognition process, namely, the recognition accuracy is not high.

Disclosure of Invention

The invention provides an information comparison method and device based on image processing and a computer readable storage medium, and mainly aims to solve the problems that the intelligent degree of image information acquisition is not high, and identification errors are easy to occur in the identification process.

In order to achieve the above object, the present invention provides an information comparison method based on image processing, which includes:

receiving an image set to be compared, and performing inclination correction operation and image square cutting operation on the image set to be compared to obtain an image square set;

carrying out image classification on the image square set, and carrying out type identification on the image square set subjected to image classification according to an optical character recognition technology to obtain a multi-type image set;

and extracting a standard information image set corresponding to the image set to be compared from the database, and comparing the multi-type image set with the standard information image set to obtain an information comparison result of the image set to be compared.

Optionally, the performing a tilt correction operation and an image square cutting operation on the image set to be compared to obtain an image square set includes:

constructing a plane coordinate system, projecting the images in the image set to be compared according to the plane coordinate system, and dividing the images in the image set to be compared according to the scales of the plane coordinate system to obtain a plurality of matrix blocks;

sequentially calculating barycentric coordinates of the plurality of matrix blocks;

adjusting the inclination angles of the plurality of operation matrix blocks according to a pre-constructed linear equation and the barycentric coordinate to finish the inclination correction operation;

mapping the image set to be compared after the inclination correction operation is completed in the plane coordinate system;

according to the preset number of squares, dividing the squares in the horizontal direction and the vertical direction of the image set to be compared to obtain a plurality of image squares;

and judging whether characters exist in the image square blocks according to an optical detection technology, and reserving the image square blocks with the characters to obtain the image square block set.

Optionally, the linear equation is:

Y＝a+bX

Y＝c+dX

wherein, b is tg δ,

a and c are arbitrary constants, δ is an inclination angle corresponding to the barycentric coordinate, and (X, Y) represent coordinates in the planar coordinate system.

Optionally, the image classifying the image square block set includes:

performing character cutting on the data of the image square set to obtain a multi-character image set;

extracting character features in the multi-character image set;

and carrying out template matching on the character features and a pre-constructed feature template library to finish the image classification.

Optionally, the extracting, from the database, a standard information image set corresponding to the image set to be compared, and comparing the multi-type image set with the standard information image set includes:

the pre-constructed projection coordinate system is used for segmenting the multi-type image set according to lines to obtain a multi-line image set;

identifying a first character at the beginning of each multi-line image set using the optical character recognition technique;

if the first character is a numeric character, the first character is reserved, if the first character is not a numeric character, the first character is removed until the recognition is completed, and all the numeric characters are collected to obtain a question mark set;

extracting a standard information image set which is the same as the question number set from an answer storage area of the database according to the question number set;

and comparing the standard information image set with the multi-type images according to the optical character recognition technology.

In addition, in order to achieve the above object, the present invention further provides an information comparing device based on image processing, the device including a memory and a processor, the memory storing therein an information comparing program based on image processing executable on the processor, the information comparing program based on image processing, when executed by the processor, implementing the steps of:

Optionally, the linear equation is:

Y＝a+bX

Y＝c+dX

wherein, b is tg δ,

Optionally, the image classifying the image square block set includes:

extracting character features in the multi-character image set;

In addition, to achieve the above object, the present invention also provides a computer readable storage medium, on which an information comparison program based on image processing is stored, the information comparison program based on image processing being executable by one or more processors to implement the steps of the information comparison method based on image processing as described above.

According to the invention, the image set is divided into a plurality of small squares through the inclination correction operation and the image square cutting operation, so that the recognition accuracy of subsequent template matching and optical character recognition is improved on the premise of reducing the subsequent calculation pressure, and meanwhile, the intellectualization degree of image information acquisition is further improved through template matching and optical character recognition and comparison operation. Therefore, the information comparison method and device based on image processing and the computer readable storage medium provided by the invention can achieve the purpose of image information acquisition.

Drawings

Fig. 1 is a schematic flowchart of an information comparison method based on image processing according to an embodiment of the present invention;

fig. 2 is a schematic diagram of an internal structure of an information comparison apparatus based on image processing according to an embodiment of the present invention;

fig. 3 is a block diagram illustrating an information comparison program based on image processing in an information comparison device based on image processing according to an embodiment of the present invention.

The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.

Detailed Description

It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

The invention provides an information comparison method based on image processing. Fig. 1 is a schematic flow chart of an information comparison method based on image processing according to an embodiment of the present invention. The method may be performed by an apparatus, which may be implemented by software and/or hardware.

In this embodiment, the information comparison method based on image processing includes:

s1, receiving an image set to be compared, and performing inclination correction operation and image square cutting operation on the image set to be compared to obtain an image square set.

Currently, a relatively large field related to image information acquisition is intelligent job correction, so the correction request includes the number and date of the job, such as a correction request of 2019, 9 and 20 for the user, and a correction request of a third job.

Further, the serial number and date of the job are in one-to-one correspondence with the jobs stored in the pre-constructed database, for example, in a certain high, middle and high three simulation test, the three (1) to three (9) high classes are divided into 1-9 job serial numbers according to the class serial numbers, and if the user wants to modify the jobs of the three (6) high classes, the user only needs to input the job serial numbers and the job dates of the three (6) high classes.

Preferably, the image sets to be compared are in the form of image scanning versions, such as three (6) shifts high full-scale simulation test text test paper, which are all uniformly collected and placed in the pre-constructed database.

Since the image set to be compared is generally obtained by image scanning, and image scanning may cause a certain tilt of the obtained image, the tilt correction operation and the image block cutting operation are performed on the image set to be compared first.

In detail, the performing a tilt correction operation and an image square cutting operation on the image set to be compared to obtain an image square set includes: and constructing a plane coordinate system, projecting the images in the image set to be compared according to the plane coordinate system, dividing the images in the image set to be compared according to the scales of the plane coordinate system to obtain a plurality of matrix blocks, sequentially calculating barycentric coordinates of the matrix blocks, and adjusting the inclination angles of the operation matrix blocks according to a pre-constructed linear equation and the barycentric coordinates to finish the inclination correction operation.

Further, the equation of the straight line is:

Y＝a+bX

Y＝c+dX

wherein, b is tg δ,

a, c are arbitrary constants, δ is an inclination angle from the barycentric coordinate, and (X, Y) represent coordinates in the plane coordinate system, and the barycentric coordinate is (P)_i,P_i). The barycentric coordinate X is equal to P_iThe value of Y is obtained by being substituted into the above linear equation and is then compared with P_iAnd adjusting the inclination angle delta to finish the inclination correction operation.

Since the image of the whole block is too large, which has a great influence on the identification and calculation pressure of the subsequent model, it is necessary to perform an image block cutting operation on the image set to be compared. The image square cutting operation comprises: mapping the image set to be compared, which is subjected to the inclination correction operation, in the plane coordinate system, dividing blocks in the horizontal direction and the vertical direction of the image set to be compared according to the number of preset blocks to obtain a plurality of image blocks, judging whether characters exist in the image blocks according to an optical detection technology, and reserving the image blocks with the characters to obtain the image block set.

The optical detection technology can judge whether characters exist in each square block or not based on an optical character recognition principle or a refractive index change principle, if some square blocks do not have characters, the square blocks are blank area square blocks, the correction effect on subsequent intelligent operation is not great, and the correction effect can be directly removed. The principle of refractive index change is to emit light rays with the same incident angle in each picture block, and determine whether the deviation of the refraction angle is greater than a threshold value, thereby determining whether characters exist in the block.

Specifically, the image square block set is represented by the following method:

DDL(I，n，B₁((X₁，Y₁)，(L₁，W₁)，Attri)，B₂((X₂，Y₂)，(L₂，W₂)，Attri，…，B_n((X_n，Y_n)，(L_n，W_n)，Attri))

wherein I is the serial number of each square in the image square set, n is the total number of the image square set, and X_iPosition information in a horizontal direction for each square in the image square set; y is_iFor each block in the set of image blocks, L_iIs the length of the square; w_iIs the width of the square; the Attri represents the attribute of the square, when the Attri is 0, it represents that there is a character in the square, and when the Attri is 1, it represents that there is no character in the square.

And S2, carrying out image classification on the image square set, and carrying out type recognition on the image square set after image classification according to an optical character recognition technology to obtain a multi-type image set.

Preferably, the set of image tiles can be subject to disciplinary classification based on a disciplinary recognition model of optical character recognition technology (OCR technology). If the image square set contains different disciplines (such as Chinese, mathematics, English and the like), the discipline recognition model is used for discipline classification, and the image square set can be divided into a Chinese discipline operation set, a mathematical discipline operation set, an English discipline operation set and other discipline operation sets.

In detail, the S2 includes: and performing character cutting on the data of the image square set to obtain a multi-character image set, extracting character features in the multi-character image set, and performing template matching on the character features and a pre-constructed feature template library to obtain a subject operation set.

Further, the character cutting is to divide the character images in the image square set by line units to obtain a plurality of groups of multi-character image sets by line units. The extraction method of the character features and the character cutting can be completed based on the optical character recognition technology (OCR technology). The template matching may be based on a K-nearest neighbor algorithm.

For example, in homework correction, the question types may include choice questions, blank filling questions, judgment questions, calculation questions, etc., but the answer forms corresponding to different question types are different, for example, the choice questions are A, B, C, D written answers, and the judgment questions are right or wrong, so it is necessary to identify the type.

Preferably, the set of image tiles is also type-recognized based on a type recognition model of optical character recognition technology (OCR technology). The whole topic type identification process is the same as the above S3, and finally, topic type job image sets based on different disciplines are obtained, such as a choice topic job image set and a reading understanding job image set under a Chinese subject job set, a choice topic job image set, a calculation topic job image set and a choice topic job image set and a judgment topic job image set under a mathematic subject job set.

S3, extracting a standard information image set corresponding to the image set to be compared from the database, and comparing the multi-type image set with the standard information image set to obtain an information comparison result of the image set to be compared.

In detail, the extracting of the standard information image set corresponding to the multi-type image set from the answer storage area of the database includes: and identifying the question numbers of the multi-type image sets according to an image identification technology to obtain a question number set, and extracting the standard information image set which is the same as the question number set from a pre-constructed database according to the question number set.

In detail, the identifying the question numbers of the multi-type image set according to the image identification technology to obtain a question number set comprises: and a pre-constructed projection coordinate system is used for segmenting the multi-type image set according to lines to obtain a multi-line image set, the optical character recognition technology is used for recognizing the first character at the beginning in each multi-line image set, if the first character is a digital character, the first character is reserved, if the first character is not a digital character, the first character is removed until the recognition is completed, and all digital characters are collected to obtain a question mark set.

In a preferred embodiment of the present invention, if the characters of the standard information image set having the question-selecting job image set with the question number 4 corresponding to the question 4 under the language subject job set are matched, if the question-selecting job image set with the question 4 is selected as C, the standard information image set with the question 4 is also selected as C, the matching is successful, and if the question-selecting job image set with the question 4 is selected as B, the standard information image set with the question 4 is selected as C, the matching is unsuccessful.

The inventive method described above may preferably be based on a GPU, which is an image processor, a microprocessor dedicated to image operations on personal computers, workstations and some mobile devices. There are many types of GPUs, such as: tetam, RTX, and the like. The speed of the student homework correction can be accelerated through the GPU.

The invention also provides an information comparison device based on image processing. Fig. 2 is a schematic diagram illustrating an internal structure of an information comparison apparatus based on image processing according to an embodiment of the present invention.

In this embodiment, the information comparing apparatus 1 based on image processing may be a PC (personal computer), or a terminal device such as a smart phone, a tablet computer, and a mobile computer, or may be a server. The information comparison device 1 based on image processing at least comprises a memory 11, a processor 12, a communication bus 13 and a network interface 14.

The memory 11 includes at least one type of readable storage medium, which includes a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, and the like. The memory 11 may be an internal storage unit of the image processing based information comparing apparatus 1 in some embodiments, for example, a hard disk of the image processing based information comparing apparatus 1. The memory 11 may also be an external storage device of the image processing-based information comparing apparatus 1 in other embodiments, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a flash Card (FlashCard), and the like provided on the image processing-based information comparing apparatus 1. Further, the memory 11 may also include both an internal storage unit of the information comparing apparatus 1 based on image processing and an external storage device. The memory 11 may be used not only to store application software installed in the image processing-based information collating apparatus 1 and various types of data, such as a code of the image processing-based information collating program 01, but also to temporarily store data that has been output or is to be output.

Processor 12, which in some embodiments may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor or other data Processing chip, is configured to execute program codes stored in memory 11 or process data, such as executing image-Processing-based information comparison program 01.

The communication bus 13 is used to realize connection communication between these components.

The network interface 14 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface), typically used to establish a communication link between the apparatus 1 and other electronic devices.

Optionally, the apparatus 1 may further comprise a user interface, which may comprise a Display (Display), an input unit such as a Keyboard (Keyboard), and optionally a standard wired interface, a wireless interface. Alternatively, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch device, or the like. The display, which may also be referred to as a display screen or a display unit, is suitable for displaying information processed in the image processing-based information comparison apparatus 1 and for displaying a visualized user interface.

Fig. 2 shows only the image processing-based information comparing apparatus 1 having the components 11 to 14 and the image processing-based information comparing program 01, and it will be understood by those skilled in the art that the structure shown in fig. 1 does not constitute a limitation of the image processing-based information comparing apparatus 1, and may include fewer or more components than those shown, or combine some components, or different arrangement of components.

In the embodiment of the apparatus 1 shown in fig. 2, the memory 11 stores an information comparison program 01 based on image processing; the processor 12 implements the following steps when executing the image processing-based information matching program 01 stored in the memory 11:

the method comprises the steps of firstly, receiving an image set to be compared, and carrying out inclination correction operation and image square cutting operation on the image set to be compared to obtain an image square set.

Further, the equation of the straight line is:

Y＝a+bX

Y＝c+dX

wherein, b is tg δ,

a, c are arbitrary constants, δ is an inclination angle from the barycentric coordinate, and (X, Y) represent coordinates in the plane coordinate system, and the barycentric coordinate is (P)_i，P_i). The barycentric coordinate X is equal to P_iThe value of Y is obtained by being substituted into the above linear equation and is then compared with P_iAnd adjusting the inclination angle delta to finish the inclination correction operation.

And secondly, carrying out image classification on the image block set, and carrying out type recognition on the image block set subjected to image classification according to an optical character recognition technology to obtain a multi-type image set.

In detail, the second step includes: and performing character cutting on the data of the image square set to obtain a multi-character image set, extracting character features in the multi-character image set, and performing template matching on the character features and a pre-constructed feature template library to obtain a subject operation set.

And step three, extracting a standard information image set corresponding to the image set to be compared from the database, and comparing the multi-type image set with the standard information image set to obtain an information comparison result of the image set to be compared.

Alternatively, in other embodiments, the information comparison program based on image processing may be further divided into one or more modules, and the one or more modules are stored in the memory 11 and executed by one or more processors (in this embodiment, the processor 12) to implement the present invention.

For example, referring to fig. 3, a schematic diagram of program modules of an image processing-based information comparison program in an embodiment of the image processing-based information comparison apparatus according to the present invention is shown, in this embodiment, the image processing-based information comparison program may be divided into a data receiving and block dividing module 10, a classification module 20, a type identification module 30, and an information result output module 40, which exemplarily:

the data receiving and sorting module 10 is configured to: receiving an image set to be compared, and performing inclination correction operation and image square cutting operation on the image set to be compared to obtain an image square set.

The classification module 20 is configured to: and carrying out image classification on the image square block set to obtain a classified image square block set.

The type identification module 30 is configured to: and according to an optical character recognition technology, carrying out type recognition on the image block set after the image classification to obtain a multi-type image set.

The job correction result output module 40 is configured to: and extracting a standard information image set corresponding to the image set to be compared from the database, and comparing the multi-type image set with the standard information image set to obtain an information comparison result of the image set to be compared.

The functions or operation steps implemented by the program modules such as the data receiving and block cutting module 10, the classification module 20, the type identification module 30, and the information result output module 40 are substantially the same as those of the above embodiments, and are not described herein again.

Furthermore, an embodiment of the present invention further provides a computer-readable storage medium, where an information comparison program based on image processing is stored, and the information comparison program based on image processing is executable by one or more processors to implement the following operations:

receiving an image set to be compared, and performing inclination correction operation and image square cutting operation on the image set to be compared to obtain an image square set.

And carrying out image classification on the image square block set to obtain a classified image square block set.

And according to an optical character recognition technology, carrying out type recognition on the image block set after the image classification to obtain a multi-type image set.

It should be noted that the above-mentioned numbers of the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments. And the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, apparatus, article, or method that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, apparatus, article, or method. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, apparatus, article, or method that includes the element.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) as described above and includes instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, or a network device) to execute the method according to the embodiments of the present invention.

The above description is only a preferred embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by using the contents of the present specification and the accompanying drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims

1. An information comparison method based on image processing is characterized in that the method comprises the following steps:

2. The method as claimed in claim 1, wherein the performing the tilt correction operation and the image block cutting operation on the image set to be compared to obtain an image block set comprises:

3. The image processing-based information comparison method according to claim 2, wherein the linear equation is:

Y＝a+bX

Y＝c+dX

wherein, b is tg δ,

4. The method for comparing information based on image processing as claimed in claim 1, wherein said classifying the image block set comprises:

extracting character features in the multi-character image set;

5. The method according to any one of claims 1 to 4, wherein the extracting a standard information image set corresponding to the image set to be compared from the database, comparing the multi-type image set and the standard information image set, comprises:

6. An information comparison device based on image processing, comprising a memory and a processor, wherein the memory stores an information comparison program based on image processing, which can run on the processor, and when the information comparison program based on image processing is executed by the processor, the following steps are implemented:

7. The image-processing-based information matching device of claim 6, wherein the performing the tilt correction operation and the image block cutting operation on the image set to be matched to obtain an image block set comprises:

8. The image-processing-based information comparing device according to claim 7, wherein the linear equation is:

Y＝a+bX

Y＝c+dX

wherein, b is tg δ,

9. The apparatus according to claim 6, wherein the image classification of the image square set comprises:

extracting character features in the multi-character image set;

10. A computer-readable storage medium, wherein an image processing-based information comparison program is stored on the computer-readable storage medium, and the image processing-based information comparison program can be executed by one or more processors to implement the steps of the image processing-based information comparison method according to any one of claims 1 to 5.