WO2021143058A1 - Image-based information comparison method, apparatus, electronic device, and computer-readable storage medium - Google Patents

Image-based information comparison method, apparatus, electronic device, and computer-readable storage medium Download PDF

Info

Publication number
WO2021143058A1
WO2021143058A1 PCT/CN2020/099066 CN2020099066W WO2021143058A1 WO 2021143058 A1 WO2021143058 A1 WO 2021143058A1 CN 2020099066 W CN2020099066 W CN 2020099066W WO 2021143058 A1 WO2021143058 A1 WO 2021143058A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
image set
compared
character
coordinate system
Prior art date
Application number
PCT/CN2020/099066
Other languages
French (fr)
Chinese (zh)
Inventor
罗林锋
Original Assignee
平安国际智慧城市科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安国际智慧城市科技股份有限公司 filed Critical 平安国际智慧城市科技股份有限公司
Publication of WO2021143058A1 publication Critical patent/WO2021143058A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/24Aligning, centring, orientation detection or correction of the image
    • G06V10/243Aligning, centring, orientation detection or correction of the image by compensating for image skew or non-uniform image deformations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/751Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching

Definitions

  • This application relates to the field of artificial intelligence technology, and in particular to an information comparison method, device, electronic device, and computer-readable storage medium based on image processing.
  • the inventor realizes that the current image information acquisition mostly relies on manual completion or a small part of the use of machine reading.
  • the machine reading still requires manual cooperation. For example, candidates write multiple-choice answers in a specific answer sheet, and then the machine uses matching The algorithm matches the answers in the answer sheet. Because these matching technologies do not involve image processing, they are not highly intelligent, and recognition errors are prone to occur during the recognition process, that is, the recognition accuracy is not high.
  • This application provides an information comparison method, device, electronic device, and computer-readable storage medium based on image processing. Its main purpose is to solve the problem of low intelligence in image information acquisition and easy recognition errors during the recognition process. problem.
  • an information comparison method based on image processing includes:
  • the standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
  • the present application also provides an electronic device including a memory and a processor, and the memory stores an information comparison program based on image processing that can run on the processor.
  • the image processing-based information comparison program is executed by the processor, the following steps are implemented:
  • the standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
  • the present application also provides a computer-readable storage medium on which is stored an information comparison program based on image processing, and the information comparison program based on image processing can be One or more processors are executed to implement the steps of the image processing-based information comparison method as described below:
  • the standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
  • the present application also provides an information comparison device based on image processing, and the sequence of the information comparison device based on image processing includes:
  • the data receiving and block cutting module is used to: receive the image set to be compared, and perform a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
  • the classification module is configured to: perform image classification on the image block set to obtain a classified image block set;
  • the type recognition module is configured to: perform type recognition on the image block set after the image classification according to the optical character recognition technology to obtain a multi-type image set;
  • the job correction result output module is used to: extract the standard information image set corresponding to the image set to be compared from the database, compare the multi-type image set and the standard information image set to obtain the The result of comparing the information of the image set.
  • This application divides the image set into multiple small squares through the tilt correction operation and the image block cutting operation, and under the premise of reducing the pressure of subsequent calculations, it also improves the recognition accuracy of subsequent template matching and optical character recognition. At the same time, it first passes the template matching And optical character recognition, and then through the comparison operation to further improve the degree of intelligence of image information acquisition. Therefore, the image processing-based information comparison method, device, and computer-readable storage medium proposed in this application can achieve the purpose of image information acquisition.
  • FIG. 1 is a schematic flowchart of an information comparison method based on image processing provided by an embodiment of this application;
  • FIG. 2 is a schematic diagram of the internal structure of an electronic device provided by an embodiment of the application.
  • FIG. 3 is a schematic diagram of modules of an information comparison program based on image processing in an information comparison device based on image processing provided by an embodiment of the application.
  • This application provides an information comparison method based on image processing.
  • FIG. 1 it is a schematic flowchart of an information comparison method based on image processing provided by an embodiment of this application.
  • the method can be executed by a device, and the device can be implemented by software and/or hardware.
  • the information comparison method based on image processing includes:
  • the correction request includes the job number and date. For example, the user enters a correction request for the third batch of jobs on September 20, 2019.
  • the number and date of the homework are in a one-to-one correspondence with the homework stored in the pre-built database. For example, a senior high school simulation test for high school three (1) to high school (9) class according to the class The number is divided into 1-9 job numbers. If the user wants to correct the job of the third grade (6) class, he only needs to enter the job number and the job date of the third grade (6) class.
  • the image set to be compared is in the form of a scanned image version, such as the text test papers of the whole class mock examination of the third grade (6) class, which are all collected and placed in the pre-built database.
  • the image set to be compared is generally obtained through image scanning, and image scanning may cause a certain tilt of the obtained image, first perform the tilt correction operation and the image block cutting operation on the image set to be compared .
  • performing the tilt correction operation and the image block cutting operation on the image set to be compared to obtain the image block set includes: constructing a plane coordinate system, and arranging the images in the image set to be compared according to the plane The coordinate system is projected, and the images in the image set to be compared are divided according to the scale of the plane coordinate system to obtain a plurality of matrix blocks, and the barycentric coordinates of the plurality of matrix blocks are calculated in turn, according to the pre-built linear equation Adjust the inclination angles of the multiple work matrix blocks with the coordinates of the center of gravity to complete the inclination correction operation.
  • the inclination angle [delta] is compared with the center of gravity coordinates
  • (X, Y) represents the coordinate of the plane coordinate system
  • the tilt angle ⁇ is adjusted by the error with P i to complete the tilt correction operation.
  • the image block cutting operation includes: mapping the image set to be compared after the tilt correction operation is completed in the plane coordinate system, and leveling the image set to be compared according to a preset number of squares A plurality of image squares are obtained by dividing the squares in the direction and the vertical direction. According to the optical detection technology, it is determined whether there are characters in the image squares, and the image squares with the characters are retained to obtain the image square set.
  • the optical detection technology can use the principle of optical character recognition or the principle of refractive index change to determine whether there is text in each square. If there are no texts in some squares, it is a square with a blank area, which has little effect on subsequent smart work corrections. Can be removed directly.
  • the principle of changing the refractive index is to emit light rays with the same incident angle in each picture block, and determine whether the deviation of the refraction angle is greater than a threshold value, thereby determining whether there is text in the block.
  • the representation method of the image block set is:
  • I is the block number of each block within the image set
  • n is the total number of blocks of the image set
  • X i for each block within the image block position information set in the horizontal direction
  • Y i is the each block within the current image block location information in the vertical direction
  • L i is the length of the block
  • W i is the width of the block
  • Attri representation attribute box when Attri is 0, which indicates the block has a character, Attri 1 , It means that there is no text in the box.
  • the subject classification of the image block set can be performed based on a subject recognition model of optical character recognition technology (OCR technology).
  • OCR technology optical character recognition technology
  • the subject recognition model is used to classify subjects, and the image block set can be split into a set of Chinese subjects and mathematics subjects. Sets, English subject work sets, and other subject work sets.
  • the S2 includes: performing character cutting on the data of the image block set to obtain a multi-character image set, extracting character features in the multi-character image set, and comparing the character features with a pre-built feature template library.
  • the template matching obtains the subject assignment set.
  • the character cutting is to segment the text images in the image block set in units of lines to obtain multiple sets of multi-character image sets in units of lines.
  • Both the character feature extraction method and the character cutting can be completed based on the optical character recognition technology (OCR technology).
  • OCR technology optical character recognition technology
  • the template matching may be based on the K-nearest neighbor algorithm for matching.
  • the question types may include multiple-choice questions, fill-in-the-blank questions, true or false questions, calculation questions, etc., and different question types correspond to different answer forms.
  • multiple-choice questions are written in A, B, C, and D.
  • True or False questions are judged in the form of right and wrong, so it is extremely necessary to carry out type recognition.
  • the type recognition of the image block set can also be performed based on the type recognition model of the optical character recognition technology (OCR technology).
  • OCR technology optical character recognition technology
  • the recognition process of the entire question type is the same as the above S3, and finally a collection of image assignments based on different disciplines is obtained. For example, there are multiple-choice homework image collections and reading comprehension homework image collections under the Chinese language homework collection, and choices under the mathematics homework collection There are multiple-choice homework image collections, judgment homework image collections, etc. under the title homework image collection, calculation homework image collection, and English subject homework collection.
  • the extracting the standard information image set corresponding to the multi-type image set from the answer storage area of the database includes: identifying the question number of the multi-type image set according to the image recognition technology to obtain the question number set, Extracting the same standard information image set as the question number set from a pre-built database according to the question number set.
  • the question number set obtained by recognizing the question numbers of the multi-type image set according to the image recognition technology includes: a pre-built projection coordinate system, and performing row-wise segmentation on the multi-type image set to obtain a multi-line image set,
  • the optical character recognition technology is used to recognize the first character at the beginning of each multi-line image set. If the first character is a numeric character, it will be retained, and if the first character is not a numeric character, it will be eliminated until it is recognized. Complete, sum up all the numeric characters to get the question number set.
  • the fourth question of the homework image set of Chinese subjects with multiple choice questions is matched with the standard information image set of the corresponding question number of the fourth question, if the homework image set of multiple choice questions is the first If the 4th question is selected as C, and the fourth question of the standard information image set is also C, the matching is successful. If the 4th question of the multiple-choice job image set is selected as B, the fourth question of the standard information image set is C, Then the match was unsuccessful.
  • the above-mentioned inventive method may preferably be based on a GPU.
  • the GPU is an image processor, which is a microprocessor that specializes in image operations on personal computers, workstations and some mobile devices. There are many types of the GPU, such as Titan, RTX, etc.
  • the GPU can speed up the correction of the student's homework.
  • the application also provides an electronic device.
  • FIG. 2 it is a schematic diagram of the internal structure of an electronic device provided by an embodiment of this application.
  • the electronic device 1 may be a PC (Personal Computer, personal computer), or a terminal device such as a smart phone, a tablet computer, or a portable computer, or a server.
  • the electronic device 1 at least includes a memory 11, a processor 12, a communication bus 13, and a network interface 14.
  • the memory 11 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), magnetic memory, magnetic disk, optical disk, and the like.
  • the memory 11 may be an internal storage unit of the electronic device 1 in some embodiments, such as a hard disk of the electronic device 1.
  • the memory 11 may also be an external storage device of the electronic device 1, such as a plug-in hard disk equipped on the electronic device 1, a smart media card (SMC), or a secure digital (SD) Card, Flash Card, etc.
  • the memory 11 may also include both an internal storage unit of the electronic device 1 and an external storage device.
  • the memory 11 can be used not only to store application software and various data installed in the electronic device 1, such as the code of the information comparison program 01 based on image processing, etc., but also to temporarily store data that has been output or will be output.
  • the processor 12 may be a central processing unit (CPU), controller, microcontroller, microprocessor, or other data processing chip, for running program codes or processing stored in the memory 11 Data, such as information comparison program 01 based on image processing.
  • CPU central processing unit
  • controller microcontroller
  • microprocessor or other data processing chip, for running program codes or processing stored in the memory 11 Data, such as information comparison program 01 based on image processing.
  • the communication bus 13 is used to realize the connection and communication between these components.
  • the network interface 14 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface), and is usually used to establish a communication connection between the device 1 and other electronic devices.
  • the device 1 may also include a user interface.
  • the user interface may include a display (Display) and an input unit such as a keyboard (Keyboard).
  • the optional user interface may also include a standard wired interface and a wireless interface.
  • the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode, organic light-emitting diode) touch device, etc.
  • the display can also be appropriately called a display screen or a display unit, which is used to display the information processed in the electronic device 1 and to display a visualized user interface.
  • FIG. 2 only shows the electronic device 1 with components 11-14 and the information comparison program 01 based on image processing.
  • FIG. 1 does not constitute a limitation on the electronic device 1 It may include fewer or more components than shown, or a combination of some components, or a different component arrangement.
  • the memory 11 stores an information comparison program 01 based on image processing; when the processor 12 executes the information comparison program 01 stored in the memory 11, the following steps are implemented:
  • Step 1 Receive an image set to be compared, and perform a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set.
  • the correction request includes the job number and date. For example, the user enters a correction request for the third batch of jobs on September 20, 2019.
  • the number and date of the homework are in a one-to-one correspondence with the homework stored in the pre-built database. For example, a senior high school simulation test for high school three (1) to high school (9) class according to the class The number is divided into 1-9 job numbers. If the user wants to correct the job of the third grade (6) class, he only needs to enter the job number and the job date of the third grade (6) class.
  • the image set to be compared is in the form of a scanned image version, such as the text test papers of the whole class mock examination of the third grade (6) class, which are all collected and placed in the pre-built database.
  • the image set to be compared is generally obtained through image scanning, and image scanning may cause a certain tilt of the obtained image, first perform the tilt correction operation and the image block cutting operation on the image set to be compared .
  • performing the tilt correction operation and the image block cutting operation on the image set to be compared to obtain the image block set includes: constructing a plane coordinate system, and arranging the images in the image set to be compared according to the plane The coordinate system is projected, and the images in the image set to be compared are divided according to the scale of the plane coordinate system to obtain a plurality of matrix blocks, and the barycentric coordinates of the plurality of matrix blocks are calculated in turn, according to the pre-built linear equation Adjust the inclination angles of the multiple work matrix blocks with the coordinates of the center of gravity to complete the inclination correction operation.
  • the inclination angle [delta] is compared with the center of gravity coordinates
  • (X, Y) represents the coordinate of the plane coordinate system
  • the tilt angle ⁇ is adjusted by the error with P i to complete the tilt correction operation.
  • the image block cutting operation includes: mapping the image set to be compared after the tilt correction operation is completed in the plane coordinate system, and leveling the image set to be compared according to a preset number of squares A plurality of image squares are obtained by dividing the squares in the direction and the vertical direction. According to the optical detection technology, it is determined whether there are characters in the image squares, and the image squares with the characters are retained to obtain the image square set.
  • the optical detection technology can use the principle of optical character recognition or the principle of refractive index change to determine whether there is text in each square. If there are no texts in some squares, it is a square with a blank area, which has little effect on subsequent smart work corrections. Can be removed directly.
  • the principle of changing the refractive index is to emit light with the same incident angle in each picture block, and determine whether the deviation of the refraction angle is greater than a threshold value, thereby determining whether there are text in the block.
  • the representation method of the image block set is:
  • I is the block number of each block within the image set
  • n is the total number of blocks of the image set
  • X i for each block within the image block position information set in the horizontal direction
  • Y i is the each block within the current image block location information in the vertical direction
  • L i is the length of the block
  • W i is the width of the block
  • Attri representation attribute box when Attri is 0, which indicates the block has a character, Attri 1 , It means that there is no text in the box.
  • Step 2 Perform image classification on the image block set, and perform type recognition on the image block set after the image classification according to the optical character recognition technology to obtain a multi-type image set.
  • the subject classification of the image block set can be performed based on a subject recognition model of optical character recognition technology (OCR technology).
  • OCR technology optical character recognition technology
  • the subject recognition model is used to classify subjects, and the image block set can be split into a set of Chinese subjects and mathematics subjects. Sets, English subject work sets, and other subject work sets.
  • the second step includes: cutting the data of the image block set to obtain a multi-character image set, extracting character features in the multi-character image set, and combining the character features with a pre-built feature template library Perform template matching to get the subject assignment set.
  • the character cutting is to segment the text images in the image block set in units of lines to obtain multiple sets of multi-character image sets in units of lines.
  • Both the character feature extraction method and the character cutting can be completed based on the optical character recognition technology (OCR technology).
  • OCR technology optical character recognition technology
  • the template matching may be based on the K-nearest neighbor algorithm for matching.
  • the question types may include multiple-choice questions, fill-in-the-blank questions, true or false questions, calculation questions, etc., and different question types correspond to different answer forms.
  • multiple-choice questions are written in A, B, C, and D.
  • True or False questions are judged in the form of right and wrong, so it is extremely necessary to carry out type recognition.
  • the type recognition of the image block set can also be performed based on the type recognition model of the optical character recognition technology (OCR technology).
  • OCR technology optical character recognition technology
  • the recognition process of the entire question type is the same as the above S3, and finally a collection of image assignments based on different disciplines is obtained. For example, there are multiple-choice homework image collections and reading comprehension homework image collections under the Chinese language homework collection, and choices under the mathematics homework collection There are multiple-choice homework image collections, judgment homework image collections, etc. under the task homework image collection, calculation homework image collection and English subject homework collection.
  • Step 3 Extract the standard information image set corresponding to the image set to be compared from the database, compare the multi-type image set and the standard information image set to obtain the information comparison of the image set to be compared result.
  • the extracting the standard information image set corresponding to the multi-type image set from the answer storage area of the database includes: identifying the question number of the multi-type image set according to the image recognition technology to obtain the question number set, Extracting the same standard information image set as the question number set from a pre-built database according to the question number set.
  • the question number set obtained by recognizing the question numbers of the multi-type image set according to the image recognition technology includes: a pre-built projection coordinate system, and performing row-wise segmentation on the multi-type image set to obtain a multi-line image set,
  • the optical character recognition technology is used to recognize the first character at the beginning of each multi-line image set. If the first character is a numeric character, it will be retained, and if the first character is not a numeric character, it will be eliminated until it is recognized. Complete, sum up all the numeric characters to get the question number set.
  • the fourth question of the homework image set of Chinese subjects with multiple choice questions is matched with the standard information image set of the corresponding question number of the fourth question, if the homework image set of multiple choice questions is the first If the 4th question is selected as C, and the fourth question of the standard information image set is also C, the matching is successful. If the 4th question of the multiple-choice job image set is selected as B, the fourth question of the standard information image set is C, Then the match was unsuccessful.
  • the above-mentioned inventive method may preferably be based on a GPU.
  • the GPU is an image processor, which is a microprocessor that specializes in image operations on personal computers, workstations and some mobile devices. There are many types of the GPU, such as Titan, RTX, etc.
  • the GPU can speed up the correction of the student's homework.
  • the image processing-based information comparison program can also be divided into one or more modules, and the one or more modules are stored in the memory 11 and run by one or more processors ( This embodiment is executed by the processor 12) to complete this application.
  • the module referred to in this application refers to a series of computer program instruction segments that can complete specific functions, and is used to describe the image processing-based information comparison program in an electronic device. The implementation process.
  • FIG. 3 a schematic diagram of the program modules of the image processing-based information comparison program in an embodiment of the image processing-based information comparison apparatus of this application.
  • the image processing-based information comparison program can be divided into a data receiving and block cutting module 10, a classification module 20, a type recognition module 30, and an information result output module 40.
  • a data receiving and block cutting module 10 a data receiving and block cutting module 10
  • a classification module 20 a classification module 20
  • a type recognition module 30 a type recognition module
  • an information result output module 40 Illustratively:
  • the data receiving and classification module 10 is configured to receive an image set to be compared, and perform a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set.
  • the classification module 20 is configured to: perform image classification on the image block set to obtain a classified image block set.
  • the type recognition module 30 is configured to perform type recognition on the image block set after the image classification according to the optical character recognition technology to obtain a multi-type image set.
  • the job correction result output module 40 is configured to: extract a standard information image set corresponding to the image set to be compared from the database, compare the multi-type image set and the standard information image set, to obtain the The information comparison result of the image set to be compared.
  • the embodiment of the present application also proposes a computer-readable storage medium.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the computer-readable storage medium stores image processing-based An information comparison program, the image processing-based information comparison program can be executed by one or more processors to implement the following operations:
  • the image set to be compared is received, and the image set to be compared is subjected to a tilt correction operation and an image block cutting operation to obtain an image block set.
  • Image classification is performed on the image block set to obtain a classified image block set.
  • the image block set after the classification of the image is type-recognized to obtain a multi-type image set.
  • the standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Character Input (AREA)

Abstract

Provided are an image-based information comparison method and apparatus, and a computer-readable storage medium, said method comprising: receiving an image set to be compared, and performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set (S1); classifying said image block set into images, and on the basis of optical character recognition technology, performing type recognition on the image block set after image classification to obtain a multi-type image set (S2); extracting from a database a standard information image set corresponding to the image set to be compared, and comparing the multi-type image set and the standard information image set to obtain an information comparison result of the image set to be compared (S3). The method can achieve a more accurate information comparison function based on image processing.

Description

基于图像的信息比对方法、装置、电子设备及计算机可读存储介质Image-based information comparison method, device, electronic equipment and computer readable storage medium
本申请要求于2020年01月15日提交中国专利局、申请号为202010041070.7、发明名称为“基于图像的信息比对方法、装置及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office, the application number is 202010041070.7, and the invention title is "image-based information comparison method, device and computer-readable storage medium" on January 15, 2020. All of them The content is incorporated in the application by reference.
技术领域Technical field
本申请涉及人工智能技术领域,尤其涉及一种基于基于图像处理的信息比对方法、装置、电子设备及计算机可读存储介质。This application relates to the field of artificial intelligence technology, and in particular to an information comparison method, device, electronic device, and computer-readable storage medium based on image processing.
背景技术Background technique
发明人意识到目前图像的信息获取多依赖人工完成或少部分使用机器读取,所述机器读取多依然需要人工配合,如考生将选择题答案撰写在特定的答题卷内,然后机器采用匹配算法匹配答题卷内的答案,由于这些匹配技术并没有涉及到图像处理,所以智能化程度不高,且在识别的过程中容易出现识别出错的情况,即识别的准确率不高。The inventor realizes that the current image information acquisition mostly relies on manual completion or a small part of the use of machine reading. The machine reading still requires manual cooperation. For example, candidates write multiple-choice answers in a specific answer sheet, and then the machine uses matching The algorithm matches the answers in the answer sheet. Because these matching technologies do not involve image processing, they are not highly intelligent, and recognition errors are prone to occur during the recognition process, that is, the recognition accuracy is not high.
发明内容Summary of the invention
本申请提供一种基于图像处理的信息比对方法、装置、电子设备及计算机可读存储介质,其主要目的在于解决图像信息获取智能化程度不高,且在识别的过程中容易出现识别出错的问题。This application provides an information comparison method, device, electronic device, and computer-readable storage medium based on image processing. Its main purpose is to solve the problem of low intelligence in image information acquisition and easy recognition errors during the recognition process. problem.
为实现上述目的,本申请提供的一种基于图像处理的信息比对方法,包括:In order to achieve the above objectives, an information comparison method based on image processing provided by this application includes:
接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集;Receiving an image set to be compared, and performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
将所述图像方块集进行图像分类,并根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集;Performing image classification on the image block set, and performing type recognition on the image block set after the image classification according to optical character recognition technology to obtain a multi-type image set;
从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
此外,为实现上述目的,本申请还提供一种电子设备,该电子设备包括存储器和处理器,所述存储器中存储有可在所述处理器上运行的基于图像处理的信息比对程序,所述基于图像处理的信息比对程序被所述处理器执行时实现如下步骤:In addition, in order to achieve the above object, the present application also provides an electronic device including a memory and a processor, and the memory stores an information comparison program based on image processing that can run on the processor. When the image processing-based information comparison program is executed by the processor, the following steps are implemented:
接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集;Receiving an image set to be compared, and performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
将所述图像方块集进行图像分类,并根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集;Performing image classification on the image block set, and performing type recognition on the image block set after the image classification according to optical character recognition technology to obtain a multi-type image set;
从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
此外,为实现上述目的,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有基于图像处理的信息比对程序,所述基于图像处理的信息比对程序可被一个或者多个处理器执行,以实现如下所述的基于图像处理的信息比对方法的步骤:In addition, in order to achieve the above-mentioned object, the present application also provides a computer-readable storage medium on which is stored an information comparison program based on image processing, and the information comparison program based on image processing can be One or more processors are executed to implement the steps of the image processing-based information comparison method as described below:
接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集;Receiving an image set to be compared, and performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
将所述图像方块集进行图像分类,并根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集;Performing image classification on the image block set, and performing type recognition on the image block set after the image classification according to optical character recognition technology to obtain a multi-type image set;
从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
此外,为实现上述目的,本申请还提供一种基于图像处理的信息比对装置,所述基于图像处理的信息比对装置序包括:In addition, in order to achieve the above object, the present application also provides an information comparison device based on image processing, and the sequence of the information comparison device based on image processing includes:
数据接收及方块切割模块,用于:接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集;The data receiving and block cutting module is used to: receive the image set to be compared, and perform a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
分类模块,用于:将所述图像方块集进行图像分类得到分类后的图像方块集;The classification module is configured to: perform image classification on the image block set to obtain a classified image block set;
类型识别模块,用于:根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集;The type recognition module is configured to: perform type recognition on the image block set after the image classification according to the optical character recognition technology to obtain a multi-type image set;
作业批改结果输出模块,用于:从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The job correction result output module is used to: extract the standard information image set corresponding to the image set to be compared from the database, compare the multi-type image set and the standard information image set to obtain the The result of comparing the information of the image set.
本申请通过倾斜矫正操作及图像方块切割操作,将图像集划分为多个小方块,减轻后续计算压力的前提下,也提高了后续模板匹配及光学字符识别的识别准确率,同时先通过模板匹配及光学字符识别,再通过对比操作进一步地提高图像信息获取的智能化程度。因此本申请提出的基于图像处理的信息比对方法、装置及计算机可读存储介质,可以实现图像信息获取的目的。This application divides the image set into multiple small squares through the tilt correction operation and the image block cutting operation, and under the premise of reducing the pressure of subsequent calculations, it also improves the recognition accuracy of subsequent template matching and optical character recognition. At the same time, it first passes the template matching And optical character recognition, and then through the comparison operation to further improve the degree of intelligence of image information acquisition. Therefore, the image processing-based information comparison method, device, and computer-readable storage medium proposed in this application can achieve the purpose of image information acquisition.
附图说明Description of the drawings
图1为本申请一实施例提供的基于图像处理的信息比对方法的流程示意图;FIG. 1 is a schematic flowchart of an information comparison method based on image processing provided by an embodiment of this application;
图2为本申请一实施例提供的电子设备的内部结构示意图;2 is a schematic diagram of the internal structure of an electronic device provided by an embodiment of the application;
图3为本申请一实施例提供的基于图像处理的信息比对装置中基于图像处理的信息比对程序的模块示意图。FIG. 3 is a schematic diagram of modules of an information comparison program based on image processing in an information comparison device based on image processing provided by an embodiment of the application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics, and advantages of the purpose of this application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, and are not used to limit the present application.
本申请提供一种基于图像处理的信息比对方法。参照图1所示,为本申请一实施例提供的基于图像处理的信息比对方法的流程示意图。该方法可以由一个装置执行,该装置可以由软件和/或硬件实现。This application provides an information comparison method based on image processing. Referring to FIG. 1, it is a schematic flowchart of an information comparison method based on image processing provided by an embodiment of this application. The method can be executed by a device, and the device can be implemented by software and/or hardware.
在本实施例中,基于图像处理的信息比对方法包括:In this embodiment, the information comparison method based on image processing includes:
S1、接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集。S1. Receive an image set to be compared, and perform a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set.
当前关于图像信息获取应用比较多的领域是智能作业批改,因此所述批改请求包括作业的编号和日期,如用户输入批改2019年9月20号,第三批作业的批改请求。Currently, there are more applications for image information acquisition in smart job corrections. Therefore, the correction request includes the job number and date. For example, the user enters a correction request for the third batch of jobs on September 20, 2019.
进一步地,所述作业的编号和日期与所述预构建的数据库中所存储的作业是一一对应的关系,如某高中高三模拟考试,将高三(1)班级到高三(9)班级按照班级编号共分为1-9个作业编号,若用户想批改高三(6)班的作业,则只需要输入高三(6)班的作业编号和作业日期即可。Further, the number and date of the homework are in a one-to-one correspondence with the homework stored in the pre-built database. For example, a senior high school simulation test for high school three (1) to high school (9) class according to the class The number is divided into 1-9 job numbers. If the user wants to correct the job of the third grade (6) class, he only needs to enter the job number and the job date of the third grade (6) class.
优选地,所述待比对图像集是图像扫描版的形式,如高三(6)班的全班模拟考试文本试卷,都统一收集并放置在所述预构建的数据库中。Preferably, the image set to be compared is in the form of a scanned image version, such as the text test papers of the whole class mock examination of the third grade (6) class, which are all collected and placed in the pre-built database.
因为所述待比对图像集一般是通过图像扫描得到的,而图像扫描有可能会使得到的图像产生一定的倾斜,故先对所述待比对图像集进行倾斜矫正操作及图像方块切割操作。Because the image set to be compared is generally obtained through image scanning, and image scanning may cause a certain tilt of the obtained image, first perform the tilt correction operation and the image block cutting operation on the image set to be compared .
详细地,所述将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集,包括:构建平面坐标系,将所述待比对图像集内的图像按照所述平面坐标系进行投影,并根据所述平面坐标系的刻度分割所述待比对图像集内的图像,得到多个矩阵块, 依次计算所述多个矩阵块的重心坐标,根据预先构建的直线方程和所述重心坐标调整所述多个作业矩阵块的倾斜角度,完成所述倾斜矫正操作。In detail, performing the tilt correction operation and the image block cutting operation on the image set to be compared to obtain the image block set includes: constructing a plane coordinate system, and arranging the images in the image set to be compared according to the plane The coordinate system is projected, and the images in the image set to be compared are divided according to the scale of the plane coordinate system to obtain a plurality of matrix blocks, and the barycentric coordinates of the plurality of matrix blocks are calculated in turn, according to the pre-built linear equation Adjust the inclination angles of the multiple work matrix blocks with the coordinates of the center of gravity to complete the inclination correction operation.
进一步地,所述直线方程为:Further, the linear equation is:
Y=a+bXY=a+bX
Y=c+dXY=c+dX
其中,b=tgδ,
Figure PCTCN2020099066-appb-000001
a,c为任意常数,δ为与所述重心坐标相比的倾斜角度,(X,Y)表示在所述平面坐标系的坐标,所述重心坐标为(P i,P i)。将所述重心坐标X=P i,带入至上述直线方程内得到Y值,通过与P i的误差调节所述倾斜角度δ,完成所述倾斜矫正操作。
Among them, b=tgδ,
Figure PCTCN2020099066-appb-000001
a, c is an arbitrary constant, the inclination angle [delta] is compared with the center of gravity coordinates, (X, Y) represents the coordinate of the plane coordinate system, the coordinates of the center of gravity (P i, P i). The center of gravity coordinate X=P i is brought into the above linear equation to obtain the Y value, and the tilt angle δ is adjusted by the error with P i to complete the tilt correction operation.
因为整块的图像由于图像太大,对于后续模型的识别和计算压力都有较大影响,因此需要对所述待比对图像集进行图像方块切割操作。所述图像方块切割操作包括:将完成所述倾斜矫正操作的所述待比对图像集映射在所述平面坐标系内,按照预先设定的方块数,对所述待比对图像集进行水平方向、垂直方向的方块划分得到多个图像方块,根据光学检测技术判断所述图像方块内是否有文字,保留具有文字的图像方块,得到所述图像方块集。Because the entire image is too large, it has a greater impact on the recognition and calculation pressure of the subsequent model. Therefore, it is necessary to perform an image block cutting operation on the image set to be compared. The image block cutting operation includes: mapping the image set to be compared after the tilt correction operation is completed in the plane coordinate system, and leveling the image set to be compared according to a preset number of squares A plurality of image squares are obtained by dividing the squares in the direction and the vertical direction. According to the optical detection technology, it is determined whether there are characters in the image squares, and the image squares with the characters are retained to obtain the image square set.
所述光学检测技术可采用基于光学字符识别原理或折射率改变原理,判断每个方块内是否有文字,若有的方块没有文字,则为空白区域的方块,对于后续智能作业批改作用不大,可直接去除。所述折射率改变原理即在每个图片方块内发射相同入射角的光线,判断折射角的偏差是否大于阈值,从而判断方块内是否有文字。The optical detection technology can use the principle of optical character recognition or the principle of refractive index change to determine whether there is text in each square. If there are no texts in some squares, it is a square with a blank area, which has little effect on subsequent smart work corrections. Can be removed directly. The principle of changing the refractive index is to emit light rays with the same incident angle in each picture block, and determine whether the deviation of the refraction angle is greater than a threshold value, thereby determining whether there is text in the block.
具体地,所述图像方块集的表示方法为:Specifically, the representation method of the image block set is:
DDL(I,n,B 1((X 1,Y 1),(L 1,W 1),Attri),B 2((X 2,Y 2),(L 2,W 2),Attri,…,B n((X n,Y n),(L n,W n),Attri)) DDL(I,n,B 1 ((X 1 ,Y 1 ),(L 1 ,W 1 ),Attri),B 2 ((X 2 ,Y 2 ),(L 2 ,W 2 ),Attri,... ,B n ((X n ,Y n ),(L n ,W n ),Attri))
其中,I为所述图像方块集内每个方块的序号,n为所述图像方块集的总数,X i为所述图像方块集内每个方块在水平方向的位置信息;Y i为所述图像方块集内每个方块在垂直方向的位置信息,L i为方块的长度;W i为方块的宽度;Attri表示方块的属性,当Attri为0时,表示该方块内具有文字,Attri为1时,表示该方块内没有文字。 Wherein, I is the block number of each block within the image set, n is the total number of blocks of the image set, X i for each block within the image block position information set in the horizontal direction; Y i is the each block within the current image block location information in the vertical direction, L i is the length of the block; W i is the width of the block; Attri representation attribute box, when Attri is 0, which indicates the block has a character, Attri 1 , It means that there is no text in the box.
S2、将所述图像方块集进行图像分类,并根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集。S2. Perform image classification on the image block set, and perform type recognition on the image block set after the image classification according to the optical character recognition technology to obtain a multi-type image set.
优选地,可基于光学字符识别技术(OCR技术)的学科识别模型对所述图像方块集进行学科分类。如所述图像方块集中包含了不同的学科(如语文、数学、英语等),利用所述学科识别模型进行学科分类,可将所述图像方块集拆分为到语文学科作业集、数学学科作业集、英语学科作业集等各个学科作业集。Preferably, the subject classification of the image block set can be performed based on a subject recognition model of optical character recognition technology (OCR technology). If the image block set contains different subjects (such as Chinese, mathematics, English, etc.), the subject recognition model is used to classify subjects, and the image block set can be split into a set of Chinese subjects and mathematics subjects. Sets, English subject work sets, and other subject work sets.
详细地,所述S2包括:将所述图像方块集的数据进行字符切割得到多字符图像集,提取所述多字符图像集内的字符特征,将所述字符特征与预构建的特征模板库进行模板匹配得到学科作业集。In detail, the S2 includes: performing character cutting on the data of the image block set to obtain a multi-character image set, extracting character features in the multi-character image set, and comparing the character features with a pre-built feature template library. The template matching obtains the subject assignment set.
进一步地,所述字符切割是将所述图像方块集内的文字图像以行为单位进行分割,得到多组以行为单位的多字符图像集。所述字符特征的提取方法以及所述字符切割都可基于所述光学字符识别技术(OCR技术)完成。所述模板匹配可基于K近邻算法进行匹配。Further, the character cutting is to segment the text images in the image block set in units of lines to obtain multiple sets of multi-character image sets in units of lines. Both the character feature extraction method and the character cutting can be completed based on the optical character recognition technology (OCR technology). The template matching may be based on the K-nearest neighbor algorithm for matching.
比如作业批改中,题型都可能包含选择题、填空题、判断题、计算题等,而不同的题型对应的答案形式不尽相同,如选择题是以A、B、C、D撰写答案,判断题是以对、错形式判断等,因此进行类型识别是极其有必要的。For example, in homework correction, the question types may include multiple-choice questions, fill-in-the-blank questions, true or false questions, calculation questions, etc., and different question types correspond to different answer forms. For example, multiple-choice questions are written in A, B, C, and D. , True or False questions are judged in the form of right and wrong, so it is extremely necessary to carry out type recognition.
较佳地,同样可基于光学字符识别技术(OCR技术)的类型识别模型对所述图像方块集进行类型识别。整个题目类型的识别过程与上述S3相同,最终得到基于不同学科的题型作业图像集,如语文学科作业集下有选择题作业图像集、阅读理解作业图像集等,数学学科作业集下有选择题作业图像集、计算题作业图像集、英语学科作业集下有选择题作业图像集、判断题作业图像集等。Preferably, the type recognition of the image block set can also be performed based on the type recognition model of the optical character recognition technology (OCR technology). The recognition process of the entire question type is the same as the above S3, and finally a collection of image assignments based on different disciplines is obtained. For example, there are multiple-choice homework image collections and reading comprehension homework image collections under the Chinese language homework collection, and choices under the mathematics homework collection There are multiple-choice homework image collections, judgment homework image collections, etc. under the title homework image collection, calculation homework image collection, and English subject homework collection.
S3、从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。S3. Extract a standard information image set corresponding to the image set to be compared from the database, compare the multi-type image set and the standard information image set to obtain an information comparison result of the image set to be compared .
详细地,所述从所述数据库的答案存储区域中提取与所述多类型图像集对应的标准信息图像集,包括:根据图像识别技术识别所述多类型图像集的题号得到题号集,根据所述题号集从预构建的数据库中提取与所述题号集相同的标准信息图像集。In detail, the extracting the standard information image set corresponding to the multi-type image set from the answer storage area of the database includes: identifying the question number of the multi-type image set according to the image recognition technology to obtain the question number set, Extracting the same standard information image set as the question number set from a pre-built database according to the question number set.
详细地,所述根据图像识别技术识别所述多类型图像集的题号得到题号集,包括:预构建的投影坐标系,对所述多类型图像集进行按行分割得到多行图像集,采用所述光学字符识别技术识别每个多行图像集内开头的第一个字符,所述第一个字符若为数字字符则保留,所述第一个字符若不是数字字符则剔除,直至识别完成,汇总所有数字字符得到题号集。In detail, the question number set obtained by recognizing the question numbers of the multi-type image set according to the image recognition technology includes: a pre-built projection coordinate system, and performing row-wise segmentation on the multi-type image set to obtain a multi-line image set, The optical character recognition technology is used to recognize the first character at the beginning of each multi-line image set. If the first character is a numeric character, it will be retained, and if the first character is not a numeric character, it will be eliminated until it is recognized. Complete, sum up all the numeric characters to get the question number set.
本申请较佳实施例,如将语文学科作业集下有选择题作业图像集第4题与所述第4题对应题号的标准信息图像集进行字符匹配,若所述选择题作业图像集第4题选择为C,所述标准信息图像集第4题也为C,则匹配成功,若所述选择题作业图像集第4题选择为B,所述标准信息图像集第4题为C,则匹配未成功。In a preferred embodiment of the present application, if the fourth question of the homework image set of Chinese subjects with multiple choice questions is matched with the standard information image set of the corresponding question number of the fourth question, if the homework image set of multiple choice questions is the first If the 4th question is selected as C, and the fourth question of the standard information image set is also C, the matching is successful. If the 4th question of the multiple-choice job image set is selected as B, the fourth question of the standard information image set is C, Then the match was unsuccessful.
以上所述的发明方法,较佳地可基于GPU,所述GPU为图像处理器,是一种专门在个人电脑、工作站和一些移动设备上进行图像运算工作的微处理器。所述GPU的种类有很多,如:泰坦,RTX等。通过GPU可加快所述学生作业批改的速度。The above-mentioned inventive method may preferably be based on a GPU. The GPU is an image processor, which is a microprocessor that specializes in image operations on personal computers, workstations and some mobile devices. There are many types of the GPU, such as Titan, RTX, etc. The GPU can speed up the correction of the student's homework.
本申请还提供一种电子设备。参照图2所示,为本申请一实施例提供的电子设备的内部结构示意图。The application also provides an electronic device. Referring to FIG. 2, it is a schematic diagram of the internal structure of an electronic device provided by an embodiment of this application.
在本实施例中,所述电子设备1可以是PC(Personal Computer,个人电脑),或者是智能手机、平板电脑、便携计算机等终端设备,也可以是一种服务器等。该电子设备1至少包括存储器11、处理器12,通信总线13,以及网络接口14。In this embodiment, the electronic device 1 may be a PC (Personal Computer, personal computer), or a terminal device such as a smart phone, a tablet computer, or a portable computer, or a server. The electronic device 1 at least includes a memory 11, a processor 12, a communication bus 13, and a network interface 14.
其中,存储器11至少包括一种类型的可读存储介质,所述可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、磁性存储器、磁盘、光盘等。存储器11在一些实施例中可以是电子设备1的内部存储单元,例如该电子设备1的硬盘。存储器11在另一些实施例中也可以是电子设备1的外部存储设备,例如电子设备1上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,存储器11还可以既包括电子设备1的内部存储单元也包括外部存储设备。存储器11不仅可以用于存储安装于电子设备1的应用软件及各类数据,例如基于图像处理的信息比对程序01的代码等,还可以用于暂时地存储已经输出或者将要输出的数据。The memory 11 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), magnetic memory, magnetic disk, optical disk, and the like. The memory 11 may be an internal storage unit of the electronic device 1 in some embodiments, such as a hard disk of the electronic device 1. In other embodiments, the memory 11 may also be an external storage device of the electronic device 1, such as a plug-in hard disk equipped on the electronic device 1, a smart media card (SMC), or a secure digital (SD) Card, Flash Card, etc. Further, the memory 11 may also include both an internal storage unit of the electronic device 1 and an external storage device. The memory 11 can be used not only to store application software and various data installed in the electronic device 1, such as the code of the information comparison program 01 based on image processing, etc., but also to temporarily store data that has been output or will be output.
处理器12在一些实施例中可以是一中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器或其他数据处理芯片,用于运行存储器11中存储的程序代码或处理数据,例如执行基于图像处理的信息比对程序01等。In some embodiments, the processor 12 may be a central processing unit (CPU), controller, microcontroller, microprocessor, or other data processing chip, for running program codes or processing stored in the memory 11 Data, such as information comparison program 01 based on image processing.
通信总线13用于实现这些组件之间的连接通信。The communication bus 13 is used to realize the connection and communication between these components.
网络接口14可选的可以包括标准的有线接口、无线接口(如WI-FI接口),通常用于在该装置1与其他电子设备之间建立通信连接。The network interface 14 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface), and is usually used to establish a communication connection between the device 1 and other electronic devices.
可选地,该装置1还可以包括用户接口,用户接口可以包括显示器(Display)、输入单元比如键盘(Keyboard),可选的用户接口还可以包括标准的有线接口、无线接口。可选地,在一些实施例中,显示器可以是LED显示器、液晶显示器、触控式液晶显示器以及OLED(Organic Light-Emitting Diode,有机发光二极管)触摸器等。其中,显示器也可以适当的称为显示屏或显示单元,用于显示在电子设备1中处理的信息以及用于显示可视化的用户界面。Optionally, the device 1 may also include a user interface. The user interface may include a display (Display) and an input unit such as a keyboard (Keyboard). The optional user interface may also include a standard wired interface and a wireless interface. Optionally, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode, organic light-emitting diode) touch device, etc. Among them, the display can also be appropriately called a display screen or a display unit, which is used to display the information processed in the electronic device 1 and to display a visualized user interface.
图2仅示出了具有组件11-14以及基于图像处理的信息比对程序01的电子设备1,本领域技术人员可以理解的是,图1示出的结构并不构成对电子设备1的限定,可以包括比 图示更少或者更多的部件,或者组合某些部件,或者不同的部件布置。FIG. 2 only shows the electronic device 1 with components 11-14 and the information comparison program 01 based on image processing. Those skilled in the art will understand that the structure shown in FIG. 1 does not constitute a limitation on the electronic device 1 It may include fewer or more components than shown, or a combination of some components, or a different component arrangement.
在图2所示的装置1实施例中,存储器11中存储有基于图像处理的信息比对程序01;处理器12执行存储器11中存储的基于图像处理的信息比对程序01时实现如下步骤:In the embodiment of the apparatus 1 shown in FIG. 2, the memory 11 stores an information comparison program 01 based on image processing; when the processor 12 executes the information comparison program 01 stored in the memory 11, the following steps are implemented:
步骤一、接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集。Step 1: Receive an image set to be compared, and perform a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set.
当前关于图像信息获取应用比较多的领域是智能作业批改,因此所述批改请求包括作业的编号和日期,如用户输入批改2019年9月20号,第三批作业的批改请求。Currently, there are more applications for image information acquisition in smart job corrections. Therefore, the correction request includes the job number and date. For example, the user enters a correction request for the third batch of jobs on September 20, 2019.
进一步地,所述作业的编号和日期与所述预构建的数据库中所存储的作业是一一对应的关系,如某高中高三模拟考试,将高三(1)班级到高三(9)班级按照班级编号共分为1-9个作业编号,若用户想批改高三(6)班的作业,则只需要输入高三(6)班的作业编号和作业日期即可。Further, the number and date of the homework are in a one-to-one correspondence with the homework stored in the pre-built database. For example, a senior high school simulation test for high school three (1) to high school (9) class according to the class The number is divided into 1-9 job numbers. If the user wants to correct the job of the third grade (6) class, he only needs to enter the job number and the job date of the third grade (6) class.
优选地,所述待比对图像集是图像扫描版的形式,如高三(6)班的全班模拟考试文本试卷,都统一收集并放置在所述预构建的数据库中。Preferably, the image set to be compared is in the form of a scanned image version, such as the text test papers of the whole class mock examination of the third grade (6) class, which are all collected and placed in the pre-built database.
因为所述待比对图像集一般是通过图像扫描得到的,而图像扫描有可能会使得到的图像产生一定的倾斜,故先对所述待比对图像集进行倾斜矫正操作及图像方块切割操作。Because the image set to be compared is generally obtained through image scanning, and image scanning may cause a certain tilt of the obtained image, first perform the tilt correction operation and the image block cutting operation on the image set to be compared .
详细地,所述将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集,包括:构建平面坐标系,将所述待比对图像集内的图像按照所述平面坐标系进行投影,并根据所述平面坐标系的刻度分割所述待比对图像集内的图像,得到多个矩阵块,依次计算所述多个矩阵块的重心坐标,根据预先构建的直线方程和所述重心坐标调整所述多个作业矩阵块的倾斜角度,完成所述倾斜矫正操作。In detail, performing the tilt correction operation and the image block cutting operation on the image set to be compared to obtain the image block set includes: constructing a plane coordinate system, and arranging the images in the image set to be compared according to the plane The coordinate system is projected, and the images in the image set to be compared are divided according to the scale of the plane coordinate system to obtain a plurality of matrix blocks, and the barycentric coordinates of the plurality of matrix blocks are calculated in turn, according to the pre-built linear equation Adjust the inclination angles of the multiple work matrix blocks with the coordinates of the center of gravity to complete the inclination correction operation.
进一步地,所述直线方程为:Further, the linear equation is:
Y=a+bXY=a+bX
Y=c+dXY=c+dX
其中,b=tgδ,
Figure PCTCN2020099066-appb-000002
a,c为任意常数,δ为与所述重心坐标相比的倾斜角度,(X,Y)表示在所述平面坐标系的坐标,所述重心坐标为(P i,P i)。将所述重心坐标X=P i,带入至上述直线方程内得到Y值,通过与P i的误差调节所述倾斜角度δ,完成所述倾斜矫正操作。
Among them, b=tgδ,
Figure PCTCN2020099066-appb-000002
a, c is an arbitrary constant, the inclination angle [delta] is compared with the center of gravity coordinates, (X, Y) represents the coordinate of the plane coordinate system, the coordinates of the center of gravity (P i, P i). The center of gravity coordinate X=P i is brought into the above linear equation to obtain the Y value, and the tilt angle δ is adjusted by the error with P i to complete the tilt correction operation.
因为整块的图像由于图像太大,对于后续模型的识别和计算压力都有较大影响,因此需要对所述待比对图像集进行图像方块切割操作。所述图像方块切割操作包括:将完成所述倾斜矫正操作的所述待比对图像集映射在所述平面坐标系内,按照预先设定的方块数,对所述待比对图像集进行水平方向、垂直方向的方块划分得到多个图像方块,根据光学检测技术判断所述图像方块内是否有文字,保留具有文字的图像方块,得到所述图像方块集。Because the entire image is too large, it has a greater impact on the recognition and calculation pressure of the subsequent model. Therefore, it is necessary to perform an image block cutting operation on the image set to be compared. The image block cutting operation includes: mapping the image set to be compared after the tilt correction operation is completed in the plane coordinate system, and leveling the image set to be compared according to a preset number of squares A plurality of image squares are obtained by dividing the squares in the direction and the vertical direction. According to the optical detection technology, it is determined whether there are characters in the image squares, and the image squares with the characters are retained to obtain the image square set.
所述光学检测技术可采用基于光学字符识别原理或折射率改变原理,判断每个方块内是否有文字,若有的方块没有文字,则为空白区域的方块,对于后续智能作业批改作用不大,可直接去除。所述折射率改变原理即在每个图片方块内发射相同入射角的光线,判断折射角的偏差是否大于阈值,从而判断方块内是否有文字。The optical detection technology can use the principle of optical character recognition or the principle of refractive index change to determine whether there is text in each square. If there are no texts in some squares, it is a square with a blank area, which has little effect on subsequent smart work corrections. Can be removed directly. The principle of changing the refractive index is to emit light with the same incident angle in each picture block, and determine whether the deviation of the refraction angle is greater than a threshold value, thereby determining whether there are text in the block.
具体地,所述图像方块集的表示方法为:Specifically, the representation method of the image block set is:
DDL(I,n,B 1((X 1,Y 1),(L 1,W 1),Attri),B 2((X 2,Y 2),(L 2,W 2),Attri,…,B n((X n,Y n),(L n,W n),Attri)) DDL(I,n,B 1 ((X 1 ,Y 1 ),(L 1 ,W 1 ),Attri),B 2 ((X 2 ,Y 2 ),(L 2 ,W 2 ),Attri,... ,B n ((X n ,Y n ),(L n ,W n ),Attri))
其中,I为所述图像方块集内每个方块的序号,n为所述图像方块集的总数,X i为所述图像方块集内每个方块在水平方向的位置信息;Y i为所述图像方块集内每个方块在垂直方向的位置信息,L i为方块的长度;W i为方块的宽度;Attri表示方块的属性,当Attri为0时,表示该方块内具有文字,Attri为1时,表示该方块内没有文字。 Wherein, I is the block number of each block within the image set, n is the total number of blocks of the image set, X i for each block within the image block position information set in the horizontal direction; Y i is the each block within the current image block location information in the vertical direction, L i is the length of the block; W i is the width of the block; Attri representation attribute box, when Attri is 0, which indicates the block has a character, Attri 1 , It means that there is no text in the box.
步骤二、将所述图像方块集进行图像分类,并根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集。Step 2: Perform image classification on the image block set, and perform type recognition on the image block set after the image classification according to the optical character recognition technology to obtain a multi-type image set.
优选地,可基于光学字符识别技术(OCR技术)的学科识别模型对所述图像方块集进行学科分类。如所述图像方块集中包含了不同的学科(如语文、数学、英语等),利用所述学科识别模型进行学科分类,可将所述图像方块集拆分为到语文学科作业集、数学学科作业集、英语学科作业集等各个学科作业集。Preferably, the subject classification of the image block set can be performed based on a subject recognition model of optical character recognition technology (OCR technology). If the image block set contains different subjects (such as Chinese, mathematics, English, etc.), the subject recognition model is used to classify subjects, and the image block set can be split into a set of Chinese subjects and mathematics subjects. Sets, English subject work sets, and other subject work sets.
详细地,所述步骤二包括:将所述图像方块集的数据进行字符切割得到多字符图像集,提取所述多字符图像集内的字符特征,将所述字符特征与预构建的特征模板库进行模板匹配得到学科作业集。In detail, the second step includes: cutting the data of the image block set to obtain a multi-character image set, extracting character features in the multi-character image set, and combining the character features with a pre-built feature template library Perform template matching to get the subject assignment set.
进一步地,所述字符切割是将所述图像方块集内的文字图像以行为单位进行分割,得到多组以行为单位的多字符图像集。所述字符特征的提取方法以及所述字符切割都可基于所述光学字符识别技术(OCR技术)完成。所述模板匹配可基于K近邻算法进行匹配。Further, the character cutting is to segment the text images in the image block set in units of lines to obtain multiple sets of multi-character image sets in units of lines. Both the character feature extraction method and the character cutting can be completed based on the optical character recognition technology (OCR technology). The template matching may be based on the K-nearest neighbor algorithm for matching.
比如作业批改中,题型都可能包含选择题、填空题、判断题、计算题等,而不同的题型对应的答案形式不尽相同,如选择题是以A、B、C、D撰写答案,判断题是以对、错形式判断等,因此进行类型识别是极其有必要的。For example, in homework correction, the question types may include multiple-choice questions, fill-in-the-blank questions, true or false questions, calculation questions, etc., and different question types correspond to different answer forms. For example, multiple-choice questions are written in A, B, C, and D. , True or False questions are judged in the form of right and wrong, so it is extremely necessary to carry out type recognition.
较佳地,同样可基于光学字符识别技术(OCR技术)的类型识别模型对所述图像方块集进行类型识别。整个题目类型的识别过程与上述S3相同,最终得到基于不同学科的题型作业图像集,如语文学科作业集下有选择题作业图像集、阅读理解作业图像集等,数学学科作业集下有选择题作业图像集、计算题作业图像集、英语学科作业集下有选择题作业图像集、判断题作业图像集等。Preferably, the type recognition of the image block set can also be performed based on the type recognition model of the optical character recognition technology (OCR technology). The recognition process of the entire question type is the same as the above S3, and finally a collection of image assignments based on different disciplines is obtained. For example, there are multiple-choice homework image collections and reading comprehension homework image collections under the Chinese language homework collection, and choices under the mathematics homework collection There are multiple-choice homework image collections, judgment homework image collections, etc. under the task homework image collection, calculation homework image collection and English subject homework collection.
步骤三、从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。Step 3: Extract the standard information image set corresponding to the image set to be compared from the database, compare the multi-type image set and the standard information image set to obtain the information comparison of the image set to be compared result.
详细地,所述从所述数据库的答案存储区域中提取与所述多类型图像集对应的标准信息图像集,包括:根据图像识别技术识别所述多类型图像集的题号得到题号集,根据所述题号集从预构建的数据库中提取与所述题号集相同的标准信息图像集。In detail, the extracting the standard information image set corresponding to the multi-type image set from the answer storage area of the database includes: identifying the question number of the multi-type image set according to the image recognition technology to obtain the question number set, Extracting the same standard information image set as the question number set from a pre-built database according to the question number set.
详细地,所述根据图像识别技术识别所述多类型图像集的题号得到题号集,包括:预构建的投影坐标系,对所述多类型图像集进行按行分割得到多行图像集,采用所述光学字符识别技术识别每个多行图像集内开头的第一个字符,所述第一个字符若为数字字符则保留,所述第一个字符若不是数字字符则剔除,直至识别完成,汇总所有数字字符得到题号集。In detail, the question number set obtained by recognizing the question numbers of the multi-type image set according to the image recognition technology includes: a pre-built projection coordinate system, and performing row-wise segmentation on the multi-type image set to obtain a multi-line image set, The optical character recognition technology is used to recognize the first character at the beginning of each multi-line image set. If the first character is a numeric character, it will be retained, and if the first character is not a numeric character, it will be eliminated until it is recognized. Complete, sum up all the numeric characters to get the question number set.
本申请较佳实施例,如将语文学科作业集下有选择题作业图像集第4题与所述第4题对应题号的标准信息图像集进行字符匹配,若所述选择题作业图像集第4题选择为C,所述标准信息图像集第4题也为C,则匹配成功,若所述选择题作业图像集第4题选择为B,所述标准信息图像集第4题为C,则匹配未成功。In a preferred embodiment of the present application, if the fourth question of the homework image set of Chinese subjects with multiple choice questions is matched with the standard information image set of the corresponding question number of the fourth question, if the homework image set of multiple choice questions is the first If the 4th question is selected as C, and the fourth question of the standard information image set is also C, the matching is successful. If the 4th question of the multiple-choice job image set is selected as B, the fourth question of the standard information image set is C, Then the match was unsuccessful.
以上所述的发明方法,较佳地可基于GPU,所述GPU为图像处理器,是一种专门在个人电脑、工作站和一些移动设备上进行图像运算工作的微处理器。所述GPU的种类有很多,如:泰坦,RTX等。通过GPU可加快所述学生作业批改的速度。The above-mentioned inventive method may preferably be based on a GPU. The GPU is an image processor, which is a microprocessor that specializes in image operations on personal computers, workstations and some mobile devices. There are many types of the GPU, such as Titan, RTX, etc. The GPU can speed up the correction of the student's homework.
可选地,在其他实施例中,基于图像处理的信息比对程序还可以被分割为一个或者多个模块,一个或者多个模块被存储于存储器11中,并由一个或多个处理器(本实施例为处理器12)所执行以完成本申请,本申请所称的模块是指能够完成特定功能的一系列计算机程序指令段,用于描述基于图像处理的信息比对程序在电子设备中的执行过程。Optionally, in other embodiments, the image processing-based information comparison program can also be divided into one or more modules, and the one or more modules are stored in the memory 11 and run by one or more processors ( This embodiment is executed by the processor 12) to complete this application. The module referred to in this application refers to a series of computer program instruction segments that can complete specific functions, and is used to describe the image processing-based information comparison program in an electronic device. The implementation process.
例如,参照图3所示,为本申请基于图像处理的信息比对装置一实施例中的基于图像处理的信息比对程序的程序模块示意图,该实施例中,所述基于图像处理的信息比对程序可以被分割为数据接收及方块切割模块10、分类模块20、类型识别模块30、信息结果输出模块40示例性地:For example, referring to FIG. 3, a schematic diagram of the program modules of the image processing-based information comparison program in an embodiment of the image processing-based information comparison apparatus of this application. In this embodiment, the image processing-based information comparison program The program can be divided into a data receiving and block cutting module 10, a classification module 20, a type recognition module 30, and an information result output module 40. Illustratively:
所述数据接收及分类模块10用于:接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集。The data receiving and classification module 10 is configured to receive an image set to be compared, and perform a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set.
所述分类模块20用于:将所述图像方块集进行图像分类得到分类后的图像方块集。The classification module 20 is configured to: perform image classification on the image block set to obtain a classified image block set.
所述类型识别模块30用于:根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集。The type recognition module 30 is configured to perform type recognition on the image block set after the image classification according to the optical character recognition technology to obtain a multi-type image set.
所述作业批改结果输出模块40用于:从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The job correction result output module 40 is configured to: extract a standard information image set corresponding to the image set to be compared from the database, compare the multi-type image set and the standard information image set, to obtain the The information comparison result of the image set to be compared.
上述数据接收及方块切割模块10、分类模块20、类型识别模块30、信息结果输出模块40等程序模块被执行时所实现的功能或操作步骤与上述实施例大体相同,在此不再赘述。The functions or operation steps implemented by the program modules such as the data receiving and square cutting module 10, the classification module 20, the type identification module 30, and the information result output module 40 when executed are substantially the same as those in the foregoing embodiment, and will not be repeated here.
此外,本申请实施例还提出一种计算机可读存储介质,所述计算机可读存储介质可以是非易失性,也可以是易失性,所述计算机可读存储介质上存储有基于图像处理的信息比对程序,所述基于图像处理的信息比对程序可被一个或多个处理器执行,以实现如下操作:In addition, the embodiment of the present application also proposes a computer-readable storage medium. The computer-readable storage medium may be non-volatile or volatile. The computer-readable storage medium stores image processing-based An information comparison program, the image processing-based information comparison program can be executed by one or more processors to implement the following operations:
接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集。The image set to be compared is received, and the image set to be compared is subjected to a tilt correction operation and an image block cutting operation to obtain an image block set.
将所述图像方块集进行图像分类得到分类后的图像方块集。Image classification is performed on the image block set to obtain a classified image block set.
根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集。According to the optical character recognition technology, the image block set after the classification of the image is type-recognized to obtain a multi-type image set.
从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
需要说明的是,上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。并且本文中的术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、装置、物品或者方法不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、装置、物品或者方法所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、装置、物品或者方法中还存在另外的相同要素。It should be noted that the serial numbers of the foregoing embodiments of the present application are only for description, and do not represent the advantages and disadvantages of the embodiments. And the terms "include", "include" or any other variants thereof in this article are intended to cover non-exclusive inclusion, so that a process, device, article or method including a series of elements not only includes those elements, but also includes those elements that are not explicitly included. The other elements listed may also include elements inherent to the process, device, article, or method. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, device, article, or method that includes the element.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above implementation manners, those skilled in the art can clearly understand that the above-mentioned embodiment method can be implemented by means of software plus the necessary general hardware platform, of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM) as described above. , Magnetic disk, optical disk), including several instructions to make a terminal device (can be a mobile phone, a computer, a server, or a network device, etc.) execute the method described in each embodiment of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only the preferred embodiments of the application, and do not limit the scope of the patent for this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of the application, or directly or indirectly applied to other related technical fields , The same reason is included in the scope of patent protection of this application.

Claims (20)

  1. 一种基于图像处理的信息比对方法,其中,所述方法包括:An information comparison method based on image processing, wherein the method includes:
    接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集;Receiving an image set to be compared, and performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
    将所述图像方块集进行图像分类,并根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集;Performing image classification on the image block set, and performing type recognition on the image block set after the image classification according to optical character recognition technology to obtain a multi-type image set;
    从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
  2. 如权利要求1所述的基于图像处理的信息比对方法,其中,所述将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集包括:3. The information comparison method based on image processing according to claim 1, wherein said performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set comprises:
    构建平面坐标系,将所述待比对图像集内的图像按照所述平面坐标系进行投影,并根据所述平面坐标系的刻度分割所述待比对图像集内的图像,得到多个矩阵块;Construct a plane coordinate system, project the images in the image set to be compared according to the plane coordinate system, and divide the images in the image set to be compared according to the scale of the plane coordinate system to obtain multiple matrices piece;
    依次计算所述多个矩阵块的重心坐标;Sequentially calculating the coordinates of the center of gravity of the plurality of matrix blocks;
    根据预先构建的直线方程和所述重心坐标调整所述多个作业矩阵块的倾斜角度,完成所述倾斜矫正操作;Adjusting the inclination angles of the multiple operation matrix blocks according to the pre-built linear equation and the center of gravity coordinates to complete the inclination correction operation;
    将完成所述倾斜矫正操作的所述待比对图像集映射在所述平面坐标系内;Mapping the image set to be compared on which the tilt correction operation is completed in the plane coordinate system;
    按照预先设定的方块数,对所述待比对图像集进行水平方向、垂直方向的方块划分得到多个图像方块;Dividing the image set to be compared in a horizontal direction and a vertical direction to obtain a plurality of image squares according to the preset number of squares;
    根据光学检测技术判断所述图像方块内是否有文字,保留具有文字的图像方块,得到所述图像方块集。According to the optical detection technology, it is determined whether there are texts in the image blocks, and the image blocks with texts are retained to obtain the image block set.
  3. 如权利要求2所述的基于图像处理的信息比对方法,其中,所述直线方程为:The method for comparing information based on image processing according to claim 2, wherein the linear equation is:
    Y=a+bXY=a+bX
    Y=c+dXY=c+dX
    其中,b=tgδ,
    Figure PCTCN2020099066-appb-100001
    a,c为任意常数,δ是与所述重心坐标对应的倾斜角度,(X,Y)表示在所述平面坐标系的坐标。
    Among them, b=tgδ,
    Figure PCTCN2020099066-appb-100001
    a and c are arbitrary constants, δ is the inclination angle corresponding to the center of gravity coordinate, and (X, Y) represents the coordinates in the plane coordinate system.
  4. 如权利要求3所述的基于图像处理的信息比对方法,其中,所述重心坐标为(P i,P i)。将所述重心坐标X=P i,带入至上述直线方程内得到Y值,通过与P i的误差调节所述倾斜角度δ,完成所述倾斜矫正操作。 The information comparison method based on image processing according to claim 3, wherein the coordinates of the center of gravity are (P i , P i ). The center of gravity coordinate X=P i is brought into the above linear equation to obtain the Y value, and the tilt angle δ is adjusted by the error with P i to complete the tilt correction operation.
  5. 如权利要求1所述的基于图像处理的信息比对方法,其中,所述将所述图像方块集进行图像分类,包括:8. The information comparison method based on image processing according to claim 1, wherein said classifying said image block set comprises:
    将所述图像方块集的数据进行字符切割得到多字符图像集;Performing character cutting on the data of the image block set to obtain a multi-character image set;
    提取所述多字符图像集内的字符特征;Extracting character features in the multi-character image set;
    将所述字符特征与预构建的特征模板库进行模板匹配完成所述图像分类。Perform template matching of the character features with a pre-built feature template library to complete the image classification.
  6. 如权利要求1至5中任意一项所述的基于图像处理的信息比对方法,其中,所述从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,包括:The information comparison method based on image processing according to any one of claims 1 to 5, wherein said extracting a standard information image set corresponding to said image set to be compared from said database, and comparing said The multi-type image set and the standard information image set include:
    预构建的投影坐标系,对所述多类型图像集进行按行分割得到多行图像集;A pre-built projection coordinate system, segmenting the multi-type image set by rows to obtain a multi-row image set;
    采用所述光学字符识别技术识别每个多行图像集内开头的第一个字符;Using the optical character recognition technology to recognize the first character at the beginning of each multi-line image set;
    所述第一个字符若为数字字符则保留,所述第一个字符若不是数字字符则剔除,直至识别完成,汇总所有数字字符得到题号集;If the first character is a numeric character, it will be reserved, and if the first character is not a numeric character, it will be removed until the recognition is completed, and all the numeric characters will be collected to obtain a question number set;
    根据所述题号集从所述数据库的答案存储区域中提取与所述题号集相同的标准信息图像集;Extracting from the answer storage area of the database according to the question number set the same standard information image set as the question number set;
    根据所述光学字符识别技术比对所述标准信息图像集和所述多类型图像。The standard information image set and the multi-type image are compared according to the optical character recognition technology.
  7. 一种电子设备,其中,所述电子设备包括存储器和处理器,所述存储器上存储有 可在所述处理器上运行的基于图像处理的信息比对程序,所述基于图像处理的信息比对程序被所述处理器执行时实现如下步骤:An electronic device, wherein the electronic device includes a memory and a processor, the memory stores an information comparison program based on image processing that can be run on the processor, and the information comparison program based on image processing The following steps are implemented when the program is executed by the processor:
    接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集;Receiving an image set to be compared, and performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
    将所述图像方块集进行图像分类,并根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集;Performing image classification on the image block set, and performing type recognition on the image block set after the image classification according to optical character recognition technology to obtain a multi-type image set;
    从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
  8. 如权利要求7所述的电子设备,其中,所述将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集包括:8. The electronic device according to claim 7, wherein said performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set comprises:
    构建平面坐标系,将所述待比对图像集内的图像按照所述平面坐标系进行投影,并根据所述平面坐标系的刻度分割所述待比对图像集内的图像,得到多个矩阵块;Construct a plane coordinate system, project the images in the image set to be compared according to the plane coordinate system, and divide the images in the image set to be compared according to the scale of the plane coordinate system to obtain multiple matrices piece;
    依次计算所述多个矩阵块的重心坐标;Sequentially calculating the coordinates of the center of gravity of the plurality of matrix blocks;
    根据预先构建的直线方程和所述重心坐标调整所述多个作业矩阵块的倾斜角度,完成所述倾斜矫正操作;Adjusting the inclination angles of the multiple operation matrix blocks according to the pre-built linear equation and the center of gravity coordinates to complete the inclination correction operation;
    将完成所述倾斜矫正操作的所述待比对图像集映射在所述平面坐标系内;Mapping the image set to be compared on which the tilt correction operation is completed in the plane coordinate system;
    按照预先设定的方块数,对所述待比对图像集进行水平方向、垂直方向的方块划分得到多个图像方块;Dividing the image set to be compared in a horizontal direction and a vertical direction to obtain a plurality of image squares according to the preset number of squares;
    根据光学检测技术判断所述图像方块内是否有文字,保留具有文字的图像方块,得到所述图像方块集。According to the optical detection technology, it is determined whether there are texts in the image blocks, and the image blocks with texts are retained to obtain the image block set.
  9. 如权利要求8所述的电子设备,其中,所述直线方程为:The electronic device according to claim 8, wherein the linear equation is:
    Y=a+bXY=a+bX
    Y=c+dXY=c+dX
    其中,b=tgδ,
    Figure PCTCN2020099066-appb-100002
    a,c为任意常数,δ是与所述重心坐标对应的倾斜角度,(X,Y)表示在所述平面坐标系的坐标。
    Among them, b=tgδ,
    Figure PCTCN2020099066-appb-100002
    a and c are arbitrary constants, δ is the inclination angle corresponding to the center of gravity coordinate, and (X, Y) represents the coordinates in the plane coordinate system.
  10. 如权利要求9所述的电子设备,其中,所述重心坐标为(P i,P i)。将所述重心坐标X=P i,带入至上述直线方程内得到Y值,通过与P i的误差调节所述倾斜角度δ,完成所述倾斜矫正操作。 The electronic device according to claim 9, wherein the coordinates of the center of gravity are (P i , P i ). The center of gravity coordinate X=P i is brought into the above linear equation to obtain the Y value, and the tilt angle δ is adjusted by the error with P i to complete the tilt correction operation.
  11. 如权利要求7中所述的电子设备,其中,所述将所述图像方块集进行图像分类,包括:8. The electronic device according to claim 7, wherein said classifying said image block set into images comprises:
    将所述图像方块集的数据进行字符切割得到多字符图像集;Performing character cutting on the data of the image block set to obtain a multi-character image set;
    提取所述多字符图像集内的字符特征;Extracting character features in the multi-character image set;
    将所述字符特征与预构建的特征模板库进行模板匹配完成所述图像分类。Perform template matching of the character features with a pre-built feature template library to complete the image classification.
  12. 如权利要求7至11中任意一项所述的电子设备,其中,所述从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,包括:The electronic device according to any one of claims 7 to 11, wherein said extracting a standard information image set corresponding to said image set to be compared from said database, and comparing said multi-type image set with all The set of standard information images, including:
    预构建的投影坐标系,对所述多类型图像集进行按行分割得到多行图像集;A pre-built projection coordinate system, segmenting the multi-type image set by rows to obtain a multi-row image set;
    采用所述光学字符识别技术识别每个多行图像集内开头的第一个字符;Using the optical character recognition technology to recognize the first character at the beginning of each multi-line image set;
    所述第一个字符若为数字字符则保留,所述第一个字符若不是数字字符则剔除,直至识别完成,汇总所有数字字符得到题号集;If the first character is a numeric character, it will be reserved, and if the first character is not a numeric character, it will be removed until the recognition is completed, and all the numeric characters will be collected to obtain a question number set;
    根据所述题号集从所述数据库的答案存储区域中提取与所述题号集相同的标准信息图像集;Extracting from the answer storage area of the database according to the question number set the same standard information image set as the question number set;
    根据所述光学字符识别技术比对所述标准信息图像集和所述多类型图像。The standard information image set and the multi-type image are compared according to the optical character recognition technology.
  13. 一种计算机可读存储介质,其中,所述计算机可读存储介质上存储有基于图像处理的信息比对程序,所述基于图像处理的信息比对程序可被一个或者多个处理器执行,以 实现如下所述的基于图像处理的信息比对方法的步骤:A computer-readable storage medium, wherein an information comparison program based on image processing is stored on the computer-readable storage medium, and the information comparison program based on image processing can be executed by one or more processors to The steps to implement the information comparison method based on image processing as described below:
    接收待比对图像集,将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集;Receiving an image set to be compared, and performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
    将所述图像方块集进行图像分类,并根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集;Performing image classification on the image block set, and performing type recognition on the image block set after the image classification according to optical character recognition technology to obtain a multi-type image set;
    从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The standard information image set corresponding to the image set to be compared is extracted from the database, and the multi-type image set is compared with the standard information image set to obtain an information comparison result of the image set to be compared.
  14. 如权利要求13所述的计算机可读存储介质,其中,所述将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集包括:15. The computer-readable storage medium according to claim 13, wherein said performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set comprises:
    构建平面坐标系,将所述待比对图像集内的图像按照所述平面坐标系进行投影,并根据所述平面坐标系的刻度分割所述待比对图像集内的图像,得到多个矩阵块;Construct a plane coordinate system, project the images in the image set to be compared according to the plane coordinate system, and divide the images in the image set to be compared according to the scale of the plane coordinate system to obtain multiple matrices piece;
    依次计算所述多个矩阵块的重心坐标;Sequentially calculating the coordinates of the center of gravity of the plurality of matrix blocks;
    根据预先构建的直线方程和所述重心坐标调整所述多个作业矩阵块的倾斜角度,完成所述倾斜矫正操作;Adjusting the inclination angles of the multiple operation matrix blocks according to the pre-built linear equation and the center of gravity coordinates to complete the inclination correction operation;
    将完成所述倾斜矫正操作的所述待比对图像集映射在所述平面坐标系内;Mapping the image set to be compared on which the tilt correction operation is completed in the plane coordinate system;
    按照预先设定的方块数,对所述待比对图像集进行水平方向、垂直方向的方块划分得到多个图像方块;Dividing the image set to be compared in a horizontal direction and a vertical direction to obtain a plurality of image squares according to the preset number of squares;
    根据光学检测技术判断所述图像方块内是否有文字,保留具有文字的图像方块,得到所述图像方块集。According to the optical detection technology, it is determined whether there are texts in the image blocks, and the image blocks with texts are retained to obtain the image block set.
  15. 如权利要求14所述的计算机可读存储介质,其中,所述直线方程为:The computer-readable storage medium of claim 14, wherein the linear equation is:
    Y=a+bXY=a+bX
    Y=c+dXY=c+dX
    其中,b=tgδ,
    Figure PCTCN2020099066-appb-100003
    a,c为任意常数,δ是与所述重心坐标对应的倾斜角度,(X,Y)表示在所述平面坐标系的坐标。
    Among them, b=tgδ,
    Figure PCTCN2020099066-appb-100003
    a and c are arbitrary constants, δ is the inclination angle corresponding to the center of gravity coordinate, and (X, Y) represents the coordinates in the plane coordinate system.
  16. 如权利要求15所述的计算机可读存储介质,其中,所述重心坐标为(P i,P i)。将所述重心坐标X=P i,带入至上述直线方程内得到Y值,通过与P i的误差调节所述倾斜角度δ,完成所述倾斜矫正操作。 The computer-readable storage medium of claim 15, wherein the center of gravity coordinates are (P i , P i ). The center of gravity coordinate X=P i is brought into the above linear equation to obtain the Y value, and the tilt angle δ is adjusted by the error with P i to complete the tilt correction operation.
  17. 如权利要求13所述的计算机可读存储介质,其中,所述将所述图像方块集进行图像分类,包括:15. The computer-readable storage medium of claim 13, wherein said classifying said image block set into images comprises:
    将所述图像方块集的数据进行字符切割得到多字符图像集;Performing character cutting on the data of the image block set to obtain a multi-character image set;
    提取所述多字符图像集内的字符特征;Extracting character features in the multi-character image set;
    将所述字符特征与预构建的特征模板库进行模板匹配完成所述图像分类。Perform template matching of the character features with a pre-built feature template library to complete the image classification.
  18. 如权利要求13至17中任意一项所述的计算机可读存储介质,其中,所述从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,包括:The computer-readable storage medium according to any one of claims 13 to 17, wherein said extracting a standard information image set corresponding to said image set to be compared from said database, and comparing said multiple types of images The set and the standard information image set include:
    预构建的投影坐标系,对所述多类型图像集进行按行分割得到多行图像集;A pre-built projection coordinate system, segmenting the multi-type image set by rows to obtain a multi-row image set;
    采用所述光学字符识别技术识别每个多行图像集内开头的第一个字符;Using the optical character recognition technology to recognize the first character at the beginning of each multi-line image set;
    所述第一个字符若为数字字符则保留,所述第一个字符若不是数字字符则剔除,直至识别完成,汇总所有数字字符得到题号集;If the first character is a numeric character, it will be reserved, and if the first character is not a numeric character, it will be removed until the recognition is completed, and all the numeric characters will be collected to obtain a question number set;
    根据所述题号集从所述数据库的答案存储区域中提取与所述题号集相同的标准信息图像集;Extracting from the answer storage area of the database according to the question number set the same standard information image set as the question number set;
    根据所述光学字符识别技术比对所述标准信息图像集和所述多类型图像。The standard information image set and the multi-type image are compared according to the optical character recognition technology.
  19. 一种基于图像处理的信息比对装置,其中,所述基于图像处理的信息比对装置包括:An information comparison device based on image processing, wherein the information comparison device based on image processing includes:
    数据接收及方块切割模块,用于:接收待比对图像集,将所述待比对图像集进行倾斜 矫正操作及图像方块切割操作,得到图像方块集;The data receiving and block cutting module is configured to: receive the image set to be compared, and perform a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set;
    分类模块,用于:将所述图像方块集进行图像分类得到分类后的图像方块集;The classification module is configured to: perform image classification on the image block set to obtain a classified image block set;
    类型识别模块,用于:根据光学字符识别技术,将所述图像分类后的图像方块集进行类型识别得到多类型图像集;The type recognition module is configured to: perform type recognition on the image block set after the image classification according to the optical character recognition technology to obtain a multi-type image set;
    作业批改结果输出模块,用于:从所述数据库中提取与所述待比对图像集对应的标准信息图像集,对比所述多类型图像集和所述标准信息图像集,得到所述待比对图像集的信息对比结果。The job correction result output module is used to: extract the standard information image set corresponding to the image set to be compared from the database, compare the multi-type image set and the standard information image set to obtain the The result of comparing the information of the image set.
  20. 如权利要求19所述的基于图像处理的信息比对装置,其中,所述将所述待比对图像集进行倾斜矫正操作及图像方块切割操作,得到图像方块集包括:21. The information comparison device based on image processing according to claim 19, wherein said performing a tilt correction operation and an image block cutting operation on the image set to be compared to obtain an image block set comprises:
    构建平面坐标系,将所述待比对图像集内的图像按照所述平面坐标系进行投影,并根据所述平面坐标系的刻度分割所述待比对图像集内的图像,得到多个矩阵块;Construct a plane coordinate system, project the images in the image set to be compared according to the plane coordinate system, and divide the images in the image set to be compared according to the scale of the plane coordinate system to obtain multiple matrices piece;
    依次计算所述多个矩阵块的重心坐标;Sequentially calculating the coordinates of the center of gravity of the plurality of matrix blocks;
    根据预先构建的直线方程和所述重心坐标调整所述多个作业矩阵块的倾斜角度,完成所述倾斜矫正操作;Adjusting the inclination angles of the multiple operation matrix blocks according to the pre-built linear equation and the center of gravity coordinates to complete the inclination correction operation;
    将完成所述倾斜矫正操作的所述待比对图像集映射在所述平面坐标系内;Mapping the image set to be compared on which the tilt correction operation is completed in the plane coordinate system;
    按照预先设定的方块数,对所述待比对图像集进行水平方向、垂直方向的方块划分得到多个图像方块;Dividing the image set to be compared in a horizontal direction and a vertical direction to obtain a plurality of image squares according to the preset number of squares;
    根据光学检测技术判断所述图像方块内是否有文字,保留具有文字的图像方块,得到所述图像方块集。According to the optical detection technology, it is determined whether there are texts in the image blocks, and the image blocks with texts are retained to obtain the image block set.
PCT/CN2020/099066 2020-01-15 2020-06-30 Image-based information comparison method, apparatus, electronic device, and computer-readable storage medium WO2021143058A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010041070.7A CN111259888B (en) 2020-01-15 2020-01-15 Image-based information comparison method, device and computer-readable storage medium
CN202010041070.7 2020-01-15

Publications (1)

Publication Number Publication Date
WO2021143058A1 true WO2021143058A1 (en) 2021-07-22

Family

ID=70954047

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/099066 WO2021143058A1 (en) 2020-01-15 2020-06-30 Image-based information comparison method, apparatus, electronic device, and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN111259888B (en)
WO (1) WO2021143058A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113702398A (en) * 2021-08-25 2021-11-26 北京美立刻医疗器械有限公司 Automatic quality inspection method and device for bracket-free appliance based on visual identification technology
CN118260846A (en) * 2024-05-29 2024-06-28 江西方堂设计工程有限公司 Digital decoration design method and system based on artificial intelligence

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111259888B (en) * 2020-01-15 2024-07-05 平安国际智慧城市科技股份有限公司 Image-based information comparison method, device and computer-readable storage medium
CN113920526A (en) * 2021-09-30 2022-01-11 广东新裕信息科技有限公司 Test paper image processing method, electronic device and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106372613A (en) * 2016-09-13 2017-02-01 广州视睿电子科技有限公司 Statistical method and device for paper test paper
CN107798321A (en) * 2017-12-04 2018-03-13 海南云江科技有限公司 A kind of examination paper analysis method and computing device
CN108171297A (en) * 2018-01-24 2018-06-15 谢德刚 A kind of answer card identification method and device
US20180211127A1 (en) * 2015-07-17 2018-07-26 Rakuten, Inc. Character recognition device, character recognition method, and character recognition program
CN110348400A (en) * 2019-07-15 2019-10-18 京东方科技集团股份有限公司 A kind of scoring acquisition methods, device and electronic equipment
CN110647885A (en) * 2019-09-17 2020-01-03 广州光大教育软件科技股份有限公司 Test paper splitting method, device, equipment and medium based on picture identification
CN111259888A (en) * 2020-01-15 2020-06-09 平安国际智慧城市科技股份有限公司 Image-based information comparison method and device and computer-readable storage medium

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5076744B2 (en) * 2007-08-30 2012-11-21 セイコーエプソン株式会社 Image processing device
CN103177235A (en) * 2013-04-18 2013-06-26 河海大学常州校区 Chinese-sensible code recognition device and Chinese-sensible code recognition method under complicated background
CN105761219B (en) * 2016-02-03 2019-03-15 北京云江科技有限公司 Text image Slant Rectify method and system
CN109685059B (en) * 2018-11-06 2024-06-28 平安科技(深圳)有限公司 Text image labeling method, text image labeling device and computer readable storage medium
CN110110581B (en) * 2019-03-14 2021-06-04 杭州笔声智能科技有限公司 Test paper correcting method and system based on artificial intelligence
CN110288755B (en) * 2019-05-21 2023-05-23 平安银行股份有限公司 Invoice checking method based on text recognition, server and storage medium
CN110443269A (en) * 2019-06-17 2019-11-12 平安信托有限责任公司 A kind of document comparison method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180211127A1 (en) * 2015-07-17 2018-07-26 Rakuten, Inc. Character recognition device, character recognition method, and character recognition program
CN106372613A (en) * 2016-09-13 2017-02-01 广州视睿电子科技有限公司 Statistical method and device for paper test paper
CN107798321A (en) * 2017-12-04 2018-03-13 海南云江科技有限公司 A kind of examination paper analysis method and computing device
CN108171297A (en) * 2018-01-24 2018-06-15 谢德刚 A kind of answer card identification method and device
CN110348400A (en) * 2019-07-15 2019-10-18 京东方科技集团股份有限公司 A kind of scoring acquisition methods, device and electronic equipment
CN110647885A (en) * 2019-09-17 2020-01-03 广州光大教育软件科技股份有限公司 Test paper splitting method, device, equipment and medium based on picture identification
CN111259888A (en) * 2020-01-15 2020-06-09 平安国际智慧城市科技股份有限公司 Image-based information comparison method and device and computer-readable storage medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113702398A (en) * 2021-08-25 2021-11-26 北京美立刻医疗器械有限公司 Automatic quality inspection method and device for bracket-free appliance based on visual identification technology
CN113702398B (en) * 2021-08-25 2024-02-20 北京美立刻医疗器械有限公司 Automatic quality inspection method and device for bracket-free appliance based on visual identification technology
CN118260846A (en) * 2024-05-29 2024-06-28 江西方堂设计工程有限公司 Digital decoration design method and system based on artificial intelligence

Also Published As

Publication number Publication date
CN111259888B (en) 2024-07-05
CN111259888A (en) 2020-06-09

Similar Documents

Publication Publication Date Title
WO2021143058A1 (en) Image-based information comparison method, apparatus, electronic device, and computer-readable storage medium
CN111325110B (en) OCR-based table format recovery method, device and storage medium
WO2021017260A1 (en) Multi-language text recognition method and apparatus, computer device, and storage medium
WO2019104879A1 (en) Information recognition method for form-type image, electronic device and readable storage medium
CN111695439A (en) Image structured data extraction method, electronic device and storage medium
CN112528863A (en) Identification method and device of table structure, electronic equipment and storage medium
US7437001B2 (en) Method and device for recognition of a handwritten pattern
WO2019071662A1 (en) Electronic device, bill information identification method, and computer readable storage medium
CN109685870B (en) Information labeling method and device, labeling equipment and storage medium
CN112699775A (en) Certificate identification method, device and equipment based on deep learning and storage medium
CN103544475A (en) Method and system for recognizing layout types
CN112115111A (en) OCR-based document version management method and system
CN101964048A (en) Character recognition method and system
CN111553334A (en) Questionnaire image recognition method, electronic device, and storage medium
CN113033543A (en) Curved text recognition method, device, equipment and medium
CN112580503A (en) Operation correction method, device, equipment and storage medium
JP2008234291A (en) Character recognition device and character recognition method
CN113806472A (en) Method and equipment for realizing full-text retrieval of character, picture and image type scanning piece
CN114049646B (en) Bank card identification method and device, computer equipment and storage medium
CN103257810A (en) Identification method and identification device of handwritten mathematical formula
US20120281919A1 (en) Method and system for text segmentation
CN113420684A (en) Report recognition method and device based on feature extraction, electronic equipment and medium
CN112396057A (en) Character recognition method and device and electronic equipment
CN115909364A (en) Real-time copybook scoring method based on deep learning technology
CN115147846A (en) Multi-language bill identification method, device, equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20914099

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20914099

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 15/03/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 20914099

Country of ref document: EP

Kind code of ref document: A1