WO2020259060A1 - Test paper information extraction method and system, and computer-readable storage medium - Google Patents

Test paper information extraction method and system, and computer-readable storage medium Download PDF

Info

Publication number
WO2020259060A1
WO2020259060A1 PCT/CN2020/087211 CN2020087211W WO2020259060A1 WO 2020259060 A1 WO2020259060 A1 WO 2020259060A1 CN 2020087211 W CN2020087211 W CN 2020087211W WO 2020259060 A1 WO2020259060 A1 WO 2020259060A1
Authority
WO
WIPO (PCT)
Prior art keywords
text
test paper
image
line
area
Prior art date
Application number
PCT/CN2020/087211
Other languages
French (fr)
Chinese (zh)
Inventor
曾志辉
欧阳一村
许文龙
贺涛
邢军华
Original Assignee
深圳中兴网信科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳中兴网信科技有限公司 filed Critical 深圳中兴网信科技有限公司
Publication of WO2020259060A1 publication Critical patent/WO2020259060A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/28Quantising the image, e.g. histogram thresholding for discrimination between background and foreground patterns
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/158Segmentation of character regions using character size, text spacings or pitch estimation

Definitions

  • This application relates to the field of electronic teaching technology, for example, to a method, system and computer-readable storage medium for extracting test paper information.
  • the automatic scoring method usually can only analyze test papers with a fixed template, that is, it can only match the test paper with a variety of templates stored in the system, and use the matched template for analysis.
  • the layout and type of many real test papers may not match the fixed template, so it is necessary to provide a solution that can accurately identify and automatically analyze any test paper (regular test paper, general answer sheet, special answer sheet, etc.) , To meet people's increasing electronic marking requirements.
  • This application proposes a method for extracting test paper information, including: preprocessing test paper images to obtain binary images; determining the layout area of the binary image; obtaining text lines of the test paper image according to the layout area; extracting text images from the text lines; The text image is input into the text recognition model to obtain the text information of the test paper image; the corresponding text information and the text line are merged to obtain the target test paper image; the test paper information of the target test paper image is extracted according to the classification label.
  • This application proposes a test paper information extraction system, which includes a memory, a processor, and a computer program stored in the memory and capable of running on the processor.
  • the processor implements the test paper information extraction method of any of the above technical solutions when the processor executes the computer program.
  • This application proposes a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, the method for extracting test paper information as any of the above technical solutions is realized.
  • Figure 1 shows a schematic flow chart of a method for extracting test paper information according to an embodiment of the present application
  • Fig. 2 shows a schematic flow chart of a method for extracting test paper information according to another embodiment of the present application
  • Fig. 3 shows a schematic flow chart of a method for extracting test paper information according to another embodiment of the present application
  • FIG. 4 shows a schematic flow chart of a method for extracting test paper information according to another embodiment of the present application
  • Figure 5 shows a test paper image of an embodiment of the present application
  • FIG. 6 shows an analysis result image of the test paper layout area of an embodiment of the present application
  • Figure 7 shows a layout area of Figure 5
  • Figure 8 shows another layout area of Figure 5
  • FIG. 9 shows the text line detection result image of FIG. 7
  • FIG. 10 shows a text information extraction result image of an embodiment of the present application
  • FIG. 11 shows a schematic diagram of constructing a text recognition model according to an embodiment of the present application.
  • Fig. 12 shows a schematic block diagram of a test paper information extraction system according to an embodiment of the present application.
  • FIG. 1 shows a schematic flowchart of the method for extracting test paper information according to an embodiment of the present application.
  • the method includes:
  • S110 Input the text image into the text recognition model to obtain the text information of the test paper image
  • S114 Extract test paper information of the target test paper image according to the classification label.
  • the test paper information extraction method combines image processing algorithms, natural language processing algorithms, and deep learning neural network model technology.
  • the binary image is obtained by preprocessing the test paper image, and the binary image is analyzed to determine the layout area of the binary image, namely Obtain the typesetting information of the test paper, detect the text line of each layout area, traverse the text line of each layout, take the largest circumscribed rectangular area of the text line to cut out the corresponding text image, and input the text image into the text recognition (Optical Character Recognition) , OCR) model is matched, the text information of the test paper image is recognized, the text information and the text line are merged correspondingly, and the target test paper image of the recognized text information is obtained, and the test paper information in the target test paper image is extracted according to different classification tags, for example, Candidate information, test question information, etc., output all test paper information.
  • Optical Character Recognition Optical Character Recognition
  • the typesetting information of the test paper can be automatically identified. Even if the layout and type of the test paper are different, the test paper image can be accurately identified and automatically analyzed to obtain test paper information, which not only realizes efficient and accurate automatic scoring, but also It can also improve the scope of application of the system, and upload the identified test paper information and typesetting information to the database to build a knowledge system, which is conducive to the automatic composition of educators, thereby effectively reducing the workload of educators and satisfying users Various needs.
  • the preprocessing is binarization processing.
  • the binary image can also be smoothed and image tilted according to actual needs.
  • the image tilt processing includes: projecting the binary image so that the edge of the binary image The position generates a corresponding mark on the projected image, determines the position of the oblique image according to the mark, and rotates the position of the oblique image to achieve image correction according to the angle between the edge of the oblique image and the standard horizontal direction or the standard vertical direction.
  • the text behavior uses the image processing function (findcontours function) of the computer vision library (opencv) to identify the rectangular frame with text information in the binary image.
  • Fig. 2 shows a schematic flowchart of a method for extracting test paper information according to another embodiment of the present application. Among them, the method includes:
  • the areas at both ends of the line obtained by the sub-image are the area between the first end of the line and the first edge of the sub-image and the area between the second end of the line of the sub-image and the second edge of the sub-image.
  • S218 Determine the segmentation area of the text area according to the third preset size
  • S220 Whether a separator is detected in the divided area, if the separator is detected in the divided area, go to S216, and if the separator is not detected in the divided area, go to S222;
  • S228 Input the text image into the text recognition model to obtain the text information of the test paper image
  • S230 Correspondingly merge the text information and the text line to obtain the target test paper image
  • the sub-images of the binary image are segmented on one side of the binary image according to the first preset size, all the lines in the sub-image are detected by the straight line detection algorithm, and the lines of all the sub-images are traversed. If the length meets the preset length range, and the area at both ends of the line is blank, the line of the sub-image is used as a binding line.
  • the binary image is divided into the other side of the binary image according to the first preset size
  • the gutter detection is performed again, if there is a gutter in the sub-image, the text area of the binary image is determined according to the gutter, where the first preset size and preset length range can be rationalized according to the layout parameters of the actual test paper Set up.
  • the binary image itself is used as the text area, and the center of the text area is used as the axis.
  • the central area of the text area is determined according to the second preset size, and the separator is detected in the central area.
  • the separator divides the text area to get the layout area. In order to avoid misjudgment problems caused by different test paper layouts, the separator of the divided text area can also be detected again. If there are still separators in the divided text area, the text area will be further divided according to the division symbol. Get a more accurate layout area. If no separator is detected in the central area, the text area is divided according to the third preset size to obtain at least two divided areas, and the separator is detected in each divided area.
  • the separator divides the text area to obtain the layout area. If no separator is detected in the divided area, the text area is regarded as the layout area.
  • the image on the right side of the gutter is taken as the text area; if there is a right gutter, the image on the left side of the gutter is taken as the text area.
  • detecting the separation symbol includes: performing projection processing on the central area or the divided area to obtain the blank area of the binary image; in the case where the width of the blank area is greater than the width threshold, the blank area As a separator.
  • the projection process is performed through the central area or the divided area, and the number of count 0 in the vertical direction can be counted to obtain the projection result array, and the blank area of the binary image is determined according to the projection result array. If the width of the blank area is greater than the width threshold , Use the blank area as the separator, and then divide the text area according to the separator to obtain the layout area, which is convenient for identifying the text information of the test paper image according to the layout area, and realizes efficient and accurate automatic scoring.
  • the width threshold can be rationally set according to the parameters of the conventional test paper layout.
  • detecting the separation symbol includes: performing blurring and/or denoising processing on the central area or the segmented area to obtain lines of the binary image; filtering according to a preset angle range and a preset length threshold Binary image lines to obtain the target line; when the length of the target line is greater than the first preset length, or the length of the target line is greater than the second preset length, and the width of the title area and blank area at both ends of the target line is the same as the length of the target line When the sum is greater than the first preset length, the line of the binary image is used as the separator.
  • the width of the title area and the blank area at both ends of the target line is the width of the title area between the first end of the target line and the first edge of the binary image, and the second end of the target line and the second edge of the binary image.
  • blur and/or denoise processing is performed on the central area or the segmented area, the lines in the binary image are detected, and all the detected lines of the binary image are filtered according to the preset angle range and length threshold to obtain the target Line, if the length of the target line is greater than the first preset length, or while the length of the target line is greater than the second preset length, the sum of the width of the title area and the blank area at both ends of the target line and the length of the target line is greater than the first
  • the line of the binary image is used as a separator, and the text area can be divided according to the separator to obtain the layout area, which is convenient for identifying the text information of the test paper image according to the layout area, and realizes efficient and accurate automation Scoring.
  • the first preset length and the second preset length can be rationally set according to the parameters of the conventional test paper layout.
  • the Hough transform function (hough lines function) of the computer vision library (opencv) is used to identify the lines of the binary image.
  • Fig. 3 shows a schematic flow chart of a method for extracting test paper information according to another embodiment of the present application. Among them, the method includes:
  • S308 Filter the widths of all rectangular boxes according to the preset width range to obtain multiple target widths
  • S312 Select the target width corresponding to the largest number of rectangular boxes as the text line width
  • S324 Input the text image into the text recognition model to obtain the text information of the test paper image
  • S328 Extract test paper information of the target test paper image according to the classification label.
  • the preset condition is that the vertical distance between the center point of the current text box and the center point of the previous text box is less than the first distance threshold, and the horizontal distance between the center point of the current text box and the center point of the previous text box The distance is less than the second distance threshold.
  • the outer edge contour existing in the layout area is identified, the largest circumscribed rectangle of the outer edge contour is taken to form a rectangular frame, the width of all the detected rectangular frames is obtained, and the width of the rectangular frame is filtered according to the preset width range, Obtain multiple target widths, count the number of rectangular boxes corresponding to each of the multiple target widths, select the target width corresponding to the largest number of rectangular boxes as the text line width, determine the text box according to the text line width, and traverse all text Box, if the vertical distance between the center point of the current text box and the center point of the previous text box is less than the first distance threshold, and the horizontal distance between the center point of the current text box and the center point of the previous text box is less than the second distance threshold, Explain that the center points of the above two text boxes are almost on a straight line.
  • the current text box and the previous text box are merged to obtain a text line, so that the text information of the test paper image can be extracted from the text line to realize accurate automatic scoring.
  • Educators can construct a knowledge system based on the recognized text information, which is beneficial for educators to automatically organize test papers, thereby reducing the workload of educators and satisfying multiple needs of users.
  • the first distance threshold and the second distance threshold are the allowable distance error values between the text boxes, which can be set reasonably according to typesetting experience.
  • the method further includes: the vertical distance between the center point of the current rectangular box and the center point of the previous rectangular box is less than a third distance threshold, And when the horizontal distance between the center point of the current rectangular frame and the center point of the previous rectangular frame is less than the fourth distance threshold, the current rectangular frame and the previous rectangular frame are merged.
  • the third distance threshold and the fourth distance threshold are the allowable distance error values between rectangular boxes, which can be set reasonably according to typesetting experience.
  • the method before inputting the text image into the text recognition model, the method further includes: obtaining text data and character data; encoding the text data and character data to obtain a recognition dictionary; and determining the text image set according to the text data ; Construct a text recognition model based on the recognition dictionary and text image collection.
  • the text data is obtained, repeated characters in the text data are excluded, each character in the text data and the character data is encoded starting from 1, to obtain a recognition dictionary, and an image of each character in the text data is obtained according to the text data , Get the text image set.
  • the recognition dictionary and the text image collection the text recognition model is constructed, so that it is convenient to use natural language processing technology to extract multiple types of text information of the test paper, with higher accuracy, faster speed, and improved practicability.
  • Character data includes but is not limited to: Arabic numerals, English letters, punctuation marks, and special characters.
  • drawtext function of PIL (python image processing library) to draw text content on a fixed-size image to obtain an image of the character.
  • DenseNet+CTC dense convolutional network model + time series data classification
  • AlexNet Alex Deep Convolutional Neural Network Model + CTC
  • VGG VGG network structure model + CTC
  • GoogleNet GoogleNet (Google network structure model) + CTC;
  • ResNet deep residual network model + CTC.
  • limiting the number of characters in the recognition dictionary for example, limiting the number of characters to about 4000, can effectively reduce the size of the character recognition model and reduce the amount of system calculation.
  • extracting the test paper information of the target test paper image according to the classification label includes: the classification label includes a title, a big question and a small question; according to the classification key characters respectively corresponding to the title, the big question and the small question , Respectively determine the title text line, the big title text line and the small title text line; extract the test paper information according to the title text line, the big title text line and the small title text line.
  • the classification tags include a title, a big question, and a small title.
  • Each tag has its own classification key characters.
  • the classification key characters are used to identify the starting position and the big question of the title in the target test paper image.
  • the starting position of the title and the starting position of the sub-question are determined to determine the title text line, the main-topic text line, or the sub-title text line corresponding to the classification label, so as to classify the test paper information, thus according to the title text line and the main text
  • the line and subtitle text lines extract different text information to obtain the corresponding test paper information, which is sequentially stored in the database.
  • Use natural language processing technology to extract multiple types of text information of test papers, improve the accuracy of extracting test paper information, effectively reduce the workload of educators, and meet the increasing requirements of electronic scoring, automatic test paper composition, and automatic question storage.
  • the test paper information is composed of a title, a large-question type, and small-question information.
  • the title is used to describe information about the nature of the test question and candidate information, such as information such as test questions for a designated subject at a designated grade and stage.
  • the question type is used to describe the category information of the test questions.
  • the category information of the test questions includes multiple choice questions, calculation questions, applied questions, fill in the blanks, answer questions, multiple choice questions, multiple choice questions, essay questions, non-choice questions, experimental questions, Optional questions, optional exam questions and other question types, sub-question information can be divided into question number, question stem information and score information.
  • the method further includes: performing coordinate information processing on the target test paper image; If the abscissa exceeds the preset coordinate range, or the abscissa of the subtitle text line does not meet the sequence number increasing rule, the subtitle text line is deleted.
  • coordinate information processing is performed on the target test paper image to obtain the coordinates of all text lines. If the abscissa of the subtitle text line exceeds the preset coordinate range, or the abscissa of the subtitle text line does not satisfy the sequence number increasing rule, Deleting the subtitle text line, on the one hand, can locate the position of the text information, on the other hand, the text line is calibrated through the coordinates of the text line to remove the misjudged text line and improve the accuracy of extracting test paper information.
  • test paper information extraction method of another embodiment of the present application includes:
  • S402 Perform layout analysis on the input test paper image to obtain a rectangular area of the binding line and all rectangular areas of the layout;
  • S406 Perform OCR recognition on the text lines of each layout, and merge the results to obtain the final test paper text
  • S410 Extract the serial numbers of candidate sub-questions according to the information of the big questions, and generate a list of question numbers from the serial number features;
  • test paper image (img) obtained by scanning is shown in Figure 5.
  • the layout of the test paper image img is analyzed, and the rectangular area of the gutter is obtained (if there is a gutter, the rectangular area of the gutter is obtained; if there is no gutter, there is no Gutter rectangular area), and all the rectangular areas of the layout, as shown in Figure 6 to Figure 8;
  • Layout separators can be: blank areas, dashed lines, and straight lines that exceed the specified size and width.
  • the length of the line is greater than 2/3 of the width of the image img3, and the lower end of the line is all blank areas.
  • the upper end of the line is the title, and the sum of the width of the title and the length of the line and the width of the blank area at the lower end of the line is greater than that of the image img3 4/5 of the width;
  • step 1.2.2 If the delimiter is not detected in step 1.2.2, then analyze the layout of the three columns;
  • 1.2.3.2 Repeat the steps of 1.2.2.2 to detect the separator layout_line1. If the detection is successful, take the image right_img at 2/3 of the image img2, its length is 1/5 of img2, and the width is the width of img2; also repeat 1.2 .2.2 step, detect the separator layout_line2, if the detection is successful, use the separator layout_line1 and the separator layout_line2 to divide the image img2 into three columns; if the layout_line1 is not detected, the entire img2 is one column.
  • the remaining rectangular boxes are text boxes that may contain text
  • the text line detection result of layout area 1 (FIG. 7) is shown in FIG. 9;
  • each character is coded from 1;
  • test paper text information test paper name, subject, unit, test type, test number area, name area, big question information (serial number, question type, score information, area, etc.);
  • test paper name keywords include: exam, test paper, test, test, simulation, etc.;
  • Subject keywords include: mathematics, Chinese, English, physics, chemistry, biology, geography, Politics, history;
  • test type keywords include: mid-term, final, simulation, competition, etc.
  • the text line area where the keyword is located is the beginning of the test number area, and then expand the area upwards , Is the exam number area, the keywords of exam number include: exam number, student number, admission ticket number, etc.;
  • test paper traverse the first 5 lines of the test paper text. If the test paper has one of the test number keywords in 4.5.1, the text line area where the key word is located is the beginning of the test number area Position, and then expand the area to the right, which is the examination number area.
  • Preset types of big questions multiple-choice questions, calculation questions, application questions, fill-in-the-blank questions, answer questions, single-choice questions, multiple-choice questions, essay questions, non-choice questions, experimental questions, optional questions, selective examination questions, etc.;
  • This big question has a total of ( ⁇ d ⁇ 1,3 ⁇ ) small questions. *Each small question ( ⁇ d ⁇ 1,3 ⁇ ) points.*(Total
  • the matched value is used as the number of small questions, the score of each small question, and the total score of the big question in turn;
  • This big question has a total of ( ⁇ d ⁇ 1,3 ⁇ ) small questions.*(Total
  • sub-question information serial number, question type, score information, area, etc.
  • the remaining serial number is the serial number of the sub-topic below the main question, and the corresponding text line area is used as the starting position of the sub-topic area;
  • the ending position of each question is the starting position of the next question. If it reaches the end of the layout, the end of the layout is the ending position of the question area;
  • the layout analysis results are: the gutter area is represented as: [5,5,214,2330]; the layout area is represented as: [235,5,1505,2330], [1746,5,1559,2330 ];
  • the text line detection result of layout area 1 ( Figure 7) is shown in Figure 9.
  • the text line detection result of the test paper image is subjected to OCR recognition to obtain the text information, and the text information and the text line are merged to obtain the following result:
  • line detection and blank area detection are used to realize test paper layout analysis, which can automatically identify the typesetting information of test papers, and use OCR method based on deep learning convolutional neural network for test paper analysis to analyze test paper images.
  • kind of demand are used to realize test paper layout analysis, which can automatically identify the typesetting information of test papers, and use OCR method based on deep learning convolutional neural network for test paper analysis to analyze test paper images.
  • a test paper information extraction system 500 is proposed, as shown in FIG. 12, including a memory 502, a processor 504, and a computer program stored in the memory 502 and running on the processor 504 When the processor 504 executes the computer program, the method for extracting test paper information in any of the foregoing embodiments is implemented.
  • a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the steps of the test paper information extraction method as in any of the above embodiments are implemented.
  • connection can be a fixed connection, a detachable connection, or an integral connection; it can be directly connected or indirectly connected through an intermediate medium.
  • the description of the terms “one embodiment”, “some embodiments”, “specific embodiments”, etc. means that the features, structures, materials or characteristics described in conjunction with the embodiment or examples are included in the application In at least one embodiment or example.
  • the schematic representations of the aforementioned terms do not necessarily refer to the same embodiment or example.
  • the described features, structures, materials or characteristics can be combined in any one or more embodiments or examples in a suitable manner.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Character Input (AREA)

Abstract

Disclosed are a test paper information extraction method and system, and a storage medium. The test paper information extraction method comprises: preprocessing a test paper image to obtain a binary image (S102); determining a layout area of the binary image (S104); acquiring text lines of the test paper image according to the layout area (S106); extracting a text image according to the text lines (S108); inputting the text image into a character recognition model to obtain text information of the test paper image (S110); correspondingly combining the text information and the text lines to obtain a target test paper image (S112); and extracting test paper information of the target test paper image according to a classification tag (S114).

Description

试卷信息提取方法、系统及计算机可读存储介质Test paper information extraction method, system and computer readable storage medium
本申请要求在2019年06年26日提交中国专利局、申请号为201910559124.6的中国专利申请的优先权,该申请的全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office with an application number of 201910559124.6 on June 26, 2019. The entire content of this application is incorporated into this application by reference.
技术领域Technical field
本申请涉及电子教学技术领域,例如,涉及一种试卷信息提取方法、系统及计算机可读存储介质。This application relates to the field of electronic teaching technology, for example, to a method, system and computer-readable storage medium for extracting test paper information.
背景技术Background technique
随着计算机和互联网技术的发展,人们越来越多的使用自动化设备对学生考试试卷进行阅卷。相关技术中,自动阅卷方法通常只能对固定模板的试卷进行分析,即只能把试卷与系统已经存储的多种模板进行匹配,用匹配得到的模板进行分析。但实际操作中,很多真实试卷的版面和类型都不一定与固定模板匹配,因此需要提供一种能够对任意试卷(常规试卷、一般答题卡、专用答题卡等)进行准确识别和自动分析的方案,以满足人们日益增长的电子阅卷要求。With the development of computer and Internet technology, more and more people use automated equipment to mark students' examination papers. In related technologies, the automatic scoring method usually can only analyze test papers with a fixed template, that is, it can only match the test paper with a variety of templates stored in the system, and use the matched template for analysis. However, in actual operation, the layout and type of many real test papers may not match the fixed template, so it is necessary to provide a solution that can accurately identify and automatically analyze any test paper (regular test paper, general answer sheet, special answer sheet, etc.) , To meet people's increasing electronic marking requirements.
发明内容Summary of the invention
本申请至少解决相关技术中存在的上述技术问题。This application at least solves the above-mentioned technical problems existing in related technologies.
本申请提出了一种试卷信息提取方法,包括:对试卷图像进行预处理,得到二进制图像;确定二进制图像的版面区域;根据版面区域获取试卷图像的文本行;根据文本行提取文本图像;将所述文本图像输入文字识别模型,得到试卷图像的文本信息;对应合并文本信息与文本行,得到目标试卷图像;根据分类标签提取目标试卷图像的试卷信息。This application proposes a method for extracting test paper information, including: preprocessing test paper images to obtain binary images; determining the layout area of the binary image; obtaining text lines of the test paper image according to the layout area; extracting text images from the text lines; The text image is input into the text recognition model to obtain the text information of the test paper image; the corresponding text information and the text line are merged to obtain the target test paper image; the test paper information of the target test paper image is extracted according to the classification label.
本申请提出了一种试卷信息提取系统,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,处理器执行计算机程序时实现上述任一技术方案的试卷信息提取方法。This application proposes a test paper information extraction system, which includes a memory, a processor, and a computer program stored in the memory and capable of running on the processor. The processor implements the test paper information extraction method of any of the above technical solutions when the processor executes the computer program.
本申请提出了一种计算机可读存储介质,存储有计算机程序,计算机程序被处理器执行时实现如上述任一技术方案的试卷信息提取方法。This application proposes a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, the method for extracting test paper information as any of the above technical solutions is realized.
附图说明Description of the drawings
图1示出了本申请一个实施例的试卷信息提取方法流程示意图;Figure 1 shows a schematic flow chart of a method for extracting test paper information according to an embodiment of the present application;
图2示出了本申请又一个实施例的试卷信息提取方法流程示意图;Fig. 2 shows a schematic flow chart of a method for extracting test paper information according to another embodiment of the present application;
图3示出了本申请又一个实施例的试卷信息提取方法流程示意图;Fig. 3 shows a schematic flow chart of a method for extracting test paper information according to another embodiment of the present application;
图4示出了本申请又一个实施例的试卷信息提取方法流程示意图;FIG. 4 shows a schematic flow chart of a method for extracting test paper information according to another embodiment of the present application;
图5示出了本申请一个实施例的试卷图像;Figure 5 shows a test paper image of an embodiment of the present application;
图6示出了本申请一个实施例的试卷版面区域分析结果图像;FIG. 6 shows an analysis result image of the test paper layout area of an embodiment of the present application;
图7示出了图5的一个版面区域;Figure 7 shows a layout area of Figure 5;
图8示出了图5的另一个版面区域;Figure 8 shows another layout area of Figure 5;
图9示出了图7的文本行检测结果图像;FIG. 9 shows the text line detection result image of FIG. 7;
图10示出了本申请一个实施例的文本信息提取结果图像;FIG. 10 shows a text information extraction result image of an embodiment of the present application;
图11示出了本申请一个实施例的构建文本识别模型的示意图;FIG. 11 shows a schematic diagram of constructing a text recognition model according to an embodiment of the present application;
图12示出了本申请一个实施例的试卷信息提取系统示意框图。Fig. 12 shows a schematic block diagram of a test paper information extraction system according to an embodiment of the present application.
具体实施方式Detailed ways
下面结合附图和具体实施方式对本申请进行描述。The application will be described below with reference to the drawings and specific implementations.
在下面的描述中阐述了很多细节以便于充分理解本申请,但是,本申请还可以采用其他不同于在此描述的其他方式来实施,因此,本申请的保护范围并不限于下面公开的具体实施例的限制。In the following description, many details are set forth in order to fully understand this application. However, this application can also be implemented in other ways different from those described here. Therefore, the scope of protection of this application is not limited to the specific implementations disclosed below. Limitations of cases.
本申请实施例提出一种试卷信息提取方法,图1示出了本申请一个实施例的试卷信息提取方法流程示意图。其中,该方法包括:An embodiment of the present application proposes a method for extracting test paper information, and FIG. 1 shows a schematic flowchart of the method for extracting test paper information according to an embodiment of the present application. Among them, the method includes:
S102,对试卷图像进行预处理,得到二进制图像;S102, preprocessing the test paper image to obtain a binary image;
S104,确定二进制图像的版面区域;S104: Determine the layout area of the binary image;
S106,根据版面区域获取试卷图像的文本行;S106: Obtain the text line of the test paper image according to the layout area;
S108,根据文本行提取文本图像;S108, extract a text image according to the text line;
S110,将文本图像输入文字识别模型,得到试卷图像的文本信息;S110: Input the text image into the text recognition model to obtain the text information of the test paper image;
S112,对应合并文本信息与文本行,得到目标试卷图像;S112, correspondingly merge the text information and the text line to obtain the target test paper image;
S114,根据分类标签提取目标试卷图像的试卷信息。S114: Extract test paper information of the target test paper image according to the classification label.
本申请提供的试卷信息提取方法,融合图像处理算法、自然语言处理算法和深度学习神经网络模型技术,通过对试卷图像进行预处理,得到二进制 图像,分析二进制图像以确定二进制图像的版面区域,即获取试卷的排版信息,对每个版面区域进行文本行检测,遍历每个版面的文本行,取文本行最大的外接矩形区域抠出对应的文本图像,将文本图像输入到文字识别(Optical Character Recognition,OCR)模型中进行匹配,识别出试卷图像的文本信息,对应合并文本信息与文本行,得到已识别出文本信息的目标试卷图像,根据不同的分类标签提取目标试卷图像中的试卷信息,例如考生信息、考题信息等,输出所有试卷信息。通过上述试卷信息提取方法,可自动识别试卷的排版信息,即使试卷的版面和类型都不同,也能对试卷图像进行准确识别和自动分析,获得试卷信息,不仅实现了高效、精准的自动化阅卷,还能够提升系统的适用范围,并且可将识别出的试卷信息和排版信息上传至数据库,以便构建知识体系,有利于教育工作者进行自动组卷,从而有效降低教育工作者的工作量,满足用户的多种需求。The test paper information extraction method provided in this application combines image processing algorithms, natural language processing algorithms, and deep learning neural network model technology. The binary image is obtained by preprocessing the test paper image, and the binary image is analyzed to determine the layout area of the binary image, namely Obtain the typesetting information of the test paper, detect the text line of each layout area, traverse the text line of each layout, take the largest circumscribed rectangular area of the text line to cut out the corresponding text image, and input the text image into the text recognition (Optical Character Recognition) , OCR) model is matched, the text information of the test paper image is recognized, the text information and the text line are merged correspondingly, and the target test paper image of the recognized text information is obtained, and the test paper information in the target test paper image is extracted according to different classification tags, for example, Candidate information, test question information, etc., output all test paper information. Through the above test paper information extraction method, the typesetting information of the test paper can be automatically identified. Even if the layout and type of the test paper are different, the test paper image can be accurately identified and automatically analyzed to obtain test paper information, which not only realizes efficient and accurate automatic scoring, but also It can also improve the scope of application of the system, and upload the identified test paper information and typesetting information to the database to build a knowledge system, which is conducive to the automatic composition of educators, thereby effectively reducing the workload of educators and satisfying users Various needs.
在一实施例中,预处理为二值化处理,还可以根据实际需求对二进制图像进行平滑处理、图像倾斜处理等操作,其中,图像倾斜处理包括:将二进制图像进行投影,使得二进制图像的边缘位置在所投影的图像上生成相应的标记,根据标记确定倾斜图像的位置,根据倾斜图像边缘与标准水平方向或标准垂直方向相差的角度,将倾斜图像的位置进行旋转实现图像矫正。文本行为利用计算机视觉库(opencv)的图像处理函数(findcontours函数)识别出二进制图像中带有文本信息的矩形框。In an embodiment, the preprocessing is binarization processing. The binary image can also be smoothed and image tilted according to actual needs. The image tilt processing includes: projecting the binary image so that the edge of the binary image The position generates a corresponding mark on the projected image, determines the position of the oblique image according to the mark, and rotates the position of the oblique image to achieve image correction according to the angle between the edge of the oblique image and the standard horizontal direction or the standard vertical direction. The text behavior uses the image processing function (findcontours function) of the computer vision library (opencv) to identify the rectangular frame with text information in the binary image.
图2示出了本申请又一个实施例的试卷信息提取方法流程示意图。其中,该方法包括:Fig. 2 shows a schematic flowchart of a method for extracting test paper information according to another embodiment of the present application. Among them, the method includes:
S202,对试卷图像进行预处理,得到二进制图像;S202, preprocessing the test paper image to obtain a binary image;
S204,根据第一预设尺寸,确定二进制图像的子图像;S204: Determine a sub-image of the binary image according to the first preset size;
S206,检测子图像的线条;S206: Detect lines of sub-images;
S208,选取线条长度满足预设长度范围,且线条两端的区域为空白的子图像的线条作为装订线;S208: Select the line of the sub-image whose line length meets the preset length range and the area at both ends of the line is blank as the binding line;
本实施例中,子图像得到线条两端的区域为线条的第一端与子图像的第一边缘之间的区域以及子图像的线条的第二端与子图像的第二边缘之间的区域。In this embodiment, the areas at both ends of the line obtained by the sub-image are the area between the first end of the line and the first edge of the sub-image and the area between the second end of the line of the sub-image and the second edge of the sub-image.
S210,根据装订线,确定二进制图像的文本区域;S210: Determine the text area of the binary image according to the gutter;
S212,根据第二预设尺寸,确定文本区域的中心区域;S212: Determine the central area of the text area according to the second preset size;
S214,是否在中心区域中检测到分隔符号,在中心区域中检测到分隔符号的情况下,进入S216,在中心区域中未检测到分隔符号的情况下,进入S218;S214, whether a separator is detected in the central area, if the separator is detected in the central area, go to S216, and if the separator is not detected in the central area, go to S218;
S216,根据分隔符号,确定版面区域,进入S224;S216: Determine the layout area according to the separator symbol, and enter S224;
S218,根据第三预设尺寸,确定文本区域的分割区域;S218: Determine the segmentation area of the text area according to the third preset size;
S220,是否在分割区域中检测到分隔符号,在分割区域中检测到分隔符的情况下,进入S216,在分割区域中未检测到分隔符的情况下,进入S222;S220: Whether a separator is detected in the divided area, if the separator is detected in the divided area, go to S216, and if the separator is not detected in the divided area, go to S222;
S222,将文本区域作为版面区域;S222: Use the text area as a layout area;
S224,根据版面区域,获取试卷图像的文本行;S224: Obtain the text line of the test paper image according to the layout area;
S226,根据文本行提取文本图像;S226: Extract a text image according to the text line;
S228,将文本图像输入文字识别模型,得到试卷图像的文本信息;S228: Input the text image into the text recognition model to obtain the text information of the test paper image;
S230,对应合并文本信息与文本行,得到目标试卷图像;S230: Correspondingly merge the text information and the text line to obtain the target test paper image;
S232,根据分类标签提取目标试卷图像的试卷信息。S232: Extract test paper information of the target test paper image according to the classification label.
在该实施例中,按照第一预设尺寸在二进制图像的一侧分割出二进制图像的子图像,使用直线检测算法检测子图像中所有的线条,遍历所有子图像的线条,若子图像的线条的长度满足预设长度范围,同时该线条两端的区域为空白,将该子图像的线条作为装订线,若没有符合条件的线条,则按照第一预设尺寸在二进制图像的另一侧分割出二进制图像的子图像,重新进行装订线的检测,若子图像中存在装订线,则根据装订线确定二进制图像的文本区域,其中,第一预设尺寸和预设长度范围可根据实际试卷的版面参数合理化设置。通过上述技术方案,能够准确识别出试卷图像的装订线,便于后续对试卷版面区域进行进一步分析和识别。In this embodiment, the sub-images of the binary image are segmented on one side of the binary image according to the first preset size, all the lines in the sub-image are detected by the straight line detection algorithm, and the lines of all the sub-images are traversed. If the length meets the preset length range, and the area at both ends of the line is blank, the line of the sub-image is used as a binding line. If there is no line that meets the conditions, the binary image is divided into the other side of the binary image according to the first preset size The sub-image of the image, the gutter detection is performed again, if there is a gutter in the sub-image, the text area of the binary image is determined according to the gutter, where the first preset size and preset length range can be rationalized according to the layout parameters of the actual test paper Set up. Through the above technical solution, the binding line of the test paper image can be accurately identified, which is convenient for further analysis and recognition of the test paper layout area.
若子图像中不存在装订线,则将二进制图像本身作为文本区域,再以文本区域中心为轴,根据第二预设尺寸确定文本区域的中心区域,在中心区域中检测分隔符号,根据检测到的分隔符号分割文本区域,从而得到版面区域。为了避免试卷的版面不同所导致的误判问题,还可以再次检测分割后的文本区域的分隔符号,若分割后的文本区域仍然存在分隔符号,则根据分割符号 对文本区域进行进一步的分割,以获取更加准确的版面区域。若中心区域中未检测到分隔符号,则按照第三预设尺寸分割文本区域,得到至少两个分割区域,在每一个分割区域中检测分隔符号,若在分割区域中检测到分隔符号,则根据分隔符号分割文本区域,从而得到版面区域,若分割区域中未检测到分隔符号,则将文本区域作为版面区域。通过上述实施例,即使试卷的版面和类型都不同,也能够准确识别版面区域,降低误判概率,从而对试卷图像进行准确识别和自动分析,获得试卷信息,不仅实现了高效、精准的自动化阅卷,还能够提升系统的适用范围。If there is no gutter in the sub-image, the binary image itself is used as the text area, and the center of the text area is used as the axis. The central area of the text area is determined according to the second preset size, and the separator is detected in the central area. The separator divides the text area to get the layout area. In order to avoid misjudgment problems caused by different test paper layouts, the separator of the divided text area can also be detected again. If there are still separators in the divided text area, the text area will be further divided according to the division symbol. Get a more accurate layout area. If no separator is detected in the central area, the text area is divided according to the third preset size to obtain at least two divided areas, and the separator is detected in each divided area. If the separator is detected in the divided area, then The separator divides the text area to obtain the layout area. If no separator is detected in the divided area, the text area is regarded as the layout area. Through the above embodiments, even if the layout and type of the test paper are different, the layout area can be accurately identified, and the probability of misjudgment can be reduced, so that the test paper image can be accurately identified and automatically analyzed to obtain test paper information, which not only realizes efficient and accurate automatic scoring , Can also improve the scope of application of the system.
在一实施例中,若存在左装订线,则取装订线右侧区域的图像为文本区域;若存在右装订线,则取装订线左侧区域的图像为文本区域。In one embodiment, if there is a left gutter, the image on the right side of the gutter is taken as the text area; if there is a right gutter, the image on the left side of the gutter is taken as the text area.
在本申请的一个实施例中,可选地,检测分隔符号包括:对中心区域或分割区域进行投影处理,得到二进制图像的空白区域;在空白区域的宽度大于宽度阈值的情况下,将空白区域作为分隔符号。In an embodiment of the present application, optionally, detecting the separation symbol includes: performing projection processing on the central area or the divided area to obtain the blank area of the binary image; in the case where the width of the blank area is greater than the width threshold, the blank area As a separator.
在该实施例中,通过中心区域或分割区域进行投影处理,能够统计垂直方向计数0的个数,得到投影结果数组,根据投影结果数组确定二进制图像的空白区域,若空白区域的宽度大于宽度阈值,将空白区域作为分隔符号,进而能够根据分隔符号对文本区域进行分割,得到版面区域,便于根据版面区域识别试卷图像的文本信息,实现了高效、精准的自动化阅卷。本实施例中,宽度阈值可根据常规试卷版面参数合理化设置。In this embodiment, the projection process is performed through the central area or the divided area, and the number of count 0 in the vertical direction can be counted to obtain the projection result array, and the blank area of the binary image is determined according to the projection result array. If the width of the blank area is greater than the width threshold , Use the blank area as the separator, and then divide the text area according to the separator to obtain the layout area, which is convenient for identifying the text information of the test paper image according to the layout area, and realizes efficient and accurate automatic scoring. In this embodiment, the width threshold can be rationally set according to the parameters of the conventional test paper layout.
在本申请的一个实施例中,可选地,检测分隔符号包括:对中心区域或分割区域进行模糊和/或去噪处理,得到二进制图像的线条;根据预设角度范围和预设长度阈值筛选二进制图像的线条,得到目标线条;在目标线条的长度大于第一预设长度,或目标线条的长度大于第二预设长度,且目标线条两端的标题区域和空白区域的宽度与目标线条的长度之和大于第一预设长度的情况下,将二进制图像的线条作为分隔符号。In an embodiment of the present application, optionally, detecting the separation symbol includes: performing blurring and/or denoising processing on the central area or the segmented area to obtain lines of the binary image; filtering according to a preset angle range and a preset length threshold Binary image lines to obtain the target line; when the length of the target line is greater than the first preset length, or the length of the target line is greater than the second preset length, and the width of the title area and blank area at both ends of the target line is the same as the length of the target line When the sum is greater than the first preset length, the line of the binary image is used as the separator.
本实施例中,目标线条两端的标题区域和空白区域的宽度为目标线条的第一端与二进制图像的第一边缘之间的标题区域的宽度、目标线条的第二端与二进制图像的第二边缘之间的空白区域的宽度。In this embodiment, the width of the title area and the blank area at both ends of the target line is the width of the title area between the first end of the target line and the first edge of the binary image, and the second end of the target line and the second edge of the binary image. The width of the blank area between the edges.
在该实施例中,对中心区域或分割区域进行模糊和/或去噪处理,检测二 进制图像中的线条,根据预设角度范围和长度阈值对检测到的全部二进制图像的线条进行筛选,得到目标线条,若目标线条的长度大于第一预设长度,或在目标线条的长度大于第二预设长度的同时,目标线条两端的标题区域和空白区域的宽度与目标线条的长度之和大于第一预设长度的情况下,将该二进制图像的线条作为分隔符号,进而能够根据分隔符号对文本区域进行分割,得到版面区域,便于根据版面区域识别试卷图像的文本信息,实现了高效、精准的自动化阅卷。In this embodiment, blur and/or denoise processing is performed on the central area or the segmented area, the lines in the binary image are detected, and all the detected lines of the binary image are filtered according to the preset angle range and length threshold to obtain the target Line, if the length of the target line is greater than the first preset length, or while the length of the target line is greater than the second preset length, the sum of the width of the title area and the blank area at both ends of the target line and the length of the target line is greater than the first In the case of preset length, the line of the binary image is used as a separator, and the text area can be divided according to the separator to obtain the layout area, which is convenient for identifying the text information of the test paper image according to the layout area, and realizes efficient and accurate automation Scoring.
本实施例中,第一预设长度和第二预设长度可根据常规试卷版面参数合理化设置。In this embodiment, the first preset length and the second preset length can be rationally set according to the parameters of the conventional test paper layout.
在一实施例中,利用计算机视觉库(opencv)的霍夫变换函数(hough lines函数)识别二进制图像的线条。In one embodiment, the Hough transform function (hough lines function) of the computer vision library (opencv) is used to identify the lines of the binary image.
图3示出了本申请又一个实施例的试卷信息提取方法流程示意图。其中,该方法包括:Fig. 3 shows a schematic flow chart of a method for extracting test paper information according to another embodiment of the present application. Among them, the method includes:
S302,对试卷图像进行预处理,得到二进制图像;S302, preprocessing the test paper image to obtain a binary image;
S304,确定二进制图像的版面区域;S304: Determine the layout area of the binary image;
S306,识别版面区域中的矩形框;S306, identifying a rectangular frame in the layout area;
S308,根据预设宽度范围筛选所有矩形框的宽度,得到多个目标宽度;S308: Filter the widths of all rectangular boxes according to the preset width range to obtain multiple target widths;
S310,统计多个目标宽度中的每个目标宽度对应的矩形框个数;S310: Count the number of rectangular frames corresponding to each target width among the multiple target widths;
S312,选取最大的矩形框个数对应的目标宽度作为文本行宽度;S312: Select the target width corresponding to the largest number of rectangular boxes as the text line width;
S314,根据文本行宽度确定文本框;S314: Determine a text box according to the width of the text line;
S316,当前文本框与前一个文本框是否满足预设条件,若当前文本框与前一个文本框满足预设条件,进入S318,当前文本框与前一个文本框不满足预设条件,进入S320;S316, whether the current text box and the previous text box meet the preset conditions, if the current text box and the previous text box meet the preset conditions, go to S318, and the current text box and the previous text box do not meet the preset conditions, go to S320;
S318,合并当前文本框和前一个文本框,得到一块文本行;S318, merge the current text box and the previous text box to obtain a piece of text line;
S320,当前文本框和前一个文本框分别作为一块文本行;S320, the current text box and the previous text box are respectively used as a piece of text line;
S322,根据文本行,提取文本图像;S322: Extract a text image according to the text line;
S324,将文本图像输入文字识别模型,得到试卷图像的文本信息;S324: Input the text image into the text recognition model to obtain the text information of the test paper image;
S326,对应合并文本信息与文本行,得到目标试卷图像;S326, correspondingly merge the text information and the text line to obtain the target test paper image;
S328,根据分类标签提取目标试卷图像的试卷信息。S328: Extract test paper information of the target test paper image according to the classification label.
在一实施例中,预设条件为当前文本框的中心点与前一个文本框的中心点的垂直距离小于第一距离阈值,且当前文本框的中心点与前一个文本框的中心点的水平距离小于第二距离阈值。In one embodiment, the preset condition is that the vertical distance between the center point of the current text box and the center point of the previous text box is less than the first distance threshold, and the horizontal distance between the center point of the current text box and the center point of the previous text box The distance is less than the second distance threshold.
在该实施例中,识别版面区域中存在的外边缘轮廓,取外边缘轮廓的最大外接矩形,形成矩形框,获取检测到的全部矩形框的宽度,根据预设宽度范围筛选矩形框的宽度,得到多个目标宽度,统计多个目标宽度中的每个宽度对应的矩形框个数,选取最大的矩形框个数对应的目标宽度作为文本行宽度,根据文本行宽度确定文本框,遍历所有文本框,若当前文本框的中心点与前一个文本框的中心点的垂直距离小于第一距离阈值,且当前文本框的中心点与前一个文本框的中心点的水平距离小于第二距离阈值,说明上述两个文本框的中心点几乎在一条直线上,此时合并当前文本框和前一个文本框,得到文本行,以便于根据文本行提取试卷图像的文本信息,实现精准地自动化阅卷,而且教育工作者能够根据识别到的文本信息构建知识体系,有利于教育工作者进行自动组卷,从而降低教育工作者的工作量,满足用户的多种需求。其中,第一距离阈值和第二距离阈值为文本框之间允许距离误差值,可以根据排版经验进行合理设置。In this embodiment, the outer edge contour existing in the layout area is identified, the largest circumscribed rectangle of the outer edge contour is taken to form a rectangular frame, the width of all the detected rectangular frames is obtained, and the width of the rectangular frame is filtered according to the preset width range, Obtain multiple target widths, count the number of rectangular boxes corresponding to each of the multiple target widths, select the target width corresponding to the largest number of rectangular boxes as the text line width, determine the text box according to the text line width, and traverse all text Box, if the vertical distance between the center point of the current text box and the center point of the previous text box is less than the first distance threshold, and the horizontal distance between the center point of the current text box and the center point of the previous text box is less than the second distance threshold, Explain that the center points of the above two text boxes are almost on a straight line. At this time, the current text box and the previous text box are merged to obtain a text line, so that the text information of the test paper image can be extracted from the text line to realize accurate automatic scoring. Educators can construct a knowledge system based on the recognized text information, which is beneficial for educators to automatically organize test papers, thereby reducing the workload of educators and satisfying multiple needs of users. Among them, the first distance threshold and the second distance threshold are the allowable distance error values between the text boxes, which can be set reasonably according to typesetting experience.
在本申请的一个实施例中,可选地,根据矩形框宽度确定文本行宽度之前,还包括:在当前矩形框的中心点与前一个矩形框的中心点的垂直距离小于第三距离阈值,且当前矩形框的中心点与前一个矩形框的中心点的水平距离小于第四距离阈值的情况下,合并当前矩形框和前一个矩形框。In an embodiment of the present application, optionally, before determining the text line width according to the width of the rectangular box, the method further includes: the vertical distance between the center point of the current rectangular box and the center point of the previous rectangular box is less than a third distance threshold, And when the horizontal distance between the center point of the current rectangular frame and the center point of the previous rectangular frame is less than the fourth distance threshold, the current rectangular frame and the previous rectangular frame are merged.
在该实施例中,若当前矩形框的中心点与前一个矩形框的中心点的垂直距离小于第三距离阈值,且当前矩形框的中心点与前一个矩形框的中心点的水平距离小于第四距离阈值,说明上述两个矩形框的中心点几乎在一条直线上而且距离较近,此时合并当前矩形框和前一个矩形框,从而减少识别出的有效矩形框数量,在获取试卷图像的文本行过程中降低系统计算量,提升提取文本信息效率。本实施例中,第三距离阈值和第四距离阈值为矩形框之间允许距离误差值,可以根据排版经验进行合理设置。In this embodiment, if the vertical distance between the center point of the current rectangular frame and the center point of the previous rectangular frame is less than the third distance threshold, and the horizontal distance between the center point of the current rectangular frame and the center point of the previous rectangular frame is less than the first Four distance thresholds, indicating that the center points of the two rectangular boxes are almost on a straight line and the distance is relatively close. At this time, the current rectangular box and the previous rectangular box are merged to reduce the number of effective rectangular boxes recognized. In the text line process, the system calculation is reduced and the efficiency of extracting text information is improved. In this embodiment, the third distance threshold and the fourth distance threshold are the allowable distance error values between rectangular boxes, which can be set reasonably according to typesetting experience.
在本申请的一个实施例中,可选地,将文本图像输入文字识别模型之前,还包括:获取文本数据和字符数据;编码文本数据和字符数据,得到识别词 典;根据文本数据确定文本图像集;根据识别词典和文本图像集构建文字识别模型。In an embodiment of the present application, optionally, before inputting the text image into the text recognition model, the method further includes: obtaining text data and character data; encoding the text data and character data to obtain a recognition dictionary; and determining the text image set according to the text data ; Construct a text recognition model based on the recognition dictionary and text image collection.
在该实施例中,获取文本数据,排除文本数据中的重复字符,从1开始编码文本数据和字符数据中的每个字符,得到识别词典,并根据文本数据获取文本数据中每个字符的图像,得到文本图像集。根据识别词典和文本图像集构建文字识别模型,从而便于使用自然语言处理技术提取试卷的多类文本信息,准确率更高、速度更快,提高实用性。In this embodiment, the text data is obtained, repeated characters in the text data are excluded, each character in the text data and the character data is encoded starting from 1, to obtain a recognition dictionary, and an image of each character in the text data is obtained according to the text data , Get the text image set. According to the recognition dictionary and the text image collection, the text recognition model is constructed, so that it is convenient to use natural language processing technology to extract multiple types of text information of the test paper, with higher accuracy, faster speed, and improved practicability.
在一实施例中,将本地文本语料库和《信息交换用汉字编码字符集》(GB2312)的重叠部分作为文本数据。字符数据包括但不限于:阿拉伯数字、英文字母、标点符号、特殊字符。利用PIL(python图像处理库)的字符处理函数(drawtext函数)在固定尺寸的图像上画出文本内容,得到字符的图像。使用DenseNet+CTC(密集卷积网络模型+时序类数据分类)网络搭建OCR模型,同样的可以使用下列卷积神经网络搭建模型:In one embodiment, the overlapping part of the local text corpus and the "Chinese Character Coded Character Set for Information Exchange" (GB2312) is used as text data. Character data includes but is not limited to: Arabic numerals, English letters, punctuation marks, and special characters. Use the character processing function (drawtext function) of PIL (python image processing library) to draw text content on a fixed-size image to obtain an image of the character. Use DenseNet+CTC (dense convolutional network model + time series data classification) network to build an OCR model, and the following convolutional neural network can also be used to build a model:
LeNet(卷积神经网络模型)+CTC;LeNet (convolutional neural network model) + CTC;
AlexNet(Alex深度卷积神经网络模型)+CTC;AlexNet (Alex Deep Convolutional Neural Network Model) + CTC;
ZF(ZF网络结构模型)+CTC;ZF (ZF network structure model) + CTC;
VGG(VGG网络结构模型)+CTC;VGG (VGG network structure model) + CTC;
GoogleNet(谷歌网络结构模型)+CTC;GoogleNet (Google network structure model) + CTC;
ResNet(深度残差网络模型)+CTC。ResNet (deep residual network model) + CTC.
在一实施例中,在组建识别词典的情况下,限定识别词典中的字符数,例如,限定字符数在4000左右,能够有效减小文字识别模型的大小,减少系统计算量。In one embodiment, in the case of building a recognition dictionary, limiting the number of characters in the recognition dictionary, for example, limiting the number of characters to about 4000, can effectively reduce the size of the character recognition model and reduce the amount of system calculation.
在本申请的一个实施例中,可选地,根据分类标签提取目标试卷图像的试卷信息包括:分类标签包括标题、大题和小题;根据标题、大题和小题分别对应的分类关键字符,分别确定标题文本行、大题文本行和小题文本行;根据标题文本行、大题文本行和小题文本行提取试卷信息。In an embodiment of the present application, optionally, extracting the test paper information of the target test paper image according to the classification label includes: the classification label includes a title, a big question and a small question; according to the classification key characters respectively corresponding to the title, the big question and the small question , Respectively determine the title text line, the big title text line and the small title text line; extract the test paper information according to the title text line, the big title text line and the small title text line.
在该实施例中,分类标签包括标题、大题和小题,每种标签拥有自身的分类关键字符,以文本行为单位,通过分类关键字符分别识别目标试卷图像中标题的起始位置、大题的起始位置和小题的起始位置,确定与分类标签对 应的标题文本行、大题文本行或小题文本行,从而将试卷信息进行了分类,由此根据标题文本行、大题文本行和小题文本行对不同的文本信息进行提取,得到相应的试卷信息,并依次存入数据库。利用自然语言处理技术提取试卷的多类文本信息,提高提取试卷信息的准确度,有效降低教育工作者的工作量,满足了日益增长的电子阅卷、自动组卷、自动题目入库等要求。In this embodiment, the classification tags include a title, a big question, and a small title. Each tag has its own classification key characters. In units of text behavior, the classification key characters are used to identify the starting position and the big question of the title in the target test paper image. The starting position of the title and the starting position of the sub-question are determined to determine the title text line, the main-topic text line, or the sub-title text line corresponding to the classification label, so as to classify the test paper information, thus according to the title text line and the main text The line and subtitle text lines extract different text information to obtain the corresponding test paper information, which is sequentially stored in the database. Use natural language processing technology to extract multiple types of text information of test papers, improve the accuracy of extracting test paper information, effectively reduce the workload of educators, and meet the increasing requirements of electronic scoring, automatic test paper composition, and automatic question storage.
在一实施例中,通常试卷信息是由标题、大题题型以及小题信息组成,标题用于描述试题性质的信息和考生信息,例如指定年级指定阶段指定科目的考试题目等信息。大题题型用于描述试题的类别信息,试题的类别信息包括选择题、计算题、应用题、填空题、解答题、单选题、多选题、问答题、非选择题、实验题、选做题、选考题等题型,小题信息可分为题序号、题干信息和分数信息。In an embodiment, usually the test paper information is composed of a title, a large-question type, and small-question information. The title is used to describe information about the nature of the test question and candidate information, such as information such as test questions for a designated subject at a designated grade and stage. The question type is used to describe the category information of the test questions. The category information of the test questions includes multiple choice questions, calculation questions, applied questions, fill in the blanks, answer questions, multiple choice questions, multiple choice questions, essay questions, non-choice questions, experimental questions, Optional questions, optional exam questions and other question types, sub-question information can be divided into question number, question stem information and score information.
在本申请的一个实施例中,可选地,根据标题文本行、大题文本行和小题文本行提取试卷信息之前,还包括:对目标试卷图像进行坐标信息处理;在小题文本行的横坐标超出预设坐标范围,或小题文本行的横坐标不满足序号递增规则的情况下,删除小题文本行。In an embodiment of the present application, optionally, before extracting the test paper information according to the title text line, the big title text line, and the subtitle text line, the method further includes: performing coordinate information processing on the target test paper image; If the abscissa exceeds the preset coordinate range, or the abscissa of the subtitle text line does not meet the sequence number increasing rule, the subtitle text line is deleted.
在该实施例中,对目标试卷图像进行坐标信息处理,得到所有文本行的坐标,若小题文本行的横坐标超出预设坐标范围,或小题文本行的横坐标不满足序号递增规则,删除小题文本行,一方面,能够定位文本信息的位置,另一方面,通过文本行的坐标对文本行进行校准,以去除误判的文本行,提升提取试卷信息的准确度。In this embodiment, coordinate information processing is performed on the target test paper image to obtain the coordinates of all text lines. If the abscissa of the subtitle text line exceeds the preset coordinate range, or the abscissa of the subtitle text line does not satisfy the sequence number increasing rule, Deleting the subtitle text line, on the one hand, can locate the position of the text information, on the other hand, the text line is calibrated through the coordinates of the text line to remove the misjudged text line and improve the accuracy of extracting test paper information.
如图4所示,本申请的又一个实施例的试卷信息提取方法,包括:As shown in Figure 4, the test paper information extraction method of another embodiment of the present application includes:
S402,对输入的试卷图像进行版面分析,得到装订线矩形区域、所有的版面矩形区域;S402: Perform layout analysis on the input test paper image to obtain a rectangular area of the binding line and all rectangular areas of the layout;
S404,对每个版面进行文本行检测;S404: Perform text line detection on each layout;
S406,对每个版面的文本行进行OCR识别,合并结果得到最终试卷文本;S406: Perform OCR recognition on the text lines of each layout, and merge the results to obtain the final test paper text;
S408,从文本中提取试卷的文本信息;S408: Extract the text information of the test paper from the text;
S410,根据大题信息,提取候选的小题序号,由序号特征生成题号列表;S410: Extract the serial numbers of candidate sub-questions according to the information of the big questions, and generate a list of question numbers from the serial number features;
S412,输出试卷所有信息。S412: Output all information of the test paper.
方法如下:Methods as below:
1.通过扫描获得的试卷图像(img)如图5所示,对试卷图像img进行版面分析,得到装订线矩形区域(若有装订线,则得到装订线矩形区域,若没有装订线,则无装订线矩形区域),以及所有的版面矩形区域,如图6至图8所示;1. The test paper image (img) obtained by scanning is shown in Figure 5. The layout of the test paper image img is analyzed, and the rectangular area of the gutter is obtained (if there is a gutter, the rectangular area of the gutter is obtained; if there is no gutter, there is no Gutter rectangular area), and all the rectangular areas of the layout, as shown in Figure 6 to Figure 8;
1.1检测装订线,如图6所示;1.1 Check the gutter, as shown in Figure 6;
1.1.1从img的左边取其长度的1/5、宽度与img相同的子图part_img;1.1.1 Take 1/5 of the length from the left of the img, and the subimage part_img with the same width as the img;
1.1.2使用直线检测算法检测part_img中的所有直线,得到直线集合line_set,以img的宽度的一半过滤直线,得到直线集合line_set2;1.1.2 Use the line detection algorithm to detect all the lines in part_img, get the line set line_set, filter the lines with half the width of the img, and get the line set line_set2;
1.1.3对直线集合line_set2以x坐标进行降序排序;1.1.3 Sort the line set line_set2 in descending order by x coordinate;
1.1.4遍历直线集合line_set2,若直线line_set[i]满足下面的条件,则该直线就是装订线binding_line;1.1.4 Traverse the line set line_set2, if the line line_set[i] meets the following conditions, the line is the binding_line;
1)直线宽度大于img的宽度的3/4;1) The width of the straight line is greater than 3/4 of the width of the img;
2)直线上顶点到上边缘的区域及直线下顶点到下边缘的区域均为空白区域;2) The area from the vertex to the upper edge of the straight line and the area from the vertex to the lower edge of the straight line are blank areas;
1.1.5若没有符合条件的直线,则从img的右边向左取其长度的1/5、宽度相同的子图part_img,重复1.1.2、1.1.3、1.1.4的步骤。1.1.5 If there is no straight line that meets the conditions, take 1/5 of the length and the same width of the subimage part_img from the right to the left of the img, and repeat the steps of 1.1.2, 1.1.3, and 1.1.4.
1.2检测版面分隔符号;1.2 Check the layout separator;
版面分隔符号可以是:超过指定大小及宽度的空白区域、虚线、直线。Layout separators can be: blank areas, dashed lines, and straight lines that exceed the specified size and width.
1.2.1取试卷正文区域的图像img2;若存在左装订线,则取装订线右边的区域的图像为img2;若存在右装订线,则取装订线左边的区域的图像为img2;若不存在装订线,取试卷图像img为正文区域图像img2;1.2.1 Take the image img2 of the text area of the test paper; if there is a left gutter, take the image of the area on the right of the gutter as img2; if there is a right gutter, take the image of the area on the left of the gutter as img2; if it does not exist Gutter, take the test paper image img as the text area image img2;
1.2.2优先分析双数版面;1.2.2 Prioritize analysis of even-numbered pages;
1.2.2.1取试卷正文区域图像img2的中心区域图像middle_img,长度为正文区域图像img2的长度的1/5,宽度为img2的宽度;1.2.2.1 Take the central area image middle_img of the text area image img2 of the test paper, the length is 1/5 of the length of the text area image img2, and the width is the width of img2;
1.2.2.2检测中心区域图像middle_img中的版面分隔符号,方法如下:1.2.2.2 To detect the layout separator in middle_img of the central area image, the method is as follows:
1.2.2.2.1检测空白区域方法:对中心区域图像middle_img进行二值化处理得到图像binary_img,对binary_img进行垂直投影(垂直方向计数0的个数),得到投影结果数组,若数组中存在宽度大于预设值的区间,则该区间 所在的位置即为版面分隔符号layout_line;1.2.2.2.1 Method of detecting blank area: Binarize the image middle_img in the central area to obtain the image binary_img, and perform vertical projection on the binary_img (count 0 in the vertical direction) to obtain the projection result array. If the width of the array is greater than The interval of the preset value, the position of the interval is the layout_line;
1.2.2.2.2检测线条(直线、虚线)方法:对中心区域图像middle_img进行高斯模糊去噪处理得到图像img3,使用opencv的hough_lines函数检测线条,过滤倾斜角不在[70,110]范围内、长度小于50的线条,得到线条集合line_set3。遍历line_set3,若线条line[i]满足下列任意一条,则该线条所在的位置即为版面分隔符号layout_line:1.2.2.2.2 Method of detecting lines (straight lines, dashed lines): perform Gaussian blur denoising processing on the image middle_img in the center area to obtain the image img3, use the hough_lines function of opencv to detect the lines, filter the inclination angle not within the range of [70, 110], and the length For lines less than 50, the line set line_set3 is obtained. Traverse line_set3, if the line line[i] satisfies any of the following, the position of the line is the layout separator layout_line:
1)线条长度大于图像img3的宽度的4/5;1) The length of the line is greater than 4/5 of the width of the image img3;
2)线条长度大于图像img3的宽度的2/3,且线条下端全为空白区域,同时,直线上端是标题,且标题的宽度、线条的长度,与线条下端空白区域的宽度之和大于图像img3的宽度的4/5;2) The length of the line is greater than 2/3 of the width of the image img3, and the lower end of the line is all blank areas. At the same time, the upper end of the line is the title, and the sum of the width of the title and the length of the line and the width of the blank area at the lower end of the line is greater than that of the image img3 4/5 of the width;
1.2.2.3若检测到版面分隔符号layout_line,则以分隔符号layout_line来分割正文区域图像img2,得到2个区域rect1、rect2,对区域rect1重复1.2.2.1和1.2.2.2的步骤,若检测到版面分隔符号layout_line1,则继续对区域rect2重复1.2.2.1和1.2.2.2的步骤,若检测到版面分隔符号layout_line1的同时检测到分隔符号layout_line2,则以分隔符号layout_line、分隔符号layout_line1、分隔符号layout_line2将图像img2分割为四栏;若没有检测到layout_line1,以分隔符号layout_line将图像img2分割为两栏,分别为版面区域1(图7)和版面区域2(图8);1.2.2.3 If the layout separator layout_line is detected, divide the text area image img2 with the layout separator symbol layout_line to obtain two regions rect1 and rect2. Repeat the steps 1.2.2.1 and 1.2.2.2 for the area rect1. Symbol layout_line1, continue to repeat the steps 1.2.2.1 and 1.2.2.2 for the area rect2. If the layout separator layout_line1 is detected and the separator layout_line2 is detected at the same time, the image img2 will be separated by the separator layout_line, the separator layout_line1 and the separator layout_line2. Divide into four columns; if layout_line1 is not detected, use the separator layout_line to divide the image img2 into two columns, namely layout area 1 (Figure 7) and layout area 2 (Figure 8);
1.2.3若1.2.2的步骤未检测到分隔符号,则再分析三栏的版面;1.2.3 If the delimiter is not detected in step 1.2.2, then analyze the layout of the three columns;
1.2.3.1取正文区域图像img2的长度1/3处的图像left_img,其长度为img2的1/5,宽度为img2的宽度;1.2.3.1 Take the image left_img at 1/3 of the length of the text area image img2, its length is 1/5 of img2, and its width is the width of img2;
1.2.3.2重复1.2.2.2的步骤,检测分隔符号layout_line1,若检测成功,再取图像img2的2/3处的图像right_img,其长度为img2的1/5,宽度为img2的宽度;也重复1.2.2.2步骤,检测分隔符号layout_line2,若检测成功,则以分隔符号layout_line1、分隔符号layout_line2将图像img2分割为三栏;若没有检测到layout_line1,整个img2就是一栏。1.2.3.2 Repeat the steps of 1.2.2.2 to detect the separator layout_line1. If the detection is successful, take the image right_img at 2/3 of the image img2, its length is 1/5 of img2, and the width is the width of img2; also repeat 1.2 .2.2 step, detect the separator layout_line2, if the detection is successful, use the separator layout_line1 and the separator layout_line2 to divide the image img2 into three columns; if the layout_line1 is not detected, the entire img2 is one column.
2.对每个版面进行文本行检测,如图9和图10所示;2. Perform text line detection on each layout, as shown in Figure 9 and Figure 10;
记检测到的版面区域为layout_rects,遍历layout_rects,对layout_rects[i]进行文本行分析;Record the detected layout area as layout_rects, traverse layout_rects, and perform text line analysis on layout_rects[i];
2.1对图片img2进行二值化处理得到图片binary_img2;2.1 Binarize the picture img2 to get the picture binary_img2;
2.2使用opencv的findcontours函数获得图片binary_img2的外边缘轮廓集合contours;2.2 Use opencv's findcontours function to obtain the outer edge contour collection contours of the image binary_img2;
2.3遍历contours,取contours[i]的最大外接矩形,得到矩形框rects;2.3 Traverse contours, take the largest circumscribed rectangle of contours[i], and get rectangular box rects;
2.4合并矩形框:若2个矩形之间的中心点垂直距离小于8,且其中一个矩形的中心点在另外一个矩形的的中心点的水平距离在预设范围之内;2.4 Merge rectangles: if the vertical distance between the center points of two rectangles is less than 8, and the horizontal distance between the center point of one rectangle and the center point of the other rectangle is within the preset range;
2.5计算文本行宽度:取矩形框rects中所有矩形框的宽度heights,去掉异常最大值、最小值,统计在[height,height+C]范围内heights的个数,个数最大值对应的height即为文本行宽度(C为常数,经验值);2.5 Calculate the width of the text line: take the width and heights of all rectangular boxes in the rectangular box rects, remove the abnormal maximum and minimum values, and count the number of heights in the range of [height, height+C]. The height corresponding to the maximum number is Is the width of the text line (C is a constant, empirical value);
2.6以文本行×F(F为大于1的常数,如1.4)为基准,去掉超过文本行×F2.6 Take the text line × F (F is a constant greater than 1, such as 1.4) as the benchmark, and remove the line that exceeds the text × F
的矩形框,剩余的矩形框即为可能存在文本的文本框;, The remaining rectangular boxes are text boxes that may contain text;
2.7以文本行×2的基准,合并矩形框:若2个矩形之间的垂直距离小于8,且水平距离小于文本行×2;2.7 Combine the rectangles based on the text line×2: if the vertical distance between two rectangles is less than 8, and the horizontal distance is less than the text line×2;
2.8从左到右遍历矩形框rects,若当前文本框的中心点与前一个文本框的中心点大致在一条直线上,则把当前文本框合并到前一个文本框中,得到一小块文本行;2.8 Traverse the rectangular box rects from left to right, if the center point of the current text box and the center point of the previous text box are roughly in a straight line, merge the current text box into the previous text box to get a small text line ;
在一实施例中,版面区域1(图7)的文本行检测结果如图9所示;In one embodiment, the text line detection result of layout area 1 (FIG. 7) is shown in FIG. 9;
2.9递归2.8的过程,即可得到整个文本行text_lines。The process of 2.9 recursive 2.8 can get the entire text line text_lines.
3.对每个版面的文本行进行OCR识别,合并得到最终试卷文本paper_text;3. Perform OCR recognition on the text lines of each layout, and merge to obtain the final test paper text paper_text;
3.1遍历每个版面的文本行text_lines[i],取其最大的外接矩形区域max_line_rect,从img2抠出对应的文本图像part_img;3.1 Traverse the text_lines[i] of each layout, take the largest circumscribed rectangle area max_line_rect, and cut out the corresponding text image part_img from img2;
3.2将part_img输入到预训练好的OCR模型中,生成文本信息;3.2 Input part_img into the pre-trained OCR model to generate text information;
3.3合并上述文本信息与文本行,得到最终的试卷文本paper_text,如图10所示;3.3 Combine the above text information and text lines to get the final test paper text paper_text, as shown in Figure 10;
3.4构建OCR模型;3.4 Build an OCR model;
3.4.1模型数据;3.4.1 Model data;
3.4.1.1使用已有文本语料库,生成400万个以10个字符为一组的文本数据text_data;3.4.1.1 Use the existing text corpus to generate 4 million text data text_data with a group of 10 characters;
3.4.1.2对上述文本数据text_data,排除重复字符,得到词典dict1;3.4.1.2 For the above text data text_data, exclude duplicate characters and get the dictionary dict1;
3.4.1.3取词典dict1和GB 2312(国标)字符集的交集作为OCR的识别词典ocr_dict,同时加上阿拉伯数字、英文字母、标点符号、特殊字符,保证字符总数在4000左右,有效减小模型大小;3.4.1.3 Take the intersection of dictionary dict1 and GB 2312 (national standard) character set as OCR recognition dictionary ocr_dict, and add Arabic numerals, English letters, punctuation marks, and special characters to ensure that the total number of characters is around 4000, effectively reducing the size of the model ;
3.4.1.4对识别词典ocr_dict按照升序,从1开始编码每个字符;3.4.1.4 According to the ascending order of the recognition dictionary ocr_dict, each character is coded from 1;
3.4.1.5把文本数据text_data转为识别词典ocr_dict对应的编码表示ocr_index_data;3.4.1.5 Convert the text data text_data to the code representation ocr_index_data corresponding to the recognition dictionary ocr_dict;
3.4.1.6对文本数据text_data使用PIL的drawtext函数,在280*32的图像上画出文本内容,得到图像集ocr_img_data;3.4.1.6 Use PIL's drawtext function for the text data text_data to draw the text content on the 280*32 image to get the image set ocr_img_data;
3.4.1.7随机取图像集ocr_img_data的1/3图片,加上高斯噪声、或图片模糊处理或图片倾斜处理;3.4.1.7 Randomly take 1/3 of the image set ocr_img_data, add Gaussian noise, or image blur or image tilt;
3.4.1.8最终得到训练数据集ocr_img_data、ocr_index_data;3.4.1.8 Finally get the training data set ocr_img_data, ocr_index_data;
3.4.2模型网络;3.4.2 Model network;
使用DenseNet+CTC(密集卷积网络+时序类数据分类)搭建网络,其中DenseNet为5层DenseBlock(网络块),growth rate k(增长率)=4,如图11所示;Use DenseNet+CTC (dense convolutional network + time series data classification) to build a network, where DenseNet is a 5-layer DenseBlock (network block), growth rate k (growth rate) = 4, as shown in Figure 11;
3.4.3模型训练;3.4.3 Model training;
对3.4.1的步骤生成的数据按照9:1的比例划分为训练集、验证集;模型训练最大轮数epochs=50,超过3轮loss(损失)不下降则停止训练;最终模型的训练准确率达到0.993,验证集的准确率达到0.986。The data generated in step 3.4.1 is divided into training set and validation set according to the ratio of 9:1; the maximum number of rounds of model training epochs=50, if the loss (loss) exceeds 3 rounds, the training will stop; the final model training is accurate The rate reached 0.993, and the accuracy rate of the verification set reached 0.986.
4.从试卷文本中提取试卷的文本信息;4. Extract the text information of the test paper from the text of the test paper;
试卷文本信息定义:试卷名称、科目、单元、考试类型、考号区域、姓名区域、大题信息(序号、题型、分数信息、区域等);Definition of test paper text information: test paper name, subject, unit, test type, test number area, name area, big question information (serial number, question type, score information, area, etc.);
4.1试卷名称提取,如图10所示;4.1 The name of the test paper is extracted, as shown in Figure 10;
遍历试卷文本的前5行,若试卷文本的前5行中存在试卷名称关键词中的一个,则该行作为试卷名称,试卷名称关键词包括:考试、试卷、测试、试题、模拟等;Traverse the first 5 lines of the test paper text. If there is one of the test paper name keywords in the first 5 lines of the test paper text, that line will be used as the test paper name. The test paper name keywords include: exam, test paper, test, test, simulation, etc.;
4.2科目提取;4.2 Subject extraction;
遍历试卷文本的前5行,若试卷文本的前5行中存在科目关键词中的一 个,则该关键词作为科目,科目关键词包括:数学、语文、英语、物理、化学、生物、地理、政治、历史;Traverse the first 5 lines of the test paper text. If there is one of the subject keywords in the first 5 lines of the test paper text, the keyword will be used as the subject. Subject keywords include: mathematics, Chinese, English, physics, chemistry, biology, geography, Politics, history;
4.3单元提取;4.3 Unit extraction;
遍历试卷文本的前5行,若试卷文本的前5行中存在表达式(第*单元),则该行作为单元;Traverse the first 5 lines of the text of the test paper, if there is an expression (unit *) in the first 5 lines of the text of the test paper, then this line is used as a unit;
4.4考试类型提取;4.4 Examination type extraction;
遍历试卷文本的前5行,若试卷文本的前5行中存在考试类型关键词中的一个,则该关键词作为考试类型,考试类型关键词包括:期中、期末、模拟、竞赛等。Traverse the first 5 lines of the test paper text. If there is one of the test type keywords in the first 5 lines of the test paper text, this keyword is used as the test type. The test type keywords include: mid-term, final, simulation, competition, etc.
4.5考号区域提取;4.5 Extraction of test number area;
4.5.1若试卷存在装订线,遍历装订线区域内的文本,若试卷存在如下考号关键词中的一个,则该关键词所在的文本行区域就是考号区域的开始位置,再向上扩展区域,即为考号区域,考号关键词包括:考号、学号、准考证号等;4.5.1 If there is a gutter in the test paper, traverse the text in the gutter area. If there is one of the following test number keywords in the test paper, the text line area where the keyword is located is the beginning of the test number area, and then expand the area upwards , Is the exam number area, the keywords of exam number include: exam number, student number, admission ticket number, etc.;
4.5.2若试卷不存在装订线,则遍历试卷文本的前5行,若试卷存在4.5.1中的考号关键词中的一个,则该关键词所在的文本行区域就是考号区域的开始位置,再向右扩展区域,即为考号区域。4.5.2 If there is no gutter in the test paper, traverse the first 5 lines of the test paper text. If the test paper has one of the test number keywords in 4.5.1, the text line area where the key word is located is the beginning of the test number area Position, and then expand the area to the right, which is the examination number area.
4.6姓名区域提取;4.6 Name area extraction;
4.6.1若试卷存在装订线,遍历装订线区域内的文本,若存在关键词(姓名),则该关键词所在的文本行区域就是姓名区域的开始位置,再向上扩展区域,即为姓名区域;4.6.1 If there is a gutter in the test paper, traverse the text in the gutter area. If there is a keyword (name), the text line area where the keyword is located is the beginning of the name area, and then expand the area upwards, that is, the name area ;
4.6.2若试卷不存在装订线,则遍历试卷文本的前5行,若试卷存在4.6.1中的关键词(姓名),则该关键词所在的文本行区域就是姓名区域的开始位置,再向右扩展区域,即为姓名区域;4.6.2 If there is no gutter in the test paper, traverse the first 5 lines of the text of the test paper. If the key word (name) in 4.6.1 exists in the test paper, the text line area where the key word is located is the beginning of the name area. Expand the area to the right, which is the name area;
4.7大题信息提取;4.7 Extraction of major information;
预设大题类型:选择题、计算题、应用题、填空题、解答题、单选题、多选题、问答题、非选择题、实验题、选做题、选考题等;Preset types of big questions: multiple-choice questions, calculation questions, application questions, fill-in-the-blank questions, answer questions, single-choice questions, multiple-choice questions, essay questions, non-choice questions, experimental questions, optional questions, selective examination questions, etc.;
4.7.1识别大题文本行位置;4.7.1 Identify the position of the text line of the big title;
遍历文本行,若当前文本开头能匹配大题关键字符,例如“中文数字” +“大题类型”或者“(”+“大题类型”+“)”或者“大题类型”等,则该文本行为大题所在文本行;Traverse the text line, if the current text can match the key characters of the big title, such as "Chinese number" + "big title type" or "(" + "big title type" + ")" or "big title type", etc., then The text line of the text where the big question is located;
4.7.2取4.7.1的步骤所在的文本行作为该大题的区域起始位置;4.7.2 Take the text line where the step of 4.7.1 is located as the starting position of the area of the big question;
4.7.3取匹配到的“中文数字”作为大题的序号;4.7.3 Take the matched "Chinese number" as the serial number of the big question;
4.7.4取匹配到的“大题类型”作为大题的题型;4.7.4 Take the matched "big question type" as the question type of the big question;
4.7.5取大题文本及下一行文本,匹配如下分数规则,作为大题的分数信息;4.7.5 Take the text of the big question and the next line of text, and match the following score rules as the score information of the big question;
分数规则一:Scoring rule 1:
1)本大题共(\d{1,3})小题.*每小题(\d{1,3})分.*(共|满分)(\d{1,3})分;1) This big question has a total of (\d{1,3}) small questions. *Each small question (\d{1,3}) points.*(Total|Full score)(\d{1,3}) points;
2)本大题共(\d{1,3})小题.*每小题(\d{1,3}\.\d)分.*(共|满分)(\d{1,3}\.\d)分;2) This big question has a total of (\d{1,3}) small questions. *Each small question (\d{1,3}\.\d) points.*(Total|Full score)(\d{1,3 }\.\d) points;
匹配到的数值依次作为大题的小题数量、每小题的分数、大题总分;The matched value is used as the number of small questions, the score of each small question, and the total score of the big question in turn;
分数规则二:Scoring Rule 2:
本大题共(\d{1,3})小题.*(共|满分)(\d{1,3})分;匹配到的数值依次作为大题的小题数量、大题总分。This big question has a total of (\d{1,3}) small questions.*(Total|Full score)(\d{1,3}) points; the matched value is used as the number of small questions and the total score of the big question in turn .
5.从文本中提取小题信息;5. Extract subtopic information from the text;
小题信息定义:序号、题型、分数信息、区域等;Definition of sub-question information: serial number, question type, score information, area, etc.;
5.1按照大题的位置,得到每个大题的文本big_question_texts;5.1 According to the position of the big question, get the text big_question_texts of each big question;
5.2遍历文本big_question_texts,取出满足下列规则的文本行,作为候选小题文本的起始位置区域;5.2 Traverse the text big_question_texts, take out the text line that meets the following rules, as the starting position area of the candidate text;
小题关键字符:“阿拉伯数字”+“、|.”;Key characters of the subtitle: "Arabic numerals" + ", |.";
5.3通过下列特征过滤候选小题;5.3 Filter the candidate questions through the following features;
1)大题起始位置的横坐标big_coordinate_x,若小题坐标的横坐标大于或者小于big_coordinate_x的则删除该小题;1) The abscissa of the starting position of the big question is big_coordinate_x, if the abscissa of the coordinate of the small question is larger or smaller than big_coordinate_x, delete the small question;
2)小题坐标的横坐标若不满足序号递增,则删除该小题;2) If the abscissa of the sub-question coordinates does not satisfy the increasing sequence number, the sub-question will be deleted;
5.3.1剩下的序号即为大题下面的小题序号,对应的文本行区域作为小题区域的起始位置;5.3.1 The remaining serial number is the serial number of the sub-topic below the main question, and the corresponding text line area is used as the starting position of the sub-topic area;
5.3.2每个小题的结束位置为下一小题的起始位置,若到版面的末尾,则 取版面的末尾为小题区域的结束位置;5.3.2 The ending position of each question is the starting position of the next question. If it reaches the end of the layout, the end of the layout is the ending position of the question area;
5.4提取小题分数信息;5.4 Extract sub-question score information;
若小题的文本能匹配到下列规则,则取对应的结果作为分数信息;If the text of the sub-question can match the following rules, take the corresponding result as the score information;
规则一:((\d{1,3})分);Rule 1: ((\d{1,3}) points);
规则二:((\d{1,3}\.\d)分);Rule 2: ((\d{1,3}\.\d) points);
规则三:本小题((共|满分)?)(\d{1,3})分;Rule 3: This sub-question ((total|full score)?)(\d{1,3}) points;
规则四:本小题((共|满分)?)(\d{1,3}\.\d)分。Rule 4: This sub-question ((Total|Full score)?)(\d{1,3}\.\d) points.
6.输出所有试卷信息。6. Output all test paper information.
以图6为例,版面分析结果为:装订线区域表示为:[5,5,214,2330];版面区域表示为:[235,5,1505,2330],[1746,5,1559,2330];Taking Figure 6 as an example, the layout analysis results are: the gutter area is represented as: [5,5,214,2330]; the layout area is represented as: [235,5,1505,2330], [1746,5,1559,2330 ];
其中,版面区域1(图7)的文本行检测结果如图9所示,对试卷图像的文本行检测结果进行OCR识别,得到文本信息,合并文本信息与文本行,得到如下面的结果:Among them, the text line detection result of layout area 1 (Figure 7) is shown in Figure 9. The text line detection result of the test paper image is subjected to OCR recognition to obtain the text information, and the text information and the text line are merged to obtain the following result:
[['______学校2013-2014学年第一学期期中自查试卷',[104,120,363,52]]][['______ School 2013-2014 school year midterm self-examination papers for the first semester', [104,120,363,52]]]
[['七年级_______',[104,120,363,52]]][['Seventh grade_______', [104, 120, 363, 52]]]
[['(考试时间分钟,满分',[104,410,363,32]]][['(Exam time minutes, full marks', [104,410,363,32]]]
[[″,[211,467,1172,230]]][[″, [211,467,1172,230]]]
[['注意事项:用蓝、黑色钢',[85,713,846,33]]][['Caution: Use blue and black steel', [85,713,846,33]]]
[['一、选择题(本大题共9小题,共45.0分)',[77,759,561,30]]][['One. Multiple choice questions (9 sub-questions in this big question, 45.0 points in total)', [77,759,561,30]]]
[['1.设集合A={xr\\^2-4x-3<0},B={x[X-3>0},则A∩B=)',[87,802,906,34]]][['1. Set A={xr\\^2-4x-3<0}, B={x[X-3>0}, then A∩B=)', [87,802,906, 34]]]
[['(-,-',[251,857,90,51]]][['(-,-', [251,857,90,51]]]
[['2.函数V=2x\\^2-e\\^-在[-2,',[84,932,634,34]]][['2. Function V=2x\\^2-e\\^-in [-2,',[84,932,634,34]]]
[['A.',[142,985,31,276]]][['A.', [142,985,31,276]]]
[[″,[193,1291,307,272]],['D',[784,1291,24,272]]][[″, [193,1291,307,272]],['D',[784,1291,24,272]]]
[['3.已知等差数列{a-前9项',[84,1584,817,32]]][['3. Known arithmetic sequence {a-first 9 items', [84,1584,817,32]]]
[['A.100',[142,1629,85,26]]][['A.100', [142, 1629, 85, 26]]]
[['4.将函数V=2sin(X=)的',[83,1683,1041,46]],[″,[628,1683, 38,46]]][['4. Put the function V=2sin(X=)', [83, 1683, 1041, 46]], [", [628, 1683, 38, 46]]]
[['A.',[141,1770,32,46]]][['A.', [141, 1770, 32, 46]]]
[['5.⊿ABC的内角A、B、Ci',[84,1856,1196,40]]][['5.⊿ABC's inner angle A, B, Ci', [84, 1856, 1196, 40]]]
[[″,[1096,1890,12,17]]][[″, [1096,1890,12,17]]]
[['v',[250,1930,34,32]]][['v', [250, 1930, 34, 32]]]
[[″,[1173,1976,295,346]]][[″, [1173, 1976, 295, 346]]]
[['6.函数y=Asim(ox-p)的部',[84,1974,689,32]]][['6. Function y = part of Asim(ox-p)', [84,1974,689,32]]]
[['A.',[141,2032,31,44]]][['A.', [141, 2032, 31, 44]]]
[['ν=2s如/x-',[244,2119,198,44]]][['ν=2s such as /x-', [244,2119,198,44]]]
[['第1页/共4页',[703,2205,141,25]]][['Page 1/A total of 4 pages', [703, 2205, 141, 25]]]
[['C.',[79,209,28,44]]][['C.', [79,209,28,44]]]
[['V=2snr-',[182,295,145,44]]][['V=2snr-', [182,295,145,44]]]
[['7.已知偶函数)在区间[①,',[20,382,1114,51]],[″,[841,382,28,51]]][['7. Known even function) is in the interval [①,',[20,382,1114,51]],[″,[841,382,28,51]]]
[['I2',[200,469,38,17]]][['I2', [200,469,38,17]]]
[[″,[505,469,69,51]],['C.',[720,468,29,52]]][[″, [505,469,69,51]],['C.',[720,468,29,52]]]
[[″,[186,484,68,36]]][[″, [186,484,68,36]]]
[['8.设直线l经过椭圆的一个',[21,556,1388,47]]][['8. Let the line l pass through one of the ellipses', [21,556,1388,47]]]
[[″,[504,643,10,17]],[″,[823,643,14,16]],[″,[1146,642,13,17]]][[″,[504,643,10,17]],[″,[823,643,14,16]],[″,[1146,642,13,17]]]
[['A.',[78,654,31,26]]][['A.', [78,654,31,26]]]
[[″,[504,677,12,16]],[″,[824,677,12,16]],[″,[1146,676,13,13]]][[″,[504,677,12,16]],[″,[824,677,12,16]],[″,[1146,676,13,13]]]
[[″,[1041,719,162,219]],[″,[1306,718,97,220]]][[″,[1041,719,162,219]],[″,[1306,718,97,220]]]
[['9.如图是由圆柱与圆锥组合',[20,717,961,29]]][['9. The figure is a combination of cylinder and cone', [20,717,961,29]]]
[['为()',[79,754,93,36]]][['为()', [79,754,93,36]]]
[['20π',[185,849,46,24]]][['20π', [185,849,46,24]]]
[['24π',[78,893,29,26]]][['24π', [78,893,29,26]]]
[['28π',[183,937,46,23]]][['28π', [183,937,46,23]]]
[['32π',[185,958,46,99]],[″,[1114,958,98,99]]][['32π', [185,958,46,99]],[″,[1114,958,98,99]]]
[['二、填空题(本大题共4小题,共20.0分)',[13,1116,560,30]],[″,[24,1116,21,30]]][['2. Fill in the blanks (4 sub-topics in this big question, 20.0 points in total)', [13,1116,560,30]],[″,[24,1116,21,30]]]
[['10.⊿ABC的内角A,B,C的对边分别为a,b,c,若cosA=-,cosC=',[23,1171,917,49]],[″,[816,1172,21,48]],[″,[950,1171,32,49]],['a=l,则b=',[1002,1171,261,49]]][['10.⊿ABC's inner angles A, B, and C are opposite sides of a, b, c, respectively, if cosA=-, cosC=', [23,1171,917,49]],[″,[816 ,1172,21,48]],[″,[950,1171,32,49]],['a=1, then b=',[1002,1171,261,49]]]
[['11.已知双曲线C:,[23,1259,277,52]],['5--=(a>0,b>)的右项',[280,1259,1119,52]],[″,[329,1259,25,52]]][['11. Known hyperbola C:, [23,1259,277,52]], ['5--=(a>0,b>) the right term', [280,1259,1119,52 ]], [″, [329, 1259, 25, 52]]]
[['一条渐近线交于M,N两点.',[80,1332,883,33]]][['An asymptote intersects at two points M and N.', [80,1332,883,33]]]
[['12.若直线y=x-b是曲线p',[23,1376,1153,32]]][['12. If the straight line y=x-b is the curve p', [23,1376,1153,32]]]
[['13.曲线V=x\\^2-在点/],2处',[23,1432,637,51]],[″,[243,1432,38,51]]][['13. Curve V=x\\^2-at point/], 2', [23,1432,637,51]],[″,[243,1432,38,51]]]
[['三、解答题(本大题共10小题,共120.0分)',[13,1506,588,30]],[″,[23,1506,23,30]],[″,[25,1506,19,30]]][['Three, answer questions (this big question has 10 sub-questions, a total of 120.0 points)', [13,1506,588,30]],[″,[23,1506,23,30]],[″, [25, 1506, 19, 30]]]
[['14.⊿ABC的内角A,B,C的对边分别为a,b,c,已知2cosC(acosB-bcosA)=c.',[23,1550,1057,32]]][['14.⊿The opposite sides of the inner angles A, B, and C of ABC are a, b, and c, respectively. It is known that 2cosC(acosB-bcosA)=c.', [23,1550,1057,32]]]
[['(I求C;',[80,1594,111,30]]][['(I seek C;', [80,1594,111,30]]]
[['lⅡ若c=,⊿ABC的面积',[80,1650,674,40]]][['lⅡIf c=, ⊿Area of ABC', [80,1650,674,40]]]
[[″,[487,1686,12,16]]][[″, [487,1686,12,16]]]
[['15.⊿ABC的内角A,B,C',[23,2041,978,50]]][['15.⊿ABC's inner corners A, B, C', [23,2041,978,50]]]
[['ll)求cosB;',[79,2115,128,32]]][['ll) seek cosB;', [79, 2115, 128, 32]]]
[['2)若a-c=6,⊿ABC的面积',[79,2158,511,32]]][['2) If a-c=6, ⊿Area of ABC', [79, 2158, 511, 32]]]
[['第2页/共4页',[648,2205,142,25]]][['Page 2/A total of 4 pages', [648, 2205, 142, 25]]]
如图10所示,对目标试卷图像的文本进行坐标信息处理后得到如下面的结果:As shown in Figure 10, after processing the coordinate information of the text of the target test paper image, the following results are obtained:
试卷名称:'______学校2013-2014学年第一学期期中自查试卷'Test paper name:'______ School 2013-2014 midterm self-examination paper for the first semester of the school year'
科目:″subject:"
单元:″unit:"
考试类型:'期中'Exam type:'midterm'
考号区域:[60,150,60,200]Exam number area: [60, 150, 60, 200]
姓名区域:[60,1640,60,200]Name area: [60, 1640, 60, 200]
['______学校2013-2014学年第一学期期中自查试卷']['______ School 2013-2014 midterm self-examination paper for the first semester of the school year']
['一、选择题','big',['一、选择题(本大题共9小题,共45.0分)'],[312,756,1428,43],{'total_score':'45.0','number':'9','each_question_score':'5.0'}]['One, multiple-choice questions','big', ['One, multiple-choice questions (9 sub-questions in this big question, 45.0 points in total)'], [312,756,1428,43],{'total_score':' 45.0','number':'9','each_question_score':'5.0'}]
['1.','small',['1.设集合A={xr\\^2-4x-3<0},B={x[X-3>0},则A∩B=)','(-,-'],[322,807,1418,114],{'score':'5.0'}]['1.','small', ['1. Set set A={xr\\^2-4x-3<0}, B={x[X-3>0}, then A∩B=) ','(-,-'], [322,807,1418,114], {'score':'5.0'}]
['2.','small',['2.函数V=2x\\^2-e\\^-在[-2,','A.','D'],[319,929,1421,645],{'score':'5.0'}]['2.','small',['2. Function V=2x\\^2-e\\^-in [-2,','A.','D'], [319,929, 1421, 645], {'score':'5.0'}]
['3.','small',['3.已知等差数列{a-前9项','A.100'],[319,1583,1421,87],{'score':'5.0'}]['3.','small', ['3. Known arithmetic sequence {a-first 9 items','A.100'], [319,1583,1421,87], {'score':' 5.0'}]
['4.','small',['4.将函数V=2sm(X=÷)的','A.'],[318,1678,1422,159],{'score':'5.0'}]['4.','small', ['4. Change the function V=2sm (X=÷)','A.'], [318, 1678, 1422, 159], {'score': '5.0 '}]
['5.','small',['5.⊿ABC的内角A、B、Ci',″,'v'],[319,1845,1421,124],{'score':'5.0'}]['5.','small',['5.⊿ABC's inner corners A, B, Ci', ",'v'], [319,1845,1421,124],{'score':'5.0' }]
['6.','small',['6.函数y=Asin/ox-p)的部',″,'A.','ν=2s如/x-','第1页/共4页'],[[319,1977,1421,350],[1766,214,1523,157]],{'score':'5.0'}]['6.','small', ['6. Function y=Asin/ox-p) part', ",'A.','ν=2s such as /x-',' page 1 / total 4 pages'], [[319,1977,1421,350],[1766,214,1523,157]],{'score':'5.0'}]
['7','small',['7.已知偶函数)在区间[①,','C.','I2',″],[1766,379,1539,162],{'score':'5.0'}]['7','small', ['7. Even function is known) in the interval [①,','C.','I2', "], [1766,379,1539,162], {'score ':'5.0'}]
['8','small',['8.设直线l经过椭圆的一个',″,'A.',″],[1767,550,1538,156],{'score':'5.0'}]['8','small',['8. Let the line l pass through one of the ellipses',",'A.',"],[1767,550,1538,156],{'score':'5.0' }]
['9.','small',['9.如图是由圆柱与圆锥组合',″,'为:)','20尔','B.','28T','327'],[1766,714,1539,399],{'score':'5.0'}]['9.','small',['9. As shown in the figure is a combination of cylinder and cone', ",' is:)', '20 尔','B.', '28T', '327'] , [1766,714,1539,399], {'score':'5.0'}]
['二、填空题','big',['二、填空题(本大题共4小题,共20.0分)'],[1759,1121,1546,38],{'total_score':'20.0','number':'4','each_question_score':'5.0'}]['Two, fill in the blanks','big', ['two, fill in the blanks (4 sub-questions in this big question, 20.0 points in total)'], [1759, 1121, 1546, 38], {'total_score':' 20.0','number':'4','each_question_score':'5.0'}]
['10.','small',['10.⊿ABC的内角A,B,C的对边分别为a,b,c,若cosA=-,cosC=a=l,则b='],[1769,1168,1536,72],{'score':'5.0'}]['10.','small',['10.⊿ABC's inner angles A, B, and C are opposite sides of a, b, c, respectively, if cosA=-, cosC=a=l, then b='] , [1769, 1168, 1536, 72], {'score':'5.0'}]
['11.','small',['11.已知双曲线C:5--=(a>0,b>0)的右项','一条渐近线 交于M,N两点.'],[1769,1249,1536,122],{'score':'5.0'}]['11.','small', ['11. Knowing hyperbola C: 5--= (a>0, b>0) right term',' an asymptote intersects at two points M and N .'], [1769, 1249, 1536, 122], {'score':'5.0'}]
['12.','small',['12.若直线y=x-b是曲线p'],[1769,1380,1536,41],{'scor e':'5.0'}]['12.','small', ['12. If the straight line y=x-b is the curve p'], [1769, 1380, 1536, 41], {'scor e':'5.0'}]
['13.','small',['13.曲线V=x\\^2-在点/],2处'],[1769,1429,1536,66],{'scor e':'5.0'}]['13.','small',['13.Curve V=x\\^2-at point/],2'],[1769,1429,1536,66],{'scor e':' 5.0'}]
['三、解答题','big',['三、解答题(本大题共10小题,共120.0分)'],[1759,1504,1546,40],{'total_score':'120.0','number':'10','each_question_score':'12.0'}]['Three, answer questions','big', ['three, answer questions (this big question has 10 sub-questions, a total of 120.0 points)'], [1759, 1504, 1546, 40], {'total_score':' 120.0','number':'10','each_question_score':'12.0'}]
['14.','small',['14.⊿ABC的内角A,B,C的对边分别为a,b,c,已知2cosCracosB-bcos4)=c.','(I求C;','lⅡ若c=,dABC的面积',″],[1769,1552,1536,486],{'score':'12.0'}]['14.','small',['14.⊿The opposite sides of the inner angles A, B, and C of ABC are a, b, and c respectively. It is known that 2cosCracosB-bcos4)=c.','(I find C ;','LⅡIf c=, the area of dABC',"],[1769,1552,1536,486],{'score':'12.0'}]
['15.','small',['15.⊿ABC的内角A,B,C','ll)求cosB;','2)若a-c=6,⊿ABC的面积','第2页/共4页'],[1769,2046,1536,189],{'score':'12.0'}]['15.','small',['15.⊿ABC's inner angles A, B, C','ll) find cosB;','2) If ac=6, ⊿area of ABC','2 Page/A total of 4 pages'], [1769, 2046, 1536, 189], {'score':'12.0'}]
大题类型信息提取结果如下:The results of extracting information about the type of questions are as follows:
['一','选择题',[312,756,1428,43],{'total_score':'45','number':'9','each_qu estion_score':'5'}]['一','multiple choice question', [312,756,1428,43], {'total_score':'45','number':'9','each_qu estion_score':'5'}]
['1','选择题',[322,807,1418,114],{'score':'5'}]['1','Multiple choice question', [322,807,1418,114], {'score':'5'}]
['2','选择题',[319,929,1421,645],{'score':'5'}]['2','Multiple choice question', [319,929,1421,645], {'score':'5'}]
['3','选择题',[319,1583,1421,87],{'score':'5'}]['3','Multiple choice question', [319,1583,1421,87], {'score':'5'}]
['4','选择题',[318,1678,1422,159],{'score':'5'}]['4','multiple choice question', [318, 1678, 1422, 159], {'score':'5'}]
['5','选择题',[319,1845,1421,124],{'score':'5'}]['5','Multiple choice', [319,1845,1421,124], {'score':'5'}]
['6','选择题',[[319,1977,1421,350],[1766,214,1523,157]],{'score':'5'}]['6','Multiple choice question', [[319,1977,1421,350], [1766,214,1523,157]], {'score':'5'}]
['7','选择题',[1766,379,1539,162],{'score':'5'}]['7','Multiple choice question', [1766,379,1539,162], {'score':'5'}]
['8','选择题',[1767,550,1538,156],{'score':'5'}]['8','Multiple choice question', [1767,550,1538,156], {'score':'5'}]
['9','选择题',[1766,714,1539,399],{'score':'5'}]['9','Multiple choice', [1766,714,1539,399], {'score':'5'}]
['二','填空题',[1759,1121,1546,38],{'total_score':'20','number':'4','each_qu estion_score':'5'}]['Two','fill in the blanks', [1759,1121,1546,38], {'total_score':'20','number':'4','each_qu estion_score':'5'}]
['10','填空题',[1769,1168,1536,72],{'score':'5'}]['10','fill in the blanks', [1769, 1168, 1536, 72], {'score':'5'}]
['11','填空题',[1769,1249,1536,122],{'score':'5'}]['11','fill in the blanks', [1769, 1249, 1536, 122], {'score':'5'}]
['12','填空题',[1769,1380,1536,41],{'score':'5'}]['12','fill in the blanks', [1769, 1380, 1536, 41], {'score':'5'}]
['13','填空题',[1769,1429,1536,66],{'score':'5'}]['13','fill in the blanks', [1769, 1429, 1536, 66], {'score':'5'}]
['三','解答题',[1759,1504,1546,40],{'total_score':'120','number':'10','each_question_score':'12'}]['Three','Question for solution', [1759, 1504, 1546, 40], {'total_score': '120','number': '10','each_question_score': '12'}]
['14','解答题',[1769,1552,1536,486],{'score':'12'}]['14','Problem to solve', [1769, 1552, 1536, 486], {'score':'12'}]
['15','解答题',[1769,2046,1536,189],{'score':'12'}]['15','Problem to solve', [1769, 2046, 1536, 189], {'score':'12'}]
在该实施例中,通过线条检测、空白区域检测的方法来实现试卷版面分析,可自动识别试卷的排版信息,利用专门用于试卷分析的基于深度学习的卷积神经网络的OCR方法对试卷图像进行准确文字识别,使用自然语言处理技术提取试卷的多类文本信息,不仅实现了高效、精准的自动化阅卷,还能够提升系统的适用范围,从而有效降低教育工作者的工作量,满足用户的多种需求。In this embodiment, line detection and blank area detection are used to realize test paper layout analysis, which can automatically identify the typesetting information of test papers, and use OCR method based on deep learning convolutional neural network for test paper analysis to analyze test paper images. Perform accurate text recognition and use natural language processing technology to extract multiple types of text information from test papers. This not only realizes efficient and accurate automatic scoring, but also improves the scope of application of the system, thereby effectively reducing the workload of educators and satisfying the needs of users. Kind of demand.
根据本申请第二方面的实施例,提出了一种试卷信息提取系统500,如图12所示,包括存储器502、处理器504及存储在存储器502上并可在处理器504上运行的计算机程序,处理器504执行计算机程序时实现上述任一实施例的试卷信息提取方法。According to the embodiment of the second aspect of the present application, a test paper information extraction system 500 is proposed, as shown in FIG. 12, including a memory 502, a processor 504, and a computer program stored in the memory 502 and running on the processor 504 When the processor 504 executes the computer program, the method for extracting test paper information in any of the foregoing embodiments is implemented.
根据本申请第三方面的实施例,提出了一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现如上述任一实施例的试卷信息提取方法的步骤。According to an embodiment of the third aspect of the present application, a computer-readable storage medium is provided, on which a computer program is stored, and when the computer program is executed by a processor, the steps of the test paper information extraction method as in any of the above embodiments are implemented.
在本说明书的描述中,术语“第一”、“第二”仅用于描述的目的,而不能理解为指示或暗示相对重要性,除非另有明确的规定和限定;术语“连接”、“安装”、“固定”等均应做广义理解,例如,“连接”可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是直接相连,也可以通过中间媒介间接相连。对于本领域的普通技术人员而言,可以根据不同情况理解上述术语在本申请中的含义。In the description of this specification, the terms "first" and "second" are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance, unless expressly stipulated and limited otherwise; the terms "connected" and " "Installation" and "fixation" should be understood in a broad sense. For example, "connection" can be a fixed connection, a detachable connection, or an integral connection; it can be directly connected or indirectly connected through an intermediate medium. For those of ordinary skill in the art, the meaning of the above-mentioned terms in this application can be understood according to different situations.
在本说明书的描述中,术语“一个实施例”、“一些实施例”、“具体实施例”等的描述意指结合该实施例或示例描述的特征、结构、材料或特点包含于本申请的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述 不一定指的是相同的实施例或实例。而且,描述的特征、结构、材料或特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, the description of the terms "one embodiment", "some embodiments", "specific embodiments", etc. means that the features, structures, materials or characteristics described in conjunction with the embodiment or examples are included in the application In at least one embodiment or example. In this specification, the schematic representations of the aforementioned terms do not necessarily refer to the same embodiment or example. Moreover, the described features, structures, materials or characteristics can be combined in any one or more embodiments or examples in a suitable manner.

Claims (13)

  1. 一种试卷信息提取方法,包括:A method for extracting test paper information, including:
    对试卷图像进行预处理,得到二进制图像;Preprocess the test paper image to obtain a binary image;
    确定所述二进制图像的版面区域;Determining the layout area of the binary image;
    根据所述版面区域,获取所述试卷图像的文本行;Obtaining the text line of the test paper image according to the layout area;
    根据所述文本行,提取文本图像;Extract a text image according to the text line;
    将所述文本图像输入文字识别模型,得到所述试卷图像的文本信息;Input the text image into a text recognition model to obtain the text information of the test paper image;
    对应合并所述文本信息与所述文本行,得到目标试卷图像;Correspondingly merge the text information and the text line to obtain a target test paper image;
    根据分类标签提取所述目标试卷图像的试卷信息。Extract the test paper information of the target test paper image according to the classification label.
  2. 根据权利要求1所述的试卷信息提取方法,其中,所述确定所述二进制图像的版面区域,包括:The method for extracting test paper information according to claim 1, wherein the determining the layout area of the binary image comprises:
    根据第一预设尺寸,确定所述二进制图像的子图像;Determining the sub-image of the binary image according to the first preset size;
    检测所述子图像的线条;Detecting the lines of the sub-image;
    在所述子图像的线条的长度满足预设长度范围,且所述子图像的线条第一端与所述子图像的第一边缘之间的区域以及所述子图像的线条的第二端与所述子图像的第二边缘之间的区域为空白区域的情况下,将所述子图像的线条作为装订线;When the length of the line of the sub-image meets the preset length range, and the area between the first end of the line of the sub-image and the first edge of the sub-image, and the second end of the line of the sub-image and If the area between the second edges of the sub-image is a blank area, use the lines of the sub-image as a binding line;
    根据所述装订线,确定所述二进制图像的文本区域;Determining the text area of the binary image according to the binding line;
    根据第二预设尺寸,确定所述文本区域的中心区域;Determine the central area of the text area according to the second preset size;
    在所述中心区域中检测分隔符号;Detecting a separation symbol in the central area;
    响应于在所述中心区域中检测到分隔符号的检测结果,根据所述分隔符号确定所述版面区域。In response to the detection result of detecting a separator in the central area, the layout area is determined according to the separator.
  3. 根据权利要求2所述的试卷信息提取方法,其中,所述确定所述二进制图像的版面区域,还包括:The method for extracting test paper information according to claim 2, wherein said determining the layout area of the binary image further comprises:
    响应于所述中心区域中未检测到所述分隔符号的检测结果,根据第三预设尺寸确定所述文本区域的分割区域;In response to the detection result that the separation symbol is not detected in the central area, determine the segmentation area of the text area according to a third preset size;
    在所述分割区域中检测所述分隔符号;Detecting the separation symbol in the segmentation area;
    响应于在所述分割区域中检测到所述分隔符号的检测结果,根据所述分隔符号确定所述版面区域;In response to a detection result of detecting the separation symbol in the divided region, determining the layout area according to the separation symbol;
    响应于在所述分割区域中未检测到所述分隔符号的检测结果,将所述文 本区域作为所述版面区域。In response to the detection result that the separation symbol is not detected in the divided area, the text area is taken as the layout area.
  4. 根据权利要求3所述的试卷信息提取方法,其中,所述检测分隔符号,包括:4. The method for extracting test paper information according to claim 3, wherein the detection separator includes:
    对所述中心区域或所述分割区域进行投影处理,得到所述二进制图像的空白区域;Performing projection processing on the central area or the segmented area to obtain the blank area of the binary image;
    在所述空白区域的宽度大于宽度阈值的情况下,将所述空白区域作为所述分隔符号。In a case where the width of the blank area is greater than the width threshold, the blank area is used as the separation symbol.
  5. 根据权利要求3所述的试卷信息提取方法,其中,所述检测分隔符号,包括:对所述中心区域或所述分割区域进行模糊处理和去噪处理中的至少之一,得到所述二进制图像的线条;The method for extracting test paper information according to claim 3, wherein the detecting the separation symbol comprises: performing at least one of blurring and denoising processing on the central area or the segmented area to obtain the binary image Lines
    根据预设角度范围和预设长度阈值筛选所述二进制图像的线条,得到目标线条;Filtering the lines of the binary image according to a preset angle range and a preset length threshold to obtain a target line;
    在所述目标线条的长度大于第一预设长度或所述目标线条的长度大于第二预设长度,且所述目标线条的第一端与所述二进制图像的第一边缘之间的标题区域的宽度、所述目标线条的第二端与所述二进制图像的第二边缘之间的空白区域的宽度与所述目标线条的长度之和大于第一预设长度的情况下,将所述目标线条作为所述分隔符号。The title area between the first end of the target line and the first edge of the binary image when the length of the target line is greater than a first preset length or the length of the target line is greater than a second preset length If the sum of the width of the target line and the blank area between the second end of the target line and the second edge of the binary image and the length of the target line is greater than the first preset length, the target The line serves as the separator.
  6. 根据权利要求1所述的试卷信息提取方法,其中,所述根据所述版面区域,获取所述试卷图像的文本行,包括:The method for extracting test paper information according to claim 1, wherein said obtaining the text line of the test paper image according to the layout area comprises:
    识别所述版面区域中的矩形框;Identifying the rectangular frame in the layout area;
    根据所述矩形框的宽度,确定文本行宽度;Determine the width of the text line according to the width of the rectangular frame;
    根据所述文本行宽度确定文本框;Determining a text box according to the width of the text line;
    在当前文本框的中心点与前一个文本框的中心点的垂直距离小于第一距离阈值,且所述当前文本框的中心点与所述前一个文本框的中心点的水平距离小于第二距离阈值的情况下,合并所述当前文本框和所述前一个文本框,得到一个文本行;The vertical distance between the center point of the current text box and the center point of the previous text box is less than the first distance threshold, and the horizontal distance between the center point of the current text box and the center point of the previous text box is less than the second distance In the case of a threshold, merge the current text box and the previous text box to obtain a text line;
    在当前文本框的中心点与前一个文本框的中心点的垂直距离大于或等于第一距离阈值,或所述当前文本框的中心点与所述前一个文本框的中心点的水平距离大于或等于第二距离阈值的情况下,将所述当前文本框和所述前一 个文本框均作为一个文本行。The vertical distance between the center point of the current text box and the center point of the previous text box is greater than or equal to the first distance threshold, or the horizontal distance between the center point of the current text box and the center point of the previous text box is greater than or When it is equal to the second distance threshold, both the current text box and the previous text box are regarded as one text line.
  7. 根据权利要求6所述的试卷信息提取方法,其中,所述根据所述矩形框的宽度,确定文本行宽度,包括:The method for extracting test paper information according to claim 6, wherein said determining the width of the text line according to the width of the rectangular frame comprises:
    根据预设宽度范围筛选所有所述矩形框的宽度,得到多个目标宽度;Filter the widths of all the rectangular frames according to the preset width range to obtain multiple target widths;
    统计所述多个目标宽度中的每个目标宽度对应的矩形框个数;Counting the number of rectangular frames corresponding to each target width in the plurality of target widths;
    选取最大的所述矩形框个数对应的目标宽度作为文本行宽度。The target width corresponding to the largest number of rectangular frames is selected as the text line width.
  8. 根据权利要求6所述的试卷信息提取方法,所述根据所述矩形框宽度,确定文本行宽度之前,还包括:The method for extracting test paper information according to claim 6, before determining the width of the text line according to the width of the rectangular frame, the method further comprises:
    在当前矩形框的中心点与前一个矩形框的中心点的垂直距离小于第三距离阈值,且所述当前矩形框的中心点与所述前一个矩形框的中心点的水平距离小于第四距离阈值的情况下,合并所述当前矩形框和所述前一个矩形框。The vertical distance between the center point of the current rectangular frame and the center point of the previous rectangular frame is less than the third distance threshold, and the horizontal distance between the center point of the current rectangular frame and the center point of the previous rectangular frame is less than the fourth distance In the case of a threshold, merge the current rectangular frame and the previous rectangular frame.
  9. 根据权利要求1所述的试卷信息提取方法,所述将所述文本图像输入文字识别模型,得到所述试卷图像的文本信息之前,还包括:The method for extracting test paper information according to claim 1, before inputting the text image into a character recognition model to obtain the text information of the test paper image, the method further comprises:
    获取文本数据和字符数据;Get text data and character data;
    编码所述文本数据和所述字符数据,得到识别词典;根据所述文本数据,确定文本图像集;Encoding the text data and the character data to obtain a recognition dictionary; determining a text image set according to the text data;
    根据所述识别词典和所述文本图像集,构建所述文字识别模型。According to the recognition dictionary and the text image set, the text recognition model is constructed.
  10. 根据权利要求1所述的试卷信息提取方法,其中,所述分类标签包括标题、大题和小题;所述根据分类标签提取所述目标试卷图像的试卷信息,包括:The method for extracting test paper information according to claim 1, wherein the classification label includes a title, a big question, and a small question; the extracting test paper information of the target test paper image according to the classification label includes:
    根据所述标题、大题和小题分别对应的分类关键字符,分别确定标题文本行、大题文本行和小题文本行;Determine the title text line, the big title text line and the small title text line respectively according to the classification key characters corresponding to the title, the big title and the small title respectively;
    根据所述标题文本行、所述大题文本行和所述小题文本行,提取所述试卷信息。Extract the test paper information according to the title text line, the big question text line and the small question text line.
  11. 根据权利要求10所述的试卷信息提取方法,所述根据所述标题文本行、所述大题文本行和所述小题文本行,提取所述试卷信息之前,还包括:The method for extracting test paper information according to claim 10, wherein before extracting the test paper information based on the title text line, the big question text line and the small question text line, the method further comprises:
    对所述目标试卷图像进行坐标信息处理;Perform coordinate information processing on the target test paper image;
    在所述小题文本行的横坐标超出预设坐标范围,或所述小题文本行的横坐标不满足序号递增规则的情况下,删除所述小题文本行。When the abscissa of the subtitle text line exceeds the preset coordinate range, or the abscissa of the subtitle text line does not satisfy the sequence number increasing rule, the subtitle text line is deleted.
  12. 一种试卷信息提取系统,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如权利要求1至11中任一项所述的试卷信息提取方法。A test paper information extraction system, comprising a memory, a processor, and a computer program stored on the memory and capable of running on the processor. The processor executes the computer program as claimed in claims 1 to 11 Any one of the test paper information extraction methods.
  13. 一种计算机可读存储介质,存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1至11中任一项所述的试卷信息提取方法。A computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, realizes the method for extracting test paper information according to any one of claims 1 to 11.
PCT/CN2020/087211 2019-06-26 2020-04-27 Test paper information extraction method and system, and computer-readable storage medium WO2020259060A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910559124.6 2019-06-26
CN201910559124.6A CN110414529A (en) 2019-06-26 2019-06-26 Paper information extracting method, system and computer readable storage medium

Publications (1)

Publication Number Publication Date
WO2020259060A1 true WO2020259060A1 (en) 2020-12-30

Family

ID=68359863

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/087211 WO2020259060A1 (en) 2019-06-26 2020-04-27 Test paper information extraction method and system, and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN110414529A (en)
WO (1) WO2020259060A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112699881A (en) * 2020-12-31 2021-04-23 北京一起教育科技有限责任公司 Image identification method and device and electronic equipment
CN112749694A (en) * 2021-01-20 2021-05-04 中科云谷科技有限公司 Method and device for identifying image direction and nameplate characters
CN113723323A (en) * 2021-09-02 2021-11-30 北京云蝶智学科技有限公司 Test paper information acquisition method and device
CN114444439A (en) * 2022-04-08 2022-05-06 深圳市壹箭教育科技有限公司 Test question set file generation method and device, electronic equipment and storage medium
CN114550181A (en) * 2022-02-10 2022-05-27 珠海读书郎软件科技有限公司 Method, device and medium for identifying question
CN115565190A (en) * 2022-11-17 2023-01-03 江西风向标智能科技有限公司 Test paper layout analysis method, system, computer and readable storage medium
CN115841677A (en) * 2022-12-21 2023-03-24 长扬科技(北京)股份有限公司 Text layout analysis method and device, electronic equipment and storage medium
CN116758578A (en) * 2023-08-18 2023-09-15 上海楷领科技有限公司 Mechanical drawing information extraction method, device, system and storage medium
CN117217209A (en) * 2023-11-07 2023-12-12 湖南强智科技发展有限公司 Intelligent college examination paper assembling method, system, equipment and storage medium

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110414529A (en) * 2019-06-26 2019-11-05 深圳中兴网信科技有限公司 Paper information extracting method, system and computer readable storage medium
CN110866388A (en) * 2019-11-19 2020-03-06 重庆华龙网海数科技有限公司 Publishing PDF layout analysis and identification method based on mixing of multiple neural networks
CN111242131A (en) * 2020-01-06 2020-06-05 北京十六进制科技有限公司 Method, storage medium and device for image recognition in intelligent marking
CN110956173B (en) * 2020-02-18 2020-06-23 江西软云科技股份有限公司 Topic content identification method and device, readable storage medium and computer equipment
CN111382721B (en) * 2020-03-20 2020-12-15 上海松鼠课堂人工智能科技有限公司 Examination room monitoring system based on artificial intelligence
CN113537221A (en) * 2020-04-15 2021-10-22 阿里巴巴集团控股有限公司 Image recognition method, device and equipment
CN111507330B (en) * 2020-04-15 2023-04-25 腾讯科技(深圳)有限公司 Problem recognition method and device, electronic equipment and storage medium
CN112000796A (en) * 2020-08-05 2020-11-27 深圳中兴网信科技有限公司 Answer generation method, electronic device and computer-readable storage medium
CN111968104B (en) * 2020-08-27 2023-04-07 中冶赛迪信息技术(重庆)有限公司 Machine vision-based steel coil abnormity identification method, system, equipment and medium
CN112364679A (en) * 2020-09-04 2021-02-12 联想(北京)有限公司 Image area identification method and electronic equipment
CN112036395A (en) * 2020-09-04 2020-12-04 联想(北京)有限公司 Text classification identification method and device based on target detection
CN112560849B (en) * 2021-01-24 2021-08-20 中天恒星(上海)科技有限公司 Neural network algorithm-based grammar segmentation method and system
CN112801084A (en) * 2021-01-29 2021-05-14 杭州大拿科技股份有限公司 Image processing method and device, electronic equipment and storage medium
CN113221880B (en) * 2021-04-29 2022-08-05 上海勃池信息技术有限公司 OCR layout analysis method based on kini purity
CN113705559A (en) * 2021-08-31 2021-11-26 平安银行股份有限公司 Character recognition method and device based on artificial intelligence and electronic equipment
CN114529918A (en) * 2022-02-10 2022-05-24 珠海读书郎软件科技有限公司 Test paper inversion identification method, device and medium
CN117409428B (en) * 2023-12-13 2024-03-01 南昌理工学院 Test paper information processing method, system, computer equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120317470A1 (en) * 2011-06-07 2012-12-13 Xerox Corporation Generate-and-test method for column segmentation
CN105373978A (en) * 2015-08-12 2016-03-02 高学 Artificial test paper judgment processing device and artificial test paper judgment processing method based on OCR
CN107798321A (en) * 2017-12-04 2018-03-13 海南云江科技有限公司 A kind of examination paper analysis method and computing device
CN109086261A (en) * 2018-08-07 2018-12-25 深圳迪普乐宁科技有限公司 It gos over examination papers by hand on a kind of paper the method and system of computer system point
CN109685065A (en) * 2018-12-11 2019-04-26 中国科学院自动化研究所 Printed page analysis method, the system of paper automatic content classification
CN110414529A (en) * 2019-06-26 2019-11-05 深圳中兴网信科技有限公司 Paper information extracting method, system and computer readable storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107748888B (en) * 2017-10-13 2019-11-08 众安信息技术服务有限公司 A kind of image text row detection method and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120317470A1 (en) * 2011-06-07 2012-12-13 Xerox Corporation Generate-and-test method for column segmentation
CN105373978A (en) * 2015-08-12 2016-03-02 高学 Artificial test paper judgment processing device and artificial test paper judgment processing method based on OCR
CN107798321A (en) * 2017-12-04 2018-03-13 海南云江科技有限公司 A kind of examination paper analysis method and computing device
CN109086261A (en) * 2018-08-07 2018-12-25 深圳迪普乐宁科技有限公司 It gos over examination papers by hand on a kind of paper the method and system of computer system point
CN109685065A (en) * 2018-12-11 2019-04-26 中国科学院自动化研究所 Printed page analysis method, the system of paper automatic content classification
CN110414529A (en) * 2019-06-26 2019-11-05 深圳中兴网信科技有限公司 Paper information extracting method, system and computer readable storage medium

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112699881A (en) * 2020-12-31 2021-04-23 北京一起教育科技有限责任公司 Image identification method and device and electronic equipment
CN112749694A (en) * 2021-01-20 2021-05-04 中科云谷科技有限公司 Method and device for identifying image direction and nameplate characters
CN113723323A (en) * 2021-09-02 2021-11-30 北京云蝶智学科技有限公司 Test paper information acquisition method and device
CN114550181B (en) * 2022-02-10 2023-01-10 珠海读书郎软件科技有限公司 Method, device and medium for identifying question
CN114550181A (en) * 2022-02-10 2022-05-27 珠海读书郎软件科技有限公司 Method, device and medium for identifying question
CN114444439A (en) * 2022-04-08 2022-05-06 深圳市壹箭教育科技有限公司 Test question set file generation method and device, electronic equipment and storage medium
CN114444439B (en) * 2022-04-08 2022-08-26 深圳市壹箭教育科技有限公司 Test question set file generation method and device, electronic equipment and storage medium
CN115565190A (en) * 2022-11-17 2023-01-03 江西风向标智能科技有限公司 Test paper layout analysis method, system, computer and readable storage medium
CN115841677A (en) * 2022-12-21 2023-03-24 长扬科技(北京)股份有限公司 Text layout analysis method and device, electronic equipment and storage medium
CN115841677B (en) * 2022-12-21 2023-09-05 长扬科技(北京)股份有限公司 Text layout analysis method and device, electronic equipment and storage medium
CN116758578A (en) * 2023-08-18 2023-09-15 上海楷领科技有限公司 Mechanical drawing information extraction method, device, system and storage medium
CN116758578B (en) * 2023-08-18 2023-11-07 上海楷领科技有限公司 Mechanical drawing information extraction method, device, system and storage medium
CN117217209A (en) * 2023-11-07 2023-12-12 湖南强智科技发展有限公司 Intelligent college examination paper assembling method, system, equipment and storage medium

Also Published As

Publication number Publication date
CN110414529A (en) 2019-11-05

Similar Documents

Publication Publication Date Title
WO2020259060A1 (en) Test paper information extraction method and system, and computer-readable storage medium
US11790641B2 (en) Answer evaluation method, answer evaluation system, electronic device, and medium
WO2021073266A1 (en) Image detection-based test question checking method and related device
CN110210413B (en) Multidisciplinary test paper content detection and identification system and method based on deep learning
CN107506762B (en) Score automatic input method based on image analysis
CN107093172B (en) Character detection method and system
CN111597908A (en) Test paper correcting method and test paper correcting device
CN111274239B (en) Test paper structuring processing method, device and equipment
CN105046200B (en) Electronic paper marking method based on straight line detection
CN112446370B (en) Method for identifying text information of nameplate of power equipment
CN111626292B (en) Text recognition method of building indication mark based on deep learning technology
CN112446259A (en) Image processing method, device, terminal and computer readable storage medium
CN111079641A (en) Answering content identification method, related device and readable storage medium
CN111695555B (en) Question number-based accurate question framing method, device, equipment and medium
CN113159014A (en) Objective question reading method, device, equipment and storage medium based on handwritten question numbers
CN106033534B (en) Electronic paper marking method based on straight line detection
CN116824608A (en) Answer sheet layout analysis method based on target detection technology
CN113537227B (en) Structured text recognition method and system
CN111832497B (en) Text detection post-processing method based on geometric features
CN113610068A (en) Test question disassembling method, system, storage medium and equipment based on test paper image
CN112686263A (en) Character recognition method and device, electronic equipment and storage medium
CN116822634A (en) Document visual language reasoning method based on layout perception prompt
KR100795951B1 (en) System for grading examination paper and control method
CN110705610A (en) Evaluation system and method based on handwriting detection and temporary writing capability
CN116612478A (en) Off-line handwritten Chinese character scoring method, device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20831713

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 31.05.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 20831713

Country of ref document: EP

Kind code of ref document: A1