CN109165652A

CN109165652A - Paper method to go over files based on label

Info

Publication number: CN109165652A
Application number: CN201810797045.4A
Authority: CN
Inventors: 邱俊杰
Original assignee: Jiangsu Golden House Education Development Ltd By Share Ltd
Current assignee: Jiangsu Golden House Education Development Ltd By Share Ltd
Priority date: 2018-07-19
Filing date: 2018-07-19
Publication date: 2019-01-08

Abstract

The present invention relates to a kind of paper method to go over files based on label, wherein, the first objective indicia is equipped on paper before each objective item, the respective option of each objective item is equipped with the second objective indicia and the answer frame for examinee's full-filling, first objective indicia is different from second objective indicia, the starting point of each subjective item is equipped with first area identification point on paper, the first area identification point is different from first objective indicia and second objective indicia, and the first area identification point and subsequent region identification point of subjective item are the answer range of the subjective item.The above-mentioned paper method to go over files based on label, examinee are not necessarily to filling answer sheets in examination, after improving examination efficiency, also reduce the probability of examinee's filling answer sheets error.

Description

Paper method to go over files based on label

Technical field

The present invention relates to paper method to go over files, more particularly to the paper method to go over files based on label.

Background technique

Current paper automatic identification technology carries out automatic identification mainly for answering card, and although this mode saves reads The time is rolled up, improves accuracy rate of going over examination papers, but there is also certain problems: waste paper, simultaneously because examinee needs in answer Card-coating is carried out on card, it is therefore desirable to the ancillary cost time, and it is easy to appear the situation that answer is coated on errors present by examinee.

Large-scale examination in recent years when carrying out subjective question marking mainly by the image of acquisition subjective item answering card come into The more people's network gradings of row, this method can reduce error of going over examination papers, and shortening is goed over examination papers the time, but is also deposited when practical application In problem above, and since the teacher that gos over examination papers can only see the answer in answering card limited area, dislocation is filled out into answer in examinee When setting or answering out limited area, the teacher that gos over examination papers can not just see complete answer.

At this stage for the fewer of the automatic marking technique study of non-answering card, the prior art mainly passes through handwritten form number The identification of word or letter realizes to the identification of objective item region answer to carry out automatic marking, but practical application when It waits, is influenced by the recognition accuracy of handwriting digital or letter, this technology is very difficult to apply in large-scale examination.

Summary of the invention

Based on this, it is necessary in view of the above technical problems, provide a kind of paper method to go over files based on label, examinee is examining It is not necessarily to filling answer sheets in examination, after improving examination efficiency, also reduces the probability of examinee's filling answer sheets error.

A kind of paper method to go over files based on label, wherein be equipped with the first objective indicia on paper before each objective item, often The respective option of a objective item is equipped with the second objective indicia and the answer frame for examinee's full-filling, and first objective indicia is different from Second objective indicia, the starting point of each subjective item is equipped with first area identification point, the first area identification point on paper Different from first objective indicia and second objective indicia, the first area identification point and subsequent region of subjective item are identified Point is the answer range of the subjective item, comprising:

The image for obtaining paper carries out the first objective indicia, the second objective indicia, answer index point and the firstth area to image The extraction of domain identification point, wherein answer index point is the answer frame of examinee's full-filling, to the first objective indicia of extraction, second objective Label, answer index point and first area identification point carry out identification differentiation, and the first objective indicia distinguished to identification is arranged Sequence；

For each first objective indicia, according between the first objective indicia and the second objective indicia, answer index point Positional relationship determines associated second objective indicia of first objective indicia and answer index point, and to the associated second objective mark Note is ranked up, and then calculates associated answer index point the distance between to each associated second objective indicia, distance is most The second objective indicia that is small and meeting design of test proportionate relationship is denoted as examinee's answer, by the sequence serial number of second objective indicia It is compared with model answer, judges whether examinee's answer is correct according to comparison result；

After identifying first area identification point, cutting is carried out to image and goes out the text of subjective item according to the image recognition of cutting Word, to go over examination papers to subjective item.

The above-mentioned paper method to go over files based on label, examinee are not necessarily to filling answer sheets in examination, are improving efficiency of taking an examination After, also reduce the probability of examinee's filling answer sheets error.

In other one embodiment, " after identification first area identification point, cutting is carried out to image and according to cutting Image recognition go out the text of subjective item, to go over examination papers to subjective item." in identify the text of subjective item, specifically include:

Images to be recognized is pre-processed, target image only comprising text is obtained；

Extract the text in the target image；

For each text extracted, the connection feature of each text is obtained；

For each text, each row abscissa and each column ordinate in all pixels point for constituting text are extracted respectively Minimum and maximum pixel, constitute the contour feature of each text；

According to the connection feature and the contour feature of established matrix magazine and each text, to institute Each text is stated to be identified.

In other one embodiment, the text extracted in the target image, comprising:

According to the separation condition of text in the target image, the first area where each text is obtained；

In the first area, the pixel abscissa of characterization text and the minimum and maximum vertex of ordinate are obtained Coordinate points

Image that all pixels point in the rectangular area that is made of the apex coordinate point forms is extracted as described the Text in one region.

In other one embodiment, the separation condition according to text in the target image,

Obtain the first area where each text, comprising:

Obtain the cut-off rule that at least one column in the target image are all background pixel point；

According to the background width of background at left and right sides of the target image, obtain in the target image where all texts Second area；

In the second region, according to text minimum widith and the cut-off rule, each text place is obtained First area.

It is described for each text extracted in other one embodiment, obtain the connection of each text Feature, comprising:

It obtains and respectively characterizes connected component and the connected component that the continuous image vegetarian refreshments of text is constituted in each text Attribute information；

All connected components are connected to feature with the attribute information of the connected component as described.

In other one embodiment, the attribute information of the connected component comprises at least one of the following information: respectively connecting The relative position information of logical part, the pixel number of each connected component, the stroke and each connected component that each connected component includes Edge gradient value.

In other one embodiment, when the text is Chinese character, the stroke that each connected component includes passes through Following method obtains:

The orientation angle for the straight line that pixel based on characterization text is constituted, obtains stroke: horizontal, vertical；

The orientation angle and length for the straight line that pixel based on characterization text is fitted to, obtain stroke: skim, right-falling stroke, roll over, Point.

A kind of computer equipment can be run on a memory and on a processor including memory, processor and storage The step of computer program, the processor realizes any one the method when executing described program.

A kind of computer readable storage medium, is stored thereon with computer program, realization when which is executed by processor The step of any one the method.

A kind of processor, the processor is for running program, wherein described program executes described in any item when running Method.

Detailed description of the invention

Fig. 1 is a kind of flow diagram of the paper method to go over files based on label provided by the embodiments of the present application.

Specific embodiment

In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and It is not used in the restriction present invention.

Refering to fig. 1, a kind of paper method to go over files based on label, wherein be equipped with the first visitor on paper before each objective item Label is seen, the respective option of each objective item is equipped with the second objective indicia and the answer frame for examinee's full-filling, and described first is objective Label is different from second objective indicia, and the starting point of each subjective item is equipped with first area identification point on paper, and described first Region recognition point is different from first objective indicia and second objective indicia, and the first area identification point of subjective item is under Region recognition point is the answer range of the subjective item, comprising:

S110, obtain paper image, to image carry out the first objective indicia, the second objective indicia, answer index point and The extraction of first area identification point, wherein answer index point is the answer frame of examinee's full-filling, to the first objective indicia of extraction, the Two objective indicias, answer index point and first area identification point carry out identification differentiation, and to the first objective indicia that identification is distinguished It is ranked up；

S120, for each first objective indicia, according to the first objective indicia and the second objective indicia, answer index point it Between positional relationship determine associated second objective indicia of first objective indicia and answer index point, and to associated second visitor It sees label to be ranked up, then calculates associated answer index point the distance between to each associated second objective indicia, away from From minimum and the second objective indicia for meeting design of test proportionate relationship is denoted as examinee's answer, by the sequence of second objective indicia Serial number is compared with model answer, judges whether examinee's answer is correct according to comparison result；

After S130, identification first area identification point, cutting is carried out to image and subjectivity is gone out according to the image recognition of cutting The text of topic, to go over examination papers to subjective item.

Extract the text in the target image；

For each text extracted, the connection feature of each text is obtained；

In other one embodiment, the text extracted in the target image, comprising:

Obtain the first area where each text, comprising:

Each technical characteristic of embodiment described above can be combined arbitrarily, for simplicity of description, not to above-mentioned reality It applies all possible combination of each technical characteristic in example to be all described, as long as however, the combination of these technical characteristics is not deposited In contradiction, all should be considered as described in this specification.

The embodiments described above only express several embodiments of the present invention, and the description thereof is more specific and detailed, but simultaneously It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art It says, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to protection of the invention Range.Therefore, the scope of protection of the patent of the invention shall be subject to the appended claims.

Claims

1. a kind of paper method to go over files based on label, which is characterized in that wherein, be equipped with the first visitor on paper before each objective item Label is seen, the respective option of each objective item is equipped with the second objective indicia and the answer frame for examinee's full-filling, and described first is objective Label is different from second objective indicia, and the starting point of each subjective item is equipped with first area identification point on paper, and described first Region recognition point is different from first objective indicia and second objective indicia, and the first area identification point of subjective item is under Region recognition point is the answer range of the subjective item, comprising:

The image for obtaining paper carries out the first objective indicia, the second objective indicia, answer index point and first area to image and knows The extraction of other point, wherein answer index point is the answer frame of examinee's full-filling, the first objective indicia, the second objective mark to extraction Note, answer index point and first area identification point carry out identification differentiation, and the first objective indicia distinguished to identification is ranked up；

For each first objective indicia, according to the position between the first objective indicia and the second objective indicia, answer index point Relationship determines associated second objective indicia of first objective indicia and answer index point, and to associated second objective indicia into Then row sequence calculates associated answer index point the distance between to each associated second objective indicia, distance minimum is simultaneously The second objective indicia for meeting design of test proportionate relationship is denoted as examinee's answer, by the sequence serial number of second objective indicia and mark Quasi- answer is compared, and judges whether examinee's answer is correct according to comparison result；

After identifying first area identification point, cutting is carried out to image and goes out the text of subjective item according to the image recognition of cutting, To go over examination papers to subjective item.

2. the paper method to go over files according to claim 1 based on label, which is characterized in that " identification first area identification After point, cutting is carried out to image and goes out the text of subjective item according to the image recognition of cutting, to be read to subjective item Volume." in identify the text of subjective item, specifically include:

Extract the text in the target image；

For each text extracted, the connection feature of each text is obtained；

For each text, each row abscissa and each column ordinate are extracted in all pixels point for constituting text respectively most Big and the smallest pixel constitutes the contour feature of each text；

According to the connection feature and the contour feature of established matrix magazine and each text, to described every A text is identified.

3. the paper method to go over files according to claim 2 based on label, which is characterized in that described to extract the target figure Text as in, comprising:

In the first area, the pixel abscissa of characterization text and the minimum and maximum apex coordinate of ordinate are obtained Point

The image that all pixels point in rectangular area that extraction is made of the apex coordinate point forms is as firstth area Text in domain.

4. the paper method to go over files according to claim 3 based on label, which is characterized in that described according to the target figure The separation condition of text as in,

Obtain the first area where each text, comprising:

According to the background width of background at left and right sides of the target image, the where all texts is obtained in the target image Two regions；

In the second region, according to text minimum widith and the cut-off rule, the where each text is obtained One region.

5. the paper method to go over files according to claim 2 based on label, which is characterized in that described every for extracting A text obtains the connection feature of each text, comprising:

Obtain the connected component for the continuous image vegetarian refreshments composition that text is respectively characterized in each text and the category of the connected component Property information；

6. the paper method to go over files according to claim 5 based on label, which is characterized in that the attribute of the connected component Information comprises at least one of the following information: the relative position information of each connected component, the pixel number of each connected component, each to be connected to The edge gradient value of stroke and each connected component that part includes.

7. the paper method to go over files according to claim 6 based on label, which is characterized in that when the text is Chinese character When, the stroke that each connected component includes obtains by the following method:

The orientation angle and length for the straight line that pixel based on characterization text is fitted to, obtain stroke: skimming, right-falling stroke, folding, point.

8. a kind of computer equipment including memory, processor and stores the meter that can be run on a memory and on a processor Calculation machine program, which is characterized in that the processor realizes any one of claims 1 to 7 the method when executing described program Step.

9. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is held by processor The step of any one of claims 1 to 7 the method is realized when row.

10. a kind of processor, which is characterized in that the processor is for running program, wherein right of execution when described program is run Benefit requires 1 to 7 described in any item methods.