WO2014069741A1

WO2014069741A1 - Apparatus and method for automatic scoring

Info

Publication number: WO2014069741A1
Application number: PCT/KR2013/005347
Authority: WO
Inventors: 윤종철; 윤경아
Original assignee: 에스케이텔레콤 주식회사
Priority date: 2012-10-31
Filing date: 2013-06-18
Publication date: 2014-05-08
Also published as: KR20140055442A; CN104364815A; KR101616909B1; US20150093737A1

Abstract

The present invention relates to an apparatus and method for automatic scoring. According to the present invention: an implicit determination criteria of an examiner can be realistically modeled by generating a correlation model between evaluation regions based on language education characteristics, evaluation region characteristics, and answer evaluation characteristics of the examiner; one or more evaluation regions for scoring target data are automatically scored by applying a pre-generated scoring model for each evaluation region; and reliable automatic scoring results can be obtained using the correlation model for each evaluation region and tuning automatic scoring scores for one or more evaluation regions.

Description

Automatic scoring device and method

The present invention relates to an automatic scoring technique for automatically scoring a user's answer through machine learning, and more particularly, to an automatic scoring apparatus and method for automatically scoring a target data in consideration of correlations between evaluation areas.

The contents described in this section merely provide background information on the present embodiment and do not constitute a prior art.

With the development of communication technology, it is possible to take a language test and a simple level test using communication technology in recent years. To this end, a server device that provides a test provides a scoring result by scoring a test. Previously, in order to grade the answers to these tests, grading results were provided by applying a method such as direct grading by a person and inputting grading data into a server device.

However, this scoring method requires a lot of manpower for scoring and it takes a considerable time to check the scoring results, it is difficult to provide fast service.

In order to improve this, in recent years, an automatic scoring system has been developed for automatically scoring through machine learning, rather than a human scoring system. The conventional automatic scoring system collects examiner's subjective scoring data for a number of existing answers. Analyze items that can be evaluated by machine learning (Evaluation Qualities) in each answer, generate a scoring model based on items that can be evaluated through machine learning, and analyze the results of the analysis and subjective scoring of the examiner. Analyze the similarity of the answers through the automatic scoring.

However, according to the linguistic pedagogy, the scoring areas are not completely mutually exclusive, and the scores of the examiner's evaluation areas are mutually influential. However, the conventional automatic scoring system does not reflect the characteristics. There is a problem that the accuracy and accuracy of the examiner's scoring through automatic scoring.

The present invention is proposed to solve the conventional inconvenience, in the automatic scoring of the target data including the user-written answer using machine learning, to automatically score the target data in consideration of the correlation between the evaluation areas It is intended to provide an automatic scoring apparatus and method.

In addition, the present invention is proposed to solve the conventional inconvenience, generating a correlation model between evaluation areas by reflecting language pedagogical characteristics, evaluation region characteristics, examiner's answer evaluation characteristics, and generated correlation model The present invention aims to provide an automatic scoring apparatus and method that can compensate for errors in the scoring model for each evaluation area.

The present invention provides a means for solving the problem, an automatic scoring unit for performing the automatic scoring for each evaluation region for the scoring target data by applying a pre-generated scoring model for each evaluation region; It provides an automatic scoring device including a score tuning unit for calculating the final automatic scoring score by adjusting the automatic scoring score for each evaluation area for the scoring target data output from the automatic scoring unit according to the correlation model between evaluation areas.

The automatic scoring apparatus according to the present invention comprises a scoring model for each evaluation area through machine learning using pre-scoring data for evaluating the one or more evaluation areas for one or more answers and one or more evaluation qualities extracted from the one or more answers. And a correlation model generation unit for generating a correlation model between the scoring model generation unit to generate and the evaluation region defining the probability of generating each score between the one or more evaluation areas based on the previously scored data. have.

In the automatic scoring apparatus according to the present invention, the score tuning unit compares the automatic scoring scores for each evaluation region, selects an abnormal evaluation region having a separation degree of scoring correlation between evaluation regions larger than a predetermined range, and the abnormal evaluation region The automatic scoring score of can be tuned using the correlation model between the evaluation areas.

In the automatic scoring apparatus according to the present invention, the score tuning unit generates the scores of the selected abnormal evaluation regions based on the automatic scoring scores of the remaining evaluation regions other than the abnormal evaluation region using the correlation model. The probability may be calculated and the automatic scoring score of the abnormal evaluation region may be changed to the score having the highest probability.

In addition, according to an embodiment of the present invention, as another means for solving the above-described problem, by applying a pre-generated scoring model for each evaluation area, performing the automatic scoring for one or more evaluation areas for the scoring target data ; And tuning an automatic scoring score for each of the one or more evaluation areas using a correlation model for each evaluation area.

In the automatic scoring method according to an embodiment of the present disclosure, the tuning may include: selecting an abnormal evaluation region having a distance greater than a preset range by comparing the automatic scoring scores between the evaluation regions; Calculating a probability of occurrence of each score of the selected abnormal evaluation region based on the automatic scoring scores of the remaining evaluation regions other than the abnormal evaluation region; And changing the automatic scoring score of the abnormal evaluation region to the score having the highest probability.

In addition, the automatic scoring method according to an embodiment of the present invention, before performing the automatic scoring, one or more evaluations extracted from the pre-scoring data and the one or more answers that evaluated the one or more evaluation areas for one or more answers Generating a scoring model for each evaluation area through machine learning using a feature, and generating a correlation model between evaluation areas that define a probability of generating each score among the one or more evaluation areas based on the previous scoring data. It may further include one.

In addition, the present invention provides a computer-readable recording medium characterized in that a program for executing the above-described automatic scoring method is recorded.

The present invention relates to a technique for automatically evaluating an answer written by a user in one or more language areas including speaking, listening, writing, etc. In particular, in evaluating one or more evaluation areas for an answer written by an examiner, language education By creating a correlation model between the evaluation areas, reflecting the scientific characteristics, evaluation area characteristics, and in vitro test evaluation characteristics, the implicit judgment criteria for the evaluation area can be modeled more realistically.

In addition, the present invention applies the correlation that can appear between the evaluation areas by applying the correlation model between the generated evaluation areas, and the evaluation characteristics and errors of the examiner's answer in performing automatic scoring through the scoring model for each evaluation area It has the effect of minimizing and increasing the reliability of the evaluation result.

1 is a diagram illustrating an automatic scoring apparatus according to an exemplary embodiment of the present invention.

2 is a diagram illustrating a method for performing automatic scoring by applying a correlation model between evaluation areas according to an embodiment of the present invention.

3 is a diagram illustrating a configuration of an automatic evaluation service system to which an automatic scoring apparatus according to the present invention is applied.

4 is a diagram illustrating a terminal device to which an automatic scoring method is applied according to an exemplary embodiment of the present invention.

5A to 5C are correlation tables between evaluation areas for describing a correlation model between evaluation areas according to an embodiment of the present invention.

6 to 8 are diagrams showing an example of an automatic scoring process to which a correlation model between evaluation areas is applied according to an embodiment of the present invention.

Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings. However, in the following description and the accompanying drawings, detailed descriptions of well-known functions or configurations that may obscure the subject matter of the present invention will be omitted. In addition, it should be noted that like elements are denoted by the same reference numerals as much as possible throughout the drawings.

The terms or words used in the specification and claims described below should not be construed as being limited to ordinary or dictionary meanings, and the inventors are appropriate as concepts of terms for explaining their own invention in the best way. It should be interpreted as meanings and concepts in accordance with the technical spirit of the present invention based on the principle that it can be defined. Therefore, the embodiments described in the present specification and the configuration shown in the drawings are only the most preferred embodiments of the present invention, and do not represent all of the technical ideas of the present invention, and various alternatives may be substituted at the time of the present application. It should be understood that there may be equivalents and variations.

In the following description, " evaluation area " is a grading criterion set to standardize in-vitro scoring in relation to a specific evaluation test, and can be defined as a scoring area and evaluation contents of the scoring area. For example, in the case of a speech evaluation for a foreign language, the evaluation area may include a scoring area consisting of fluency, language use, compositional power, and pronunciation. Here, fluency is an element for evaluating the degree of natural ignition without appropriateness and hesitation. Language usage is a factor in evaluating the correctness of expression and the adequacy of vocabulary usage. Constructivity is a factor that evaluates the logical connectivity of speech and the consistency / aggregation of speech content. Pronunciation is a factor that assesses the clarity and comprehension of pronunciation. In the present invention, it is intended to implement automatic scoring of one or more predetermined evaluation areas.

First, an automatic scoring apparatus and method according to an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

1 is a diagram illustrating a configuration of an automatic scoring apparatus for performing automatic scoring according to an embodiment of the present invention.

Referring to FIG. 1, the automatic scoring apparatus 100 according to an exemplary embodiment of the present invention is an apparatus for automatically scoring an answer written by an evaluator for a specific problem based on one or more preset evaluation areas. In particular, the automatic scoring apparatus 100 according to the present invention automatically calculates the scores of one or more evaluation areas for the scoring target data using one or more evaluation area scoring models. Subsequently, the automatic scoring apparatus 100 compares the automatic scoring scores of each evaluation area scored by the scoring model for each evaluation area by using the correlation model between the evaluation areas that have been previously generated, and has an abnormal score having a score outside the preset range. ) Tune the automatic scoring of the evaluation area.

To this end, the automatic scoring apparatus 100 collects scoring data which is a reference for one or more answers, for example, scoring data for one or more evaluation areas that are directly scored by the examiner. In addition, the automatic scoring device 100 may extract one or more evaluation qualities from the one or more answers. In addition, the automatic scoring apparatus 100 may generate a scoring model for each evaluation area by performing machine learning using the evaluation quality for each answer and the previous scoring data.

The automatic scoring apparatus 100 may automatically score the score for each evaluation area for the newly input scoring object data through the generated evaluation model for each evaluation area.

In addition, the automatic scoring apparatus 100 may generate a correlation model between evaluation areas in advance using the scoring data.

As such, the automatic scoring apparatus 100 may include a scoring model generator 110, a correlation model generator 120, an automatic scoring unit 130, and a score tuning unit 140. The scoring model generator 110, the correlation model generator 120, the automatic scoring unit 130, and the score tuning unit 140 may be implemented in hardware or software, or a combination of hardware and software. For example, the scoring model generator 110, the correlation model generator 120, the automatic scoring unit 130, and the score tuning unit 140 may be implemented with the following software. It can be implemented in combination with a microprocessor that executes.

The scoring model generation unit 110 generates a scoring model for each evaluation area through machine learning using one or more evaluation data for the evaluation area for one or more answers that have been previously scored by the examiner, and one or more evaluation qualities from the one or more pre-scored answers. Create

Specifically, the scoring model generating unit 110 is an evaluation quality extracted from one or more answers, that is, items that can be automatically evaluated (eg, word count, adjective number, grammatical error, spelling error, tense match, best answer) And similarity with). In addition, by performing a machine learning of the evaluation quality and the evaluation data for each evaluation area of the examiner for the one or more answers, it is possible to generate a scoring model for each evaluation area that defines the relationship between the evaluation quality and the scoring score for each evaluation area. That is, the subjective evaluation criteria of the examiner are modeled based on one or more automatically evaluable evaluation qualities.

Correlation model generation unit 120 is for modeling the correlation between the evaluation areas in the grading data scored by the examiner by reflecting language pedagogical characteristics, evaluation area characteristics, examiner's answer evaluation characteristics. To this end, the correlation model generator 120 analyzes the correlation between evaluation areas using the one or more pre-giving data used to generate the scoring model for each evaluation area, and generates a correlation model.

For example, the correlation model generator 120 may define, as shown in FIG. 5A to FIG. 5C, the characteristics influencing the scoring between evaluation areas as a generation probability table for each score range between evaluation areas. have. In the present embodiment, the first to fourth evaluation areas are set, and when the scores in the range of 0 to 5 are scored for each evaluation area, the other evaluation areas (Rubric # 4) based on the other evaluation areas ( The score correlation with Rubric # 1, # 2, # 3) is analyzed and illustrated. Specifically, FIG. 5A illustrates the correlation between the first evaluation region Rubric # 1 and the fourth evaluation region Rubric # 4 as occurrence probabilities for each score group, and FIG. 5B illustrates a second evaluation region Rubric # 2. And the correlation between the fourth evaluation region (Rubric # 4) as the probability of occurrence for each score group, Figure 5c shows the correlation between the third evaluation region (Rubric # 3) and the fourth evaluation region (Rubric # 4).

By using such a correlation model, the probability of occurrence of scores between evaluation areas can be checked. For example, referring to FIG. 5C, when the third evaluation region Rubric # 3 has three points, the probability that the fourth evaluation region Rubric # 4 is zero is 0%, and the probability of one point is 0.2%. The probability of 2 points is 5.6%, the probability of 3 points is 16.4%, the probability of 4 points is 0.4%, and the probability of 5 points is 0%. Therefore, when the third evaluation area (Rubric # 3) is three points, the score of the fourth evaluation area (Rubric # 4) is very likely to be three or two points. In addition, when the fourth evaluation region (Rubric # 4) has three points, the probability that the third evaluation region (Rubric # 3) is 0 or 1 point is 0%, the probability of 2 points is 2.8%, and the probability of 3 points is 16.4%. , The probability of 4 points is 6.6%, and the probability of 5 points is 0.6%. An answer that receives a high score in the third evaluation area (Rubric # 3) through the correlation model between these evaluation areas is likely to receive a high score in the fourth evaluation area (Rubric # 4), and a low score in the third evaluation area. The answer received was found to have a high probability of receiving a low score in the fourth evaluation area. This is because one or more assessment areas for a particular answer are linguistically linked without being independent of each other.

The automatic scoring unit 130 receives new scoring object data, which is a test answer to be scored, by the evaluator, and uses one or more evaluation regions for the scoring object data using the scoring model for each evaluation region generated by the scoring model generator 110. The star score is automatically calculated.

Subsequently, the score tuning unit 140 tunes an automatic scoring score for each evaluation area for the scoring target data output from the automatic scoring unit 130 through the correlation model between the evaluation areas generated by the correlation model generator 120. do. In detail, the score tuning unit 140 compares the automatic scoring scores for each evaluation area, selects an abnormal evaluation area having a score greater than a preset range, and between the selected abnormal evaluation area and the remaining evaluation areas. The automatic scoring score of the abnormal evaluation region may be adjusted based on the correlation model.

Then, the automatic scoring method according to the embodiment of the present invention implemented in the automatic scoring apparatus configured as described above will be described in detail with reference to FIG. 2.

2 is a diagram illustrating a method for performing automatic scoring by applying a correlation model between evaluation areas in an automatic evaluation service system according to an exemplary embodiment of the present invention.

Referring to FIG. 2, the automatic scoring device 100 according to an embodiment of the present invention collects one or more scoring data previously scored by the examiners in step 1101. The one or more scoring data includes information about the scores of one or more answers each of the one or more examiners for one or more evaluation areas.

Thereafter, the automatic scoring apparatus 100 generates a scoring model for each evaluation region through machine learning based on one or more scoring data collected in step 1102. More specifically, the automatic scoring apparatus 100 may automatically evaluate the evaluation qualities (eg, word count, adjective number, grammar error, spelling error, tense match, model, etc.) from the answer corresponding to the scoring data for each evaluation region. Analyze the similarity with the answer). Then, a scoring model for each evaluation area is generated to calculate scores for each evaluation area based on the evaluated evaluation properties and the at least one scoring data by machine learning for each evaluation area.

In addition, the automatic scoring apparatus 100 generates a correlation model between evaluation regions based on the scoring data for each evaluation region collected in operation 1103, as shown in FIGS. 5A to 5C. The correlation model between evaluation areas is a structural representation of the correlation between two evaluation areas. For example, if four evaluation areas exist, six correlation models may be generated. Here, the correlation model between the evaluation areas may be implemented in the form of defining the occurrence probability for each score between the two evaluation areas.

Subsequently, the automatic scoring apparatus 100 newly receives the scoring target data prepared by the evaluator who took the test in step 1104 about the specific problem.

When the new scoring target data is input, the automatic scoring apparatus 100 calculates an automatic scoring score for each of the scoring targets by one or more evaluation regions by applying the scoring model generated for each evaluation region in operation 1105. Specifically, at least one evaluation feature is extracted from the new scoring object data, and the extracted evaluation feature is input to the scoring model for each evaluation area to calculate an automatic scoring score for each evaluation area.

The automatic scoring scores for the evaluation areas calculated as described above may include errors because the correlations between the evaluation areas are not reflected. To this end, the present invention further performs a process of tuning the automatic scoring result using the correlation model described below.

In detail, the automatic scoring apparatus 100 compares the automatic scoring score for each evaluation region calculated through the automatic scoring in operation 1106, and selects the abnormal evaluation region having a score whose correlation distance is out of the preset range. Here, the correlation spacing may be defined as a probability that a difference between scores of two evaluation areas or an automatic scoring score of two evaluation areas occurs at the same time.

Figure 6 is an example for explaining the automatic scoring method according to the present invention, the examinee number is information for identifying each subject, the subjective scoring results of the examiner for the answer by each subject is shown on the left, the same answer is evaluated The automatic scoring score calculated using the area scoring model is shown on the right. In this case, the scoring is performed on four evaluation areas (Rubric # 1 to # 4).

For example, as a result of automatically scoring the answer for the examinee of the examinee number "20121102" shown in FIG. 6 using a scoring model for each evaluation area, the score of the first evaluation area (Rubric # 1) is 4 points, 2 The score of the evaluation area (Rubric # 2) was calculated by 3 points, the score of the third evaluation area (Rubric # 3) by 3 points, the score of the fourth evaluation area (Rubric # 4) by 0 points. In this case, according to step 1106, if the abnormal evaluation region is selected out of the predetermined range, the score of the fourth evaluation region (Rubric # 4) of the automatic scoring result is 0, and the difference from the scores of the other evaluation regions. Since the fourth evaluation area (Rubric # 4) can be selected as an abnormal evaluation area. Here, the selection of the abnormal evaluation region may be made based on a difference between the average value of the automatic scoring scores of the remaining evaluation regions and their own automatic scoring scores for each evaluation region. That is, the evaluation areas in which the scores of each evaluation area differ by more than a predetermined reference value from the average value of the automatic scoring scores of the remaining evaluation areas are selected as the abnormal evaluation areas. In this case, the selection criterion δ of the abnormal evaluation region may be arbitrarily determined.

In operation 1107, the automatic scoring apparatus 100 tunes the automatic scoring of the selected abnormal evaluation region by applying a correlation model between the evaluation regions. Specifically, the automatic scoring apparatus 100 checks the automatic scoring scores of the selected abnormal evaluation areas and the automatic scoring scores of the remaining evaluation areas, and based on the automatic scoring scores of the remaining evaluation areas through the correlation model, the abnormal evaluation areas. The probability of occurrence for each score (for example, 0 to 5 points) is calculated. Thereafter, the automatic scoring apparatus 100 obtains a sum of probabilities of generating automatic scoring scores of the remaining evaluation regions for each score of the selected abnormal evaluation region, and extracts a score having the highest sum of the probabilities. The automatic scoring apparatus 100 may perform score tuning by changing the automatic scoring score of the selected abnormal evaluation region to the score having the highest probability.

Referring to the example illustrated in FIG. 6, the fourth evaluation region is selected as the abnormal evaluation region from the automatic scoring result of the examinee having the examinee number “20121102”, and the automatic scoring scores of the remaining first, second and third evaluation regions are evaluated. Were 4 points, 3 points, and 3 points, respectively. In this case, as shown in FIG. 7, the automatic scoring apparatus 100 has a probability of occurrence of score points (0 to 5 points) of the fourth evaluation area when the first evaluation area is four points, and the second evaluation area is three points. The occurrence probability of each of the fourth evaluation region score points (0 to 5 points) when the point is, and the occurrence probability of the fourth evaluation region by score points (0 to 5 points) when the third evaluation area is 3 points. Subsequently, the sum of the probabilities of generating the automatic scoring scores of the remaining first, second, and third evaluation areas for each score range of the fourth evaluation area is obtained, and the score of the fourth evaluation area having the maximum is detected. Referring to the example of FIG. 7, when the automatic scoring scores of the first to third evaluation areas (Rubric # 1 to # 3) are 4, 3, and 3, respectively, three points of the scores of the fourth evaluation area (Rubric # 4) are scored. It can be seen that the probability of occurrence is the highest with 40%.

Therefore, the automatic scoring apparatus 100 according to the present invention changes the automatic score of the fourth evaluation region selected as the abnormal evaluation region from 0 to 3, as shown in FIG. 8.

According to this, it can be seen that the final automatic scoring result in the automatic scoring apparatus 100 is adjusted similarly to the scoring result by the examiner, as shown in FIG. 8.

Thereafter, in operation 1108, the automatic scoring apparatus 100 may calculate final automatic scoring result data through score tuning, and provide final evaluating result information on the calculated final automatic scoring result data to the evaluator.

The automatic evaluation apparatus and method according to the present invention can be applied to an automatic evaluation service system based on a network.

3 is a diagram illustrating a configuration of an automatic evaluation service system to which an automatic evaluation apparatus according to an exemplary embodiment of the present invention is applied.

Referring to FIG. 3, the automatic evaluation service system may include an evaluation service server 30 including a plurality of terminal devices 20 and an automatic scoring device 100_1 connected through the communication network 10.

The plurality of terminal devices 20 refers to a terminal capable of transmitting and receiving various data via the communication network 10 according to a user's key manipulation, and may be a tablet PC, a laptop, or a personal computer. It may be one of a personal computer, a smart phone, a personal digital assistant (PDA), a smart TV, and a mobile communication terminal. In addition, the terminal device 20 is a terminal for performing voice or data communication using the communication network 10, and stores a browser, a program, and a protocol for communicating with the evaluation service server 30 via the communication network 10. Means a terminal having a memory, a microprocessor for operating and controlling various programs. That is, the terminal device 20 may be any terminal as long as server-client communication with the evaluation service server 30 is possible, and is a broad concept including all communication computing devices such as notebook computers, mobile communication terminals, and PDAs. Meanwhile, the terminal device 20 is preferably manufactured in a form having a touch screen, but is not necessarily limited thereto.

In particular, the plurality of terminal devices 20 according to the embodiment of the present invention mean a terminal for receiving an automatic scoring service, and may be a terminal device of an examinee or a terminal device of an examiner. The plurality of terminal devices 20 interoperate with the evaluation service server 100 through the communication network 10, receive a test answer from an evaluator, and transmit the test answer to the evaluation service server 30, from the evaluation service server 30. An automatic evaluation result for the test answer may be sent. In particular, by applying the correlation model for each evaluation area from the evaluation service server 30 may receive the automatically scored scoring result data to guide the user.

The evaluation service server 30 is a server device that performs an automatic evaluation on an answer transmitted from the terminal device 20 and provides the evaluation result. The evaluation service server 30 includes an automatic scoring device 100_1 to which a correlation model according to the present invention is applied. can do.

The automatic scoring apparatus 100_1 may provide an automatic scoring service in cooperation with a plurality of terminal apparatuses 20 through the communication network 10. The automatic scoring apparatus 100_1 may collect scoring data for each evaluation area from the examiner and store the evaluation data in advance for each evaluation area in the database. At this time, the scoring data and evaluation data for each evaluation area may be directly input from the examiner or may be transmitted through the communication network 10.

In addition, the automatic scoring device 100_1 generates a scoring model for each evaluation area through machine learning using the collected scoring data and evaluation quality of each evaluation area, and compares the scoring results of the evaluation area to evaluate language pedagogical characteristics and evaluation. Correlation models between assessment areas can be created by reflecting domain characteristics, examiner's answer evaluation characteristics, etc. In addition, when the automatic scoring apparatus 100_1 receives the new scoring target data from the terminal device 20, the automatic scoring apparatus 100_1 extracts an evaluation feature from the new scoring target data. Then, the extracted evaluation feature is input to the generated scoring region-specific scoring model to calculate an automatic scoring score for each evaluation region for the new scoring object data. Subsequently, the automatic scoring apparatus 100_1 applies the generated correlation model between the evaluation regions, and selects the abnormal evaluation region having a score of a correlation greater than a predetermined reference value. The automatic scoring apparatus 100_1_ calculates a probability of occurrence of each of the abnormal evaluation areas by using the correlation model based on the automatic scoring scores of the remaining evaluation areas other than the selected abnormal evaluation area, and calculates the scores by the correlation model. By comparing the probability of occurrence, the highest probability score is applied as the automatic scoring score of the selected abnormal evaluation region. The automatic scoring apparatus 100_1 may provide the terminal apparatus 20 with the final automatic scoring score thus calculated. Since the detailed configuration of the automatic scoring apparatus 100_1 has been described with reference to FIGS. 1 and 2, a redundant description thereof will be omitted.

In addition, the automatic scoring method according to the present invention may be implemented and used in the form of a program mounted on the terminal device.

4 is a diagram illustrating a terminal device having a program according to an automatic evaluation method according to an exemplary embodiment of the present invention.

Referring to FIG. 4, the terminal device 40 may include a control unit 210, a communication unit 220, an input unit 230, a storage unit 240, and an output unit 250. The terminal device 40 is a user information processing device capable of installing and executing the automatic scoring program 100_2 according to the present invention and performing the automatic scoring method according to the present invention. Anything is possible. For example, the terminal device 40, a tablet PC (Tablet PC), a laptop (Laptop) computer, a personal computer (PC), a smart phone (Smart Phone), a personal digital assistant (PDA) , A smart TV, a mobile communication terminal, and the like.

The controller 210 controls the overall operation of the terminal device 40 and the operation related to the automatic scoring service execution. In particular, when the controller 210 receives a user's test take request signal from the input unit 230, the controller 210 executes an application for taking a test according to the input test take request information, and displays a test problem or the like on the screen of the output unit 250. Control to display Accordingly, the controller 210 receives and processes the information on the answer of the test question, that is, the scoring target data through the input unit 230, and stores the processed scoring target data in the storage 140. Then, the automatic scoring program 100_2 is executed to control automatic scoring of new data. In addition, the controller 310 controls the user to guide the final automatic scoring result information through the screen of the output unit 250.

The communication unit 220 is for transmitting and receiving data through a communication network. The communication unit 220 may transmit and receive data through various communication methods as well as wired and wireless methods. In addition, the communication unit 220 may transmit and receive data using one or more communication methods, and for this purpose, the communication unit 220 may include a plurality of communication modules that transmit and receive data according to different communication methods.

The input unit 230 may generate a user input signal corresponding to a user's request or information according to a user's operation, and may be implemented by various input means that are currently commercialized or may be commercialized in the future. For example, a keyboard and a mouse In addition to a general input device such as a joystick, a touch screen, a touch pad, and the like, a gesture input means for detecting a user's motion and generating a specific input signal may be included. The input unit 230 may transfer information input from the user to the controller 210. That is, the input unit 230 may receive an answer to a test question, that is, new scoring target data, from an evaluator.

The storage unit 240 stores information necessary for the operation of the terminal device 40, and in particular, may store information related to an automatic scoring service. In particular, the automatic scoring program 100_2 programmed to perform the automatic scoring method according to the present invention may be stored. The storage unit 240 may include an optical recording medium such as a magnetic media such as a hard disk, a floppy disk, and a magnetic tape, a compact disk read only memory (CD-ROM), and a digital video disk (DVD). And magneto-optical media such as floppy disks and ROM, random access memory (RAM), and flash memory.

The output unit 250 is a means for providing the user to recognize the operation result or state of the terminal device 40, and includes, for example, a display unit for visually outputting through a screen or a speaker for outputting an audible sound. can do. In particular, a screen related to an automatic scoring service driven by the terminal device 40 may be displayed, and a screen for executing the automatic scoring service may be displayed according to a user's request. In addition, the output unit 250 may display an answer to a test question input from an evaluator, that is, scoring target data, or display an automatic scoring score for the scoring target data on the screen.

That is, the terminal device 40 executes the automatic scoring program 100_2, and automatically uses the scoring model for each evaluation area for the user's answer, that is, the target data input through the input unit 230, for each evaluation area. A scoring score is calculated, and then an abnormal evaluation region having a score that is out of a predetermined range is extracted using a correlation model between evaluation regions, and the abnormal evaluation region is based on the automatic scoring score of the remaining evaluation regions. The probability of occurrence of each score is calculated, and the automatic scoring score of the abnormal evaluation region is changed to the highest probability score. In addition, the terminal device 40 may provide the user with the automatic scoring result finally calculated as described above.

Here, the program instructions recorded in the automatic scoring program 100_2 may be those specially designed and configured for the present invention or may be known and available to those skilled in computer software.

On the other hand, the embodiments of the present invention disclosed in the specification and drawings are merely presented specific examples for clarity and are not intended to limit the scope of the present invention. It is apparent to those skilled in the art that other modifications based on the technical idea of the present invention can be carried out in addition to the embodiments disclosed herein.

The present invention relates to an automatic scoring apparatus and method, wherein in scoring grading data by one or more evaluation areas, a correlation model between evaluation areas is generated by reflecting language pedagogical characteristics, evaluation area characteristics, test tube answer evaluation characteristics, and the like. By generating, there is an effect that can more realisticly model the implicit judgment criteria that examiners subjectively apply.

In addition, the present invention applies a correlation model between the generated evaluation areas to select an abnormal evaluation area that the correlation distance between evaluation areas is out of a predetermined range, the most likely to occur based on the automatic scoring of the remaining evaluation areas. By tuning the score, it is possible to score more similarly to the subjective scoring data of the examiner, thereby increasing the automatic evaluation performance.

The present invention is a useful invention that is applied to the automatic scoring service, a useful invention that generates the effect of performing automatic scoring more similarly to the test and answer in consideration of the scoring correlation between the evaluation areas, through which the development of the service industry I can contribute.

Claims

An automatic scoring unit configured to perform automatic scoring for each evaluation region on the scoring target data by applying a previously generated evaluation model for each evaluation region;

A score tuning unit that calculates a final automatic scoring score by tuning an automatic scoring score for each evaluation region for the scoring target data output from the automatic scoring unit according to a correlation model between evaluation regions;

Automatic scoring device comprising a.
The method of claim 1,

A scoring model generation unit for generating a scoring model for each evaluation area through machine learning using pre-scoring data evaluating the one or more evaluation areas for at least one answer and at least one evaluation quality extracted from the at least one answer;

Automatic scoring device comprising more.
The method of claim 2,

A correlation model generator configured to generate a correlation model between the evaluation areas that define a probability of generating each score between the one or more evaluation areas based on the pre-marked data;

Automatic scoring device comprising more.
The method of claim 1, wherein the score tuning unit,

By comparing the automatic scoring scores of the evaluation areas, an abnormal evaluation area having a score greater than a preset range of scoring correlations between evaluation areas is selected, and the automatic scoring scores of the abnormal evaluation areas are correlated between the evaluation areas. Automatic scoring device, characterized in that tuning using the model.
The method of claim 4, wherein the score tuning unit,

Calculating the probability of occurrence of each score of the selected abnormal evaluation region based on the automatic scoring scores of the remaining evaluation regions other than the abnormal evaluation region using the correlation model,

And an automatic scoring score of the abnormal evaluation region to a score having the highest probability.
Performing automatic scoring for one or more evaluation regions on the scoring target data by applying a previously generated evaluation model for each evaluation region; And

Tuning an automatic scoring score for each of the one or more evaluation areas using a correlation model for each evaluation area;

Automatic scoring method comprising a.
The method of claim 6, wherein the tuning step

Selecting an abnormal evaluation region having a score greater than a predetermined range by comparing the automatic scoring scores between the evaluation regions;

Calculating a probability of occurrence of each score of the selected abnormal evaluation region based on the automatic scoring scores of the remaining evaluation regions other than the abnormal evaluation region;

And changing the automatic scoring score of the abnormal evaluation region to the score having the highest probability.
The method of claim 6,

Before performing the automatic scoring, a scoring model for each evaluation region is generated through machine learning using pre-scoring data evaluating the one or more evaluation regions for one or more answers and one or more evaluation features extracted from the one or more answers. Doing;

Automatic scoring method characterized in that it further comprises.
The method of claim 8,

And generating a correlation model between evaluation areas that defines a probability of generating each score between the one or more evaluation areas based on the previously scored data.
Performing automatic scoring for one or more evaluation regions on the scoring target data by applying a previously generated evaluation model for each evaluation region; And

Tuning an automatic scoring score for each of the one or more evaluation areas using a correlation model for each evaluation area;

A computer-readable recording medium in which a program for executing the program is recorded.