WO2022124355A1 - English composition correction result analysis system - Google Patents

English composition correction result analysis system Download PDF

Info

Publication number
WO2022124355A1
WO2022124355A1 PCT/JP2021/045231 JP2021045231W WO2022124355A1 WO 2022124355 A1 WO2022124355 A1 WO 2022124355A1 JP 2021045231 W JP2021045231 W JP 2021045231W WO 2022124355 A1 WO2022124355 A1 WO 2022124355A1
Authority
WO
WIPO (PCT)
Prior art keywords
unit
english composition
storage device
vocabulary level
sentence
Prior art date
Application number
PCT/JP2021/045231
Other languages
French (fr)
Japanese (ja)
Inventor
諭 内田
文勝 本間
圭悟 佐藤
千佳 廣田
真之介 田阪
浩一 西岡
裕司 前田
Original Assignee
株式会社新興出版社啓林館
国立大学法人九州大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社新興出版社啓林館, 国立大学法人九州大学 filed Critical 株式会社新興出版社啓林館
Publication of WO2022124355A1 publication Critical patent/WO2022124355A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/194Calculation of difference between files
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • G09B19/06Foreign languages
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B7/00Electrically-operated teaching apparatus or devices working with questions and answers
    • G09B7/02Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student

Definitions

  • the present invention relates to a technique for analyzing a correction result by comparing an English composition with a correction sentence thereof.
  • a proofreading system that calibrates grammatical errors and spelling mistakes in English composition with a computer has already been put into practical use due to the development of artificial intelligence in recent years, and is being used in the business field as well.
  • the English learning system by the applicant of the present application which is exemplified in Non-Patent Document 1, is an example of a correction system that provides such a correction service.
  • users such as learners and teachers send the English composition of the Japanese sentence being asked to the system and receive the correction sentence by the corrector with excellent English ability from the system. be able to.
  • Patent Document 1 considers the difference in the position of the same word group, the difference in the conjugation form of the word, the difference in the part of speech of the word, etc. between the answer sentence created by the user and the correct answer sentence prepared in advance. Then, the technology for scoring the answer text is disclosed.
  • the present invention has been made in view of the above problems, and provides an English composition correction result analysis system that analyzes a correction result by comparing an English composition with the correction sentence and returns the analysis result to the user. With the goal.
  • the first aspect of the present invention is an English composition correction result analysis system, which includes an input receiving unit, a matching unit, and a result output unit.
  • the input reception unit accepts input of English composition and its correction sentence.
  • the associating unit associates the sentences constituting the English composition and the correction sentence with each other when the input is accepted.
  • the result output unit outputs the result of the mapping performed by the mapping unit.
  • the matching unit associates the sentences with each other so that the distance between the sentences based on the predetermined standard is minimized.
  • a user for example, a learner, a teacher, etc.
  • the second aspect of the present invention is the English composition correction result analysis system according to the first aspect, and the matching unit includes a single sentence and a plurality of sentences included in the English composition and the correction sentence.
  • the sentences are described in ascending order of distance between the single sentence and the plurality of sentences based on the predetermined criteria, until there are no unrelated sentences in either the English composition or the correction sentence. Make a correspondence between each other.
  • the correspondence between sentences can be obtained by a steady method in order, and the user who receives the result of the correspondence can find the correspondence between the sentences including a plurality of sentences and the distance is the minimum. , You can know.
  • the third aspect of the present invention is an English composition correction result analysis system according to the first or second aspect, which further includes a part-of-speech analyzer access unit and a part-of-speech search unit.
  • the part-speech analyzer access unit receives input of a sentence, analyzes the part-speech for each word constituting the input sentence, and accesses the part-speech analyzer that outputs the analysis result.
  • the part-speech search unit selects a set of words and their part-speech between the associated sentences that is in one sentence but not in the other sentence, or in both sentences. The search is performed based on the analysis result obtained by the analyzer access unit accessing the part of speech analyzer. Then, the result output unit also outputs the result of the search performed by the part of speech search unit.
  • part of speech includes a large classification of part of speech such as verbs and nouns, and further, a minor classification of part of speech such as verb tense, noun tense, and noun countability / uncountability (subordinate attributes of major classification). May be included.
  • the part-of-speech analyzer may be an external device.
  • the fourth aspect of the present invention is an English composition correction result analysis system according to any one of the first to third aspects, further comprising a grammar item storage device access unit and a grammar item search unit.
  • the grammar item storage device access unit accesses a grammar item storage device that stores one or more elements including a word and its part of speech in association with each other and the corresponding grammar item.
  • the grammar item search unit stores the grammar item in one sentence but not in the other sentence, or the grammar item in both sentences, among the associated sentences.
  • the device access unit searches based on the stored contents obtained by accessing the grammar item storage device. Then, the result output unit also outputs the result of the search performed by the grammar item search unit.
  • the user can know the grammatical items that are inconsistent or the grammatical items that match among the grammatical items that are in a corresponding relationship with each other.
  • "part of speech” includes a large classification of part of speech such as verbs and nouns, and further, a minor classification of part of speech such as verb tense, noun tense, and noun countability / uncountability (subordinate attributes of major classification). May be included.
  • the grammatical items are, for example, tense, progressive tense, passive voice, subjunctive mood, and the like.
  • the grammar item storage device may be an external storage device.
  • the fifth aspect of the present invention is the English composition correction result analysis system according to the fourth aspect, and the grammar item search unit is not only the grammar item but also the one or more corresponding to the grammar item.
  • the element of is also recognized as a grammatical item in both sentences only when the associated sentences match each other.
  • the sixth aspect of the present invention is an English composition correction result analysis system according to the fourth or fifth aspect, further including an explanatory data storage device and an explanatory data reading unit.
  • the commentary data storage device stores commentary data containing explanations for each grammar item in association with the corresponding grammar item.
  • the explanatory data reading unit reads the explanatory data corresponding to the grammar item in one sentence but not in the other sentence, which is searched by the grammar item search unit, from the explanatory data storage device. Then, the result output unit also outputs the explanatory data read by the explanatory data reading unit.
  • the user can obtain a corresponding explanation when there is a discrepancy in the grammatical items between the English composition and the corrected sentence.
  • the seventh aspect of the present invention is an English composition correction result analysis system according to any one of the first to sixth aspects, which is a vocabulary level storage device access unit, a vocabulary level determination unit, and a vocabulary level representative. It further includes a value calculation unit, a distance calculation unit, an evaluation data storage device, and an evaluation data reading unit.
  • the vocabulary level storage device access unit accesses a vocabulary level storage device that stores a vocabulary level for each word in association with the corresponding word.
  • the vocabulary level determination unit determines the vocabulary level for each word for all words or words satisfying predetermined conditions included in the English composition for which input is accepted, and the vocabulary level storage device access unit determines the vocabulary level for each word. This is done based on the storage content obtained by accessing the storage device.
  • the vocabulary level representative value calculation unit calculates a vocabulary level representative value which is a representative value of the vocabulary level determined by the vocabulary level determination unit.
  • the distance calculation unit calculates the total distance, which is the distance based on the predetermined standard, between the entire English composition for which input has been accepted and the entire correction sentence for which input has been accepted.
  • the evaluation data storage device stores evaluation data including evaluations according to the total distance and the vocabulary level representative value in association with the area for each area of the total distance and the vocabulary level representative value. ..
  • the evaluation data reading unit reads out the evaluation data corresponding to the calculated vocabulary level representative value and the calculated total distance from the evaluation data storage device. Then, the result output unit also outputs the evaluation data read by the evaluation data reading unit.
  • the user can obtain an evaluation based on the degree of difference between the English composition and the corrected sentence and the vocabulary level of the English composition.
  • the vocabulary level storage device may be an external storage device.
  • the representative value is, for example, an average value.
  • the predetermined condition is that, for example, the word is a content word.
  • the eighth aspect of the present invention is an English composition correction result analysis system, which is a vocabulary level storage device access unit, an input reception unit, a vocabulary level determination unit, a vocabulary level representative value calculation unit, and a distance. It includes a calculation unit, an evaluation data storage device, an evaluation data reading unit, and a result output unit.
  • the vocabulary level storage device access unit accesses a vocabulary level storage device that stores a vocabulary level for each word in association with the corresponding word.
  • the input reception unit accepts input of an English composition including a plurality of sentences and its correction sentence.
  • the vocabulary level determination unit determines the vocabulary level for each word for all words or words satisfying predetermined conditions included in the English composition for which input is accepted, and the vocabulary level storage device access unit determines the vocabulary level for each word.
  • the vocabulary level representative value calculation unit calculates a vocabulary level representative value which is a representative value of the vocabulary level determined by the vocabulary level determination unit.
  • the distance calculation unit calculates the total distance, which is the distance based on a predetermined standard, between the entire English composition for which input has been accepted and the entire correction sentence for which input has been accepted.
  • the evaluation data storage device stores evaluation data including evaluations according to the total distance and the vocabulary level representative value in association with the area for each area of the total distance and the vocabulary level representative value. ..
  • the evaluation data reading unit reads out the evaluation data corresponding to the calculated vocabulary level representative value and the calculated total distance from the evaluation data storage device.
  • the result output unit outputs the evaluation data read by the evaluation data reading unit.
  • the user can obtain an evaluation based on the degree of difference between the English composition and the corrected sentence and the vocabulary level of the English composition.
  • the vocabulary level storage device may be an external storage device.
  • the representative value is, for example, an average value.
  • the predetermined condition is that, for example, the word is a content word.
  • an English composition correction result analysis system that analyzes the correction result by comparing the English composition and the correction sentence and returns the analysis result to the user is realized.
  • FIG. 3 is a screen diagram showing an example in which the correspondence between sentences between an English composition and its correction sentence is displayed on the screen of a user terminal using the system of FIG. 1.
  • FIG. 1 is a block diagram illustrating the configuration of an English composition correction result analysis system according to an embodiment of the present invention.
  • This English composition correction result analysis system (abbreviated as "analysis system” as appropriate) 101 is a learner's English composition (also referred to as “learner's English composition” as appropriate) and its correction by the corrector ("corrector English”). It is a system that analyzes the correction result by comparing it with "composition” and returns the analysis result to users such as learners.
  • the analysis system 101 has a computer and is used by a user as a server connected to the network 120 in the illustrated example.
  • the network 120 is, for example, the Internet.
  • the learning system 141 In addition to the analysis system 101, the learning system 141, the corrector terminal 143, the user terminal 151, and the system administrator terminal 153 are connected to the network 120.
  • the user terminal 151 is a computer of a user who uses the learning system 141 and the analysis system 101. In addition to the learner, the learner's parents, teachers, etc. can also be users.
  • the system administrator terminal 153 is an administrator's computer that manages the learning system 141 and the analysis system 101.
  • the corrector terminal 143 is a corrector's computer.
  • the learning system 141 sets a question selected by the learner through the user terminal 151 from the questions of the English composition prepared in advance, receives the English composition answered by the learner, and uses the received English composition as the corrector terminal 143.
  • the learning system 141 further sends the received English composition and the corrected sentence to the user terminal 151 and also sends the received English composition to the analysis system 101 for analysis processing.
  • the content of the question may be a translation of a given Japanese sentence into English, or may be a request for creation of the content, such as "Write within 100 words about your future dreams.”
  • the corrector does not necessarily have to be fluent in Japanese, and may correct based only on the learner's English composition.
  • the user terminal 151 accesses only the learning system 141 and uses the analysis system 101 through the learning system 141.
  • the user terminal 151 may use the learning system 141 by installing a dedicated application, or may learn by accessing the program provided by the learning system 141 in the form of an ASP (application service provider).
  • System 141 may be used.
  • the analysis system 101 and the learning system 141 are individually connected to the network 120 as independent servers, but both may be constructed in a single server. In this case, the exchange of data between the analysis system 101 and the learning system 141 is performed without going through the network 120.
  • the device groups 161, 163, 165 used by the analysis system 101 are connected to the network 120 as a server.
  • the device group 161, 163, 165 includes a part-speech analysis device 161, a grammar item storage device 163, and a vocabulary level storage device 165.
  • the analysis system 101 having a computer includes devices 1 to 17 that are connected to each other so as to exchange data and each of which fulfills an individual function.
  • the device groups 1 to 17 include a device composed of only hardware and a device equivalently realized by reading a program by a computer.
  • the device groups 1 to 17 include an input receiving unit 1, a mapping unit 2, a result output unit 3, a part-of-speech analysis device access unit 4, a part-of-speech search unit 5, a grammar item storage device access unit 6, and a grammar item search.
  • the analysis system 101 associates the sentences included in the two English compositions with each other in order to enable an appropriate contrast between the learner's English composition and the corrector's English composition.
  • the mapping is performed, for example, by using the Levenshtein distance and selecting the other sentence with the shortest distance from one sentence as the corresponding sentence.
  • the Levenshtein distance is a well-known method for assessing the similarity between sentences. The higher the similarity between sentences, the smaller the distance. If the sentences are the same, the distance will be zero.
  • the distance is calculated for various combinations of sentences. The distance is calculated in singular and multiple sentence units. Even if one single sentence corresponds to multiple other sentences, the distance between one single sentence and the other is also calculated so that the mapping is correct. In this way, sentence-based correspondence is made between the learner's English composition and the corrector's English composition.
  • the input reception unit 1 of the analysis system 101 receives the input of the learner's English composition and the corrector's English composition, which is the correction sentence, sent from the learning system 141.
  • the associating unit 2 associates the learner's English composition and the corrector's English composition with each other.
  • the Levenshtein distance is calculated for various combinations of the singular and plural sentences contained in the learner's English composition and the singular and plural sentences included in the corrector's English composition.
  • the sentences are associated with each other in ascending order of the Levenshtein distance. The correspondence is performed until there are no unmatched sentences in either the learner's English composition or the corrector's English composition.
  • the result output unit 3 outputs the result of the association made by the association unit 2 to the learning system 141.
  • the learning system 141 sends the result of the association to the user terminal 151.
  • the user terminal 151 displays the result of the received correspondence on the screen.
  • the user for example, a learner, a teacher, etc.
  • the user can know the correspondence between the learner's English composition and the corrector's English composition.
  • the English composition 1 exemplified in Table 1 is an English composition tentatively assumed as a learner's English composition. Further, the English composition 2 exemplified in Table 2 is an English composition tentatively assumed as a corrector English composition.
  • the sentences included in each English composition are numbered 1, 2, ..., And a, b, c, ... In the order in which they are arranged.
  • a sentence that combines multiple sentences is used as a collection of sentences. to add.
  • An upper limit may be set in consideration of the calculation load as to how many sentences should be considered as the number of sentences constituting the combined sentence. In the examples of Tables 3 and 4, the upper limit is set to "3". That is, the combination of up to three sentences is considered. Further, regarding the combination of a plurality of sentences, in the illustrated example, only consecutive sentences are targeted for combination. In reality, the combination of discontinuous sentences rarely shows a good correspondence between the learner's English composition and the corrector's English composition. Therefore, it can be said that the method of combining only consecutive sentences causes almost no problem in reality, can avoid wasting calculation time, and is a practical method.
  • Table 14 illustrates the final result of the mapping.
  • Table 15 exemplifies the final results including sentence 1 constituting English sentence 1 (learner English composition) and sentence 2 constituting English sentence 2 (corrector English composition) by rearranging the sentences as appropriate. There is. In the example of Table 15, the sentences are rearranged according to the order of English sentences 2. The final results illustrated in Table 15 are output from the result output unit 3 to the learning system 141.
  • FIG. 2 is a screen diagram showing an example in which a learner's English composition and its corrector's English composition are displayed on the screen of the user terminal 151.
  • FIG. 3 is a screen diagram showing an example in which the correspondence between the sentences between the learner's English composition and the corrector's English composition is displayed on the screen of the user terminal 151.
  • the learner's English composition and the corrector's English composition illustrated in FIG. 2 are not the same as the learner's English composition and the corrector's English composition illustrated in Tables 1 and 2.
  • the learning system 141 sends the learner English composition received from the user terminal 151 and the corrector English composition received from the corrector terminal 143 to the user terminal 151.
  • the screen diagram of FIG. 2 shows an example in which the learner's English composition and the corrector's English composition sent from the learning system 141 are displayed side by side on the same screen so that they can be easily compared with each other.
  • the learner's English composition is displayed on the left side, and the corrector's English composition is displayed on the right side.
  • the "English composition analysis” button is displayed at the lower end of the screen.
  • the learning system 141 sends the learner English composition and the corrector English composition to the analysis system 101 for analysis processing.
  • the analysis system 101 starts the analysis of the sent learner's English composition and the corrector's English composition.
  • the analysis system 101 outputs the final result of the mapping illustrated in Table 15 to the learning system 141, the learning system 141 sends the received final result to the user terminal 151.
  • the sentences having a corresponding relationship between the learner's English composition and the corrector's English composition are displayed side by side with each other.
  • the user can clearly grasp the correspondence between the sentences between the learner's English composition and the corrector's English composition.
  • the "number 1", “number 2", and "distance” exemplified in Table 15 are not displayed. In this case, these data may not be included in the final result of the association sent by the analysis system 101 to the learning system 141.
  • the correspondence between sentences is performed until there are no unmatched sentences in either the learner's English composition or the corrector's English composition.
  • the unassociated sentences disappear at the same time.
  • the learner's English composition inserts a sentence that is completely unrelated to the question, the sentence that cannot be associated disappears first in the corrector's English composition, and the sentence that cannot be associated is the learner's English. It may remain in the composition.
  • a sentence that cannot be associated is also displayed in the learner English composition column on the left side, and the corrector English composition column on the right side is "irrelevant to the subject". ", Or a blank is displayed. Strikethrough may be superimposed on the unassociated sentence displayed in the learner English composition column on the left side.
  • Such processing is performed by, for example, the mapping unit 2. Sentences that cannot be associated are not subject to the following part-of-speech analysis and grammatical analysis.
  • the analysis system 101 further performs part-of-speech analysis by using the result of associating sentences with each other.
  • the difference between the part-speech used in one sentence and the part-speech used in the other sentence is logically calculated for the sentences corresponding to each other between the learner's English composition and the corrector's English composition. Then, based on the presence or absence of the difference, the analysis result is returned to the user as whether there is an error in the part of speech.
  • Part of speech analysis includes not only the major classification of part of speech such as verbs and nouns, but also the lower attributes of the major classification, that is, the minor classification of part of speech, for example, the verb time system for verbs and the singular / plural nouns for nouns.
  • Nouns such as countable and uncountable, can also be analyzed. That is, a broadly defined part of speech including a subclass of part of speech may be the subject of analysis. In the following explanation, we will take an example of analyzing such broadly defined part of speech.
  • the part of speech analysis device 161 is a device that, when a sentence is input, analyzes the part of speech for each word constituting the sentence and outputs the analysis result.
  • a part-of-speech analyzer 161 that fulfills such a function is already provided on the Internet as a server. TreeTagger (https://www.cis.uni-muenchen.de/ ⁇ schmid/tools/TreeTagger/) is one example.
  • the part-speech analyzer access unit 4 inputs the sentence sets associated with each other by the associating unit 2 into the part-speech analysis device 161 to obtain the result of analyzing the part of speech word by word for these sentence sets.
  • Which set of sentences to enter can be specified at the same time, for example, when the user requests part-of-speech analysis. For example, when the user clicks on any of the sentence sets displayed in FIG. 3, a pull tab or a small screen appears, and menus such as "part of speech analysis” and "grammar analysis” are displayed. By selecting "part of speech analysis", the user can request the part of speech analysis and specify the set of sentences to be analyzed at the same time.
  • the part-speech search unit 5 is located in one of the pairs of words and their part-speech between the associated sentences based on the analysis result of the part-speech acquired by the part-speech analyzer access unit 4. Search for pairs that are not in the other sentence, or in both sentences.
  • the result output unit 3 outputs the result searched by the part of speech search unit 5 to the learning system 141.
  • the learning system 141 sends the sent search result to the user terminal 151.
  • the user terminal 151 displays the received search result on the screen. As a result, the user who receives the search result can know the disagreement group or the matching group among the pairs of the word and its part of speech among the sentences in the corresponding relationship.
  • the part-of-speech analyzer access unit 4 inputs "text01" and "text02" into the part-speech analyzer 161 to obtain the analysis results exemplified in Tables 17 and 18, respectively.
  • the part of speech analyzer 161 to be used is the above-mentioned TreeTagger as an example.
  • the part-speech symbol is a symbol for identifying a part-speech defined in the part-speech analyzer 161.
  • part of speech means the above-mentioned part of speech in a broad sense.
  • the "meaning of the part-of-speech symbol” expressed in Japanese is a phrase that expresses the meaning of the part-of-speech symbol "VBD" in Japanese.
  • the part-of-speech analyzer access unit 4 separately accesses another server that is associated with the part-of-speech symbol and stores its meaning, or FIG.
  • the part-of-speech symbol meaning data storage device 16 provided as a storage device in the analysis system 101 stores the part-of-speech symbol and its meaning in association with each other, and the part-of-speech analyzer access unit 4 stores this part-of-speech.
  • the data storage device 16 may be accessed separately.
  • the part-of-speech symbol and its meaning data can be stored in the part-of-speech symbol meaning data storage device 16 through the input receiving unit 1 by, for example, the system administrator operating the system administrator terminal 153.
  • the part of speech search unit 5 is found in both the part of speech in "text01", the part of speech in "text02", and both "text01” and “text02” from the analysis results exemplified in Tables 17 and 18.
  • Search for part of speech In the search, "prototype of word” and “meaning of part of speech symbol” are not referred to. However, it refers not only to the “part of speech symbol” corresponding to the part of speech in a broad sense, but also to the "word”. Only when both the "part of speech symbol” and the "word” match, it is judged as "match".
  • FIG. 4 shows an example in which the analysis results of the words constituting each of the sentences corresponding to each other between the learner's English composition and the corrector's English composition and their part of speech are displayed on the screen of the user terminal 151. It is a screen view which shows. In the illustrated example, both sentences are displayed at the top of the screen. At the top of the screen, the meaning of each word and its part-speech symbol is displayed along with the sentence. In each case, the sentences in the learner's English composition are displayed on the left side and the sentences in the corrector's English composition are displayed on the right side for easy comparison. In the middle of the screen, the meanings of words and their part-speech symbols that exist only on one side of the corresponding sentences are displayed.
  • the left side shows the meanings of words and their part of speech symbols only in the sentence in the learner's English composition
  • the right side shows the words and their part of speech only in the sentence in the corrector's English composition.
  • the symbol is displayed.
  • the meanings of the words and their part-speech symbols in both of the corresponding sentences are displayed. From the illustrated screen, the user can clearly understand how the learner's English composition was corrected, where the problem was, what was good, and so on.
  • the analysis system 101 further performs grammatical analysis using the result of associating sentences with each other.
  • the grammar analysis the difference between the grammar item used in one sentence and the grammar item used in the other sentence is calculated for the sentences corresponding to each other between the learner's English composition and the corrector's English composition.
  • a logical operation is performed, and the analysis result is returned to the user as having / without an error in the grammar based on the presence / absence of a difference.
  • the grammatical items to be compared are specified by selecting an appropriate one from a large number of predetermined grammatical items based on the combination of part of speech and the like.
  • Grammar items are, for example, passive voice, verb + to infinitive, present perfect, and so on.
  • the procedure of grammar analysis will be described with reference to FIG.
  • the user terminal 151 requests a grammar analysis by a user's click operation or the like, this request is input to the input receiving unit 1 of the analysis system 101 via the learning system 141 and transmitted to the grammar item storage device access unit 6.
  • the user can specify the set of sentences to be analyzed at the same time. For example, when the user clicks on any of the sentence sets displayed in FIG. 3, a pull tab or a small screen appears, and menus such as "part of speech analysis” and "grammar analysis” are displayed. By selecting "grammar analysis", the user can request grammar analysis and specify a set of sentences to be analyzed at the same time.
  • the grammar item storage device access unit 6 accesses the grammar item storage device 163.
  • the grammar item storage device 163 stores one or more combinations of elements for each word including a word and its part of speech in association with the grammar item represented by the combination.
  • a collection of regular expressions for grammar items has been published as part of the CEFR-J project. (Link: http://cefr-j.org/PDF/sympo2020/CEFRJGP_GRAMMATICAL_ITEM_LIST.pdf).
  • 501 kinds of grammatical items are recorded in this regular expression collection in association with a regular expression of one or more combinations of elements for each corresponding word.
  • the elements for each word include "words", “part of speech symbols”, and “prototypes of words” exemplified in Tables 17 and 18.
  • Regular expression means a well-known expression form that concisely expresses a number of sets of elements using metacharacters.
  • the regular expressions included in the above regular expression collection are exemplified in Table 23 below.
  • the grammar item storage device access unit 6 accesses the grammar item storage device 163 to select a sentence associated with the grammar item storage device 163 from among the combinations of elements stored in the grammar item storage device 163 for each word. Search for the one that matches the combination of elements for each word included in the set, and pick up the corresponding grammar item.
  • the stored contents of the grammar item storage device 163 can also be stored in the storage device of the analysis system 101.
  • the grammar item storage device access unit 6 may access the storage device instead of the grammar item storage device 163.
  • the grammar item search unit 7 is based on the grammar item acquired by the grammar item storage device access unit 6, and the grammar item that is in one sentence but not in the other sentence, or both, between the associated sentences. Search for grammar items in the sentence.
  • the result output unit 3 outputs the result searched by the grammar item search unit 7 to the learning system 141.
  • the learning system 141 sends the sent search result to the user terminal 151.
  • the user terminal 151 displays the received search result on the screen. As a result, the user who receives the search result can know the grammatical items that do not match or the grammatical items that match between the sentences in the corresponding relationship.
  • the grammar item storage device access unit 6 may acquire the analysis result already obtained by the part of speech analysis device access unit 4 from the part of speech analysis device access unit 4 without newly accessing the part of speech analysis device 161. Next, the grammar item storage device access unit 6 expands the analysis results exemplified in Tables 17 and 18 into the original sentence formats of "text01” and "text02” as illustrated in Table 22. That is, the sentences “text01” and “text02” are expanded into the form of "word_part of speech symbol_word prototype" for each word.
  • Table 23 exemplifies two examples of the regular expressions recorded in the above-mentioned regular expression collection.
  • Each regular expression is recorded in association with a grammatical item, and each is described by one or more sets of elements whose contents are "word_part of speech symbol_word prototype". Therefore, the grammar item storage device 163 for storing a regular expression collection represents one or more sets of elements in which each element contains a "word”, a "(broadly defined) part of speech", and a "prototype of a word”. It is equivalent to remembering grammatical items in association with each other.
  • the grammar item storage device access unit 6 accesses the grammar item storage device 163, and for each of the 501 types of regular expressions, the corresponding characters in the expanded sentences of "text01" and "text02" exemplified in Table 22. Explore columns. The results of the search are illustrated in Tables 24 and 25.
  • the data describing the parent-child relationship is stored in the grammar item storage device 163 or analyzed as illustrated in FIG.
  • the grammar item storage device access unit 6 grasps such a parent-child relationship as illustrated in Tables 24 and 25. be able to. In the latter case, the grammar item storage device access unit 6 accesses the parent-child relationship data storage device 17 in addition to the grammar item storage device 163.
  • the parent-child relationship data storage device 17 can store data through the input reception unit 1 by the system administrator operating the system administrator terminal 153.
  • the grammar item search unit 7 sets the regular expression only in "text01", the regular expression only in "text02", and "text01" and "text02". Search for regular expressions on both sides.
  • the search not only the regular expression identification number but also the corresponding character string (“matched phrase” in the table) is referred to. That is, it is judged as "match” only when both the regular expression identification number and the corresponding character string match.
  • it is possible to avoid determining "match” when it should not be determined as “match”, such as when there is a different character string in which only the regular expression identification number matches.
  • Tables 26 to 28 exemplify the results of the search.
  • the regular expression “115" has a parental relationship with the regular expressions "107” and "219”. Therefore, the grammar item search unit 7 may delete the regular expressions "107” and "219” that are related to each other from the regular expressions that exist only in "text02".
  • the search results exemplified in Tables 26 to 28 are sent to the user terminal 151 through the learning system 141.
  • FIG. 5 is a screen diagram showing an example in which the analysis results of the grammatical items of the sentences corresponding to each other between the learner's English composition and the corrector's English composition are displayed on the screen of the user terminal 151. ..
  • both sentences are displayed at the top of the screen.
  • the sentences in the learner's English composition are displayed on the left side, and the sentences in the corrector's English composition are displayed on the right side.
  • the grammar items of each sentence are displayed in the upper part of the screen.
  • the grammar item is displayed together with the corresponding character string (“matched words / phrases” in Tables 24, 25, etc.). In the middle of the screen, grammar items that exist only in one of the corresponding sentences are displayed.
  • the grammar items found only in the sentences in the learner's English composition are displayed on the left side, and the grammar items found only in the sentences in the corrector's English composition are displayed on the right side.
  • the grammar items on both sides of the corresponding sentence are displayed. From the illustrated screen, the user can clearly understand where the grammatical problem was in the learner's English composition, what was good, and so on.
  • this function can be realized by preparing an explanatory data storage device 8 that stores grammatical items and explanatory data such as explanatory videos thereof in association with each other.
  • the explanatory data can be stored in the explanatory data storage device 8 through the input receiving unit 1 by the system administrator operating the system administrator terminal 153.
  • this request is input to the input receiving unit 1 of the analysis system 101 via the learning system 141 and transmitted to the explanation data reading unit 9.
  • the request includes, for example, data specifying sentences such as "text01" and "text02" exemplified in Tables 26 and 27, and data specifying grammatical items in sentences such as numbers [0] and [1]. Is done.
  • the explanation data reading unit 9 acquires the regular expression identification number of the grammar item for which explanation is required from the data included in the request from the data obtained by searching the grammar item search unit 7 exemplified in Tables 26 and 27. , The explanatory data corresponding to the acquired regular expression identification number is acquired from the explanatory data storage device 8.
  • the acquired commentary data is sent from the commentary data reading unit 9 to the user terminal 151 via the result output unit 3 and the learning system 141.
  • the request transmitted from the user terminal 151 to the input receiving unit 1 may include a regular expression identification number that specifies a grammatical item.
  • the commentary data reading unit 9 acquires the commentary data corresponding to the regular expression identification number from the commentary data storage device 8 without referring to the data searched and obtained by the grammar item search unit 7. Can be done.
  • the analysis system 101 can also return to the user an evaluation of the learner's English composition based on both the distance between the learner's English composition and its corrector's English composition and the vocabulary level of the learner's English composition.
  • this function includes a vocabulary level storage device 165, a vocabulary level storage device access unit 11, a vocabulary level determination unit 12, a vocabulary level representative value calculation unit 13, an evaluation data storage device 14, and evaluation data. This can be achieved by providing the reading unit 15.
  • the vocabulary level storage device 165 stores a large number of words in association with each other and the vocabulary level.
  • the vocabulary level for example, the vocabulary level defined in CEFR-J can be used.
  • the CEFR-J project provides vocabulary level data in spreadsheet format that describes the vocabulary levels defined by the CEFR-J for a large number of words (http://www.cefr-j.org/download.html). #cefrj_wordlist).
  • the vocabulary level storage device 165 stores vocabulary level data provided by, for example, CEFR-J.
  • the vocabulary level storage device 165 is already connected to the network 120 as a server, and if available, it can be used. If there is no available vocabulary level storage device 165, a new vocabulary level storage device 165 may be provided and the system administrator may operate the system administrator terminal 153 to store the vocabulary level data.
  • the storage device included in the analysis system 101 may store the vocabulary level data. In this case, the vocabulary level storage device access unit 11 accesses the storage device included in the analysis system 101.
  • the learning system 141 When the user terminal 151 sends an evaluation request to the learning system 141 by the user performing a predetermined operation, the learning system 141 inputs the sent request to the input reception unit 1 of the analysis system 101.
  • the input reception unit 1 conveys the received request to the vocabulary level determination unit 12 and the distance calculation unit 10.
  • a user's predetermined operation is performed by, for example, providing a button for requesting an evaluation on any of the screens illustrated in FIGS. 3 to 5 or all of them, and clicking the button. ..
  • the vocabulary level determination unit 12 Upon receiving the request, the vocabulary level determination unit 12 causes the vocabulary level storage device access unit 11 to access the vocabulary level storage device 165 to acquire the vocabulary level for each word included in the learner's English composition. Since the learner's English composition has already been transmitted from the input reception unit 1 to the correspondence unit 2, etc., the vocabulary level determination unit 12 can acquire the learner's English composition from, for example, the association unit 2.
  • the vocabulary level determination unit 12 may acquire the vocabulary level only for the words satisfying a predetermined condition, instead of acquiring the vocabulary level for all the words included in the learner's English composition. For example, the function words may be omitted from the words, and the vocabulary level may be obtained only for the content words.
  • the data provided by the CEFR-J project also describes the part of speech for each word, and it is possible to distinguish between function words and content words by referring to the part of speech.
  • the vocabulary level for each word acquired by the vocabulary level determination unit 12 is transmitted to the vocabulary level representative value calculation unit 13, so that the representative values are calculated.
  • the representative value is, for example, an average value, a median value, or the like.
  • the distance calculation unit 10 calculates the distance between the learner's English composition and the corrector's English composition.
  • the distance calculated here is not the distance between the learner's English composition and each sentence included in the corrector's English composition, but the distance between the entire learner's English composition and the entire corrector's English composition.
  • the distance is, for example, the Levenshtein distance. Since the distance calculation unit 10 has already received the learner's English composition and the corrector's English composition from the input reception unit 1, it is possible to calculate the distance between them.
  • the evaluation data reading unit 15 reads out the evaluation data corresponding to the calculated vocabulary level representative value and the calculated distance from the evaluation data storage device 14.
  • the evaluation data storage device 14 stores evaluation data according to both the vocabulary level representative value of the learner's English composition and the distance between the learner's English composition and the corrector's English composition.
  • FIG. 6 shows an example of evaluation data.
  • the evaluation space composed of the vocabulary level representative value and the distance is divided into four areas, and individual evaluation data is associated with each area. It is also possible to divide the evaluation space into a larger number of areas. Also, the area is not necessarily limited to a rectangular area.
  • the evaluation data can be stored in the evaluation data storage device 14 through the input receiving unit 1 by the system administrator operating the system administrator terminal 153.
  • the evaluation data reading unit 15 transmits the read evaluation data to the result output unit 3.
  • the result output unit 3 transmits the received evaluation data to the learning system 141, and the learning system 141 sends the received evaluation data to the user terminal 151.
  • the storage content of the vocabulary level storage device 165 may be stored in a storage device (not shown) included in the analysis system 101. Further, when the user requests an evaluation based on the learner's English composition and the corrector's English composition without requesting the correspondence between the sentences, the learning system 141 sets the learner's English composition and its correction.
  • the person's English composition is sent to the input reception unit 1 of the analysis system 101, and the input reception unit 1 transmits these data to the vocabulary level determination unit 12 and the distance calculation unit 10, so that the evaluation data reading unit 15 Can acquire evaluation data and convey it to the result output unit 3.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Document Processing Apparatus (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

An English composition correction result analysis system according to the present disclosure is provided with an input accepting unit, an associating unit, and a result output unit. The input accepting unit accepts input of an English composition and a corresponding corrected text. The associating unit associates with one another the texts constituting the English composition and the corrected text of which input has been accepted, between the English composition and the corrected text. The result output unit outputs the results of the association performed by the associating unit. Furthermore, the associating unit performs the association between the texts in order of increasing distance, based on predetermined criteria, between words and a plurality of texts included in the English composition and words and a plurality of texts included in the corrected text, until there are no unassociated texts in either the English composition or the corrected text. This configuration makes it possible to obtain an English composition correction result analysis system which analyzes an English composition and a corresponding corrected text by comparing the same with one another, and which returns the analysis result to the user.

Description

英作文添削結果分析システムEnglish composition correction result analysis system
 本発明は、英作文とその添削文とを対比することにより添削結果を分析する技術に関する。 The present invention relates to a technique for analyzing a correction result by comparing an English composition with a correction sentence thereof.
 英作文の文法上の誤り、スペルミスをコンピュータにより校正する校正システムは、近年の人工知能の発展により、すでに実用化され、ビジネスの分野においても利用が進んでいる。これに対し、英語学習者の英作文を、単に文法・スペル上の正誤だけでなく、出題されている問題に対して、正しく適切な英文になっているかという観点で添削するには、優れた英語力を有する添削者の手による添削が、現在においても非常に有効である。非特許文献1に例示する、本願出願人による英語学習システムは、そのような添削サービスを提供する添削システムの一例である。この添削システムでは、学習者、教師等の利用者(ユーザ)は、インターネットを通じて、出題されている日文の英作文をシステムに送付し、システムから優れた英語力を有する添削者による添削文を受け取ることができる。 A proofreading system that calibrates grammatical errors and spelling mistakes in English composition with a computer has already been put into practical use due to the development of artificial intelligence in recent years, and is being used in the business field as well. On the other hand, it is excellent to correct English composition of English learners not only from the correctness in grammar and spelling but also from the viewpoint of whether the English composition is correct and appropriate for the question being asked. Corrections made by a corrector with English proficiency are still very effective. The English learning system by the applicant of the present application, which is exemplified in Non-Patent Document 1, is an example of a correction system that provides such a correction service. In this correction system, users (users) such as learners and teachers send the English composition of the Japanese sentence being asked to the system and receive the correction sentence by the corrector with excellent English ability from the system. be able to.
 しかし、単に添削文を受け取るだけでは、利用者には、英作文のどこを、どのように直すべきなのか、が分かり難い場合があった。このような問題点を解消し、従来の添削システムの付加価値を高めるものとして、添削結果を自動分析して、分析結果を利用者にフィードバックするシステムが待望されていた。なお、特許文献1は、利用者の作成した解答文と、あらかじめ用意された正解文との間における、同一語群の位置の相違・単語の活用形の相違・単語の品詞の相違等を考慮して、解答文の採点を行う技術を開示している。 However, it was sometimes difficult for the user to understand where and how to correct the English composition simply by receiving the correction text. In order to solve such problems and increase the added value of the conventional correction system, a system that automatically analyzes the correction results and feeds back the analysis results to the user has been long-awaited. In addition, Patent Document 1 considers the difference in the position of the same word group, the difference in the conjugation form of the word, the difference in the part of speech of the word, etc. between the answer sentence created by the user and the correct answer sentence prepared in advance. Then, the technology for scoring the answer text is disclosed.
特許第4165898号公報Japanese Patent No. 4165898
 本発明は上記の問題点に鑑みてなされたもので、英作文とその添削文とを対比することにより添削結果を分析し、利用者に分析結果を返す英作文添削結果分析システムを提供することを目的とする。 The present invention has been made in view of the above problems, and provides an English composition correction result analysis system that analyzes a correction result by comparing an English composition with the correction sentence and returns the analysis result to the user. With the goal.
 上記目的を達成するために、本発明のうち第1の態様によるものは、英作文添削結果分析システムであって、入力受付部と、対応付け部と、結果出力部と、を備えている。入力受付部は、英作文とその添削文との入力を受け付ける。対応付け部は、入力を受け付けられた前記英作文と前記添削文との間で、それらを構成する文同士の対応付けを行う。結果出力部は、前記対応付け部が行った前記対応付けの結果を出力する。そして、前記対応付け部は、所定基準に基づく文同士の距離が最小となるように、前記文同士の対応付けを行う。
 この構成によれば、対応付けの結果を受け取るユーザ(例えば、学習者、教師等)は、英作文と添削文との間で、それらを構成する文同士の対応関係を、知ることができる。
In order to achieve the above object, the first aspect of the present invention is an English composition correction result analysis system, which includes an input receiving unit, a matching unit, and a result output unit. The input reception unit accepts input of English composition and its correction sentence. The associating unit associates the sentences constituting the English composition and the correction sentence with each other when the input is accepted. The result output unit outputs the result of the mapping performed by the mapping unit. Then, the matching unit associates the sentences with each other so that the distance between the sentences based on the predetermined standard is minimized.
According to this configuration, a user (for example, a learner, a teacher, etc.) who receives the result of the association can know the correspondence between the English composition and the correction sentence.
 本発明のうち第2の態様によるものは、第1の態様による英作文添削結果分析システムであって、前記対応付け部は、前記英作文に含まれる単数及び複数の文と、前記添削文に含まれる単数及び複数の文との間で、前記所定基準に基づく文同士の距離が小さい順に、前記英作文及び前記添削文のいずれかの中に、対応付けられない文が無くなるまで、前記文同士の対応付けを行う。
 この構成によれば、順序を踏む着実な方法で文同士の対応関係が得られ、しかも、対応付けの結果を受け取るユーザは、複数の文を含めて距離が最小となる文同士の対応関係を、知ることができる。
The second aspect of the present invention is the English composition correction result analysis system according to the first aspect, and the matching unit includes a single sentence and a plurality of sentences included in the English composition and the correction sentence. The sentences are described in ascending order of distance between the single sentence and the plurality of sentences based on the predetermined criteria, until there are no unrelated sentences in either the English composition or the correction sentence. Make a correspondence between each other.
According to this configuration, the correspondence between sentences can be obtained by a steady method in order, and the user who receives the result of the correspondence can find the correspondence between the sentences including a plurality of sentences and the distance is the minimum. , You can know.
 本発明のうち第3の態様によるものは、第1又は第2の態様による英作文添削結果分析システムであって、品詞分析装置アクセス部と、品詞探索部と、をさらに備えている。品詞分析装置アクセス部は、文の入力を受付け、入力された文を構成する単語毎に、その品詞を分析し、その分析結果を出力する品詞分析装置に、アクセスする。品詞探索部は、対応づけられた前記文同士の間で、単語とその品詞との組のうち、一方の文にあって他方の文にない組、又は双方の文にある組を、前記品詞分析装置アクセス部が前記品詞分析装置にアクセスすることにより得られる分析結果に基づいて、探索する。そして、前記結果出力部は、前記品詞探索部が行った探索の結果をも出力する。 The third aspect of the present invention is an English composition correction result analysis system according to the first or second aspect, which further includes a part-of-speech analyzer access unit and a part-of-speech search unit. The part-speech analyzer access unit receives input of a sentence, analyzes the part-speech for each word constituting the input sentence, and accesses the part-speech analyzer that outputs the analysis result. The part-speech search unit selects a set of words and their part-speech between the associated sentences that is in one sentence but not in the other sentence, or in both sentences. The search is performed based on the analysis result obtained by the analyzer access unit accessing the part of speech analyzer. Then, the result output unit also outputs the result of the search performed by the part of speech search unit.
 この構成によれば、ユーザは、対応関係にある文同士の間で、単語とその品詞との組のうち、不一致である組、又は一致している組について、知ることができる。なお、「品詞」は、動詞、名詞などの品詞の大分類を含み、さらには、動詞の時制、名詞の単複、名詞の可算・不可算など、品詞の小分類(大分類の下位の属性)を含めたものであってもよい。また、品詞分析装置は、外部の装置であってもよい。 According to this configuration, the user can know the disagreement group or the matching group among the pairs of words and their part of speech among the sentences having a corresponding relationship. In addition, "part of speech" includes a large classification of part of speech such as verbs and nouns, and further, a minor classification of part of speech such as verb tense, noun tense, and noun countability / uncountability (subordinate attributes of major classification). May be included. Further, the part-of-speech analyzer may be an external device.
 本発明のうち第4の態様によるものは、第1から第3のいずれかの態様による英作文添削結果分析システムであって、文法項目記憶装置アクセス部と、文法項目探索部と、をさらに備えている。文法項目記憶装置アクセス部は、各要素が単語とその品詞とを含む1以上の要素と、対応する文法項目とを、関連付けて記憶する文法項目記憶装置にアクセスする。文法項目探索部は、対応づけられた前記文同士の間で、文法項目のうち、一方の文にあって他方の文にない文法項目、又は双方の文にある文法項目を、前記文法項目記憶装置アクセス部が前記文法項目記憶装置にアクセスすることにより得られる記憶内容に基づいて、探索する。そして、前記結果出力部は、前記文法項目探索部が行った探索の結果をも出力する。 The fourth aspect of the present invention is an English composition correction result analysis system according to any one of the first to third aspects, further comprising a grammar item storage device access unit and a grammar item search unit. ing. The grammar item storage device access unit accesses a grammar item storage device that stores one or more elements including a word and its part of speech in association with each other and the corresponding grammar item. The grammar item search unit stores the grammar item in one sentence but not in the other sentence, or the grammar item in both sentences, among the associated sentences. The device access unit searches based on the stored contents obtained by accessing the grammar item storage device. Then, the result output unit also outputs the result of the search performed by the grammar item search unit.
 この構成によれば、ユーザは、対応関係にある文同士の間で、文法項目のうち、不一致である文法項目、又は一致している文法項目について、知ることができる。なお、「品詞」は、動詞、名詞などの品詞の大分類を含み、さらには、動詞の時制、名詞の単複、名詞の可算・不可算など、品詞の小分類(大分類の下位の属性)を含めたものであってもよい。文法項目は、例えば、時制、進行形、受動態、仮定法などである。また、文法項目記憶装置は、外部の記憶装置であってもよい。 According to this configuration, the user can know the grammatical items that are inconsistent or the grammatical items that match among the grammatical items that are in a corresponding relationship with each other. In addition, "part of speech" includes a large classification of part of speech such as verbs and nouns, and further, a minor classification of part of speech such as verb tense, noun tense, and noun countability / uncountability (subordinate attributes of major classification). May be included. The grammatical items are, for example, tense, progressive tense, passive voice, subjunctive mood, and the like. Further, the grammar item storage device may be an external storage device.
 本発明のうち第5の態様によるものは、第4の態様による英作文添削結果分析システムであって、前記文法項目探索部は、前記文法項目だけでなく、前記文法項目に対応する前記1以上の要素も、対応づけられた前記文同士の間で一致する場合に限り、前記双方の文にある文法項目として認定する。 The fifth aspect of the present invention is the English composition correction result analysis system according to the fourth aspect, and the grammar item search unit is not only the grammar item but also the one or more corresponding to the grammar item. The element of is also recognized as a grammatical item in both sentences only when the associated sentences match each other.
 この構成によれば、対応関係にある文同士の間で、文法項目に関連する要素が異なるのに、文法項目が一致するものと判断される不都合が回避される。 According to this configuration, it is possible to avoid the inconvenience that it is judged that the grammar items match even though the elements related to the grammar items are different between the sentences having a corresponding relationship.
 本発明のうち第6の態様によるものは、第4又は第5の態様による英作文添削結果分析システムであって、解説データ記憶装置と、解説データ読出部と、をさらに備えている。解説データ記憶装置は、文法項目毎の解説を内容とする解説データを、対応する文法項目に関連づけて記憶する。解説データ読出部は、前記文法項目探索部が探索した、一方の文にあって他方の文にない文法項目に対応する解説データを、前記解説データ記憶装置から読み出す。そして、前記結果出力部は、前記解説データ読出部が読み出した前記解説データをも出力する。 The sixth aspect of the present invention is an English composition correction result analysis system according to the fourth or fifth aspect, further including an explanatory data storage device and an explanatory data reading unit. The commentary data storage device stores commentary data containing explanations for each grammar item in association with the corresponding grammar item. The explanatory data reading unit reads the explanatory data corresponding to the grammar item in one sentence but not in the other sentence, which is searched by the grammar item search unit, from the explanatory data storage device. Then, the result output unit also outputs the explanatory data read by the explanatory data reading unit.
 この構成によれば、ユーザは、英作文とその添削文との間で文法項目に不一致があったときに、これに対応する解説を得ることができる。 According to this configuration, the user can obtain a corresponding explanation when there is a discrepancy in the grammatical items between the English composition and the corrected sentence.
 本発明のうち第7の態様によるものは、第1から第6のいずれかの態様による英作文添削結果分析システムであって、語彙レベル記憶装置アクセス部と、語彙レベル判定部と、語彙レベル代表値算定部と、距離算出部と、評価データ記憶装置と、評価データ読出部と、をさらに備えている。語彙レベル記憶装置アクセス部は、単語毎に語彙レベルを、対応する単語と関連付けて記憶する語彙レベル記憶装置にアクセスする。語彙レベル判定部は、入力を受け付けられた前記英作文に含まれる、全ての単語又は所定の条件を満たす単語について、単語毎に語彙レベルの判定を、前記語彙レベル記憶装置アクセス部が前記語彙レベル記憶装置にアクセスすることにより得られる記憶内容に基づいて行う。語彙レベル代表値算定部は、前記語彙レベル判定部によって判定された前記語彙レベルの代表値である語彙レベル代表値を算出する。距離算出部は、入力を受け付けられた前記英作文の全体と入力を受け付けられた前記添削文の全体との間の、前記所定基準に基づく距離である全体距離を算出する。評価データ記憶装置は、前記全体距離と前記語彙レベル代表値とに応じた評価を内容とする評価データを、前記全体距離と前記語彙レベル代表値との区域毎に、当該区域に関連付けて記憶する。評価データ読出部は、算出された前記語彙レベル代表値と、算出された前記全体距離とに対応する評価データを、前記評価データ記憶装置から読み出す。そして、前記結果出力部は、前記評価データ読出部が読み出した前記評価データをも出力する。 The seventh aspect of the present invention is an English composition correction result analysis system according to any one of the first to sixth aspects, which is a vocabulary level storage device access unit, a vocabulary level determination unit, and a vocabulary level representative. It further includes a value calculation unit, a distance calculation unit, an evaluation data storage device, and an evaluation data reading unit. The vocabulary level storage device access unit accesses a vocabulary level storage device that stores a vocabulary level for each word in association with the corresponding word. The vocabulary level determination unit determines the vocabulary level for each word for all words or words satisfying predetermined conditions included in the English composition for which input is accepted, and the vocabulary level storage device access unit determines the vocabulary level for each word. This is done based on the storage content obtained by accessing the storage device. The vocabulary level representative value calculation unit calculates a vocabulary level representative value which is a representative value of the vocabulary level determined by the vocabulary level determination unit. The distance calculation unit calculates the total distance, which is the distance based on the predetermined standard, between the entire English composition for which input has been accepted and the entire correction sentence for which input has been accepted. The evaluation data storage device stores evaluation data including evaluations according to the total distance and the vocabulary level representative value in association with the area for each area of the total distance and the vocabulary level representative value. .. The evaluation data reading unit reads out the evaluation data corresponding to the calculated vocabulary level representative value and the calculated total distance from the evaluation data storage device. Then, the result output unit also outputs the evaluation data read by the evaluation data reading unit.
 この構成によれば、ユーザは、英作文と添削文との相違の度合いと、英作文の語彙レベルとに基づく評価を得ることができる。語彙レベル記憶装置は、外部の記憶装置であってもよい。代表値は、例えば平均値である。また、所定の条件は、例えば、単語が内容語であることである。 According to this configuration, the user can obtain an evaluation based on the degree of difference between the English composition and the corrected sentence and the vocabulary level of the English composition. The vocabulary level storage device may be an external storage device. The representative value is, for example, an average value. Further, the predetermined condition is that, for example, the word is a content word.
 本発明のうち第8の態様によるものは、英作文添削結果分析システムであって、語彙レベル記憶装置アクセス部と、入力受付部と、語彙レベル判定部と、語彙レベル代表値算定部と、距離算出部と、評価データ記憶装置と、評価データ読出部と、結果出力部と、を備えている。語彙レベル記憶装置アクセス部は、単語毎に語彙レベルを、対応する単語と関連付けて記憶する語彙レベル記憶装置にアクセスする。入力受付部は、複数の文を含む英作文とその添削文との入力を受け付ける。語彙レベル判定部は、入力を受け付けられた前記英作文に含まれる、全ての単語又は所定の条件を満たす単語について、単語毎に語彙レベルの判定を、前記語彙レベル記憶装置アクセス部が前記語彙レベル記憶装置にアクセスすることにより得られる記憶内容に基づいて行う。語彙レベル代表値算定部は、前記語彙レベル判定部によって判定された前記語彙レベルの代表値である語彙レベル代表値を算出する。距離算出部は、入力を受け付けられた前記英作文の全体と入力を受け付けられた前記添削文の全体との間の、所定基準に基づく距離である全体距離を算出する。評価データ記憶装置は、前記全体距離と前記語彙レベル代表値とに応じた評価を内容とする評価データを、前記全体距離と前記語彙レベル代表値との区域毎に、当該区域に関連付けて記憶する。評価データ読出部は、算出された前記語彙レベル代表値と、算出された前記全体距離とに対応する評価データを、前記評価データ記憶装置から読み出す。結果出力部は、前記評価データ読出部が読み出した前記評価データを出力する。 The eighth aspect of the present invention is an English composition correction result analysis system, which is a vocabulary level storage device access unit, an input reception unit, a vocabulary level determination unit, a vocabulary level representative value calculation unit, and a distance. It includes a calculation unit, an evaluation data storage device, an evaluation data reading unit, and a result output unit. The vocabulary level storage device access unit accesses a vocabulary level storage device that stores a vocabulary level for each word in association with the corresponding word. The input reception unit accepts input of an English composition including a plurality of sentences and its correction sentence. The vocabulary level determination unit determines the vocabulary level for each word for all words or words satisfying predetermined conditions included in the English composition for which input is accepted, and the vocabulary level storage device access unit determines the vocabulary level for each word. This is done based on the storage content obtained by accessing the storage device. The vocabulary level representative value calculation unit calculates a vocabulary level representative value which is a representative value of the vocabulary level determined by the vocabulary level determination unit. The distance calculation unit calculates the total distance, which is the distance based on a predetermined standard, between the entire English composition for which input has been accepted and the entire correction sentence for which input has been accepted. The evaluation data storage device stores evaluation data including evaluations according to the total distance and the vocabulary level representative value in association with the area for each area of the total distance and the vocabulary level representative value. .. The evaluation data reading unit reads out the evaluation data corresponding to the calculated vocabulary level representative value and the calculated total distance from the evaluation data storage device. The result output unit outputs the evaluation data read by the evaluation data reading unit.
 この構成によれば、ユーザは、英作文と添削文との相違の度合いと、英作文の語彙レベルとに基づく評価を得ることができる。語彙レベル記憶装置は、外部の記憶装置であってもよい。代表値は、例えば平均値である。また、所定の条件は、例えば、単語が内容語であることである。 According to this configuration, the user can obtain an evaluation based on the degree of difference between the English composition and the corrected sentence and the vocabulary level of the English composition. The vocabulary level storage device may be an external storage device. The representative value is, for example, an average value. Further, the predetermined condition is that, for example, the word is a content word.
 以上のように本発明によれば、英作文とその添削文とを対比することにより添削結果を分析し、利用者に分析結果を返す英作文添削結果分析システムが実現する。
 本発明の目的、特徴、局面、及び利点は、以下の詳細な説明と添付図面とによって、より明白となる。
As described above, according to the present invention, an English composition correction result analysis system that analyzes the correction result by comparing the English composition and the correction sentence and returns the analysis result to the user is realized.
The objects, features, aspects, and advantages of the present invention will be made clearer by the following detailed description and accompanying drawings.
本発明の一実施の形態による英作文添削結果分析システムの構成を例示するブロック図である。It is a block diagram which illustrates the structure of the English composition correction result analysis system by one Embodiment of this invention. 図1のシステムを利用するユーザ端末の画面に、英作文とその添削文が表示される例を示す画面図である。It is a screen diagram which shows the example which the English composition and the correction sentence are displayed on the screen of the user terminal which uses the system of FIG. 図1のシステムを利用するユーザ端末の画面に、英作文とその添削文との間の文同士の対応関係が表示される例を示す画面図である。FIG. 3 is a screen diagram showing an example in which the correspondence between sentences between an English composition and its correction sentence is displayed on the screen of a user terminal using the system of FIG. 1. 図1のシステムを利用するユーザ端末の画面に、英作文とその添削文との間で互いに対応する文同士について、それらを構成する各単語とその品詞の分析結果が表示される例を示す画面図である。A screen showing an example in which the analysis results of each word and its part of speech are displayed on the screen of a user terminal using the system of FIG. 1 for sentences corresponding to each other between an English composition and its correction sentence. It is a figure. 図1のシステムを利用するユーザ端末の画面に、英作文とその添削文との間で互いに対応する文同士について、それらの文法項目の分析結果が表示される例を示す画面図である。It is a screen diagram which shows the example which the analysis result of the grammatical item is displayed about the sentence which corresponds to each other between an English composition and the correction sentence on the screen of the user terminal which uses the system of FIG. 図1のシステムが有する評価データ記憶装置に記憶される評価データを例示する説明図である。It is explanatory drawing which illustrates the evaluation data stored in the evaluation data storage apparatus which the system of FIG. 1 has.
 (1.全体構成)
 図1は、本発明の一実施の形態による英作文添削結果分析システムの構成を例示するブロック図である。この英作文添削結果分析システム(「分析システム」と適宜略記する)101は、学習者の英作文(「学習者英作文」とも適宜記載する)と、添削者によるその添削文(「添削者英作文」とも適宜記載する)とを対比することにより添削結果を分析し、学習者等のユーザに分析結果を返すシステムである。分析システム101は、コンピュータを有しており、図示例ではネットワーク120に接続されるサーバとして、ユーザの利用に供される。ネットワーク120は、一例としてインターネットである。
(1. Overall configuration)
FIG. 1 is a block diagram illustrating the configuration of an English composition correction result analysis system according to an embodiment of the present invention. This English composition correction result analysis system (abbreviated as "analysis system" as appropriate) 101 is a learner's English composition (also referred to as "learner's English composition" as appropriate) and its correction by the corrector ("corrector English"). It is a system that analyzes the correction result by comparing it with "composition" and returns the analysis result to users such as learners. The analysis system 101 has a computer and is used by a user as a server connected to the network 120 in the illustrated example. The network 120 is, for example, the Internet.
 ネットワーク120には、分析システム101の他に、学習システム141、添削者端末143、ユーザ端末151、システム管理者端末153が接続される。ユーザ端末151は、学習システム141及び分析システム101を利用するユーザのコンピュータである。学習者のほか、学習者の親、教師などもユーザであり得る。システム管理者端末153は、学習システム141及び分析システム101を管理する管理者のコンピュータである。添削者端末143は、添削者のコンピュータである。学習システム141は、予め準備された英作文の問題の中から、ユーザ端末151を通じて学習者が選択する問題を出題し、学習者が回答する英作文を受け取り、受け取った英作文を添削者端末143を通じて添削者に送付し、さらに、添削者による添削文を受け取る。学習システム141は、さらに、受け取った英作文と添削文とを、ユーザ端末151に送るとともに、分析処理のために分析システム101に送る。出題の内容は、与えられた日本語の文章を英訳させるものであってもよく、「将来の夢について100語以内で書きなさい」といった、内容の創作をも求めるものであっても良い。添削者は、日本語に通じていることを必ずしも要さず、学習者の英作文のみに基づいて添削しても良い。 In addition to the analysis system 101, the learning system 141, the corrector terminal 143, the user terminal 151, and the system administrator terminal 153 are connected to the network 120. The user terminal 151 is a computer of a user who uses the learning system 141 and the analysis system 101. In addition to the learner, the learner's parents, teachers, etc. can also be users. The system administrator terminal 153 is an administrator's computer that manages the learning system 141 and the analysis system 101. The corrector terminal 143 is a corrector's computer. The learning system 141 sets a question selected by the learner through the user terminal 151 from the questions of the English composition prepared in advance, receives the English composition answered by the learner, and uses the received English composition as the corrector terminal 143. It will be sent to the corrector through, and the correction text by the corrector will be received. The learning system 141 further sends the received English composition and the corrected sentence to the user terminal 151 and also sends the received English composition to the analysis system 101 for analysis processing. The content of the question may be a translation of a given Japanese sentence into English, or may be a request for creation of the content, such as "Write within 100 words about your future dreams." The corrector does not necessarily have to be fluent in Japanese, and may correct based only on the learner's English composition.
 ユーザ端末151は、一例として、学習システム141にのみアクセスし、学習システム141を通じて分析システム101を利用する。ユーザ端末151は、専用のアプリケーションをインストールすることにより、学習システム141を利用しても良いし、学習システム141がASP(アプリケーション・サービス・プロバイダ)の形態で提供するプログラムにアクセスすることにより、学習システム141を利用しても良い。また、図示例では、分析システム101と学習システム141とは、独立したサーバとして、個別にネットワーク120に接続されているが、双方が単一のサーバの中に構築されていても良い。この場合には、分析システム101と学習システム141との間のデータのやり取りは、ネットワーク120を介することなく行われる。 As an example, the user terminal 151 accesses only the learning system 141 and uses the analysis system 101 through the learning system 141. The user terminal 151 may use the learning system 141 by installing a dedicated application, or may learn by accessing the program provided by the learning system 141 in the form of an ASP (application service provider). System 141 may be used. Further, in the illustrated example, the analysis system 101 and the learning system 141 are individually connected to the network 120 as independent servers, but both may be constructed in a single server. In this case, the exchange of data between the analysis system 101 and the learning system 141 is performed without going through the network 120.
 ネットワーク120には、さらに、分析システム101が利用する装置群161,163,165がサーバとして接続される。図示例では、装置群161,163,165には、品詞分析装置161、文法項目記憶装置163、語彙レベル記憶装置165が含まれる。 Further, the device groups 161, 163, 165 used by the analysis system 101 are connected to the network 120 as a server. In the illustrated example, the device group 161, 163, 165 includes a part-speech analysis device 161, a grammar item storage device 163, and a vocabulary level storage device 165.
 コンピュータを有する分析システム101は、データをやり取りするように互いに結合し、それぞれが個別の機能を果たす装置群1~17を含んでいる。装置群1~17には、ハードウェアのみで構成される装置と、コンピュータがプログラムを読み込むことにより、等価的に実現される装置とが含まれる。図示例では、装置群1~17には、入力受付部1、対応付け部2、結果出力部3、品詞分析装置アクセス部4、品詞探索部5、文法項目記憶装置アクセス部6、文法項目探索部7、解説データ記憶装置8、解説データ読出部9、距離算出部10、語彙レベル記憶装置アクセス部11、語彙レベル判定部12、語彙レベル代表値算定部13、評価データ記憶装置14、評価データ読出部15、品詞記号意味データ記憶装置16、及び親子関係データ記憶装置17が、含まれる。図1において、分析システム101内の点線の矢印は、入力受付部1から伝えられるデータの流れを表している。これらの装置群1~17,161,163,165の機能については、分析システム101に関する以下の動作説明の中で、逐次説明する。 The analysis system 101 having a computer includes devices 1 to 17 that are connected to each other so as to exchange data and each of which fulfills an individual function. The device groups 1 to 17 include a device composed of only hardware and a device equivalently realized by reading a program by a computer. In the illustrated example, the device groups 1 to 17 include an input receiving unit 1, a mapping unit 2, a result output unit 3, a part-of-speech analysis device access unit 4, a part-of-speech search unit 5, a grammar item storage device access unit 6, and a grammar item search. Unit 7, commentary data storage device 8, commentary data reading unit 9, distance calculation unit 10, grammar level storage device access unit 11, grammar level determination unit 12, vocabulary level representative value calculation unit 13, evaluation data storage device 14, evaluation data A reading unit 15, a part-time symbol meaning data storage device 16, and a parent-child relationship data storage device 17 are included. In FIG. 1, the dotted arrow in the analysis system 101 represents the flow of data transmitted from the input receiving unit 1. The functions of these device groups 1 to 17,161,163,165 will be sequentially described in the following operation description regarding the analysis system 101.
 (2.文同士の対応付け)
 分析システム101は、学習者英作文と添削者英作文との間の適切な対比を可能にするために、2つの英作文にそれぞれ含まれる文同士の対応付けを行う。対応付けは、一例として、レーベンシュタイン(Levenshtein)距離を用いて、一方の文との距離が最も小さい他方の文を、対応する文として選択する、という方法に従って行われる。レーベンシュタイン距離は、文同士の類似性を評価する周知の手法である。文同士の類似性が高いほど、距離は小さくなる。文同士が同一であれば、距離はゼロとなる。最小距離を見いだすために、様々な文同士の組み合わせについて、距離が算出される。距離の算出は、単数及び複数の文単位で行われる。一方の単一の文が他方の複数の文に対応する場合であっても、対応付けが正しく行われるように、一方の単一の文と他方の複数の文との距離も算出される。このようにして、学習者英作文と添削者英作文との間で、文単位の対応付けがなされる。
(2. Correspondence between sentences)
The analysis system 101 associates the sentences included in the two English compositions with each other in order to enable an appropriate contrast between the learner's English composition and the corrector's English composition. The mapping is performed, for example, by using the Levenshtein distance and selecting the other sentence with the shortest distance from one sentence as the corresponding sentence. The Levenshtein distance is a well-known method for assessing the similarity between sentences. The higher the similarity between sentences, the smaller the distance. If the sentences are the same, the distance will be zero. In order to find the minimum distance, the distance is calculated for various combinations of sentences. The distance is calculated in singular and multiple sentence units. Even if one single sentence corresponds to multiple other sentences, the distance between one single sentence and the other is also calculated so that the mapping is correct. In this way, sentence-based correspondence is made between the learner's English composition and the corrector's English composition.
 図1に即して、文同士の対応付けの手順を説明する。まず分析システム101の入力受付部1は、学習システム141から送られる、学習者英作文と、その添削文である添削者英作文との入力を受け付ける。対応付け部2は、入力を受け付けられた学習者英作文と添削者英作文との間で、それらを構成する文同士の対応付けを行う。対応付けのために、学習者英作文に含まれる単数及び複数の文と、添削者英作文に含まれる単数及び複数の文との様々な組み合わせについて、レーベンシュタイン距離が算出される。そうして、レーベンシュタイン距離が小さい順に、文同士の対応付けが行われる。対応付けは、学習者英作文及び添削者英作文のいずれかの中に、対応付けられない文が無くなるまで行われる。結果出力部3は、対応付け部2によってなされた対応付けの結果を、学習システム141に出力する。学習システム141は、対応付けの結果をユーザ端末151に送る。ユーザ端末151は、受け取った対応付けの結果を画面に表示する。それにより、ユーザ(例えば、学習者、教師等)は、学習者英作文と添削者英作文との間で、それらを構成する文同士の対応関係を、知ることができる。 The procedure for associating sentences with each other will be described with reference to FIG. First, the input reception unit 1 of the analysis system 101 receives the input of the learner's English composition and the corrector's English composition, which is the correction sentence, sent from the learning system 141. The associating unit 2 associates the learner's English composition and the corrector's English composition with each other. For mapping, the Levenshtein distance is calculated for various combinations of the singular and plural sentences contained in the learner's English composition and the singular and plural sentences included in the corrector's English composition. Then, the sentences are associated with each other in ascending order of the Levenshtein distance. The correspondence is performed until there are no unmatched sentences in either the learner's English composition or the corrector's English composition. The result output unit 3 outputs the result of the association made by the association unit 2 to the learning system 141. The learning system 141 sends the result of the association to the user terminal 151. The user terminal 151 displays the result of the received correspondence on the screen. As a result, the user (for example, a learner, a teacher, etc.) can know the correspondence between the learner's English composition and the corrector's English composition.
 以下において、表1~表15を参照しつつ、対応付け部2による対応付けの手順の一例を詳細に説明する。表1に例示する英文1は、学習者英作文として仮に想定した英作文である。また、表2に例示する英文2は、添削者英作文として仮に想定した英作文である。各英作文に含まれる文には、並んでいる順に、1,2,・・・、及びa,b,c,・・・を付番している。 In the following, an example of the matching procedure by the matching unit 2 will be described in detail with reference to Tables 1 to 15. The English composition 1 exemplified in Table 1 is an English composition tentatively assumed as a learner's English composition. Further, the English composition 2 exemplified in Table 2 is an English composition tentatively assumed as a corrector English composition. The sentences included in each English composition are numbered 1, 2, ..., And a, b, c, ... In the order in which they are arranged.
Figure JPOXMLDOC01-appb-T000001
Figure JPOXMLDOC01-appb-T000001
Figure JPOXMLDOC01-appb-T000002
Figure JPOXMLDOC01-appb-T000002
 単一の文だけでなく、複数の文を結合した文が類似文としてヒットする可能性があるため、表3及び表4に例示するように、複数の文を結合した文を文の集まりに追加する。結合した文を構成する文の数として、幾つまでを考慮するかについては、計算の負荷を考慮して上限を設けても良い。表3及び表4の例では、上限が「3」に設定されている。すなわち、3つの文までの結合が考慮されている。また、複数の文の結合については、図示例では、連続した文のみが結合の対象とされる。現実には、連続しない文の結合が、学習者英作文と添削者英作文との間で、良好な対応関係を示すことは、殆ど無い。従って、連続した文のみを結合の対象とする方法は、現実に問題を生じることは殆ど無く、計算時間の無駄を回避することもでき、実用的な方法であると言うことができる。 Not only a single sentence but also a sentence that combines multiple sentences may be hit as a similar sentence. Therefore, as illustrated in Tables 3 and 4, a sentence that combines multiple sentences is used as a collection of sentences. to add. An upper limit may be set in consideration of the calculation load as to how many sentences should be considered as the number of sentences constituting the combined sentence. In the examples of Tables 3 and 4, the upper limit is set to "3". That is, the combination of up to three sentences is considered. Further, regarding the combination of a plurality of sentences, in the illustrated example, only consecutive sentences are targeted for combination. In reality, the combination of discontinuous sentences rarely shows a good correspondence between the learner's English composition and the corrector's English composition. Therefore, it can be said that the method of combining only consecutive sentences causes almost no problem in reality, can avoid wasting calculation time, and is a practical method.
Figure JPOXMLDOC01-appb-T000003
Figure JPOXMLDOC01-appb-T000003
Figure JPOXMLDOC01-appb-T000004
Figure JPOXMLDOC01-appb-T000004
 次に、表5及び表6の「レーベンシュタイン距離の表」に例示するように、表3に例示した文の集まりと、表4に例示した文の集まりとの、全ての組み合わせについて、レーベンシュタイン距離が算出される。 Next, as illustrated in "Table of Levenshtein Distances" in Tables 5 and 6, for all combinations of the set of sentences exemplified in Table 3 and the set of sentences exemplified in Table 4, Levenshtein The distance is calculated.
Figure JPOXMLDOC01-appb-T000005
Figure JPOXMLDOC01-appb-T000005
Figure JPOXMLDOC01-appb-T000006
Figure JPOXMLDOC01-appb-T000006
 次に、表5及び表6の「レーベンシュタイン距離の表」に例示する文の組み合わせの中から、レーベンシュタイン距離が最も小さくなる文の組み合わせを、類似文すなわち対応する文として取得する。すなわち、レーベンシュタイン距離が表中で最小の「10」である、英作文1の番号1が「4」の文と、英作文2の番号2が「e」の文との組み合わせが選択される。最小となるレーベンシュタイン距離が複数ある場合には、例えば、それらの中から表中で最上に位置する組み合わせを取得すると良い。表7の「結果の表」は、類似文として選択された文の組み合わせを例示している。 Next, from the combinations of sentences illustrated in the "Table of Levenshtein distances" in Tables 5 and 6, the combination of sentences with the smallest Levenshtein distance is acquired as a similar sentence, that is, a corresponding sentence. That is, a combination of a sentence in which the Levenshtein distance is the smallest "10" in the table, the sentence in which the number 1 in the English composition 1 is "4", and the sentence in which the number 2 in the English composition 2 is "e" is selected. .. When there are a plurality of minimum Levenshtein distances, for example, it is advisable to obtain the combination located at the highest position in the table from among them. The "Results Table" in Table 7 illustrates a combination of sentences selected as similar sentences.
Figure JPOXMLDOC01-appb-T000007
Figure JPOXMLDOC01-appb-T000007
 次に、選択された組み合わせを構成する、英作文1の中の文(番号1が「4」の文)と、英作文2の中の文(番号2が「e」の文)とのいずれかを含む組み合わせを、表5及び表6の「レーベンシュタイン距離の表」から取り除く。その結果、「レーベンシュタイン距離の表」は、表8の通りとなる。 Next, either the sentence in the English composition 1 (the sentence in which the number 1 is "4") or the sentence in the English composition 2 (the sentence in which the number 2 is "e") constituting the selected combination. The combinations containing the above are removed from the "Levenshtein distance table" in Tables 5 and 6. As a result, the "Levenshtein distance table" is as shown in Table 8.
Figure JPOXMLDOC01-appb-T000008
Figure JPOXMLDOC01-appb-T000008
 以下、レーベンシュタイン距離の表から、英作文1と英作文2のいずれかを構成する文がなくなるまで、表7及び表8に例示した手順が繰り返される。まず、表8の「レーベンシュタイン距離の表」において、レーベンシュタイン距離が最小の「12」である、英作文1の番号1が「1+2」の文と、英作文2の番号2が「b」の文との組み合わせが、類似文として選択される。表9の「結果の表」は、その結果、これまでに類似文として選択された文の組み合わせを例示している。 Hereinafter, the procedures illustrated in Tables 7 and 8 are repeated until there are no sentences constituting either English composition 1 or English composition 2 from the Levenshtein distance table. First, in the "Levenshtein distance table" in Table 8, the sentence number 1 of English composition 1 is "1 + 2" and the sentence number 2 of English composition 2 is "b", which is the minimum Levenshtein distance "12". The combination with the sentence of is selected as a similar sentence. The "Table of Results" in Table 9 illustrates the combinations of sentences that have been selected as similar sentences as a result.
Figure JPOXMLDOC01-appb-T000009
Figure JPOXMLDOC01-appb-T000009
 次に、選択された組み合わせを構成する、英作文1の中の文(番号1が「1+2」の文)と、英作文2の中の文(番号2が「b」の文)とのいずれかを含む組み合わせを、表8の「レーベンシュタイン距離の表」から取り除く。その結果、「レーベンシュタイン距離の表」は、表10の通りとなる。 Next, either the sentence in the English composition 1 (the sentence in which the number 1 is "1 + 2") or the sentence in the English composition 2 (the sentence in which the number 2 is "b") constituting the selected combination. The combinations containing the above are removed from the "Levenshtein distance table" in Table 8. As a result, the "Levenshtein distance table" is as shown in Table 10.
Figure JPOXMLDOC01-appb-T000010
Figure JPOXMLDOC01-appb-T000010
 次に、表10の「レーベンシュタイン距離の表」において、レーベンシュタイン距離が最小の「13」である、英作文1の番号1が「3」の文と、英作文2の番号2が「a」の文との組み合わせが、類似文として選択される。表11の「結果の表」は、その結果、これまでに類似文として選択された文の組み合わせを例示している。 Next, in the "Levenshtein distance table" of Table 10, the sentence in which the Levenshtein distance is the minimum "13", the number 1 of the English composition 1 is "3", and the number 2 of the English composition 2 is "a". The combination with the sentence of "" is selected as a similar sentence. The "Table of Results" in Table 11 exemplifies, as a result, a combination of sentences previously selected as similar sentences.
Figure JPOXMLDOC01-appb-T000011
Figure JPOXMLDOC01-appb-T000011
 次に、選択された組み合わせを構成する、英作文1の中の文(番号1が「3」の文)と、英作文2の中の文(番号2が「a」の文)とのいずれかを含む組み合わせを、表10の「レーベンシュタイン距離の表」から取り除く。その結果、「レーベンシュタイン距離の表」は、表12の通りとなる。 Next, either the sentence in the English composition 1 (the sentence in which the number 1 is "3") or the sentence in the English composition 2 (the sentence in which the number 2 is "a") constituting the selected combination. The combinations containing the above are removed from the "Levenshtein distance table" in Table 10. As a result, the "Levenshtein distance table" is as shown in Table 12.
Figure JPOXMLDOC01-appb-T000012
Figure JPOXMLDOC01-appb-T000012
 次に、表12の「レーベンシュタイン距離の表」において、レーベンシュタイン距離が最小の「45」である、英作文1の番号1が「5」の文と、英作文2の番号2が「c+d」の文との組み合わせが、類似文として選択される。表13の「結果の表」は、その結果、これまでに類似文として選択された文の組み合わせを例示している。 Next, in the "Levenshtein distance table" of Table 12, the sentence in which the Levenshtein distance is the minimum "45", the number 1 of the English composition 1 is "5", and the number 2 of the English composition 2 is "c + d". The combination with the sentence of "" is selected as a similar sentence. The "table of results" in Table 13 exemplifies, as a result, a combination of sentences previously selected as similar sentences.
Figure JPOXMLDOC01-appb-T000013
Figure JPOXMLDOC01-appb-T000013
 次に、選択された組み合わせを構成する、英作文1の中の文(番号1が「5」の文)と、英作文2の中の文(番号2が「e+d」の文)とのいずれかを含む組み合わせを、表12の「レーベンシュタイン距離の表」から取り除く。その結果、「レーベンシュタイン距離の表」は、表14の通りとなる。 Next, either the sentence in the English composition 1 (the sentence in which the number 1 is "5") or the sentence in the English composition 2 (the sentence in which the number 2 is "e + d") constituting the selected combination. The combinations containing the above are removed from the "Levenshtein distance table" in Table 12. As a result, the "Levenshtein distance table" is as shown in Table 14.
Figure JPOXMLDOC01-appb-T000014
Figure JPOXMLDOC01-appb-T000014
 表14に例示するように、「レーベンシュタイン距離の表」から、文の組み合わせは、もはや無くなっている。このことは、文同士の対応付け、すなわち類似文の選択が、全て尽くされていることを意味している。従って、表13は、対応付けの最終結果を例示するものとなっている。
 表15は、文の並べ替えを適宜行い、英文1(学習者英作文)を構成する文1、英文2(添削者英作文)を構成する文2を含めた最終的な結果を例示している。表15の例では、文の並べ替えは、英文2の順序に沿うように行われている。表15に例示する最終結果が、結果出力部3から学習システム141に出力される。
As illustrated in Table 14, from the "Levenshtein Distance Table", the sentence combinations are no longer present. This means that the correspondence between sentences, that is, the selection of similar sentences, is all exhausted. Therefore, Table 13 illustrates the final result of the mapping.
Table 15 exemplifies the final results including sentence 1 constituting English sentence 1 (learner English composition) and sentence 2 constituting English sentence 2 (corrector English composition) by rearranging the sentences as appropriate. There is. In the example of Table 15, the sentences are rearranged according to the order of English sentences 2. The final results illustrated in Table 15 are output from the result output unit 3 to the learning system 141.
Figure JPOXMLDOC01-appb-T000015
Figure JPOXMLDOC01-appb-T000015
 図2は、ユーザ端末151の画面に、学習者英作文とその添削者英作文とが表示される例を示す画面図である。また、図3は、ユーザ端末151の画面に、学習者英作文とその添削者英作文との間の文同士の対応関係が表示される例を示す画面図である。図2に例示される学習者英作文と添削者英作文は、表1,2に例示した学習者英作文と添削者英作文とは、同一ではない。 FIG. 2 is a screen diagram showing an example in which a learner's English composition and its corrector's English composition are displayed on the screen of the user terminal 151. Further, FIG. 3 is a screen diagram showing an example in which the correspondence between the sentences between the learner's English composition and the corrector's English composition is displayed on the screen of the user terminal 151. The learner's English composition and the corrector's English composition illustrated in FIG. 2 are not the same as the learner's English composition and the corrector's English composition illustrated in Tables 1 and 2.
 既に述べたように、学習システム141は、ユーザ端末151から受け取った学習者英作文と、添削者端末143から受け取った添削者英作文とを、ユーザ端末151に送る。図2の画面図は、学習システム141から送られた学習者英作文と添削者英作文とが、互いに見比べ易いように同一画面上に並置して表示される例を示している。左側に学習者英作文が表示され、右側に添削者英作文が表示されている。 As already described, the learning system 141 sends the learner English composition received from the user terminal 151 and the corrector English composition received from the corrector terminal 143 to the user terminal 151. The screen diagram of FIG. 2 shows an example in which the learner's English composition and the corrector's English composition sent from the learning system 141 are displayed side by side on the same screen so that they can be easily compared with each other. The learner's English composition is displayed on the left side, and the corrector's English composition is displayed on the right side.
 図2の例では、画面上の下端に「英作文分析」ボタンが表示されている。この「英作文分析」ボタンをユーザがクリックすると、学習システム141は、学習者英作文と添削者英作文とを、分析処理のために分析システム101に送る。それにより、送られた学習者英作文と添削者英作文とについて、分析システム101による分析が開始される。分析システム101が、表15に例示する対応付けの最終結果を学習システム141に出力すると、学習システム141は、受け取った最終結果をユーザ端末151に送る。 In the example of FIG. 2, the "English composition analysis" button is displayed at the lower end of the screen. When the user clicks the "English composition analysis" button, the learning system 141 sends the learner English composition and the corrector English composition to the analysis system 101 for analysis processing. As a result, the analysis system 101 starts the analysis of the sent learner's English composition and the corrector's English composition. When the analysis system 101 outputs the final result of the mapping illustrated in Table 15 to the learning system 141, the learning system 141 sends the received final result to the user terminal 151.
 図3に例示される画面図では、学習者英作文とその添削者英作文との間で、対応関係にある文同士が、互いに横に並ぶ様に表示されている。これにより、ユーザは、学習者英作文とその添削者英作文との間の文同士の対応関係を明瞭に把握することが可能となる。図示例では、表15に例示する「番号1」、「番号2」、「距離」は表示されない。この場合、分析システム101が学習システム141に送る対応付けの最終結果には、これらのデータが含まれていなくてもよい。
 既に述べたように、文同士の対応付けは、学習者英作文及び添削者英作文のいずれかの中に、対応付けられない文が無くなるまで行われる。多くの場合は、学習者英作文及び添削者英作文との双方において、対応付けられない文は同時に無くなる。しかし、学習者英作文が、出題とは全く無関係な文を挿入していた場合等には、対応付けられない文は、添削者英作文において先に無くなり、対応付けられない文が学習者英作文に残ることもあり得る。この場合には、例えば、図3の画面図において、左側の学習者英作文の欄には、対応付けられない文も表示され、その右側の添削者英作文の欄には、「題意に無関係」といった何らかのメッセージ、あるいは空白が表示される。左側の学習者英作文の欄に表示される、対応付けられない文に取り消し線を重ねて表示しても良い。かかる処理は、例えば、対応付け部2によって行われる。なお、対応付けられない文については、以下の品詞分析、文法分析の対象にはされない。
In the screen diagram illustrated in FIG. 3, the sentences having a corresponding relationship between the learner's English composition and the corrector's English composition are displayed side by side with each other. As a result, the user can clearly grasp the correspondence between the sentences between the learner's English composition and the corrector's English composition. In the illustrated example, the "number 1", "number 2", and "distance" exemplified in Table 15 are not displayed. In this case, these data may not be included in the final result of the association sent by the analysis system 101 to the learning system 141.
As already described, the correspondence between sentences is performed until there are no unmatched sentences in either the learner's English composition or the corrector's English composition. In many cases, in both the learner's English composition and the corrector's English composition, the unassociated sentences disappear at the same time. However, if the learner's English composition inserts a sentence that is completely unrelated to the question, the sentence that cannot be associated disappears first in the corrector's English composition, and the sentence that cannot be associated is the learner's English. It may remain in the composition. In this case, for example, in the screen diagram of FIG. 3, a sentence that cannot be associated is also displayed in the learner English composition column on the left side, and the corrector English composition column on the right side is "irrelevant to the subject". ", Or a blank is displayed. Strikethrough may be superimposed on the unassociated sentence displayed in the learner English composition column on the left side. Such processing is performed by, for example, the mapping unit 2. Sentences that cannot be associated are not subject to the following part-of-speech analysis and grammatical analysis.
 (3.品詞分析)
 分析システム101は、さらに、文同士の対応付けの結果を利用して、品詞分析を行う。品詞分析では、学習者英作文と添削者英作文との間で互いに対応する文同士について、一方の文に使われている品詞と、他方の文に使われている品詞との差分を論理演算し、差異の有無に基づいて、品詞に誤り有り/無しとして、ユーザに分析結果を返す。品詞分析には、動詞、名詞などの品詞の大分類だけでなく、大分類の下位の属性、すなわち品詞の小分類、例えば動詞であれば動詞の時制、名詞であれば、名詞の単数・複数、名詞の可算・不可算などを、分析の対象とすることもできる。すなわち、小分類の品詞を含む広義の品詞を分析の対象としても良い。以下の説明では、このような広義の品詞を分析する例を取り上げる。
(3. Part of speech analysis)
The analysis system 101 further performs part-of-speech analysis by using the result of associating sentences with each other. In the part-speech analysis, the difference between the part-speech used in one sentence and the part-speech used in the other sentence is logically calculated for the sentences corresponding to each other between the learner's English composition and the corrector's English composition. Then, based on the presence or absence of the difference, the analysis result is returned to the user as whether there is an error in the part of speech. Part of speech analysis includes not only the major classification of part of speech such as verbs and nouns, but also the lower attributes of the major classification, that is, the minor classification of part of speech, for example, the verb time system for verbs and the singular / plural nouns for nouns. , Nouns, such as countable and uncountable, can also be analyzed. That is, a broadly defined part of speech including a subclass of part of speech may be the subject of analysis. In the following explanation, we will take an example of analyzing such broadly defined part of speech.
 まず、図1に即して、品詞の分析の手順を説明する。ユーザのクリック操作等によりユーザ端末151が品詞の分析を要求すると、この要求は、学習システム141を経由して分析システム101の入力受付部1に入力され、品詞分析装置アクセス部4に伝えられる。品詞分析装置アクセス部4は、要求を受け取ると、品詞分析装置161にアクセスする。品詞分析装置161は、文を入力すると、文を構成する単語毎に、その品詞を分析し、分析結果を出力する装置である。このような機能を果たす品詞分析装置161が、サーバとしてインターネット上に、既に提供されている。TreeTagger(https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/)は、その一例である。 First, the procedure for analyzing part of speech will be explained in line with FIG. When the user terminal 151 requests the analysis of the part of speech by the user's click operation or the like, this request is input to the input receiving unit 1 of the analysis system 101 via the learning system 141 and transmitted to the part of speech analyzer access unit 4. Upon receiving the request, the part-of-speech analyzer access unit 4 accesses the part-speech analyzer 161. The part of speech analysis device 161 is a device that, when a sentence is input, analyzes the part of speech for each word constituting the sentence and outputs the analysis result. A part-of-speech analyzer 161 that fulfills such a function is already provided on the Internet as a server. TreeTagger (https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/) is one example.
 品詞分析装置アクセス部4は、対応付け部2によって対応付けられた文の組を、品詞分析装置161に入力することにより、これらの文の組について、単語毎に品詞を分析した結果を得る。どの文の組を入力するかは、例えば、ユーザが品詞分析を要求するときに、同時に指定することができる。例えば、図3に表示された文の組のいずれかをユーザがクリックすると、プルタブあるいは小画面が現れて、「品詞分析」、「文法分析」等のメニューが表示される。ユーザは、「品詞分析」を選択することにより、品詞分析の要求と、分析すべき文の組の指定とを、同時に行うことができる。
 品詞探索部5は、品詞分析装置アクセス部4が取得した品詞に関する分析結果に基づいて、対応づけられた前記文同士の間で、単語とその品詞との組のうち、一方の文にあって他方の文にない組、又は双方の文にある組を探索する。結果出力部3は、品詞探索部5が探索した結果を学習システム141に出力する。学習システム141は、送られた探索結果をユーザ端末151に送る。ユーザ端末151は、受け取った探索結果を画面に表示する。それにより、探索結果を受け取るユーザは、対応関係にある文同士の間で、単語とその品詞との組のうち、不一致である組、又は一致している組について、知ることができる。
The part-speech analyzer access unit 4 inputs the sentence sets associated with each other by the associating unit 2 into the part-speech analysis device 161 to obtain the result of analyzing the part of speech word by word for these sentence sets. Which set of sentences to enter can be specified at the same time, for example, when the user requests part-of-speech analysis. For example, when the user clicks on any of the sentence sets displayed in FIG. 3, a pull tab or a small screen appears, and menus such as "part of speech analysis" and "grammar analysis" are displayed. By selecting "part of speech analysis", the user can request the part of speech analysis and specify the set of sentences to be analyzed at the same time.
The part-speech search unit 5 is located in one of the pairs of words and their part-speech between the associated sentences based on the analysis result of the part-speech acquired by the part-speech analyzer access unit 4. Search for pairs that are not in the other sentence, or in both sentences. The result output unit 3 outputs the result searched by the part of speech search unit 5 to the learning system 141. The learning system 141 sends the sent search result to the user terminal 151. The user terminal 151 displays the received search result on the screen. As a result, the user who receives the search result can know the disagreement group or the matching group among the pairs of the word and its part of speech among the sentences in the corresponding relationship.
 以下において、表16~表21を参照しつつ、品詞分析装置アクセス部4及び品詞探索部5による分析及び探索の手順の一例を詳細に説明する。表16に例示する「text01」と「text02」は、学習者英作文と添削者英作文の間で互いに対応付けがなされた文の組である。 In the following, an example of the analysis and search procedure by the part-speech analyzer access unit 4 and the part-speech search unit 5 will be described in detail with reference to Tables 16 to 21. “Text01” and “text02” exemplified in Table 16 are a set of sentences in which the learner's English composition and the corrector's English composition are associated with each other.
Figure JPOXMLDOC01-appb-T000016
Figure JPOXMLDOC01-appb-T000016
 品詞分析装置アクセス部4は、「text01」と「text02」を品詞分析装置161に入力することにより、それぞれ表17及び表18に例示する分析結果を得る。利用する品詞分析装置161は、一例として上述のTreeTaggerである。 The part-of-speech analyzer access unit 4 inputs "text01" and "text02" into the part-speech analyzer 161 to obtain the analysis results exemplified in Tables 17 and 18, respectively. The part of speech analyzer 161 to be used is the above-mentioned TreeTagger as an example.
Figure JPOXMLDOC01-appb-T000017
Figure JPOXMLDOC01-appb-T000017
Figure JPOXMLDOC01-appb-T000018
Figure JPOXMLDOC01-appb-T000018
 表17及び表18に例示するように、「text01」及び「text02」が、単語毎に、「単語」、「品詞記号」、「単語の原型」の3つに分析される。品詞記号は、品詞分析装置161において定められた品詞を識別する記号である。ここで「品詞」は、既に述べた広義の品詞を意味している。日文で表される「品詞記号の意味」は、品詞記号「VBD」などの意味を、日本語で表現した語句である。このような語句が、品詞分析装置161に準備されていない場合には、品詞記号と関連付けてその意味を記憶する別のサーバに、品詞分析装置アクセス部4が別途アクセスするか、あるいは、図1に例示するように、分析システム101の中の記憶装置として設けられる品詞記号意味データ記憶装置16に、品詞記号とその意味とを関連付けて記憶しておいて、品詞分析装置アクセス部4がこの品詞記号意味データ記憶装置16に別途アクセスするとよい。この場合、品詞記号とその意味のデータは、例えば、システム管理者がシステム管理者端末153を操作することにより、入力受付部1を通じて品詞記号意味データ記憶装置16に記憶させることができる。 As illustrated in Tables 17 and 18, "text01" and "text02" are analyzed for each word into three, "word", "part of speech symbol", and "prototype of word". The part-speech symbol is a symbol for identifying a part-speech defined in the part-speech analyzer 161. Here, "part of speech" means the above-mentioned part of speech in a broad sense. The "meaning of the part-of-speech symbol" expressed in Japanese is a phrase that expresses the meaning of the part-of-speech symbol "VBD" in Japanese. When such a phrase is not prepared in the part-of-speech analyzer 161, the part-of-speech analyzer access unit 4 separately accesses another server that is associated with the part-of-speech symbol and stores its meaning, or FIG. As illustrated in the above, the part-of-speech symbol meaning data storage device 16 provided as a storage device in the analysis system 101 stores the part-of-speech symbol and its meaning in association with each other, and the part-of-speech analyzer access unit 4 stores this part-of-speech. Symbol Meaning The data storage device 16 may be accessed separately. In this case, the part-of-speech symbol and its meaning data can be stored in the part-of-speech symbol meaning data storage device 16 through the input receiving unit 1 by, for example, the system administrator operating the system administrator terminal 153.
 次に、品詞探索部5は、表17及び表18に例示する分析結果から、「text01」のみにある品詞、「text02」のみにある品詞、及び「text01」と「text02」との双方にある品詞を探索する。探索では、「単語の原型」、「品詞記号の意味」は参照しない。しかし、広義の品詞に対応する「品詞記号」だけでなく、「単語」をも参照する。「品詞記号」と「単語」の双方が一致して初めて「一致」と判断する。それにより、品詞のみが一致する異なる単語があった場合など、単語と品詞のうちの一方のみが一致する場合、すなわち本来「一致」と判断すべきではない場合に、これを「一致」と判断することを回避できる。このようにして、表17及び表18の間で、(単語、品詞記号)の組が比較され、「text01」のみにある組、「text02」のみにある組、及び「text01」と「text02」との双方にある組が選び出される。表19、表20、及び表21に、探索の結果を例示する。表19、表20、及び表21に例示される探索の結果は、学習システム141を通じてユーザ端末151に送られる。 Next, the part of speech search unit 5 is found in both the part of speech in "text01", the part of speech in "text02", and both "text01" and "text02" from the analysis results exemplified in Tables 17 and 18. Search for part of speech. In the search, "prototype of word" and "meaning of part of speech symbol" are not referred to. However, it refers not only to the "part of speech symbol" corresponding to the part of speech in a broad sense, but also to the "word". Only when both the "part of speech symbol" and the "word" match, it is judged as "match". As a result, when only one of the word and the part of speech matches, that is, when there is a different word that matches only the part of speech, that is, when it should not be judged as "match", this is judged as "match". Can be avoided. In this way, the pairs of (words, part of speech symbols) are compared between Tables 17 and 18, and the pairs that are only in "text01", the pairs that are only in "text02", and "text01" and "text02". The pair in both sides is selected. Table 19, Table 20, and Table 21 illustrate the results of the search. The search results exemplified in Table 19, Table 20, and Table 21 are sent to the user terminal 151 through the learning system 141.
Figure JPOXMLDOC01-appb-T000019
Figure JPOXMLDOC01-appb-T000019
Figure JPOXMLDOC01-appb-T000020
Figure JPOXMLDOC01-appb-T000020
Figure JPOXMLDOC01-appb-T000021
Figure JPOXMLDOC01-appb-T000021
 図4は、ユーザ端末151の画面に、学習者英作文とその添削者英作文との間で互いに対応する文同士について、それらを構成する各単語とその品詞の分析結果が表示される例を示す画面図である。図示の例では、画面最上段に、双方の文が表示されている。画面上段には、各単語とその品詞記号の意味とが、文に沿って表示されている。いずれにおいても、対比し易いように、左側には学習者英作文の中の文、右側には添削者英作文の中の文が表示されている。画面中段には、対応する文同士の一方にのみある単語とその品詞記号の意味が表示されている。対比し易いように、左側には、学習者英作文の中の文にのみある単語とその品詞記号の意味が表示され、右側には添削者英作文の中の文にのみある単語とその品詞記号が表示されている。画面下段には、対応する文同士の双方にある単語とその品詞記号の意味が表示されている。ユーザは、例示される画面から、学習者英作文がどのように添削されたのか、どこに問題があったのか、どこが良かったのか、等を明瞭に理解することができる。 FIG. 4 shows an example in which the analysis results of the words constituting each of the sentences corresponding to each other between the learner's English composition and the corrector's English composition and their part of speech are displayed on the screen of the user terminal 151. It is a screen view which shows. In the illustrated example, both sentences are displayed at the top of the screen. At the top of the screen, the meaning of each word and its part-speech symbol is displayed along with the sentence. In each case, the sentences in the learner's English composition are displayed on the left side and the sentences in the corrector's English composition are displayed on the right side for easy comparison. In the middle of the screen, the meanings of words and their part-speech symbols that exist only on one side of the corresponding sentences are displayed. For easy comparison, the left side shows the meanings of words and their part of speech symbols only in the sentence in the learner's English composition, and the right side shows the words and their part of speech only in the sentence in the corrector's English composition. The symbol is displayed. At the bottom of the screen, the meanings of the words and their part-speech symbols in both of the corresponding sentences are displayed. From the illustrated screen, the user can clearly understand how the learner's English composition was corrected, where the problem was, what was good, and so on.
 (4.文法分析)
 分析システム101は、さらに、文同士の対応付けの結果を利用して、文法分析を行う。文法分析では、学習者英作文と添削者英作文との間で互いに対応する文同士について、一方の文に使われている文法項目と、他方の文に使われている文法項目との差分を論理演算し、差異の有無に基づいて、文法に誤り有り/無しとして、ユーザに分析結果を返す。対比すべき文法項目は、品詞の組み合わせ等に基づいて、予め定められた多数の文法項目の中から適切なものを選び出すことによって特定される。文法項目は、例えば、受動態、動詞+to不定詞、現在完了、などである。
(4. Grammar analysis)
The analysis system 101 further performs grammatical analysis using the result of associating sentences with each other. In the grammar analysis, the difference between the grammar item used in one sentence and the grammar item used in the other sentence is calculated for the sentences corresponding to each other between the learner's English composition and the corrector's English composition. A logical operation is performed, and the analysis result is returned to the user as having / without an error in the grammar based on the presence / absence of a difference. The grammatical items to be compared are specified by selecting an appropriate one from a large number of predetermined grammatical items based on the combination of part of speech and the like. Grammar items are, for example, passive voice, verb + to infinitive, present perfect, and so on.
 まず、図1に即して、文法分析の手順を説明する。ユーザのクリック操作等によりユーザ端末151が文法分析を要求すると、この要求は、学習システム141を経由して分析システム101の入力受付部1に入力され、文法項目記憶装置アクセス部6に伝えられる。一例として、ユーザは文法分析を要求するときに、分析すべき文の組を同時に指定することができる。例えば、図3に表示された文の組のいずれかをユーザがクリックすると、プルタブあるいは小画面が現れて、「品詞分析」、「文法分析」等のメニューが表示される。ユーザは、「文法分析」を選択することにより、文法分析の要求と、分析すべき文の組の指定とを、同時に行うことができる。
 文法項目記憶装置アクセス部6は、要求を受け取ると、文法項目記憶装置163にアクセスする。文法項目記憶装置163は、単語とその品詞とを含む単語毎の要素の1以上の組み合わせと、それが表す文法項目とを関連付けて記憶している。一例として、CEFR-Jプロジェクトの一環として文法項目の正規表現集が公表されている。(リンク先:http://cefr-j.org/PDF/sympo2020/CEFRJGP_GRAMMATICAL_ITEM_LIST.pdf)。この正規表現集には、現在のところ、501種の文法項目が、対応する単語毎の要素の1以上の組み合わせの正規表現と関連付けて収録されている。この正規表現集では、単語毎の要素は、表17及び表18に例示した、「単語」、「品詞記号」、「単語の原型」を内容としている。「正規表現」とは、メタ文字を用いて、何通りもの要素の組を簡潔に表現する、周知の一表現形式を意味する。上記の正規表現集に収録される正規表現は、後述の表23に例示される。文法項目記憶装置アクセス部6は、文法項目記憶装置163にアクセすることにより、文法項目記憶装置163に記憶される単語毎の要素の組み合わせの中から、対応付け部2によって対応付けられた文の組に含まれる単語毎の要素の組み合わせに一致するものを探索し、対応する文法項目を拾い出す。
First, the procedure of grammar analysis will be described with reference to FIG. When the user terminal 151 requests a grammar analysis by a user's click operation or the like, this request is input to the input receiving unit 1 of the analysis system 101 via the learning system 141 and transmitted to the grammar item storage device access unit 6. As an example, when requesting a grammar analysis, the user can specify the set of sentences to be analyzed at the same time. For example, when the user clicks on any of the sentence sets displayed in FIG. 3, a pull tab or a small screen appears, and menus such as "part of speech analysis" and "grammar analysis" are displayed. By selecting "grammar analysis", the user can request grammar analysis and specify a set of sentences to be analyzed at the same time.
Upon receiving the request, the grammar item storage device access unit 6 accesses the grammar item storage device 163. The grammar item storage device 163 stores one or more combinations of elements for each word including a word and its part of speech in association with the grammar item represented by the combination. As an example, a collection of regular expressions for grammar items has been published as part of the CEFR-J project. (Link: http://cefr-j.org/PDF/sympo2020/CEFRJGP_GRAMMATICAL_ITEM_LIST.pdf). At present, 501 kinds of grammatical items are recorded in this regular expression collection in association with a regular expression of one or more combinations of elements for each corresponding word. In this regular expression collection, the elements for each word include "words", "part of speech symbols", and "prototypes of words" exemplified in Tables 17 and 18. "Regular expression" means a well-known expression form that concisely expresses a number of sets of elements using metacharacters. The regular expressions included in the above regular expression collection are exemplified in Table 23 below. The grammar item storage device access unit 6 accesses the grammar item storage device 163 to select a sentence associated with the grammar item storage device 163 from among the combinations of elements stored in the grammar item storage device 163 for each word. Search for the one that matches the combination of elements for each word included in the set, and pick up the corresponding grammar item.
 文法項目記憶装置163の記憶内容は、分析システム101が有する記憶装置に記憶させておくこともできる。この場合には、文法項目記憶装置アクセス部6は、文法項目記憶装置163に代えて、当該記憶装置にアクセスすると良い。 The stored contents of the grammar item storage device 163 can also be stored in the storage device of the analysis system 101. In this case, the grammar item storage device access unit 6 may access the storage device instead of the grammar item storage device 163.
 文法項目探索部7は、文法項目記憶装置アクセス部6が取得した文法項目に基づいて、対応づけられた前記文同士の間で、一方の文にあって他方の文にない文法項目、又は双方の文にある文法項目を探索する。結果出力部3は、文法項目探索部7が探索した結果を学習システム141に出力する。学習システム141は、送られた探索結果をユーザ端末151に送る。ユーザ端末151は、受け取った探索結果を画面に表示する。それにより、探索結果を受け取るユーザは、対応関係にある文同士の間で、不一致である文法項目、又は一致している文法項目について、知ることができる。 The grammar item search unit 7 is based on the grammar item acquired by the grammar item storage device access unit 6, and the grammar item that is in one sentence but not in the other sentence, or both, between the associated sentences. Search for grammar items in the sentence. The result output unit 3 outputs the result searched by the grammar item search unit 7 to the learning system 141. The learning system 141 sends the sent search result to the user terminal 151. The user terminal 151 displays the received search result on the screen. As a result, the user who receives the search result can know the grammatical items that do not match or the grammatical items that match between the sentences in the corresponding relationship.
 以下において、表22~表28を参照しつつ、文法項目記憶装置アクセス部6及び文法項目探索部7による探索の手順の一例を詳細に説明する。学習者英作文と添削者英作文の間で互いに対応付けがなされた文の組として、表16に例示する「text01」と「text02」を、再び取り上げる。文法項目記憶装置アクセス部6は、品詞分析装置アクセス部4と同様に、「text01」と「text02」を品詞分析装置161に入力することにより、それぞれ表17及び表18に例示する分析結果を得る。文法項目記憶装置アクセス部6は、新たに品詞分析装置161にアクセスすることなく、品詞分析装置アクセス部4が既に得ている分析結果を、品詞分析装置アクセス部4から取得しても良い。文法項目記憶装置アクセス部6は、次に、表17及び表18に例示する分析結果を、表22に例示するように、「text01」、「text02」の元の文の形式に展開する。すなわち、「text01」、「text02」の文を、単語毎に「単語_品詞記号_単語の原型」の形式に展開する。 In the following, an example of the search procedure by the grammar item storage device access unit 6 and the grammar item search unit 7 will be described in detail with reference to Tables 22 to 28. As a set of sentences in which the learner's English composition and the corrector's English composition are associated with each other, "text01" and "text02" exemplified in Table 16 are taken up again. Similar to the part-speech analyzer access unit 4, the grammar item storage device access unit 6 inputs "text01" and "text02" to the part-speech analyzer 161 to obtain the analysis results exemplified in Tables 17 and 18, respectively. .. The grammar item storage device access unit 6 may acquire the analysis result already obtained by the part of speech analysis device access unit 4 from the part of speech analysis device access unit 4 without newly accessing the part of speech analysis device 161. Next, the grammar item storage device access unit 6 expands the analysis results exemplified in Tables 17 and 18 into the original sentence formats of "text01" and "text02" as illustrated in Table 22. That is, the sentences "text01" and "text02" are expanded into the form of "word_part of speech symbol_word prototype" for each word.
Figure JPOXMLDOC01-appb-T000022
Figure JPOXMLDOC01-appb-T000022
 表23は、上に述べた正規表現集に収録される正規表現のうちの2例を例示している。各正規表現は、文法項目に関連づけて収録されており、各々が「単語_品詞記号_単語の原型」を内容とする要素の1以上の組により記述されている。従って、正規表現集を記憶する文法項目記憶装置163は、各要素が「単語」、「(広義の)品詞」、「単語の原型」を含んでいる要素の1以上の組と、それが表す文法項目とを、互いに関連付けて記憶していることに等しい。 Table 23 exemplifies two examples of the regular expressions recorded in the above-mentioned regular expression collection. Each regular expression is recorded in association with a grammatical item, and each is described by one or more sets of elements whose contents are "word_part of speech symbol_word prototype". Therefore, the grammar item storage device 163 for storing a regular expression collection represents one or more sets of elements in which each element contains a "word", a "(broadly defined) part of speech", and a "prototype of a word". It is equivalent to remembering grammatical items in association with each other.
Figure JPOXMLDOC01-appb-T000023
Figure JPOXMLDOC01-appb-T000023
 文法項目記憶装置アクセス部6は、文法項目記憶装置163にアクセスし、501種の正規表現のそれぞれについて、表22に例示する「text01」及び「text02」の展開された文の中の該当する文字列を探索する。探索の結果を、表24及び表25に例示する。 The grammar item storage device access unit 6 accesses the grammar item storage device 163, and for each of the 501 types of regular expressions, the corresponding characters in the expanded sentences of "text01" and "text02" exemplified in Table 22. Explore columns. The results of the search are illustrated in Tables 24 and 25.
Figure JPOXMLDOC01-appb-T000024
Figure JPOXMLDOC01-appb-T000024
 表24に例示するように、「text01」については、該当する文字列が見出されたのは、正規表現識別番号「111」及び「45」で識別される正規表現である。該当する文字列(表において「マッチした語句」)は、正規表現「111」については「played」と「I_PP_I」であり、正規表現「45」については、2つの「with」である。記号「<->」は、該当する文字列が複数存在する場合に、それらを並記することを意味している。 As illustrated in Table 24, for "text01", the corresponding character string was found in the regular expression identified by the regular expression identification numbers "111" and "45". The corresponding character strings (“matched words” in the table) are “played” and “I_PP_I” for the regular expression “111” and two “with” for the regular expression “45”. The symbol "<->" means that when there are a plurality of corresponding character strings, they are listed side by side.
Figure JPOXMLDOC01-appb-T000025
Figure JPOXMLDOC01-appb-T000025
 表25に例示するように、「text02」については、該当する文字列が見出されたのは、正規表現識別番号「107」、「115」、「219」及び「45」で識別される正規表現である。該当する文字列は、正規表現「107」については「was」と「I_PP_I」であり、正規表現「115」については、「was playing」と「was」であり、正規表現「219」については、「playing」であり、正規表現「45」については、2つの「with」である。正規表現「107」、「115」、「219」の間には親子関係が存在する。すなわち、正規表現「115」は、正規表現「107」、「219」を含んでおり、それらの親の関係にある。このような親子関係は、正規表現自体の関係として存在するものであることから、親子関係を記述したデータを、文法項目記憶装置163に記憶させておくか、又は図1に例示するように分析システム101内の記憶装置として設けられる親子関係データ記憶装置17に記憶させておくことにより、文法項目記憶装置アクセス部6は、表24及び表25に例示するように、かかる親子関係についても把握することができる。後者の場合には、文法項目記憶装置アクセス部6は、文法項目記憶装置163に加えて、親子関係データ記憶装置17にもアクセスする。親子関係データ記憶装置17には、システム管理者がシステム管理者端末153を操作することにより、入力受付部1を通じてデータを記憶させることができる。 As illustrated in Table 25, for "text02", the corresponding character string was found in the regular expression identification numbers "107", "115", "219" and "45". It is an expression. The corresponding character strings are "was" and "I_PP_I" for the regular expression "107", "was playing" and "was" for the regular expression "115", and "was" for the regular expression "219". It is "playing", and for the regular expression "45", there are two "with". There is a parent-child relationship between the regular expressions "107", "115", and "219". That is, the regular expression "115" includes the regular expressions "107" and "219", and is in a parental relationship with them. Since such a parent-child relationship exists as a relationship of the normal expression itself, the data describing the parent-child relationship is stored in the grammar item storage device 163 or analyzed as illustrated in FIG. By storing in the parent-child relationship data storage device 17 provided as a storage device in the system 101, the grammar item storage device access unit 6 also grasps such a parent-child relationship as illustrated in Tables 24 and 25. be able to. In the latter case, the grammar item storage device access unit 6 accesses the parent-child relationship data storage device 17 in addition to the grammar item storage device 163. The parent-child relationship data storage device 17 can store data through the input reception unit 1 by the system administrator operating the system administrator terminal 153.
 次に、文法項目探索部7は、表24及び表25に例示する探索結果から、「text01」のみにある正規表現、「text02」のみにある正規表現、及び「text01」と「text02」との双方にある正規表現を探索する。探索では、正規表現識別番号だけでなく、該当した文字列(表において「マッチした語句」)をも参照する。すなわち、正規表現識別番号と該当する文字列との双方が一致して初めて「一致」と判断する。それにより、正規表現識別番号のみが一致する異なる文字列があった場合など、本来「一致」と判断すべきではない場合に、これを「一致」と判断することを回避できる。このようにして、表24及び表25の間で、(正規表現識別番号、マッチした語句)の組が比較され、「text01」のみにある組、「text02」のみにある組、及び「text01」と「text02」との双方にある組が選び出される。表26~表28に、探索の結果を例示する。表27において、正規表現「115」は、正規表現「107」、「219」の親の関係にある。このため、文法項目探索部7は、子の関係にある正規表現「107」、「219」を、「text02」のみにある正規表現から削除してもよい。表26~表28に例示される探索の結果は、学習システム141を通じてユーザ端末151に送られる。 Next, from the search results exemplified in Table 24 and Table 25, the grammar item search unit 7 sets the regular expression only in "text01", the regular expression only in "text02", and "text01" and "text02". Search for regular expressions on both sides. In the search, not only the regular expression identification number but also the corresponding character string (“matched phrase” in the table) is referred to. That is, it is judged as "match" only when both the regular expression identification number and the corresponding character string match. As a result, it is possible to avoid determining "match" when it should not be determined as "match", such as when there is a different character string in which only the regular expression identification number matches. In this way, the pairs of (regular expression identification number, matched phrase) are compared between Tables 24 and 25, and the pairs that are found only in "text01", the pairs that are found only in "text02", and the pairs that are found in "text01". And "text02" are both selected. Tables 26 to 28 exemplify the results of the search. In Table 27, the regular expression "115" has a parental relationship with the regular expressions "107" and "219". Therefore, the grammar item search unit 7 may delete the regular expressions "107" and "219" that are related to each other from the regular expressions that exist only in "text02". The search results exemplified in Tables 26 to 28 are sent to the user terminal 151 through the learning system 141.
Figure JPOXMLDOC01-appb-T000026
Figure JPOXMLDOC01-appb-T000026
Figure JPOXMLDOC01-appb-T000027
Figure JPOXMLDOC01-appb-T000027
Figure JPOXMLDOC01-appb-T000028
Figure JPOXMLDOC01-appb-T000028
 図5は、ユーザ端末151の画面に、学習者英作文とその添削者英作文との間で互いに対応する文同士について、それらの文法項目の分析結果が表示される例を示す画面図である。図示の例では、画面最上段に、双方の文が表示されている。対比し易いように、左側には学習者英作文の中の文、右側には添削者英作文の中の文が表示されている。画面上段には、それぞれの文の文法項目が表示されている。文法項目は、該当した文字列(表24、表25などの「マッチした語句」)と併せて表示される。画面中段には、対応する文同士の一方にのみある文法項目が表示されている。対比し易いように、左側には、学習者英作文の中の文にのみある文法項目が表示され、右側には添削者英作文の中の文にのみある文法項目が表示されている。画面下段には、対応する文同士の双方にある文法項目が表示されている。ユーザは、例示される画面から、学習者英作文のどこに文法上の問題があったのか、どこが良かったのか、等を明瞭に理解することができる。 FIG. 5 is a screen diagram showing an example in which the analysis results of the grammatical items of the sentences corresponding to each other between the learner's English composition and the corrector's English composition are displayed on the screen of the user terminal 151. .. In the illustrated example, both sentences are displayed at the top of the screen. For easy comparison, the sentences in the learner's English composition are displayed on the left side, and the sentences in the corrector's English composition are displayed on the right side. The grammar items of each sentence are displayed in the upper part of the screen. The grammar item is displayed together with the corresponding character string (“matched words / phrases” in Tables 24, 25, etc.). In the middle of the screen, grammar items that exist only in one of the corresponding sentences are displayed. For easy comparison, the grammar items found only in the sentences in the learner's English composition are displayed on the left side, and the grammar items found only in the sentences in the corrector's English composition are displayed on the right side. At the bottom of the screen, grammar items on both sides of the corresponding sentence are displayed. From the illustrated screen, the user can clearly understand where the grammatical problem was in the learner's English composition, what was good, and so on.
 文法分析の結果、文法に誤りがあった場合には、分析の結果だけでなく、誤った文法に対応する適切な文法解説を選択して、ユーザに提供することも可能である。例えば、図5に例示する画面において、文法項目の不一致を表示する中段には、文法項目にリンクが張られていて、ユーザが所望の文法項目をクリックすると、その文法項目に対応する解説が画面に表示される。この機能は、図1に例示するように、文法項目とその解説動画などの解説データとを、互いに関連付けて記憶する解説データ記憶装置8を準備することにより、実現可能である。解説データは、システム管理者がシステム管理者端末153を操作することにより、入力受付部1を通じて解説データ記憶装置8に記憶させることができる。 If there is an error in the grammar as a result of the grammar analysis, it is possible to select not only the result of the analysis but also an appropriate grammar explanation corresponding to the incorrect grammar and provide it to the user. For example, in the screen illustrated in FIG. 5, a link is provided to the grammar item in the middle of displaying the discrepancy of the grammar item, and when the user clicks the desired grammar item, the explanation corresponding to the grammar item is displayed on the screen. Is displayed in. As illustrated in FIG. 1, this function can be realized by preparing an explanatory data storage device 8 that stores grammatical items and explanatory data such as explanatory videos thereof in association with each other. The explanatory data can be stored in the explanatory data storage device 8 through the input receiving unit 1 by the system administrator operating the system administrator terminal 153.
 ユーザのクリック操作によりユーザ端末151が文法解説を要求すると、この要求は、学習システム141を経由して分析システム101の入力受付部1に入力され、解説データ読出部9に伝えられる。要求には、例えば、表26、表27に例示する「text01」、「text02」などの文を特定するデータ、及び番号[0]、[1]などの文中の文法項目を特定するデータが含まれる。解説データ読出部9は、要求に含まれるデータから、解説が求められる文法項目の正規表現識別番号を、表26及び表27に例示する文法項目探索部7が探索して得たデータから取得し、取得した正規表現識別番号に該当する解説データを、解説データ記憶装置8から取得する。取得された解説データは、解説データ読出部9から結果出力部3、学習システム141を経由し、ユーザ端末151に送られる。ユーザ端末151から入力受付部1に伝えられる要求には、文法項目を特定する正規表現識別番号が含まれていても良い。この場合には、解説データ読出部9は、文法項目探索部7が探索して得たデータを参照することなく、正規表現識別番号に該当する解説データを、解説データ記憶装置8から取得することができる。 When the user terminal 151 requests a grammar explanation by a user's click operation, this request is input to the input receiving unit 1 of the analysis system 101 via the learning system 141 and transmitted to the explanation data reading unit 9. The request includes, for example, data specifying sentences such as "text01" and "text02" exemplified in Tables 26 and 27, and data specifying grammatical items in sentences such as numbers [0] and [1]. Is done. The explanation data reading unit 9 acquires the regular expression identification number of the grammar item for which explanation is required from the data included in the request from the data obtained by searching the grammar item search unit 7 exemplified in Tables 26 and 27. , The explanatory data corresponding to the acquired regular expression identification number is acquired from the explanatory data storage device 8. The acquired commentary data is sent from the commentary data reading unit 9 to the user terminal 151 via the result output unit 3 and the learning system 141. The request transmitted from the user terminal 151 to the input receiving unit 1 may include a regular expression identification number that specifies a grammatical item. In this case, the commentary data reading unit 9 acquires the commentary data corresponding to the regular expression identification number from the commentary data storage device 8 without referring to the data searched and obtained by the grammar item search unit 7. Can be done.
 (5.英作文の評価)
 分析システム101は、さらに、学習者英作文とその添削者英作文との間の距離、及び学習者英作文の語彙レベルの双方に基づいた学習者英作文に対する評価を、ユーザに返すことも可能である。この機能は、例えば図1に例示するように、語彙レベル記憶装置165、語彙レベル記憶装置アクセス部11、語彙レベル判定部12、語彙レベル代表値算定部13、評価データ記憶装置14、及び評価データ読出部15を設けることにより、実現可能である。語彙レベル記憶装置165は、多数の単語について、単語とその語彙レベルとを、互いに関連付けて記憶する。語彙レベルとして、例えばCEFR-Jに定められる語彙レベルを利用することができる。CEFR-Jプロジェクトは、多数の単語についてCEFR-Jが定める語彙レベルを記述した語彙レベルデータを、スプレッドシートの形式で提供している(http://www.cefr-j.org/download.html#cefrj_wordlist)。
(5. Evaluation of English composition)
The analysis system 101 can also return to the user an evaluation of the learner's English composition based on both the distance between the learner's English composition and its corrector's English composition and the vocabulary level of the learner's English composition. Is. As illustrated in FIG. 1, this function includes a vocabulary level storage device 165, a vocabulary level storage device access unit 11, a vocabulary level determination unit 12, a vocabulary level representative value calculation unit 13, an evaluation data storage device 14, and evaluation data. This can be achieved by providing the reading unit 15. The vocabulary level storage device 165 stores a large number of words in association with each other and the vocabulary level. As the vocabulary level, for example, the vocabulary level defined in CEFR-J can be used. The CEFR-J project provides vocabulary level data in spreadsheet format that describes the vocabulary levels defined by the CEFR-J for a large number of words (http://www.cefr-j.org/download.html). #cefrj_wordlist).
 語彙レベル記憶装置165は、例えばCEFR-Jが提供する語彙レベルデータを記憶している。語彙レベル記憶装置165が、すでにサーバとしてネットワーク120に接続されており、利用可能であれば、それを利用することができる。利用可能な語彙レベル記憶装置165が無ければ、語彙レベル記憶装置165を新たに設け、システム管理者がシステム管理者端末153を操作することにより、語彙レベルデータを記憶させるとよい。語彙レベル記憶装置165をネットワーク120に接続されるサーバとして設ける代わりに、分析システム101が有する記憶装置に、語彙レベルデータを記憶させてもよい。この場合には、語彙レベル記憶装置アクセス部11は、分析システム101が有する記憶装置にアクセスする。 The vocabulary level storage device 165 stores vocabulary level data provided by, for example, CEFR-J. The vocabulary level storage device 165 is already connected to the network 120 as a server, and if available, it can be used. If there is no available vocabulary level storage device 165, a new vocabulary level storage device 165 may be provided and the system administrator may operate the system administrator terminal 153 to store the vocabulary level data. Instead of providing the vocabulary level storage device 165 as a server connected to the network 120, the storage device included in the analysis system 101 may store the vocabulary level data. In this case, the vocabulary level storage device access unit 11 accesses the storage device included in the analysis system 101.
 ユーザが所定の操作をすることにより、ユーザ端末151が学習システム141に評価の要求を送ると、学習システム141は、送られた要求を分析システム101の入力受付部1に入力する。入力受付部1は、受け取った要求を語彙レベル判定部12と距離算出部10に伝える。ユーザの所定の操作は、例えば、図3~図5に例示するいずれかの画面、あるいはそれら全ての画面において、評価を要求するためのボタンが用意されており、これをクリックすることにより行われる。 When the user terminal 151 sends an evaluation request to the learning system 141 by the user performing a predetermined operation, the learning system 141 inputs the sent request to the input reception unit 1 of the analysis system 101. The input reception unit 1 conveys the received request to the vocabulary level determination unit 12 and the distance calculation unit 10. A user's predetermined operation is performed by, for example, providing a button for requesting an evaluation on any of the screens illustrated in FIGS. 3 to 5 or all of them, and clicking the button. ..
 語彙レベル判定部12は、要求を受け取ると、語彙レベル記憶装置アクセス部11に語彙レベル記憶装置165にアクセスさせることにより、学習者英作文に含まれる各単語について、その語彙レベルを取得する。学習者英作文は、すでに入力受付部1から対応付け部2等に伝えられていることから、語彙レベル判定部12は、例えば対応付け部2から学習者英作文を取得することができる。語彙レベル判定部12は、学習者英作文に含まれる全ての単語について、語彙レベルを取得するのではなく、所定の条件を満たす単語についてのみ語彙レベルを取得しても良い。例えば、単語のうち機能語を省き、内容語のみについて、語彙レベルを取得しても良い。CEFR-Jプロジェクトが提供するデータには、単語毎の品詞も記述されており、品詞を参照することにより、機能語と内容語とを弁別することが可能である。 Upon receiving the request, the vocabulary level determination unit 12 causes the vocabulary level storage device access unit 11 to access the vocabulary level storage device 165 to acquire the vocabulary level for each word included in the learner's English composition. Since the learner's English composition has already been transmitted from the input reception unit 1 to the correspondence unit 2, etc., the vocabulary level determination unit 12 can acquire the learner's English composition from, for example, the association unit 2. The vocabulary level determination unit 12 may acquire the vocabulary level only for the words satisfying a predetermined condition, instead of acquiring the vocabulary level for all the words included in the learner's English composition. For example, the function words may be omitted from the words, and the vocabulary level may be obtained only for the content words. The data provided by the CEFR-J project also describes the part of speech for each word, and it is possible to distinguish between function words and content words by referring to the part of speech.
 語彙レベル判定部12が取得した単語毎の語彙レベルは、語彙レベル代表値算定部13に伝えられることにより、それらの代表値が算出される。代表値は、例えば平均値、中央値などである。 The vocabulary level for each word acquired by the vocabulary level determination unit 12 is transmitted to the vocabulary level representative value calculation unit 13, so that the representative values are calculated. The representative value is, for example, an average value, a median value, or the like.
 距離算出部10は、入力受付部1から要求を受け取ると、学習者英作文とその添削者英作文との間の距離を算出する。ここで算出される距離は、学習者英作文と添削者英作文に含まれる文ごとの距離ではなく、学習者英作文の全体と添削者英作文の全体との間の距離である。距離は、例えばレーベンシュタイン距離である。距離算出部10は、既に入力受付部1から学習者英作文とその添削者英作文とを受け取っているので、それらの間の距離を算出することが可能である。 Upon receiving the request from the input reception unit 1, the distance calculation unit 10 calculates the distance between the learner's English composition and the corrector's English composition. The distance calculated here is not the distance between the learner's English composition and each sentence included in the corrector's English composition, but the distance between the entire learner's English composition and the entire corrector's English composition. The distance is, for example, the Levenshtein distance. Since the distance calculation unit 10 has already received the learner's English composition and the corrector's English composition from the input reception unit 1, it is possible to calculate the distance between them.
 評価データ読出部15は、算出された語彙レベル代表値と、算出された距離とに対応する評価データを、評価データ記憶装置14から読み出す。評価データ記憶装置14には、学習者英作文の語彙レベル代表値と、学習者英作文と添削者英作文との間の距離との双方に応じた評価データが記憶されている。図6には、評価データの一例が示されている。図6の例では、語彙レベル代表値と距離とにより構成される評価空間を4区域に分割し、各領域には個別の評価データが関連付けられている。評価空間をさらに多数の区域に分割することも可能である。また、区域は必ずしも矩形の区域に限られない。評価データは、システム管理者がシステム管理者端末153を操作することにより、入力受付部1を通じて評価データ記憶装置14に記憶させることができる。 The evaluation data reading unit 15 reads out the evaluation data corresponding to the calculated vocabulary level representative value and the calculated distance from the evaluation data storage device 14. The evaluation data storage device 14 stores evaluation data according to both the vocabulary level representative value of the learner's English composition and the distance between the learner's English composition and the corrector's English composition. FIG. 6 shows an example of evaluation data. In the example of FIG. 6, the evaluation space composed of the vocabulary level representative value and the distance is divided into four areas, and individual evaluation data is associated with each area. It is also possible to divide the evaluation space into a larger number of areas. Also, the area is not necessarily limited to a rectangular area. The evaluation data can be stored in the evaluation data storage device 14 through the input receiving unit 1 by the system administrator operating the system administrator terminal 153.
 評価データ読出部15は、読み出した評価データを結果出力部3へ伝える。結果出力部3は、受け取った評価データを学習システム141に伝え、学習システム141は、受け取った評価データをユーザ端末151に送る。それにより、ユーザは、英作文と添削文との相違の度合いと、英作文の語彙レベルとに基づく評価を得ることができる。
 語彙レベル記憶装置165の記憶内容は、分析システム101が有する記憶装置(図示略)に記憶させても良い。また、文同士の対応付けの要求無しで、学習者英作文とその添削者英作文とに基づく評価の要求が、ユーザからあった場合には、学習システム141は、学習者英作文とその添削者英作文を、分析システム101の入力受付部1に送り、入力受付部1は、これらのデータを、語彙レベル判定部12、距離算出部10に伝えるようにすることにより、評価データ読出部15は評価データを取得して、結果出力部3に伝えることができる。
The evaluation data reading unit 15 transmits the read evaluation data to the result output unit 3. The result output unit 3 transmits the received evaluation data to the learning system 141, and the learning system 141 sends the received evaluation data to the user terminal 151. Thereby, the user can obtain an evaluation based on the degree of difference between the English composition and the correction sentence and the vocabulary level of the English composition.
The storage content of the vocabulary level storage device 165 may be stored in a storage device (not shown) included in the analysis system 101. Further, when the user requests an evaluation based on the learner's English composition and the corrector's English composition without requesting the correspondence between the sentences, the learning system 141 sets the learner's English composition and its correction. The person's English composition is sent to the input reception unit 1 of the analysis system 101, and the input reception unit 1 transmits these data to the vocabulary level determination unit 12 and the distance calculation unit 10, so that the evaluation data reading unit 15 Can acquire evaluation data and convey it to the result output unit 3.
 本出願は、2020年12月13日に日本国に本出願人により出願された特願2020-206387号に基づくものであり、その全内容は参照により本出願に組み込まれる。 This application is based on Japanese Patent Application No. 2020-206387 filed by the applicant in Japan on December 13, 2020, the entire contents of which are incorporated into this application by reference.
 本発明の特定の実施の形態についての上記説明は、例示を目的として提示したものである。それらは、網羅的であったり、記載した形態そのままに本発明を制限したりすることを意図したものではない。数多くの変形や変更が、上記の記載内容に照らして可能であることは当業者に自明である。 The above description of the specific embodiment of the present invention is presented for the purpose of illustration. They are not intended to be exhaustive or to limit the invention as it is described. It is self-evident to those skilled in the art that numerous modifications and changes are possible in the light of the above description.
 1 入力受付部、 2 対応付け部、 3 結果出力部、 4 品詞分析装置アクセス部、 5 品詞探索部、 6 文法項目記憶装置アクセス部、 7 文法項目探索部、 8 解説データ記憶装置、 9 解説データ読出部、 10 距離算出部、 11 語彙レベル記憶装置アクセス部、 12 語彙レベル判定部、 13 語彙レベル代表値算定部、 14 評価データ記憶装置、 15 評価データ読出部、 16 品詞記号意味データ記憶装置、 17 親子関係データ記憶装置、 101 英作文添削結果分析システム、 120 ネットワーク、 141 学習システム、 143 添削者端末、 151 ユーザ端末、 153 管理者端末、 161 品詞分析装置、 163 文法項目記憶装置、 165 語彙レベル記憶装置。 1 Input reception unit, 2 Correspondence unit, 3 Result output unit, 4 Part code analysis device access unit, 5 Part code search unit, 6 Grammar item storage device access unit, 7 Grammar item search unit, 8 Explanation data storage device, 9 Explanation data Reading unit, 10 distance calculation unit, 11 grammar level storage device access unit, 12 grammar level judgment unit, 13 grammar level representative value calculation unit, 14 evaluation data storage device, 15 evaluation data reading unit, 16 part code symbol meaning data storage device, 17 Parent-child relationship data storage device, 101 English composition correction result analysis system, 120 network, 141 learning system, 143 corrector terminal, 151 user terminal, 153 administrator terminal, 161 part lyrics analysis device, 163 grammar item storage device, 165 grammar level Storage device.

Claims (8)

  1.  英作文とその添削文との入力を受け付ける入力受付部と、
     入力を受け付けられた前記英作文と前記添削文との間で、それらを構成する文同士の対応付けを行う対応付け部と、
     前記対応付け部が行った前記対応付けの結果を出力する結果出力部と、を備え、
     前記対応付け部は、所定基準に基づく文同士の距離が最小となるように、前記文同士の対応付けを行う、英作文添削結果分析システム。
    An input reception section that accepts input between English composition and its corrections,
    A correspondence unit that associates the sentences constituting them between the English composition and the correction sentence for which input is accepted, and
    A result output unit that outputs the result of the association performed by the association unit is provided.
    The correspondence unit is an English composition correction result analysis system that associates sentences with each other so that the distance between sentences based on a predetermined standard is minimized.
  2.  前記対応付け部は、前記英作文に含まれる単数及び複数の文と、前記添削文に含まれる単数及び複数の文との間で、前記所定基準に基づく文同士の距離が小さい順に、前記英作文及び前記添削文のいずれかの中に、対応付けられない文が無くなるまで、前記文同士の対応付けを行う、請求項1に記載の英作文添削結果分析システム。 In the corresponding unit, the English composition includes the single sentence and the plurality of sentences, and the single sentence and the plurality of sentences included in the correction sentence, in ascending order of the distance between the sentences based on the predetermined standard. The English composition correction result analysis system according to claim 1, wherein the sentences are associated with each other until there are no unmatched sentences in either the composition or the correction sentence.
  3.  文の入力を受付け、入力された文を構成する単語毎に、その品詞を分析し、その分析結果を出力する品詞分析装置に、アクセスする品詞分析装置アクセス部と、
     対応づけられた前記文同士の間で、単語とその品詞との組のうち、一方の文にあって他方の文にない組、又は双方の文にある組を、前記品詞分析装置アクセス部が前記品詞分析装置にアクセスすることにより得られる分析結果に基づいて、探索する品詞探索部と、をさらに備え、
     前記結果出力部は、前記品詞探索部が行った探索の結果をも出力する、請求項1又は2に記載の英作文添削結果分析システム。
    The part-of-speech analyzer access unit that accepts the input of a sentence, analyzes the part of speech for each word that composes the input sentence, and accesses the part-of-speech analyzer that outputs the analysis result.
    Among the pairs of words and their part of speech between the associated sentences, the part of speech analyzer access unit selects a set in one sentence that is not in the other sentence, or a set in both sentences. A part-speech search unit for searching based on the analysis result obtained by accessing the part-speech analyzer is further provided.
    The English composition correction result analysis system according to claim 1 or 2, wherein the result output unit also outputs the result of the search performed by the part of speech search unit.
  4.  各要素が単語とその品詞とを含む1以上の要素と、対応する文法項目とを、関連付けて記憶する文法項目記憶装置にアクセスする文法項目記憶装置アクセス部と、
     対応づけられた前記文同士の間で、文法項目のうち、一方の文にあって他方の文にない文法項目、又は双方の文にある文法項目を、前記文法項目記憶装置アクセス部が前記文法項目記憶装置にアクセスすることにより得られる記憶内容に基づいて、探索する文法項目探索部と、をさらに備え、
     前記結果出力部は、前記文法項目探索部が行った探索の結果をも出力する、請求項1から3のいずれかに記載の英作文添削結果分析システム。
    A grammar item storage device access unit that accesses a grammar item storage device that stores one or more elements, each element including a word and its part of speech, and a corresponding grammar item in association with each other.
    Among the grammar items that are associated with each other, the grammar item that is in one sentence but not in the other sentence, or the grammar item in both sentences is selected by the grammar item storage device access unit. Further equipped with a grammar item search unit for searching based on the stored contents obtained by accessing the item storage device.
    The English composition correction result analysis system according to any one of claims 1 to 3, wherein the result output unit also outputs the result of the search performed by the grammar item search unit.
  5.  前記文法項目探索部は、前記文法項目だけでなく、前記文法項目に対応する前記1以上の要素も、対応づけられた前記文同士の間で一致する場合に限り、前記双方の文にある文法項目として認定する、請求項4に記載の英作文添削結果分析システム。 The grammar item search unit has not only the grammar item but also the grammar in both sentences only when the one or more elements corresponding to the grammar item match between the associated sentences. The English composition correction result analysis system according to claim 4, which is certified as an item.
  6.  文法項目毎の解説を内容とする解説データを、対応する文法項目に関連づけて記憶する解説データ記憶装置と、
     前記文法項目探索部が探索した、一方の文にあって他方の文にない文法項目に対応する解説データを、前記解説データ記憶装置から読み出す解説データ読出部と、をさらに備え、
     前記結果出力部は、前記解説データ読出部が読み出した前記解説データをも出力する、請求項4又は5に記載の英作文添削結果分析システム。
    An explanation data storage device that stores explanation data containing explanations for each grammar item in association with the corresponding grammar items.
    The grammar item search unit further includes an explanation data reading unit that reads from the explanation data storage device the explanation data corresponding to the grammar items in one sentence but not in the other sentence.
    The English composition correction result analysis system according to claim 4 or 5, wherein the result output unit also outputs the commentary data read by the commentary data reading unit.
  7.  単語毎に語彙レベルを、対応する単語と関連付けて記憶する語彙レベル記憶装置にアクセスする語彙レベル記憶装置アクセス部と、
     入力を受け付けられた前記英作文に含まれる、全ての単語又は所定の条件を満たす単語について、単語毎に語彙レベルの判定を、前記語彙レベル記憶装置アクセス部が前記語彙レベル記憶装置にアクセスすることにより得られる記憶内容に基づいて行う語彙レベル判定部と、
     前記語彙レベル判定部によって判定された前記語彙レベルの代表値である語彙レベル代表値を算出する語彙レベル代表値算定部と、
     入力を受け付けられた前記英作文の全体と入力を受け付けられた前記添削文の全体との間の、前記所定基準に基づく距離である全体距離を算出する距離算出部と、
     前記全体距離と前記語彙レベル代表値とに応じた評価を内容とする評価データを、前記全体距離と前記語彙レベル代表値との区域毎に、当該区域に関連付けて記憶する評価データ記憶装置と、
     算出された前記語彙レベル代表値と、算出された前記全体距離とに対応する評価データを、前記評価データ記憶装置から読み出す評価データ読出部と、をさらに備え、
     前記結果出力部は、前記評価データ読出部が読み出した前記評価データをも出力する、請求項1から6のいずれかに記載の英作文添削結果分析システム。
    A vocabulary level storage device access unit that accesses a vocabulary level storage device that stores a vocabulary level for each word in association with the corresponding word.
    The vocabulary level storage device access unit accesses the vocabulary level storage device to determine the vocabulary level for each word for all words or words satisfying predetermined conditions included in the English composition for which input is accepted. The vocabulary level judgment unit based on the memory content obtained by
    A vocabulary level representative value calculation unit that calculates a vocabulary level representative value that is a representative value of the vocabulary level determined by the vocabulary level determination unit, and a vocabulary level representative value calculation unit.
    A distance calculation unit that calculates the total distance, which is the distance based on the predetermined standard, between the entire English composition for which input has been accepted and the entire correction sentence for which input has been accepted.
    An evaluation data storage device that stores evaluation data including evaluations according to the total distance and the vocabulary level representative value in association with the area for each area of the total distance and the vocabulary level representative value.
    Further, an evaluation data reading unit for reading the calculated evaluation data corresponding to the vocabulary level representative value and the calculated total distance from the evaluation data storage device is provided.
    The English composition correction result analysis system according to any one of claims 1 to 6, wherein the result output unit also outputs the evaluation data read by the evaluation data reading unit.
  8.  単語毎に語彙レベルを、対応する単語と関連付けて記憶する語彙レベル記憶装置にアクセスする語彙レベル記憶装置アクセス部と、
     複数の文を含む英作文とその添削文との入力を受け付ける入力受付部と、
     入力を受け付けられた前記英作文に含まれる、全ての単語又は所定の条件を満たす単語について、単語毎に語彙レベルの判定を、前記語彙レベル記憶装置アクセス部が前記語彙レベル記憶装置にアクセスすることにより得られる記憶内容に基づいて行う語彙レベル判定部と、
     前記語彙レベル判定部によって判定された前記語彙レベルの代表値である語彙レベル代表値を算出する語彙レベル代表値算定部と、
     入力を受け付けられた前記英作文の全体と入力を受け付けられた前記添削文の全体との間の、所定基準に基づく距離である全体距離を算出する距離算出部と、
     前記全体距離と前記語彙レベル代表値とに応じた評価を内容とする評価データを、前記全体距離と前記語彙レベル代表値との区域毎に、当該区域に関連付けて記憶する評価データ記憶装置と、
     算出された前記語彙レベル代表値と、算出された前記全体距離とに対応する評価データを、前記評価データ記憶装置から読み出す評価データ読出部と、
     前記評価データ読出部が読み出した前記評価データを出力する結果出力部と、を備える、英作文添削結果分析システム。
    A vocabulary level storage device access unit that accesses a vocabulary level storage device that stores a vocabulary level for each word in association with the corresponding word.
    An input reception unit that accepts input of English composition including multiple sentences and its correction sentences,
    The vocabulary level storage device access unit accesses the vocabulary level storage device to determine the vocabulary level for each word for all words or words satisfying predetermined conditions included in the English composition for which input is accepted. The vocabulary level judgment unit based on the memory content obtained by
    A vocabulary level representative value calculation unit that calculates a vocabulary level representative value that is a representative value of the vocabulary level determined by the vocabulary level determination unit, and a vocabulary level representative value calculation unit.
    A distance calculation unit that calculates the total distance, which is the distance based on a predetermined standard, between the entire English composition for which input has been accepted and the entire correction sentence for which input has been accepted.
    An evaluation data storage device that stores evaluation data including evaluations according to the total distance and the vocabulary level representative value in association with the area for each area of the total distance and the vocabulary level representative value.
    An evaluation data reading unit that reads out evaluation data corresponding to the calculated vocabulary level representative value and the calculated total distance from the evaluation data storage device.
    An English composition correction result analysis system including a result output unit that outputs the evaluation data read by the evaluation data reading unit.
PCT/JP2021/045231 2020-12-13 2021-12-09 English composition correction result analysis system WO2022124355A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2020206387A JP7084003B1 (en) 2020-12-13 2020-12-13 English composition correction result analysis system
JP2020-206387 2020-12-13

Publications (1)

Publication Number Publication Date
WO2022124355A1 true WO2022124355A1 (en) 2022-06-16

Family

ID=81973370

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2021/045231 WO2022124355A1 (en) 2020-12-13 2021-12-09 English composition correction result analysis system

Country Status (2)

Country Link
JP (1) JP7084003B1 (en)
WO (1) WO2022124355A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005345987A (en) * 2004-06-07 2005-12-15 Hitachi Electronics Service Co Ltd Week point learning system and computer software
WO2006134759A1 (en) * 2005-06-15 2006-12-21 Waseda University Sentence evaluating device and sentence evaluating program
JP2017167413A (en) * 2016-03-17 2017-09-21 独立行政法人大学入試センター Marking auxiliary system
JP2020140442A (en) * 2019-02-28 2020-09-03 株式会社沖データ Document correction method and document correction device
JP2020177387A (en) * 2019-04-17 2020-10-29 株式会社Nttドコモ Sentence output device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007286409A (en) * 2006-04-18 2007-11-01 Hitachi Ltd Marking assist device
JP5695440B2 (en) * 2011-02-22 2015-04-08 株式会社教育測定研究所 Language analysis system and language analysis method
JP5729058B2 (en) * 2011-03-22 2015-06-03 大日本印刷株式会社 Foreign language teaching material creation system
JP7067751B2 (en) * 2017-09-28 2022-05-16 株式会社デジタル・ナレッジ Teaching material authoring system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005345987A (en) * 2004-06-07 2005-12-15 Hitachi Electronics Service Co Ltd Week point learning system and computer software
WO2006134759A1 (en) * 2005-06-15 2006-12-21 Waseda University Sentence evaluating device and sentence evaluating program
JP2017167413A (en) * 2016-03-17 2017-09-21 独立行政法人大学入試センター Marking auxiliary system
JP2020140442A (en) * 2019-02-28 2020-09-03 株式会社沖データ Document correction method and document correction device
JP2020177387A (en) * 2019-04-17 2020-10-29 株式会社Nttドコモ Sentence output device

Also Published As

Publication number Publication date
JP2022093215A (en) 2022-06-23
JP7084003B1 (en) 2022-06-14

Similar Documents

Publication Publication Date Title
Garnier et al. The PHaVE List: A pedagogical list of phrasal verbs and their most frequent meaning senses
US8700382B2 (en) Personal text assistant
US6442523B1 (en) Method for the auditory navigation of text
US9384678B2 (en) System and method for generating questions and multiple choice answers to adaptively aid in word comprehension
Rumsey How to find information: a guide for researchers
US7689407B2 (en) Method of learning a second language through the guidance of pictures
US8332208B2 (en) Information processing apparatus, information processing method, and program
US20130149681A1 (en) System and method for automatically generating document specific vocabulary questions
KR102050266B1 (en) Server, method and computer program for providing english study service
JP2005157524A (en) Question response system, and method for processing question response
US9311297B2 (en) Method and system for outputting information
US20210042300A1 (en) Mapping data resources to requested teaching objectives
JP4018673B2 (en) Multiple choice language test question automatic creation program
Sahu et al. Evaluating performance of different grammar checking tools
Wu et al. Supporting collocation learning with a digital library
JP4827163B2 (en) Test question distribution system
Huang et al. Using collocation clusters to detect and correct English L2 learners’ collocation errors
JP2008090117A (en) Learning system, similar question retrieval method, and program
WO2022124355A1 (en) English composition correction result analysis system
JP5877775B2 (en) Content management apparatus, content management system, content management method, program, and storage medium
Bussmann et al. MathSciNet: A comparative analysis of American Mathematical Society and EBSCO platforms
CN106776533B (en) Method and system for analyzing a piece of text
JP2021096395A (en) Grammar learning system, server device, data retrieval method, and retrieval program
Kao et al. Design and development of the sentence-based collocation recommender with error detection for academic writing
US20220382753A1 (en) Narrowing synonym dictionary results using document attributes

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21903454

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21903454

Country of ref document: EP

Kind code of ref document: A1