JP2010224323A

JP2010224323A - Thesis content evaluation device and program

Info

Publication number: JP2010224323A
Application number: JP2009072947A
Authority: JP
Inventors: Akira Nagata; 亮永田
Original assignee: JAPAN INST FOR EDUCATIONAL MEASUREMENT Inc; JAPAN INSTITUTE FOR EDUCATIONAL MEASUREMENT Inc
Current assignee: JAPAN INST FOR EDUCATIONAL MEASUREMENT Inc; JAPAN INSTITUTE FOR EDUCATIONAL MEASUREMENT Inc
Priority date: 2009-03-24
Filing date: 2009-03-24
Publication date: 2010-10-07
Anticipated expiration: 2029-03-24
Also published as: JP5208827B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a thesis content evaluation device dispensing with data regulation for giving marks even when changing a problem sentence. <P>SOLUTION: This thesis content evaluation device 10 includes a mutual information volume database 24 stored with a mutual information volume, a reverse appearance frequency database 25 stored with a reverse appearance frequency, a score calculating means 27 for calculating a score expressing a relevancy between each word in the thesis and the problem sentence, by substitution with data extracted from the respective databases 24, 25 in a preliminarily stored mathematical expression, a score table preparing means 28 for preparing a score table, based on a score found in each combination of the words, a content evaluation database 30 stored with a plurality of kinds of reference score tables different respectively in patterns of distribution state, and personal content evaluations corresponding to the reference score tables, and a content evaluation determining means 31 for extracting the reference score table most akin to the score table prepared by the score table preparing means 28, and for determining the content evaluation corresponding to the reference score table as the content evaluation of the thesis. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、論文内容評価装置及び論文内容評価プログラムに係り、更に詳しくは、論文のテーマとなる出題文に基づいて、当該出題文と同一の言語で学習者が作成した論文の内容を自動評価することのできる論文内容評価装置及び論文内容評価プログラムに関する。 The present invention relates to a paper content evaluation apparatus and a paper content evaluation program, and more specifically, automatically evaluates the content of a paper created by a learner in the same language as the question text based on the question text as the theme of the paper. The present invention relates to a paper content evaluation apparatus and a paper content evaluation program.

あるテーマに基づいて学習者が作成した論文の内容を自動的に採点する文章評価採点装置が提案されている（特許文献１参照）。この文章評価採点装置は、次の手順により、出題文と採点対象の小論文との近似度が求められ、当該近似度に基づき小論文の評価採点が行われる。すなわち、予め、新聞記事などの多くの文書を集めて、単語−文書行列が作成され、当該行列を特異値分解することにより、単語と文書の関係を表す行列が求められる。そして、当該行列を用いることで、出題文を構成する単語のベクトルと小論文を構成する単語のベクトルとがそれぞれ文書ベクトルに変換される。その後、コサイン尺度と呼ばれる手法により、両文書ベクトルの近似度が求められて得点対象とされる。この近似度は、各文書ベクトルが近い程、高得点となる。 A sentence evaluation scoring device that automatically scores the content of a paper created by a learner based on a certain theme has been proposed (see Patent Document 1). In this sentence evaluation scoring device, the degree of approximation between the question sentence and the essay to be graded is obtained by the following procedure, and the essay is evaluated based on the degree of approximation. That is, a number of documents such as newspaper articles are collected in advance to create a word-document matrix, and a singular value decomposition is performed on the matrix to obtain a matrix representing the relationship between words and documents. Then, by using the matrix, the word vector constituting the question sentence and the word vector constituting the essay are each converted into a document vector. Thereafter, the degree of approximation of both document vectors is obtained by a technique called a cosine scale, and is scored. The degree of approximation becomes higher as each document vector is closer.

特開２００４−１５１７５７号公報JP 2004-151757 A

しかしながら、前記文章評価採点装置にあっては、当該技術に関する評価を示した論文によると、１００〜２００文字程度の長い問題文でないと上手く採点できないという問題がある。また、前記文章評価採点装置では、小論文の採点結果が、人的な採点結果と必ずしも一致しないという問題もある。すなわち、前記文章評価採点装置では、採点者の観点が全く入らずに、出題文と採点対象の小論文がどの程度近似しているかのみでの採点であり、採点者の感覚に近い採点を行うことができない。例えば、学習者が作成した小論文中の各単語について、出題文中の各単語に対する関連性が高い単語、中程度の単語、低い単語という具合に区別したときに、前記関連性の高い単語及び低い単語が多い一方で前記中程度の単語が少ないようなケースでは、内容の統一感がなく、論述の流れが悪いとして、採点者にマイナスイメージを多く与える可能性がある。ところが、このケースに対し、逆に、前記関連性の高い単語及び低い単語が少ない一方で前記中程度の単語が多いような論文は、前記文章評価採点装置で先のケースの近似度と同一値と算出されても、採点者による採点で、先のケースよりも良い点数と判断されることがある。従って、論文の自動評価を行うためのロジックとして、採点者の主観的な評価基準も何らかの形で反映しないと、採点者の採点結果とずれが生じ得ることになり、採点現場での利用が敬遠される虞がある。ここで、前記特許文献１の文書評価採点装置に採点者の観点を入れるには、基準となる小論文の収集と採点が出題文の種類毎に必要になることから、基礎データの構築の手間が煩雑になり、バリエーションの異なる出題文を簡単に変えることが困難になる。 However, the sentence evaluation scoring device has a problem that according to a paper showing an evaluation related to the technology, the sentence cannot be scored well unless it is a long problem sentence of about 100 to 200 characters. The sentence evaluation scoring apparatus also has a problem that the scoring result of the essay does not necessarily match the human scoring result. In other words, in the sentence evaluation scoring system, scoring is performed based on the degree of approximation between the question text and the essay to be scored, without any scorer's viewpoint, and scoring close to that of the scorer. I can't. For example, when each word in an essay created by a learner is distinguished as a word having a high degree of relevance to each word in the question sentence, a medium word, a low word, etc., the word having a high relevance and a word having a low relevance However, in the case where the number of intermediate words is small, there is a possibility of giving a negative image to the scorer because the content is not unified and the flow of discussion is poor. However, in contrast to this case, a paper with a small number of highly related words and low words but a large number of medium words has the same degree of approximation as the previous case in the sentence evaluation scoring device. May be determined as a better score than the previous case by scoring. Therefore, if the subjective evaluation criteria of the grader is not reflected in some way as the logic for automatic evaluation of the paper, it may be misaligned with the grader's scoring result. There is a risk of being. Here, in order to put the grader's point of view into the document evaluation scoring apparatus of Patent Document 1, collection of a short essay as a reference and scoring are required for each type of question, so it takes time to construct basic data. It becomes complicated and it becomes difficult to easily change question sentences with different variations.

本発明は、このような課題に着目して案出されたものであり、その目的は、出題文を変えたときに採点用のデータ調整を新たに行う必要がなく、出題文を低コストで自由に設定できるとともに、論文評価者の人的な内容評価に近い評価結果を得ることができる論文内容評価装置及び論文内容評価プログラムを提供することにある。 The present invention has been devised by paying attention to such problems, and the purpose thereof is not to newly perform scoring data adjustment when changing the question sentence, and the question sentence can be reduced at low cost. An object of the present invention is to provide a paper content evaluation apparatus and a paper content evaluation program that can be set freely and can obtain an evaluation result close to the human content evaluation of a paper evaluator.

（１）前記目的を達成するため、本発明は、論文のテーマとなる出題文に基づいて、当該出題文と同一の言語で学習者が作成した論文の内容を自動評価する論文内容評価装置において、
一対の単語が同一文書に登場する可能性を表す指標となる相互情報量が単語の組み合わせ毎に記憶された相互情報量データベースと、
多くの文書に出現する一般的な単語ほど数値が低くなる逆出現頻度が単語毎に記憶された逆出現頻度データベースと、
前記相互情報量データベース及び前記逆出現頻度データベースから抽出したデータを予め記憶された数式に代入することで前記論文中の一単語と前記出題文との関連度を表すスコアを求めるスコア算出手段と、
前記スコア算出手段で前記論文中の単語毎にそれぞれ求められたスコアから、当該スコアの分布状況を表すスコア表を作成するスコア表作成手段と、
前記分布状況のパターンがそれぞれ異なる複数種の基準スコア表と当該各基準スコア表に対応した人的な内容評価とが記憶された内容評価データベースと、
前記スコア表作成手段で作成された前記スコア表に近い前記内容評価データベースの基準スコア表を抽出し、当該基準スコア表に対応する内容評価を前記論文の内容評価とする内容評価決定手段とを備える、という構成を採っている。 (1) In order to achieve the above object, the present invention provides a paper content evaluation apparatus that automatically evaluates the content of a paper created by a learner in the same language as the question text based on the question text as the theme of the paper. ,
A mutual information database in which a mutual information amount serving as an index indicating the possibility of a pair of words appearing in the same document is stored for each combination of words;
A reverse appearance frequency database in which the reverse appearance frequency is stored for each word, the numerical value of which is lower for general words appearing in many documents;
Score calculating means for obtaining a score representing the degree of association between one word in the paper and the question sentence by substituting data extracted from the mutual information database and the reverse appearance frequency database into a mathematical formula stored in advance;
From the score obtained for each word in the paper by the score calculation means, a score table creation means for creating a score table representing the distribution status of the score;
A content evaluation database in which a plurality of types of reference score tables having different patterns of distribution status and human content evaluations corresponding to the respective reference score tables are stored,
A content evaluation determination unit that extracts a reference score table of the content evaluation database close to the score table created by the score table creation unit, and uses a content evaluation corresponding to the reference score table as a content evaluation of the paper , Is adopted.

（２）また、前記論文中の各単語の出現数をカウントする機能を含む論文カウント手段を備え、前記スコア算出手段では、前記相互情報量と、前記逆出現頻度と、前記単語の出現数と、前記出題文を構成する単語の種類数とをパラメータとしたスコア演算式が予め記憶され、当該スコア演算式に、前記出題文中の一単語と前記論文中の一単語からなる一対の対象単語について前記相互情報量データベースから抽出された前記相互情報量と、前記一対の対象単語のうち前記論文中の一単語について前記逆出現頻度データベースから抽出された前記逆出現頻度と、前記一対の対象単語のうち前記論文中の一単語について前記論文カウント手段でカウントされた前記出現数と、予めカウントされた前記出題文中の単語の種類数とが代入されることで、前記スコアが算出される、という構成を採っている。
ことが好ましい。 (2) Further, it includes a paper counting means including a function of counting the number of occurrences of each word in the paper, and the score calculation means includes the mutual information amount, the reverse appearance frequency, the number of occurrences of the word, , A score calculation formula using the number of types of words constituting the question sentence as a parameter is stored in advance, and the score calculation formula includes a pair of target words including one word in the question sentence and one word in the paper. The mutual information extracted from the mutual information database, the reverse appearance frequency extracted from the reverse appearance frequency database for one word in the paper among the pair of target words, and the pair of target words Of these, the number of occurrences counted by the paper counting means for one word in the paper and the number of types of words in the question sentence counted in advance are substituted. It adopts a configuration in which the score is calculated.
It is preferable.

（３）更に、前記出題文を構成する単語の種類数をカウントする出題文カウント手段を備え、当該出題文カウント手段でカウントされた単語の種類数を前記スコア演算式に代入する、という構成も併せて採用することができる。 (3) Further, there is a configuration in which a question sentence counting means for counting the number of types of words constituting the question sentence is provided, and the number of word types counted by the question sentence counting means is substituted into the score calculation formula. It can be employed together.

（４）ここで、種々の文書に頻出する単語で一部の処理時に除外対象とするストップワードが記憶されたストップワードデータベースを備え、前記出題文カウント手段は、前記ストップワードデータベースに記憶されたストップワードを除いて前記出題文中の単語の種類数をカウントする、という構成を採るとよい。 (4) Here, it is provided with a stop word database in which stop words that are frequently excluded from various documents and stored as a part of processing are stored, and the question counting unit is stored in the stop word database. A configuration may be adopted in which the number of types of words in the question sentence is counted except for stop words.

（５）また、種々の文書に頻出する単語で一部の処理時に除外対象とするストップワードが記憶されたストップワードデータベースを備え、前記スコア算出手段では、前記ストップワードデータベースに基づき、前記ストップワードを除く前記論文中の一単語と前記ストップワードを除く前記出題文中の一単語との全て組み合わせについて、前記スコアを算出する、という構成を採ることが好ましい。 (5) In addition, a stop word database in which stop words to be excluded at the time of partial processing of words that frequently appear in various documents are stored, and the score calculation means uses the stop word database based on the stop word database. It is preferable that the score is calculated for all combinations of one word in the paper except for and one word in the question sentence except for the stop word.

（６）更に、前記内容評価決定手段では、内容評価対象となる論文の前記スコア表と、前記内容評価データベースに記憶された前記各基準スコア表とがそれぞれ対比され、予め設定されたパターン認識手法により、前記スコア表作成手段で作成されたスコア表に近いパターンの前記基準スコア表が選択される、という構成を採用している。 (6) Furthermore, in the content evaluation determination means, the score table of the article to be content evaluation is compared with each of the reference score tables stored in the content evaluation database, respectively, and a preset pattern recognition method is set. Thus, a configuration is adopted in which the reference score table having a pattern close to the score table created by the score table creating means is selected.

（７）また、本発明は、論文のテーマとなる出題文に基づいて、当該出題文と同一の言語で学習者が作成した論文の内容を自動評価する処理をコンピュータに実行させるためのプログラムにおいて、
一対の単語が同一文書に登場する可能性を表す指標となる相互情報量が単語の組み合わせ毎に記憶された相互情報量データベースと、
多くの文書に出現する一般的な単語ほど数値が低くなる逆出現頻度が単語毎に記憶された逆出現頻度データベースと、
前記相互情報量データベース及び前記逆出現頻度データベースから抽出したデータを予め記憶された数式に代入することで前記論文中の一単語と前記出題文との関連度を表すスコアを求めるスコア算出手段と、
前記スコア算出手段で前記論文の単語毎にそれぞれ求められたスコアから、当該スコアの分布状況を表すスコア表を作成するスコア表作成手段と、
前記分布状況のパターンがそれぞれ異なる複数種の基準スコア表と当該各基準スコア表に対応した論文の内容評価とが記憶された内容評価データベースと、
前記スコア表作成手段で作成された前記スコア表に近い前記内容評価データベースの基準スコア表を抽出し、当該基準スコア表に対応する内容評価を前記論文の内容評価とする内容評価決定手段として前記コンピュータを機能させる、という構成を採っている。 (7) Further, the present invention provides a program for causing a computer to execute a process of automatically evaluating the content of a paper created by a learner in the same language as the question text based on the question text as a theme of the paper. ,
A mutual information database in which a mutual information amount serving as an index indicating the possibility of a pair of words appearing in the same document is stored for each combination of words;
A reverse appearance frequency database in which the reverse appearance frequency is stored for each word, the numerical value of which is lower for general words appearing in many documents;
Score calculating means for obtaining a score representing the degree of association between one word in the paper and the question sentence by substituting data extracted from the mutual information database and the reverse appearance frequency database into a mathematical formula stored in advance;
From the score obtained for each word of the paper by the score calculation means, a score table creation means for creating a score table representing the distribution status of the score;
A content evaluation database in which a plurality of types of reference score tables having different patterns of distribution status and content evaluations of articles corresponding to the respective reference score tables are stored;
Extracting a reference score table of the content evaluation database close to the score table created by the score table creating means, and using the computer as a content evaluation deciding means for making a content evaluation corresponding to the reference score table a content evaluation of the paper The structure is made to function.

本発明によれば、出題文を構成する各単語と学習者が作成した論文の各単語との組み合わせから一つの値を総合的に求めて内容評価するのではなく、各スコアの分布状況を表すスコア表を作成した上で、人的な内容評価に対応して予め記憶されている基準スコア表の中から、論文のスコア表と分布状況のパターンの近い基準スコア表を選択し、当該基準スコア表に紐付いている内容評価が論文の内容評価とされる。つまり、スコア表上で表されたスコアのパターン自体を出題文の内容に関係なく対比することにより、論文の内容評価が決定される。このため、出題文が変わっても、内容評価データベースのデータを変更する必要がなく、そのままのデータで学習者の論文の内容評価がなされることになる。従って、出題文毎に基準スコア表を作成する必要がなく、出題者側での出題文の変更を低コストで簡単に行うことができる。また、既存の論文類から前記スコア表作成手段での手順と同様の手順で基準スコア表を作成するときに、同時に、採点者が前記既存の論文類の内容を人的に評価し、この人的な内容評価を基準スコア表に対応させることにより、論文評価者の人的な採点評価に近い評価結果を得ることができる。また、本発明によれば、出題文が少ない単語数で構成されていても正確な採点が可能になる。 According to the present invention, instead of comprehensively obtaining a single value from the combination of each word constituting the question and each word of the paper created by the learner, the contents are evaluated, and the distribution status of each score is represented. After creating the score table, select a reference score table that is similar to the score table of the paper and the distribution pattern from the reference score tables stored in advance corresponding to human content evaluation, and the reference score The content evaluation linked to the table is the content evaluation of the paper. That is, the content evaluation of a paper is determined by comparing the score pattern itself shown on the score table regardless of the content of the question. For this reason, even if the question sentence changes, it is not necessary to change the data in the content evaluation database, and the content evaluation of the learner's paper is performed with the data as it is. Therefore, it is not necessary to create a reference score table for each question sentence, and the question sentence can be easily changed at a low cost on the questioner side. In addition, when creating a standard score table from existing papers in the same procedure as the score table creating means, at the same time, the scorer personally evaluates the contents of the existing papers, and this person By associating general content evaluation with the reference score table, it is possible to obtain an evaluation result close to the human scoring evaluation of the paper evaluator. Further, according to the present invention, accurate scoring is possible even if the question sentence is composed of a small number of words.

本実施形態に係る論文内容評価装置の概略システム構成図。1 is a schematic system configuration diagram of a paper content evaluation apparatus according to the present embodiment. スコア表のイメージ図。The image figure of a score table. 内容評価データベースのイメージ図。Image of content evaluation database.

以下、本発明の実施形態について図面を参照しながら説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１には、本実施形態に係る論文内容評価装置の概略システム構成図が示されている。この図において、論文内容評価装置１０は、学生等の学習者の英語論文作成教育用として利用され、学習者が作成した英語論文の内容を自動的に評価する装置である。この論文内容評価装置１０は、ＣＰＵ等の演算処理装置、メモリやハードディスク等の記憶装置等からなるコンピュータによって構成され、当該コンピュータを後述する各手段として機能させるためのプログラムがインストールされている。すなわち、本実施形態に係る論文内容評価装置１０は、学習者が保有するパーソナルコンピュータ等の端末１１に対し、インターネット回線等のデータ回線で接続されるサーバとして機能し、学習者が作成した論文を受信すると、当該論文の内容を自動評価した上で当該評価結果を学習者に送信するようになっている。 FIG. 1 shows a schematic system configuration diagram of a paper content evaluation apparatus according to this embodiment. In this figure, a paper content evaluation apparatus 10 is an apparatus that is used for education of English paper preparation by learners such as students and automatically evaluates the content of English papers created by the learner. This paper content evaluation apparatus 10 is constituted by a computer including an arithmetic processing unit such as a CPU and a storage device such as a memory and a hard disk, and a program for causing the computer to function as each unit to be described later is installed. That is, the paper content evaluation apparatus 10 according to the present embodiment functions as a server connected to a terminal 11 such as a personal computer owned by a learner via a data line such as the Internet line, and a paper created by the learner is created. When received, the content of the paper is automatically evaluated and the evaluation result is transmitted to the learner.

具体的に、前記論文内容評価装置１０は、論文のテーマとなる英語の出題文が複数種記憶された出題文データベース１３と、端末１１からの要求に応じて出題文データベース１３から出題文を抽出する出題文抽出手段１４と、当該出題文を端末１１に送信する出題文送信手段１６と、学習者が端末１１で受信した出題文に基づいて英語で論文を作成した後で端末１１に入力された当該論文を受信する受信手段１７と、種々の文書に頻出する単語で一部の処理時に除外対象とするストップワードが記憶されたストップワードデータベース１９と、前記出題文を構成する単語の種類数をカウントする出題文カウント手段２１と、論文中の単語数や文数などをカウントする論文カウント手段２２と、一対の単語が同一文書に登場する可能性を表す指標となる相互情報量が単語の組み合わせ毎に記憶された相互情報量データベース２４と、文書に出現する単語の頻度の逆数を表す逆出現頻度が単語毎に記憶された逆出現頻度データベース２５と、論文中の各一単語と出題文の集合との関連度を表すスコアを求めるスコア算出手段２７と、スコア算出手段２７で論文中の各単語についてそれぞれ求めたスコアから、当該スコアの分布状況を表すスコア表を作成するスコア表作成手段２８と、スコアの分布状況のパターンがそれぞれ異なる多種の基準スコア表と論文の内容評価とが対応して記憶された内容評価データベース３０と、スコア表作成手段２８で作成されたスコア表から、内容評価データベース３０に基づいて学習者が作成した論文の内容評価を決定する内容評価決定手段３１と、内容評価決定手段３１で決定された内容評価をデータ回線経由で学習者の端末１１に送信する評価結果送信手段３３とを備えて構成されている。 Specifically, the paper content evaluation apparatus 10 extracts a question sentence database 13 storing a plurality of types of English question sentences as a theme of the paper and the question sentence database 13 in response to a request from the terminal 11. The question sentence extraction means 14 to be transmitted, the question sentence transmission means 16 to transmit the question sentence to the terminal 11, and the learner creates a paper in English based on the question sentences received by the terminal 11 and is input to the terminal 11. The receiving means 17 for receiving the paper, the stop word database 19 in which stop words to be excluded at the time of processing of some words frequently appearing in various documents, and the number of types of words constituting the question sentence The question sentence counting means 21 for counting the number of words, the paper counting means 22 for counting the number of words in the paper, the number of sentences, etc. A mutual information database 24 in which the mutual information amount is stored for each word combination, a reverse appearance frequency database 25 in which a reverse appearance frequency representing the reciprocal of the frequency of words appearing in the document is stored for each word, and a paper Score calculating means 27 for obtaining a score representing the degree of association between each word in the sentence and the set of questions, and a score representing the distribution status of the score from the scores obtained for each word in the paper by the score calculating means 27 A score table creating means 28 for creating a table, a content evaluation database 30 in which various reference score tables having different patterns of score distribution patterns and content evaluations of articles are stored correspondingly, and a score table creating means 28 Content evaluation determination means 31 for determining content evaluation of a paper created by a learner based on the content evaluation database 30 from the generated score table, It is constructed and a evaluation result transmitting means 33 for transmitting to the terminal 11 of the learner content rating determined by the evaluation determination unit 31 via the data line.

前記出題文データベース１３は、出題者によって決定された論文のテーマを英文で記載した出題文が多数記憶されている。当該出題文は、特に限定されるものではないが、学習者の学年毎やレベル毎の他に、テーマのジャンル別等、種々のカテゴリー毎に設定されている。なお、前記出題文データベース１３は、新たな出題文を逐次追加できる機能を備えていても良い。 The question sentence database 13 stores a large number of question sentences that describe in English the theme of a paper determined by the questioner. The question text is not particularly limited, but is set for each category such as the genre of the theme in addition to the learner's grade and level. Note that the question sentence database 13 may have a function of sequentially adding new question sentences.

前記出題文抽出手段１４は、学習者の端末１１から要求があると、当該要求に応じたカテゴリーの中から、ランダムに、若しくは、一定規則に従って、出題文を抽出し、当該出題文を出題文送信手段１６から端末１１に送信するようになっている。 When a request is received from the learner's terminal 11, the question sentence extraction means 14 extracts a question sentence from a category corresponding to the request at random or according to a certain rule, and extracts the question sentence as a question sentence. Transmission is performed from the transmission means 16 to the terminal 11.

前記ストップワードデータベース１９には、大部分の文書に頻出する単語、例えば、「ａ」、「ａｎ」等の冠詞、「ｉｎ」、「ｏｎ」等の前置詞、「ｉｔ」、「ｔｈａｔ」等の代名詞、「ｗｈｉｃｈ」、「ｗｈｏ」等の疑問詞、「ｉｓ」、「ａｒｅ」のｂｅ動詞、「ｃａｎ」、「ｍａｙ」等の助動詞等の頻出単語が予め記憶されている。 The stop word database 19 includes words that appear frequently in most documents, for example, articles such as “a” and “an”, prepositions such as “in” and “on”, “it”, “that”, and the like. Frequent words such as pronouns, question words such as “who” and “who”, be verbs such as “is” and “are”, and auxiliary verbs such as “can” and “may” are stored in advance.

前記出題文カウント手段２１では、出題文中のスペースを手掛かりにして当該出題文を構成する単語を特定した後で、ストップワードデータベース１９に記憶されたストップワードを除く出題文中の単語の種類数をカウントするようになっている。つまり、出題文中に同じ単語が繰り返して出現する場合でも、当該単語を「１」としてカウントする。 The question counting means 21 counts the number of types of words in the question sentence excluding the stop words stored in the stop word database 19 after identifying the words constituting the question sentence using the space in the question sentence as a clue. It is supposed to be. That is, even when the same word repeatedly appears in the question, the word is counted as “1”.

前記論文カウント手段２２は、論文中のスペースを手掛かりにして当該論文を構成する単語を特定する機能と、当該単語の特定後に論文中の単語の総数をカウントする機能と、前記単語の特定後に論文中の各単語の出現数をカウントする機能と、前記単語の特定後に論文中の単語の種類数をカウントする機能と、論文を構成する文数をカウントする機能とを有している。ここで、論文中の各単語の出現数は、論文中に同一の単語が何回出現するか、各単語別に計数される。また、論文中の単語の種類数は、出題文カウント手段２１でのカウントと同様に、論文に重複して登場する同一の単語をまとめて「１種類」と計数し、論文中に重複して登場しない単語を一単語につき「１種類」と計数することで求められる。更に、論文を構成する文数は、論文中に存在するピリオド、感嘆符、疑問符の何れかを検出し、当該ピリオド等で区切られた一領域を「一文」として論文全体を計数することで求められる。 The paper counting means 22 has a function of identifying words constituting the paper with a clue to a space in the paper, a function of counting the total number of words in the paper after the word is specified, and a paper after the word is specified. A function for counting the number of appearances of each word in the document, a function for counting the number of types of words in the paper after the word is specified, and a function for counting the number of sentences constituting the paper. Here, the number of appearances of each word in the paper is counted for each word, how many times the same word appears in the paper. In addition, the number of types of words in the paper is counted as “one type” by summing up the same words appearing in the paper in duplicate, similarly to the counting by the question counting means 21. It is obtained by counting words that do not appear as “one type” per word. In addition, the number of sentences composing a paper is obtained by detecting any period, exclamation mark, or question mark in the paper, and counting the entire paper as a single sentence divided by the period. It is done.

前記相互情報量データベース２４には、学習者が使用する可能性のある相当数の英単語中の一対の単語同士の前記相互情報量が単語の組み合わせ毎に記憶されている。当該相互情報量は、新聞や雑誌等、様々なジャンルの多くの文書に記載された英単語から、以下のように、コンピュータ等の図示しない演算装置を使って予め求められる。 In the mutual information database 24, the mutual information of a pair of words in a considerable number of English words that the learner may use is stored for each combination of words. The mutual information amount is obtained in advance from English words described in many documents of various genres such as newspapers and magazines using an arithmetic device (not shown) such as a computer as follows.

すなわち、ある単語「ｗ_ｉ」と他の単語「ｗ_ｊ」との相互情報量ＭＩは、次式（１）によって求められる。なお、以下の各数式において、対数は、自然対数に限らず、種々の底を採ることもできる。

ここで、ｐ（ｗ_ｉ，ｗ_ｊ）、ｐ（ｗ_ｉ）、ｐ（ｗ_ｊ）は、次式（２）、（３）で表される。

上式（２）、（３）において、「Ｎ」は、相互情報量を求めるために使用した全文書数である。また、「ｎ_ｉｊ」は、前記全文書の中で、単語「ｗ_ｉ」と単語「ｗ_ｊ」の組み合わせが存在する文書数である。つまり、この「ｎ_ｉｊ」は、ある一文書内に単語「ｗ_ｉ」と単語「ｗ_ｊ」の組み合わせが複数存在していても、一文書で「１」とカウントされる。
例えば、全文書数Ｎが１０００であるとして、その中で、単語「ｗ_ｉ」と単語「ｗ_ｊ」が共に登場している文書数ｎ_ｉｊが５０であれば、
ｐ（ｗ_ｉ，ｗ_ｊ）＝５０／１０００＝０．０５
となる。
なお、「Ｎ」を全文書中にそれぞれ存在する文の総数とし、「ｎ_ｉｊ」を、全文書中で、単語「ｗ_ｉ」と単語「ｗ_ｊ」の組み合わせが存在する文の数としてもよい。
また、ｐ（ｗ_ｉ）、ｐ（ｗ_ｊ）を求める上式（３）の分母は、全文書中に存在する単語の総数を意味し、「ｆ_ｉ」は、全文書中での単語「ｗ_ｉ」の出現数であり、「ｆ_ｊ」は、全文書中での単語「ｗ_ｊ」の出現数である。 That is, the mutual information MI between a certain word “w _i ” and another word “w _j ” is obtained by the following equation (1). In each of the following mathematical formulas, the logarithm is not limited to a natural logarithm, and various bases can be taken.

Here, p (w _i , w _j ), p (w _i ), and p (w _j ) are expressed by the following equations (2) and (3).

In the above formulas (2) and (3), “N” is the total number of documents used for obtaining the mutual information amount. Also, “n _ij ” is the number of documents in which a combination of the word “w _i ” and the word “w _j ” exists in all the documents. That is, “n _ij ” is counted as “1” in one document even if a plurality of combinations of the word “w _i ” and the word “w _j ” exist in one document.
For example, assuming that the total number N of documents is 1000, if the number of documents n _ij in which the word “w _i ” and the word “w _j ” appear together is 50,
p (w _i , w _j ) = 50/1000 = 0.05
It becomes.
Note that “N” is the total number of sentences existing in all the documents, and “n _ij ” is the number of sentences in which the combination of the word “w _i ” and the word “w _j ” exists in all the documents. Good.
Further, the denominator of the above equation (3) for obtaining p (w _i ) and p (w _j ) means the total number of words existing in all documents, and “f _i ” represents the word “ “w _i ” is the number of appearances, and “f _j ” is the number of occurrences of the word “w _j ” in the entire document.

前記逆出現頻度データベース２５に記憶された逆出現頻度は、前述の相互情報量と同様に、新聞や雑誌等、様々なジャンルの多くの文書に記載された英単語から、コンピュータ等の図示しない演算装置を使って予め求められる。具体的に、単語「ｗ_ｉ」の逆出現頻度ｉｄｆ（ｗ_ｉ）は、次式（４）により求められる。

ここで、「Ｎ」は、相互情報量を求める場合と同様、逆出現頻度を求めるために使用した全文書数であり、「ｎ_ｉ」は、当該全文書の中で、単語「ｗ_ｉ」が含まれている文書数である。つまり、この「ｎ_ｉ」は、ある一文書内に単語「ｗ_ｉ」がいくつ存在していても、一文書で「１」とカウントされる。
上式（４）により、逆出現頻度ｉｄｆ（ｗ_ｉ）は、多くの文書に出現する一般的な単語ほど低い数値となる。 The reverse appearance frequency stored in the reverse appearance frequency database 25 is an arithmetic operation (not shown) of a computer or the like from English words described in many documents of various genres such as newspapers and magazines, similar to the mutual information amount described above. Pre-determined using the device. Specifically, the reverse appearance frequency idf (w _i ) of the word “w _i ” is obtained by the following equation (4).

Here, “N” is the total number of documents used for obtaining the reverse appearance frequency, as in the case of obtaining the mutual information amount, and “n _i ” is the word “w _i ” in all the documents. Is the number of documents that contain That is, this “n _i ” is counted as “1” in one document, regardless of how many words “w _i ” exist in one document.
According to the above equation (4), the reverse appearance frequency idf (w _i ) becomes a lower numerical value for general words appearing in many documents.

前記スコア算出手段２７では、学習者が作成した論文中の単語のうち、ストップワードデータベース１９に記憶された単語（ストップワード）を除く単語全てについて、ストップワードを除く出題文中の各単語との関係に基づき求めたスコアがそれぞれ算出される。すなわち、スコア算出手段２７では、前記相互情報量と、前記逆出現頻度と、論文中の単語の出現数と、出題文中の単語の種類数とをパラメータとした次式（５）のスコア演算式が予め記憶され、当該スコア演算式に前記パラメータの各値を代入することで、スコアが算出される。

上式において、「ＭＩＤＦ（ｗ_ｉ，Ｔ）」は、ストップワードを除く出題文中の単語の集合「Ｔ」と、論文中の単語「ｗ_ｉ」とにおけるスコアである。また、「ＭＩ（ｗ_ｉ，ｗ_ｊ）」は、出題文の集合「Ｔ」中の単語「ｗ_ｊ」と論文中の単語「ｗ_ｉ」における前述の相互情報量であり、「｜Ｔ｜」は、ストップワードを除く出題文中の単語の種類数である。更に、「ｉｄｆ（ｗ_ｉ）」は、単語「ｗ_ｉ」における前述の逆出現頻度であり、「ｅ_ｉ」は、単語「ｗ_ｉ」の論文中の出現数である。 In the score calculation means 27, among the words in the paper created by the learner, all the words except the words (stop words) stored in the stop word database 19 are related to the words in the question sentence excluding the stop words. Each score obtained based on the above is calculated. That is, in the score calculation means 27, the score calculation formula of the following equation (5) using the mutual information amount, the reverse appearance frequency, the number of words appearing in the paper, and the number of types of words in the question sentence as parameters. Are stored in advance, and the score is calculated by substituting each value of the parameter into the score calculation formula.

In the above formula, “MIDF (w _i , T)” is a score in a set of words “T” in the question sentence excluding stop words and a word “w _i ” in the paper. In addition, "MI _(w i, w _j)" is the mutual information of the above-mentioned in the word "w _i" of the set and in the article "T" word "w _j" in the question sentence, "| T | "Is the number of types of words in the question sentence excluding stop words. Furthermore, “idf (w _i )” is the above-described reverse appearance frequency of the word “w _i ”, and “e _i ” is the number of occurrences of the word “w _i ” in the paper.

つまり、ここでは、対象となる出題文中の各単語「ｗ_ｊ」と論文中の各単語「ｗ_ｉ」について、それぞれの組み合わせにおける相互情報量「ＭＩ（ｗ_ｉ，ｗ_ｊ）」が相互情報量データベース２４から抽出されるとともに、論文中の単語「ｗ_ｉ」の逆出現頻度「ｉｄｆ（ｗ_ｉ）」が逆出現頻度データベース２５から抽出される。そして、各相互情報量「ＭＩ（ｗ_ｉ，ｗ_ｊ）」及び逆出現頻度「ｉｄｆ（ｗ_ｉ）」とともに、出題文カウント手段２１でカウントされた出題文中の単語の種類数「｜Ｔ｜」と、論文カウント手段２２でカウントされた単語「ｗ_ｉ」の論文中の出現数「ｅ_ｉ」とが、前記スコア演算式に代入され、論文中の単語「ｗ_ｉ」毎に、スコア「ＭＩＤＦ（ｗ_ｉ，Ｔ）」が求められる。上式（５）によれば、このスコアは、論文中の単語「ｗ_ｉ」それぞれについての出題文中の各単語「ｗ_ｊ」との組み合わせ全てにおける各相互情報量が加算されることになり、論文中の単語「ｗ_ｉ」毎に一つの値が求められる。 That is, herein, for each word in the article and the word "w _j" in question statement of interest "w _i", mutual information in each of the combinations "MI (w i, _w _j)" mutual information amount In addition to being extracted from the database 24, the reverse appearance frequency “idf (w _i )” of the word “w _i ” in the paper is extracted from the reverse appearance frequency database 25. Then, together with each mutual information amount “MI (w _i , w _j )” and reverse appearance frequency “idf (w _i )”, the number of types of words in the question sentence counted by the question sentence counting means 21 “| T |” When, the number of occurrences in the article of paper counting means 22 counted word "w _i" and "e _i", but the substituted into the score computing equation, for every word in the article "w _i" score "MIDF (W _i , T) ”is required. According to the above equation (5), this score is obtained by adding each mutual information amount in all combinations with each word “w _j ” in the question sentence for each word “w _i ” in the paper, One value is obtained for each word “w _i ” in the paper.

前記スコア表作成手段２８では、次のようにしてスコア表が作成される。 The score table creating means 28 creates a score table as follows.

このスコア表は、図２に示されるように、横軸がスコア、縦軸が、カウントされた数（単語数）となっている。横軸のスコアは、ゼロから所定の数値幅毎に区分され、各区分の数値幅ｄは、一定且つ均等に予め設定される。ここで、当該数値幅ｄは、所定の出題文に対して予め作成された多くの論文に基づき、次式（６）〜（８）で求められる。ここで、式（６）中「Ｌ」は、論文に拘らず予め設定された一定の区分数（例えば１０区分）であり、「Ｖ」は、各論文の単語毎に上式（５）で予め求められた各論文中の単語毎のスコア（ＭＩＤＦ）の値「ｖ」を要素とする集合を意味する。

In this score table, as shown in FIG. 2, the horizontal axis indicates the score, and the vertical axis indicates the counted number (number of words). The score on the horizontal axis is divided every predetermined numerical width from zero, and the numerical width d of each section is set in advance uniformly and uniformly. Here, the numerical value width d is obtained by the following equations (6) to (8) based on many papers prepared in advance for a predetermined question sentence. Here, “L” in equation (6) is a predetermined number of categories (for example, 10 categories) set in advance regardless of the paper, and “V” is expressed by the above equation (5) for each word of each paper. It means a set whose elements are score (MIDF) value “v” for each word in each paper obtained in advance.

すなわち、式（７）は、求めたスコアの総数を「｜Ｖ｜」としたときに、各論文から求めた単語毎の全てのスコア「ｖ」の平均を求める数式であり、式（８）は、各論文から求めた単語毎の全てのスコア「ｖ」の不偏分散を求める数式である。 That is, Expression (7) is an expression for calculating the average of all scores “v” for each word obtained from each paper, where “| V |” is the total number of obtained scores. Is a mathematical formula for obtaining the unbiased variance of all scores “v” for each word obtained from each paper.

以上のようにして予め定められた前記数値幅ｄのスコア表に、論文中の単語毎にスコア算出手段２７でそれぞれ求めたスコアがスコア表に反映される。つまり、求めた多数のスコアを該当する区分毎にカウントして棒グラフのようにスコア表にグラフ化し、当該グラフパターンが学習者の作成論文のスコア表として記憶される。更に、論文カウント手段２２でカウントされた各数、すなわち、論文を構成する単語の総数、論文中の単語の種類数、及び論文を構成する文数もスコア表に反映される（図２中右端部分）。なお、これら各数については、そのうち何れか又は全てをスコア表に反映させなくても良い。 As described above, the score obtained by the score calculation means 27 for each word in the paper is reflected in the score table in the score table of the numerical value d determined in advance as described above. That is, a large number of obtained scores are counted for each corresponding category and plotted in a score table like a bar graph, and the graph pattern is stored as a score table for a paper created by the learner. Further, each number counted by the paper counting means 22, that is, the total number of words constituting the paper, the number of types of words in the paper, and the number of sentences constituting the paper are also reflected in the score table (the right end in FIG. 2). portion). Note that any or all of these numbers may not be reflected in the score table.

前記内容評価データベース３０は、１種類又は複数種類の出題文に対して多数の学習者が作成した多数の論文それぞれについて、前述のようにして作成した基準スコア表と、各基準スコア表に対応する論文をそれぞれ採点者が人的に評価した結果の評価値とが対応されて記憶されている。つまり、内容評価データベース３０には、スコアの分布状況のパターンが種々異なる基準スコア表が、人的に求められた評価値に対応して記憶されている。この評価値としては、特に限定されるものではないが、１から５の５段階評価や、０点から１００点までの点数評価や、合格、不合格のみの２段階評価等、種々の態様を採用することができる。例えば、評価値を５段階としたときの内容評価データベース３０での記憶データのイメージは、図３に示されている。 The content evaluation database 30 corresponds to the reference score table created as described above and each reference score table for each of a number of papers created by a large number of learners for one or more types of questions. An evaluation value obtained as a result of personal evaluation of each paper by a grader is stored in correspondence. In other words, the content evaluation database 30 stores reference score tables having different patterns of score distribution states corresponding to the evaluation values obtained by humans. Although this evaluation value is not particularly limited, various aspects such as a five-step evaluation from 1 to 5, a score evaluation from 0 to 100 points, a two-step evaluation only for pass and fail, etc. Can be adopted. For example, FIG. 3 shows an image of stored data in the content evaluation database 30 when there are five evaluation values.

前記内容評価決定手段３１では、スコア表作成手段２８で作成された内容評価対象となる論文のスコア表と、内容評価データベース３０に記憶された全ての基準スコア表とがそれぞれ対比され、内容評価対象となる論文のスコア表に対して、スコアの分布状況のパターンが近い内容評価データベース３０の基準スコア表が抽出され、抽出された基準スコア表に対応して記憶された前記評価値が学習者の論文の内容評価とされる。ここで、各スコア表のパターンの近似判断は、サポートベクターマシン（ＳＶＭ）、ナイーブベイズ分類器、コサイン尺度、Ｋ近傍法等、予め設定された公知のパターン認識手法を用いて行われる。なお、ここでは、評価対象のスコア表に対し、スコアの分布状況のパターンが最も近い基準スコア表を抽出する他、評価対象のスコア表に対し、ある程度の範囲内で近い基準スコア表を複数抽出し、それぞれ対応する評価値の中で最も多く存在する評価値を選択し、前記評価値を学習者の論文の内容評価とすることもできる。また、パターン認識手法によって区分けされた各評価値の境界に対して近い代表点を評価値毎に設定し、評価対象となるスコア表が最も近くなる各代表点の基準スコア表を特定し、当該基準スコア表に対応する評価値を論文の内容評価とすることもできる。以上により、論文のスコア表とパターンが近い基準スコア表を抽出できる限り、前記パターン認識手法を用いた種々の手法を採用することができる。 The content evaluation determination unit 31 compares the score table of the paper that is the content evaluation target created by the score table creation unit 28 with all the reference score tables stored in the content evaluation database 30, and the content evaluation target The reference score table of the content evaluation database 30 having a similar score distribution pattern to the score table of the paper to be extracted is extracted, and the evaluation value stored corresponding to the extracted reference score table is the learner's score. The content of the paper is evaluated. Here, the approximate determination of the pattern of each score table is performed using a known pattern recognition method set in advance, such as a support vector machine (SVM), a naive Bayes classifier, a cosine scale, a K neighborhood method, or the like. Here, in addition to extracting the reference score table with the closest score distribution pattern to the score table to be evaluated, multiple reference score tables that are close within a certain range to the score table to be evaluated are extracted. It is also possible to select the evaluation value that exists most frequently among the corresponding evaluation values, and use the evaluation value as the content evaluation of the learner's paper. Also, for each evaluation value, a representative point that is closest to the boundary of each evaluation value divided by the pattern recognition method is set, and a reference score table for each representative point that is closest to the score table to be evaluated is specified, The evaluation value corresponding to the reference score table can be used as the content evaluation of the paper. As described above, various methods using the pattern recognition method can be adopted as long as a reference score table having a pattern similar to the score table of the paper can be extracted.

前記評価結果送信手段３３では、内容評価決定手段３１で決定された評価値を内容評価結果として学習者の端末１１に送信する。 The evaluation result transmission means 33 transmits the evaluation value determined by the content evaluation determination means 31 to the learner's terminal 11 as the content evaluation result.

次に、本実施形態の論文内容評価装置１０における論文の内容評価の流れを説明する。 Next, the flow of the content evaluation of the paper in the paper content evaluation apparatus 10 of the present embodiment will be described.

先ず、学習者が自己の端末１１から、論文内容評価装置１０に出題を要求すると、出題文抽出手段１４で出題文データベース１３から適当な出題文を抽出し、当該出題文が出題文送信手段１６から学習者の端末１１に送信される。学習者は、論文のテーマとなる英語の出題文（例えば、「ＭｙＦａｍｉｌｙ」）について、自己の思考に基づき英作文をしながら、英語の論文の作成を行う。当該論文作成終了後、学習者は、当該論文を自己の端末１１に入力し、論文内容評価装置１０に送信する。そして、受信手段１４で受信された学習者の論文は、論文カウント手段２２で、論文中の総単語数、論文中の各単語の出現数、論文中の単語の種類数、及び論文を構成する文数がカウントされる。これと前後して、出題文については、その構成単語のうち、ストップワードデータベース１９で記憶されたストップワードを除く単語の種類数が出題文カウント手段２１でカウントされる。なお、ここでカウントされる出題文中の単語の種類数は、出題文と対応して予め出題文データベース１３に記憶させておいてもよい。 First, when a learner requests a question to the paper content evaluation apparatus 10 from his / her terminal 11, an appropriate question sentence is extracted from the question sentence database 13 by the question sentence extraction means 14, and the question sentence transmission means 16 To the learner's terminal 11. The learner creates an English paper while writing English based on his / her thoughts on an English question (for example, “My Family”) as the theme of the paper. After completing the paper creation, the learner inputs the paper into his / her terminal 11 and transmits it to the paper content evaluation apparatus 10. The learner's paper received by the receiving means 14 constitutes the total number of words in the paper, the number of appearances of each word in the paper, the number of types of words in the paper, and the paper by the paper counting means 22. The number of sentences is counted. Before and after this, the question sentence counting means 21 counts the number of types of words excluding the stop words stored in the stop word database 19 among the constituent words. The number of types of words in the question sentence counted here may be stored in advance in the question sentence database 13 corresponding to the question sentence.

更に、ストップワードを除く論文中の各一単語について、スコア算出手段２７でスコアがそれぞれ求められる。この際、出題文中の単語と論文中の単語との相互情報量が相互情報量データベース２４から抽出され、論文中の単語の逆出現頻度が逆出現頻度データベース２５から抽出される。そして、出題文中の単語と論文中の単語について、これら一対の単語の相互情報量と、当該一対の単語のうちの論文中の単語の逆出現頻度と、論文カウント手段２２でカウントされた論文中の単語の出現数と、出題文カウント手段２１でカウントされた出題文の単語の種類数とが、前述のスコア演算式に代入され、スコアが求められる。 Further, the score calculation means 27 obtains a score for each word in the paper excluding the stop word. At this time, the mutual information amount of the word in the question sentence and the word in the paper is extracted from the mutual information amount database 24, and the reverse appearance frequency of the word in the paper is extracted from the reverse appearance frequency database 25. Then, for the words in the question and the words in the paper, the mutual information amount of the pair of words, the reverse appearance frequency of the word in the paper of the pair of words, and the paper counted in the paper counting means 22 And the number of types of words of the question sentence counted by the question sentence counting means 21 are substituted into the above-described score calculation formula to obtain a score.

例えば、出題文が「Ｔｈｅｈｏｌｉｄａｙｏｆｔｈｅｆａｍｉｌｙ」であった場合、学習者が作成した論文中の一単語、例えば「Ｍｏｔｈｅｒ」についてスコアを求める場合、次のようになる。先ず、ストップワードデータベース１９により、出題文中で、単語「ｈｏｌｉｄａｙ」及び「ｆａｍｉｌｙ」以外は、ストップワードとして特定される。そして、相互情報量データベース２４から抽出された出題文中のストップワード以外の各単語の各相互情報量、すなわち、単語「ｈｏｌｉｄａｙ」と単語「Ｍｏｔｈｅｒ」の相互情報量と、単語「Ｆａｍｉｌｙ」と単語「Ｍｏｔｈｅｒ」の相互情報量とが加算された上で、逆出現頻度データベース２５から抽出された論文中の単語「Ｍｏｔｈｅｒ」の逆出現頻度と、論文カウント手段２２でカウントされた論文中の単語「Ｍｏｔｈｅｒ」の出現数と、出題文カウント手段２１でカウントされた出題文を構成する単語の種類数「２」とが前述のスコア演算式に代入され、論文中の単語「Ｍｏｔｈｅｒ」について、スコアが求められる。同様に、論文中の他の単語それぞれについて、出題文中のストップワードを除く単語の集合（単語「ｈｏｌｉｄａｙ」と単語「Ｍｏｔｈｅｒ」）におけるスコアがそれぞれ求められる。なお、出題文中及び論文中における各単語の大文字、小文字の情報については、双方同一の文字情報として前述の処理を行うことが好ましい。 For example, when the question sentence is “The holiday of the family”, the score is obtained for one word in the paper created by the learner, for example, “Mother”. First, the stop word database 19 identifies the words other than the words “holiday” and “family” as stop words in the question. Then, the mutual information amount of each word other than the stop word in the question sentence extracted from the mutual information amount database 24, that is, the mutual information amount of the word “holiday” and the word “Mother”, the word “Family” and the word “ After adding the mutual information of “Mother”, the reverse appearance frequency of the word “Mother” in the paper extracted from the reverse appearance frequency database 25 and the word “Mother” in the paper counted by the paper counting means 22 are added. "And the number of types of words constituting the question counted" 2 "counted by the question counting means 21 are substituted into the above-mentioned score calculation formula, and a score is obtained for the word" Mother "in the paper. It is done. Similarly, for each of the other words in the paper, the scores in the set of words (word “holiday” and word “Mother”) excluding the stop word in the question are obtained. In addition, it is preferable to perform the above-described processing on the uppercase and lowercase information of each word in the question sentence and in the paper as the same character information.

このように、評価対象となる論文からは、多数のスコアが求められることになり、当該求められた各スコアは、スコア表作成手段２８により、予め設定された一定の数値幅ｄのスコア表に投票される。また、論文カウント手段２２でカウントされた論文中の総単語数、当該論文の単語の種類数、及び論文を構成する文数もスコア表に反映される。 In this way, a large number of scores are obtained from the paper to be evaluated, and each of the obtained scores is stored in a score table having a predetermined numerical value range d by the score table creating means 28. Voted. The total number of words in the paper counted by the paper counting means 22, the number of types of words in the paper, and the number of sentences constituting the paper are also reflected in the score table.

以上のようにして作成されたスコア表は、内容評価データベース３０に予め記憶された基準スコア表と対比され、内容評価決定手段３１でのパターン認識処理により、スコアの分布状況のパターンが近い内容評価データベース３０の基準スコア表が決定され、当該決定された基準スコア表に対応して記憶された評価値が、学習者が作成した論文の評価値とされる。例えば、学習者が作成した論文のスコア表が、図２に示されるようなパターンとなった場合、前記パターン認識処理により、内容評価データベース３０中に記憶されたスコア表の中で最も近いパターンである図３中最下部の基準スコア表が抽出され、当該基準スコア表に対応して記憶されている評価値「４」が論文の評価値となる。そして、当該評価値は、評価結果として評価結果送信手段３３から学習者の端末１１に送信される。 The score table created as described above is compared with the reference score table stored in advance in the content evaluation database 30, and the content evaluation pattern close to the content evaluation pattern by the content evaluation determination unit 31 is performed. The reference score table of the database 30 is determined, and the evaluation value stored corresponding to the determined reference score table is used as the evaluation value of the paper created by the learner. For example, when the score table of a paper created by the learner has a pattern as shown in FIG. 2, the pattern recognition process uses the pattern closest to the score table stored in the content evaluation database 30. The lowest reference score table in FIG. 3 is extracted, and the evaluation value “4” stored in correspondence with the reference score table is the evaluation value of the paper. And the said evaluation value is transmitted to the learner's terminal 11 from the evaluation result transmission means 33 as an evaluation result.

本発明者らの実験研究によれば、前記実施形態に係る論文内容評価装置１０で多数の論文の内容を評価したところ、同一の論文を多くの採点者が採点した結果に近い結果が得られたことが実証された。 According to the experimental research by the present inventors, when the content of a large number of papers is evaluated by the paper content evaluation apparatus 10 according to the embodiment, a result close to the result of scoring the same paper by many scorers is obtained. It was proved that

従って、このような実施形態によれば、採点者が人的に評価する場合に近い論文の内容評価を自動的に得ることができる。また、学習者が作成した論文からスコア表が作成され、予め用意した基準スコア表のパターンとの対比のみで論文が評価されることになり、基準スコア表は、出題文を統一して作成しなくても良い。このため、出題文に対する数多くの模範論文をセットで用意して予め採点した上で記憶させる等、出題文に対応したデータを用意する必要がなく、装置内の出題文の追加変更削除等を簡単な作業で行うことができる。 Therefore, according to such an embodiment, it is possible to automatically obtain a content evaluation of a paper that is close to a case where a grader evaluates manually. In addition, a score table is created from the paper created by the learner, and the paper is evaluated only in comparison with the pattern of the standard score table prepared in advance. The standard score table is created by unifying the question sentences. It is not necessary. For this reason, it is not necessary to prepare data corresponding to the question text, such as preparing a large number of model papers for the question text as a set, pre-scoring it, and storing it, and it is easy to add, change, delete, etc. It can be done by simple work.

なお、本発明の論文内容評価装置１０は、英語の論文の内容評価に限らず、同様の手段及び手順で、他の言語の出題文に対する同一言語の論文の内容評価にも適用することができる。 The paper content evaluation apparatus 10 of the present invention can be applied not only to content evaluation of English papers but also to content evaluation of papers in the same language with respect to questions in other languages by similar means and procedures. .

また、前記実施形態にあっては、論文内容評価装置１０をサーバ形式としているが、本発明はこれに限らず、コンピュータを、前記相互情報量データベースと、前記逆出現頻度データベースと、前記スコア算出手段と、前記スコア表作成手段と、前記内容評価データベースと、前記内容評価決定手段として機能させるプログラムを用意し、当該プログラムを論文の学習者となる利用者の端末にインストールして、当該端末自体を論文内容評価装置として機能させることもできる。 Moreover, in the said embodiment, although the paper content evaluation apparatus 10 is made into the server format, this invention is not limited to this, The computer is the said mutual information amount database, the said reverse appearance frequency database, and the said score calculation. Means, the score table creation means, the content evaluation database, and a program that functions as the content evaluation determination means, and installs the program on a terminal of a user who becomes a thesis learner, the terminal itself Can be made to function as a paper content evaluation apparatus.

その他、本発明における各種構成は図示構成例に限定されるものではなく、実質的に同様の作用を奏する限りにおいて、種々の変更が可能である。 In addition, various configurations in the present invention are not limited to the illustrated configuration examples, and various modifications are possible as long as substantially the same operation is achieved.

１０論文内容評価装置
１９ストップワードデータベース
２１出題文カウント手段
２２論文カウント手段
２４相互情報量データベース
２５内容評価データベース
２７スコア算出手段
２８スコア表作成手段
３０内容評価データベース
３１内容評価決定手段 DESCRIPTION OF SYMBOLS 10 Paper content evaluation apparatus 19 Stop word database 21 Question sentence count means 22 Paper count means 24 Mutual information database 25 Content evaluation database 27 Score calculation means 28 Score table preparation means 30 Content evaluation database 31 Content evaluation decision means

Claims

In a paper content evaluation device that automatically evaluates the content of a paper created by the learner in the same language as the question text, based on the question text that is the theme of the paper,
A mutual information database in which a mutual information amount serving as an index indicating the possibility of a pair of words appearing in the same document is stored for each combination of words;
A reverse appearance frequency database in which the reverse appearance frequency is stored for each word, the numerical value of which is lower for general words appearing in many documents;
Score calculating means for obtaining a score representing the degree of association between one word in the paper and the question sentence by substituting data extracted from the mutual information database and the reverse appearance frequency database into a mathematical formula stored in advance;
From the score obtained for each word in the paper by the score calculation means, a score table creation means for creating a score table representing the distribution status of the score;
A content evaluation database in which a plurality of types of reference score tables having different patterns of distribution status and human content evaluations corresponding to the respective reference score tables are stored,
A content evaluation determination unit that extracts a reference score table of the content evaluation database close to the score table created by the score table creation unit, and uses a content evaluation corresponding to the reference score table as a content evaluation of the paper An article content evaluation device characterized by

A paper counting means including a function of counting the number of occurrences of each word in the paper,
The score calculation means stores in advance a score calculation formula using the mutual information amount, the reverse appearance frequency, the number of appearances of the word, and the number of types of words constituting the question sentence as parameters. In the arithmetic expression, the mutual information extracted from the mutual information database for a pair of target words consisting of one word in the question sentence and one word in the paper, and among the pair of target words in the paper The reverse appearance frequency extracted from the reverse appearance frequency database for one word, the number of appearances counted by the paper counting means for one word in the paper among the pair of target words, and the pre-counted 2. The paper content evaluation apparatus according to claim 1, wherein the score is calculated by substituting the number of types of words in the question sentence.

2. A question sentence counting unit that counts the number of types of words constituting the question sentence, wherein the number of word types counted by the question sentence counting unit is substituted into the score calculation expression. 2. Article content evaluation device described in 2.

It has a stop word database in which stop words that are frequently excluded from various documents are stored.
4. The article content evaluation apparatus according to claim 3, wherein the question sentence counting means counts the number of types of words in the question sentence excluding stop words stored in the stop word database.

It has a stop word database in which stop words that are frequently excluded from various documents are stored.
The score calculating means calculates the score for all combinations of one word in the paper excluding the stop word and one word in the question sentence excluding the stop word based on the stop word database. The paper content evaluation apparatus according to claim 1, 2, or 3.

In the content evaluation determination means, the score table of the article to be content evaluation is compared with each of the reference score tables stored in the content evaluation database, and the score table is determined by a preset pattern recognition method. 6. The article content evaluation apparatus according to claim 1, wherein the reference score table having a pattern close to the score table created by the creating means is selected.

In a program for causing a computer to execute a process of automatically evaluating the content of a paper created by a learner in the same language as the question text, based on the question text as the theme of the paper,
A mutual information database in which a mutual information amount serving as an index indicating the possibility of a pair of words appearing in the same document is stored for each combination of words;
A reverse appearance frequency database in which the reverse appearance frequency is stored for each word, the numerical value of which is lower for general words appearing in many documents;
Score calculating means for obtaining a score representing the degree of association between one word in the paper and the question sentence by substituting data extracted from the mutual information database and the reverse appearance frequency database into a mathematical formula stored in advance;
From the score obtained for each word of the paper by the score calculation means, a score table creation means for creating a score table representing the distribution status of the score;
A content evaluation database in which a plurality of types of reference score tables having different patterns of distribution status and content evaluations of articles corresponding to the respective reference score tables are stored;
Extracting a reference score table of the content evaluation database close to the score table created by the score table creating means, and using the computer as a content evaluation deciding means for making a content evaluation corresponding to the reference score table a content evaluation of the paper Thesis content evaluation program characterized by making