JP2012088494A

JP2012088494A - Examination question evaluation system and method for controlling the same, program and recording medium

Info

Publication number: JP2012088494A
Application number: JP2010234597A
Authority: JP
Inventors: Osamu Matsuo; 理松尾
Original assignee: Kinki University
Current assignee: Kinki University
Priority date: 2010-10-19
Filing date: 2010-10-19
Publication date: 2012-05-10

Abstract

PROBLEM TO BE SOLVED: To efficiently and objectively evaluate an examination question.SOLUTION: A CPU 2 calculates a modified Ebel index which expresses difficulty and necessity degrees of an examination question based on the difficulty and necessity degrees of the examination question input when the question is prepared. The CPU 2 calculates a percentage of correct answers and discrimination index of the examination question based on data including answer contents that are a plurality of examinees' answers to the examination question. On the basis of the calculated modified Ebel index, the percentage of correct answers and the discrimination index, the CPU 2 calculates a question evaluation index that expresses quality of the examination question using a predetermined function and displays the calculated problem evaluation index on a display 6.

Description

本発明は、複数の受験者を対象に繰り返し実施される客観試験に用いられる試験問題を評価するための試験問題評価装置及びその制御方法並びにプログラム及び記録媒体に関する。 The present invention relates to a test question evaluation apparatus, a control method thereof, a program, and a recording medium for evaluating a test question used in an objective test repeatedly performed on a plurality of examinees.

医師国家試験、薬剤師国家試験、看護士国家試験、及び運転免許試験のように複数の受験者を対象に繰り返し実施される客観試験（出題時にすでに正解が決定しており、採点者が異なっても同一の採点結果が得られる試験）は、受験者の健全な自己学習を促進させ、問題解決能力を向上させるものが望ましい。また、難問及び奇問を排し、暗記のみでは正解に至らないような、解釈及び問題解決能力の測定に重点を置いた試験である必要がある。さらに、良く学習し理解している受験者とそれ以外の受験者を識別できる必要がある。従って、客観試験においては、各試験問題を評価し、適切にかつ効率的に見直して更新することが重要である（特許文献１及び２参照。）。非特許文献１には、修正イーベル法（非特許文献２参照。）により、又は、最低合格水準（Minimum Pass Level（ＭＰＬ)）に基づいて、客観試験の試験問題を評価することが記載されている。 Objective tests that are repeatedly performed on multiple candidates, such as the doctor's national exam, the pharmacist's national exam, the nurse's national exam, and the driver's license exam. It is desirable that exams that give the same scoring results) promote the sound self-learning of candidates and improve their problem-solving ability. In addition, it is necessary to focus on the measurement of interpretation and problem-solving ability so that difficult questions and odd questions are eliminated and correct answers cannot be obtained by memorization alone. In addition, it is necessary to be able to distinguish between those who have learned and understood well and those who have not. Therefore, in the objective test, it is important to evaluate each test problem and review and update it appropriately and efficiently (see Patent Documents 1 and 2). Non-Patent Document 1 describes that an objective test test problem is evaluated by a modified ebel method (see Non-Patent Document 2) or based on a minimum pass level (MPL). Yes.

特開２００５−２２１８９５号公報。JP-A-2005-221895. 特開２００３−８５２９６号公報。JP2003-85296A.

平野豊ほか、「良質な試験問題の作成法とその評価の仕方」、近畿大学医学雑誌、第３３巻、４号、３２３〜３２８頁、２００８年。Yutaka Hirano et al., “How to create high-quality exam questions and how to evaluate them”, Kinki University Medical Journal, Vol. 33, No. 4, pp. 323-328, 2008. 牛場大蔵ほか、「試験の合格水準の理論と実際」、医学教育、第１６巻、１７５〜１８２頁、１９８５年。Ushiba Okura et al., “Theory and Practice of Passing Test Levels”, Medical Education, Vol. 16, pp. 175-182, 1985.

しかしながら、修正イーベル法又は最低合格水準のみによる試験問題の評価は、客観性及び汎用性に欠けるという面もあり、試験問題の評価方法としては不十分であるという課題があった。 However, the evaluation of the test problem using only the modified ebel method or the minimum acceptable level has a problem that it is insufficient as an evaluation method for the test problem because it lacks objectivity and versatility.

本発明の目的は以上の問題点を解決し、試験問題を従来技術に比較して効率的かつ客観的に評価できる試験問題評価装置及びその制御方法並びにプログラム及び記録媒体を提供することにある。 An object of the present invention is to solve the above-described problems and to provide a test problem evaluation apparatus, a control method thereof, a program, and a recording medium that can efficiently and objectively evaluate a test problem as compared with the prior art.

第１の発明に係る試験問題評価装置は、
試験問題の作問者により入力される所定のデータを入力するための入力手段と、
試験問題の作成をサポートする作問サポート処理及び上記試験問題を評価する試験問題評価処理を実行する制御手段と、
試験問題の難易度及び必要度と、上記試験問題の必要度及び難易度を表す修正イーベル指数との対応関係を示す修正イーベル指数算出テーブルを予め格納する記憶手段とを備えた試験問題評価装置であって、
上記制御手段は、
上記作問サポート処理において、上記入力手段から入力される上記試験問題の難易度及び必要度に基づいて、上記修正イーベル指数算出テーブルを参照して、上記試験問題の難易度及び必要度を表す修正イーベル指数を算出し、
上記試験問題評価処理において、複数の受験者の上記試験問題に対する解答内容を含むデータに基づいて、上記試験問題の正答率と、上記複数の受験者のうちの成績上位層の受験者と上記成績上位層の受験者よりも下位の成績の成績下位層の受験者とを区別する能力を表す識別指数とを算出し、上記算出された修正イーベル指数と、正答率と、識別指数とに基づいて、修正イーベル指数と、正答率と、識別指数とをパラメータとする所定の関数を用いて上記試験問題の品質を表す問題評価指数を算出して出力することを特徴とする。 The test problem evaluation apparatus according to the first invention is:
Input means for inputting predetermined data input by the examiner of the test question;
Control means for executing question support processing for supporting creation of test questions and test question evaluation processing for evaluating the test questions;
A test problem evaluation apparatus comprising a storage means for storing in advance a modified ebel index calculation table showing a correspondence relationship between the difficulty level and necessity level of the test problem and the corrected ebel index indicating the necessity level and difficulty level of the test problem. There,
The control means includes
In the question support process, based on the difficulty level and necessity level of the test question input from the input means, the correction indicating the difficulty level and the necessity level of the test question with reference to the corrected ebel index calculation table Calculate the ebel index,
In the examination question evaluation process, the correct answer rate of the examination question and the higher grade examinees and the grades of the plurality of examinees are based on data including the contents of answers to the examination questions of a plurality of examinees. Calculate an identification index that represents the ability to distinguish lower grades from lower grades than upper grades, and based on the calculated corrected ebel index, correct answer rate, and identification index A problem evaluation index representing the quality of the test question is calculated and output using a predetermined function having the corrected ebel index, the correct answer rate, and the identification index as parameters.

上記試験問題評価装置において、上記制御手段はさらに、上記試験問題評価処理において、上記算出された問題評価指数に基づいて、上記試験問題を複数の問題評価区分に分類し、上記分類された問題評価区分を出力することを特徴とする。 In the test question evaluation apparatus, the control means further classifies the test questions into a plurality of problem evaluation categories based on the calculated problem evaluation index in the test question evaluation process, and classifies the problem evaluations. The category is output.

また、上記試験問題評価装置において、上記制御手段はさらに、上記試験問題評価処理において、上記複数の問題評価区分のうち所定のしきい値区分より品質の良い問題評価指数を有する問題評価区分に分類された試験問題の内容を含む試験問題データを、上記記憶手段に格納することを特徴とする。 Further, in the test question evaluation apparatus, the control means is further classified into a problem evaluation category having a better problem evaluation index than a predetermined threshold value category among the plurality of question evaluation categories in the test question evaluation process. Test question data including the contents of the given test questions is stored in the storage means.

さらに、上記試験問題評価装置において、上記試験問題は多肢選択問題であって、
上記制御手段はさらに、上記試験問題評価処理において、上記解答内容を含むデータに基づいて、上記多肢選択問題の選択肢毎の選択率を算出し、上記試験問題を上記複数の問題評価区分に分類した後に、所定の選択率以下の選択肢があるときに上記分類された問題評価区分を１段階だけ下げるように変更することを特徴とする。 Furthermore, in the test problem evaluation apparatus, the test problem is a multiple-choice problem,
The control means further calculates a selection rate for each option of the multiple-choice question based on the data including the answer contents in the test question evaluation process, and classifies the test question into the plurality of question evaluation categories. After that, when there is an option with a predetermined selection rate or less, the classified problem evaluation category is changed so as to be lowered by one level.

またさらに、上記試験問題評価装置において、上記試験問題は多肢選択問題であって、
上記制御手段はさらに、
上記作問サポート処理において、上記入力手段から、上記多肢選択問題の各選択肢毎の難易度を表す難易度コードを入力し、
上記試験問題評価処理において、上記入力された難易度コードを、上記問題評価指数とともに出力することを特徴とする。 Still further, in the test problem evaluation apparatus, the test problem is a multiple choice problem,
The control means further includes
In the questioning support process, the difficulty code representing the difficulty for each option of the multiple-choice question is input from the input means.
In the test question evaluation process, the input difficulty code is output together with the question evaluation index.

また、上記試験問題評価装置において、上記修正イーベル指数は修正イーベル法において用いられる期待正答率であり、
上記修正イーベル指数算出テーブルは、修正イーベル法において用いられ、試験問題の難易度及び必要度と期待正答率との対応関係を示すテーブルであることを特徴とする。 In the test question evaluation apparatus, the corrected ebel index is an expected correct answer rate used in the corrected ebel method,
The modified ebel index calculation table is used in the modified ebel method, and is a table showing the correspondence between the difficulty level and necessity level of the test questions and the expected correct answer rate.

第２の発明に係る試験問題評価装置の制御方法は、
試験問題の作問者により入力される所定のデータを入力するための入力手段と、
試験問題の作成をサポートする作問サポート処理及び上記試験問題を評価する試験問題評価処理を実行する制御手段と、
試験問題の難易度及び必要度と、上記試験問題の必要度及び難易度を表す修正イーベル指数との対応関係を示す修正イーベル指数算出テーブルを予め格納する記憶手段とを備えた試験問題評価装置の制御方法であって、
上記制御手段が、上記作問サポート処理において、上記入力手段から入力される上記試験問題の難易度及び必要度に基づいて、上記修正イーベル指数算出テーブルを参照して、上記試験問題の難易度及び必要度を表す修正イーベル指数を算出するステップと、
上記制御手段が、上記試験問題評価処理において、複数の受験者の上記試験問題に対する解答内容を含むデータに基づいて、上記試験問題の正答率と、上記複数の受験者のうちの成績上位層の受験者と上記成績上位層の受験者よりも下位の成績の成績下位層の受験者とを区別する能力を表す識別指数とを算出し、上記算出された修正イーベル指数と、正答率と、識別指数とに基づいて、修正イーベル指数と、正答率と、識別指数とをパラメータとする所定の関数を用いて上記試験問題の品質を表す問題評価指数を算出して出力するステップとを含むことを特徴とする。 The control method of the test question evaluation apparatus according to the second invention is:
Input means for inputting predetermined data input by the examiner of the test question;
Control means for executing question support processing for supporting creation of test questions and test question evaluation processing for evaluating the test questions;
A test problem evaluation apparatus comprising storage means for storing in advance a corrected ebel index calculation table indicating a correspondence relationship between the difficulty level and necessity level of a test problem and the corrected ebel index indicating the necessity level and difficulty level of the test problem. A control method,
In the questioning support process, the control means refers to the modified ebel index calculation table based on the difficulty level and necessity level of the test problem input from the input means, and determines the difficulty level of the test problem and Calculating a modified ebel index representing the degree of necessity;
The control means, in the test question evaluation process, based on the data including the answer contents for the test questions of a plurality of examinees, the correct answer rate of the test questions and Calculates an identification index that represents the ability to distinguish test takers from lower grades of lower grades than those of the higher grades, and calculates the corrected ebel index, correct answer rate, and discrimination And calculating and outputting a problem evaluation index representing the quality of the test question using a predetermined function having the corrected ebel index, correct answer rate, and identification index as parameters based on the index. Features.

上記試験問題評価装置の制御方法において、上記制御手段が、上記試験問題評価処理において、上記算出された問題評価指数に基づいて、上記試験問題を複数の問題評価区分に分類し、上記分類された問題評価区分を出力するステップをさらに含むことを特徴とする。 In the control method of the test question evaluation apparatus, the control means classifies the test questions into a plurality of question evaluation categories based on the calculated problem evaluation index in the test question evaluation process, and the classified The method further includes the step of outputting a problem evaluation category.

また、上記試験問題評価装置の制御方法において、上記制御手段が、上記試験問題評価処理において、上記複数の問題評価区分のうち所定のしきい値区分より品質の良い問題評価指数を有する問題評価区分に分類された試験問題の内容を含む試験問題データを、上記記憶手段に格納するステップをさらに含むことを特徴とする。 Further, in the control method of the test problem evaluation apparatus, the control means has a problem evaluation category having a better problem evaluation index than a predetermined threshold value category among the plurality of problem evaluation categories in the test question evaluation process. The method further includes the step of storing the test question data including the contents of the test questions classified into the above-mentioned storage means.

さらに、上記試験問題評価装置の制御方法において、上記試験問題は多肢選択問題であって、
上記制御手段が、上記試験問題評価処理において、上記解答内容を含むデータに基づいて、上記多肢選択問題の選択肢毎の選択率を算出し、上記試験問題を上記複数の問題評価区分に分類した後に、所定の選択率以下の選択肢があるときに上記分類された問題評価区分を１段階だけ下げるように変更するステップをさらに含むことを特徴とする。 Further, in the control method of the test problem evaluation apparatus, the test problem is a multiple choice problem,
In the test question evaluation process, the control means calculates a selection rate for each option of the multiple-choice question based on data including the answer contents, and classifies the test questions into the plurality of question evaluation categories. The method further includes a step of changing the classified problem evaluation category so as to be lowered by one step when there is an option having a predetermined selection rate or less.

またさらに、上記試験問題評価装置の制御方法において、上記試験問題は多肢選択問題であって、
上記制御手段が、上記作問サポート処理において、上記入力手段から、上記多肢選択問題の各選択肢毎の難易度を表す難易度コードを入力するステップと、
上記制御手段が、上記試験問題評価処理において、上記入力された難易度コードを、上記問題評価指数とともに出力するステップとをさらに含むことを特徴とする。 Still further, in the control method of the test problem evaluation apparatus, the test problem is a multiple choice problem,
The control means, in the question support process, a step of inputting a difficulty level code representing a difficulty level for each option of the multiple-choice question from the input means;
The control means further includes a step of outputting the input difficulty code together with the problem evaluation index in the test question evaluation process.

また、上記試験問題評価装置の制御方法において、上記修正イーベル指数は修正イーベル法において用いられる期待正答率であり、
上記修正イーベル指数算出テーブルは、修正イーベル法において用いられ、試験問題の難易度及び必要度と期待正答率との対応関係を示すテーブルであることを特徴とする。 Further, in the control method of the test question evaluation apparatus, the corrected ebel index is an expected correct answer rate used in the corrected ebel method,
The modified ebel index calculation table is used in the modified ebel method, and is a table showing the correspondence between the difficulty level and necessity level of the test questions and the expected correct answer rate.

第３の発明に係るプログラムは、上記試験問題評価装置の制御方法における各ステップを含むことを特徴とする。 A program according to a third invention includes the steps of the control method for the test question evaluation apparatus.

第４の発明に係るコンピュータで読み取り可能な記録媒体は、上記プログラムを格納したことを特徴とする。 A computer-readable recording medium according to a fourth aspect of the present invention stores the above program.

本発明に係る試験問題評価装置及びその制御方法並びにプログラム及び記録媒体によれば、作問時に入力される試験問題の難易度及び必要度に基づいて算出される修正イーベル指数と、試験問題に対する解答内容を含むデータに基づいて算出される試験問題の正答率及び識別指数とに基づいて、修正イーベル指数と、正答率と、識別指数とをパラメータとする所定の関数を用いて上記試験問題の品質を表す問題評価指数を算出して出力するので、作問者による試験問題の評価内容と、試験実施後に得られる客観的な数値との両方を反映した問題評価指数を自動的に得ることができる。このため、試験問題の評価を自動的に行い、試験問題の良否を従来技術に比較して客観的に判定できる。 According to the test problem evaluation apparatus, the control method thereof, the program, and the recording medium according to the present invention, the corrected ebel index calculated based on the difficulty and necessity of the test problem input at the time of questioning, and the answer to the test problem Based on the correct answer rate and the identification index of the test questions calculated based on the data including the contents, the quality of the above test problem using a predetermined function with the corrected ebel index, the correct answer rate, and the identification index as parameters. A problem evaluation index that reflects both the evaluation contents of the test questions by the author and the objective numerical values obtained after the test can be automatically obtained. . For this reason, the test questions are automatically evaluated, and the quality of the test questions can be objectively determined as compared with the prior art.

本発明の実施形態に係る試験問題評価装置１の構成を示すブロック図である。It is a block diagram which shows the structure of the test question evaluation apparatus 1 which concerns on embodiment of this invention. 図１の試験問題分類テーブル１２の一例を示すテーブルである。It is a table which shows an example of the test question classification table 12 of FIG. 図１のコメントテーブル１３の一例を示すテーブルである。It is a table which shows an example of the comment table 13 of FIG. 図１のＣＰＵ２によって実行される作問サポート処理のフローチャートである。It is a flowchart of the question support process performed by CPU2 of FIG. 図１のＣＰＵ２によって実行される試験問題評価処理のフローチャートである。It is a flowchart of the test question evaluation process performed by CPU2 of FIG. 図４のステップＳ１においてディスプレイ６に表示される問題一覧ウィンドウ１００の表示例である。5 is a display example of a problem list window 100 displayed on the display 6 in step S1 of FIG. 図４のステップＳ５においてディスプレイ６に表示される問題入力ウィンドウ２００の表示例である。It is a display example of the problem input window 200 displayed on the display 6 in step S5 of FIG. 図５のステップＳ１８におけるディスプレイ６への表示例である。It is an example of a display on the display 6 in step S18 of FIG.

以下、本発明に係る実施形態について図面を参照して説明する。なお、以下の実施形態において、同様の構成要素については同一の符号を付している。 Hereinafter, embodiments according to the present invention will be described with reference to the drawings. In the following embodiments, the same reference numerals are assigned to the same components.

図１は、本発明の実施形態に係る試験問題評価装置１の構成を示すブロック図であり、図２は、図１の試験問題分類テーブル１２の一例を示すテーブルであり、図３は、図１のコメントテーブル１３の一例を示すテーブルである。図１において、試験問題評価装置１は、例えばパーソナルコンピュータなどのディジタル計算機であって、作問者が、医師国家試験に準拠した模擬試験のための試験問題を作成し、評価するための端末装置として用いられる。本実施形態において、各試験問題は、５個の選択肢の中から１個の選択肢を選択する多肢選択問題（以下、ＭＣＱ（Multiple Choice Question）という。）である。試験問題評価装置１は、ＣＰＵ（Central Processing Unit）２と、ＲＯＭ（Read Only Memory）３と、ＲＡＭ（Random Access Memory）４と、ハードディスクドライブ５と、ディスプレイ６と、操作入力部７と、ＬＡＮ（Local Area Network）インターフェース８と、光ディスクドライブ２０とを備えて構成される。ＣＰＵ２は、バスを介してＲＯＭ３、ＲＡＭ４、ハードディスクドライブ５、ディスプレイ６、操作入力部７、ＬＡＮインターフェース８、及び光ディスクドライブ２０と接続されていてそれらの各動作及び試験問題評価装置１の全体の動作を制御するほか、後述する種々のソフトウェアプログラムの処理（図４及び図５）を実行する。 FIG. 1 is a block diagram showing a configuration of a test question evaluation apparatus 1 according to an embodiment of the present invention, FIG. 2 is a table showing an example of the test question classification table 12 of FIG. 1, and FIG. 2 is a table showing an example of one comment table 13. In FIG. 1, a test question evaluation device 1 is a digital computer such as a personal computer, for example, and a terminal device for an author to create and evaluate a test question for a mock test in accordance with a doctor's national test. Used as In the present embodiment, each test question is a multiple choice question (hereinafter referred to as MCQ (Multiple Choice Question)) in which one option is selected from five options. The test question evaluation apparatus 1 includes a central processing unit (CPU) 2, a read only memory (ROM) 3, a random access memory (RAM) 4, a hard disk drive 5, a display 6, an operation input unit 7, a LAN. A (Local Area Network) interface 8 and an optical disk drive 20 are provided. The CPU 2 is connected to the ROM 3, RAM 4, hard disk drive 5, display 6, operation input unit 7, LAN interface 8, and optical disk drive 20 via a bus, and their respective operations and the overall operation of the test question evaluation apparatus 1. In addition to controlling the above, various software program processes (FIGS. 4 and 5) described later are executed.

また、図１において、ディスプレイ６は、液晶表示装置（ＬＣＤ（Liquid Crystal Display））又はＣＲＴ（Cathode Ray Tube）ディスプレイなどの表示装置であり、試験問題評価装置１の動作状態の表示及び種々のＧＵＩ（Graphic User Interface）プログラムのための表示装置として機能する。さらに、操作入力部７は、例えば、マウスなどのポインティングデバイス及びキーボードなどの文字入力手段を含む。作問者及び受験者などの試験問題評価装置１のユーザは、データ及び指示コマンドなどを入力するために操作入力部７を用いる。さらに、ＬＡＮインターフェース８は、ＬＡＮ８０に接続され、ＬＡＮ８０に接続された複数Ｍ台の受験者用端末装置３０−１〜３０−Ｍなどの機器からの信号及びデータを受信する一方、ＬＡＮ８０に接続された複数Ｍ台の受験者用端末装置３０−１〜３０−Ｍなどの機器に対してＣＰＵ２からの信号及びデータを送信して、信号変換及びプロトコル変換などのＬＡＮ通信に係る双方向のインターフェース処理を実行する。光ディスクドライブ２０は、ＣＤ−ＲＯＭ又はＤＶＤなどの、コンピュータで読み取り可能な記録媒体２１に記録されたデータ及びプログラムを読み出して出力する。 In FIG. 1, a display 6 is a display device such as a liquid crystal display device (LCD (Liquid Crystal Display)) or a CRT (Cathode Ray Tube) display, and displays the operation status of the test question evaluation device 1 and various GUIs. (Graphic User Interface) Functions as a display device for the program. Further, the operation input unit 7 includes, for example, a pointing device such as a mouse and character input means such as a keyboard. A user of the test question evaluation apparatus 1 such as a questioner and a test taker uses the operation input unit 7 to input data and instruction commands. Further, the LAN interface 8 is connected to the LAN 80 and receives signals and data from a plurality of M terminal devices 30-1 to 30-M connected to the LAN 80, while being connected to the LAN 80. Bidirectional interface processing related to LAN communication such as signal conversion and protocol conversion by transmitting signals and data from the CPU 2 to devices such as a plurality of M test taker terminal devices 30-1 to 30-M Execute. The optical disk drive 20 reads and outputs data and programs recorded on a computer-readable recording medium 21 such as a CD-ROM or DVD.

図１において、ＲＯＭ３は、試験問題評価装置１の動作に必要であってＣＰＵ２によって実行される種々のソフトウェアのプログラムを予め格納する。また、ハードディスクドライブ５は、記録媒体を内蔵する記憶装置であり、ＣＰＵ２が実行する試験問題評価プログラム１０と、修正イーベル指数算出テーブル１１と、試験問題分類テーブル１２と、コメントテーブル１３とを予め格納する。ここで、試験問題評価プログラム１０は、図４を参照して詳細後述する作問サポート処理のプログラムと、図５を参照して詳細後述する試験問題評価処理のプログラムとを含む。また、ハードディスクドライブ５は、作問サポート処理によってそれぞれ生成されたＮ個（Ｎは正の整数である。）の試験問題データ１４−１〜１４−Ｎ（以下、試験問題データ１４−１〜１４−Ｎを区別しない場合には、試験問題データ１４という。）を格納する。さらに、ハードディスクドライブ５は、プール（蓄積）対象の試験問題データ１４を格納するためのプール対象試験問題データ格納領域１５と、一部修正（ブラッシュアップ）してプール対象の問題となり得る試験問題データ１４を格納するためのブラッシュアップ対象試験問題格納領域１６と、重要な修正を含みかつブラッシュアップで十分に吟味の上、修正することにより再実施対象問題の候補になり得る試験問題データ１４を格納するための、重要な修正を含むブラッシュアップ対象試験問題格納領域１７とを有する。 In FIG. 1, the ROM 3 stores various software programs that are necessary for the operation of the test question evaluation apparatus 1 and executed by the CPU 2 in advance. Further, the hard disk drive 5 is a storage device with a built-in recording medium, and stores in advance a test question evaluation program 10 executed by the CPU 2, a modified ebel index calculation table 11, a test question classification table 12, and a comment table 13. To do. Here, the test question evaluation program 10 includes a question support processing program described in detail later with reference to FIG. 4 and a test question evaluation processing program described in detail later with reference to FIG. The hard disk drive 5 has N pieces (N is a positive integer) of test question data 14-1 to 14-N (hereinafter referred to as test question data 14-1 to 14) generated by the question support process. -N is not distinguished, it is called test question data 14). Further, the hard disk drive 5 includes a pool target test question data storage area 15 for storing pool (accumulation) target test question data 14 and test problem data that can be partially corrected (brushed up) to become a pool target problem. The test question storage area 16 for storing the test item 14 for storing the test item 14 and the test question data 14 that includes important corrections and can be a candidate for the subject to be re-executed by making corrections after thorough examination by the brush up. And a test subject storage area 17 to be brushed up including important corrections.

ここで、修正イーベル指数算出テーブル１１は、作問者によって試験問題データ１４の生成時に設定される試験問題の難易度（平易、中等、又は困難）及び必要度（疑問、重要、又は必須）と、修正イーベル指数との対応関係を示す。ここで、修正イーベル指数は、試験問題の難易度及び必要度を表す指数であって、難易度の低下、及び必要度の上昇に伴って大きくなるように設定される。例えば、修正イーベル指数として、修正イーベル法（非特許文献２参照。）で用いられる期待正答率を用いてもよい。この場合は、修正イーベル指数算出テーブル１１は、以下のように設定される。 Here, the modified ebel index calculation table 11 includes the difficulty (plain, medium, or difficult) and necessity (question, important, or essential) of the test questions set when the test question data 14 is generated by the questioner. The correspondence relationship with the modified ebel index is shown. Here, the modified ebel index is an index representing the difficulty and necessity of the test problem, and is set so as to increase as the difficulty decreases and the necessity increases. For example, an expected correct answer rate used in the modified ebel method (see Non-Patent Document 2) may be used as the modified ebel index. In this case, the modified ebel index calculation table 11 is set as follows.

また、図２に示すように、試験問題分類テーブル１２は、詳細後述する試験問題評価処理によって算出される試験問題データ１４の品質を表す問題評価指数と、試験問題の問題評価区分との対応関係を示す。さらに、図３に示すように、コメントテーブル１３は、試験問題データ１４の問題評価区分と、ディスプレイ６に表示されるコメントとの対応関係を示す。 As shown in FIG. 2, the test question classification table 12 includes a correspondence relationship between a problem evaluation index representing the quality of the test question data 14 calculated by a test question evaluation process, which will be described in detail later, and a problem evaluation category of the test question. Indicates. Further, as shown in FIG. 3, the comment table 13 indicates a correspondence relationship between the problem evaluation category of the test question data 14 and the comment displayed on the display 6.

さらに、図１において、ＲＡＭ４は、ＳＲＡＭ（Static RAM）、ＤＲＡＭ（Dynamic RAM）、ＳＤＲＡＭ（Synchronous DRAM）などで構成され、ＣＰＵ２のワーキングエリアとして使用されて、試験問題評価プログラム１０などのプログラムを実行したときに、当該実行するプログラムに対応する機能を実行するために必要な実行プログラムとそれを実行するために必要なデータ及び実行時に発生する一時的なデータを格納する。 Further, in FIG. 1, a RAM 4 is constituted by SRAM (Static RAM), DRAM (Dynamic RAM), SDRAM (Synchronous DRAM), etc., and is used as a working area of the CPU 2 to execute a program such as a test question evaluation program 10. Then, an execution program necessary for executing the function corresponding to the program to be executed, data necessary for executing the function, and temporary data generated at the time of execution are stored.

次に、図４を参照して、図１のＣＰＵ２によって実行される作問サポート処理を説明する。図４は、図１のＣＰＵ２によって実行される、作問者の試験問題の作成をサポートするための作問サポート処理のフローチャートである。作問者が、試験問題評価装置１の操作入力部７を用いて、試験問題データ１４−ｎ（ｎ＝１，２，…，Ｎ）を生成するための所定の操作（例えば、ディスプレイ６に表示されている所定のアイコンをクリックする、又は、ディスプレイ６に表示されているコンソールウィンドウにおいて所定のコマンドを入力するなど。）を行うと、これに応答して、ＣＰＵ２は、試験問題評価プログラム１０に含まれる作問サポート処理のためのプログラムを実行する。 Next, the question support process executed by the CPU 2 of FIG. 1 will be described with reference to FIG. FIG. 4 is a flowchart of the question support process executed by the CPU 2 in FIG. 1 to support the creation of the question questions of the questioner. The questioner uses the operation input unit 7 of the test question evaluation apparatus 1 to perform a predetermined operation (for example, on the display 6) for generating the test question data 14-n (n = 1, 2,..., N). When a predetermined icon displayed is clicked or a predetermined command is input in the console window displayed on the display 6), the CPU 2 responds to the test question evaluation program 10. The program for the question support process included in is executed.

まず始めに、図４のステップＳ１において、ディスプレイ６に問題一覧ウィンドウ１００が表示される。図６は、図４のステップＳ１においてディスプレイ６に表示される問題一覧ウィンドウ１００の表示例である。図６に示すように、問題一覧ウィンドウ１００は、試験問題データ１４−ｎを新規に生成することを指示するための「新規作成」ボタン１０１と、既に生成されてハードディスクドライブ５に格納されている試験問題データ１４−ｎを修正することを指示するための「問題の修正」ボタン１０２と、ハードディスクドライブ５に格納されている試験問題データ１４−ｎの一覧表１０３とを含む。図４のステップＳ２において、作問者が、「新規作成」ボタン１０１をクリックしたか否かが判断され、ＹＥＳのときはステップＳ５に進む一方、ＮＯのときはステップＳ３に進む。そして、ステップＳ３において、作問者が、「問題の修正」ボタン１０２をクリックしたか否かが判断され、ＹＥＳのときはステップＳ４に進む一方、ＮＯのときはステップＳ２に戻る。ステップＳ４では、問題一覧ウィンドウ１００の一覧表１０３において選択された問題データ１４−ｎがハードディスクドライブ５から読み出される。そして、ステップＳ４又はステップＳ２に続いて、ステップＳ５において、ディスプレイ６に問題入力ウィンドウ２００が表示される。 First, a problem list window 100 is displayed on the display 6 in step S1 of FIG. FIG. 6 is a display example of the problem list window 100 displayed on the display 6 in step S1 of FIG. As shown in FIG. 6, the question list window 100 has already been generated and stored in the hard disk drive 5 and a “new creation” button 101 for instructing to newly generate the test question data 14-n. A “problem correction” button 102 for instructing to correct the test question data 14-n and a list 103 of the test question data 14-n stored in the hard disk drive 5 are included. In step S2 of FIG. 4, it is determined whether or not the questioner has clicked the “New” button 101. If YES, the process proceeds to step S5. If NO, the process proceeds to step S3. Then, in step S3, it is determined whether or not the questioner has clicked the “correct problem” button 102. If YES, the process proceeds to step S4. If NO, the process returns to step S2. In step S 4, the problem data 14 -n selected in the list 103 of the problem list window 100 is read from the hard disk drive 5. Then, following step S4 or step S2, the question input window 200 is displayed on the display 6 in step S5.

図７は、図４のステップＳ５においてディスプレイ６に表示される問題入力ウィンドウ２００の表示例である。作問者は、ステップＳ５において、ディスプレイ６に表示された問題入力ウィンドウ２００と操作入力部７とを用いて、対話形式で作問を行う。図７に示すように、問題入力ウィンドウ２００は、問題文を入力するためのテキストボックス２０１と、選択肢毎に解答肢を入力するためのテキストボックス２０２−１〜２０２−５と、選択肢毎に難易度コードを選択して入力するためのドロップダウンリスト２３−１〜２３−５と、試験問題の必要度を選択して入力するためのドロップダウンリスト２０４と、試験問題の難易度を選択して入力するためのドロップダウンリスト２０５と、「保存」ボタン２０６とを含む。ここで、難易度コード（非特許文献１参照。）は、最低合格指数（非特許文献１参照。）に用いられるコードであって、選択肢が正解であるか否か及び難易度を表す。難易度コードは、正解を表す難易度コード（丸で囲まれた２（図７参照。））と、合格認定を許容できる最低能力の受験生が正解との区別できなくてもやむを得ない不正解の選択肢を表す難易度コード（２）と、合格認定を許容できる最低能力の受験生が正解とすることもあるであろうし、しないこともあるであろう不正解の選択肢を表す難易度コード（１）と、本試験問題に関係する領域の学習を行ったと思えない受験生が選択する選択肢を表す難易度コード（０）とから選択される。また、試験問題の必要度は、疑問、重要、及び必須の中から選択される。さらに、試験問題の難易度は、平易、中等、及び困難の中から選択される。図７の例では、テキストボックス２０１に、問題文「インフォームドコンセントとして適切でないのはどれか。」が入力されている。また、選択肢毎に、以下の解答肢と難易度コードが入力されている。 FIG. 7 is a display example of the problem input window 200 displayed on the display 6 in step S5 of FIG. In step S5, the interrogator performs an interactive question using the question input window 200 and the operation input unit 7 displayed on the display 6. As shown in FIG. 7, the question input window 200 includes a text box 201 for inputting a question sentence, text boxes 202-1 to 202-5 for inputting answer limbs for each option, and difficulty for each option. Select a drop-down list 23-1 to 23-5 for selecting and inputting a degree code, a drop-down list 204 for selecting and inputting a degree of necessity of an examination question, and a difficulty level of an examination question A drop-down list 205 for inputting and a “Save” button 206 are included. Here, the difficulty level code (see Non-Patent Document 1) is a code used for the lowest pass index (see Non-Patent Document 1), and indicates whether the option is correct and the difficulty level. The difficulty code is a difficulty code that indicates the correct answer (circled 2 (see Fig. 7)) and an inaccurate answer that is unavoidable even if the student with the lowest ability who can tolerate the acceptance cannot be distinguished from the correct answer. Difficulty level code (2) representing choices, and difficulty level code (1) representing choices of incorrect answers that may or may not be correct for students with the lowest ability that can be accepted. And a difficulty level code (0) representing an option selected by a student who does not seem to have learned the area related to the main examination question. In addition, the degree of necessity of the test questions is selected from among questions, important and essential. Furthermore, the difficulty level of the test questions is selected from among plain, medium and difficult. In the example of FIG. 7, the question sentence “Which is not appropriate as an informed consent?” Is entered in the text box 201. In addition, the following answering limbs and difficulty codes are input for each option.

[表１]
＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿
選択肢解答肢難易度コード
＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿
ａ治療処置の目的を説明する。０
ｂ治療処置の選択肢を説明する。０
ｃ治療処置のリスクを説明する。０
ｄ病院の免責文書に署名をもらう。正解を表す２（丸で囲まれた２）
ｅ納得の上同意文書に署名をもらう。０
＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿＿ [Table 1]
______________________________________________
Options Answering limbs difficulty code __________________________
a Explain the purpose of the therapeutic procedure. 0
b Explain therapeutic treatment options. 0
c Explain the risk of therapeutic treatment. 0
d Get a hospital waiver document signed. 2 representing the correct answer (circled 2)
e Get the consent document signed with consent. 0
______________________________________________

さらに、図７において、ドロップダウンリスト２０４を用いて、必要度として「必須」が選択され、ドロップダウンリスト２０５を用いて、難易度として「平易」が選択されている。なお、図４のステップＳ３において作問者が「問題の修正」ボタンをクリックし、ステップＳ４において、選択された試験問題データ１４−ｎがハードディスクドライブ５から読み出された場合には、読み出された試験問題データ１４−ｎに含まれる問題文と、選択肢毎の解答肢及び難易度コードと、必要度及び難易度とを、ステップＳ５において問題入力ウィンドウ２００に表示する。 Further, in FIG. 7, “required” is selected as the necessary degree using the drop-down list 204, and “easy” is selected as the difficulty level using the drop-down list 205. In step S3 in FIG. 4, the author clicks the “Problem Correction” button, and when the selected test question data 14-n is read from the hard disk drive 5 in step S4, the read is performed. In step S5, the question input window 200 displays the question text included in the test question data 14-n, the answer limbs and the difficulty level code for each option, and the necessity level and the difficulty level.

図４のステップＳ５に続いて、ステップＳ６において、作問者が、問題入力ウィンドウ２００の「保存」ボタン２０６をクリックしたか否かが判断され、ＹＥＳのときはステップＳ７に進む一方、ＮＯのときはステップＳ６の処理を繰り返す。次に、ステップＳ７において、ＣＰＵ２は、入力された必要度及び難易度に基づいて修正イーベル指数算出テーブル１１を参照して、修正イーベル指数を算出する。そして、ステップＳ８において、ＣＰＵ２は、入力された問題文と、入力された選択肢毎の解答肢及び難易度コードと、入力された必要度及び難易度と、算出された修正イーベル指数とを含む試験問題データ１４−ｎを生成して、ハードディスクドライブ５に格納し、作問サポート処理を終了する。 Subsequent to step S5 of FIG. 4, in step S6, it is determined whether or not the questioner has clicked the “save” button 206 in the question input window 200. If YES, the process proceeds to step S7, while NO. If so, the process of step S6 is repeated. Next, in step S <b> 7, the CPU 2 calculates a corrected ebel index by referring to the corrected ebel index calculation table 11 based on the input necessity level and difficulty level. And in step S8, CPU2 is a test containing the inputted question sentence, the answer limb and difficulty code for every inputted option, the inputted necessity and difficulty, and the calculated corrected ebel index. The problem data 14-n is generated and stored in the hard disk drive 5, and the question support process is terminated.

以上説明したように、図４の作問サポート処理によって、問題文と、選択肢毎の解答肢及び難易度コードと、必要度及び難易度と、修正イーベル指数とを含む試験問題データ１４−ｎを生成してハードディスクドライブ５に格納することができる。図１において、各受験者用端末装置３０−１〜３０−Ｍは、生成された試験問題データ１４−ｎを、ハードディスクドライブ５からＬＡＮインターフェース８及びＬＡＮ８０を介して他の受験者用端末装置３０−１〜３０−Ｍを用いて受信して受験者用端末装置３０−１〜３０−Ｍの各表示装置に表示する。各受験者は、受験者用端末装置３０−１〜３０−Ｍにおいて試験問題に解答し、受験者用端末装置３０−１〜３０−Ｍは、解答内容を含むデータを、ＬＡＮ８０を介して試験問題評価装置１に送信する。 As described above, the question support data 14-n including the question sentence, the answer limb and difficulty code for each option, the necessity and difficulty, and the modified ebel index is obtained by the question support process of FIG. It can be generated and stored in the hard disk drive 5. In FIG. 1, each test taker terminal device 30-1 to 30 -M transmits the generated test question data 14 -n from the hard disk drive 5 to another test taker terminal device 30 via the LAN interface 8 and the LAN 80. -1 to 30-M are received and displayed on each display device of the examinee terminal devices 30-1 to 30-M. Each examinee answers the test questions in the test taker terminal devices 30-1 to 30-M, and the test taker terminal devices 30-1 to 30-M test the data including the answer contents via the LAN 80. It transmits to the problem evaluation device 1.

次に、図５を参照して、図１のＣＰＵ２によって実行される試験問題評価処理を説明する。図５は、図１のＣＰＵ２によって実行される試験問題評価処理のフローチャートである。作問者が、試験問題評価装置１の操作入力部７を用いて、試験問題データ１４−ｎを評価するための所定の操作（例えば、ディスプレイ６に表示されている所定のアイコンをクリックする、又は、ディスプレイ６に表示されているコンソールウィンドウにおいて所定のコマンドを入力するなど。）を行うと、これに応答して、ＣＰＵ２は、試験問題評価プログラム１０に含まれる試験問題評価処理のためのプログラムを実行する。 Next, a test question evaluation process executed by the CPU 2 of FIG. 1 will be described with reference to FIG. FIG. 5 is a flowchart of the test question evaluation process executed by the CPU 2 of FIG. The questioner uses the operation input unit 7 of the test question evaluation apparatus 1 to click a predetermined operation (for example, a predetermined icon displayed on the display 6) for evaluating the test question data 14-n. Or a predetermined command is input in the console window displayed on the display 6), etc., the CPU 2 responds to this by the program for the test question evaluation process included in the test question evaluation program 10. Execute.

まず始めに、ステップＳ１０において、ＣＰＵ２は、受験者用端末装置３０−１〜３０−Ｍから、試験問題データ１４−ｎに対する解答内容を含むデータを受信し、当該受信されたデータに基づいて、各解答を採点し、正答率と、選択肢毎の選択者数と、選択肢毎の選択率と、識別指数とを算出する。 First, in step S10, the CPU 2 receives data including answer contents for the test question data 14-n from the test taker terminal devices 30-1 to 30-M, and based on the received data, Each answer is scored, and the correct answer rate, the number of selectors for each option, the select rate for each option, and the identification index are calculated.

ここで、試験問題データ１４−ｎの正答率は、全ての受験者のうち試験問題データ１４−ｎに正解した受験者の割合（０以上１以下の値）である。また、試験問題データ１４−ｎの識別指数は、全ての受験者のうちの成績上位層の受験者と上記成績上位層の受験者よりも下位の成績の成績下位層の受験者とを区別する能力を表す。本実施形態において、試験問題データ１４−ｎの識別指数は、全ての受験者のうちの成績最上位Ｘ％の受験者の試験問題データ１４−ｎの正答率から、全ての受験者のうちの成績最下位Ｙ％の受験者の試験問題データ１４−ｎの正答率を減算した値であり、−１以上１以下の数値である。識別指数が負の値の試験問題は、適切な問題ではない可能性、あるいは、実施した試験で測定する能力と異なる能力を測定している可能性がある。一般に、識別指数は０．２を超えることが好ましい。また、成績最上位者の割合Ｘ％及び成績最下位者の割合Ｙ％は、試験問題データ１４−ｎを含む試験全体の得点順位に基づいて、例えば、それぞれ２５％、１５％、５０％などの所定の値に設定される。ここで、例えばＸ＝Ｙ＝５０％であるとき、試験問題データ１４−ｎの識別指数は、全ての受験者のうちの成績上位層の受験者と上記成績上位層の受験者よりも直下の下位の成績の成績下位層の受験者とを区別する能力を表す。また、例えばＸ＜５０％かつＹ＜５０％であるとき、試験問題データ１４−ｎの識別指数は、全ての受験者のうちの成績上位層の受験者と上記成績上位層の受験者よりも、所定数の受験者を介して、下位の成績の成績下位層の受験者とを区別する能力を表し、当該成績上位層の受験者は第１のしきい値順位以上の受験者であり、当該下位の成績の成績下位層の受験者は、上記第１のしきい値順位よりも下位の第２のしきい値順位以下の受験者であり、それ故、試験問題データ１４−ｎの識別指数は、全ての受験者のうちの、第１のしきい値順位以上の成績上位層の受験者と、第２のしきい値順位以下の下位の成績の成績下位層の受験者とを区別する能力を表す Here, the correct answer rate of the test question data 14-n is a ratio (value of 0 or more and 1 or less) of test takers who correctly answered the test question data 14-n among all test takers. In addition, the identification index of the exam question data 14-n distinguishes among the examinees of the higher grades among all the examinees and the examinees of the lower grades of the grades lower than the above graded candidates. Represents ability. In the present embodiment, the identification index of the test question data 14-n is determined based on the correct answer rate of the test question data 14-n of the examinee with the highest grade of X% among all the test candidates. This is a value obtained by subtracting the correct answer rate of the test question data 14-n of the examinee of the lowest grade Y%, and is a numerical value of −1 or more and 1 or less. A test problem with a negative identification index may not be an appropriate problem, or may be measuring an ability that is different from the ability to measure in the test performed. Generally, it is preferable that the identification index exceeds 0.2. Further, the ratio X% of the highest grader and the percentage Y% of the lowest grader are, for example, 25%, 15%, 50%, etc., respectively, based on the overall score ranking including the exam question data 14-n. Is set to a predetermined value. Here, for example, when X = Y = 50%, the identification index of the test question data 14-n is lower than those of the higher grades of all the examinees and those of the higher grades. Describes the ability to distinguish lower grades from lower grades. For example, when X <50% and Y <50%, the identification index of the test question data 14-n is higher than those of the higher grades of all the examinees and those of the above higher grades. , Representing the ability to distinguish lower grades of lower grades through a predetermined number of examinees, where the higher grades are those who are above the first threshold rank, The candidate of the lower grade of the lower grade is a candidate who is lower than the second threshold rank lower than the first threshold rank, and therefore, identification of the test question data 14-n The index distinguishes among candidates who are higher grades above the first threshold rank and those who are lower grades below the second threshold rank. Represents the ability to

次に、ステップＳ１０に続いてステップＳ１１において、ＣＰＵ２は、試験問題データ１４−ｎに含まれる修正イーベル指数と、算出した正答率及び識別指数とに基づいて、所定の関数を用いて問題評価指数を算出する。ここで、問題評価指数Ｅは、以下の式で表される。 Next, in step S11 following step S10, the CPU 2 uses a predetermined function to evaluate the problem evaluation index based on the corrected ebel index included in the test question data 14-n and the calculated correct answer rate and identification index. Is calculated. Here, the problem evaluation index E is expressed by the following equation.

［数１］
Ｅ＝ｆ（ｘ，ｙ，ｚ） [Equation 1]
E = f (x, y, z)

ここで、ｘは修正イーベル指数であり、ｙは正答率であり、ｚは識別指数であり、ｆは例えば、それぞれ所定の係数を有してパラメータｘ、ｙ、ｚを線形結合し、又は２次以上の高次方程式で結合する所定の関数である。このように算出された問題評価指数Ｅは試験問題の品質（良否）を表し、本実施形態では、関数ｆは、品質の上昇と共に試験問題の問題評価指数Ｅが高い値を有するように設定される。 Here, x is a modified ebel index, y is a correct answer rate, z is an identification index, and f is, for example, a linear combination of parameters x, y, and z each having a predetermined coefficient, or 2 It is a predetermined function that is combined by a higher-order equation of the order or higher. The problem evaluation index E calculated in this way represents the quality (good / bad) of the test problem. In this embodiment, the function f is set so that the problem evaluation index E of the test problem has a high value as the quality increases. The

図５において、ステップＳ１１に続いてステップＳ１２において、ＣＰＵ２は、算出された問題評価指数に基づいて問題分類テーブル１２を参照して、試験問題データ１４−ｎを問題評価区分Ａ，Ｂ又はＣに分類する。ここで、問題評価区分Ａ，Ｂ，Ｃは、試験問題の品質を表し、問題評価区分Ａ，Ｂ，Ｃの順序で品質が悪くなる。具体的には、問題評価区分Ａに分類された試験問題データ１４−ｎはプール対象問題に適しており、問題評価区分Ｂに分類された試験問題データ１４−ｎは一部修正（ブラッシュアップ）してプール対象問題となり得る。また、問題評価区分Ｃに分類された試験問題データ１４−ｎは、問題評価区分Ｂに分類された試験問題データ１４−ｎに比較して重要な修正を行う必要があるが、ブラッシュアップで十分に吟味の上、修正することにより再実施対象問題の候補になり得る。 In FIG. 5, in step S12 following step S11, the CPU 2 refers to the problem classification table 12 based on the calculated problem evaluation index, and sets the test question data 14-n to the problem evaluation category A, B, or C. Classify. Here, the problem evaluation categories A, B, and C represent the quality of the test questions, and the quality deteriorates in the order of the problem evaluation categories A, B, and C. Specifically, the test question data 14-n classified in the problem evaluation category A is suitable for the pool target problem, and the test question data 14-n classified in the problem evaluation category B is partially corrected (brushed up). Can be a pooled problem. Further, the test question data 14-n classified in the problem evaluation category C needs to be significantly modified as compared with the test question data 14-n classified in the problem assessment category B. After reviewing the above, it can become a candidate for a problem to be re-implemented by correcting it.

次に、ステップＳ１３において、試験問題データ１４−ｎが問題評価区分Ｃに分類されたか否かが判断され、ＹＥＳのときはステップＳ１６に進む一方、ＮＯのときはステップＳ１４に進む。さらに、ステップＳ１４において、選択者数がゼロの選択肢があるか、又は、選択率が２％未満の選択肢があるか否かが判断され、ＹＥＳのときはステップＳ１５に進む一方、ＮＯのときはステップＳ１６に進む。ステップＳ１５では、問題評価区分は１段階だけ下げられ（問題評価区分Ａから問題評価区分Ｂへ、又は、問題評価区分Ｂから問題評価区分Ｃへ下げられる。）、ステップＳ１６に進む。ステップＳ１６では、試験問題データ１４−ｎが問題評価区分Ａに分類された場合はプール対象試験問題データ格納領域１５に格納し、問題評価区分Ｂに分類された場合はブラッシュアップ対象試験問題データ格納領域１６に格納し、問題評価区分Ｃに分類された場合は重要な修正を含むブラッシュアップ対象試験問題データ格納領域１７に格納する。ステップＳ１６の処理により、問題評価区分Ａに分類されたプール対象問題である試験問題データ１４−ｎと、問題評価区分Ｂに分類された一部修正が必要な試験問題データ１４−ｎと、問題評価区分Ｃに分類されたより重要な修正が必要な試験問題データ１４−ｎとを、それぞれ異なる格納領域１５，１６及び１７に格納できる。次に、ステップＳ１７では、ディスプレイ６に、試験問題データ１４−ｎの選択肢番号と、難易度コードと、修正イーベル指数と、正答率と、選択者数と、選択率と、識別指数と、問題評価指数と、問題評価区分とを表示するとともに、問題評価区分に基づいてコメントテーブル１３を参照してコメントを表示し、試験問題評価処理を終了する。図８は、図５のステップＳ１８におけるディスプレイ６への表示例である。なお、図８では、算出された正答率を１００倍した値（単位は、％である。）を「正答率」として表示している。 Next, in step S13, it is determined whether or not the test question data 14-n is classified into the problem evaluation category C. If YES, the process proceeds to step S16, and if NO, the process proceeds to step S14. Further, in step S14, it is determined whether or not there is an option with the number of selections of zero or an option with a selection rate of less than 2%. If YES, the process proceeds to step S15, whereas if NO, Proceed to step S16. In step S15, the problem evaluation category is lowered by one level (from problem evaluation category A to problem evaluation category B, or from problem evaluation category B to problem evaluation category C), and the process proceeds to step S16. In step S16, when the test question data 14-n is classified into the problem evaluation category A, it is stored in the pool target test question data storage area 15, and when it is classified into the problem evaluation category B, the brush-up target test question data is stored. If it is stored in the area 16 and is classified into the problem evaluation category C, it is stored in the test problem data storage area 17 to be brushed up including important corrections. By the process of step S16, the test problem data 14-n that is a pool target problem classified into the problem evaluation category A, the test problem data 14-n classified into the problem evaluation category B that needs partial correction, and the problem The test question data 14-n classified in the evaluation category C and requiring more important correction can be stored in different storage areas 15, 16 and 17, respectively. Next, in step S17, the option number of the examination question data 14-n, the difficulty code, the corrected ebel index, the correct answer rate, the number of selecters, the selection rate, the identification index, and the question are displayed on the display 6. The evaluation index and the problem evaluation category are displayed, the comment is displayed with reference to the comment table 13 based on the problem evaluation category, and the test question evaluation process is terminated. FIG. 8 is a display example on the display 6 in step S18 of FIG. In FIG. 8, a value (unit:%) obtained by multiplying the calculated correct answer rate by 100 is displayed as “correct answer rate”.

従来は、修正イーベル法による試験問題の主観的な評価と、試験実施後に得られる正答率などの客観的な数値に基づく試験問題の評価とは、独立に行われており、２つの評価結果に基づいて作問者自身が総合的に試験問題を評価する必要があった。しかしながら、本実施形態によれば、作問時に作問者によって設定される試験問題の難易度及び必要度を表す修正イーベル指数と、試験実施後に得られる試験問題の正答率及び識別指数とに基づいて問題評価指数を算出し、当該問題評価指数に基づいて試験問題を自動的に評価する。このため、作問者による試験問題の評価内容と、試験実施後に得られる客観的な数値との両方を反映した問題評価指数を自動的に得ることができる。このため、試験問題の評価を自動的に行い、試験問題の良否を従来技術に比較して客観的に判定できる。 Conventionally, the subjective evaluation of test questions using the modified ebel method and the evaluation of test questions based on objective values such as the correct answer rate obtained after the test are conducted independently. Based on this, the examiner himself had to evaluate the examination questions comprehensively. However, according to the present embodiment, based on the modified ebel index indicating the difficulty and necessity of the test questions set by the questioner at the time of writing, and the correct answer rate and identification index of the test questions obtained after the test is performed. The problem evaluation index is calculated, and the test questions are automatically evaluated based on the problem evaluation index. Therefore, it is possible to automatically obtain a problem evaluation index that reflects both the evaluation contents of the test questions by the questioner and the objective numerical values obtained after the test. For this reason, the test questions are automatically evaluated, and the quality of the test questions can be objectively determined as compared with the prior art.

さらに、図８に示すように、試験問題の問題評価区分を表示するので、作問者は、当該表示に基づいて、例えば、問題評価区分Ｂに分類されてブラッシュアップ対象試験問題格納領域１６に格納された試験問題データ１４と、問題評価区分Ｃに分類されて重要な修正を含むブラッシュアップ対象試験問題格納領域１７に格納された試験問題データ１４を吟味し推敲できる。また、問題評価区分Ａに分類された（すなわち、しきい値区分Ｂより品質の良い問題評価指数を有し、良問と判定された）試験問題データ１４−ｎを自動的にプール対象試験問題データ格納領域１５に格納するので、従来技術に比較して、容易に客観試験を作成できる。従って、本実施形態によれば、試験作成を補助することができる。さらに、受験者用端末装置３０−１〜３０−Ｍから、問題評価区分Ａに分類されてプール対象試験問題データ格納領域１５に格納された試験問題データ１４−ｎを読み出すことができるように設定すれば、受験者の自己学習を促進できる。 Further, as shown in FIG. 8, since the question evaluation classification of the test questions is displayed, the questioner is classified into the problem evaluation classification B based on the display, for example, in the examination problem storage area 16 to be brushed up. The stored test problem data 14 and the test problem data 14 classified in the problem evaluation category C and stored in the brush-up target test problem storage area 17 including important corrections can be examined and refined. Further, the test problem data 14-n classified into the problem evaluation category A (that is, having a problem evaluation index with a higher quality than the threshold category B and determined as a good question) is automatically pooled test questions. Since the data is stored in the data storage area 15, an objective test can be easily created as compared with the prior art. Therefore, according to the present embodiment, test creation can be assisted. Further, setting is made so that the test question data 14-n classified in the question evaluation category A and stored in the pool target test question data storage area 15 can be read from the terminal devices 30-1 to 30-M for the examinees. If this is done, the self-learning of the examinee can be promoted.

また、図５の試験問題評価処理において、選択者数がゼロの選択肢がある、又は、選択率が２％未満の選択肢があるときに、問題評価区分を１段階だけ下げる（ステップＳ１４及びステップＳ１５）ので、選択肢毎の選択率を考慮した試験問題評価を行うことができる。 Further, in the test question evaluation process of FIG. 5, when there is an option with the number of selections of zero or an option with a selection rate of less than 2%, the question evaluation category is lowered by one level (steps S14 and S15). Therefore, it is possible to perform a test question evaluation considering the selection rate for each option.

なお、上記実施形態では、試験問題データ１４は５個の選択肢の中から１個の選択肢を選択する多肢選択問題を含んだ。しかしながら、本発明はこれに限らず、任意の数の選択肢から任意の数の選択肢を選択する多肢選択問題、又は論述問題であってもよい。ただし、論述問題の場合は、難易度コードの設定はできず、図５のステップＳ１２の処理の次にステップＳ１８の処理を実行することになる。 In the above embodiment, the test question data 14 includes a multiple-choice question in which one option is selected from five options. However, the present invention is not limited to this, and may be a multiple-choice problem in which an arbitrary number of options are selected from an arbitrary number of options, or a discussion problem. However, in the case of a discussion problem, the difficulty level code cannot be set, and the process of step S18 is executed after the process of step S12 of FIG.

また、上記実施形態では、試験問題データ１４を問題評価指数に基づいて問題評価区分Ａ，Ｂ，Ｃに分類したが、本発明はこれに限らず、２つ又は４つ以上の問題評価区分に分類してもよい。例えば、測定する能力と異なる能力を測定しているために難しすぎる試験問題又は易しすぎる試験問題であって、ブラッシュアップの対象問題になり得ない試験問題を表す問題評価区分Ｄを追加して設けてもよい。 In the above embodiment, the test question data 14 is classified into the problem evaluation categories A, B, and C based on the problem evaluation index. However, the present invention is not limited to this, and is divided into two or four or more problem evaluation categories. You may classify. For example, a problem evaluation category D representing a test problem that is too difficult or too easy to measure because it measures a different ability from the ability to be measured and cannot be a subject to be brushed up is added. It may be provided.

さらに、ステップＳ１４における選択者数のしきい値及び選択率のしきい値と、試験問題分類テーブル１２における試験問題データ１４を分類するための問題評価指数のしきい値とは、本実施形態において示した値に限らない。試験問題評価処理を繰り返し実行し、より高精度に試験問題を評価できるように、これらの値を変更してもよい。 Furthermore, the threshold value of the number of selected persons and the threshold value of the selection rate in step S14 and the threshold value of the problem evaluation index for classifying the test question data 14 in the test question classification table 12 are as described in this embodiment. It is not limited to the indicated value. These values may be changed so that the test question evaluation process is repeatedly executed and the test questions can be evaluated with higher accuracy.

またさらに、作問者によって図４の作問サポート処理のステップＳ５において設定された各選択肢の難易度コードに基づいて、最低合格指数（ＭＰＩ（Minimum Pass Index）。非特許文献１参照。）を算出して、算出された最低合格指数をさらに含む試験問題データ１４を生成してもよい。この場合、最低合格指数は、例えば、以下の式で算出される。 Furthermore, the minimum pass index (MPI (Minimum Pass Index), see Non-Patent Document 1) based on the difficulty code of each option set by the author in step S5 of the question support process of FIG. The test question data 14 that further includes the calculated minimum passing index may be generated. In this case, the minimum pass index is calculated by the following formula, for example.

［数２］
最低合格指数＝２／（各選択肢の難易度コードに対応する値の総和） [Equation 2]
Minimum passing index = 2 / (sum of values corresponding to difficulty code of each option)

ただし、正解を表す難易度コード（丸で囲まれた２）に対応する値は２であり、難易度コード（２）に対応する値は２であり、難易度コード（１）に対応する値は１であり、難易度コード（０）に対応する値は０である。 However, the value corresponding to the difficulty level code (2 circled) representing the correct answer is 2, the value corresponding to the difficulty level code (2) is 2, and the value corresponding to the difficulty level code (1). Is 1, and the value corresponding to the difficulty level code (0) is 0.

また、上記実施形態において、決定された問題評価区分Ａ、Ｂ又はＣをディスプレイ６に表示したが、本発明はこれに限らず、プリンタなどの出力手段を用いて印字出力等してもよい。 In the above-described embodiment, the determined problem evaluation category A, B, or C is displayed on the display 6. However, the present invention is not limited to this, and printing output may be performed using an output unit such as a printer.

さらに、上記実施形態において、試験問題評価装置１は、作問者が、医師国家試験に準拠した模擬試験のための試験問題を作成して評価するために用いられた。しかしながら、本発明はこれに限らず、薬剤師国家試験又は看護士国家試験などの医学以外の領域の客観試験の試験問題を作成して評価するために用いられてもよい。 Furthermore, in the said embodiment, the test question evaluation apparatus 1 was used in order for the questioner to create and evaluate the test question for the mock test based on the doctor national examination. However, the present invention is not limited to this, and may be used to create and evaluate test questions for objective tests in areas other than medicine, such as the pharmacist national exam or the nurse national exam.

またさらに、上記実施形態において、試験問題評価プログラム１０と、その実行のための各テーブル１１〜１３及びデータとをそれぞれハードディスクドライブ５に予め格納した。しかしながら、本発明はこれに限らず、ＣＤ−ＲＯＭ又はＤＶＤなどの、コンピュータで読み取り可能な記録媒体２１に記録された試験問題評価プログラム１０と、その実行のための各テーブル１１〜１３及びデータをそれぞれ、コンピュータなどのコントローラを含む光ディスクドライブ２０により読み出して、ハードディスクドライブ５に格納してもよい。また、試験問題評価プログラム１０と、その実行のための各テーブル１１〜１３及びデータとを、試験問題評価装置１の外部装置から、インターネット及びＬＡＮインターフェース８を介してハードディスクドライブ５に格納してもよい。 Furthermore, in the above-described embodiment, the test question evaluation program 10 and the tables 11 to 13 and data for the execution are stored in the hard disk drive 5 in advance. However, the present invention is not limited to this, and the test question evaluation program 10 recorded on the computer-readable recording medium 21 such as a CD-ROM or DVD, and the tables 11 to 13 and data for executing the program are stored. Each may be read by the optical disk drive 20 including a controller such as a computer and stored in the hard disk drive 5. Further, the test question evaluation program 10 and the tables 11 to 13 and data for executing the test question evaluation program 10 may be stored in the hard disk drive 5 from the external device of the test question evaluation device 1 via the Internet and the LAN interface 8. Good.

さらに、ＣＰＵ２は、図４の作問サポート処理のステップＳ５において、ディスプレイ６に問題入力ウィンドウ２００（図７参照。）を表示したが、本発明はこれに限られない。例えば、ＣＰＵ２は、問題タイプ（図７の例では、上部に記載されているように、「タイプＡ」である。）に応じて類似の雛形問題を併せて表示するサポート機能を実行してもよい。さらに、ＣＰＵ２は、問題作成時のガイダンス機能（例えば、コア・カリキュラムに準拠する場合は、「３歳」と入力すると、「女性」という単語が「女児」という単語に置き換わる機能。）を実行してもよい。 Further, the CPU 2 displays the question input window 200 (see FIG. 7) on the display 6 in step S5 of the question support process of FIG. 4, but the present invention is not limited to this. For example, the CPU 2 may execute a support function for displaying a similar template problem together according to the problem type (in the example of FIG. 7, “Type A” as described above). Good. Further, the CPU 2 executes a guidance function at the time of problem creation (for example, a function that replaces the word “female” with the word “girl” when “3 years old” is entered when conforming to the core curriculum). May be.

またさらに、上記実施形態において、正答率及び期待正答率は０以上１以下の値であったが、本発明はこれに限らず、百分率であってもよい。 Furthermore, in the said embodiment, although the correct answer rate and the expected correct answer rate were values of 0 or more and 1 or less, this invention is not restricted to this, A percentage may be sufficient.

以上説明したように、本発明に係る試験問題評価装置及びその制御方法並びにプログラム及び記録媒体によれば、作問時に入力される試験問題の難易度及び必要度に基づいて算出される修正イーベル指数と、試験問題に対する解答内容を含むデータに基づいて算出される試験問題の正答率及び識別指数とに基づいて、修正イーベル指数と、正答率と、識別指数とをパラメータとする所定の関数を用いて上記試験問題の品質を表す問題評価指数を算出して出力するので、作問者による試験問題の評価内容と、試験実施後に得られる客観的な数値との両方を反映した問題評価指数を自動的に得ることができる。このため、試験問題の評価を自動的に行い、試験問題の良否を従来技術に比較して客観的に判定できる。 As described above, according to the test problem evaluation apparatus, the control method thereof, the program, and the recording medium according to the present invention, the modified ebel index calculated based on the difficulty level and the necessity level of the test problem input at the time of writing And a predetermined function using the corrected ebel index, the correct answer rate, and the identification index as parameters, based on the correct answer rate and the identification index of the test question calculated based on the data including the answer contents for the test question. The problem evaluation index that expresses the quality of the above test questions is calculated and output, so a problem evaluation index that reflects both the evaluation contents of the test questions by the author and objective values obtained after the test is automatically performed. Can be obtained. For this reason, the test questions are automatically evaluated, and the quality of the test questions can be objectively determined as compared with the prior art.

１…試験問題評価装置、
２…ＣＰＵ、
３…ＲＯＭ、
４…ＲＡＭ、
５…ハードディスクドライブ、
６…ディスプレイ、
７…操作入力部、
８…ＬＡＮインターフェース、
１０…試験問題評価プログラム、
１１…修正イーベル指数算出テーブル、
１２…試験問題分類テーブル、
１３…コメントテーブル、
１４−１〜１４−Ｎ…試験問題データ、
１５…プール対象試験問題データ格納領域、
１６…ブラッシュアップ対象試験問題データ格納領域、
１７…重要な修正を含むブラッシュアップ対象試験問題データ格納領域、
２０…光ディスクドライブ、
２１…記録媒体、
３１−１〜３１−Ｍ…受験者用端末装置、
８０…ＬＡＮ。 1 ... Test question evaluation device,
2 ... CPU,
3 ... ROM,
4 ... RAM,
5. Hard disk drive,
6 ... Display,
7 ... operation input part,
8 ... LAN interface,
10 ... Exam question evaluation program,
11 ... Modified ebel index calculation table,
12 ... Exam question classification table,
13 ... Comment table,
14-1 to 14-N ... test question data,
15 ... Pool target test question data storage area,
16 ... Test question data storage area for brush-up,
17: Storage area for exam data subject to brush-up including important corrections,
20 ... Optical disc drive,
21. Recording medium,
31-1 to 31-M ... Terminal device for examinees,
80 ... LAN.

Claims

Input means for inputting predetermined data input by the examiner of the test question;
Control means for executing question support processing for supporting creation of test questions and test question evaluation processing for evaluating the test questions;
A test problem evaluation apparatus comprising a storage means for storing in advance a modified ebel index calculation table showing a correspondence relationship between the difficulty level and necessity level of the test problem and the corrected ebel index indicating the necessity level and difficulty level of the test problem. There,
The control means includes
In the question support process, based on the difficulty level and necessity level of the test question input from the input means, the correction indicating the difficulty level and the necessity level of the test question with reference to the corrected ebel index calculation table Calculate the ebel index,
In the examination question evaluation process, the correct answer rate of the examination question and the higher grade examinees and the grades of the plurality of examinees are based on data including the contents of answers to the examination questions of a plurality of examinees. Calculate an identification index that represents the ability to distinguish lower grades from lower grades than upper grades, and based on the calculated corrected ebel index, correct answer rate, and identification index A test question evaluation apparatus characterized in that a problem evaluation index representing the quality of the test question is calculated and output using a predetermined function having parameters of a modified Ebel index, a correct answer rate, and an identification index.

In the test question evaluation process, the control means further classifies the test question into a plurality of question evaluation categories based on the calculated question evaluation index, and outputs the classified question evaluation categories. The test problem evaluation apparatus according to claim 1.

The control means further includes, in the test question evaluation process, a test including the contents of test questions classified into a problem evaluation category having a problem evaluation index having a quality higher than a predetermined threshold value category among the plurality of question evaluation categories. 3. The test problem evaluation apparatus according to claim 2, wherein problem data is stored in the storage means.

The exam question is a multiple choice question,
The control means further calculates a selection rate for each option of the multiple-choice question based on the data including the answer contents in the test question evaluation process, and classifies the test question into the plurality of question evaluation categories. 4. The test question evaluation apparatus according to claim 2 or 3, wherein when there is an option having a predetermined selection rate or less, the classified question evaluation category is changed to be lowered by one step.

The exam question is a multiple choice question,
The control means further includes
In the questioning support process, the difficulty code representing the difficulty for each option of the multiple-choice question is input from the input means.
5. The test question evaluation apparatus according to claim 1, wherein, in the test question evaluation process, the input difficulty code is output together with the question evaluation index. 6.

The above modified ebel index is the expected correct answer rate used in the modified ebel method.
6. The corrected ebel index calculation table is used in the modified ebel method, and is a table showing a correspondence relationship between a difficulty level and a necessity level of an examination question and an expected correct answer rate. The test problem evaluation apparatus as described in any one.

Input means for inputting predetermined data input by the examiner of the test question;
Control means for executing question support processing for supporting creation of test questions and test question evaluation processing for evaluating the test questions;
A test problem evaluation apparatus comprising storage means for storing in advance a corrected ebel index calculation table indicating a correspondence relationship between the difficulty level and necessity level of a test problem and the corrected ebel index indicating the necessity level and difficulty level of the test problem. A control method,
In the questioning support process, the control means refers to the modified ebel index calculation table based on the difficulty level and necessity level of the test problem input from the input means, and determines the difficulty level of the test problem and Calculating a modified ebel index representing the degree of necessity;
The control means, in the test question evaluation process, based on the data including the answer contents for the test questions of a plurality of examinees, the correct answer rate of the test questions and Calculates an identification index that represents the ability to distinguish test takers from lower grades of lower grades than those of the higher grades, and calculates the corrected ebel index, correct answer rate, and discrimination And calculating and outputting a problem evaluation index representing the quality of the test question using a predetermined function having the corrected ebel index, correct answer rate, and identification index as parameters based on the index. A control method for a test problem evaluation apparatus.

The control means further includes a step of classifying the test problem into a plurality of problem evaluation categories based on the calculated problem evaluation index and outputting the classified problem evaluation categories in the test question evaluation process. The control method of the test problem evaluation apparatus according to claim 7.

A test problem in which the control means includes the content of the test problem classified into a problem evaluation category having a problem evaluation index having a higher quality than a predetermined threshold value category among the plurality of problem evaluation categories in the test question evaluation process. 9. The method of controlling a test question evaluation apparatus according to claim 8, further comprising a step of storing data in the storage means.

The exam question is a multiple choice question,
In the test question evaluation process, the control means calculates a selection rate for each option of the multiple-choice question based on data including the answer contents, and classifies the test questions into the plurality of question evaluation categories. The test problem evaluation apparatus according to claim 8, further comprising a step of changing the classified problem evaluation category to be lowered by one step when there is an option having a predetermined selection rate or less. Control method.

The exam question is a multiple choice question,
The control means, in the question support process, a step of inputting a difficulty level code representing a difficulty level for each option of the multiple-choice question from the input means;
The control means further includes a step of outputting the input difficulty code together with the question evaluation index in the test question evaluation process. A control method for the test problem evaluation apparatus according to claim 1.

The above modified ebel index is the expected correct answer rate used in the modified ebel method.
12. The corrected ebel index calculation table is used in the modified ebel method, and is a table showing a correspondence relationship between the difficulty level and necessity level of a test question and an expected correct answer rate. A method for controlling the test problem evaluation apparatus according to claim 1.

A program comprising the steps of the method for controlling a test question evaluation apparatus according to claim 7.

A computer-readable recording medium storing the program according to claim 13.