TW201013549A - Method of estimation of the examinee's ability on the computerized adaptive testing using adaptive network-based fuzzy inference system - Google Patents
Method of estimation of the examinee's ability on the computerized adaptive testing using adaptive network-based fuzzy inference system Download PDFInfo
- Publication number
- TW201013549A TW201013549A TW97136334A TW97136334A TW201013549A TW 201013549 A TW201013549 A TW 201013549A TW 97136334 A TW97136334 A TW 97136334A TW 97136334 A TW97136334 A TW 97136334A TW 201013549 A TW201013549 A TW 201013549A
- Authority
- TW
- Taiwan
- Prior art keywords
- ability
- value
- test
- subject
- fuzzy inference
- Prior art date
Links
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
201013549 九、發明說明: 【發明所屬之技術領域】 . 本發明係關於一種數位學習領域中用以量測受試者 能力之方法,尤其是一種利用調適性網路模糊推論系統評 估電腦化適性測驗受試者能力之方法。 【先前技術】 目刖在學生爿b力測篁方面,傳統分類學生程度好壞 ❹ 的方法,為在同樣的試題基準下,以分數高低來分類學生 學生須將所有试題做完’這樣的方法在鑑別學生程度上 ,必定花費不少時間。因此若採用電腦化適性測驗,提供 適合的題目給受試者作答,便能以較傳統測驗較少的題數 ’準確分類學生的能力。而在數位學習中,對於如何預測 受試者能力在電腦化適性測驗中提供適合作答的試題以 達到最佳的測驗效果是一項重要的研究。例如··最大近似 值估計( Maximum Likelihood Estimation,MLE )及貝氏估 Ο S十(Bayesian Likelihood Estimation,BLE)等,即被提出 用以解決能力評估且有不錯的效果。 在數位學習的領域裡,「電腦化適性測驗」為量測學 . 生能力的方法,最常使用現代測驗裡的試題反應理論( • Item ResP〇nse Theory,IRT)為其架構。而應用 IRT 所發201013549 IX. Description of the invention: [Technical field to which the invention pertains] The present invention relates to a method for measuring a subject's ability in the field of digital learning, in particular, an adaptive network fuzzy inference system for evaluating a computerized fitness test The method of subject ability. [Prior Art] In the aspect of students' ability to test and test students, the traditional method of classifying students is good or bad. In order to classify students with scores under the same test questions, all students must complete all the questions. The method must take a lot of time to identify the students. Therefore, if a computerized fitness test is used to provide a suitable question to the subject, the student's ability can be accurately classified by a number of questions that are less than the traditional test. In digital learning, it is an important study to predict how a subject's ability to provide a suitable test in a computerized fitness test to achieve the best test results. For example, Maximum Likelihood Estimation (MLE) and Bayesian Likelihood Estimation (BLE) are proposed to solve the capability assessment and have good results. In the field of digital learning, the "computerized fitness test" is a measure of abilities. The method of abilities is most often based on the Item ResP〇nse Theory (IRT) in modern tests. And applied by IRT
展的數位學習系統,近年來也越來越盛行,這些以IRT 理論為基礎,再搭配上各自的數位學習系統,均達到不錯 的成效。 例如習知評量系統較為常見的是「調適性網路模糊 201013549The digital learning system of the exhibition has become more and more popular in recent years. These are based on the IRT theory and are matched with their respective digital learning systems, which have achieved good results. For example, the familiar measurement system is more commonly used as "adaptive network blur 201013549
System ’ ANFIS)」,其架構係以模糊推論系統(FuzzySystem ’ ANFIS), whose architecture is based on fuzzy inference systems (Fuzzy
Inference System, FIS)為主體’結合前饋式類神經網路的 監督式學習方法,使得系統充分發揮對於不確定性與不精 確性的處理能力,同時具有自我學習組織能力與歸納推理 功能。 請參照第1圖所示,一般而言,「調適性網路模糊推 論系統(ANFIS)」架構大致包含有一輸入層、一規則層 、一正規化層、一結果推論層及一輸出層等五層。其中該 輸入層(第一層)係將輸入變數映射到模糊集合中,屬於 X的集合分別為1 A和2 A兩個子集合;屬於y的集合 分別為1 B和2 R忐他I;隹厶•甘„丄^ ^ 常使用鐘形函數。該規則層(第二層)係將各輸入變數的 模糊集合和隸屬函數在該輸入層決定後,將各輸入變數間 的模糊集合進行排列組合之配對,輪出值表示規則的觸發 ❹ 強X該正規化層(第二層)係將上-層各節點所得的娃The Inference System (FIS) is the main subject' combined with the feed-forward neural network's supervised learning method, which enables the system to fully utilize the ability to deal with uncertainty and inaccuracy, as well as self-learning organizational ability and inductive reasoning. Referring to Figure 1, in general, the "Adaptive Network Fuzzy Inference System (ANFIS)" architecture generally includes an input layer, a rule layer, a normalization layer, a result inference layer, and an output layer. Floor. Wherein the input layer (the first layer) maps the input variables into the fuzzy set, the set belonging to X is respectively two sub-sets of 1 A and 2 A; the set belonging to y is 1 B and 2 R忐 other I;隹厶•甘„丄^ ^ often uses a bell-shaped function. The rule layer (the second layer) arranges the fuzzy set and the membership function of each input variable after the input layer determines the fuzzy set between the input variables. Pairing of combinations, the round-out value indicates the triggering of the rule ❹ Strong X The normalized layer (the second layer) is the baby obtained from the upper-layer nodes
調適性網路模糊推論系統( ANFIS)」則必須要獲得一組知識,^ )的if-then ruies,其通常可分為語 產生一組模糊(fiizzy 多舌的資訊(linguistic 201013549 inf〇rmation)及數字的資訊(numerical information);前 者為專家提供的知識,通常是透過長期累積的經驗加上大 量的嘗試錯誤而得到的,後者為將系統操作_知識的輸入 輸出對記錄起來,拿來訓練該推論系統,將訓練資料配合 混合式學習規則系統(Hybrid Learning Algorithm )來訓 練模型,以作為該「調適性網路模糊推論系統(ANFIS) 」的運作方式。 ❹ 然而,前述習知評量系統係採用固定的題目,並未 考慮受試者作答值與其能力不符合和能力評估標準差異動 的情形,且對於不同程度的學生作答而言,亦會覺得相對 困難或相對簡單,因此無法有效的預測出學生的能力。再 者’傳統評量系統亦僅收集一組訓練資料’當該組訓練資 料之作答值與文試者能力不符合時,即無法提供較佳的能 力評估準確度。 【發明内容】 ❿ 本發明係提供-種彻㈣性網路模齡論系統評 估電腦化適性測驗受試者能力之方法,以提供較佳的能力 評估準確度,為其主要之發明目的。 . 為達到前述發明目的,本發明所運用之技術手段及 藉由該技術手段所能達到之功效包含有: -種湘調適性網路_推論系統評估電腦化適性 測驗受試者能力之方法包含一模擬訓練資料步驟.、一學習 ,力模型建置步驟及-適性測驗能力評估步驟。該模擬訓 練資料步驟藉由-電腦系統基於試題反應理論以模擬產生 201013549 二=訓練資料’其中第丨練情況為作答值與受試 I能力㈣合’第二_練諸情況為作答值與受試者能 力不符°的清況’第二組訓練資料情況為考慮標準差異動 ^情況’·該學習能力模型建置步驟將前述三組訓練資料進 行訓練’以建置完成—觸性網路模娜論系統模型;該 適性測驗能力評估步驟,彻麵適性網路_推論系統 針對受試者進行施測,以產生一能力估計值,進而完成能 ❹ 力評估作業。藉此,即具有可提升評估能力準確度之功效 〇 【實施方式】 為讓本發明之上述及其他目的、特徵及優點能更明 顯易K,下文特舉本發明之較佳實施例,並配合所附圖式 ’作詳細說明如下: 本發明主要係藉由一電腦系統(未繪示)連接至少 資料庫(未繪示)作為執行架構,並以調適性網路模糊 ❹ 推淪系統(Adaptive-Network-Based Fuzzy Inference System,ANFIS)作為基礎的學習能力新預測模型,再辅 以試題反應理論(Item ReSp〇nse Theory,irt)為依據來 適性化選擇試題,以便利用電腦化適性測驗中試題的鑑別 度、困難度、猜測度及受試者作答試題前的能力作為參數 ,提供更符合受試者能力的試題,進而藉由「調適性網路 模糊推論系統(ANFIS)」推論受試者能力的修正量,故 可提供較好的能力評估準確度。請參照第2圖所示,本發 明利用調適性網路模糊推論系統評估電腦化適性測驗受試 201013549 者能力之方法大致包含一模擬訓練資料步驟S1、—風 能力模型建置步驟S2及一適性測驗能力評估步驟 〇 • 本發明較佳實施例之模擬訓練資料步称S1,主要係 基於表試題反應理論(Item Response Theory,说τ)中 可將學生的能力量化到一3〜+ 3之間’並以三參數對數 形模式的題庫進行施測,學生能力的測量與題目的難产 ❹ 鑑別度、猜測度以及該題作答正確與否有關,故該模^訓 練資料步驟S1主要係藉由該電腦系統模擬以產生訓練資 料’請參照第3圖所示,其詳細步驟流程包含: 一選擇題目步驟S11,該電腦系統較佳係自動選擇五 題訊息量涵蓋全能力範圍的題目供受試者做答,期望能準 確測出受試者初始能力,之後的流程在第i次選擇題目上 ,則依上一次即第i-Ι次流程的能力估計值來作選擇,選 擇之題目為由該資料庫内所預設之題庫裡挑選試題訊 ❹ 息函數/,.(θΛ)最大值的題目,其公式如下: !感=ϋ * =_1.72ay2(l- c >.) [cj + + sargmax/,(彡)ζ· = 1,2,··.,《 其中/代表第ί·次流程;《代表流程總次數;J代表受 試者能力估計值;/,〆)代表試題,·在能力J上所提供的訊 息;/T(谷)代表在能力0上尸、(谷)值的導數;代表能力点 201013549 ^如);%代 ci代表試題/ 在試題/上答_試題反應鮮;_代表 表試題/魏财4代表朗7·_易度; 的猜測度。 採=====崎㈣率侧 = ^ +(l-cr)The Adaptive Network Fuzzy Inference System (ANFIS) must acquire a set of knowledge, ^) if-then ruies, which can usually be divided into words to produce a set of fuzzy (fiizzy multi-lingual information (linguistic 201013549 inf〇rmation) And numerical information; the knowledge provided by the former for the expert is usually obtained through long-term accumulated experience plus a large number of trial errors, which are used to record the input and output pairs of the system operation _ knowledge. The inference system trains the model with the Hybrid Learning Algorithm as the operation mode of the Adaptive Network Fuzzy Inference System (ANFIS). ❹ However, the aforementioned conventional evaluation system It adopts a fixed topic and does not consider the situation in which the subject's answer value differs from its ability incompatibility and ability assessment criteria, and it may be relatively difficult or relatively simple for students of different levels to answer, so it cannot be effective. Predicting the student's abilities. In addition, the 'traditional assessment system only collects a set of training materials' when the group When the answer value of the training material does not match the ability of the test subject, it is impossible to provide better ability to evaluate the accuracy. [Summary of the Invention] ❿ The present invention provides a computerized fitness test for assessing the computerized fitness test. The method of the subject's ability to provide a better ability to assess the accuracy is the primary objective of the invention. To achieve the foregoing object, the technical means utilized by the present invention and the effects achievable by the technical means include There are: - A variety of adaptability network _ inference system to assess computer aptitude test subject ability method includes a simulation training data step, a learning, force model construction steps and - fitness test ability evaluation steps. The simulation training The data step is based on the test-response theory of the computer system to simulate the generation of 201013549. 2=Training data' wherein the first training situation is the answer value and the test I ability (four) combined 'second _ the situation is the answer value and the subject ability The condition of the non-conformity of 'the second group of training data is to consider the standard difference ^ situation' · the learning ability model construction steps will be the above three sets of training materials Line training 'to complete the construction - the tactile network model system model; the fitness test ability evaluation step, the full-featured network _ inference system for the subject to test, to generate an ability estimate, and then complete It is possible to evaluate the operation, thereby having the effect of improving the accuracy of the evaluation capability. [Embodiment] To make the above and other objects, features and advantages of the present invention more apparent, the following is a summary of the present invention. The preferred embodiment is described in detail with reference to the following drawings. The present invention mainly uses a computer system (not shown) to connect at least a database (not shown) as an execution architecture, and adjusts the network with an adaptive network. Ad The Adaptive-Network-Based Fuzzy Inference System (ANFIS) is used as the basis for the new prediction model of learning ability, supplemented by the Item ReSp〇nse Theory (irt) to adapt the test questions to make use of The degree of discrimination, difficulty, guessing, and ability of the subject before answering the test questions in the computerized fitness test are provided as parameters to provide more conformity to the subject. The questions, then by "adaptability network fuzzy inference system (ANFIS)" deduction amount correction capability of the subject, it can provide better ability to assess accuracy. Referring to FIG. 2, the method for evaluating the ability of the computerized fitness test participant 201013549 using the adaptive network fuzzy inference system generally includes a simulation training data step S1, a wind capability model construction step S2, and a suitability. Test ability evaluation step 〇 • The simulated training data step S1 of the preferred embodiment of the present invention is mainly based on the Item Response Theory (τ), which can quantify the student's ability to between 3 and +3. 'And the three-parameter logarithmic pattern of the test bank is tested, the measurement of the student's ability is related to the dystocia, the degree of discrimination, the guessing degree and the correctness of the question. Therefore, the step S1 of the training data is mainly based on The computer system simulates to generate training materials. Please refer to Figure 3, the detailed steps of the process include: 1. Selecting the topic step S11, the computer system preferably automatically selects five questions to cover the full range of topics for the subject. The person answers, expects to accurately measure the initial ability of the subject, and the subsequent process is based on the last i-th order process on the i-th selection question. The ability estimate is used for selection. The title of the selection is the question of selecting the maximum value of the test message information function /, (θΛ) from the question bank preset in the database. The formula is as follows: !Sense=ϋ * =_1 .72ay2(l- c >.) [cj + + sargmax/,(彡)ζ· = 1,2,··., "where / represents the ί·th flow; "represents the total number of processes; J represents the Estimated tester's ability; /, 〆) representative test questions, · information provided on ability J; /T (valley) represents the derivative of corpse, (valley) value at ability 0; representative ability point 201013549 ^ as); % generation ci on behalf of the test questions / on the test questions / answer _ test questions fresh; _ representative table test questions / Wei Cai 4 represents Lang 7 · _ easy degree; the guess. Mining =====Saki (four) rate side = ^ +(l-cr)
其中4為隨機亂數值,㈣4 ; z•代表第,·次流程。 藉此,當機率乃⑻大於或等於亂數值及時,模擬作答 R(6〇為正確,其值為1,反之為失敗,其值為〇。 一計算能力估計值步驟S13,受試者能力估計值谷採 用最大近似值估計(Maximum Likelihood Estimation,MLE )方法計算,其公式如下: S = aigmaxP(Ul)U2i^-,U„ |0)4 is a random chaotic value, (4) 4; z• stands for the first, and the second process. Therefore, when the probability is (8) greater than or equal to the random value in time, the simulation answers R (6〇 is correct, its value is 1, and vice versa, its value is 〇. A calculation ability estimation step S13, subject ability estimation The value valley is calculated by the Maximum Likelihood Estimation (MLE) method, and its formula is as follows: S = aigmaxP(Ul)U2i^-, U„ |0)
argmax]~J^ (6)Ui Q ψ)1^ /=1 argmaxpc,- +(1_C/)_ ,-1.7¾ ㈣) ·)严 其中ί‘ = 1,2,.·.,《 ; «代表流程總次數;¢/代表第;個試 題測驗反應值’ R的值為1 (代表正確反應)或〇 (代表 不正確反應);0代表學生能力值,_3<0<3。藉此,使 得户^,〜…,以的為最大值之❼’即為能力估計值彡。 201013549 一計算能力估計標準差步驟S14,主要係評估標準差 是否小於門檻值S141,其中证(力為能力估計值j之標準 差’若其值小於所設定之門檻值則停止施測,否則就繼續 前述選擇題目步驟S11,其公式如下: 鱗=1>爲才_ 卿)=Argmax]~J^ (6)Ui Q ψ)1^ /=1 argmaxpc,- +(1_C/)_ ,-1.73⁄4 (4)) ·) Strict ί' = 1,2,.·., " ; Represents the total number of processes; ¢ / stands for the first test test response value 'R value of 1 (for correct response) or 〇 (for incorrect response); 0 for student ability value, _3 <0; In this way, the user's ^, ~..., and the maximum value ❼' is the ability estimate 彡. 201013549 A calculation capability estimation standard deviation step S14, mainly to assess whether the standard deviation is less than the threshold value S141, wherein the certificate (the force is the standard deviation of the capability estimation value j) stops the test if its value is less than the set threshold value, otherwise Continuing with the aforementioned selection topic step S11, the formula is as follows: Scale = 1 > _ _ Qing) =
:計算能力估計侯選值步驟S15,係將能力估計值沒 區門$,區間(confidence interval),假設為95%的信賴 品:0信賴區間的範圍則為ΘΛ±1.96><從(力為標準差,在 ,^的所有點視為能力估計候選點,並以0.01為間隔 題目,能力估計候選點依照試題訊息函數從題庫裡挑選 估計候^叶算能力估計值,再計算其標準差,最後在能力 、、點區間其標準差最小的點即為最佳能力估計候選 更詳兮 ’根據前述該模擬訓練資料步驟S1,該電 人St收集三組訓練資料。其中第一組訓練資料情況為 :情^反應理論(Item Resp麵Theory,IRT)流程的 練資料β亦即作答值與受試者能力相符合,該第一組訓 程難易^上次流程能力估計值、本次流程鑑別度、本次流 铲力修正本次流程猜測度、本次流程作答值和本次流程 值).〃量(即本次流程能力估計值一上次流程能力估計 組則練資料情況為作答值與受試者能力不符合 —11 — 201013549 第"細丨練資料是上次絲能力估計值、本次 程作答值二度、本次流程猜測度、本次流 __ , _本认机程能力修正量(即本次流程能力估計值 準差展ΓΓί力估計值)。第三組訓練資料情況為考慮標 :、月況,藉由挑選標準差最小的能力點作為候選 集其他的钱者能力可能值,該第三組訓練資料 程最佳能力估計值、本次流程賴度、本次流程 0 ❹ Γ i本次錄_度、核録作倾和本次流程最: The calculation capability estimation candidate value step S15 is to calculate the capability estimation value without the gate $, the confidence interval, and assume the 95% trust product: the range of the 0 confidence interval is ΘΛ±1.96><from The force is the standard deviation. At all points of ^, it is regarded as the candidate point of ability estimation, and the problem is ranked by 0.01. The candidate of the ability estimation selects the estimated value of the estimated calculation ability of the candidate leaf from the question bank according to the test message function, and then calculates the standard. Poor, finally, the point where the standard deviation is the smallest in the ability and the interval, that is, the best ability estimation candidate is more detailed. According to the aforementioned simulation training data step S1, the electric person St collects three sets of training materials. The first group training The data is as follows: the emotional data of the Item Resp Surface Theory (IRT) process is also consistent with the ability of the subject. The first group of training is difficult ^ the last process capacity estimate, this time The process identification degree, the current flow shovel force correction process guessing degree, the current process answer value and the current process value). 〃 quantity (that is, the current process capability estimation value, the last process capability estimation group, the training data is answer Inconsistent with the ability of the subject—11 — 201013549 The first section is the estimated value of the silk ability, the second time of the process, the guess of the process, the current stream __, _ this machine The capability correction amount (that is, the estimated value of the current process capability is estimated). The third group of training data is considered as the target: the monthly condition, by selecting the ability point with the smallest standard deviation as the candidate set for other money. Possible value of the ability, the third group of training data, the best ability estimate, the current process, the current process 0 ❹ Γ i this time recorded _ degree, the nuclear record for the dump and the most
CiEi: 〇本錢程最錄力估賴選值-上次流 程最佳能力估計值)。 另外’該電腦系統之模擬環境為受試者能力值_3〜 ^之:’每_ (m做為受試者真實能力來模擬作答 測驗’每個能力值會執行若干個―,每-個流程( 收集以上所描狀三種料的三_練資料, 隨著標準差_動,㈣轉受試巧朗推論能力值, ^透過各料同_賴況,使用、·網路模糊推論 系統UNFIS)」來推論出能力修正量,以域能力預測 的準綠度。 本發明較佳實施例之學習能力模型建置步驟S2,主 要係將前述三組解資料進行姆’其推論能力值抑的 公式為: x(j) = x(i-l) + Ax(i~l) M/-1) = /(,〇-I),a(〇s6(〇c(ax〇) 其中i代表^次流m)代表第Μ次流程之能 力估計值U-組訓練資料及第二一練資料)或是上次 —12 — 201013549 最佳能力估計候選值(第三組訓練資料);α(〇代表第^次 流程題目鑑別度;的·)代表第z•次流程題目難易度;c⑺代 表第z次流程題目猜測度;^⑺代表第,·次流程題目作答值 ;Δχ(ί_1)代表第,·次流程能力修正量或最佳能力修正量。 藉此,可進一步使用「調適性網路模糊推論系統(Anfis )」予以模擬/(),該「調適性網路模糊推論系統( )」架構包含: ^CiEi: The most important record of the money course is the estimated value of the last process. In addition, the simulation environment of the computer system is the subject's ability value _3~^: 'every _ (m as the subject's real ability to simulate the answer test) each ability value will be executed a number of -, each - Process (collecting the three _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ In order to infer the capacity correction amount, the quasi-greenness predicted by the domain capability. The learning ability model construction step S2 of the preferred embodiment of the present invention mainly performs the above three sets of solution data to perform the m's inference ability value suppression. The formula is: x(j) = x(il) + Ax(i~l) M/-1) = /(,〇-I), a(〇s6(〇c(ax〇) where i represents ^current m) represents the U-group training data and the second training data of the third process) or the last -12 - 201013549 best ability estimation candidate (third group training data); α (〇 represents the first ^The number of process title discrimination; () represents the difficulty of the z-th process topic; c(7) represents the z-th process problem guess; ^(7) represents the first, the second process Answer value; Δχ (ί_1) on behalf of, the amount of correction-time process capability or the ability to amend the optimum amount. In this way, the "Adjustment Network Fuzzy Inference System (Anfis)" can be further used to simulate / (), the "adaptive network fuzzy inference system ()" architecture contains: ^
不如下 一輸入層(第一層),此層節點為調適性節點 輸入值x(/-l)、α(〇、6(〇及各分為三個鐘型隸屬函數, ><0則為兩個鐘型隸屬函數,該輸入值的鐘型隸屬函數表 1 + Ni~ejNt - — .dJ^i Οχ =μ]·(Νί) = 其中 % e {χ(ζ. - 1),α(ζ·),δ⑺,φ·),〆,·)} ; = 12, 為Μ的隸屬函數個數;4、%、/力為前提部;从〜 …為鐘形(bell-shaped)隸屬函數。 之參數, 一規則層(第二層),此層總共有3χ3χ3><3 規則,可將該輸入層的隸屬函數相乘,作甘Χ2 = 162條 ',其公式如下: ‘、、、發動強度 〇2,ρ=ΤΙμΛΝί) Ν(Instead of the following input layer (first layer), this layer node inputs the values x(/-l), α(〇, 6(〇 and each divided into three bell-type membership functions, ><0 Then, it is a two-valve membership function, and the clock type of the input value belongs to the function table 1 + Ni~ejNt - — .dJ^i Οχ = μ]·(Νί) = where % e {χ(ζ. - 1), α(ζ·), δ(7), φ·), 〆,·)} ; = 12, is the number of membership functions of Μ; 4, %, / force is the premise; from ... ... bell-shaped Membership function. The parameter, a regular layer (the second layer), this layer has a total of 3χ3χ3><3 rules, which can be multiplied by the membership function of the input layer, and the formula is as follows: ',,, Starting strength 〇2, ρ=ΤΙμΛΝί) Ν(
鐘形(bell- 其中 M e{x(i-l)W),6(〇,c(〇,y(i)}; 為'的隸屬函數個數;/7 = 1,2,...,162 shaped)隸屬函數。 —13 — 201013549 一正規化層(第三層),此層將該規則層各節點的結 果正規化後,輸出結果介於0與丨之間,其公式如下: J P 162 iwp 其中%為發動強度,^ = 1,2,..462。 一結果推論層(第四層),此層節點為調適性節點, 為該正規^^層節p乘上後項參數方程式,其公式如下: 〇A =^pfp= Wp (qpX(i) + ^α(/) + SpbQ) + (pC(jl) + u^y(i) + ^ } 其中%,4,\4,^,\為結論部分參數;/? = 1,2,,162。 一輸出層(第五層),此層節點為一個固定節點,將 該結果推論層進行加總,作為輸出值,其公式如下·· 162 〇5 = p—l 其中5為發動強度,= 1,2,...,162。 整體而言,當完成該學習能力模型建置步驟S2後, 最後較佳可進一步使用複合型學習演算法(hybrid leamingrule)修正前述部分參數與結論部分參數。 本發明較佳實施例之適性測驗能力評估步驟,主 要係針對受試者進行制,其較佳可選概題訊息量涵蓋 全能力範®的題目,細最大近似值估計(MaximumBell-shaped (bell-where M e{x(il)W), 6(〇,c(〇,y(i)}; is the number of membership functions of ';7= 1,2,...,162 Shape) membership function. —13 — 201013549 A normalization layer (third layer), this layer normalizes the results of each node of the rule layer, and the output is between 0 and ,, and its formula is as follows: JP 162 iwp Where % is the launching strength, ^ = 1, 2, .. 462. A result inference layer (fourth layer), this layer node is the adaptive node, multiplying the regular parameter section of the normal ^^ layer section by its equation The formula is as follows: 〇A =^pfp= Wp (qpX(i) + ^α(/) + SpbQ) + (pC(jl) + u^y(i) + ^ } where %,4,\4,^, \ is the conclusion part of the parameter; /? = 1,2,,162. An output layer (fifth layer), this layer node is a fixed node, the result inference layer is summed as the output value, the formula is as follows · 162 〇5 = p—l where 5 is the launching strength, = 1, 2, ..., 162. Overall, when the learning ability model is established, step S2 is completed, and finally, composite learning can be further used. The algorithm (hybrid leamingrule) corrects some of the aforementioned parameters Conclusion Part parameters. Adaptive Testing procedure of Example Assessment of the ability of the present invention the preferred embodiment, the main lines were prepared for the subject, which preferably takes the title optional amount of information covering the whole range of capacity ® subject, the maximum fine estimation approximations (Maximum
Likelihood Estimation, MLE)進行開頭能力的測量。請參 照第4圖所示’該適性測驗能力評估步驟S3包含一選擇 題目步驟S31、一模擬受試者作答步驟阳、一推論能力 值步驟S%、-計算能力估計標準差步驟S34及-計算能 201013549 力估計侯選值步驟S35等步驟。其中該選擇題目步 、模擬受試者作答步驟S32、計算能力估計標準 S34 (S34l)及計算能力估計侯選值步驟奶之執 ,係與前述該選擇題目步驟S11、模擬受試者作答二 S12、計算能力估計標準差步驟训及計算能力估: 值步驟S15等步驟相同,不再贅述。 夭遷 另外,該推論能力值步雜S33主要係以「調適 ❹ ❹ 路模糊推論纽(AKFIS)」抑钱魏力修正 ,主要係將能力修正量加上該學f能力模型建置步驟 所求得之推論能力值作為本找適_驗能力評估 S3的能力減值,當能力估計值之鮮差顿設定 檻值S341即可結束該推論能力值步驟S33。藉此,可 照試題訊息函數從題庫巾選出訊息量最大的題目給予 者施測’試題訊息函數可使钱者較触的試題訊^ 和獲得最大值,而試題訊息量總和與能力估計的標準^ 方成倒數_,因此可使钱者能力估計的標準 小,產生最準確的能力估計值。 平王取 為證明本發明數位學f領域中用以量測受試 之方法,其確實具有較佳的能力評估準確度以下^以 Matlab建置「調適性網路模掏推論系統(娜is之 变’且設疋受試者為1〇〇人,並以常態分布來模擬受試 者真實能力作答。假設受試者能力值為(_3〜+ 3)之門 :===二=變作為棋擬試題反應“ 對於母-人的測驗需完成題庫5〇題或能力評估標準差 —15 ~ 201013549 小於門播0.2為止。請參照第5圖所示,係為所選50題 的試題訊息函數總和曲線,在能力值〇時,其試題訊息 量值約11.5 ’在能力值區間,其預估能力會較 ' 準確。 又’模擬受試者人數100人,並統計其平均誤差, 請依序參照第6、7、8、9、10、11及12圖所示,係分別 以本發明所架構之ANFIS比較傳統最大近似值估計( ❹ Maximum Likelihood Estimation, MLE )及貝氏估計(Likelihood Estimation, MLE) performs measurements of initial ability. Please refer to FIG. 4' The fitness test capability evaluation step S3 includes a selection question step S31, a simulated subject answer step yang, an inference ability value step S%, a calculation capability estimation standard deviation step S34, and a calculation Can 201013549 force estimate candidate value step S35 and other steps. The selection topic step, the simulation subject answering step S32, the calculation ability estimation standard S34 (S34l), and the calculation capability estimation candidate value step milk execution, are the same as the foregoing selection topic step S11, and the simulated subject answer 2 S12 The calculation capability estimation standard deviation step training and calculation capability estimation: the value steps S15 and the like are the same, and will not be described again. In addition, the inference ability value step S33 is mainly based on the “Adjustment ❹ 模糊 模糊 推 推 推 ( AK AK AK AK AK AK AK AK 魏 魏 魏 魏 魏 , , , , , , , , , , 魏 魏 魏 魏 魏 魏 魏 魏 魏 魏 魏 魏 魏 魏 魏 魏The inferred ability value is used as the ability impairment of the present evaluation ability evaluation S3, and the inference capacity value step S33 is ended when the capacity estimation value is set to the threshold value S341. In this way, the test message function can be selected from the question bank to select the topic with the largest amount of information to give the tester's test message function to make the tester's test message and the maximum value, and the sum of the test message amount and the ability estimation standard. ^ Fang Cheng countdown _, so the standard of the ability of the money can be estimated to produce the most accurate estimate of the ability. Ping Wang takes the method of measuring the test in the field of digital field of the present invention, which does have better ability to evaluate the accuracy below. The establishment of the adaptive network model inference system by Matlab (the change of Nais 'And the subject is 1 person, and the normal distribution is used to simulate the true ability of the subject. Assuming the subject's ability value is (_3 ~ + 3): === two = change as chess The proposed test response "For the mother-to-person test, complete the question bank 5 questions or the ability evaluation standard deviation - 15 ~ 201013549 is less than the door broadcast 0.2. Please refer to Figure 5, which is the sum of the test questions function of the selected 50 questions. Curve, when the ability value is ,, the amount of test information is about 11.5' in the ability value range, its predictive ability will be 'accurate. 'The number of simulated subjects is 100, and the average error is counted. Please refer to it in order. Figures 6, 7, 8, 9, 10, 11 and 12 show the comparison of the Maximum Likelihood Estimation (MLE) and the Bayesian Estimate with the ANFIS of the present invention.
Bayesian Likelihood Estimation,BLE)。其中 ANFIS0 1 在 受試者能力(一1〜+ 1)間,其誤差小於MLE、BLE和 ANFIS0.5 ; MLE在一開始時誤差會比BLE差,原因是 BLE有趨向先前分配的趨勢,根據常態分布,受試者能 力值為0時,也就是中等能力的考生是分布比率最高的 ,因此BLE會比MLE效果好一些’在受試者能力值為 1或一1時’此時試題訊息量比受試者能力值為〇時少, 〇 .所以ANIFS需做一些題數後,評估效果才會比MLE好 ;BLE評估能力則稍差;另外,在受試者能力值為2或 時,由於試題訊息量較少,ANIFS需做更多的題數 ,評估效果才會比MLE和BLE好;ANIFS0.5或—3時 ,ANFIS評估誤差值更大,且呈現不穩定的情況,需做 • 更多的題數,才能漸趨穩定。 因此’使用「調適性網路模糊推論系统(ANpis) 做模擬受試者能力評估,在試題訊息量大時可提供不錯^ 能力推測效果,且觀察前述模擬實驗結果,若是試題^_ 201013549 量大時,隨著流程的增加,標準差會漸漸下降,最佳能力 估計候選值的範圍會漸漸縮小,流程所收集到的資料就能 涵蓋更多可能性的資料,因此能力評估的效果會較穩定準 . 確。 如上所述,本發明主要係藉由調適性網路模糊推論 系統(ANFIS )為基礎’並以試題反應理論(Item Response Theory,IRT)為依據,而可適性化選擇試題以有 ❹ 效改善評估成效,且利用電腦化適性測驗中試題的鑑別度 、困難度、猜測度及受試者作答試題前的能力作為參數, 再以調適性網路模糊推論系統(ANFIS)推論受試者能力 的修正量,評估受試者作答後的能力。更重要的是,可模 擬受試者以收集三種不同評估情形下的三組訓練資料,透 過各種不同的模糊規則組合情形來推論能力,以提升評估 能力的準確度。 雖然本發明已利用上述較佳實施例揭示,然其並非 〇 用以限定本發明,任何熟習此技藝者在不脫離本發明之精 神和範圍之内,相對上述實施例進行各種更動與修改仍屬 本發明所保護之技術範疇,因此本發明之保護範圍當視後 , 附之申請專利範圍所界定者為準。 —17 — 201013549 【圖式簡單說明】 第1圖:習知「調適性網路模糊推論系統(ANFIS)」 架構示意圖。 第2圖:本發明利用調適性網路模糊推論系統評估電 腦化適性測驗受試者能力之方法的步驟流程圖。 第3圖:本發明利用調適性網路模糊推論系統評估電 腦化適性測驗受試者能力之方法之步驟S1的詳細流程示 意圖。 第4圖:本發明利用調適性網路模糊推論系統評估電 腦化適性測驗受試者能力之方法之步驟S3的詳細流程示 意圖。 第5圖:本發明利用調適性網路模糊推論系統評估電 腦化適性測驗受試者能力之方法所做模擬實驗的試題訊息 函數總和曲線示意圖。 第6〜12圖:本發明利用調適性網路模糊推論系統評 估電腦化適性測驗受試者能力之方法所做模擬實驗的結果 示意圖。 【主要元件符號說明】 (無) —18 —Bayesian Likelihood Estimation, BLE). Among them, ANFIS0 1 is less than MLE, BLE and ANFIS0.5 in the subject's ability (1~+1); MLE will be worse than BLE in the beginning, because BLE tends to be assigned to the previous trend, according to Normal distribution, when the subject's ability value is 0, that is, the middle-capacity candidate has the highest distribution ratio, so BLE will be better than MLE's when the subject's ability value is 1 or 1'. The amount is less than the subject's ability value, 〇. So ANIFS needs to do some number of questions, the evaluation effect will be better than MLE; BLE evaluation ability is slightly worse; in addition, when the subject's ability value is 2 or Because the amount of test questions is small, ANIFS needs to do more questions, and the evaluation effect is better than MLE and BLE. When ANIFS0.5 or ——3, ANFIS evaluates the error value more and it is unstable. Do • More questions to stabilize. Therefore, 'using the adaptive network fuzzy inference system (ANpis) to do the simulation of the subject's ability assessment, can provide a good ^ ability speculation effect when the amount of questions is large, and observe the results of the above simulation experiment, if the test questions ^_ 201013549 large amount As the process increases, the standard deviation will gradually decrease, and the range of best ability estimation candidate values will gradually shrink. The data collected by the process will cover more possibilities, so the effectiveness evaluation will be more stable. As described above, the present invention is mainly based on the adaptive network fuzzy inference system (ANFIS) and is based on the Item Response Theory (IRT), and the test questions can be appropriately selected. ❹ Effectiveness to improve the effectiveness of the assessment, and use the computerized fitness test to identify the degree of difficulty, difficulty, guess and the ability of the subject to answer questions before using the adaptive network fuzzy inference system (ANFIS) to infer the test The ability to correct the ability to assess the subject's ability to answer. More importantly, the subject can be simulated to collect three different assessment scenarios. The three sets of training materials are inferred by various combinations of fuzzy rules to improve the accuracy of the evaluation ability. Although the present invention has been disclosed by the above preferred embodiments, it is not intended to limit the present invention, Various changes and modifications to the above-described embodiments are still within the technical scope of the present invention, and the scope of protection of the present invention is defined by the scope of the patent application, without departing from the spirit and scope of the present invention. —17 — 201013549 [Simple description of the diagram] Figure 1: Schematic diagram of the structure of the Adaptive Network Fuzzy Inference System (ANFIS). Figure 2 is a flow chart showing the steps of the method for assessing the competence of a computerized fitness test subject using an adaptive network fuzzy inference system. Figure 3: Detailed flow diagram of step S1 of the method of the present invention for assessing the ability of a computerized fitness test subject using an adaptive network fuzzy inference system. Figure 4: Detailed flow diagram of step S3 of the method of the present invention for assessing the ability of a computerized fitness test subject using an adaptive network fuzzy inference system. Figure 5: Schematic diagram of the summation of the function of the test message of the simulation experiment using the adaptive network fuzzy inference system to evaluate the competence of the computerized fitness test. Figures 6 to 12: Schematic diagram of the results of a simulation experiment conducted by the present invention using an adaptive network fuzzy inference system to evaluate the ability of a computerized fitness test subject. [Main component symbol description] (none) — 18 —
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW97136334A TW201013549A (en) | 2008-09-22 | 2008-09-22 | Method of estimation of the examinee's ability on the computerized adaptive testing using adaptive network-based fuzzy inference system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW97136334A TW201013549A (en) | 2008-09-22 | 2008-09-22 | Method of estimation of the examinee's ability on the computerized adaptive testing using adaptive network-based fuzzy inference system |
Publications (1)
Publication Number | Publication Date |
---|---|
TW201013549A true TW201013549A (en) | 2010-04-01 |
Family
ID=44829392
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW97136334A TW201013549A (en) | 2008-09-22 | 2008-09-22 | Method of estimation of the examinee's ability on the computerized adaptive testing using adaptive network-based fuzzy inference system |
Country Status (1)
Country | Link |
---|---|
TW (1) | TW201013549A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI426461B (en) * | 2010-06-08 | 2014-02-11 | Chi Mei Medical Ct | Satisfaction Survey Method and System |
CN103942993A (en) * | 2014-03-17 | 2014-07-23 | 深圳市承儒科技有限公司 | Self-adaptive online assessment system and method based on IRT |
TWI804953B (en) * | 2020-09-08 | 2023-06-11 | 日商斯庫林集團股份有限公司 | Training data creation assistance apparatus, training data creation assistance system and training data creation assistance method |
-
2008
- 2008-09-22 TW TW97136334A patent/TW201013549A/en unknown
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI426461B (en) * | 2010-06-08 | 2014-02-11 | Chi Mei Medical Ct | Satisfaction Survey Method and System |
CN103942993A (en) * | 2014-03-17 | 2014-07-23 | 深圳市承儒科技有限公司 | Self-adaptive online assessment system and method based on IRT |
TWI804953B (en) * | 2020-09-08 | 2023-06-11 | 日商斯庫林集團股份有限公司 | Training data creation assistance apparatus, training data creation assistance system and training data creation assistance method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Lesko et al. | Discounting the difficult: How high math-identified women respond to stereotype threat | |
Friedman | Self-efficacy and burnout in teaching: The importance of interpersonal-relations efficacy | |
McKellar et al. | Threats and supports to female students’ math beliefs and achievement | |
Higham et al. | Internal mapping and its impact on measures of absolute and relative metacognitive accuracy | |
Hirose | Meticulous Learning Follow-up Systems for Undergraduate Students Using the Online Item Response Theory | |
Zhang et al. | Children's shyness, peer acceptance, and academic achievement in the early school years | |
Song | Uncovering the structural underpinnings of political discussion networks: Evidence from an exponential random graph model | |
Ashenafi et al. | Predicting Student Progress from Peer-Assessment Data. | |
CN114098730A (en) | Cognitive ability testing and training method, device, equipment and medium based on cognitive map | |
Botelho et al. | The prediction of student first response using prerequisite skills | |
Hassan et al. | Use of item analysis to improve the quality of single best answer multiple choice question in summative assessment of undergraduate medical students in Malaysia. | |
TW201013549A (en) | Method of estimation of the examinee's ability on the computerized adaptive testing using adaptive network-based fuzzy inference system | |
Bugden et al. | Identifying children with persistent developmental dyscalculia from a 2‐min test of symbolic and nonsymbolic numerical magnitude processing | |
Gözümi et al. | Use of Smartphones in Class: Examining the Relationship between M-Learning Readiness, Cyberloafing, Nomophobia and Addiction Variables. | |
Preston et al. | Detecting faulty within-item category functioning with the nominal response model | |
Hsiung | Identification of dysfunctional cooperative learning teams based on students' academic achievement | |
Marsh et al. | The sweet spot of clinical intuitions: Predictors of the effects of context on impressions of conduct disorder symptoms. | |
Chang et al. | Development and application of detection indices for measuring guessing behaviors and test-taking effort in computerized adaptive testing | |
Grabiszewski et al. | Effort is not a monotonic function of skills: Results from a global mobile experiment | |
TW201428666A (en) | Method for evaluating learning outcomes of individual concept and computer readable media thereof | |
Khan et al. | An approach for identifying affective states through behavioral patterns in web-based learning management systems | |
Serrano et al. | Plausibility validation of a decision making model using subjects’ explanations of decisions | |
Henriques et al. | A non-exercise based V02max prediction using FRIEND dataset with a neural network | |
Frey et al. | Scaffolding reflection: prompting social constructive metacognitive activity in non-formal learning | |
CN110046147A (en) | It is applicable in user's learning ability value-acquiring method and its application of Adaptable System |