JP6850218B2

JP6850218B2 - Capability estimation device, method and program

Info

Publication number: JP6850218B2
Application number: JP2017138791A
Authority: JP
Inventors: 哲生小林; 南　泰浩; 泰浩南; 佑亮森山
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-07-18
Filing date: 2017-07-18
Publication date: 2021-03-31
Anticipated expiration: 2037-07-18
Also published as: JP2019021015A

Description

この発明は、幼児の語彙能力を推定するために用いられる技術に関する。 The present invention relates to techniques used to estimate infant vocabulary.

従来、幼児の語彙の発達状態を調べるために、語彙チェックリストによる親の回答に基づくアンケート調査が利用されることが多かった。これらのチェックリストは、幼児の発達状況を調べるのに有効である（例えば、非特許文献１から３参照。）。 In the past, questionnaire surveys based on parental answers from vocabulary checklists were often used to examine the developmental status of infants' vocabulary. These checklists are useful for investigating the developmental status of infants (see, for example, Non-Patent Documents 1 to 3).

小椋たみ子、綿巻徹、「日本のこどもの語彙発達の基準研究：日本語マッカーサー乳幼児言語発達質問紙から」、京都国際社会福祉センター紀要、発達・療育研究、2008.11、第24号、P.3-42Tamiko Ogura, Toru Watamaki, "Standard Study of Japanese Children's Vocabulary Development: From Japanese MacArthur Infant Language Development Questionnaire", Kyoto International Social Welfare Center Bulletin, Development and Rehabilitation Research, 2008.11, No. 24, P.3- 42 小林哲生、奥村優子、南泰浩、「語彙チェックリストアプリによる幼児語彙発達データ収集の試み」、電子情報通信学会技術研究報告、HCS2015-59、2016Tetsuo Kobayashi, Yuko Okumura, Yasuhiro Minami, "Trial of Collecting Infant Vocabulary Development Data Using Vocabulary Checklist App", IEICE Technical Report, HCS2015-59, 2016 稲葉太一、小椋たみ子、綿巻徹、前田美穂、「言語習得の発達過程におけるパーンセンタイル値の推定方法」、日本行動計量学会第31回大会発表論文抄録集、P.46-49、2003Taichi Inaba, Tamiko Ogura, Toru Watamaki, Miho Maeda, "Estimation Method of Pancentric Value in the Development Process of Language Acquisition", Abstracts of the 31st Annual Meeting of the Japan Society for Behavioral Metrics, P.46-49, 2003

従来の語彙チェックリストによる能力測定方法では、％タイルという指標で同年代の幼児と比べてどれぐらい外れているのかを調べていた。この方法では、平均からずれているという総体的な評価はできるが、その幼児の語彙の能力が絶対的にどの程度であるかということを判定できなかった。 In the conventional method of measuring ability using a vocabulary checklist, the percentage tile index was used to find out how far it was compared to infants of the same age. Although this method could give an overall assessment of deviation from the average, it could not determine the absolute level of the infant's vocabulary.

また、従来の語彙チェックリストによる能力測定方法では、語彙チェックリストの語が多かった。例えば、非特許文献１で述べた語彙チェックリストには700語程度の語が含まれており、２才後半の幼児では、母親が語彙チェックリストによるアンケートに答えるために、１時間程度の時間を要していた。 In addition, in the conventional ability measurement method using the vocabulary checklist, there were many words in the vocabulary checklist. For example, the vocabulary checklist described in Non-Patent Document 1 contains about 700 words, and in an infant in the latter half of 2 years old, it takes about 1 hour for the mother to answer the questionnaire by the vocabulary checklist. I needed it.

この発明の目的は、幼児の語彙の能力が絶対的にどの程度であるかを推定する能力推定装置、方法及びプログラムを提供することである。 The purpose of this invention is the ability estimation apparatus infant's ability vocabulary to estimate whether a degree which absolutely is to provide METHODS and programs.

この発明の一態様による能力推定装置は、複数の語のそれぞれを各齢xの幼児が獲得している確率についての情報が記憶されている記憶部と、能力推定の対象となる対象幼児が複数の語のそれぞれを獲得しているか否かについての情報と、記憶部から読み込んだ複数の語のそれぞれを各齢xの幼児が獲得している確率についての情報とに基づいて、対象幼児がどの位の齢の幼児と同程度の語彙能力を有するかを示す指標である能力齢を求める能力齢計算部と、を備えている。 The ability estimation device according to one aspect of the present invention has a storage unit that stores information about the probability that an infant of each age x has acquired each of a plurality of words, and a plurality of target infants for ability estimation. Which of the target infants is based on information about whether or not each of the words has been acquired and information about the probability that each of the multiple words read from the memory has been acquired by the infant of each age x. It is equipped with an ability age calculation unit that obtains the ability age, which is an index indicating whether or not the child has the same vocabulary ability as an infant of the same age.

幼児の語彙の能力が絶対的にどの程度であるかを推定することができる。 It is possible to estimate the absolute level of the infant's vocabulary .

能力推定装置の例を説明するためのブロック図。A block diagram for explaining an example of a capacity estimation device. 能力推定方法の例を説明するための流れ図。A flow chart for explaining an example of the ability estimation method. ワンワンという語の理解曲線及び発話曲線の例を示す図。The figure which shows the example of the understanding curve and the utterance curve of the word one-one. 実験例の結果を説明するための図。The figure for demonstrating the result of the experimental example. 実験例の結果を説明するための図。The figure for demonstrating the result of the experimental example.

以下、図面を参照してこの発明の一実施形態について説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.

能力推定装置は、図１に示すように、記憶部１、語選択部２、能力齢計算部３、能力齢補正部４及びクラスタリング部５を例えば備えている。語選択部２は、語選択装置でもある。 As shown in FIG. 1, the ability estimation device includes, for example, a storage unit 1, a word selection unit 2, an ability age calculation unit 3, an ability age correction unit 4, and a clustering unit 5. The word selection unit 2 is also a word selection device.

能力推定方法は、文書検索装置の各部が、図２や後に説明するステップＳ２からステップＳ４の処理を実行することにより例えば実現される。語選択方法は、語選択部２が、ステップＳ２の処理を実行することにより例えば実現される。 The ability estimation method is realized, for example, by each part of the document retrieval device executing the processes of steps S2 to S4 described in FIG. 2 and later. The word selection method is realized, for example, by the word selection unit 2 executing the process of step S2.

＜記憶部１＞
記憶部１には、複数の語と、複数の語のそれぞれを各齢xの幼児が獲得している確率についての情報が記憶されている。具体的には、複数の語のそれぞれを識別するためのIDと紐付けられて、そのそれぞれの語を各齢xの幼児が獲得している確率についての情報が記憶されている。 <Memory unit 1>
The storage unit 1 stores information about a plurality of words and the probability that an infant of each age x has acquired each of the plurality of words. Specifically, it is associated with an ID for identifying each of a plurality of words, and information about the probability that an infant of each age x has acquired each word is stored.

ここで、獲得とは、語を理解できるようになること、又は、語を発話できるようになることを意味する。 Here, acquisition means being able to understand a word or being able to speak a word.

各齢xの平均的な幼児が獲得している確率についての情報の例は、以下に説明する語の理解曲線に関する情報及び語の発話曲線に関する情報の少なくとも一方である。 An example of information about the probability that an average infant of each age x has acquired is at least one of the information about the word comprehension curve and the information about the word utterance curve described below.

ある語の理解曲線は、幼児の齢(例えば月齢)に対しそのある語を理解している幼児の割合を表す曲線である。理解曲線は、養育者が語彙チェックリスト調査で回答したものである。理解曲線は、調査時点で、幼児が該当語を（発話できなくても）理解できるかどうかを判断して答えられたものを月齢ごとに集計して、例えば図３に示すように、横軸に幼児の月齢、縦軸に該当語を理解できる幼児の割合をとり、△としてプロットし、それらをロジスティック曲線で近似したものである。図３は、「ワンワン」という語の理解曲線及び発話曲線の例を示す図である。 The comprehension curve of a word is a curve that represents the ratio of the infant who understands the word to the age of the infant (for example, the age of the moon). The comprehension curve was answered by the caregiver in a vocabulary checklist survey. The comprehension curve is calculated by determining whether the infant can understand the relevant word (even if he / she cannot speak) at the time of the survey, and totaling the answers by age, for example, as shown in FIG. 3, the horizontal axis. The age of the infant and the percentage of infants who can understand the corresponding word are taken on the vertical axis, plotted as Δ, and approximated by a logistic curve. FIG. 3 is a diagram showing an example of an understanding curve and an utterance curve of the word “one-one”.

ある語の発話曲線は、幼児の月齢に対しそのある語を発話できる幼児の割合を表す曲線である。発話曲線も、養育者が語彙チェックリスト調査で回答したものである。発話曲線は、調査時点で、幼児が該当語を発話できるかどうかを判断して答えられたものを月齢ごとに集計して、例えば図３に示すように、横軸に幼児の月齢、縦軸に該当語を発話できる幼児の割合をとり、○としてプロットし、それらをロジスティック曲線で近似したものである。 The utterance curve of a word is a curve representing the ratio of infants who can speak the word to the age of the infant. The utterance curve was also answered by the caregiver in the vocabulary checklist survey. As for the utterance curve, at the time of the survey, it is judged whether or not the infant can speak the corresponding word, and the answers are aggregated for each age. For example, as shown in FIG. 3, the horizontal axis is the infant's age and the vertical axis is the vertical axis. The percentage of infants who can speak the relevant word is taken, plotted as ○, and approximated by a logistic curve.

なお、ロジスティック曲線による近似には、最小二乗法、尤度最大化手法を例えば用いることができる。この場合、非線形な関数の最適化が必要になるので、最急降下法等の方法を用いて最適なパラメータを求める。何れの方法も周知であるため、ここでは詳しい説明は省略する。 For the approximation by the logistic curve, for example, the least squares method and the likelihood maximization method can be used. In this case, it is necessary to optimize the non-linear function, so the optimum parameters are obtained by using a method such as the steepest descent method. Since both methods are well known, detailed description thereof will be omitted here.

ロジスティック関数は、以下の式（１）で表される。式（１）において、xは月齢を表し、f_j(x)は語jの獲得割合を表し、eはネイピア数を表す。a_j,b_j,c_jは、ロジスティック曲線の形状を定めるパラメータである。具体的には、a_jは獲得割合の上限を決めるパラメータであり、b_jはロジスティック曲線のオフセット量を表すパラメータであり、c_jは語jの累積獲得割合の傾きを表すパラメータである。 The logistic function is represented by the following equation (1). In equation (1), x represents the age of the moon, f _j (x) represents the acquisition rate of the word j, and e represents the number of Napiers. a _j , b _j , and c _j are parameters that determine the shape of the logistic curve. Specifically, a _j is a parameter that determines the upper limit of the acquisition rate, b _j is a parameter that represents the offset amount of the logistic curve, and c _j is a parameter that represents the slope of the cumulative acquisition rate of the word j.

a_j,b_j,c_jが定まればロジスティック曲線の形状が定まる。このため、例えば、ある語jの理解曲線を表すロジスティック曲線のパラメータa_j,b_j,c_jが、その語jの理解曲線に関する情報となる。また、例えば、ある語jの発話曲線を表すロジスティック曲線のパラメータa_j,b_j,c_jが、その語jの発話曲線に関する情報となる。 Once a _j , b _j , and c _j are determined, the shape of the logistic curve is determined. Therefore, for example, the parameters a _j , b _j , and c _{j of the} logistic curve representing the comprehension curve of a word j are information about the comprehension curve of the word j. Further, for example, the parameters a _j , b _j , and c _{j of the} logistic curve representing the utterance curve of a word j are information regarding the utterance curve of the word j.

各齢xの平均的な幼児が獲得している確率についての情報の他の例は、Xを0≦X≦100の所定の数として、幼児のX％がその語を獲得するX％獲得齢である。Xは例えば50である。このX％獲得齢が記憶部１に記憶されていてもよい。 Another example of information about the probability that an average infant of each age x is acquiring is the X% age at which X% of the infants acquire the word, where X is a given number of 0 ≤ X ≤ 100. Is. X is, for example, 50. This X% acquisition age may be stored in the storage unit 1.

ある語のX％獲得齢は、幼児のX％がその語を理解する齢である。この場合、ある語のX％獲得齢は、その語の理解曲線から求めることができる。すなわち、その語jの理解曲線のパラメータa_j,b_j,c_jを所与として、x=(a_je^(cjx+bj))/(1+e^(cjx+bj))となるxを求める。求まったxが、その語jのX％獲得齢となる。 The X% acquisition age of a word is the age at which X% of the infant understands the word. In this case, the X% age of a word can be determined from the comprehension curve of the word. That is, the parameter a _j of understanding curve of the word j, b _j, given the c _j, x = a _{^{(a j e (cjx + bj}} )) / (1 + e (cjx + bj)) become x Ask. The obtained x is the X% age of the word j.

また、ある語のX％獲得齢は、幼児のX％がその語を発話する月齢であってもよい。この場合、ある語のX％獲得齢は、その語の発話曲線から求めることができる。すなわち、その語の発話曲線のパラメータa_j,b_j,c_jを所与として、x=(a_je^(cjx+bj))/(1+e^(cjx+bj))となるxを求める。求まったxが、その語のX％獲得齢となる。 Also, the X% acquisition age of a word may be the age at which X% of the infant speaks the word. In this case, the X% age of a word can be determined from the speech curve of that word. _{That is, given the parameters a j} , b _j , and c _j of the utterance curve of the word, find x such that x = (a _j e ^{(cjx + bj)} ) / (1 + e ^{(cjx + bj)} ). .. The obtained x is the X% age of the word.

このように、語のX％獲得齢は、その語の理解曲線又は発話曲線から求めることができるため、語のX％獲得齢に代えて、語の理解曲線に関する情報及び語の発話曲線に関する情報の少なくとも一方が記憶部１に記憶されていてもよい。この場合、後述する語選択部２は、記憶部１を参照して、語の理解曲線に関する情報及び語の発話曲線に関する情報の少なくとも一方から語のX％獲得齢を求め、求まった語のX％獲得齢を、記憶部１から読み込んだ語のX％獲得齢として後述する処理を行う。 In this way, the X% acquisition age of a word can be obtained from the understanding curve or utterance curve of the word. Therefore, instead of the X% acquisition age of the word, information on the understanding curve of the word and information on the utterance curve of the word. At least one of the above may be stored in the storage unit 1. In this case, the word selection unit 2 described later obtains the X% acquisition age of the word from at least one of the information on the word comprehension curve and the information on the word utterance curve with reference to the memory unit 1, and X of the obtained word. The% acquisition age is treated as the X% acquisition age of the word read from the storage unit 1, which will be described later.

もちろん、語の理解曲線に関する情報及び語の発話曲線に関する情報の少なくとも一方に加えて、語のX％獲得齢が記憶部１に記憶されていてもよい。これにより、語のX％獲得齢を求める計算コストを削減することができる。 Of course, in addition to at least one of the information regarding the word comprehension curve and the information regarding the word utterance curve, the X% acquisition age of the word may be stored in the storage unit 1. This can reduce the computational cost of finding the X% age of a word.

上記の説明では、X％獲得齢は、月齢を単位として表されるとしているが、X％獲得齢は、月齢以外の、週齢、日齢等の所定の時間の単位により表されてもよい。 In the above explanation, the X% acquisition age is expressed in units of the moon age, but the X% acquisition age may be expressed in a predetermined time unit such as week age, day age, etc. other than the moon age. ..

記憶部１には、複数の語と対応する各齢xの幼児が獲得している確率についての情報とがX％獲得齢が小さい順に並んで記憶されていてもよい。 The storage unit 1 may store a plurality of words and information about the probability acquired by an infant of each age x corresponding to each other in ascending order of X% acquired age.

＜語選択部２＞
語選択部２には、記憶部１に記憶されている語が入力される。 <Word selection unit 2>
Words stored in the storage unit 1 are input to the word selection unit 2.

語選択部２は、式(1)を確率分布とするフィッシャー情報量を求め、その情報量が大きくかつ幅広くなるようにT個の語を選択することにより簡易語彙チェックリストを作成する。語選択部２からは、T個の語、又は、T個の語により構成される簡易語彙チェックリストが出力される。 The word selection unit 2 obtains the Fisher information amount using the equation (1) as the probability distribution, and creates a simple vocabulary checklist by selecting T words so that the information amount is large and wide. The word selection unit 2 outputs T words or a simple vocabulary checklist composed of T words.

すなわち、語選択部２は、例えば、式（１）を確率分布とする、式（０）に示されるフィッシャー情報量Fが大きくかつ幅広くなるように、記憶部１に記憶されている語を選択する。言い換えれば、語選択部２は、所定の区間で万遍なく語を選択する。ここで、所定の区間とは、語選択の対象となる区間である。 That is, the word selection unit 2 selects words stored in the storage unit 1 so that the Fisher information amount F shown in the equation (0) is large and wide, for example, with the equation (1) as the probability distribution. To do. In other words, the word selection unit 2 selects words evenly in a predetermined section. Here, the predetermined section is a section to be selected as a word.

そのために、語選択部２は、例えば、選択される語に対応するX％獲得齢が予め定められた複数の齢区間のそれぞれに少なくとも１個含まれるように、記憶部１に記憶されている語を選択する（ステップＳ２）。語選択部２は、例えばT個の語を選択する。Tは、所定の正の整数である。選択されたT個の語から構成されるチェックリストを、簡易語彙チェックリストとも呼ぶ。ここで、T≦Mである。例えば、Tは、Mの半分以上の値とする。 Therefore, the word selection unit 2 is stored in the storage unit 1, for example, so that at least one X% acquisition age corresponding to the selected word is included in each of a plurality of predetermined age sections. Select a word (step S2). The word selection unit 2 selects, for example, T words. T is a given positive integer. A checklist consisting of T selected words is also called a simple vocabulary checklist. Here, T ≦ M. For example, T is a value that is at least half of M.

予め定められた複数の齢区間の例は、生後g日からh日までの期間を所定の区間としたとき、生後g日からh日までの日数をM分割した区間である。Mは所定の２以上の整数である。予め定められた複数の齢区間の例は、(1)12ヶ月以下、(2)13-18ヶ月、(3)19-24ヶ月、(4)25-30ヶ月、(5)31-36ヶ月、(6)37-48ヶ月という６個の区間である。この場合、M=6となる。 An example of a plurality of predetermined age sections is a section in which the number of days from g days to h days after birth is divided into M, assuming that the period from g days to h days after birth is a predetermined section. M is a predetermined integer of 2 or more. Examples of multiple pre-determined age sections are (1) 12 months or less, (2) 13-18 months, (3) 19-24 months, (4) 25-30 months, (5) 31-36 months. , (6) There are 6 sections of 37-48 months. In this case, M = 6.

予め定められた複数の齢区間の他の例は、これらのM分割した区間から少なくとも１個の区間を除いた区間である。これらのM分割した区間の中に、X％獲得齢が含まれる語が記憶部１に記憶されていない区間がある場合には、そのX％獲得齢が含まれる語が記憶部１に記憶されていない区間を、これらのM分割した区間から除いてもよい。 Another example of a plurality of predetermined age sections is a section obtained by removing at least one section from these M-divided sections. If there is a section in these M-divided sections in which the word containing the X% acquired age is not stored in the storage unit 1, the word containing the X% acquired age is stored in the storage unit 1. Sections that are not included may be excluded from these M-divided sections.

gとhは、対象とする幼児に応じて適宜設定される正の整数である。例えば、幼児全体を能力推定の対象とし、所定の区間を幼児期全体とする場合は、gを400程度、hを1500程度に設定する。 g and h are positive integers that are appropriately set according to the target infant. For example, if the entire infant is the target of ability estimation and the predetermined section is the entire childhood, g is set to about 400 and h is set to about 1500.

なお、２才児近辺の幼児を対象とする場合は、gを600程度、hを800程度に設定する。このように、対象となる幼児の齢が明確である場合には、その齢の近傍の区間を所定の区間とみなして、その齢の近傍の区間をM分割してもよい。 When targeting infants near the age of two, set g to about 600 and h to about 800. As described above, when the age of the target infant is clear, the section in the vicinity of the age may be regarded as a predetermined section, and the section in the vicinity of the age may be divided into M.

語選択部２は、例えば、各齢区間で％語獲得齢がその齢区間の中央値に近い語を１つ選択する。 The word selection unit 2 selects, for example, one word whose% word acquisition age is close to the median of the age section in each age section.

語選択部２は、所定の区間で万遍なく語を選択することができれば、上記のM分割の方法以外の方法により語を選択してもよい。 The word selection unit 2 may select a word by a method other than the above-mentioned M division method as long as the word can be selected evenly in a predetermined section.

語選択部２による語の選択により生成された簡易語彙チェックリストは、養育者のアンケートに用いられる。すなわち、養育者は、簡易語彙チェックリストに掲載された各語jについて、能力推定の対象となる対象幼児が獲得しているかどうかを判断し、その判断結果を得る。具体的には、j=1,…,Tとして、対象幼児が語jを獲得しているか否かについての情報をt_jとし、対象幼児が語jを獲得している場合にはt_j=1とし、上記対象幼児が語jを獲得していない場合にはt_j=0とする。 The simple vocabulary checklist generated by word selection by the word selection unit 2 is used for the caregiver questionnaire. That is, the caregiver determines whether or not the target infant to be the target of ability estimation has acquired each word j listed in the simple vocabulary checklist, and obtains the determination result. Specifically, let j = 1,…, T, _{and let t j} be the information about whether or not the target infant has acquired the word j, and if the target infant has acquired the word j, t _j = If the target infant has not acquired the word j, set t _j = 0.

この判断結果が、能力齢計算部３に入力される。 This determination result is input to the ability age calculation unit 3.

このように、語選択部２が、記憶部１に記憶されている語から、T個の語を選択して出力する、または、選択したT個の語による簡易語彙チェックリストを作成することで、能力齢の推定の対象となる対象幼児の養育者がアンケートにおいて答える必要がある語の数を減らすことができる。これにより、対象幼児の養育者のアンケートの回答時間を減らすことができる。 In this way, the word selection unit 2 selects and outputs T words from the words stored in the storage unit 1, or creates a simple vocabulary checklist using the selected T words. It is possible to reduce the number of words that the caregiver of the target infant to be estimated for the ability age needs to answer in the questionnaire. As a result, it is possible to reduce the time required to respond to the questionnaire of the caregiver of the target infant.

＜能力齢計算部３＞
能力齢計算部３には、上記の判断結果、言い換えれば、能力推定の対象となる対象幼児が複数の語のそれぞれを獲得しているか否かについての情報が入力される。 <Ability age calculation unit 3>
The ability age calculation unit 3 is input with the above determination result, in other words, information on whether or not the target infant to be the target of the ability estimation has acquired each of the plurality of words.

能力齢計算部３は、入力された、対象幼児が複数の語のそれぞれを獲得しているか否かについての情報と、記憶部１から読み込んだ複数の語のそれぞれを各齢xの幼児が獲得している確率についての情報とに基づいて、対象幼児がどの位の齢の幼児と同程度の語彙能力を有するかを示す指標である能力齢を求める（ステップＳ３）。求まった能力齢は、能力齢補正部４に出力される。能力齢は、月齢、週齢、日齢等の所定の時間の単位により表される。 In the ability age calculation unit 3, the input information about whether or not the target infant has acquired each of the plurality of words and each of the plurality of words read from the storage unit 1 are acquired by the infant of each age x. Based on the information about the probability of doing so, the ability age, which is an index showing how old the target infant has the same vocabulary ability as the infant, is obtained (step S3). The obtained ability age is output to the ability age correction unit 4. Ability age is expressed in units of predetermined time such as moon age, week age, and day age.

例えば、能力齢計算部３は、記憶部１から読み込んだ、語jを各齢xの幼児が獲得している確率についての情報をa_j,b_j,c_jとし、eをネイピア数として、以下の式（２）により定義されるx^-を能力齢とする。 For example, the ability age calculation unit 3 sets the information about the probability that the infant of each age x has acquired the word j read from the memory unit 1 as a _j , b _j , c _j, and e as the Napier number. x is defined by the following equation (2) ^- the the ability age.

このように、最小二乗法により能力齢を求めてもよい。より具体的には、能力齢計算部３は、入力された、対象幼児が複数の語のそれぞれを獲得しているか否かについての情報（１，０の情報）と、記憶部１から読み込んだ複数の語のそれぞれを各齢xの幼児が獲得している確率についての情報との二乗距離が最小となるx^-を能力齢としてもよい。 In this way, the age of ability may be determined by the method of least squares. More specifically, the ability age calculation unit 3 reads the input information (1,0 information) about whether or not the target infant has acquired each of the plurality of words and the storage unit 1. may be the ability of age ^- a plurality of words x squared distance is minimum with information about the probability that the infant has acquired for each age x.

式（２）により定義されるx^-を求めるために、能力齢計算部３は、例えば、g(x)が以下の式（２’）のように定義されるとして、g(x)の１階導関数g'(x)=0となるxの値を求める。この求まったxが、x^-となる。 X defined by equation (2) ^- to determine the, as the ability of age calculator 3, for example, g (x) is defined as the following equation (2 '), g of (x) 1 Find the value of x such that the derivative g'(x) = 0. This Motoma' was x is, x ^- become.

また、式（２）により定義されるx^-を求めるために、能力齢計算部３は、ニュートン法等の０となるxの値を逐次的に求める等の最適化手法を用いてもよい。また、能力齢計算部３は、x_min<x<x_maxの区間で所定の微小間隔Δxずつシフトしすることによりg(x)の値を求めることにより、g(x)を最小にするxの値を求めてもよい。x_minはxの所定の最小値であり、x_maxはxの所定の最大値である。これらの求まったxが、x^-となる。 ^{Further, in order to obtain x −} defined by the equation (2), the ability age calculation unit 3 may use an optimization method such as Newton's method for sequentially obtaining the value of x that becomes 0. Further, the ability age calculation unit 3 minimizes g (x) by obtaining the value of g (x) by shifting by a predetermined minute interval Δx in the interval of _{x min} <x <x _max. You may find the value of. x _min is the predetermined minimum value of x and x _max is the predetermined maximum value of x. These Motoma' was x is, x ^- become.

また、能力齢計算部３は、記憶部１から読み込んだ、語jを各齢xの幼児が獲得している確率についての情報をa_j,b_j,c_jとし、eをネイピア数として、以下の式（３）により定義されるx^-を能力齢としてもよい。 Further, the ability age calculation unit 3 sets the information about the probability that the infant of each age x has acquired the word j read from the memory unit 1 as a _j , b _j , c _j, and e as the Napier number. x is defined by the following equation (3) ^- may be used as capability of age.

このように、能力齢計算部３は、式（３’）の確率分布を用いた、最尤推定法により幼児能力を求めてもよい。言い換えれば、能力齢計算部３は、確率的に近い齢を能力齢として求めてもよい。 In this way, the ability age calculation unit 3 may obtain the infant ability by the maximum likelihood estimation method using the probability distribution of the equation (3'). In other words, the ability age calculation unit 3 may obtain an age that is stochastically close as the ability age.

式（３）により定義されるx^-を求めるために、能力齢計算部３は、例えば、h(x)が以下の式（３''）のように定義されるとして、log h(x)の１階導関数(log h(x))'=∂log h(x)/∂x=0となるxの値を求める。この求まったxが、x^-となる。 X defined by equation (3) ^- in order to determine the, ability age calculator 3, for example, as h (x) is defined as the following equation (3 ''), log h (x) Find the value of x such that the first-order derivative of (log h (x))'= ∂log h (x) / ∂x = 0. This Motoma' was x is, x ^- become.

また、式（３）により定義されるx^-を求めるために、能力齢計算部３は、ニュートン法等の０となるxの値を逐次的に求める等の最適化手法を用いてもよい。また、能力齢計算部３は、x_min<x<x_maxの区間で所定の微小間隔Δxずつシフトしすることによりh(x)の値を求めることにより、h(x)を最大にするxの値を求めてもよい。x_minはxの所定の最小値であり、x_maxはxの所定の最大値である。これらの求まったxが、x^-となる。 ^{Further, in order to obtain x −} defined by the equation (3), the ability age calculation unit 3 may use an optimization method such as Newton's method for sequentially obtaining the value of x that becomes 0. Further, the ability age calculation unit 3 maximizes h (x) by obtaining the value of h (x) by shifting by a predetermined minute interval Δx in the interval of _{x min} <x <x _max. You may find the value of. x _min is the predetermined minimum value of x and x _max is the predetermined maximum value of x. These Motoma' was x is, x ^- become.

このように、能力齢を求めることで、幼児の語彙の能力が絶対的にどの程度であるかを推定することができる。 In this way, by determining the ability age, it is possible to estimate the absolute level of the infant's vocabulary ability.

能力齢計算部３は、最小二乗法と最尤推定法のどちらを用いてもよいが、例えば推定月齢には少なくとも１ヵ月の幅があるため、中心から外れるほど２個の方法のそれぞれで求めた値に差が出る可能性がある。この差を抑えるために、式（１）のロジスティック関数のパラメータを求めるときと、能力齢を推定するときの手法を合わせてもよい。すなわち、能力齢計算部３は、式（１）のロジスティック関数のパラメータを求めるときに用いた、最小二乗法又は最尤推定法で、能力齢を推定してもよい。 The ability age calculation unit 3 may use either the least squares method or the maximum likelihood estimation method, but for example, since the estimated age has a range of at least one month, it is calculated by each of the two methods as it deviates from the center. There may be a difference in the values. In order to suppress this difference, the method for obtaining the parameters of the logistic function in Eq. (1) and the method for estimating the age of ability may be combined. That is, the ability age calculation unit 3 may estimate the ability age by the least squares method or the maximum likelihood estimation method used when obtaining the parameters of the logistic function of the equation (1).

＜能力齢補正部４＞
能力齢補正部４には、能力齢計算部３で求まった能力齢が入力される。 <Ability age correction unit 4>
The ability age obtained by the ability age calculation unit 3 is input to the ability age correction unit 4.

能力齢補正部４は、対象幼児が獲得している語の品詞情報に基づいて能力齢計算部３で求まった能力齢を補正する（ステップＳ３）。補正後の能力齢は、能力推定装置及び方法による最終的な能力推定結果として出力される。 The ability age correction unit 4 corrects the ability age obtained by the ability age calculation unit 3 based on the part-speech information of the word acquired by the target infant (step S3). The corrected age of ability is output as the final result of ability estimation by the ability estimation device and method.

能力齢補正部４による能力齢の補正は、幼児の語の獲得タイプによって語彙獲得の速度が異なるという参考文献１で示されている事実を利用したものである。
〔参考文献１〕小林哲生、南泰浩、杉山弘晃、"語彙爆発の新しい視点：日本語学習児の初期語彙発達に関する縦断データ解析"、日本赤ちゃん学会、ベビーサイエンス、12, 34-49, 2013 The correction of the ability age by the ability age correction unit 4 utilizes the fact shown in Reference 1 that the speed of vocabulary acquisition differs depending on the word acquisition type of the infant.
[Reference 1] Tetsuo Kobayashi, Yasuhiro Minami, Hiroaki Sugiyama, "A New Perspective on Vocabulary Explosion: Longitudinal Data Analysis on Early Vocabulary Development in Japanese Learning Children", Japan Society for Baby Science, Baby Science, 12, 34-49, 2013

能力齢補正部４による能力齢の補正により、能力齢を更に正確に求めることができる。 By correcting the ability age by the ability age correction unit 4, the ability age can be obtained more accurately.

能力齢補正部４による補正の方法の<<例１>><<例２>>を以下に説明する。 << Example 1 >> << Example 2 >> of the correction method by the ability age correction unit 4 will be described below.

<<例１>>
例えば、複数の幼児のそれぞれが獲得している語の品詞情報に基づいて複数の幼児をクラスタリングすることにより複数の幼児は少なくとも１個のグループに分けられており、各グループに対応する補正値が予め定められているとする。 << Example 1 >>
For example, by clustering a plurality of infants based on the part-speech information of the word acquired by each of the plurality of infants, the plurality of infants are divided into at least one group, and the correction value corresponding to each group is set. It is assumed that it is predetermined.

複数の幼児のそれぞれが獲得している語の品詞情報は、養育者が補正用簡易語彙チェックリストに掲載された各語jを、幼児が獲得しているかどうかを判断することにより得られる。補正用簡易語彙チェックリストは、語選択部２により作成された簡易語彙チェックリストとは異なる語彙チェックリストである。 The part-of-speech information of the words acquired by each of the plurality of infants is obtained by the caregiver determining whether or not the infant has acquired each word j listed in the simple vocabulary checklist for correction. The simple vocabulary checklist for correction is a vocabulary checklist different from the simple vocabulary checklist created by the word selection unit 2.

補正用簡易語彙チェックリストを作成するために、例えば、語選択部２は、記憶部１に記憶されている語の中から、式（１）＝０．５となるxが小さい方からN個の語を選択する（ステップＳ２）。言い換えれば、語選択部２は、例えば発話初期に獲得される語をN個選択する。Nは予め定められた正の整数である。例えば、Nは400程度である。補正用簡易語彙チェックリストは、これらの選択された語により構成される。なお、これらの選択された語には、社会語、名詞、述語、閉じた語、その他という品詞情報が定められているとする。社会語とは、例えば、あいさつ、ママ、パパである。閉じた語とは、例えば、代名詞、助詞、助動詞、数詞である。 In order to create a simple vocabulary checklist for correction, for example, the word selection unit 2 has N words from the words stored in the storage unit 1 from the smaller x in equation (1) = 0.5. Select the word (step S2). In other words, the word selection unit 2 selects, for example, N words acquired at the initial stage of utterance. N is a predetermined positive integer. For example, N is about 400. The simple vocabulary checklist for correction consists of these selected words. It is assumed that these selected words are defined with part-of-speech information such as social words, nouns, predicates, closed words, and others. Social languages are, for example, greetings, moms, and dads. Closed words are, for example, pronouns, particles, auxiliary verbs, and numerals.

クラスタリング部５には、補正用簡易語彙チェックリストに基づいて生成された、複数の幼児のそれぞれが獲得している語の品詞情報が入力される。 In the clustering unit 5, the part-of-speech information of the words acquired by each of the plurality of infants, which is generated based on the simple vocabulary checklist for correction, is input.

クラスタリング部５は、複数の幼児のそれぞれが獲得している語の品詞情報に基づいて複数の幼児を、k-means法等のクラスタリング方法によりクラスタリングする。具体的には、幼児kのベクトルv_kが以下のように定義されているとして、このベクトルv_kに基づいて複数の幼児をクラスタリングする。 The clustering unit 5 clusters a plurality of infants by a clustering method such as a k-means method based on the part-of-speech information of the words acquired by each of the plurality of infants. Specifically, as a vector v _k infants k is defined as follows, clustering several infant on the basis of the vector v _k.

ここで、V_k,nは幼児kが獲得している品詞情報が名詞である語の数であり、V_k,sは幼児kが獲得している品詞情報が社会語である語の数であり、V_k,pは幼児kが獲得している品詞情報が述語である語の数であり、V_k,cは幼児kが獲得している品詞情報が閉じた語である語の数であり、V_k,oは幼児kが獲得している品詞情報がその他である語の数である。 Here, V _{k, n} is the number of words whose part of speech information acquired by infant k is a nomenclature, and V _{k, s} is the number of words whose part of speech information acquired by infant k is a social language. Yes, V _{k, p} is the number of words whose part of speech information acquired by infant k is a predicate, and V _{k, c} is the number of words whose part of speech information acquired by infant k is closed. Yes, V _{k, o} is the number of words for which the part-speech information acquired by the infant k is other.

クラスタリングにより幼児は少なくとも１個のグループに分けられる。例えば、クラスタ数を３とする。クラスタ数を３とすることにより、「発話初期に（他の品詞よりも）名詞を発話する」「発話初期に（他の品詞よりも）社会語を発話する」「その他」という３個のグループに幼児を分けることができる。もちろん、クラスタ数は３以外でもよい。例えば、クラスタ数は５程度としてもよい。 Infants are divided into at least one group by clustering. For example, the number of clusters is 3. By setting the number of clusters to 3, three groups of "speak a noun (than other part of speech) at the beginning of utterance", "speak a social language (more than other part of speech) at the beginning of utterance", and "other". Infants can be divided into. Of course, the number of clusters may be other than 3. For example, the number of clusters may be about 5.

クラスタリング部５は、グループごとに、そのグループに含まれる幼児の能力齢と実齢との差の平均値を計算し、この計算された平均値をそのグループの補正値とする。補正値は、正の値の場合もあるし、負の値となる場合もある。幼児の能力齢は、能力齢計算部３のステップＳ３の処理により得ることができる。クラスタリング部５により生成された各グループについての情報及び各グループの補正値は、能力齢補正部４に出力される。 The clustering unit 5 calculates the average value of the difference between the ability age and the actual age of the infants included in the group for each group, and uses this calculated average value as the correction value of the group. The correction value may be a positive value or a negative value. The ability age of an infant can be obtained by the process of step S3 of the ability age calculation unit 3. The information about each group generated by the clustering unit 5 and the correction value of each group are output to the ability age correction unit 4.

以上のクラスタリング部５の処理が、能力齢補正部４の処理に先だって予め行われているとする。 It is assumed that the above-mentioned processing of the clustering unit 5 is performed in advance of the processing of the ability age correction unit 4.

また、養育者が補正用簡易語彙チェックリストに掲載された各語jを、対象幼児が獲得しているかどうかを判断することにより、対象幼児が獲得している語の品詞情報が予め得られているとする。 In addition, the part-of-speech information of the words acquired by the target infant can be obtained in advance by the caregiver determining whether or not each word j listed in the simple vocabulary checklist for correction has been acquired by the target infant. Suppose you are.

能力齢補正部４には、対象幼児が獲得している語の品詞情報と、クラスタリング部５により生成された各グループについての情報及び各グループの補正値と、能力齢計算部３により計算された能力齢とが入力される。この各グループについての情報には、各グループを表すクラスタが含まれているとする。各グループを表すクラスタは、例えばその各グループに含まれる幼児の式（４）により定義されるベクトルの平均値である。 In the ability age correction unit 4, the part-of-speech information of the words acquired by the target infant, the information about each group generated by the clustering unit 5, the correction value of each group, and the ability age calculation unit 3 calculated. The ability age is entered. It is assumed that the information about each group includes clusters representing each group. The cluster representing each group is, for example, the average value of the vectors defined by the formula (4) of the infant included in each group.

この場合、能力齢補正部４は、対象幼児が獲得している語の品詞情報に基づいて、対象幼児が属するグループを特定し、特定されたグループに対応する補正値に基づいて対象幼児の能力齢を補正する。 In this case, the ability age correction unit 4 identifies the group to which the target infant belongs based on the part-speech information of the word acquired by the target infant, and the ability of the target infant based on the correction value corresponding to the specified group. Correct age.

具体的には、能力齢補正部４は、対象幼児が獲得している語の品詞情報に基づいて、
式（４）により定義される、対象幼児のベクトルを求め、求まった対象幼児のベクトルに最も距離が近いクラスタを有するグループを特定し、特定されたグループを対象幼児が属するグループとする。そして、能力齢補正部４は、能力齢計算部３により計算された能力齢から、特定されたグループに対応する補正値を減算する。減算された値は、補正後の能力齢として出力される。 Specifically, the ability age correction unit 4 is based on the part-speech information of the word acquired by the target infant.
The vector of the target infant defined by the equation (4) is obtained, the group having the cluster closest to the obtained vector of the target infant is specified, and the specified group is defined as the group to which the target infant belongs. Then, the ability age correction unit 4 subtracts the correction value corresponding to the specified group from the ability age calculated by the ability age calculation unit 3. The subtracted value is output as the corrected age.

なお、クラスタリング部５は、予め定められた複数の齢区間の中のある齢区間に実齢が含まれる幼児を式（４）により定義されるベクトルに基づいてクラスタリングすることによりグループ分けをして、上記と同様の処理を行うことにより、そのある齢区間の各グループに対応する補正値を求めてもよい。クラスタリング部５は、この処理を、予め定められた複数の齢区間のそれぞれに対して行うことにより、各予め定められた齢区間の各グループに対応する補正値を求めることができる。 In addition, the clustering unit 5 groups infants whose actual age is included in a certain age section among a plurality of predetermined age sections by clustering them based on the vector defined by the equation (4). , The correction value corresponding to each group of the certain age section may be obtained by performing the same processing as described above. By performing this process for each of the plurality of predetermined age sections, the clustering unit 5 can obtain a correction value corresponding to each group of each predetermined age section.

予め定められた齢区間の例は、(1)12ヶ月以下、(2)13-18ヶ月、(3)19-24ヶ月、(4)25-30ヶ月、(5)31-36ヶ月、(6)37-48ヶ月という６個の区間である。もちろん、これ以外の区間を予め定められた齢区間としてもよい。 Examples of predetermined age intervals are (1) 12 months or less, (2) 13-18 months, (3) 19-24 months, (4) 25-30 months, (5) 31-36 months, ( 6) There are 6 sections of 37-48 months. Of course, a section other than this may be a predetermined age section.

この場合、能力齢補正部４は、対象幼児の実齢に基づいて、予め定められた複数の齢区間の中の対象幼児の実齢が含まれる齢区間を特定する。また、能力齢補正部４は、対象幼児が獲得している語の品詞情報に基づいて、特定された齢区間において、対象幼児が属するグループを特定する。そして、能力齢補正部４は、特定されたグループに対応する補正値を選択し、その選択された補正値に基づいて対象幼児の能力齢を補正する。例えば、能力齢補正部４は、対象幼児の能力齢からその選択された補正値を減算する。減算された値は、補正後の能力齢として出力される。 In this case, the ability age correction unit 4 specifies an age section including the actual age of the target infant among a plurality of predetermined age sections based on the actual age of the target infant. In addition, the ability age correction unit 4 identifies the group to which the target infant belongs in the specified age section based on the part-of-speech information of the word acquired by the target infant. Then, the ability age correction unit 4 selects a correction value corresponding to the specified group, and corrects the ability age of the target infant based on the selected correction value. For example, the ability age correction unit 4 subtracts the selected correction value from the ability age of the target infant. The subtracted value is output as the corrected age.

なお、クラスタリング部５は、各グループの予め定められた複数の齢区間のそれぞれごとに補正値を求めてもよい。予め定められた齢区間の例は、(1)12ヶ月以下、(2)13-18ヶ月、(3)19-24ヶ月、(4)25-30ヶ月、(5)31-36ヶ月、(6)37-48ヶ月という６個の区間である。もちろん、これ以外の区間を予め定められた齢区間としてもよい。 The clustering unit 5 may obtain a correction value for each of a plurality of predetermined age sections of each group. Examples of predetermined age intervals are (1) 12 months or less, (2) 13-18 months, (3) 19-24 months, (4) 25-30 months, (5) 31-36 months, ( 6) There are 6 sections of 37-48 months. Of course, a section other than this may be a predetermined age section.

例えば、クラスタリング部５は、あるグループ内のある齢区間に含まれる実齢を有する幼児の能力齢と実齢との差の平均値を計算し、この計算された平均値をそのグループのその齢区間に対応する補正値とする。クラスタリング部５は、この処理を、各グループの各予め定められた齢区間に対して行うことにより、各グループの各予め定められた齢区間に対応する補正値を得ることができる。 For example, the clustering unit 5 calculates the average value of the difference between the ability age and the actual age of an infant having an actual age included in a certain age section in a certain group, and uses this calculated average value as the age of the group. The correction value corresponds to the section. By performing this process on each predetermined age section of each group, the clustering unit 5 can obtain a correction value corresponding to each predetermined age section of each group.

この場合、能力齢補正部４は、上記と同様に、対象幼児が獲得している語の品詞情報に基づいて、対象幼児が属するグループを特定する。そして、能力齢補正部４は、特定されたグループの、対象幼児の実齢が含まれる齢区間に対応する補正値を選択し、その選択された補正値に基づいて対象幼児の能力齢を補正する。例えば、能力齢補正部４は、対象幼児の能力齢からその選択された補正値を減算する。減算された値は、補正後の能力齢として出力される。 In this case, the ability age correction unit 4 identifies the group to which the target infant belongs based on the part-of-speech information of the words acquired by the target infant, as described above. Then, the ability age correction unit 4 selects a correction value corresponding to the age section including the actual age of the target infant in the specified group, and corrects the ability age of the target infant based on the selected correction value. To do. For example, the ability age correction unit 4 subtracts the selected correction value from the ability age of the target infant. The subtracted value is output as the corrected age.

<<例２>>
幼児が獲得している語の中の品詞情報に含まれる名詞の語の個数が、一般的な幼児が獲得している語の中の平均的な個数よりも多く、かつ、幼児が獲得している語の中の品詞情報に含まれる名詞と社会語の比率における名詞が占める割合が、一般的な幼児が獲得している語の中の品詞情報に含まれる名詞と社会語の比率における名詞が占める平均的な割合よりも高い場合には、その幼児の獲得している語彙数は一般的な幼児の獲得している語彙数よりも多く、また、幼児が獲得している語の中の品詞情報に含まれる社会語の個数が、一般的な幼児が獲得している語の中の平均的な個数よりも多く、かつ、幼児が獲得している語の中の品詞情報に含まれる名詞と社会語の比率における社会語が占める割合が、一般的な幼児が獲得している語の中の品詞情報に含まれる名詞と社会語の比率における社会語が占める平均的な割合よりも高い場合には、その幼児の獲得している語彙数は一般的な幼児の獲得している語彙数よりも少ない、という参考文献１に示された事実が知られている。<<例２>>では、この事実に基づき、能力齢補正部４は、対象幼児が獲得している語の中の、品詞情報が名詞の語の個数が多いほど大きく、品詞情報が社会語の語の個数が多いほど小さい値に基づいて補正値を得て、得られた補正値を用いて能力齢を補正する。 << Example 2 >>
The number of nomenclature words contained in the part-of-speech information in the words acquired by the infant is larger than the average number of words acquired by the general infant, and the infant acquires it. The ratio of nomen in the ratio of nomen to social words contained in the part-of-speech information in the word is the ratio of nomen to the part-of-speech information in the words acquired by general infants. If it is higher than the average percentage, the infant has more vocabulary than the average infant has, and the part of the word in the word that the infant has acquired. The number of social words included in the information is larger than the average number of words acquired by a general infant, and the number of nomenclature contained in the part word information in the words acquired by the infant When the ratio of social words in the ratio of social words is higher than the average ratio of social words in the ratio of nomenclature and social words contained in the part lyrics information in the words acquired by general infants. Is known to be the fact shown in Reference 1 that the number of vocabulary acquired by the infant is smaller than the number of vocabulary acquired by the general infant. In << Example 2 >>, based on this fact, the ability age correction unit 4 has the larger the number of noun words in the words acquired by the target infant, and the part of speech information is the social language. As the number of words in the word increases, a correction value is obtained based on a smaller value, and the obtained correction value is used to correct the ability age.

αを、幼児が獲得している語の中の、品詞情報が名詞の語の個数が多いほど大きく、品詞情報が社会語の語の個数が多いほど小さい値を取る値とする。例えば、α＝名詞の数／（名詞の数＋社会語の数）である。もちろん、幼児が獲得している語の中の、品詞情報が名詞の語の個数が多いほど大きく、品詞情報が社会語の語の個数が多いほど小さい値を取る値であれば、α＝名詞の数／（社会語の数）、α＝名詞の数−社会語の数等であってもよい。 Let α be a value in which the part-speech information takes a larger value as the number of noun words increases, and the part-speech information takes a smaller value as the number of social language words increases among the words acquired by the infant. For example, α = number of nouns / (number of nouns + number of social words). Of course, if the part-speech information of the words acquired by the infant is larger as the number of noun words is larger and the part-speech information is smaller as the number of social words is larger, α = noun. / (Number of social words), α = number of nouns-number of social words, etc.

<<例１>>と同様に、複数の幼児の養育者が、補正用簡易語彙チェックリストに掲載された各語jを、幼児が獲得しているかどうかを判断することにより、複数の幼児のそれぞれが獲得している語の品詞情報が得られているとする。この得られている複数の幼児のそれぞれが獲得している語の品詞情報が、能力齢補正部４に入力される。 Similar to << Example 1 >>, multiple infant caregivers determine whether or not each infant has acquired each word j listed in the simple vocabulary checklist for correction. It is assumed that the part-of-speech information of the word acquired by each is obtained. The part-of-speech information of the words acquired by each of the obtained plurality of infants is input to the ability age correction unit 4.

そして、能力齢補正部４は、ある幼児kのαの値をα_kとして、ある幼児kの能力齢x_k ^-と複数の幼児の実齢の平均値d_kとの差分x_k ^--d_k=vα_k-uとおいて、最小二乗法によりこの式を満たす尤もらしいv,uを推定する。 The ability of age correcting unit 4, as the value of alpha _k of alpha certain infants k, there infant k ability age x _k ^- difference x _k and the average value d _k of the plurality of infant actual age ^- -d _{With k} = vα _k -u, we estimate plausible v, u that satisfy this equation by the least squares method.

なお、x_k ^--d_kとα_kとに相関が大きくない場合、例えば最小二乗法でuとvによりモデル化できない場合、能力齢補正部４は<<例２>>による補正を行わなくてもよい。x_k ^--d_k=vα_k-uの式を満たす尤もらしいv,uを最小二乗法でモデル化できない場合とは、例えば、５％や１％で、最小二乗法の検定を行った場合、ｐ値が閾値以上になる場合などのことである。 Incidentally, x _k ^- if correlation between -d _k and alpha _k is not large, when, for example, can not be modeled by u and v by the least square method, the ability of age correcting unit 4 without correction by << Example 2 >> You may. _{^{_{_{x k - -d k = vα k}}}} -u plausible v satisfying equation, and if you can not modeled in the method of least squares u is, for example, 5% or 1%, in the case of performing the assay method of least squares , When the p-value is equal to or greater than the threshold value.

また、対象幼児の養育者が、補正用簡易語彙チェックリストに掲載された各語jを、幼児が獲得しているかどうかを判断することにより、対象幼児が獲得している語の品詞情報が得られているとする。この得られている対象幼児が獲得している語の品詞情報が、能力齢補正部４に入力される。 In addition, the caregiver of the target infant determines whether or not the infant has acquired each word j listed in the simple vocabulary checklist for correction, so that the part-of-speech information of the word acquired by the target infant can be obtained. It is assumed that it has been done. The part-of-speech information of the word acquired by the obtained target infant is input to the ability age correction unit 4.

以上の処理が、能力齢補正部４による補正の処理に先だって予め行われているとする。 It is assumed that the above processing is performed in advance of the correction processing by the ability age correction unit 4.

能力齢補正部４は、入力された対象幼児が獲得している語の品詞情報に基づいて、対象幼児のαの値を計算し、計算されたαの値に基づいてvα-uを計算し、この計算された値を補正値とする。そして、能力齢補正部４は、この補正値を用いて能力齢を補正する。例えば、能力齢補正部４は、能力齢からこの補正値を減算する。減算された値は、補正後の能力齢として出力される。 The ability age correction unit 4 calculates the α value of the target infant based on the input part-speech information of the word acquired by the target infant, and calculates vα-u based on the calculated α value. , Let this calculated value be the correction value. Then, the ability age correction unit 4 corrects the ability age using this correction value. For example, the ability age correction unit 4 subtracts this correction value from the ability age. The subtracted value is output as the corrected age.

なお、能力齢補正部４は、予め定められた複数の齢区間のそれぞれごとにv,nを求めてもよい。予め定められた齢区間の例は、(1)12ヶ月以下、(2)13-18ヶ月、(3)19-24ヶ月、(4)25-30ヶ月、(5)31-36ヶ月、(6)37-48ヶ月という６個の区間である。もちろん、これ以外の区間を予め定められた齢区間としてもよい。 The ability age correction unit 4 may obtain v and n for each of a plurality of predetermined age sections. Examples of predetermined age intervals are (1) 12 months or less, (2) 13-18 months, (3) 19-24 months, (4) 25-30 months, (5) 31-36 months, ( 6) There are 6 sections of 37-48 months. Of course, a section other than this may be a predetermined age section.

例えば、能力齢補正部４は、予め定められた複数の齢区間の中のある齢区間に実齢が含まれる幼児kのx_k ^-,d_k,α_kを用いて、x_k ^--d_k=vα_k-uの式を満たす尤もらしいv,uを例えば最小二乗法により求める。能力齢補正部４は、この処理を、予め定められた複数の齢区間の中の各齢区間に対して行うことにより、予め定められた複数の齢区間の中の各齢区間に対応するv,uを得ることができる。 For example, the ability of age correcting unit 4, x _k infants k that includes Jitsuyowai in a certain age interval of a plurality of age interval defined in advance ^-, with d _k, alpha _k, x _k ^- -d _{Find the plausible v, u that satisfy the equation k} = vα _k -u, for example, by the least squares method. By performing this process for each age section in the plurality of predetermined age sections, the ability age correction unit 4 corresponds to each age section in the plurality of predetermined age sections v. , U can be obtained.

この場合、能力齢補正部４は、対象幼児の実齢に基づいて、予め定められた複数の齢区間の中の対象幼児の実齢が含まれる齢区間を特定し、その特定された齢空間に対応するv,uを用いて、上記と同様にして、補正値を求め、求まった補正値を用いて能力齢を補正する。 In this case, the ability age correction unit 4 identifies an age section including the actual age of the target infant among a plurality of predetermined age sections based on the actual age of the target infant, and the specified age space. Using v and u corresponding to, the correction value is obtained in the same manner as above, and the ability age is corrected using the obtained correction value.

また、能力齢補正部４は、<<例１>>で説明した各グループ毎にv,nを求めてもよい。予め定められた齢区間の例は、(1)12ヶ月以下、(2)13-18ヶ月、(3)19-24ヶ月、(4)25-30ヶ月、(5)31-36ヶ月、(6)37-48ヶ月という６個の区間である。もちろん、これ以外の区間を予め定められた齢区間としてもよい。 Further, the ability age correction unit 4 may obtain v and n for each group described in << Example 1 >>. Examples of predetermined age intervals are (1) 12 months or less, (2) 13-18 months, (3) 19-24 months, (4) 25-30 months, (5) 31-36 months, ( 6) There are 6 sections of 37-48 months. Of course, a section other than this may be a predetermined age section.

例えば、能力齢補正部４は、あるグループに含まれる幼児kのx_k ^-,d_k,α_kを用いて、x_k ^--d_k=vα_k-uの式を満たす尤もらしいv,uを例えば最小二乗法により求める。能力齢補正部４は、この処理を、各グループに対して行うことにより、各グループに対応するv,uを得ることができる。 For example, the ability of age correcting unit 4, x _k infants k included in a group ^-, d _k, with _{_{^{_{α k, x k - -d k}}}} = vα k -u plausible v satisfying equation, u Is obtained by, for example, the least squares method. By performing this process for each group, the ability age correction unit 4 can obtain v and u corresponding to each group.

この場合、能力齢補正部４は、対象幼児が獲得している語の品詞情報に基づいて、式（４）により定義される、対象幼児のベクトルを求め、求まった対象幼児のベクトルに最も距離が近いクラスタを有するグループを特定する。そして、能力齢補正部４は、その特定されたグループに対応するv,uを用いて、上記と同様にして、補正値を求め、求まった補正値を用いて能力齢を補正する。 In this case, the ability age correction unit 4 obtains the vector of the target infant defined by the equation (4) based on the part-speech information of the word acquired by the target infant, and is the closest to the obtained vector of the target infant. Identify groups that have clusters close to each other. Then, the ability age correction unit 4 obtains a correction value in the same manner as described above using v and u corresponding to the specified group, and corrects the ability age using the obtained correction value.

［実験例］
非特許文献２で示した2688個の語彙チェックリストを用いて予め収集した940人のチェック結果から、語毎の獲得曲線のロジスティック関数(1)を求めた。このロジスティック関数から、上記の手法により、幼児の能力齢を推定した。評価に用いた幼児は200人である。実験例では、gを400、hを1500と設定した。 [Experimental example]
The logistic function (1) of the acquisition curve for each word was obtained from the check results of 940 people collected in advance using the 2688 vocabulary checklists shown in Non-Patent Document 2. From this logistic function, the ability age of infants was estimated by the above method. The number of infants used for the evaluation was 200. In the experimental example, g was set to 400 and h was set to 1500.

推定した能力齢と幼児が実際に持つ能力の分散を比較することで、上記の簡易語彙チェックリストによるテストの推定精度を評価することを考える。しかし、幼児k が実際に持つ能力は未知であるため直接比較できない。そこで、実際の能力との差の分散を求める代わりに幼児kの実齢d_kとの差の分散を調べる。この分散もテストが正確になるほど小さくなる。区間の分割数Mを変えながら作成した簡易語彙チェックリストによるテストで推定した能力齢x_kと実齢d_kの差の分散を図４に示す。ここでは、能力齢及び実齢として、日齢を用いている。図４の、左から２列目及び５列目の語数Tは簡易語彙チェックリストに掲載された語の数を表す。この分散は、モデルの適合度を表しており，小さくなるほど幼児の能力推定に適したモデルになっている。簡易語彙チェックリストの語数と分散の関係を比較すると分散は語数によらずほぼ一定の値となるが分かる。この結果から、30，40程度の語でも十分な推定結果を示すことが分かる。 It is considered to evaluate the estimation accuracy of the test by the above simple vocabulary checklist by comparing the estimated ability age and the variance of the ability actually possessed by the infant. However, since the actual ability of infant k is unknown, it cannot be compared directly. Therefore, instead of finding the variance of the difference from the actual ability, the variance of the difference between the infant k and the actual age d _k is examined. This variance also decreases as the test becomes more accurate. FIG. 4 shows the variance of the difference between the _{ability age x k} and the actual _{age d k} estimated by the test using the simple vocabulary checklist created while changing the number of divisions M of the interval. Here, the age of the day is used as the ability age and the actual age. In FIG. 4, the number of words T in the second and fifth columns from the left represents the number of words listed in the simple vocabulary checklist. This variance represents the goodness of fit of the model, and the smaller it is, the more suitable the model is for estimating the ability of infants. Comparing the relationship between the number of words and the variance in the simple vocabulary checklist, it can be seen that the variance is an almost constant value regardless of the number of words. From this result, it can be seen that even words of about 30 or 40 show sufficient estimation results.

以下、能力齢の補正による推定精度の向上について説明する。 Hereinafter, the improvement of the estimation accuracy by the correction of the ability age will be described.

まず、クラスタリングを用いる場合には、全体の語数クラスタリングの数（タイプ数）を２とした。補正を求める品詞を調べる品詞テストには，2688語のうち，幼児が平均的に最初に覚える400語を利用した。幼児の簡易語彙チェックリストには40語を用いた。図５に実施した結果を示す。能力齢の補正前の分散である補正前分散は、（１）式により推定した能力齢を表す。能力齢の補正後の分散である補正分散（１）は、上記手法により補正された能力齢と実齢の差の分散を示す。ここでは、能力齢及び実齢として、日齢を用いている。この分散の値が小さくなることが、正当な評価をしていることに相当する。図５を見ると、上記の手法による補正により分散値が小さくなっている。より具体的には、上記の手法では、トータル440語テストが必要であるが、図４の語数Tが473のところよりも分散が小さくなっており、上記の手法による補正の有効性を示している。 First, when clustering is used, the total number of words clustering (number of types) is set to 2. Of the 2688 words, the 400 words that infants remember first on average were used in the part-speech test to find the part-speech for correction. 40 words were used for the infant's simple vocabulary checklist. FIG. 5 shows the results of the implementation. The pre-correction variance, which is the variance before correction of the ability age, represents the ability age estimated by Eq. (1). The corrected variance (1), which is the variance after correction of the ability age, shows the variance of the difference between the ability age and the actual age corrected by the above method. Here, the age of the day is used as the ability age and the actual age. A small value of this variance corresponds to a legitimate evaluation. Looking at FIG. 5, the variance value is reduced by the correction by the above method. More specifically, the above method requires a total of 440 word tests, but the variance is smaller than when the number of words T in FIG. 4 is 473, indicating the effectiveness of the correction by the above method. There is.

つぎに、クラスタリングを用いる場合と同条件で、能力齢の補正を行った結果を図５の補正分散（２）に示す。この場合でも、分散の削減が優位に行えることが確認できる。 Next, the result of correcting the ability age under the same conditions as when clustering is used is shown in the corrected variance (2) of FIG. Even in this case, it can be confirmed that the reduction of variance can be advantageous.

［変形例等］
上記の実施形態では、幼児の語彙獲得の例で説明したが、能力齢の推定の対象は幼児、言い換えれば母国語を習得中の未就学児でなくてもよい。 [Modification example, etc.]
In the above embodiment, the example of vocabulary acquisition of an infant has been described, but the target of estimating the age of ability does not have to be an infant, in other words, a preschooler who is learning a mother tongue.

上記の説明では、理解曲線及び発話曲線を、月齢を用いて定義したが、理解曲線及び発話曲線は、月齢に代えて日齢、週齢等の所定の時間を表す指標を用いて定義されてもよい。 In the above explanation, the comprehension curve and the utterance curve are defined by using the age of the moon, but the comprehension curve and the utterance curve are defined by using an index representing a predetermined time such as the age of the day and the age of the week instead of the age of the moon. May be good.

能力齢推定装置は、能力齢補正部４及びクラスタリング部５を備えていなくてもよい。この場合、能力齢推定装置は、能力齢計算部３により計算された能力齢を最終的な推定結果として出力する。 The ability age estimation device may not include the ability age correction unit 4 and the clustering unit 5. In this case, the ability age estimation device outputs the ability age calculated by the ability age calculation unit 3 as the final estimation result.

上記各装置及び方法において説明した処理は、記載の順にしたがって時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。 The processes described in each of the above devices and methods are not only executed in chronological order according to the order of description, but may also be executed in parallel or individually as required by the processing capacity of the device that executes the processes.

また、上記各装置における各種の処理機能をコンピュータによって実現してもよい。その場合、各装置が有すべき機能の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、上記各装置における各種の処理機能がコンピュータ上で実現される。 Further, various processing functions in each of the above devices may be realized by a computer. In that case, the processing content of the function that each device should have is described by the program. Then, by executing this program on the computer, various processing functions in each of the above devices are realized on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。コンピュータで読み取り可能な記録媒体としては、例えば、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリ等どのようなものでもよい。 The program describing the processing content can be recorded on a computer-readable recording medium. The computer-readable recording medium may be, for example, a magnetic recording device, an optical disk, a photomagnetic recording medium, a semiconductor memory, or the like.

また、このプログラムの流通は、例えば、そのプログラムを記録したＤＶＤ、ＣＤ−ＲＯＭ等の可搬型記録媒体を販売、譲渡、貸与等することによって行う。さらに、このプログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することにより、このプログラムを流通させてもよい。 Further, the distribution of this program is performed, for example, by selling, transferring, renting, or the like a portable recording medium such as a DVD or a CD-ROM in which the program is recorded. Further, the program may be distributed by storing the program in the storage device of the server computer and transferring the program from the server computer to another computer via a network.

このようなプログラムを実行するコンピュータは、例えば、まず、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、一旦、自己の記憶部に格納する。そして、処理の実行時、このコンピュータは、自己の記憶部に格納されたプログラムを読み取り、読み取ったプログラムに従った処理を実行する。また、このプログラムの別の実施形態として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよい。さらに、このコンピュータにサーバコンピュータからプログラムが転送されるたびに、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。また、サーバコンピュータから、このコンピュータへのプログラムの転送は行わず、その実行指示と結果取得のみによって処理機能を実現する、いわゆるＡＳＰ（Application Service Provider）型のサービスによって、上述の処理を実行する構成としてもよい。なお、プログラムには、電子計算機による処理の用に供する情報であってプログラムに準ずるもの（コンピュータに対する直接の指令ではないがコンピュータの処理を規定する性質を有するデータ等）を含むものとする。 A computer that executes such a program first temporarily stores, for example, a program recorded on a portable recording medium or a program transferred from a server computer in its own storage unit. Then, when the process is executed, the computer reads the program stored in its own storage unit and executes the process according to the read program. Further, as another embodiment of this program, a computer may read the program directly from a portable recording medium and execute processing according to the program. Further, every time the program is transferred from the server computer to this computer, the processing according to the received program may be executed sequentially. In addition, the above processing is executed by a so-called ASP (Application Service Provider) type service that realizes the processing function only by the execution instruction and result acquisition without transferring the program from the server computer to this computer. May be. The program shall include information used for processing by a computer and equivalent to the program (data that is not a direct command to the computer but has a property of defining the processing of the computer, etc.).

また、コンピュータ上で所定のプログラムを実行させることにより、各装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に又は人手により実現することとしてもよい。 Further, although each device is configured by executing a predetermined program on a computer, at least a part of these processing contents may be realized by hardware or manually.

この発明は、例えば、幼児を対象とした語彙能力を調べる様々な幼児語彙テストの作成に役立つ。 The present invention is useful, for example, in creating various infant vocabulary tests that test vocabulary proficiency in infants.

１記憶部
２語選択部
３能力齢計算部
４能力齢補正部
５クラスタリング部 1 Memory unit 2 Word selection unit 3 Ability age calculation unit 4 Ability age correction unit 5 Clustering unit

Claims

A memory unit that stores information about the probability that an infant of each age x has acquired each of multiple words.
Information on whether or not the target infant to be targeted for ability estimation has acquired each of a plurality of words, and the probability that each of the plurality of words read from the above storage unit has been acquired by an infant of each age x. Based on the information in, the ability age calculation unit that calculates the ability age, which is an index showing how old the target infant has the same vocabulary ability as the infant,
Capability estimation device including.

The capacity estimation device according to claim 1.
It further includes an ability age correction unit that corrects the ability age obtained by the ability age calculation unit based on the part-speech information of the word acquired by the target infant.
Capacity estimation device.

The capacity estimation device according to claim 1.
In the storage unit, X is set to a predetermined number of 0 ≤ X ≤ 100, and each word and the X% acquisition age at which X% of the infant acquires each of the above words are stored.
A word selection unit that selects words stored in the storage unit so that the X% acquired age corresponding to the selected word is included in at least one in each of a plurality of predetermined age sections.
The plurality of words are words selected by the word selection unit.
Capacity estimation device.

The capacity estimation device according to claim 1.
Let 1, ..., T be the plurality of words, j = 1, ..., T, let t _j be the information about whether or not the target infant has acquired the word j, and let the target infant acquire the word j. If so, set t _j = 1, and if the target infant has not acquired the word j, set t _j = 0, and provide information on the probability that the infant of each age x has acquired the word j. and _{_{_{a j, b j, c j}}} , the e as Napier's constant, the capacity of age calculation section of which is x defined by the following equation ^- is referred to as the ability of age,

Capacity estimation device.

Capacity estimation device.

Assuming that the memory unit stores information about the probability that an infant of each age x has acquired each of a plurality of words.
The ability calculation unit acquires information on whether or not the target infant whose ability is to be estimated has acquired each of a plurality of words, and each of the plurality of words read from the above storage unit is acquired by an infant of each age x. Ability age calculation step to determine the ability age, which is an index showing how old the target infant has the same vocabulary ability as the infant, based on the information about the probability of having
Ability estimation method including.

Program for causing a computer to function claims 1 as either capacity estimation equipment each part of the 5.