JP5604344B2

JP5604344B2 - Vocabulary explosion time detection device, method, and program

Info

Publication number: JP5604344B2
Application number: JP2011060851A
Authority: JP
Inventors: 哲生小林; 泰浩南; 弘晃杉山
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2011-03-18
Filing date: 2011-03-18
Publication date: 2014-10-08
Anticipated expiration: 2031-03-18
Also published as: JP2012196254A

Description

本発明は、語彙爆発時期検出装置、方法、及びプログラムに係り、特に、幼児が語彙爆発時期にさしかかったか否かを検出する語彙爆発時期検出装置、方法、及びプログラムに関する。 The present invention relates to a vocabulary explosion time detection device, method, and program, and more particularly, to a vocabulary explosion time detection device, method, and program for detecting whether or not an infant has reached a vocabulary explosion time.

ヒトの言語発達は「人間とは何か」を考える上で重要な科学的知見や示唆を提供し得るものでありながら、現状としては未解決の問題が多いため、言語発達に関する測定技術の進展や商業上でのサービス展開はほとんど見られないのが現状である。特に、音声認知や語彙獲得、文法操作などの基本能力の中でも、語彙獲得に関する科学技術はほとんど進展が見られていない。しかし、健やかな発達を緩やかに後押しする教育や、言語発達遅滞を含む発達障害に関する早期発見・支援などの必要性を考えると、本分野での技術開発は重要な意味をもつと考えられる。 Although human language development can provide important scientific knowledge and suggestions for thinking about `` what is human beings '', there are many unsolved problems at present, so progress in measurement technology related to language development There is almost no commercial service development. In particular, there has been little progress in science and technology related to vocabulary acquisition, among basic abilities such as speech recognition, vocabulary acquisition, and grammar manipulation. However, technological development in this field is considered to be important in view of the need for education that moderately supports healthy development and the early detection and support of developmental disabilities including language development delays.

幼児の言語発達の中でも特に特徴的で且つ個人性を捉える上で重要な現象のひとつは、語彙爆発（またはボキャブラリー・スパート）である。これは、発達心理学者が２０世紀中頃から注目してきた現象であり、１歳後半に起こるとされる語彙学習速度の急激な変化のことを指す。基本的には、幼児は１歳の誕生日前後に初語を発するようになるが、しばらくは非常に緩やかな速度で単語を覚えていくことになる。しかし１歳半以降になると、急激に単語を発するようになるため、その劇的な変化を「爆発」や「スパート」と呼んできた。語彙爆発は多くの親が意識的に気づくほど劇的な変化を伴うため、心理学の分野だけでなく育児産業の関係者にもよく知られている。 One of the most distinctive and important phenomena in infant language development is the vocabulary explosion (or vocabulary spurt). This is a phenomenon that development psychologists have been paying attention to since the middle of the 20th century, and refers to a rapid change in vocabulary learning speed that occurs in the second half of the year. Basically, toddlers begin to utter their first words around their first birthday, but for a while they learn words at a very moderate rate. However, after the age of one and a half years, the words suddenly start to utter, so the dramatic change has been called “explosion” and “spurt”. The vocabulary explosion is so dramatic that many parents consciously notice it, so it is well known not only in the field of psychology but also in the childcare industry.

従来、発達心理学の分野では、語彙チェックリスト（親の回答に基づくアンケート調査）を用いた大規模集団データで語彙爆発の現象を複数の言語で確認してきた。月齢毎に集団データの平均値をプロットすると、ゆるやかな上昇を示す２次曲線になり、その変曲点が１８−２０ヶ月頃に現れることを見出してきた。こうした集団データから、語彙爆発が多くの子どもでみられる一般的な現象であるとみなしてきた。 Conventionally, in the field of developmental psychology, the phenomenon of vocabulary explosion has been confirmed in multiple languages using large-scale group data using a vocabulary checklist (questionnaire survey based on parents' answers). It has been found that when the average value of the population data is plotted for each age, it becomes a quadratic curve showing a gradual increase, and its inflection point appears around 18-20 months. From these collective data, we have regarded vocabulary explosion as a common phenomenon seen in many children.

この語彙爆発について、語彙爆発が個人毎にいつ起こるのか、また、語彙爆発時期（語彙爆発が開始される時期）をどのように検出及び推定するのかということに関して、従来、主に以下の４つの手法が提案されている。 With regard to this vocabulary explosion, the following four main points have been hitherto related to when the vocabulary explosion occurs for each individual and how to detect and estimate the vocabulary explosion time (the time when the vocabulary explosion starts). A method has been proposed.

１つ目は、特に計算などせずグラフを描き、目視で判定する目視法である。２つ目は、５０語覚えた時点を語彙爆発時期と定義する５０語達成基準法である。３つ目は、ある特定の期間（例えば３週間）で達成基準（例えば３０語以上）を満たした時期を語彙爆発時期にするという特定期間達成基準法である。４つめは、時間軸に沿った語彙獲得データの速度成分をロジスティック回帰式に近似させ、その変曲点を語彙爆発時期とするロジスティック回帰近似法である（非特許文献１参照）。 The first is a visual method in which a graph is drawn without any particular calculation and is visually determined. The second is the 50-word achievement standard method that defines the time when 50 words are learned as the vocabulary explosion time. The third is a specific period achievement standard method in which a period when an achievement standard (for example, 30 words or more) is satisfied in a specific period (for example, three weeks) is set as a vocabulary explosion period. The fourth is a logistic regression approximation method in which the velocity component of vocabulary acquisition data along the time axis is approximated to a logistic regression equation, and the inflection point is the vocabulary explosion time (see Non-Patent Document 1).

Ganger, J., & Brent, M. R. (2004). Reexamining the vocabulary spurt. Developmental Psychology, Vol. 40, No. 4, 621-632.Ganger, J., & Brent, M. R. (2004). Reexamining the vocabulary spurt. Developmental Psychology, Vol. 40, No. 4, 621-632.

しかしながら、１つ目の手法は、現象の有無をある程度確認可能であるが、語彙爆発時期を正確に判定する場合には不向きである、という問題がある。 However, the first method can confirm the presence or absence of the phenomenon to some extent, but has a problem that it is not suitable for accurately determining the vocabulary explosion time.

また、２つ目の手法は、実証データに基づいた基準ではあるが、英語圏の中流階層の非常に少ないサンプルに基づく基準であったため、多くの文化圏の様々な子どもに当てはまる保証はない、という問題がある。また、語彙爆発の個人差が全く想定されていない、という問題もある。 The second method is based on empirical data, but based on a very small sample of English-speaking middle classes, there is no guarantee that it will apply to various children in many cultural zones. There is a problem. Another problem is that no individual differences in vocabulary explosion are assumed.

また、３つ目の手法は、ある特定の時間範囲で語彙獲得速度の変化を検出可能であるが、一義的で恣意的な達成基準の設定は、個人間の語彙獲得速度を考慮に入れていないため、個人によっては語彙爆発時期を完全に見誤る可能性がある、という問題がある。 The third method can detect changes in vocabulary acquisition speed over a specific time range, but the unique and arbitrary achievement criteria setting takes into account the vocabulary acquisition speed between individuals. There is a problem that some individuals may misunderstand the vocabulary explosion time completely.

また、４つめの手法は、個人毎にデータを近似させることで、個人間の語彙獲得速度がたとえ異なっていても対応はできるものの、幼児の語彙発達の特徴を正確に捉えきれていないため、ロジスティック回帰の近似精度が低く、語彙爆発の存在自体も確認できない場合が多い、という問題がある。また、この手法では、ある程度蓄積されたデータを遡って解析するタイプの推定法であるため、子どもが発達していく中で、いわばリアルタイムに語彙爆発時期を検出したい場合には利用できない、という問題もある。 The fourth method is to approximate the data for each individual, even if the vocabulary acquisition speed between individuals is different, but it can not handle the characteristics of infant vocabulary development accurately, There is a problem that the approximation accuracy of logistic regression is low and the existence of the vocabulary explosion itself cannot often be confirmed. In addition, since this method is a type of estimation method that retroactively analyzes data accumulated to some extent, it cannot be used when the vocabulary explosion time is detected in real time as the child develops. There is also.

本発明は上記問題点に鑑みてなされたものであり、語彙爆発以降のデータが多くない場合でも、個人差も考慮してリアルタイムに語彙爆発の時期にさしかかっているか否かを検出することができる語彙爆発時期検出装置、方法、及びプログラムを提供することを目的とする。 The present invention has been made in view of the above problems, and even when there is not a lot of data after the vocabulary explosion, it is possible to detect whether the timing of the vocabulary explosion is approaching in real time in consideration of individual differences. An object of the present invention is to provide a vocabulary explosion time detection device, method, and program.

上記目的を達成するために、本発明の語彙爆発時期検出装置は、幼児が新しい単語を発話するようになった日齢と、前記日齢までに前記幼児が発話するようになった単語の累積数との関係を示す複数のデータのうち、前記日齢が大きい方から所定個のデータを除いたデータの推移を直線で近似する近似手段と、前記近似手段で近似された直線と前記所定個のデータとの差分、及び予め定めた閾値に基づいて、前記複数のデータ内に、前記幼児の語彙爆発時期を示すデータが含まれるか否かを判定する判定手段と、を含んで構成されている。 In order to achieve the above-mentioned object, the vocabulary explosion timing detection device of the present invention provides an age at which an infant begins to speak a new word, and a cumulative number of words that the infant has spoken before the date. Among a plurality of data indicating the relationship with the number, approximation means for approximating a transition of data excluding the predetermined number of data from the larger age, a straight line approximated by the approximation means and the predetermined number Determination means for determining whether or not the data indicating the vocabulary explosion time of the infant is included in the plurality of data based on a difference from the data and a predetermined threshold value. Yes.

本発明の語彙爆発時期検出装置によれば、近似手段が、幼児が新しい単語を発話するようになった日齢と、その日齢までに幼児が発話するようになった単語の累積数との関係を示す複数のデータのうち、日齢が大きい方から所定個のデータを除いたデータの推移を直線で近似する。そして、判定手段が、近似手段で近似された直線と所定個のデータとの差分、及び予め定めた閾値に基づいて、複数のデータ内に、幼児の語彙爆発時期を示すデータが含まれるか否かを判定する。 According to the vocabulary explosion time detection apparatus of the present invention, the approximation means is a relationship between the age when the infant began to speak a new word and the cumulative number of words the infant began speaking before that age. The transition of data obtained by removing a predetermined number of data from the larger age among the plurality of data indicating the above is approximated by a straight line. Whether or not the data indicating the vocabulary explosion timing of the infant is included in the plurality of data based on the difference between the straight line approximated by the approximating means and a predetermined number of data and a predetermined threshold value. Determine whether.

このように、幼児の語彙発達の特徴を捉えて、幼児が新しい単語を発話するようになった日齢と単語の累積数との関係を示すデータを、日齢が大きい方から所定個のデータを除いて直線近似し、直線と所定個のデータとの差分に基づいて、語彙爆発の有無を判定するため、個人差も考慮してリアルタイムに語彙爆発の時期にさしかかっているか否かを検出することができる。 In this way, data indicating the relationship between the age at which the infant began to speak a new word and the cumulative number of words by capturing the characteristics of the vocabulary development of the infant, a predetermined number of data from the older age In order to determine whether or not there is a vocabulary explosion based on the difference between the straight line and a predetermined number of data, detect whether or not the vocabulary explosion is approaching in real time, taking into account individual differences. be able to.

また、前記判定手段は、前記所定個のデータの前記日齢が小さい方のデータから１つずつ加算しながら前記差分を算出し、前記差分が前記閾値を超えたときに、該差分の算出に用いられたデータの中で最大の日齢を、前記幼児の語彙爆発時期として検出することができる。 In addition, the determination unit calculates the difference while adding one by one from the data with the smaller age of the predetermined number of data, and when the difference exceeds the threshold, the determination unit calculates the difference. The maximum age among the data used can be detected as the vocabulary explosion time of the infant.

また、本発明の語彙爆発時期検出方法は、近似手段と、判定手段とを含む語彙爆発時期検出装置における語彙爆発時期検出方法であって、前記近似手段は、幼児が新しい単語を発話するようになった日齢と、前記日齢までに前記幼児が発話するようになった単語の累積数との関係を示す複数のデータのうち、前記日齢が大きい方から所定個のデータを除いたデータの推移を直線で近似し、前記判定手段は、前記近似手段で近似された直線と前記所定個のデータとの差分、及び予め定めた閾値に基づいて、前記複数のデータ内に、前記幼児の語彙爆発時期を示すデータが含まれるか否かを判定する方法である。 The vocabulary explosion time detection method according to the present invention is a vocabulary explosion time detection method in a vocabulary explosion time detection apparatus including an approximation means and a determination means, wherein the approximation means allows an infant to speak a new word. Of a plurality of data indicating the relationship between the age of the child and the cumulative number of words that the infant has spoken before the age, data obtained by removing a predetermined number of data from the larger age The determining means approximates the transition of the child with a predetermined threshold based on a difference between the straight line approximated by the approximating means and the predetermined number of data, and a predetermined threshold. This is a method for determining whether or not data indicating the vocabulary explosion time is included.

また、本発明の語彙爆発時期検出プログラムは、コンピュータを、上記語彙爆発時期検出装置を構成する各手段として機能させるためのプログラムである。 The vocabulary explosion time detection program of the present invention is a program for causing a computer to function as each means constituting the vocabulary explosion time detection device.

以上説明したように、本発明の語彙爆発時期検出装置、方法、及びプログラムによれば、幼児の語彙発達の特徴を捉えて、幼児が新しい単語を発話するようになった日齢と単語の累積数のデータを、日齢が大きい方から所定個のデータを除いて直線近似し、直線と所定個のデータとの差分に基づいて、語彙爆発の有無を判定するため、個人差も考慮してリアルタイムに語彙爆発の時期にさしかかっているか否かを検出することができる、という効果が得られる。 As described above, according to the vocabulary explosion timing detection apparatus, method, and program of the present invention, the age and cumulative number of words at which an infant begins to speak a new word by capturing the characteristics of the infant's vocabulary development. In order to determine the presence or absence of a vocabulary explosion based on the difference between the number of data and the predetermined number of data from the older age, by linear approximation, and taking into account individual differences The effect of being able to detect whether or not the time of the vocabulary explosion is approaching in real time can be obtained.

本実施の形態の語彙爆発時期検出装置の機能的構成を示すブロック図である。It is a block diagram which shows the functional structure of the vocabulary explosion time detection apparatus of this Embodiment. 入力画面の一例を示す図である。It is a figure which shows an example of an input screen. 入力データセットの一例を示す図である。It is a figure which shows an example of an input data set. 語彙爆発時期の検出を説明するための図である。It is a figure for demonstrating the detection of a vocabulary explosion time. 検出結果の出力例を示す図である。It is a figure which shows the example of an output of a detection result. 本実施の形態の語彙爆発時期検出装置における語彙爆発時期検出処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the vocabulary explosion time detection processing routine in the vocabulary explosion time detection apparatus of this Embodiment.

以下、図面を参照して本発明の実施の形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１に示すように、本実施の形態に係る語彙爆発時期検出装置１０は、種々のデータの入力を受け付ける入力部１２と、語彙爆発の有無を判定する演算部１４と、検出結果を出力する出力部１６と、を備えている。 As shown in FIG. 1, the vocabulary explosion timing detection apparatus 10 according to the present embodiment outputs an input unit 12 that receives input of various data, a calculation unit 14 that determines the presence or absence of a vocabulary explosion, and a detection result. And an output unit 16.

入力部１２は、既知のキーボード、マウス、記憶装置などの入力器により実現され、入力データを受け付ける。 The input unit 12 is realized by an input device such as a known keyboard, mouse, or storage device, and receives input data.

ここで、幼児の語彙爆発の時期を判定するために、どういったデータを参照するかがまずは問題となる。幼児の発話を全てデジタルビデオレコーダーなどの電子メディアで記録可能であれば、それを分析するのが最も高精度な方法といえるが、データ取得にかかるコストは膨大で、かつ幼児の曖昧な発話データを自動で認識し単語レベルで分析する工学的技術もまだ存在しないので、実現は大変難しい。一方、所定期間毎に（例えば、３ヶ月に１度）アンケートに回答してもらい、幼児が新たに発話した単語数の変化を把握する方法もある。この場合、所定期間が長ければ、語彙爆発の時期にさしかかったか否かをリアルタイムに把握するのは困難である。また、所定期間が短ければ、アンケートの回答者（幼児の親）への負担が増大する。従って、現実的には、データを記録する親への負担を軽減しつつ、かつ細かい時間ポイントでデータ取得が可能な方法が望ましい。 Here, in order to determine the timing of the infant's vocabulary explosion, what kind of data is referred to first becomes a problem. If it is possible to record all of the infant's utterances with electronic media such as a digital video recorder, it can be said that the most accurate method is to analyze it, but the cost of data acquisition is enormous and the infant's ambiguous utterance data Since there is no engineering technology that automatically recognizes and analyzes at the word level, it is very difficult to realize. On the other hand, there is also a method in which a questionnaire is answered every predetermined period (for example, once every three months) to grasp a change in the number of words newly spoken by an infant. In this case, if the predetermined period is long, it is difficult to grasp in real time whether or not the vocabulary explosion time has come. In addition, if the predetermined period is short, the burden on the respondent of the questionnaire (the parent of the infant) increases. Therefore, in practice, it is desirable to have a method capable of acquiring data at fine time points while reducing the burden on the parent who records the data.

そこで、本実施の形態では、ウェブ日誌法を利用したデータ取得を適用する。この方法は、幼児が単語を新たに学習（発話）した場合に、ウェブ上の特定のサイトに携帯電話やパーソナルコンピュータからネットワークを介してアクセスし、その日の日誌と共に、幼児が覚えた単語を記録するものである（非特許文献２「小林哲生、永田昌明（２００９）、「ウェブを用いた幼児言語発達研究：大規模縦断データ収集の試み」、言語処理学会第１５回年次大会論文集、ｐ．５３４−５３７．」、非特許文献３「小林哲生、永田昌明（２０１０年３月）、「ウェブ上で収集した幼児語彙発達データの信頼性検証」、言語処理学会第１６回年次大会論文集、ｐ．４０３−４０６．」参照）。この方法の有効性は科学的に検証されている点で非常によい。 Therefore, in the present embodiment, data acquisition using the web diary method is applied. In this method, when an infant learns a new word (speaks), a specific site on the web is accessed via a network from a mobile phone or a personal computer, and the word that the infant remembers is recorded along with the diary of the day. (Non-Patent Document 2 “Tetsuo Kobayashi, Masaaki Nagata (2009),“ Infant Language Development Research Using the Web: Trial of Large-Scale Longitudinal Data Collection ”, Proc. 15th Annual Conference of the Language Processing Society, pp. 534-537., Non-Patent Document 3, “Tetsuo Kobayashi, Masaaki Nagata (March 2010),“ Reliability Verification of Infant Vocabulary Development Data Collected on the Web ”, 16th Annual Conference of the Language Processing Society of Japan See the collection of papers, pages 403-406.). The effectiveness of this method is very good in that it has been scientifically verified.

また、この方法によるデータ取得の利点は、親にとっても比較的容易に記録できる方式でありながら、記録年月日（幼児が新たな単語を覚えた年月日）と幼児の生年月日との差から、幼児が新たな単語を覚えた日齢を算出可能な点である。このように取得されたデータを用いることによって、本実施の形態の語彙爆発時期検出装置１０により、語彙爆発時期にさしかかっているか否かを日齢単位で検出可能になる。 In addition, the advantage of data acquisition by this method is that it is relatively easy for parents to record, but the date of recording (the date when the infant learned a new word) and the date of birth of the infant From the difference, it is possible to calculate the age at which the infant learned a new word. By using the data acquired in this way, the vocabulary explosion time detection device 10 according to the present embodiment can detect whether or not the vocabulary explosion time is approaching, in units of days.

例えば、図２に示すような入力画面５０を入力インターフェースとして入力部１２に設け、データ入力を行う。図２の入力画面５０には、日付入力領域５２と、単語入力領域５４と、生年月日表示領域５６と、登録修正ボタン５８とが設けられている。 For example, an input screen 50 as shown in FIG. 2 is provided in the input unit 12 as an input interface to input data. In the input screen 50 of FIG. 2, a date input area 52, a word input area 54, a date of birth display area 56, and a registration correction button 58 are provided.

日付入力領域５２は、直接入力やプルダウンメニューからの選択により、幼児が新しい単語を発話した日付（単語獲得年月日）を入力可能となっている。また、入力画面５０を開いた際に、その日の日付が初期値として入力されるようにしてもよい。単語入力領域５４には、直接入力により、幼児が新たに覚えた単語の発話及び意味を入力可能となっている。生年月日表示領域５６は、予め登録された幼児の生年月日が表示される。生年月日が未登録の場合、または登録済みの生年月日を修正する場合には、登録修正ボタン５８を押下することにより、生年月日入力画面を表示させ、生年月日の入力を受け付ける。 In the date input area 52, the date (word acquisition date) when the infant spoke a new word can be input by direct input or selection from a pull-down menu. Further, when the input screen 50 is opened, the date of the day may be input as an initial value. In the word input area 54, the utterance and meaning of a word newly learned by the infant can be input by direct input. The date of birth display area 56 displays the date of birth of the infant registered in advance. When the date of birth is not registered, or when the registered date of birth is to be corrected, by pressing the registration correction button 58, the date of birth input screen is displayed and the input of the date of birth is accepted.

このように入力されたデータを受け付けることにより、図３に示すような、いつ（例：２００９年９月１２日）、どんな単語（例：わんわん）をどんな意味（例：犬）で発話したかを表す、生年月日、単語獲得年月日、発話、及び意味で構成されたデータセットが取得される。なお、予め記憶装置に蓄積されたデータセットを取得する形式としてもよい。 By accepting the data entered in this way, as shown in Fig. 3, when (eg: September 12, 2009), what word (eg: doggie) and what meaning (eg: dog) was spoken A data set composed of the date of birth, the date of word acquisition, the utterance, and the meaning is obtained. It should be noted that the data set stored in the storage device in advance may be acquired.

演算部１４は、ＣＰＵ（Central Processing Unit）と、ＲＡＭ（Random Access Memory）と、後述する語彙爆発時期検出処理ルーチンを実行するためのプログラムを記憶したＲＯＭ（Read Only Memory）とを備えたコンピュータで構成されている。演算部１４は、機能的には、単語獲得日齢算出部２０と、直線近似部２２と、変化点検出部２４と、を含んだ構成で表すことができる。なお、変化点検出部２４が、本発明の判定手段の一例である。 The calculation unit 14 is a computer that includes a CPU (Central Processing Unit), a RAM (Random Access Memory), and a ROM (Read Only Memory) that stores a program for executing a vocabulary explosion time detection processing routine described later. It is configured. The calculation unit 14 can be functionally represented by a configuration including a word acquisition age calculation unit 20, a straight line approximation unit 22, and a change point detection unit 24. The change point detection unit 24 is an example of the determination unit of the present invention.

単語獲得日齢算出部２０は、入力部１２から入力されたデータセットの単語獲得年月日と生年月日との差から、それぞれの単語が生後何日目に獲得されたかを示す「獲得日齢」を算出する。例えば、単語獲得年月日が「２００９年９月１２日」、生年月日が「２００８年９月１２日」であれば、獲得日齢＝２００９年９月１２日−２００８年９月１２日＝３６５日齢、と算出することができる。算出された各単語の獲得日齢を昇順に並べ、小さい方から１，２，３，・・・と整数系列を割り当て、累積単語数（何番目に覚えた単語か）を算出する。これにより、獲得日齢と累積単語数との組からなるデータセットが生成される。 The word acquisition age calculation unit 20 indicates the date of acquisition of each word from the difference between the word acquisition date and the date of birth in the data set input from the input unit 12. Calculate age. For example, if the word acquisition date is “September 12, 2009” and the date of birth is “September 12, 2008”, the age of acquisition = September 12, 2009-September 12, 2008 = 365 days old. The calculated acquisition ages of the words are arranged in ascending order, and an integer series of 1, 2, 3,... Is assigned in ascending order to calculate the cumulative number of words (the most remembered word). Thereby, the data set which consists of a set of acquisition age and the number of accumulation words is generated.

なお、入力部１２において直接、獲得日齢と累積単語数との組からなるデータセットを取得する形式としてもよい。この場合、演算部１４において、単語獲得日齢算出部２０の構成を省略することができる。 In addition, it is good also as a format which acquires in the input part 12 the data set which consists of a set of acquisition age and the cumulative number of words directly. In this case, the configuration of the word acquisition age calculation unit 20 can be omitted in the calculation unit 14.

直線近似部２２は、獲得日齢と累積単語数との組からなるデータセットを、１つの直線で近似する。実データを用いた事前検証では，語彙爆発前の語彙学習速度は８０％以上の精度で直線近似できることがわかっている。そこで、累積単語数をｙ軸、獲得日齢をｘ軸とする座標系に各データをプロットし、プロットされた各データポイントのノルムが最小になるような直線を求める。このとき、獲得日齢が大きい方からｋ個のデータポイントを除外し、残りのデータポイントを用いて直線近似を行う。 The straight line approximation unit 22 approximates a data set composed of a combination of the acquired age and the cumulative number of words with one straight line. In prior verification using actual data, it is known that the vocabulary learning speed before the vocabulary explosion can be linearly approximated with an accuracy of 80% or more. Therefore, each data is plotted on a coordinate system in which the cumulative number of words is the y-axis and the acquired age is the x-axis, and a straight line that minimizes the norm of each plotted data point is obtained. At this time, k data points are excluded from those having a larger acquired age, and linear approximation is performed using the remaining data points.

変化点検出部２４では、直線近似部２２により求めた直線と、直線近似部２２で近似直線を求めるときに除外したｋ個のデータポイントとの差が、所定の閾値以上となったときに語彙爆発が起こったと判定する。具体的には、図４に示すように、累積単語数の時系列をｙ_ｉ、獲得日齢をｘ_ｉとし（１≦ｉ≦Ｉ）、ｉ＝ｎ＋ｋまでのプロットが得られたとする。このプロットされたデータポイントの内ｎ番目までのデータポイント（図中白丸）を利用し、下記（１）式により直線近似を仮定し、直線のパラメータａ及びｂを求める。 The change point detection unit 24 uses the vocabulary when the difference between the straight line obtained by the straight line approximation unit 22 and the k data points excluded when the straight line approximation unit 22 obtains the approximate straight line exceeds a predetermined threshold. Determine that an explosion has occurred. Specifically, as shown in FIG. 4, it is assumed that the time series of the cumulative number of words is y _i , the acquired age is x _i (1 ≦ i ≦ I), and plots up to i = n + k are obtained. Using up to nth data points (white circles in the figure) among the plotted data points, straight line approximation is assumed by the following equation (1), and straight line parameters a and b are obtained.

ここでは，ノルムとして二乗ノルムを利用する。次に、この直線と除外したｋ個のデータポイント（図中黒丸）との差分ｄｉｓｔを、下記（２）式により計算する。 Here, the square norm is used as the norm. Next, a difference dist between the straight line and the excluded k data points (black circles in the figure) is calculated by the following equation (2).

なお、ｋ個全てを積算した差分ｄｉｓｔ値を一度に求めるのではなく、まずｉ＝ｎ＋１のときの差分ｄｉｓｔと閾値ｄとを比較し、ｉを１ずつ増やしたときのデータポイントと直線との差分を順番にｄｉｓｔに加算しながら閾値ｄと比較する処理を繰り返すそして、差分ｄｉｓｔが初めて閾値ｄを超えたときに、語彙爆発の時期にさしかかったと判定し、語彙爆発が有ることを示す情報、及び差分ｄｉｓｔが初めて閾値ｄを超えたときのｘ_ｉを語彙爆発の開始日として出力する。なお、ｋ個のデータポイント全てを積算しても差分ｄｉｓｔが閾値ｄを超えなかった場合には、語彙爆発が無い（まだ語彙爆発の時期に到達していない）ことを示す情報を出力する。 In addition, the difference dist value obtained by accumulating all k pieces is not obtained at a time, but the difference dist when i = n + 1 is first compared with the threshold d, and the data point and the straight line when i is increased by 1 are compared. Repeating the process of comparing the threshold value d while sequentially adding the differences to the dist, and when the difference dist exceeds the threshold value d for the first time, it is determined that the timing of the vocabulary explosion has come, and information indicating that there is a vocabulary explosion, And x _i when the difference dist exceeds the threshold d for the first time is output as the start date of the vocabulary explosion. If the difference dist does not exceed the threshold value d even after accumulating all k data points, information indicating that there is no vocabulary explosion (the vocabulary explosion time has not yet been reached) is output.

出力部１６は、ディスプレイ、プリンタ、磁気ディスクなどで実装され、演算部１４での演算結果が出力される。例えば、図５に示すような出力インターフェースに演算結果を出力する。語彙爆発が検出された場合は、その日時を合わせて表示してもよい。また、入力部１２にて所定の日時の入力を受け付け、その日時と検出された語彙爆発の開始日とを比較し、入力された日時が語彙爆発の開始日より前であれば、当該日時において語彙爆発がないことを示す情報を表示し、語彙爆発の開始日であれば、当該日時において語彙爆発にさしかかったことを示す情報を表示してもよい。なお、図５の例では、語彙爆発の開始日は、日齢にて表示されている。 The output unit 16 is mounted with a display, a printer, a magnetic disk, or the like, and the calculation result of the calculation unit 14 is output. For example, the calculation result is output to an output interface as shown in FIG. If a vocabulary explosion is detected, the date and time may be displayed together. Further, the input unit 12 accepts input of a predetermined date and time, compares the date and time with the detected vocabulary start date, and if the input date and time is before the vocabulary start date, Information indicating that there is no vocabulary explosion may be displayed, and if it is the start date of the vocabulary explosion, information indicating that the vocabulary explosion has been reached at that date may be displayed. In the example of FIG. 5, the start date of the vocabulary explosion is displayed in age.

次に、図６を参照して、本実施の形態の語彙爆発時期検出装置１０において実行される語彙爆発時期検出処理ルーチンについて説明する。 Next, a vocabulary explosion time detection processing routine executed in the vocabulary explosion time detection apparatus 10 of the present embodiment will be described with reference to FIG.

ステップ１００で、生年月日、単語獲得年月日、発話、及び意味で構成されたデータセットを取得する。ここでは、ｎ＋ｋ個のデータが取得されたものとする。 In step 100, a data set composed of date of birth, word acquisition date, utterance, and meaning is obtained. Here, it is assumed that n + k pieces of data have been acquired.

次に、ステップ１０２で、上記ステップ１００で取得したデータセットの単語獲得年月日と生年月日との差から、それぞれの単語の獲得日齢を算出する。そして、算出された各単語の獲得日齢を昇順に並べ、小さい方から１，２，３，・・・と整数系列を割り当て、累積単語数を算出する。これにより、獲得日齢と累積単語数との組からなるデータセットを生成する。 Next, in step 102, the acquisition age of each word is calculated from the difference between the date of acquisition and the date of birth of the data set acquired in step 100 above. Then, the calculated acquisition ages of the words are arranged in ascending order, and an integer series of 1, 2, 3,... Is assigned from the smallest to calculate the cumulative number of words. As a result, a data set composed of a set of the acquired age and the cumulative number of words is generated.

次に、ステップ１０４で、上記ステップ１０２で生成した獲得日齢と累積単語数との組からなるデータセットを、累積単語数をｙ軸、獲得日齢をｘ軸とする座標系にプロットし、プロットされたデータポイントのうち、ｎ番目までのデータポイントを用いて、ノルムが最小になるような直線（ｙ＝ａｘ＋ｂ）を求める。 Next, in step 104, the data set consisting of the combination of the acquired age and the cumulative number of words generated in step 102 is plotted on a coordinate system with the cumulative number of words on the y-axis and the acquired age on the x-axis. A straight line (y = ax + b) having a minimum norm is obtained by using up to nth data points among the plotted data points.

次に、ステップ１０６で、変数ｊに１をセットし、次に、ステップ１０８で、上記（１）式に従って、上記ステップ１０４で求めた直線と、（ｎ＋１）番目から（ｎ＋ｊ）番目までのデータポイントとの差分ｄｉｓｔを算出する。 Next, in step 106, 1 is set to the variable j. Next, in step 108, the straight line obtained in step 104 and the data from (n + 1) th to (n + j) th in accordance with the above equation (1). The difference dist from the point is calculated.

次に、ステップ１１０で、上記ステップ１０８で算出した差分ｄｉｓｔが閾値ｄを超えたか否かを判定する。差分ｄｉｓｔ＞閾値ｄの場合には、ステップ１１２へ移行して、データポイント（ｘ_ｎ＋ｊ，ｙ_ｎ＋ｊ）を変化点として検出する。一方、差分ｄｉｓｔ≦閾値ｄの場合には、ステップ１１４へ移行して、ｊ＝ｋとなったか否かを判定する。ｊ≠ｋの場合には、ステップ１１６へ移行して、ｊを１インクリメントして、ステップ１０８へ戻る。一方、ｊ＝ｋの場合には、ｋ個のデータポイント全てを積算しても差分ｄｉｓｔが閾値ｄを超えなかったことを示しているため、ステップ１１８へ移行して、変化点なしを検出結果として出力する。 Next, in step 110, it is determined whether or not the difference dist calculated in step 108 has exceeded a threshold value d. When difference dist> threshold d, the process proceeds to step 112, and the data point (x _{n + j} , y _{n + j} ) is detected as a change point. On the other hand, if difference dist ≦ threshold d, the process proceeds to step 114 to determine whether j = k. If j ≠ k, the process proceeds to step 116, j is incremented by 1, and the process returns to step 108. On the other hand, in the case of j = k, it indicates that the difference dist does not exceed the threshold value d even if all the k data points are integrated, so the process proceeds to step 118 and the detection result indicating no change point is obtained. Output as.

次に、ステップ１２０で、上記ステップ１１２で変化点が検出された場合には、語彙爆発にさしかかっていることを示す情報、及び変化点（ｘ_ｎ＋ｊ，ｙ_ｎ＋ｊ）から得られる語彙爆発の開始日ｘ_ｎ＋ｊを出力する。また、上記ステップ１１８で変化点が検出されていない場合には、語彙爆発が無いことを示す情報を出力して、処理を終了する。 Next, in step 120, if a change point is detected in step 112, information indicating that the vocabulary explosion is about to be reached, and the start date of the vocabulary explosion obtained from the change point (x _{n + j} , y _{n + j} ) _{xn + j} is output. If no change point is detected in step 118, information indicating that there is no vocabulary explosion is output, and the process ends.

実際に、本実施の形態の手法で、ｋ＝５、ｄ＝１．０という値を使って実験を行った。本手法で１５名分の実データで検証を行ったところ，約５３％のデータで語彙爆発日の検出に成功した。つまり，この結果は，語彙学習速度がある１つの変化点で変化することを科学的にも意味していると思われる。 Actually, an experiment was performed using the values of k = 5 and d = 1.0 by the method of the present embodiment. When we verified the actual data for 15 people with this method, we succeeded in detecting the vocabulary explosion date with about 53% of data. In other words, this result seems to mean scientifically that the vocabulary learning speed changes at one change point.

以上説明したように、本実施の形態の語彙爆発時期検出装置によれば、幼児の語彙発達の特徴を捉えて、単語の獲得日齢と累積単語数とのデータセットを、日齢が大きい方から所定個のデータを除いて直線近似し、直線と所定個のデータとの差分が閾値を超える変化点を検出することで語彙爆発の有無を判定するため、個人差も考慮してリアルタイムに語彙爆発の時期にさしかかっているか否かを検出することができる。 As described above, according to the vocabulary explosion timing detection device of the present embodiment, the data set of the acquired age of words and the cumulative number of words is captured for those who have a large age based on the characteristics of infant vocabulary development. In order to determine the presence or absence of a vocabulary explosion by detecting a change point where the difference between the straight line and the predetermined number of data exceeds the threshold, the vocabulary is taken into consideration in real time in consideration of individual differences. It is possible to detect whether or not an explosion is about to occur.

このように、幼児の語彙爆発を正確に且つ迅速に検出することの効果として、（１）語彙爆発前後で変わる発達段階に即した教育の実施、（２）個人の語彙学習速度や特徴に合わせたオーダーメード型教育の実施、（３）言語発達遅滞などの発達障害児の早期発見および支援教育プログラムの開発、などが挙げられる。また語彙発達データの取得の時点からウェブなどで一元的に管理すれば、より効果的な幼児教育や育児支援が可能となり、少子高齢化社会を支えるＩＣＴ技術として、社会および産業に大きな効果をもたらす可能性がある。 As described above, the effects of accurately and quickly detecting the vocabulary explosion of infants are as follows: (1) Implementation of education according to the developmental stage that changes before and after the vocabulary explosion; (2) According to the individual vocabulary learning speed and characteristics (3) Early detection of children with developmental disabilities such as delayed language development and development of support education programs. Moreover, if it is managed centrally on the web from the time of acquisition of vocabulary development data, more effective early childhood education and childcare support will be possible, and it will have a great effect on society and industry as an ICT technology that supports an aging society with fewer children. there is a possibility.

また、本発明は、上記実施の形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications are possible without departing from the gist of the present invention.

また、上述の語彙爆発時期推定装置は、内部にコンピュータシステムを有しているが、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。 The above-described vocabulary explosion time estimation apparatus has a computer system inside, but the “computer system” includes a homepage provision environment (or display environment) if a WWW system is used. Shall be.

また、本願明細書中において、プログラムが予めインストールされている実施形態として説明したが、当該プログラムを、コンピュータ読み取り可能な記録媒体に格納して提供することも可能である。 In the present specification, the embodiment has been described in which the program is installed in advance. However, the program can be provided by being stored in a computer-readable recording medium.

１０語彙爆発時期検出装置
１２入力部
１４演算部
１６出力部
２０単語獲得日齢算出部
２２直線近似部
２４変化点検出部 DESCRIPTION OF SYMBOLS 10 Vocabulary explosion time detection apparatus 12 Input part 14 Calculation part 16 Output part 20 Word acquisition age calculation part 22 Straight line approximation part 24 Change point detection part

Claims

Of a plurality of data indicating the relationship between the age at which an infant started speaking a new word and the cumulative number of words that the infant started speaking before the age, the one with the larger age Approximating means for approximating a transition of data obtained by removing a predetermined number of data from a straight line,
Based on the difference between the straight line approximated by the approximating means and the predetermined number of data, and a predetermined threshold value, whether or not the data indicating the infant vocabulary explosion time is included in the plurality of data. Determination means for determining;
Vocabulary explosion time detection device including.

The determination means calculates the difference while adding one by one from the smaller data of the predetermined number of data, and is used for calculating the difference when the difference exceeds the threshold value. The lexical explosion timing detection device according to claim 1, wherein the maximum age in the data is detected as the lexical explosion timing of the infant.

A vocabulary explosion time detection method in a vocabulary explosion time detection device including an approximation means and a determination means,
The approximating means includes, among a plurality of data indicating a relationship between an age at which an infant comes to speak a new word and a cumulative number of words at which the infant comes to speak before the age, Approximate the transition of data excluding a predetermined number of data from those with the largest age, with a straight line,
The determination means includes data indicating the vocabulary explosion timing of the infant in the plurality of data based on a difference between the straight line approximated by the approximation means and the predetermined number of data, and a predetermined threshold. A method to detect when a vocabulary explosion occurs.

The determination means calculates the difference while adding one by one from the smaller data of the predetermined number of data, and is used for calculating the difference when the difference exceeds the threshold value. 4. The vocabulary explosion timing detection method according to claim 3, wherein the maximum age in the data is detected as the vocabulary explosion timing of the infant.

A vocabulary explosion time detection program for causing a computer to function as each means constituting the vocabulary explosion time detection device according to claim 1.