JPH10207349A

JPH10207349A - Card type language learning machine and voice comparing system

Info

Publication number: JPH10207349A
Application number: JP967297A
Authority: JP
Inventors: Katsuyasu Shimazaki; 勝康島崎
Original assignee: Mamiya OP Co Ltd
Current assignee: Mamiya OP Co Ltd
Priority date: 1997-01-22
Filing date: 1997-01-22
Publication date: 1998-08-07

Abstract

PROBLEM TO BE SOLVED: To objectively and exactly judge the uttering of learning person. SOLUTION: When a magnetic data reading mechanism part 2 reads the model uttering of magnetic tape 1a, a model uttering feature extracting means 3 extracts the respective features of accent and intonation from this read model uttering. On the other hand, the uttering of learning person is collected from a microphone 4. A learning person uttering feature extracting means 5 extracts the respective features of accent and intonation in the uttering of learning person. A pattern matching means 6 performs pattern matching between the respective extracted features in model uttering and learning person uttering. While using three kinds of LED 8a, 8b and 8c of display panel, for example, a matching display means 7 displays the result of pattern matching.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は語学学習を行うため
のカード式語学学習機および音声比較システムに関し、
特に学習者の発声と模範発声とを比較する機能を有する
カード式語学学習機および音声比較システムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a card type language learning machine for performing language learning and a voice comparison system.
In particular, the present invention relates to a card-type language learning machine and a voice comparison system having a function of comparing a student's utterance with a model utterance.

【０００２】[0002]

【従来の技術】従来、カード式語学学習機としては、例
えば実用新案登録第３０２３８７３号に見られるよう
に、プログラムカードと呼ばれる磁気カードをレールに
沿って搬送し、磁気カードに貼られた磁気テープに記録
された模範発声を読み取って、スピーカから出力するよ
うにしたものがある。2. Description of the Related Art Conventionally, as a card type language learning machine, a magnetic card called a program card is conveyed along a rail as shown in Utility Model Registration No. 3023873, and a magnetic tape attached to the magnetic card is used. There is a device that reads a model utterance recorded in a speaker and outputs it from a speaker.

【０００３】このようなカード式語学学習機による学習
の手順は、磁気カードに記録された模範発声をスピーカで聴く。学習者自身がマイクに向かって発声し、磁気カードに
記録する。[0003] In the learning procedure by such a card-type language learning machine, a model utterance recorded on a magnetic card is listened to by a speaker. The learner himself speaks into the microphone and records it on a magnetic card.

【０００４】磁気カードを再生して、模範発声と学習
者自身との発声を聴き比べる。というのが一般的である。そして、学習者は、自分の発
声が模範発声に近くなるように〜を繰り返して練習
する。[0004] By reproducing the magnetic card, the model utterance is compared with the utterance of the learner himself. It is common. Then, the learner practices by repeating ~ to make his or her utterance closer to the model utterance.

【０００５】[0005]

【発明が解決しようとする課題】しかし、従来のカード
式語学学習機では、学習者が自分の発声と模範発声とを
比較するのみなので、両者の違いを客観的にかつ正確に
判断することができなかった。However, in the conventional card type language learning machine, since the learner only compares his / her own utterance with the model utterance, it is not possible to objectively and accurately judge the difference between the two. could not.

【０００６】これに対し、一般のパソコン用の語学教材
ソフトでは、発音の波形をＣＲＴ等に表示して模範発声
と学習者の発声とを比較できるようにしたものがある。
しかし、発声の要素としては、発音以外にも、アクセン
トやイントネーションも含まれている。すなわち、これ
らの要素の調和がとれたときに正しい発声となる。よっ
て、従来のように発音の波形を比べるだけでは、十分と
は言えなかった。[0006] On the other hand, some language teaching software for general personal computers displays a pronunciation waveform on a CRT or the like so that a model utterance can be compared with a student's utterance.
However, utterance elements include accent and intonation as well as pronunciation. That is, when these elements are harmonized, a correct utterance is obtained. Therefore, it is not enough to compare the waveforms of the sounds as in the related art.

【０００７】本発明はこのような点に鑑みてなされたも
のであり、学習者の発声を客観的にかつ正確に判断する
ことのできるカード式語学学習機および音声比較システ
ムを提供することを目的とする。The present invention has been made in view of the above points, and has as its object to provide a card-type language learning machine and a voice comparison system capable of objectively and accurately determining the utterance of a learner. And

【０００８】[0008]

【課題を解決するための手段】本発明では上記課題を解
決するために、磁気カードを使用して語学学習を行うた
めのカード式語学学習機において、模範発声が録音され
た模範発声データ領域を有する磁気カードの磁気データ
を読み取る磁気データ読み取り機構部と、前記学習者発
声を集音するマイクと、前記磁気カードの模範発声のア
クセントおよびイントネーションの各特徴を抽出する模
範発声特徴抽出手段と、前記学習者発声のアクセントお
よびイントネーションの各特徴を抽出する学習者発声特
徴抽出手段と、前記抽出された模範発声の各特徴と前記
学習者発声の各特徴とをパターンマッチングするパター
ンマッチング手段と、前記パターンマッチングの結果を
表示するマットング表示手段と、を有することを特徴と
するカード式語学学習機が提供される。According to the present invention, in order to solve the above-mentioned problems, in a card-type language learning machine for performing language learning using a magnetic card, an exemplary utterance data area in which an exemplary utterance is recorded is stored. A magnetic data reading mechanism for reading the magnetic data of the magnetic card, a microphone for collecting the learner's utterance, an exemplary utterance feature extracting means for extracting each characteristic of accent and intonation of the exemplary utterance of the magnetic card, Learner utterance feature extraction means for extracting each feature of the accent and intonation of the learner utterance; pattern matching means for pattern matching each of the extracted model utterance features and each of the learner utterance features; And a matting display means for displaying a result of the matching.習機 is provided.

【０００９】このようなカード式語学学習機では、磁気
データ読み取り機構部により、磁気カードの磁気データ
を読み取り、模範発声を読み取る。この読み取られた模
範発声から、模範発声特徴抽出手段がそのアクセントお
よびイントネーションの各特徴を抽出する。一方、マイ
クからは、学習者発声が集音される。この集音された学
習者発声から、学習者発声特徴抽出手段がそのアクセン
トおよびイントネーションの各特徴を抽出する。In such a card-type language learning machine, the magnetic data reading mechanism reads magnetic data of a magnetic card and reads model speech. From the read model utterance, the model utterance feature extracting unit extracts each feature of the accent and intonation. On the other hand, learners' utterances are collected from the microphone. From the collected learner utterances, the learner utterance feature extraction means extracts each feature of the accent and intonation.

【００１０】パターンマッチング手段は、抽出された模
範発声の各特徴と学習者発声の各特徴とをパターンマッ
チングする。そして、マッチング表示手段が、パターン
マッチングの結果を表示する。これにより、学習者発声
と模範発声との違いが客観的にかつ正確に評価され、そ
れが一目で確認できる。The pattern matching means performs pattern matching between each feature of the extracted model utterance and each feature of the learner utterance. Then, the matching display means displays the result of the pattern matching. Thereby, the difference between the learner utterance and the model utterance is objectively and accurately evaluated, and it can be confirmed at a glance.

【００１１】[0011]

【発明の実施の形態】以下、本発明の一形態を図面を参
照して説明する。図１は本形態のカード式語学学習機の
機能の概念を示す図である。磁気カード１の磁気テープ
１ａには、少なくとも模範発声の磁気データが記録され
ている。磁気データ読み取り機構部２は、この磁気テー
プ１ａの模範発声を読み取る。模範発声特徴抽出手段３
は、この読み取られた模範発声からアクセントおよびイ
ントネーションの各特徴を抽出する。一方、マイク４か
らは、学習者発声が集音される。学習者発声特徴抽出手
段５は、学習者発声のアクセントおよびイントネーショ
ンの各特徴を抽出する。DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a diagram showing the concept of the function of the card type language learning machine of the present embodiment. On the magnetic tape 1a of the magnetic card 1, at least magnetic data of the model utterance is recorded. The magnetic data reading mechanism 2 reads the model utterance of the magnetic tape 1a. Model utterance feature extraction means 3
Extracts the features of accent and intonation from the read model utterance. On the other hand, learners' utterances are collected from the microphone 4. The learner utterance feature extraction means 5 extracts each feature of the accent and intonation of the learner utterance.

【００１２】パターンマッチング手段６は、抽出された
模範発声の各特徴と学習者発声の各特徴とをパターンマ
ッチングする。マッチング表示手段７は、例えば表示パ
ネル８の３種類のＬＥＤ８ａ，８ｂ，８ｃを使用して、
パターンマッチングの結果を表示する。すなわち、マッ
チングの度合いが悪ければＬＥＤ８ａを点灯させ、良け
ればＬＥＤ８ｂを点灯させ、非常に良ければＬＥＤ８ｃ
を点灯させる。The pattern matching means 6 performs pattern matching between each feature of the extracted model utterance and each feature of the learner utterance. The matching display means 7 uses, for example, three types of LEDs 8a, 8b, 8c of the display panel 8,
Displays the result of pattern matching. That is, if the degree of matching is bad, the LED 8a is turned on, if it is good, the LED 8b is turned on, and if it is very good, the LED 8c is turned on.
Lights up.

【００１３】図２は本形態のカード式語学学習機の外観
構成を示す斜視図である。カード式語学学習機１０の上
側ケース１２には、後述する様々な操作を行うための操
作部１３、ケース内部の後述のスピーカ３４からの音声
を出力する音声出力部１４、磁気カード１６用の搬送読
み取り機構部１５等が設けられている。FIG. 2 is a perspective view showing the external configuration of the card type language learning machine of the present embodiment. The upper case 12 of the card-type language learning machine 10 includes an operation unit 13 for performing various operations described later, a sound output unit 14 for outputting a sound from a speaker 34 described later inside the case, and a transport for the magnetic card 16. A reading mechanism 15 and the like are provided.

【００１４】搬送読み取り機構部１５は、そのレールに
磁気カード１６を挿入することにより、図の矢印方向に
磁気カード１６を搬送しながら、磁気テープ１６ａに発
声データを録音したり、読み取ったりする。The transport reading mechanism 15 inserts the magnetic card 16 into the rail, thereby recording and reading utterance data on the magnetic tape 16a while transporting the magnetic card 16 in the direction of the arrow in the figure.

【００１５】操作部１３には、生徒モードボタン２１、
先生モードボタン２２、録音ボタン２３、マッチングス
イッチ２４、ボリューム２５、マイク孔２６、表示部２
７が設けられている。生徒モードボタン２１は、学習者
の発声を録音したり再生したりするときに押すボタンで
ある。この生徒モードボタン２１を録音ボタン２３と同
時に押すことにより、磁気カード１６の磁気テープ１６
ａに学習者の発声を録音することができる。また、生徒
モードボタン２１のみが押されている場合には、磁気カ
ード１６の磁気テープ１６ａに録音された学習者の発声
が、音声出力部１４から出力される。The operation unit 13 includes a student mode button 21,
Teacher mode button 22, Record button 23, Matching switch 24, Volume 25, Microphone hole 26, Display unit 2
7 are provided. The student mode button 21 is a button that is pressed when recording or reproducing a student's utterance. When the student mode button 21 is pressed at the same time as the recording button 23, the magnetic tape 16 of the magnetic card 16 is pressed.
The utterance of the learner can be recorded in a. When only the student mode button 21 is pressed, the learner's utterance recorded on the magnetic tape 16 a of the magnetic card 16 is output from the audio output unit 14.

【００１６】一方、先生モードボタン２２は、磁気テー
プ１６ａに録音された模範発声を聴くときに押すボタン
である。マッチングスイッチ２４は、模範発声と学習者
発声とのパターンマッチングを行うためのスイッチであ
る。このマッチングスイッチ２４がオンになった状態で
先生モードボタン２２が押され、磁気カード１６が読み
取られると、磁気テープ１６ａの模範発声が読み取ら
れ、後述する手順によって模範発声の特徴が抽出され、
内部のメモリに格納される。一方、マッチングスイッチ
２４がオンになった状態で生徒モードボタン２１が押さ
れ、マイク孔２６を介して後述のマイク３５から学習者
発声が集音されると、後述する手順によって学習者発声
の特徴が抽出される。そして、先に格納された模範発声
と学習者発声の特徴が、後述する手法によりパターンマ
ッチングされ、その結果、両者の類似度が表示部２７に
表示される。On the other hand, the teacher mode button 22 is a button to be pressed when listening to the model utterance recorded on the magnetic tape 16a. The matching switch 24 is a switch for performing pattern matching between the model utterance and the learner utterance. When the teacher mode button 22 is pressed with the matching switch 24 turned on and the magnetic card 16 is read, the model utterance of the magnetic tape 16a is read, and the characteristics of the model utterance are extracted by the procedure described later.
Stored in internal memory. On the other hand, when the student mode button 21 is pressed in a state where the matching switch 24 is turned on and a learner's utterance is collected from a microphone 35 to be described later through the microphone hole 26, the characteristics of the learner's utterance are described by a procedure described later. Is extracted. Then, the features of the model utterance and the learner utterance stored earlier are subjected to pattern matching by a method described later, and as a result, the similarity between the two is displayed on the display unit 27.

【００１７】図３は表示部２７の構成の一例を示す図で
ある。この表示部２７には、例えば３つの絵柄の表示
欄、すなわち、丸表示欄２７ａ、２重丸表示欄２７ｂ、
花丸表示欄２７ｃが設けられている。各表示欄２７ａ，
２７ｂ，２７ｃには、内部に図示されていないＬＥＤが
設けられており、そのＬＥＤが点灯することにより、各
絵柄が表示される。ここでは、パターンマッチングの結
果、模範発声と学習者発声の特徴のマッチングの度合い
が低い場合には、丸表示欄２７ａが点灯し、中程度の場
合には２重丸表示欄２７ｂが、高い場合には花丸表示欄
２７ｃが点灯する。FIG. 3 is a diagram showing an example of the configuration of the display unit 27. The display section 27 includes, for example, display fields for three pictures, that is, a circle display field 27a, a double circle display field 27b,
A flower circle display field 27c is provided. Each display column 27a,
LEDs (not shown) are provided inside 27b and 27c, and when the LEDs are turned on, each picture is displayed. Here, as a result of the pattern matching, when the degree of matching between the model utterance and the feature of the learner utterance is low, the circle display field 27a is turned on. When the degree is medium, the double circle display field 27b is high. , The flower circle display field 27c is turned on.

【００１８】図４はカード式語学学習機１０内部のハー
ドウェアの構成を示すブロック図である。制御回路３１
は、例えばロジック回路で構成されている。制御回路３
１は、生徒モードボタン２１、先生モードボタン２２、
録音ボタン２３、マッチングスイッチ２４、図示されて
いないセンサ等の状態に応じて、搬送読み取り機構部１
５を駆動して、カード式語学学習機１０全体の動作制御
を行う。FIG. 4 is a block diagram showing a hardware configuration inside the card type language learning machine 10. As shown in FIG. Control circuit 31
Is composed of, for example, a logic circuit. Control circuit 3
1 is a student mode button 21, a teacher mode button 22,
The transport reading mechanism 1 according to the state of the recording button 23, the matching switch 24, a sensor (not shown), and the like.
5 to control the operation of the entire card type language learning machine 10.

【００１９】音声処理回路３２は、例えばロジック回路
で構成されている。音声処理回路３２は、後述するマッ
チング処理回路、プリアンプ回路、メモリ、周波数変調
回路等を有している。すなわち、音声処理回路３２は、
マイク３５で集音された学習者発声を読み取り、その特
徴（アクセントおよびイントネーション）を抽出する。
その一方で、音声処理回路３２は、制御回路３１からの
指令に応じて、磁気ヘッド１５１によって磁気カード１
６の模範発声データを読み取り、模範発声の特徴（アク
セントおよびイントネーション）を抽出する。そして、
学習者発声の特徴と模範発声の特徴をパターンマッチン
グして、その結果を制御回路３１に送る。The audio processing circuit 32 is constituted by, for example, a logic circuit. The audio processing circuit 32 includes a later-described matching processing circuit, a preamplifier circuit, a memory, a frequency modulation circuit, and the like. That is, the audio processing circuit 32
The learner's utterance collected by the microphone 35 is read, and its features (accent and intonation) are extracted.
On the other hand, the audio processing circuit 32 controls the magnetic card 1 by the magnetic head 151 in response to a command from the control circuit 31.
6 to read the model utterance data and extract the features (accents and intonation) of the model utterance. And
The feature of the learner utterance and the feature of the model utterance are subjected to pattern matching, and the result is sent to the control circuit 31.

【００２０】パターンマッチングの結果を受けた制御回
路３１は、学習者発声の特徴と模範発声の特徴の類似度
を表示部２７によって表示する。また、音声処理回路３
２は、磁気カード１６の模範発声データを読み取ると、
それを増幅してパワーアンプ３３に送る。音声処理回路
３２から送られた模範発声をボリューム２５の調節度合
いに応じて増幅し、スピーカ３４に出力する。なお、マ
イク３５で集音された学習者の発声も、磁気カード１６
に録音したり、スピーカ３４から出力することもでき
る。The control circuit 31 receiving the result of the pattern matching displays the similarity between the feature of the learner's utterance and the feature of the model utterance on the display unit 27. Also, the audio processing circuit 3
2 reads the model utterance data of the magnetic card 16,
It is amplified and sent to the power amplifier 33. The model utterance sent from the audio processing circuit 32 is amplified according to the degree of adjustment of the volume 25 and output to the speaker 34. Note that the learner's utterance collected by the microphone 35 is
, Or output from the speaker 34.

【００２１】図５は音声処理回路３２内のマッチング処
理回路の具体的な構成を示すブロック図である。また、
図６は音声信号入力から特徴パターン抽出までのマッチ
ング処理回路の機能を示す図である。図５のマッチング
処理回路４０には、例えば図６に示すような波形を持つ
模範発声または学習者発声の音声信号５１が入力され
る。入力された音声信号５１は、アクセント検出回路４
１およびイントネーション検出回路４２に送られる。FIG. 5 is a block diagram showing a specific configuration of the matching processing circuit in the audio processing circuit 32. Also,
FIG. 6 is a diagram showing functions of a matching processing circuit from input of an audio signal to extraction of a feature pattern. The matching processing circuit 40 shown in FIG. 5 receives, for example, a model utterance or a learner utterance voice signal 51 having a waveform as shown in FIG. The input audio signal 51 is input to the accent detection circuit 4.
1 and sent to the intonation detection circuit 42.

【００２２】アクセント検出回路４１は、音声信号５１
からアクセントの変化を検出する回路であり、整流器、
平滑ＬＰＦ（ローパスフィルタ）等で構成されている。
アクセント検出回路４１では、音声信号５１の波形を整
流した後、数十Ｈｚ程度のカットオフ周波数を持つＬＰ
Ｆで平滑し、図６のグラフ５２に示すような信号を得
る。ここで、グラフ５２の横軸は時間を、縦軸は検出さ
れた音声振幅の電圧レベルをそれぞれ示している。アク
セントの認識は、音声の強弱の変化が合っていればよい
ので、音声の強弱の大きさまで合っている必要はない。
したがって、アクセントの特徴パターン抽出は、音声の
強弱のレベル圧縮やパターンをモデル化する等の方法で
認識率の向上を図るようにする。The accent detection circuit 41 has a voice signal 51
Is a circuit that detects changes in accent from the rectifier,
It is composed of a smooth LPF (low-pass filter) or the like.
In the accent detection circuit 41, after rectifying the waveform of the audio signal 51, the LP having a cutoff frequency of about several tens Hz
The signal is smoothed by F to obtain a signal as shown in a graph 52 of FIG. Here, the horizontal axis of the graph 52 indicates time, and the vertical axis indicates the voltage level of the detected audio amplitude. The recognition of the accent only needs to match the strength of the voice, so it is not necessary to match the strength of the voice.
Therefore, in the extraction of the accent feature pattern, the recognition rate is improved by a method such as level compression of the voice level or modeling of the pattern.

【００２３】一方、イントネーション検出回路４２は、
音声信号５１からイントネーション波形、すなわち、音
声周波数の高低の変化を検出する回路であり、ＬＰＦ、
零交差波検出回路、Ｆ／Ｖ（周波数／電圧）変換回路等
で構成されている。イントネーション検出回路４２で
は、音声信号５１の基本ピッチ周波数の変化の検出によ
って、図６のグラフ５３のような信号を得る。基本ピッ
チ周波数は、成人の男性で１２５Ｈｚ、女性で２５０Ｈ
ｚ程度である。ここでは、高い方の周波数である２５０
Ｈｚにイントネーションの変化分αを加え、（２５０＋
α）Ｈｚのカットオフ周波数を持つＬＰＦで検出する。
グラフ５３では、処理の都合上Ｆ／Ｖ変換がなされてお
り、縦軸は電圧レベルとなっているが、電圧レベルの変
化は基本ピッチの変化に対応している。On the other hand, the intonation detection circuit 42
This circuit detects an intonation waveform from the audio signal 51, that is, a change in the level of the audio frequency, and includes a LPF,
It is composed of a zero-crossing wave detection circuit, an F / V (frequency / voltage) conversion circuit, and the like. In the intonation detection circuit 42, a signal as shown by a graph 53 in FIG. 6 is obtained by detecting a change in the basic pitch frequency of the audio signal 51. Basic pitch frequency is 125Hz for adult men and 250H for women
about z. Here, the higher frequency, 250
Hz plus the intonation change α, (250+
α) It is detected by an LPF having a cutoff frequency of Hz.
In the graph 53, the F / V conversion is performed for the sake of processing, and the vertical axis indicates the voltage level, but the change in the voltage level corresponds to the change in the basic pitch.

【００２４】一般的に、イントネーションの変化態様は
少ないため、アクセントの場合より容易にモデル化が可
能である。また、発声が途切れたときに検出できなかっ
た基本ピッチは、その前後を滑らかにパターン化する。In general, since the variation of intonation is small, modeling is easier than in the case of accent. The basic pitch that could not be detected when the utterance was interrupted is smoothly patterned before and after the basic pitch.

【００２５】このようにしてアクセントとイントネーシ
ョンの波形が検出されると、図５のマルチプレクサ４３
が両者を直列にしてＡ／Ｄ変換回路４４に送る。Ａ／Ｄ
変換回路４４は、アナログ信号をディジタル信号に変換
する。When the accent and intonation waveforms are detected in this manner, the multiplexer 43 shown in FIG.
Sends them to the A / D conversion circuit 44 in series. A / D
The conversion circuit 44 converts an analog signal into a digital signal.

【００２６】特徴パターン抽出回路４５は、検出された
アクセントおよびイントネーションの波形からそれぞれ
特徴パターンを抽出する。この特徴パターンの抽出方法
としては、一般に、線型予測係数、ＰＡＲＣＯＲ係数、
帯域フィルタ出力、零交差波係数、エネルギー、自己相
関関数等に基づく方法がある。これらの何れを用いても
本形態の実行が可能である。アクセントおよびイントネ
ーションの各特徴パターンを、それぞれ図６の表５４お
よび表５５に示す。The feature pattern extraction circuit 45 extracts a feature pattern from each of the detected accent and intonation waveforms. As a method for extracting the feature pattern, generally, a linear prediction coefficient, a PARCOR coefficient,
There is a method based on band filter output, zero crossing wave coefficient, energy, autocorrelation function, and the like. This embodiment can be executed by using any of these. The respective characteristic patterns of accent and intonation are shown in Tables 54 and 55 of FIG. 6, respectively.

【００２７】図５の特徴パターン抽出回路４５で抽出さ
れた各特徴パターンは、切換スイッチ４６を介して、模
範発声記憶回路４７またはパターンマッチング回路４８
に送られる。すなわち、抽出された特徴パターンが模範
発声の信号のものであれば、制御回路３１側からの指令
により切換スイッチ４６が切り換えられて、模範発声記
憶回路４７に送られ、そこで一時的に記憶される。一
方、抽出された特徴パターンが学習者発声の信号のもの
であれば、パターンマッチング回路４８に直接送られ
る。Each characteristic pattern extracted by the characteristic pattern extraction circuit 45 shown in FIG.
Sent to That is, if the extracted feature pattern is a model utterance signal, the changeover switch 46 is switched by a command from the control circuit 31 and sent to the model utterance storage circuit 47, where it is temporarily stored. . On the other hand, if the extracted feature pattern is a signal of a learner's utterance, it is sent directly to the pattern matching circuit 48.

【００２８】パターンマッチング回路４８は、模範発声
記憶回路４７に記憶された模範発声のアクセントおよび
イントネーションと、学習者発声のアクセントおよびイ
ントネーションとをそれぞれ比較し、両者の類似の度合
いを計算する。ここでは、パターンマッチングとしてＤ
Ｐ（ダイナミックプログラミング）マッチングを使用す
る。The pattern matching circuit 48 compares the accent and intonation of the model utterance stored in the model utterance storage circuit 47 with the accent and intonation of the learner's utterance, and calculates the degree of similarity between the two. Here, the pattern matching is D
Use P (dynamic programming) matching.

【００２９】次に、ＤＰマッチングを用いた本形態のパ
ターンマッチングの具体例について説明する。図７は模
範発声と学習者発声の各アクセントの特徴パターンの比
較方法を示す図である。ここで、模範発声については、
図６で示した表５４の特徴パターンを使用する。この模
範発声の特徴パターンは、図に示すように、サンプリン
グ時間ｔ₀〜ｔ₁₂の間に納まっている。一方、学習者発
声は、これよりもテンポが速く、短い時間間隔ｔ₀〜ｔ
₉の間に納まっている。ＤＰマッチングによれば、模範
発声と学習者発声とで、できるだけ近い値をとるデータ
どうしの誤差を計算していく。ただし、前後が交差して
図の矢印が交差するような計算の仕方は禁止される。Next, a specific example of pattern matching of the present embodiment using DP matching will be described. FIG. 7 is a diagram showing a method of comparing the feature patterns of each accent of the model utterance and the learner utterance. Here, about the model utterance,
The feature pattern of Table 54 shown in FIG. 6 is used. Characteristic pattern of this exemplary utterance, as shown, are accommodated between the sampling time t ₀ ~t _12. On the other hand, learner utterance, this faster tempo than the short time interval t ₀ ~t
Fits between _nine . According to the DP matching, an error between data having values as close as possible between the model utterance and the learner utterance is calculated. However, calculation methods in which the front and rear cross and the arrows in the figure cross are prohibited.

【００３０】この方法で、各データどうしの誤差を計算
し、それらの絶対値の合計（以後、「距離」と呼ぶ）を
計算すると、ここでは０となる。すなわち、図７の例で
は、テンポは異なっても、模範発声と学習者発声のアク
セントは非常に類似していることが分かる。When the error between each data is calculated by this method and the sum of their absolute values (hereinafter, referred to as “distance”) is calculated, it becomes 0 here. That is, in the example of FIG. 7, it can be seen that the accents of the model utterance and the learner utterance are very similar even if the tempo is different.

【００３１】パターンマッチング回路４８は、同様の方
法によって、模範発声と学習者発声のイントネーション
についても計算を行う。そして、各特徴パターンのマッ
チングが終了すると、その類似判定結果を制御回路３１
に送る。The pattern matching circuit 48 calculates the intonation between the model utterance and the learner utterance in the same manner. When the matching of each feature pattern is completed, the similarity determination result is sent to the control circuit 31.
Send to

【００３２】これを受けた制御回路３１は、それに応じ
た表示を表示部２７で行う。例えば、アクセントとイン
トネーションの各距離の平均が９以上ならば、図３で示
した丸表示欄２７ａを点灯させる。また、５以上８以下
ならば、２重丸表示欄２７ｂを点灯させ、４以下ならば
花丸表示欄２７ｃを点灯させる。The control circuit 31 which has received the instruction causes the display unit 27 to perform a display corresponding thereto. For example, if the average of each distance between the accent and the intonation is 9 or more, the circle display field 27a shown in FIG. 3 is turned on. If the number is 5 or more and 8 or less, the double circle display section 27b is turned on. If the number is 4 or less, the flower circle display section 27c is turned on.

【００３３】なお、アクセントとイントネーションを別
個に表示したい場合には、図３の各表示欄２７ａ，２７
ｂ，２７ｃに色の異なるＬＥＤを１対ずつ設け、それぞ
れを上述のような点数配分に応じて点灯させればよい。When it is desired to display accents and intonations separately, display columns 27a and 27 in FIG.
A pair of LEDs having different colors may be provided for b and 27c, and each of them may be turned on according to the above-mentioned point distribution.

【００３４】図８はマッチング処理回路４０による手順
を示すフローチャートである。〔Ｓ１〕本体の操作部１３が先生モードになっているか
否かを判断し、なっていればステップＳ２に進み、なっ
ていなければステップＳ７に進む。〔Ｓ２〕図５で示した切換スイッチ４６を模範発声記憶
回路４７側に切り換える。〔Ｓ３〕模範発声のアクセントおよびイントネーション
を検出する。〔Ｓ４〕検出した信号のＡ／Ｄ変換を行う。〔Ｓ５〕模範発声のアクセントおよびイントネーション
の各特徴パターンを抽出する。FIG. 8 is a flowchart showing a procedure performed by the matching processing circuit 40. [S1] It is determined whether or not the operation unit 13 of the main unit is in the teacher mode. If yes, the process proceeds to step S2, and if not, the process proceeds to step S7. [S2] The changeover switch 46 shown in FIG. 5 is switched to the model utterance storage circuit 47 side. [S3] The accent and intonation of the model utterance are detected. [S4] A / D conversion of the detected signal is performed. [S5] The accent and intonation feature patterns of the model utterance are extracted.

【００３５】〔Ｓ６〕抽出したパターンを模範発声記憶
回路４７に登録する。〔Ｓ７〕切換スイッチ４６をパターンマッチング回路４
８側に切り換える。〔Ｓ８〕学習者発声のアクセントおよびイントネーショ
ンを検出する。〔Ｓ９〕検出した信号のＡ／Ｄ変換を行う。〔Ｓ１０〕学習者発声のアクセントおよびイントネーシ
ョンの各特徴パターンを抽出する。〔Ｓ１１〕模範発声および学習者発声の各特徴パターン
のパターンマッチングを行う。〔Ｓ１２〕両者の類似度を表示させる。[S6] The extracted pattern is registered in the model utterance storage circuit 47. [S7] Set the changeover switch 46 to the pattern matching circuit 4
Switch to 8 side. [S8] The accent and intonation of the learner's utterance are detected. [S9] A / D conversion of the detected signal is performed. [S10] Each feature pattern of the accent and intonation of the learner's utterance is extracted. [S11] Pattern matching of each characteristic pattern of the model utterance and the learner utterance is performed. [S12] The similarity between the two is displayed.

【００３６】このように、本形態では、模範発声および
学習者発声のアクセントおよびイントネーションを比較
して、その類似の度合いを表示するようにしたので、学
習者発声と模範発声との違いを客観的にかつ正確に評価
でき、それを一目で確認することができる。As described above, in the present embodiment, the degree of similarity is displayed by comparing the accent and intonation of the model utterance and the learner's utterance, so that the difference between the learner's utterance and the model utterance is objectively determined. And can be evaluated accurately and at a glance.

【００３７】また、本形態では、パターンマッチングの
方法として、ＤＰマッチングを用いるようにしたので、
学習者発声と模範発声とのテンポが異なっても、正確に
マッチングを行うことができる。In this embodiment, DP matching is used as a pattern matching method.
Even if the learner utterance and the model utterance have different tempos, accurate matching can be performed.

【００３８】なお、本形態では、カード式語学学習機１
０を使用する例を示したが、音声入力ボードを取り付け
ることにより、通常のパソコン等でも本形態の機能の実
行が可能である。このとき、パターンマッチングの処理
はソフトウェアで、また、類似度の表示はモニタ等で行
う。In this embodiment, the card type language learning machine 1
Although the example using 0 is shown, the function of the present embodiment can be executed by a normal personal computer or the like by attaching a voice input board. At this time, the pattern matching process is performed by software, and the similarity is displayed on a monitor or the like.

【００３９】[0039]

【発明の効果】以上説明したように本発明では、模範発
声からそのアクセントおよびイントネーションの各特徴
を抽出する一方、マイクから集音された学習者発声から
そのアクセントおよびイントネーションの各特徴を抽出
し、抽出された模範発声の各特徴と学習者の発声の各特
徴とをパターンマッチングし、そのパターンマッチング
の結果を表示するようにしたので、学習者の発声と模範
発声との違いを客観的にかつ正確に評価することがで
き、それを一目で確認することが可能となる。As described above, according to the present invention, each feature of the accent and intonation is extracted from the model utterance, while each feature of the accent and intonation is extracted from the learner utterance collected from the microphone. Each feature of the extracted model utterance and each feature of the learner's utterance were subjected to pattern matching, and the results of the pattern matching were displayed, so that the difference between the learner's utterance and the model utterance was objectively and Accurate evaluation can be made, and it can be confirmed at a glance.

[Brief description of the drawings]

【図１】本形態のカード式語学学習機の機能の概念を示
す図である。FIG. 1 is a diagram showing a concept of a function of a card-type language learning machine of the present embodiment.

【図２】本形態のカード式語学学習機の外観構成を示す
斜視図である。FIG. 2 is a perspective view showing an external configuration of a card-type language learning machine of the present embodiment.

【図３】表示部の構成の一例を示す図である。FIG. 3 is a diagram illustrating an example of a configuration of a display unit.

【図４】カード式語学学習機内部のハードウェアの構成
を示すブロック図である。FIG. 4 is a block diagram showing a hardware configuration inside the card type language learning machine.

【図５】音声処理回路内のマッチング処理回路の具体的
な構成を示すブロック図である。FIG. 5 is a block diagram showing a specific configuration of a matching processing circuit in the audio processing circuit.

【図６】音声信号入力から特徴パターン抽出までのマッ
チング処理回路の機能を示す図である。FIG. 6 is a diagram illustrating functions of a matching processing circuit from input of an audio signal to extraction of a feature pattern.

【図７】模範発声と学習者発声の各アクセントの特徴パ
ターンの比較方法を示す図である。FIG. 7 is a diagram showing a method of comparing feature patterns of accents of a model utterance and a learner utterance.

【図８】マッチング処理回路による手順を示すフローチ
ャートである。FIG. 8 is a flowchart illustrating a procedure performed by a matching processing circuit.

[Explanation of symbols]

１磁気カード２磁気データ読み取り機構部３模範発声特徴抽出手段４マイク５学習者発声特徴抽出手段６パターンマッチング手段７マッチング表示手段８表示パネル１０カード式語学学習機１３操作部１６磁気カード１６ａ磁気テープ２１生徒モードボタン２２先生モードボタン２３録音ボタン２４マッチングスイッチ２７表示部３１制御回路３２音声処理回路３５マイク４０マッチング処理回路 DESCRIPTION OF SYMBOLS 1 Magnetic card 2 Magnetic data reading mechanism part 3 Model utterance feature extraction means 4 Microphone 5 Learner utterance feature extraction means 6 Pattern matching means 7 Matching display means 8 Display panel 10 Card language learning machine 13 Operation part 16 Magnetic card 16a Magnetic tape 21 Student Mode Button 22 Teacher Mode Button 23 Record Button 24 Matching Switch 27 Display 31 Control Circuit 32 Audio Processing Circuit 35 Microphone 40 Matching Processing Circuit

Claims

[Claims]

1. A card-type language learning machine for performing language learning using a magnetic card, comprising: a magnetic data reading mechanism for reading magnetic data of a magnetic card having a model utterance data area in which a model utterance is recorded; A microphone that collects learner utterances; a model utterance feature extraction unit that extracts each feature of accent and intonation of the model utterance of the magnetic card; a learner utterance that extracts each feature of accent and intonation of the learner utterance A card comprising: a feature extracting unit; a pattern matching unit that performs pattern matching between the extracted model utterance feature and the learner utterance feature; and a matching display unit that displays a result of the pattern matching. Expression language learning machine.

2. The card-type language learning machine according to claim 1, wherein the pattern matching is DP (Dynamic Programming) matching.

3. The card-type language learning machine according to claim 1, wherein the matching display means is configured to display a picture corresponding to the degree of similarity of the matching.

4. A voice comparison system for comparing a model utterance with a learner utterance, comprising: a model utterance data storage unit storing a model utterance; a microphone for collecting a learner utterance; and an accent of the model utterance. Model utterance feature extraction means for extracting each feature of the learner and intonation; learner utterance feature extraction means for extracting each feature of the learner's utterance accent and intonation; the extracted model utterance feature and the learner A voice comparison system comprising: a pattern matching unit that performs pattern matching with an utterance feature; and a matching display unit that displays a result of the pattern matching.