JP2008134455A

JP2008134455A - Speech data reproduction system, speech data reproduction program and utterance practice device

Info

Publication number: JP2008134455A
Application number: JP2006320648A
Authority: JP
Inventors: Junichi Takami; 淳一鷹見; Michiaki Yokota; 道昭横田; Shunichi Toyoshima; 俊一豊島; Masatoshi Takara; 昌利高良
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2006-11-28
Filing date: 2006-11-28
Publication date: 2008-06-12

Abstract

<P>PROBLEM TO BE SOLVED: To solve the following problem: it is difficult to efficiently practice telephone responses alone. <P>SOLUTION: Speech data stored in a device is classified into the other side (response part) and the own side (practice part) and managed. A flag indicating which part, the response part or the practice part, reproduction of the speech data is to be started from, is prepared, and the starting order is controlled by this flag. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、音声を発したり音を発したりする動作を個人練習するための装置及びプログラムに関する。
より詳細には、電話応対等の練習を個人で行うための装置及びプログラムに関する。 The present invention relates to an apparatus and a program for personally practicing an action of making a sound or a sound.
More particularly, the present invention relates to an apparatus and a program for personal practice of telephone reception and the like.

財団法人日本電信電話ユーザ協会（http://www.jtua.or.jp/）が主催する「電話応対（ＣＴＩ）コンクール」という競技がある。お客様対応担当者の電話応対の技能を研鑽する目的で催されている。
このコンクールは、お客様がどう答えるかが予め課題として決まっているが、そのような応対になるためにどのような受け答えをすればよいのかを、出場選手自身が考え、自分の言葉で応対しなければならない。
一例として、図２９に平成１８年度のコンクールにおける課題を示す。応対者が出場選手自身に相当する。この内容を自分で考え、また自然な応対になるように話さなければならない。 There is a competition called “Telephone Answering (CTI) Competition” organized by Nippon Telegraph and Telephone User Association (http://www.jtua.or.jp/). It is held for the purpose of studying the telephone reception skills of customer service representatives.
In this contest, how to respond to the customer is determined as an issue in advance, but the competitors themselves must think about what kind of response they should make in order to become such a response, and respond in their own words. I must.
As an example, FIG. 29 shows problems in the 2006 competition. The respondent is equivalent to the participating player himself. You must think about this content yourself and speak in a natural way.

なお、電話応対練習ではないが、英会話の練習システムに関する先行技術文献として、特許文献１を示す。
特開２００５−３１２０７号公報 In addition, although it is not telephone answering practice, patent document 1 is shown as prior art literature regarding the practice system of English conversation.
JP 2005-31207 A

さて、電話応対コンクールのために練習することを考える。
もし、人手に頼るとなると、練習者自身以外に、応対者と、練習者の出来の良し悪しを採点する採点者の、合計二名が最低限必要になる。時間計測も必要であるので、時間計測を採点者が兼ねない場合は三名の人手が必要になる。
つまり、コンクール出場を目指す者は、最低二名の協力者を集める必要があった。
当然、人手に頼るのは時間的制約等が大きく、練習者の都合だけではなかなか練習ができない。 Now, consider practicing for the telephone reception competition.
If you rely on human resources, you need at least two people in total, a responder and a grader who scores the quality of the practitioner, in addition to the practitioner himself. Since time measurement is also required, three people are required if the grader does not have time measurement.
In other words, those who wanted to participate in the competition needed to gather at least two collaborators.
Naturally, relying on human resources is very time-consuming, so it is difficult to practice just by the convenience of the practitioner.

本発明はかかる点に鑑みてなされたものであり、練習者一人だけで電話応対等の練習を実現する、音声データ再生装置及び音声データ再生プログラムを提供することを目的とする。 The present invention has been made in view of the above points, and an object of the present invention is to provide an audio data reproducing apparatus and an audio data reproducing program that enable only one trainer to practice telephone answering.

上記課題を解決するための本発明は、複数の音声データ群と、再生音声データを管理するリファレンスパターン管理テーブルと、再生音声データが自分側から開始するか相手側から開始するかを示すフラグとよりなるデータを読み込んで、音声データを再生すると共に、複数の音声データ群をリファレンスパターン管理テーブルの記述に基づいて表示する音声データ装置にかかわるものである。 The present invention for solving the above problems includes a plurality of audio data groups, a reference pattern management table for managing reproduced audio data, and a flag indicating whether the reproduced audio data starts from its own side or from the other side. The present invention relates to an audio data device that reads data comprising the above and reproduces audio data and displays a plurality of audio data groups based on the description of the reference pattern management table.

特許文献１等にあるような、予めお手本が作成済みのシステムとは異なり、電話応対コンクールの場合はお手本が存在しない。つまり、お手本は練習者自身が作成する必要がある。そこで、お手本を簡単に作成し、そのお手本から容易に応対者の部分だけを抜粋して練習ができるようにするために、お手本の音声データを練習者側と応答者側とで明確に区分する仕組みを採用した。 Unlike a system in which a model has been created in advance as in Patent Document 1 and the like, there is no model in the case of a telephone reception competition. In other words, the practitioner himself must create the model. Therefore, in order to make it easy to create a model and to easily extract only the respondent's part from the model, the voice data of the model is clearly divided between the trainer side and the responder side. Adopted a mechanism.

本発明により、装置に蓄積される音声データを相手側（応答パート）と自分側（練習パート）に分けてリファレンスパターンと入力パターンの２系統に分類して管理し、入力パターンをリファレンスパターンに近づけるように反復練習を行うことにより、一人での電話応対練習を効率よく実現可能な、音声データ装置、プログラム及び発話練習装置を提供できる。 According to the present invention, the voice data stored in the apparatus is divided into the other side (response part) and the own side (practice part) and managed by classifying them into two systems of a reference pattern and an input pattern, and the input pattern is brought close to the reference pattern. As described above, it is possible to provide a voice data device, a program, and an utterance practice device capable of efficiently realizing a telephone answering practice alone by performing repeated practice.

以下、本発明の実施の形態を、図１〜図２８を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to FIGS.

［外観］
図１は、本発明の実施の形態にかかる、電話応対練習支援プログラム（以下「練習支援ソフト」と略）を起動した時の状態を示す図である。なお、図１は各部分の名称を表示する関係上、後述するリファレンスパターン表示エリア１５１、入力パターン表示エリア１５２及び詳細表示エリア１５３が白地になっているが、実際は図２、図３、図４、図５に示すように中間色で表示される。例えば灰色地である。
練習支援ソフト１０１はマイクロソフト社のＷｉｎｄｏｗｓ（登録商標）が稼動するパソコンにて実行できるアプリケーションプログラムである。練習支援ソフト１０１はその性格上、パソコンにオーディオインターフェースが装備されていることを要する。後述するように、使用者等が電話応対の練習等で発声した音声を録音したり、録音済み音声データファイルを再生して、上達の度合い等を確認する等の作業において、パソコンは音声の録音及び再生機能を要するからである。 [appearance]
FIG. 1 is a diagram showing a state when a telephone answering practice support program (hereinafter abbreviated as “practice support software”) according to an embodiment of the present invention is activated. In FIG. 1, the reference pattern display area 151, the input pattern display area 152, and the detailed display area 153, which will be described later, are white because of displaying the names of the respective parts. As shown in FIG. 5, it is displayed in an intermediate color. For example, gray ground.
The practice support software 101 is an application program that can be executed on a personal computer running Windows (registered trademark) of Microsoft Corporation. The practice support software 101 requires that the personal computer is equipped with an audio interface due to its nature. As will be described later, PCs record voices when users record voices during telephone reception exercises, play recorded voice data files, and check their progress. This is because a reproduction function is required.

練習支援ソフト１０１はタスクファイルを読み込み、表示画面１０２上にその内容を展開して表示する。
そして、表示した内容に対し、「リファレンス再生」ボタン１０３及び「入力再生」ボタン１０４を押すことにより、音声の再生を実行する。
タスクファイルは、複数の音声データファイルと、その再生順番を示すテーブルデータ等よりなる。タスクファイルの詳細は後述する。 The practice support software 101 reads the task file and expands and displays the contents on the display screen 102.
Then, by pressing the “reference playback” button 103 and the “input playback” button 104 with respect to the displayed contents, the audio is played back.
The task file includes a plurality of audio data files, table data indicating the reproduction order, and the like. Details of the task file will be described later.

表示画面１０２中には大きく三つの領域がある。
一つは、リファレンスパターン表示エリア１５１である。
リファレンスパターン表示エリア１５１は、電話応対のお手本の音声データが表示される領域である。
もう一つは、入力パターン表示エリア１５２である。
入力パターン表示エリア１５２は、電話応対の練習を行った時の音声データが表示される領域である。
最後に、詳細表示エリア１５３がある。
詳細表示エリア１５３は、リファレンスパターン表示エリア１５１及び入力パターン表示エリア１５２の、現在の再生位置にかかる音声データを、時間軸を拡大して表示すると共に、後述する類似度スコア表示を行う領域である。また、後述するテキスト表示モードが有効になっている時は、会話の内容を示す文章（テキスト）が表示される。 There are three main areas in the display screen 102.
One is a reference pattern display area 151.
The reference pattern display area 151 is an area where voice data of a model for telephone reception is displayed.
The other is an input pattern display area 152.
The input pattern display area 152 is an area in which audio data when a telephone answering practice is performed is displayed.
Finally, there is a detailed display area 153.
The detail display area 153 is an area for displaying the audio data relating to the current reproduction position in the reference pattern display area 151 and the input pattern display area 152 while enlarging the time axis and displaying a similarity score which will be described later. . In addition, when a text display mode to be described later is valid, a sentence (text) indicating the content of the conversation is displayed.

「タスク新規作成」ボタン１０５は、タスクファイルを新規に作成するためのボタンである。
「タスク読み込み」ボタン１０６は、既存のタスクファイルを読み込むためのボタンである。
「タスク書き出し」ボタン１０７は、作成した音声データ等からタスクファイルを作成するためのボタンである。
タスクファイル名表示欄１０８は、読み込んだタスクファイルの名前が表示される領域である。 The “create new task” button 105 is a button for creating a new task file.
The “task read” button 106 is a button for reading an existing task file.
The “task export” button 107 is a button for creating a task file from the created audio data or the like.
The task file name display field 108 is an area where the name of the read task file is displayed.

「録音」ボタン１０９は、電話応対のお手本である、リファレンスパターンを録音するためのボタンである。
リファレンスパターンを録音すると、リファレンスパターン表示エリア１５１にその内容が表示される。
「連続」チェックボックス１１０は、リファレンスパターンの録音を連続して行うか否かを選択するチェックボックスである。
「リファレンス再生」ボタン１０３は、録音したリファレンスパターンを再生するためのボタンである。 The “record” button 109 is a button for recording a reference pattern, which is an example of telephone reception.
When the reference pattern is recorded, the content is displayed in the reference pattern display area 151.
The “continuous” check box 110 is a check box for selecting whether to continuously record the reference pattern.
The “reference reproduction” button 103 is a button for reproducing the recorded reference pattern.

図２は、空のパートを幾つか作成した状態の図である。
図１の「パート追加」ボタン１１１を押すと、リファレンスパターン表示エリア１５１と入力パターン表示エリア１５２に次々と四角が表示される。この四角がパートである。
淡色にて表示されるパートは練習パート２０２である。練習パート２０２は、実際の表示画面では例えば白地にて表示される。これは、電話応対をするオペレータ、つまり練習支援ソフト１０１の使用者自身の発話部分である。
濃色にて表示されるパートは応答パート２０３である。応答パート２０３は、実際の表示画面では例えば群青地にて表示される。これは、電話応対をする相手の発話部分である。つまり、電話口の向こう側にいる顧客である。 FIG. 2 shows a state in which some empty parts are created.
When the “part addition” button 111 in FIG. 1 is pressed, squares are displayed one after another in the reference pattern display area 151 and the input pattern display area 152. This square is the part.
The part displayed in light color is the practice part 202. The practice part 202 is displayed on, for example, a white background on the actual display screen. This is an utterance portion of the operator who answers the telephone, that is, the user of the practice support software 101.
The part displayed in dark color is the response part 203. The response part 203 is displayed in, for example, an ultramarine background on the actual display screen. This is the utterance part of the other party answering the call. In other words, it is a customer behind the phone.

再び図１に戻る。
「パート追加」ボタン１１１は、カーソルで選択されているパートの直後に新たにパートを追加するためのボタンである。
「パート挿入」ボタン１１２は、カーソルで選択されているパートの直前に新たにパートを挿入するためのボタンである。
「パート削除」ボタン１１３は、カーソルで選択されているパートを削除するためのボタンである。
「ＭＰ３作成」ボタン１１４は、作成したリファレンスパターンからＭＰ３（MPEG Audio Layer-3：映像データ圧縮方式のＭＰＥＧ−１で利用される音声圧縮方式の一つ。）ファイルを生成するためのボタンである。これは携帯型音楽再生機器等に当該ファイルを取り込んで練習するためのものである。 Returning again to FIG.
The “part addition” button 111 is a button for newly adding a part immediately after the part selected by the cursor.
The “part insertion” button 112 is a button for inserting a new part immediately before the part selected by the cursor.
The “part delete” button 113 is a button for deleting the part selected by the cursor.
The “Create MP3” button 114 is a button for generating an MP3 (MPEG Audio Layer-3: one of audio compression methods used in the MPEG-1 video data compression method) file from the created reference pattern. . This is for practicing by taking the file into a portable music player or the like.

「練習開始」ボタン１１５は、電話応対の練習を行うためのボタンである。入力パターン表示エリア１５２内の表示に対応して、応答パートの再生と、練習パートの録音を交互に実行する。
「入力再生」ボタン１０４は、録音した練習パターンを再生する。
「応答速度」変更欄１１６は、練習において入力パターン内の応答パートの再生速度を変更するための数値変更欄である。 The “practice start” button 115 is a button for practicing telephone reception. Corresponding to the display in the input pattern display area 152, playback of the response part and recording of the practice part are executed alternately.
The “input playback” button 104 plays back the recorded practice pattern.
The “response speed” change field 116 is a numerical value change field for changing the playback speed of the response part in the input pattern in practice.

「評価」ボタン１１７は、詳細表示エリア１５３においてリファレンスパターン詳細表示エリア１５４中のパートと入力パターン詳細表示エリア１５６中のパートとの比較を行い、その結果を表示するためのボタンである。
「比較再生」ボタン１１８は、リファレンスパターン詳細表示エリア１５４中のパートと入力パターン詳細表示エリア１５６中のパートの時間軸が対応する部分を同時に再生するためのボタンである。この時間軸の対応関係は、時間対応付けエリア１５５に表示されている結果から求められる。
「自動評価」チェックボックス１１９は、パートの選択操作が行われ、詳細表示エリア１５３においてリファレンスパターン詳細表示エリア１５４中のパートと入力パターン詳細表示エリア１５６中のパートが書き変えられた時点で自動的に「評価」ボタン１１７を押した場合と同じ動作を実行するためのチェックボックスである。
「テキスト表示」チェックボックス１２０は、詳細表示エリア１５３に現在のパートのテキストを表示するためのチェックボックスである。このチェックボックスをチェック（マウスでクリックして「レ」表示をさせる）すると、詳細表示エリア１５３全体がテキスト表示画面に切り替わる。 The “evaluation” button 117 is a button for comparing the part in the reference pattern detail display area 154 with the part in the input pattern detail display area 156 in the detail display area 153 and displaying the result.
The “comparison playback” button 118 is a button for simultaneously playing back the part corresponding to the time axis of the part in the reference pattern detail display area 154 and the part in the input pattern detail display area 156. This time axis correspondence is obtained from the result displayed in the time association area 155.
The “automatic evaluation” check box 119 is automatically selected when a part selection operation is performed and the part in the reference pattern detail display area 154 and the part in the input pattern detail display area 156 are rewritten in the detail display area 153. This is a check box for executing the same operation as when the “evaluation” button 117 is pressed.
The “text display” check box 120 is a check box for displaying the text of the current part in the detail display area 153. When this check box is checked (clicking with the mouse to display “Re”), the entire detail display area 153 is switched to the text display screen.

「メトロノーム」チェックボックス１２１は、練習支援ソフト１０１の付加機能である、メトロノーム音を発生するためのチェックボックスである。このチェックボックスをチェックすると、練習における練習パートの録音のときに、その横にあるテンポ設定欄１２５にて設定したテンポのクリック音を発生する。なお、クリック音はリファレンスパターンの録音及び入力パターンの練習においては録音されない。 The “metronome” check box 121 is a check box for generating a metronome sound, which is an additional function of the practice support software 101. If this check box is checked, a click sound of the tempo set in the tempo setting field 125 next to the practice part is recorded during practice. The click sound is not recorded during recording of the reference pattern and practice of the input pattern.

タイマチェックボックス１２２は、練習中にリファレンスパターン中の練習パートの再生時間から逆算して、残り何分何秒かを表示する機能を選択するチェックボックスである。
電話応対の会話は早過ぎても遅すぎても良くない。つまり、お手本の会話のタイミングを忠実に再現できることが望ましい。そこで、このようなタイマ機能を設け、練習をする使用者が会話のタイミングを体に覚え込ませるのである。
「制限時間」設定欄１２３は、練習の際の全体の経過時間を規定する目安となる時間を設定する。つまり、総経過時間を設定する。この制限時間を設定すると、総経過時間が設定値の９０％を越えた時点で、タイマ表示欄１２４のバックが黄色に変化する。そして、総経過時間が設定値を越えた時点で、タイマ表示欄１２４のバックが赤色に変化する。
タイマ表示欄１２４は、二つの経過時間を示す。
スラッシュ（「/」)を境に左側の表示欄は、現在再生中或は録音中のパート内の経過時間を表示する。
スラッシュ（「/」)を境に右側の表示欄は、動作モードによって表示内容が異なる。
リファレンスパターン及び入力パターンの再生時においては、各パターンの先頭から現在再生中の位置までの累積経過時間を表示する。
リファレンスパターンにおける１パートのみの録音時においては、録音中のパートの経過時間に、当該パートを除く全パートの再生時間の総和を表示する。
リファレンスパターンにおける連続モードの録音時においては、録音中のパートの経過時間に、当該パートより前方のパートの再生時間の総和を表示する。
練習実行時においては、録音或は再生中のパートの経過時間に、当該パートより前方のパートの再生時間の総和を表示する。
現在再生或は録音中のパートの経過時間は、タスク制御部９０８が、録音の際にはＡ／Ｄ変換部９１０の、再生の際にはＤ／Ａ変換部９１１の、それら内部にある図示しないリングバッファを見て、音声データをバイト単位で計数することで、正確な経過時間を得ている。 The timer check box 122 is a check box for selecting a function for displaying the remaining minutes and seconds by calculating backward from the playback time of the practice part in the reference pattern during practice.
The telephone conversation can be too early or too late. In other words, it is desirable that the timing of the model conversation can be faithfully reproduced. Therefore, such a timer function is provided so that the user who is practicing remembers the timing of the conversation.
The “time limit” setting field 123 sets a time that serves as a guideline for defining the total elapsed time during practice. That is, the total elapsed time is set. When this time limit is set, the back of the timer display field 124 turns yellow when the total elapsed time exceeds 90% of the set value. When the total elapsed time exceeds the set value, the back of the timer display field 124 changes to red.
The timer display column 124 shows two elapsed times.
The display column on the left side of the slash (“/”) indicates the elapsed time in the part currently being played back or recorded.
The display content on the right side of the slash (“/”) boundary differs depending on the operation mode.
When the reference pattern and the input pattern are reproduced, the accumulated elapsed time from the beginning of each pattern to the position currently being reproduced is displayed.
When only one part is recorded in the reference pattern, the total playback time of all parts except the part is displayed as the elapsed time of the part being recorded.
During continuous mode recording in the reference pattern, the total playback time of the part ahead of the part is displayed as the elapsed time of the part being recorded.
When practicing, the total playback time of the part ahead of the part is displayed as the elapsed time of the part being recorded or played.
The elapsed time of the part currently being played back or recorded is indicated by the task control unit 908 inside the A / D conversion unit 910 during recording and the D / A conversion unit 911 during playback. The exact elapsed time is obtained by looking at the ring buffer and counting the audio data in bytes.

「終了」ボタン１３０は、練習支援ソフト１０１を終了させるためのボタンである。
時間軸変更ボタン１２６、１２７及び１２８は夫々、リファレンスパターン表示エリア１５１、入力パターン表示エリア１５２及び詳細表示エリア１５３の波形表示の時間軸の幅を変更する。 The “end” button 130 is a button for ending the practice support software 101.
The time axis change buttons 126, 127, and 128 change the width of the time axis of the waveform display in the reference pattern display area 151, the input pattern display area 152, and the detailed display area 153, respectively.

ここで、練習支援ソフト１０１の概略を説明する。
練習支援ソフト１０１は、練習者が単独で電話応対を練習するために、予め自分でお手本を録音する。
そして、そのお手本の自分側を練習する。
お手本は、電話応対者側と電話応答者側の音声データの集合体よりなる。会話は交互に行われるため、電話応対者側の発話の後に電話応答者側の発話が続き、その後は電話応対者側、電話応答者側…と、音声データが配置される。
この、お手本における抽象的な概念を「リファレンスパターン」と呼ぶ。
そこで、音声データを、電話応対者側と電話応答者側と交互に配置するテーブルを用意する。
これが、リファレンスパターン管理テーブルである。リファレンスパターン管理テーブルは、リファレンスパターンの概念の中心である。 Here, an outline of the practice support software 101 will be described.
The practice support software 101 records a model in advance so that the practitioner can practice the telephone answering alone.
Then practice your side of the example.
The model consists of a collection of voice data on the telephone responder side and the telephone responder side. Since the conversations are alternately performed, the utterance on the telephone responder side follows the utterance on the telephone responder side, and thereafter, voice data is arranged on the telephone responder side, the telephone responder side, and so on.
This abstract concept in the model is called a “reference pattern”.
Therefore, a table is prepared in which voice data is alternately arranged on the telephone responder side and the telephone responder side.
This is the reference pattern management table. The reference pattern management table is the center of the concept of reference patterns.

お手本を基に練習をする際には、電話応答者側、つまり相手の音声データはお手本から持ち込み、電話応対者側、つまり自分はお手本を聞いて応対の練習をする。練習の結果は録音して、後から聞き直し、改善点を自己判断する。
お手本を基に練習をするために、リファレンスパターンとは別の抽象的な概念として、「入力パターン」を導入する。
リファレンスパターン管理テーブルの内容を複写し、練習パートの部分を空にする。この対応関係を、リファレンスパターン管理テーブルと同様に記述するテーブルを用意する。これが入力パターン管理テーブルである。
練習は、入力パターン管理テーブルの記述に基づき、複写された応答パートを再生し、空になった練習パートに録音を行う。 When practicing on the basis of a model, the voice responder side, that is, the other party's voice data is brought from the model, and the telephone answerer side, that is, oneself listens to the model and practices the answering. Record the results of the practice, listen to them later, and self-judge improvement points.
In order to practice on the basis of an example, an “input pattern” is introduced as an abstract concept different from the reference pattern.
Copy the contents of the reference pattern management table and empty the practice part. A table describing this correspondence is prepared in the same manner as the reference pattern management table. This is the input pattern management table.
In practice, the copied response part is reproduced based on the description of the input pattern management table, and recording is performed on the practice part which has become empty.

これより、図１から図５迄を用いて、練習支援ソフト１０１の基本的な使い方の概要を説明する。
先ず、練習支援ソフト１０１を起動すると、利用履歴にタスクファイルが存在していれば、それを読み込んで図３のように表示する。利用履歴がないか、あっても当該タスクファイルが存在していなければ、初期状態として図１のように表示する。
図１の状態から、「パート挿入」ボタン１１２或は「パート追加」ボタン１１１を押すと、図２のように練習パートと応答パートが次々と追加される。
図２の状態から「録音」ボタン１０９を押して録音を行うと、リファレンスパターンの録音が行われ、リファレンスパターン表示エリア１５１が図３のような状態になる。なお、リファレンスパターンの録音が完遂すると、入力パターン表示エリア１５２中の応答パートにはリファレンスパターン中の応答パートと同じ内容が表示される。
なお、図３ではカーソルが１番目のパート（練習パート）を指し示しているので、リファレンスパターン中の第１番目のパートの音声データが、リファレンスパターン詳細表示エリア１５４に表示されている。
図３の状態から「練習」ボタンを押して練習を開始すると、入力パターン表示エリア１５２中の練習パートについては録音が、入力パターン表示エリア１５２中の応答パートについては再生が行われ、その結果、図４のような状態になる。
なお、図４ではカーソルが１番目のパート（練習パート）を指し示しているので、リファレンスパターン中の第１番目のパートの音声データが、リファレンスパターン詳細表示エリア１５４に表示されている。また、入力パターン中の第１番目のパートの音声データが、入力パターン詳細表示エリア１５６に表示されている。
図４の状態から「評価」ボタン１１７を押すと、評価が実行され、図５のような表示状態になる。
図５ではカーソルが１番目のパート（練習パート）を指し示しているので、リファレンスパターン中の第１番目のパートの音声データと、入力パターン中の第１番目のパートの音声データを比較している。
その結果、時間対応付け表示エリア１５５には、双方の音声データの時間軸の対応付けが細かい線にて表示される。
また、双方の音声データの類似度が計算され、その結果が類似度スコア表示エリア１５７に表示される。 Hereafter, the outline of the basic usage of the practice support software 101 will be described with reference to FIGS.
First, when the practice support software 101 is activated, if a task file exists in the usage history, it is read and displayed as shown in FIG. If there is no usage history or the task file does not exist, the initial state is displayed as shown in FIG.
When the “insert part” button 112 or the “add part” button 111 is pressed from the state of FIG. 1, practice parts and response parts are added one after another as shown in FIG.
When recording is performed by pressing the “record” button 109 from the state of FIG. 2, the reference pattern is recorded, and the reference pattern display area 151 is in a state as shown in FIG. When the recording of the reference pattern is completed, the same content as the response part in the reference pattern is displayed in the response part in the input pattern display area 152.
In FIG. 3, since the cursor points to the first part (practice part), the audio data of the first part in the reference pattern is displayed in the reference pattern detail display area 154.
When the “practice” button is pressed from the state of FIG. 3 to start the practice, the practice part in the input pattern display area 152 is recorded, and the response part in the input pattern display area 152 is played back. It becomes a state like 4.
In FIG. 4, since the cursor points to the first part (practice part), the audio data of the first part in the reference pattern is displayed in the reference pattern detail display area 154. Further, the audio data of the first part in the input pattern is displayed in the input pattern detail display area 156.
When the “evaluation” button 117 is pressed from the state of FIG. 4, the evaluation is executed and the display state as shown in FIG. 5 is obtained.
In FIG. 5, since the cursor points to the first part (practice part), the audio data of the first part in the reference pattern is compared with the audio data of the first part in the input pattern. .
As a result, in the time association display area 155, the association of the time axes of both audio data is displayed with fine lines.
In addition, the similarity between both audio data is calculated, and the result is displayed in the similarity score display area 157.

図６はリファレンスパターン表示エリア１５１と入力パターン表示エリア１５２を一部拡大した図である。
リファレンスパターン表示エリア１５１と入力パターン表示エリア１５２は、灰色の背景色にパートが表示される。
練習パートは淡色の下地に濃色の波形が表示される。例えばクリーム色の下地に群青色の波形である。
応答パートは練習パートとは反対に、濃色の下地に淡色の波形が表示される。例えば群青色の下地にクリーム色の波形である。
各々のパートに表示されている波形は、対応する音声データファイルの中身を可視化したものである。表示波形の縦軸は各時刻における振幅値に対応し、横軸は、相対的な時間に対応する。
各パートの左下には注目される色として、例えば赤色にて番号が表示されている。これがパート番号である。
入力パターン表示エリア１５２中の練習パートにおいて「ｎｏｄａｔａ」と表示されている（「ｎｏｄａｔａ」表示６０２）のは、対応する音声データファイルがないことを示す。「ｎｏｄａｔａ」表示パートは、練習支援ソフトのデフォルト値である、一定の幅で表示される。
リファレンスパターン表示エリア１５１と入力パターン表示エリア１５２の１番目のパートは、灰色地に表示される他のパートと異なり、その周囲が淡色（例えば黄色）に表示されている。これがカーソル表示である。
カーソルは、後述する録音／練習／再生を開始する際の基準位置や評価の対象となるパートの位置を示す。
カーソルとは別に、現在録音中或は再生中のパートは、注目される色で色分けして表示される。例えば、録音の場合は赤色に、再生の場合は緑色に、夫々色分けして表示する。
カーソルの変更は、録音／練習／再生等を行っていない状態において、リファレンスパターン表示エリア１５１或は入力パターン表示エリア１５２の中に表示されているパートをマウスで左クリックすることにより、行う。つまり、カーソルの位置はユーザの発意によってのみ変更される。 FIG. 6 is a partially enlarged view of the reference pattern display area 151 and the input pattern display area 152.
In the reference pattern display area 151 and the input pattern display area 152, parts are displayed on a gray background color.
In the practice part, a dark waveform is displayed on a light background. For example, it is an ultramarine waveform on a cream base.
In contrast to the practice part, the response part displays a light waveform on a dark background. For example, it is a cream-colored waveform on an ultramarine base.
The waveform displayed in each part is a visualization of the contents of the corresponding audio data file. The vertical axis of the display waveform corresponds to the amplitude value at each time, and the horizontal axis corresponds to relative time.
A number is displayed in the lower left of each part, for example, in red as a noticed color. This is the part number.
“No data” displayed in the practice part in the input pattern display area 152 (“no data” display 602) indicates that there is no corresponding audio data file. The “no data” display part is displayed with a certain width, which is the default value of the practice support software.
The first part of the reference pattern display area 151 and the input pattern display area 152 is displayed in a light color (for example, yellow), unlike other parts displayed in gray. This is the cursor display.
The cursor indicates the reference position when starting recording / practice / playback, which will be described later, and the position of the part to be evaluated.
Apart from the cursor, the part currently being recorded or reproduced is displayed in different colors according to the color of interest. For example, it is displayed in red for recording and green for reproduction.
The cursor is changed by left-clicking the part displayed in the reference pattern display area 151 or the input pattern display area 152 with the mouse when recording / practice / playback is not performed. That is, the position of the cursor is changed only by the user's intention.

［データ構造と表示内容と内部の機能との関係］
図７は、電話応対練習支援装置の内部データの構造を示す。
図８は、電話応対練習支援装置の内部データと、リファレンスパターン表示エリア１５１及び入力パターン表示エリア１５２の表示内容との対応関係を示す。
図９は、電話応対練習支援装置の機能ブロック図を示す。 [Relationship between data structure, display contents, and internal functions]
FIG. 7 shows the internal data structure of the telephone answering practice support apparatus.
FIG. 8 shows a correspondence relationship between internal data of the telephone answering practice support device and display contents of the reference pattern display area 151 and the input pattern display area 152.
FIG. 9 shows a functional block diagram of the telephone answering practice support apparatus.

先ず、図７と図９とを併用して説明する。
練習支援ソフト１０１はパソコンに読み込まれ実行されることで、電話応対練習支援装置９０１を構成する。
周知のハードディスク装置やフラッシュメモリ等よりなる不揮発性ストレージ９０２に格納されているタスクファイル９０３は、図７に示されている各々のデータのうち、音声データ７０２とリファレンスパターン管理テーブル７０１とテキストデータ７０４が格納されている。タスクファイル９０３は圧縮アーカイブファイルであり、複数のファイルを圧縮して一つのファイルを形成するものである。一例としては、ＬＨＡ（http://ja.wikipedia.org/wiki/LHA）等がある。
音声データ７０２は、音声データファイル９０４の集合体である。リファレンスパターン表示エリア１５１及び入力パターン表示エリア１５２の各パートに対応する音声データファイルであり、一例としてはｗａｖファイルである。なお、タスクファイル９０３にはリファレンスパターン管理テーブル７０１に対応する音声データファイルのみ格納される。 First, FIG. 7 and FIG. 9 will be described together.
The practice support software 101 is loaded into a personal computer and executed to constitute a telephone answering practice support apparatus 901.
A task file 903 stored in a non-volatile storage 902 made up of a known hard disk device, flash memory, or the like includes an audio data 702, a reference pattern management table 701, and text data 704 among the respective data shown in FIG. Is stored. The task file 903 is a compressed archive file, and a plurality of files are compressed to form one file. An example is LHA (http://en.wikipedia.org/wiki/LHA).
The audio data 702 is a collection of audio data files 904. It is an audio data file corresponding to each part of the reference pattern display area 151 and the input pattern display area 152, and as an example is a wav file. Note that only the audio data file corresponding to the reference pattern management table 701 is stored in the task file 903.

リファレンスパターン管理テーブル７０１は、リファレンスパターン表示エリア１５１に表示される各パートの音声データファイル９０４のファイル名が列挙されている。なお、タスクファイル９０３に格納される際には、第１パートのファイル名が記載される行、つまり先頭行に記載されているファイル名の後に、半角スペースを一つ挟んで、表示順フラグ７０５の「０」或は「１」が記載された後、プレーンテキストファイルの形式にて書き出される。
表示順フラグ７０５は、リファレンスパターン及び入力パターンが練習パートから始まるのか応答パートから始まるのかを示すフラグである。「０」で練習パートから、「１」で応答パートから始まることを示す。 The reference pattern management table 701 lists the file names of the audio data files 904 for each part displayed in the reference pattern display area 151. When the file is stored in the task file 903, the display order flag 705 includes a line where the file name of the first part is described, that is, the file name described in the first line, and a single space. After “0” or “1” is written, it is written in the form of a plain text file.
The display order flag 705 is a flag indicating whether the reference pattern and the input pattern start from the practice part or the response part. “0” indicates that the practice part starts, and “1” indicates that the response part starts.

リファレンスパターン管理テーブル７０１をタスクファイル９０３に格納する際には、第１パートのファイル名が記載される行、つまり先頭行に記載されているファイル名の後に、半角スペースを一つ挟んで、表示順フラグ７０５の「０」或は「１」の記載を伴って、各パートの音声データファイル９０４のファイル名をテキストファイルに列挙出力する。このテキストファイルの中身の一例を次に示す。一行目だけ、ファイル名の後にスペース一つ挟んで、「０」が記されていることから、最初のパートは練習パートであることがわかる。 When the reference pattern management table 701 is stored in the task file 903, a line with the file name of the first part, that is, the file name described in the first line, followed by one half-width space is displayed. Along with the description of “0” or “1” in the order flag 705, the file names of the audio data files 904 of each part are listed and output in a text file. An example of the contents of this text file is shown below. In the first line, “0” is written with a space after the file name, so that the first part is a practice part.

ref061101153035005.wav 0
ref061101153036006.wav
ref061101153037007.wav
ref061101153038008.wav ref061101153035005.wav 0
ref061101153036006.wav
ref061101153037007.wav
ref061101153038008.wav

入力パターン管理テーブル７０３は、入力パターン表示エリア１５２に表示される各パートの音声データファイル９０４のファイル名が列挙されている。なお、タスクファイル９０３には格納されない。電話応対練習支援装置９０１がタスクファイル９０３を読み込むと、電話応対練習支援装置９０１はリファレンスパターン管理テーブル７０１を図示しないリファレンスパターン管理テーブルテキストファイルからＲＡＭ９０５に読み込んだ後、リファレンスパターン管理テーブル７０１から入力パターン管理テーブル７０３を生成する。つまり、入力パターン管理テーブル７０３はタスクファイル９０３読み込みの度毎に新規に生成される。
テキストデータ７０４はプレーンテキストである。これは各パートの台詞が記述されている。 The input pattern management table 703 lists the file names of the audio data files 904 for each part displayed in the input pattern display area 152. It is not stored in the task file 903. When the telephone answering practice support apparatus 901 reads the task file 903, the telephone answering practice support apparatus 901 reads the reference pattern management table 701 from the reference pattern management table text file (not shown) into the RAM 905, and then inputs the input pattern from the reference pattern management table 701. A management table 703 is generated. That is, the input pattern management table 703 is newly generated every time the task file 903 is read.
Text data 704 is plain text. This describes the lines of each part.

図７に示されている各々のデータのうち、音声データ７０２は個別の音声データファイル９０４として、不揮発性ストレージ９０２に格納されている。それ以外の、リファレンスパターン管理テーブル７０１、入力パターン管理テーブル７０３、テキストデータ７０４、表示順フラグ７０５は、ＲＡＭ９０５に格納されている。また、ＲＡＭ９０５には後述する動作制御にかかわるカウンタ変数等が格納される。
ＧＵＩ（Graphical User Interface：ユーザに対する情報の表示にグラフィックを多用し、大半の基礎的な操作をマウスなどのポインティングデバイスにて行うことができるユーザインターフェース。）は、その機能として操作部９０６と表示部９０７に大別できる。操作部９０６と表示部９０７は表示画面１０２を構成する。
操作部９０６がユーザの操作を受けると、その制御信号がタスク制御部９０８に伝達される。
タスク制御部９０８は電話応対練習支援装置９０１の中核をなす機能を提供する。ユーザの操作を操作部９０６から受けて、その必要に応じて以下のような動作をする。
一つは、後述する録音／練習／再生の各動作のためにＲＡＭ９０５内のデータを読み込んで、音声信号処理部９１２を稼動させることである。
もう一つは、不揮発性ストレージ９０２内の音声データファイル９０４のサイズを取得して、後述するタイマ部９０９とＡ／Ｄ変換部９１０及びＤ／Ａ変換部９１１から録音や再生の正確な経過時間等を計算することである。
更には、各種動作に応じて表示部９０７の表示制御を行わせることである。 Of each data shown in FIG. 7, the audio data 702 is stored in the nonvolatile storage 902 as an individual audio data file 904. The other reference pattern management table 701, input pattern management table 703, text data 704, and display order flag 705 are stored in the RAM 905. The RAM 905 stores counter variables related to operation control described later.
A GUI (Graphical User Interface: a user interface that uses a lot of graphics to display information to the user and can perform most basic operations with a pointing device such as a mouse) functions as an operation unit 906 and a display unit. It can be roughly divided into 907. The operation unit 906 and the display unit 907 constitute the display screen 102.
When the operation unit 906 receives a user operation, the control signal is transmitted to the task control unit 908.
The task control unit 908 provides a function that forms the core of the telephone answering practice support device 901. A user operation is received from the operation unit 906, and the following operation is performed as necessary.
One is to read the data in the RAM 905 and operate the audio signal processing unit 912 for recording / practice / playback operations to be described later.
The other is that the size of the audio data file 904 in the non-volatile storage 902 is acquired, and the accurate elapsed time of recording and playback from the timer unit 909, the A / D conversion unit 910, and the D / A conversion unit 911 which will be described later And so on.
Furthermore, display control of the display unit 907 is performed according to various operations.

音声信号処理部９１２は、Ａ／Ｄ変換部９１０及びＤ／Ａ変換部９１１と不揮発性ストレージ９０２との間で、ディジタルオーディオ信号の授受と信号処理を行う。つまり、録音と再生、そして必要に応じて後述する話速変換処理を行う。
録音であれば、Ａ／Ｄ変換部９１０から得られるディジタルオーディオ信号を、不揮発性ストレージ９０２内のタスク制御部９０８によって指定された音声データファイル９０４に記録する。
再生であれば、不揮発性ストレージ９０２内のタスク制御部９０８によって指定された音声データファイル９０４からディジタルオーディオ信号を生成し、Ｄ／Ａ変換部９１１に供給する。 The audio signal processing unit 912 performs transmission / reception of digital audio signals and signal processing between the A / D conversion unit 910 and the D / A conversion unit 911 and the nonvolatile storage 902. That is, recording and playback, and speech speed conversion processing described later are performed as necessary.
When recording, the digital audio signal obtained from the A / D conversion unit 910 is recorded in the audio data file 904 designated by the task control unit 908 in the nonvolatile storage 902.
For reproduction, a digital audio signal is generated from the audio data file 904 specified by the task control unit 908 in the nonvolatile storage 902 and supplied to the D / A conversion unit 911.

タイマ部９０９はミリ秒単位の細かなタイマ機能を提供する。
Ａ／Ｄ変換部９１０はマイクロホン９１３から入力されるアナログ音声信号をディジタル信号に変換し、音声信号処理部９１２に供給する。
Ｄ／Ａ変換部９１１は音声信号処理部９１２から出力されるディジタル信号をアナログ音声信号に変換し、スピーカ９１４に出力する。
Ａ／Ｄ変換部９１０及びＤ／Ａ変換部９１１には内部にリングバッファを内蔵し、このリングバッファにディジタル音声データが読み書きされる。タスク制御部９０８はリングバッファの読み出し開始位置を取得して、タイマ部９０９から得られる正確な時間情報と共に、正確な再生時間或は録音時間を計算する。
表示部９０７はタスク制御部９０８の指示を受け、各種表示を行う。主に、
リファレンスパターン表示エリア１５１及び入力パターン表示エリア１５２中の各パートの表示、
タイマの表示、
テキストの表示、
ボタンの操作可能／不可の表示切替等を行う。 The timer unit 909 provides a fine timer function in milliseconds.
The A / D conversion unit 910 converts the analog audio signal input from the microphone 913 into a digital signal and supplies the digital signal to the audio signal processing unit 912.
The D / A conversion unit 911 converts the digital signal output from the audio signal processing unit 912 into an analog audio signal and outputs the analog audio signal to the speaker 914.
The A / D conversion unit 910 and the D / A conversion unit 911 incorporate a ring buffer therein, and digital audio data is read from and written to the ring buffer. The task control unit 908 obtains the reading start position of the ring buffer, and calculates an accurate reproduction time or recording time together with accurate time information obtained from the timer unit 909.
The display unit 907 receives various instructions from the task control unit 908. mainly,
Display of each part in the reference pattern display area 151 and the input pattern display area 152;
Timer display,
Text display,
Performs display switching for enabling / disabling buttons.

音声データファイル９０４のファイル名は、必ずユニークなファイル名になるように工夫されている。具体的には、「年月日」＋「時刻」＋「９９９迄のカウント値」にて構成される。
リファレンスパターンにおいて新規に音声データファイル９０４のファイル名を生成するには、リファレンスパターン用の音声データファイル９０４であることを示す”ｒｅｆ”という文字列に、ファイル名生成の際の日時、そして練習支援ソフト１０１の内部のファイル名生成用カウンタのカウント値をインクリメントさせた値を付加する。
入力パターンにおいて新規に音声データファイル９０４のファイル名を生成するには、入力パターン用の音声データファイル９０４であることを示す”ｏｂｊ”という文字列に、ファイル名生成の際の日時、そして練習支援ソフト１０１の内部のファイル名生成用カウンタのカウント値をインクリメントさせた値を付加する。
今、２００６年１１月１日１５時３０分３５秒であったとする。そして、練習支援ソフト１０１の内部のファイル名生成用カウンタのカウント値が「００３」であったとする。この時点で、リファレンスパターン用の音声データファイル９０４を生成すると、ref061101153035004.wavというファイル名が生成される。
同様に、今、２００６年１１月１日１５時３０分３５秒であったとする。そして、練習支援ソフト１０１の内部のファイル名生成用カウンタのカウント値が「００３」であったとする。この時点で、入力パターン用の音声データファイル９０４を生成すると、obj061101153035004.wavというファイル名が生成される。
つまり、１秒間でカウンタ値を１０００回繰り返さない限りは、人間の操作で同一の音声データファイル９０４を生成することはほぼ不可能である。 The file name of the audio data file 904 is devised so that it is always a unique file name. Specifically, it is composed of “year / month / day” + “time” + “count value up to 999”.
In order to generate a new file name of the audio data file 904 in the reference pattern, the character string “ref” indicating the audio data file 904 for the reference pattern, the date and time when generating the file name, and practice support A value obtained by incrementing the count value of the file name generation counter inside the software 101 is added.
In order to newly generate the file name of the audio data file 904 in the input pattern, the date and time when generating the file name and the practice support are added to the character string “obj” indicating the audio data file 904 for the input pattern. A value obtained by incrementing the count value of the file name generation counter inside the software 101 is added.
Suppose that it was 15:30:35 on November 1, 2006. Assume that the count value of the file name generation counter in the practice support software 101 is “003”. At this point, when the audio data file 904 for the reference pattern is generated, the file name ref061101153035004.wav is generated.
Similarly, it is now assumed that it was 15:30:35 on November 1, 2006. Assume that the count value of the file name generation counter in the practice support software 101 is “003”. At this time, when the audio data file 904 for the input pattern is generated, the file name obj061101153035004.wav is generated.
That is, unless the counter value is repeated 1000 times per second, it is almost impossible to generate the same audio data file 904 by human operation.

図８は、リファレンスパターンには全パートに音声データファイル９０４が対応付けされているが、入力パターンには応答パートのみ音声データファイル９０４が対応付けされていることを示す。タスクファイル９０３を読み込んだ直後、或は新規にリファレンスパターンの録音を完遂した直後が、この状態である。
リファレンスパターン表示エリア１５１の応答パートに対応する音声データファイル９０４は、入力パターン表示エリア１５２の同じ番号の応答パートに対応する音声データファイル９０４と同じ内容になる。このため、リファレンスパターンの応答パートの音声データファイル９０４から、入力パターンの応答パートの音声データファイル９０４を生成すべく、複写する処理を行う。具体的には、以下のようなＭＳ−ＤＯＳ（登録商標）コマンドに相当する処理を行う。
>copy ref061101153035004.wav obj061101153035004.wav
つまり、”ｒｅｆ”より始まるファイル名を”ｏｂｊ”に始まるファイル名に変換して、内容をそっくりそのままコピーする。
なお、この処理の詳細な動作は後述する。 FIG. 8 shows that the audio data file 904 is associated with all the parts in the reference pattern, but only the response part is associated with the input pattern. This state is immediately after reading the task file 903 or immediately after recording a new reference pattern.
The audio data file 904 corresponding to the response part in the reference pattern display area 151 has the same contents as the audio data file 904 corresponding to the response part having the same number in the input pattern display area 152. For this reason, a copy process is performed to generate an audio data file 904 of the response part of the input pattern from the audio data file 904 of the response part of the reference pattern. Specifically, processing corresponding to the following MS-DOS (registered trademark) command is performed.
> copy ref061101153035004.wav obj061101153035004.wav
That is, a file name starting with “ref” is converted into a file name starting with “obj”, and the contents are copied as they are.
The detailed operation of this process will be described later.

［初期動作］
図１０は練習支援ソフト１０１を起動した直後の処理の流れを示すフローチャートである。主にタスク制御部９０８が主体となって行う処理である。
練習支援ソフト１０１を起動すると（Ｓ１００１）、タスク制御部９０８は、Ｗｉｎｄｏｗｓ（登録商標）のレジストリ等に格納されているデータファイルの履歴情報から、タスクファイル９０３の履歴が存在するか否か、見る（Ｓ１００２）。
もし、タスクファイル９０３の履歴が履歴情報にあった場合には、そのファイル名は実在するか否か、確認する（Ｓ１００３）。
タスクファイル９０３の履歴がないか、あってもファイルが実在しない場合は、デフォルト設定を行う（Ｓ１００４）。
履歴にあるタスクファイル９０３が実在していれば、タスク制御部９０８は、当該タスクファイル９０３を読み込み、音声データファイル９０４については不揮発性ストレージ９０２内の所定のディレクトリに展開し、それ以外のデータはＲＡＭ９０５に読み込む。この時、タスクファイル９０３内に含まれている図示しないリファレンス管理テーブルテキストファイルはＲＡＭ９０５に取り込まれ、リファレンスパターン管理テーブル７０１に変換される。そして、この動作に伴い、リファレンスパターン管理テーブル７０１のレコード数からパートの総数を得て、これを変数Ｎに設定する（Ｓ１００５）。この時点で、ステップＳ１００４或はＳ１００５によって、リファレンスパターン管理テーブル７０１が生成されたこととなる。
次に、タスク制御部９０８は、リファレンスパターン管理テーブル７０１から入力パターン管理テーブル７０３を生成する処理を行う（Ｓ１００６）。
リファレンスパターン管理テーブル７０１と入力パターン管理テーブル７０３が揃ったら、タスク制御部９０８は表示部９０７を制御して、各テーブル毎にパートを各々の表示エリアに表示する処理を行い（Ｓ１００８）、一連の処理を終了する（Ｓ１００９）。
ステップＳ１００９の直後の画面表示状態が、図１及び図３である。
タスクファイル９０３の読み込みを行わない場合（Ｓ１００４）は、図１の画面表示状態になる。つまり、読み込もうとするタスクファイル９０３が存在しないので、空白の練習パートを一つだけ設けて表示する。
タスクファイル９０３の読み込みが行われた場合（Ｓ１００５）は、図３の画面表示状態になる。
つまり、読み込もうとするタスクファイル９０３が存在するので、リファレンスパターン表示エリア１５１には所定のパート数が表示され、入力パターン表示エリア１５２には応答パートだけがリファレンスパターン表示エリア１５１の表示内容と同じになる。 [Initial operation]
FIG. 10 is a flowchart showing the flow of processing immediately after starting the practice support software 101. This process is mainly performed by the task control unit 908.
When the practice support software 101 is activated (S1001), the task control unit 908 checks whether the history of the task file 903 exists from the history information of the data file stored in the Windows (registered trademark) registry or the like. (S1002).
If the history of the task file 903 is in the history information, it is confirmed whether or not the file name actually exists (S1003).
If the task file 903 does not exist or does not exist, default setting is performed (S1004).
If the task file 903 in the history exists, the task control unit 908 reads the task file 903, expands the audio data file 904 to a predetermined directory in the non-volatile storage 902, and stores other data. Read into the RAM 905. At this time, a reference management table text file (not shown) included in the task file 903 is taken into the RAM 905 and converted into a reference pattern management table 701. With this operation, the total number of parts is obtained from the number of records in the reference pattern management table 701, and this is set as a variable N (S1005). At this point, the reference pattern management table 701 has been generated in step S1004 or S1005.
Next, the task control unit 908 performs processing for generating the input pattern management table 703 from the reference pattern management table 701 (S1006).
When the reference pattern management table 701 and the input pattern management table 703 are prepared, the task control unit 908 controls the display unit 907 to perform a process of displaying a part in each display area for each table (S1008). The process ends (S1009).
The screen display state immediately after step S1009 is shown in FIGS.
When the task file 903 is not read (S1004), the screen display state shown in FIG. That is, since there is no task file 903 to be read, only one blank practice part is provided and displayed.
When the task file 903 is read (S1005), the screen display state shown in FIG.
That is, since there is a task file 903 to be read, a predetermined number of parts are displayed in the reference pattern display area 151, and only the response part is displayed in the input pattern display area 152 in the same manner as the display contents of the reference pattern display area 151. Become.

図１１はデフォルト設定の詳細な動作を示すフローチャートである。図１０のステップＳ１００４に相当する。主にタスク制御部９０８が主体となって行う処理である。
処理を開始すると（Ｓ１１０１）、タスク制御部９０８は規定値としてＲＡＭ９０５内の表示順フラグ７０５を「０」に設定する（Ｓ１１０２）。
次に、タスク制御部９０８はリファレンスパターン管理テーブル７０１の第１番目のパートの為の、ユニークなファイル名を新規に生成する（Ｓ１１０３）。そして、生成したファイル名をリファレンスパターン管理テーブル７０１の第１番目のパートに格納して（Ｓ１１０４）、処理を終了する（Ｓ１１０５）。 FIG. 11 is a flowchart showing the detailed operation of default setting. This corresponds to step S1004 in FIG. This process is mainly performed by the task control unit 908.
When the process is started (S1101), the task control unit 908 sets the display order flag 705 in the RAM 905 to “0” as a specified value (S1102).
Next, the task control unit 908 newly generates a unique file name for the first part of the reference pattern management table 701 (S1103). The generated file name is stored in the first part of the reference pattern management table 701 (S1104), and the process is terminated (S1105).

図１２は入力パターン管理テーブル７０３を生成する処理のフローチャートである。図１０のステップＳ１００６に相当する。主にタスク制御部９０８が主体となって行う処理である。
処理を開始すると（Ｓ１２０１）、タスク制御部９０８は最初にカウンタを構成する変数ｉに１を設定する（Ｓ１２０２）。なお、これ以降のフローチャートにおいて、変数が幾つか現れるが、これら変数は全てＲＡＭ９０５内にタスク制御部９０８によって設けられる変数である。特に、カウンタを構成する変数は各々のフローチャートの説明においてのみ有効なローカル変数である。
これ以降はループ処理である。
先ず、カウンタ変数ｉが示す番号のパートは応答パートか否か、判定する（Ｓ１２０３）。
例えば、表示順フラグ７０５が「０」（練習パートから開始）の場合は、奇数パートが練習パートであり、偶数パートが応答パートである。
逆に、表示順フラグ７０５が「１」（応答パートから開始）の場合は、奇数パートが応答パートであり、偶数パートが練習パートである。
すなわち、カウンタ変数ｉが偶数か奇数か、表示順フラグ７０５が０か１かで、その判定結果が変わる。
判定の結果、ｉ番目のパートが練習パートであれば、タスク制御部９０８は入力パターン管理テーブル７０３用にユニークなファイル名を生成する（Ｓ１２０４）。
判定の結果、ｉ番目のパートが応答パートであれば、タスク制御部９０８は、リファレンスパターン管理テーブル７０１のｉ番目の音声データファイル９０４から、入力パターン管理テーブル７０３のｉ番目の音声データファイル９０４を生成すべく、コピーする処理を行う（Ｓ１２０５）。先に示したcopyコマンドと等しい処理である。
ステップＳ１２０４或はＳ１２０５のいずれの場合においても、この時点で、入力パターン管理テーブル７０３用の音声データファイル９０４のファイル名が得られている。そこで、タスク制御部９０８は次に、入力パターン管理テーブル７０３のｉ番目のレコードに、当該音声データファイル９０４のファイル名を格納する（Ｓ１２０６）。
そして、カウンタ変数ｉを１加算（インクリメント）する（Ｓ１２０７）。
次に、インクリメントされた変数ｉの番号のパートが、リファレンスパターン管理テーブル７０１内にあるか否か見て（Ｓ１２０８）、あればループ処理を繰り返し（Ｓ１２０３）、なければ終了する（Ｓ１２０９）。 FIG. 12 is a flowchart of processing for generating the input pattern management table 703. This corresponds to step S1006 in FIG. This process is mainly performed by the task control unit 908.
When the process is started (S1201), the task control unit 908 first sets 1 to a variable i constituting the counter (S1202). In the following flowcharts, some variables appear. All of these variables are variables provided by the task control unit 908 in the RAM 905. In particular, the variables constituting the counter are local variables that are valid only in the description of each flowchart.
Subsequent processing is loop processing.
First, it is determined whether or not the part with the number indicated by the counter variable i is a response part (S1203).
For example, when the display order flag 705 is “0” (starting from the practice part), the odd part is the practice part and the even part is the response part.
Conversely, when the display order flag 705 is “1” (starting from the response part), the odd-numbered part is the response part and the even-numbered part is the practice part.
That is, the determination result changes depending on whether the counter variable i is an even number or an odd number, and the display order flag 705 is 0 or 1.
As a result of the determination, if the i-th part is a practice part, the task control unit 908 generates a unique file name for the input pattern management table 703 (S1204).
As a result of the determination, if the i-th part is a response part, the task control unit 908 changes the i-th audio data file 904 of the input pattern management table 703 from the i-th audio data file 904 of the reference pattern management table 701. In order to generate, a copy process is performed (S1205). The process is the same as the copy command shown above.
In either case of step S1204 or S1205, the file name of the audio data file 904 for the input pattern management table 703 is obtained at this point. Therefore, the task control unit 908 next stores the file name of the audio data file 904 in the i-th record of the input pattern management table 703 (S1206).
Then, the counter variable i is incremented by 1 (S1207).
Next, it is checked whether or not the incremented part number i is in the reference pattern management table 701 (S1208). If so, the loop process is repeated (S1203), and if not, the process ends (S1209).

図１３は表示画面１０２を用いた状態遷移図である。
録音／練習／再生を行っていない状態（停止状態画面１３０１）において、リファレンスパターン表示エリア１５１の第１パートをマウスで右クリックすると、メニューウィンドウ２５０３が表示される。このウィンドウにて「練習パートから開始」するのか、「応答パートから開始」するのかをマウスにて指定すると、その指定内容によって表示順フラグ７０５が書き換えられると共に、リファレンスパターン表示エリア１５１と入力パターン表示エリア１５２の再表示処理が行われる。
録音／練習／再生を行っていない状態において、リファレンスパターン表示エリア１５１或は入力パターン表示エリア１５２に表示されている任意のパートをマウスで左クリックすると、当該パートにカーソル表示が移動する。 FIG. 13 is a state transition diagram using the display screen 102.
When the first part of the reference pattern display area 151 is right-clicked with the mouse in a state where recording / practice / playback is not performed (stop state screen 1301), a menu window 2503 is displayed. In this window, if “start from practice part” or “start from response part” is specified with the mouse, the display order flag 705 is rewritten according to the specified contents, and the reference pattern display area 151 and the input pattern display are displayed. A redisplay process of the area 152 is performed.
When the recording / practice / playback is not being performed, if any part displayed in the reference pattern display area 151 or the input pattern display area 152 is left-clicked with the mouse, the cursor display moves to the part.

図１４はリファレンスパターン表示エリア１５１における表示処理である。
図１５は入力パターン表示エリア１５２における表示処理である。
両者共、実質的な処理内容は全く同じであるので、ここでは図１４のリファレンスパターン表示エリア１５１の表示処理について説明する。主にタスク制御部９０８が主体となって行う処理である。
処理を開始すると（Ｓ１４０１）、タスク制御部９０８はカウンタ変数ｉを指定した値に設定する（Ｓ１４０２）。指定した値とは、そのときのシチュエーションに応じて変わる。図１２の場合では１になる。後述する録音／練習／再生処理の場合では、録音／練習／再生を完了した直後のパートの番号が代入される。
次に、タスク制御部９０８はリファレンスパターン管理テーブル７０１のｉ番目のパートに記述されている音声データファイル９０４は不揮発性ストレージ９０２内に実在するか否かを調べる（Ｓ１４０３）。もしあれば、表示部９０７を制御して、その音声データファイル９０４を読み込み、波形表示する（Ｓ１４０４）。なければ、空表示、すなわち「ｎｏｄａｔａ」表示を行う（Ｓ１４０５）。なお、このステップＳ１４０４及びＳ１４０５では、変数ｉと表示順フラグ７０５とを確認して、練習パートであれば白地に群青表示を、応答パートであれば群青地に白表示を行う。
次に、タスク制御部９０８は、表示部９０７による表示処理の結果、当該パートの表示が表示エリアをはみ出たか否かを確認する（Ｓ１４０６）。はみ出ていなければ、変数ｉをインクリメントし（Ｓ１４０７）、パート最大数Ｎと比較して、Ｎを超えていないか否か確認する（Ｓ１４０８）。Ｎを超えていなければ、再び表示処理を繰り返す（Ｓ１４０３）。Ｎを超えたか、或はステップＳ１４０６で表示エリアをはみ出てしまった場合は、処理を終了する（Ｓ１４０９）。 FIG. 14 shows display processing in the reference pattern display area 151.
FIG. 15 shows display processing in the input pattern display area 152.
Since both have substantially the same processing contents, here, the display processing of the reference pattern display area 151 in FIG. 14 will be described. This process is mainly performed by the task control unit 908.
When the process is started (S1401), the task control unit 908 sets the counter variable i to a specified value (S1402). The specified value changes according to the situation at that time. In the case of FIG. In the case of recording / practice / playback processing to be described later, the part number immediately after the completion of recording / practice / playback is substituted.
Next, the task control unit 908 checks whether or not the audio data file 904 described in the i-th part of the reference pattern management table 701 actually exists in the nonvolatile storage 902 (S1403). If there is, the display unit 907 is controlled to read the audio data file 904 and display the waveform (S1404). If not, a blank display, ie, “no data” display is performed (S1405). In steps S1404 and S1405, the variable i and the display order flag 705 are confirmed, and if it is a practice part, a group blue display is displayed on a white background, and if it is a response part, a group blue display is displayed.
Next, the task control unit 908 confirms whether or not the display of the part exceeds the display area as a result of the display processing by the display unit 907 (S1406). If not, the variable i is incremented (S1407), and compared with the maximum number N of parts, it is confirmed whether or not N is exceeded (S1408). If N is not exceeded, the display process is repeated again (S1403). If N has been exceeded or the display area has been exceeded in step S1406, the process is terminated (S1409).

図１６はパート挿入ボタンを押したときの処理を示すフローチャートである。主にタスク制御部９０８が主体となって行う処理である。
パート挿入ボタンが押されると（Ｓ１６０１）、タスク制御部９０８は、リファレンスパターン管理テーブル７０１の、現在カーソルが示しているパート以降の内容を一つ後方にずらす（Ｓ１６０２）。図１６のステップＳ１６０２においては、現在のカーソル位置を示す変数をｋとしている。そして、リファレンスパターン管理テーブル７０１のｊ番目の内容をＲｅｆ（ｊ）と表現している。この処理の詳細は、
Ｒｅｆ（Ｎ＋１）を新規に作成し、
Ｒｅｆ（Ｎ＋１）にＲｅｆ（Ｎ）を代入し、
Ｒｅｆ（Ｎ）にＲｅｆ（Ｎ−１）を代入し、
Ｒｅｆ（Ｎ−１）にＲｅｆ（Ｎ−２）を代入し、…そして、
Ｒｅｆ（ｋ＋１）にＲｅｆ（ｋ）を代入する、という処理の流れになる。
次に、入力パターン管理テーブル７０３の、現在カーソルが示しているパート以降の内容を一つ後方にずらす（Ｓ１６０３）。図１６のステップＳ１６０３においては、現在のカーソル位置を示す変数をｋとしている。そして、入力パターン管理テーブル７０３のｊ番目の内容をＯｂｊ（ｊ）と表現している。この処理の詳細は、
Ｏｂｊ（Ｎ＋１）を新規に作成し、
Ｏｂｊ（Ｎ＋１）にＯｂｊ（Ｎ）を代入し、
Ｏｂｊ（Ｎ）にＯｂｊ（Ｎ−１）を代入し、
Ｏｂｊ（Ｎ−１）にＯｂｊ（Ｎ−２）を代入し、…そして、
Ｏｂｊ（ｋ＋１）にＯｂｊ（ｋ）を代入する、という処理の流れになる。
つまり、ステップＳ１６０２及びＳ１６０３は、挿入したい位置を空ける処理である。
空ける処理の後は、実際にデータを挿入する処理である。タスク制御部９０８は、リファレンスパターン用にユニークなファイル名を生成して、リファレンスパターン管理テーブル７０１のｋ番目に書き込む（Ｓ１６０４）。同様に、入力パターン用にユニークなファイル名を生成して、入力パターン管理テーブル７０３のｋ番目に書き込む（Ｓ１６０５）。
そして、パートの数を保持している変数Ｎをインクリメントする（Ｓ１６０６）。
最後に、入力パターンのデータ更新処理を行って（Ｓ１６０７）、終了する（Ｓ１６０８）。 FIG. 16 is a flowchart showing processing when the part insertion button is pressed. This process is mainly performed by the task control unit 908.
When the part insertion button is pressed (S1601), the task control unit 908 shifts the contents after the part indicated by the current cursor in the reference pattern management table 701 backward by one (S1602). In step S1602 of FIG. 16, a variable indicating the current cursor position is set to k. The jth content of the reference pattern management table 701 is expressed as Ref (j). For details on this process,
Create a new Ref (N + 1)
Substituting Ref (N) for Ref (N + 1),
Substituting Ref (N−1) for Ref (N),
Substituting Ref (N-2) for Ref (N-1), and ...
The processing flow is to substitute Ref (k) for Ref (k + 1).
Next, the contents after the part indicated by the cursor in the input pattern management table 703 are shifted backward by one (S1603). In step S1603 in FIG. 16, a variable indicating the current cursor position is set to k. The jth content of the input pattern management table 703 is expressed as Obj (j). For details on this process,
Obj (N + 1) is newly created,
Substitute Obj (N) for Obj (N + 1)
Substitute Obj (N-1) for Obj (N),
Substitute Obj (N-2) for Obj (N-1), and ...
The processing flow is to substitute Obj (k) for Obj (k + 1).
That is, steps S1602 and S1603 are processes for freeing a position to be inserted.
After the vacant process, data is actually inserted. The task control unit 908 generates a unique file name for the reference pattern and writes it in the kth of the reference pattern management table 701 (S1604). Similarly, a unique file name for the input pattern is generated and written in the kth of the input pattern management table 703 (S1605).
Then, the variable N holding the number of parts is incremented (S1606).
Finally, input pattern data update processing is performed (S1607), and the process ends (S1608).

図１７はパート削除ボタンを押したときの処理を示すフローチャートである。主にタスク制御部９０８が主体となって行う処理である。
パート削除ボタンが押されると（Ｓ１７０１）、タスク制御部９０８は、リファレンスパターン管理テーブル７０１の、現在カーソルが示しているパートの内容である音声データファイル９０４を削除する（Ｓ１７０２）。このとき、当該削除対象の音声データファイル９０４が実在していなければ、処理をスキップする。同様に、入力パターン管理テーブル７０３の、現在カーソルが示しているパートの内容である音声データファイル９０４を削除する（Ｓ１７０３）。このとき、当該削除対象の音声データファイル９０４が実在していなければ、処理をスキップする。
次に、リファレンスパターン管理テーブル７０１の、現在カーソルが示しているパート以降の内容を一つ前方にずらす（Ｓ１７０４）。図１７のステップＳ１７０４においては、現在のカーソル位置を示す変数をｋとしている。そして、リファレンスパターン管理テーブル７０１のｊ番目の内容をＲｅｆ（ｊ）と表現している。この処理の詳細は、
Ｒｅｆ（ｋ＋１）にＲｅｆ（ｋ）を代入し、
Ｒｅｆ（ｋ＋２）にＲｅｆ（ｋ＋１）を代入し、…そして、
Ｒｅｆ（Ｎ−１）にＲｅｆ（Ｎ）を代入する、という処理の流れになる。
同様に、入力パターン管理テーブル７０３の、現在カーソルが示しているパート以降の内容を一つ前方にずらす（Ｓ１７０５）。図１７のステップＳ１７０５においては、現在のカーソル位置を示す変数をｋとしている。そして、入力パターン管理テーブル７０３のｊ番目の内容をＯｂｊ（ｊ）と表現している。この処理の詳細は、
Ｏｂｊ（ｋ＋１）にＯｂｊ（ｋ）を代入し、
Ｏｂｊ（ｋ＋２）にＯｂｊ（ｋ＋１）を代入し、…そして、
Ｏｂｊ（Ｎ−１）にＯｂｊ（Ｎ）を代入する、という処理の流れになる。
そして、パートの数を保持している変数Ｎをデクリメントする（Ｓ１７０６）。
最後に、入力パターンのデータ更新処理を行って（Ｓ１７０７）、終了する（Ｓ１７０８）。 FIG. 17 is a flowchart showing the processing when the part delete button is pressed. This process is mainly performed by the task control unit 908.
When the part deletion button is pressed (S1701), the task control unit 908 deletes the audio data file 904 that is the content of the part currently indicated by the cursor in the reference pattern management table 701 (S1702). At this time, if the audio data file 904 to be deleted does not actually exist, the process is skipped. Similarly, the audio data file 904 that is the content of the part currently indicated by the cursor is deleted from the input pattern management table 703 (S1703). At this time, if the audio data file 904 to be deleted does not actually exist, the process is skipped.
Next, the contents after the part indicated by the cursor in the reference pattern management table 701 are shifted forward by one (S1704). In step S1704 in FIG. 17, a variable indicating the current cursor position is set to k. The jth content of the reference pattern management table 701 is expressed as Ref (j). For details on this process,
Substituting Ref (k) for Ref (k + 1),
Substitute Ref (k + 1) for Ref (k + 2), and ...
The processing flow is to substitute Ref (N) for Ref (N-1).
Similarly, the contents after the part indicated by the cursor in the input pattern management table 703 are shifted forward by one (S1705). In step S1705 of FIG. 17, a variable indicating the current cursor position is set to k. The jth content of the input pattern management table 703 is expressed as Obj (j). For details on this process,
Substituting Obj (k) for Obj (k + 1)
Substitute Obj (k + 1) for Obj (k + 2), and ...
The processing flow is to substitute Obj (N) for Obj (N-1).
Then, the variable N holding the number of parts is decremented (S1706).
Finally, input pattern data update processing is performed (S1707), and the process ends (S1708).

図１８は、入力パターンのデータ更新処理を示すフローチャートである。図１６のステップＳ１６０７及び図１７のステップＳ１７０７に相当する。主にタスク制御部９０８が主体となって行う処理である。
処理を開始すると（Ｓ１８０１）、タスク制御部９０８は、カウンタ変数ｉに現在のカーソル位置ｋを代入する（Ｓ１８０２）。
次に、タスク制御部９０８はカウンタ値ｉとパート数Ｎとを比較して（Ｓ１８０３）、ｉがパート数Ｎ以下であれば、ｉ番目のパートが応答パートなのか練習パートなのかを確認する（Ｓ１８０４）。
応答パートであれば、入力パターン管理テーブル７０３のｉ番目が参照している音声データファイル９０４に、リファレンスパターン管理テーブル７０１のｉ番目が参照している音声データファイル９０４の内容を上書きコピーする（Ｓ１８０５）。この処理により、入力パターン管理テーブル７０３のｉ番目が参照している音声データファイル９０４のファイル名はそのままで、中身はリファレンスパターン管理テーブル７０１のｉ番目が参照している音声データファイル９０４と同じになる。
練習パートであれば、入力パターン管理テーブル７０３のｉ番目が参照している音声データファイル９０４を削除する（Ｓ１８０６）。なお、音声データファイル９０４が実在しない場合は、処理をスキップする。
ステップＳ１８０５及びＳ１８０６のいずれの場合でも、処理を完遂した後は、カウンタｉをインクリメント（Ｓ１８０７）して、再びステップＳ１８０３にてパート数Ｎと比較される。
ステップＳ１８０３にて、カウンタｉがパート数Ｎを超えたら、終了となる（Ｓ１８０８）。 FIG. 18 is a flowchart showing input pattern data update processing. This corresponds to step S1607 in FIG. 16 and step S1707 in FIG. This process is mainly performed by the task control unit 908.
When the process is started (S1801), the task control unit 908 substitutes the current cursor position k for the counter variable i (S1802).
Next, the task control unit 908 compares the counter value i with the number of parts N (S1803), and if i is equal to or less than the number of parts N, it confirms whether the i-th part is a response part or a practice part. (S1804).
If it is a response part, the contents of the audio data file 904 referred to by the i-th reference pattern management table 701 are overwritten and copied to the audio data file 904 referred to by the i-th input pattern management table 703 (S1805). ). By this processing, the file name of the audio data file 904 referred to by the i-th in the input pattern management table 703 remains the same, and the content is the same as the audio data file 904 referred to by the i-th in the reference pattern management table 701. Become.
If it is a practice part, the audio data file 904 referred to by the i-th in the input pattern management table 703 is deleted (S1806). If the audio data file 904 does not exist, the process is skipped.
In either case of steps S1805 and S1806, after completing the processing, the counter i is incremented (S1807) and compared with the number N of parts again in step S1803.
If the counter i exceeds the number N of parts in step S1803, the process ends (S1808).

以上、図１６、図１７及び図１８を通じて、リファレンスパターン管理テーブルと入力パターン管理テーブルとのパート数は必ず同じ数になることがわかる。 As described above, it can be seen from FIG. 16, FIG. 17 and FIG. 18 that the number of parts in the reference pattern management table and the input pattern management table is always the same.

［録音］
図１９及び図２０は、リファレンスパターンの録音を行う際の処理を示すフローチャートである。主にタスク制御部９０８が主体となって行う処理である。
図１の「録音」ボタン１０９を押すと（Ｓ１９０１）、タスク制御部９０８は、最初にカウンタ変数ｊにカーソル位置を示す指定数ｋを代入する（Ｓ１９０２）。
これ以降はループ処理である。
先ず、ｊとパート数Ｎを比較する（Ｓ１９０３）。ｊがＮ以下であれば、次に「テキスト表示」チェックボックス１２０にチェックが入っているか否か（テキスト表示モードであるか否か）を確認する（Ｓ１９０４）。テキスト表示モードであれば、テキストデータ７０４のｊ番目のパートのテキストを表示する（Ｓ１９０５）。
次に、リファレンスパターン表示エリア１５１中のｊ番目のパートを赤く表示して（Ｓ１９０６）、録音を開始する（Ｓ１９０７）。
録音中は、タスク制御部９０８は操作部９０６を通じて、キーボードのエスケープキー（ＥＳＣキー）が押されたか否か（Ｓ１９０８）、マウスの右クリックが行われたか否か（Ｓ１９０９）をチェックし続ける（Ｓ１９０９のＮからＳ１９０８へ）。
ＥＳＣキーが押されたら（Ｓ１９０８のＹ）、録音を終了すると共に当該パートの赤色表示を止めて（Ｓ１９１０）、ｊ番目以降のパートの再表示処理を行う（Ｓ１９１１）。すなわち、図１４に示したパート表示処理を、ｉにｊを代入して行うものである。そして、終了する（Ｓ１９１２）。
マウスの右クリックが行われたら（Ｓ１９０９のＹ）、ｊ番目のパートの録音を終了すると共に、当該パートの赤色表示を止める（Ｓ２０１３）。そして、ｊ番目以降のパートの再表示処理を行う（Ｓ２０１４）。すなわち、図１４に示したパート表示処理を、ｉにｊを代入して行うものである。
次に、「連続」チェックボックス１１０にチェックが入っているか（連続録音モードであるか否か）を確認する（Ｓ２０１５）。連続モードでなければここで処理は終了する（Ｓ２０１６）が、連続モードであれば、次のパートの録音を行うので、ｊをインクリメントして（Ｓ２０１７）、処理を繰り返す（Ｓ１９０３）。カウンタｊがパート数Ｎを超えたら、終了となる（Ｓ１９１８）。 [recording]
FIG. 19 and FIG. 20 are flowcharts showing processing when recording a reference pattern. This process is mainly performed by the task control unit 908.
When the “record” button 109 in FIG. 1 is pressed (S1901), the task control unit 908 first substitutes the designated number k indicating the cursor position into the counter variable j (S1902).
Subsequent processing is loop processing.
First, j is compared with the number N of parts (S1903). If j is less than or equal to N, it is checked whether or not the “text display” check box 120 is checked (whether or not the text display mode is set) (S1904). If it is in the text display mode, the text of the j-th part of the text data 704 is displayed (S1905).
Next, the j-th part in the reference pattern display area 151 is displayed in red (S1906), and recording is started (S1907).
During recording, the task control unit 908 continues to check whether an escape key (ESC key) on the keyboard has been pressed (S1908) and whether a right mouse click has been performed (S1909) through the operation unit 906 (S1909). N from S1909 to S1908).
When the ESC key is pressed (Y in S1908), the recording is terminated and the red display of the part is stopped (S1910), and the jth and subsequent parts are displayed again (S1911). That is, the part display process shown in FIG. 14 is performed by substituting j for i. Then, the process ends (S1912).
When the right mouse click is performed (Y in S1909), the recording of the j-th part is terminated and the red display of the part is stopped (S2013). Then, the j-th and subsequent parts are redisplayed (S2014). That is, the part display process shown in FIG. 14 is performed by substituting j for i.
Next, it is confirmed whether or not the “continuous” check box 110 is checked (whether or not it is the continuous recording mode) (S2015). If it is not the continuous mode, the process ends here (S2016). If it is the continuous mode, the next part is recorded, so j is incremented (S2017), and the process is repeated (S1903). If the counter j exceeds the number N of parts, the process ends (S1918).

［練習］
図２１は表示画面１０２を用いた状態遷移図である。
通常の表示状態（停止状態画面２１０１）において、「練習開始」ボタン１１５を押すと、発話練習開始ウィンドウ２１０２が表示される。
発話練習開始ウィンドウ２１０２の「開始」ボタンを押すと、入力パターン表示エリア１５２の表示内容にしたがって、練習パートへの録音（練習）と、応答パートの再生が繰り返される（練習状態画面２１０３）。 [practice]
FIG. 21 is a state transition diagram using the display screen 102.
When the “practice start” button 115 is pressed in the normal display state (stop state screen 2101), an utterance practice start window 2102 is displayed.
When a “start” button in the utterance practice start window 2102 is pressed, recording (practice) to the practice part and reproduction of the response part are repeated according to the display contents of the input pattern display area 152 (practice state screen 2103).

図２２及び図２３は入力パターンにおける練習を行う際の処理を示すフローチャートである。主にタスク制御部９０８が主体となって行う処理である。
図１の「練習開始」ボタン１１５を押すと（Ｓ２２０１）、タスク制御部９０８は最初にカウンタ変数ｊにカーソル位置を示す指定数ｋを代入する（Ｓ２２０２）。
これ以降はループ処理である。
先ず、ｊとパート数Ｎを比較する（Ｓ２２０３）。ｊがＮ以下であれば、次に「テキスト表示」チェックボックス１２０にチェックが入っているか否か（テキスト表示モードであるか否か）を確認する（Ｓ２２０４）。テキスト表示モードであれば、テキストデータ７０４のｊ番目のパートのテキストを表示する（Ｓ２２０５）。
次に、ｊ番目のパートは応答パートであるか否かを確認する（Ｓ２２０６）。
応答パートであれば、ｊ番目のパートを緑色に表示して（Ｓ２２０７）、ｊ番目のパートの再生を開始する（Ｓ２２０８）。
再生中は、キーボードのエスケープキー（ＥＳＣキー）が押されたか否か（Ｓ２２０９）、当該パートの再生が終了したか否か（Ｓ２３１０）をチェックしつつ、ループする（Ｓ２３１０のＮからＳ２２０９へ）。
ＥＳＣキーが押されたら（Ｓ２２０９のＹ）、当該パートの再生を中断すると共に当該パートの緑色表示を止めて（Ｓ２２１１）、終了する（Ｓ２２１２）。
当該パートの再生が終了したら（Ｓ２３１０のＹ）、カウンタｊをインクリメントして（Ｓ２３１３）、ループ処理を継続する（Ｓ２２０３）。
応答パートの次には練習パートが位置している（Ｓ２２０６のＮ）。練習パートであれば、当該ｊ番目のパートを赤く表示して（Ｓ２３１４）、録音を開始する（Ｓ２３１５）。ここで「録音」というのは、練習パートにかかる音声データファイル９０４に対する録音である。
録音中は、キーボードのエスケープキー（ＥＳＣキー）が押されたか否か（Ｓ２３１６）、マウスの右クリックが行われたか否か（Ｓ２３１７）をチェックしつつ、ループする（Ｓ２３１７のＮからＳ２３１６へ）。
ＥＳＣキーが押されたら（Ｓ２３１６のＹ）、録音を終了すると共に当該パートの赤色表示を止めて（Ｓ２３１８）、ｊ番目以降のパートの再表示処理を行う（Ｓ２３１９）。すなわち、図１４に示したパート表示処理を、ｉにｊを代入して行うものである。そして、終了する（Ｓ２３２０）。
マウスの右クリックが行われたら（Ｓ２３１７のＹ）、ｊ番目のパートの録音を終了すると共に、当該パートの赤色表示を止める（Ｓ２３２１）。そして、ｊ番目以降のパートの再表示処理を行う（Ｓ２３２２）。すなわち、図１４に示したパート表示処理を、ｉにｊを代入して行うものである。そして、カウンタｊをインクリメントして（Ｓ２３１３）、ループ処理を継続する（Ｓ２２０３）。
カウンタｊがパート数Ｎを超えたら終了となる（Ｓ２２２３）。 22 and 23 are flowcharts showing the processing when practicing the input pattern. This process is mainly performed by the task control unit 908.
When the “practice start” button 115 in FIG. 1 is pressed (S2201), the task control unit 908 first substitutes the designated number k indicating the cursor position into the counter variable j (S2202).
Subsequent processing is loop processing.
First, j is compared with the number N of parts (S2203). If j is less than or equal to N, it is checked whether or not the “text display” check box 120 is checked (whether or not it is in the text display mode) (S2204). In the text display mode, the text of the j-th part of the text data 704 is displayed (S2205).
Next, it is confirmed whether or not the j-th part is a response part (S2206).
If it is a response part, the jth part is displayed in green (S2207), and playback of the jth part is started (S2208).
During playback, it loops (from N in S2310 to S2209) while checking whether the escape key (ESC key) on the keyboard has been pressed (S2209) and whether playback of the part has been completed (S2310). .
When the ESC key is pressed (Y in S2209), the reproduction of the part is interrupted and the green display of the part is stopped (S2211), and the process is ended (S2212).
When the reproduction of the part is completed (Y in S2310), the counter j is incremented (S2313), and the loop process is continued (S2203).
The practice part is located next to the response part (N in S2206). If it is a practice part, the j-th part is displayed in red (S2314), and recording is started (S2315). Here, “recording” is recording with respect to the audio data file 904 related to the practice part.
During recording, it loops (from N in S2317 to S2316) while checking whether the escape key (ESC key) on the keyboard has been pressed (S2316) and whether the right mouse click has been performed (S2317). .
When the ESC key is pressed (Y in S2316), the recording is terminated and the red display of the part is stopped (S2318), and the j-th and subsequent parts are displayed again (S2319). That is, the part display process shown in FIG. 14 is performed by substituting j for i. Then, the process ends (S2320).
When the right click of the mouse is performed (Y in S2317), the recording of the j-th part is terminated and the red display of the part is stopped (S2321). Then, the j-th and subsequent parts are redisplayed (S2322). That is, the part display process shown in FIG. 14 is performed by substituting j for i. Then, the counter j is incremented (S2313) and the loop processing is continued (S2203).
If the counter j exceeds the number N of parts, the process ends (S2223).

［応答速度変更］
練習の際に、「応答速度」変更欄１１６の数値を変更することにより、応答パートの再生速度を変更することができる。いわゆる話速変換と呼ばれる機能である。数値による速度変更情報は操作部９０６からタスク制御部９０８に伝達される。タスク制御部９０８は音声信号処理部に制御命令を発する。音声信号処理部９１２では目的の（これから再生しようとする応答パートの）音声データファイル９０４の再サンプリング処理を施すことにより、再生時のピッチを変更せずに再生速度を変更する。 [Change response speed]
During practice, the playback speed of the response part can be changed by changing the value in the “response speed” change column 116. This function is called so-called speech speed conversion. The numerical speed change information is transmitted from the operation unit 906 to the task control unit 908. The task control unit 908 issues a control command to the audio signal processing unit. The audio signal processing unit 912 changes the reproduction speed without changing the pitch at the time of reproduction by performing a resampling process on the target audio data file 904 (of the response part to be reproduced in the future).

［再生］
図２４は、リファレンスパターン及び入力パターンの再生処理を示すフローチャートである。主にタスク制御部９０８が主体となって行う処理である。
「リファレンス再生」ボタン１０３或は「入力再生」ボタン１０４を押すと（Ｓ２４０１）、最初にカウンタ変数ｊにカーソル位置を示す指定数ｋを代入する（Ｓ２４０２）。
これ以降はループ処理である。
先ず、ｊとパート数Ｎを比較する（Ｓ２４０３）。ｊがＮ以下であれば、次に「テキスト表示」チェックボックス１２０にチェックが入っているか否か（テキスト表示モードであるか否か）を確認する（Ｓ２４０４）。テキスト表示モードであれば、テキストデータ７０４のｊ番目のパートのテキストを表示する（Ｓ２４０５）。
次に、ｊ番目のパートを緑色に表示して（Ｓ２４０６）、ｊ番目のパートの再生を開始する（Ｓ２４０７）。
再生中は、キーボードのエスケープキー（ＥＳＣキー）が押されたか否か（Ｓ２４０８）、当該パートの再生が終了したか否か（Ｓ２４０９）をチェックしつつ、ループする（Ｓ２４０９のＮからＳ２４０８へ）。
ＥＳＣキーが押されたら（Ｓ２４０８のＹ）、当該パートの再生を中断すると共に当該パートの緑色表示を止めて（Ｓ２４１０）、終了する（Ｓ２４１１）。
当該パートの再生が終了したら（Ｓ２４０９のＹ）、カウンタｊをインクリメントして（Ｓ２４１２）、ループ処理を継続する（Ｓ２４１３）。 [Regeneration]
FIG. 24 is a flowchart showing the reproduction process of the reference pattern and the input pattern. This process is mainly performed by the task control unit 908.
When the “reference reproduction” button 103 or the “input reproduction” button 104 is pressed (S2401), a designated number k indicating the cursor position is first substituted into the counter variable j (S2402).
Subsequent processing is loop processing.
First, j is compared with the number N of parts (S2403). If j is equal to or less than N, it is next checked whether or not the “text display” check box 120 is checked (whether or not it is in the text display mode) (S2404). If it is the text display mode, the text of the j-th part of the text data 704 is displayed (S2405).
Next, the jth part is displayed in green (S2406), and playback of the jth part is started (S2407).
During playback, a loop is performed while checking whether the escape key (ESC key) on the keyboard has been pressed (S2408) and whether playback of the part has ended (S2409) (from N in S2409 to S2408). .
When the ESC key is pressed (Y in S2408), the reproduction of the part is interrupted and the green display of the part is stopped (S2410), and the process is ended (S2411).
When the reproduction of the part is completed (Y in S2409), the counter j is incremented (S2412), and the loop process is continued (S2413).

［差し替え］
練習を行った結果、入力パターン上のある特定のパートに録音した音声がリファレンスパターン上のものよりも出来が良かった場合は、当該パートを選択して（マウスの左クリック）カーソル位置を更新した後、「差し替え」ボタン１３１を押す（マウスの左クリック）。すると、当該リファレンスパターン上のパートが指し示す音声データファイル９０４は、対応する入力パターン上のパートが指し示す音声データファイル９０４によって上書きコピーされる。
リファレンスパターン上にて選択したパートの音声データがref061101153035005.wavで、対応する入力パターン上のパートの音声データファイル９０４がobj061101154035000.wavである場合、以下のＭＳ−ＤＯＳコマンドを実行したことと動作は等しい。
>copy obj061101154035000.wav ref061101153035005.wav
つまり、リファレンスパターン管理テーブル７０１内の当該パートのレコードに格納されているファイル名はそのままで、中身は入力パターン管理テーブル７０３内の当該パートのレコードに格納されているファイル名の音声データファイル９０４の中身に挿げ替えられる。 [Replace]
As a result of practice, if the sound recorded in a specific part on the input pattern is better than that on the reference pattern, the part is selected (left mouse click) and the cursor position is updated. Then, the “replace” button 131 is pressed (left mouse click). Then, the audio data file 904 indicated by the part on the reference pattern is overwritten and copied by the audio data file 904 indicated by the part on the corresponding input pattern.
When the audio data of the part selected on the reference pattern is ref061101153035005.wav and the audio data file 904 of the part on the corresponding input pattern is obj061101154035000.wav, the following MS-DOS command is executed and the operation is equal.
> copy obj061101154035000.wav ref061101153035005.wav
That is, the file name stored in the record of the part in the reference pattern management table 701 is kept as it is, and the content is the audio data file 904 having the file name stored in the record of the part in the input pattern management table 703. Can be replaced with the contents.

［カット／コピー／ペースト］
電話応対では、応答する相手の会話内容が「ええ」「はい」等の相槌である場合が多い。このような相槌に相当する箇所は、同一の音声データを流用したい。そこで、指定したパートの内容に対し、切り取り（カット：指定した箇所の内容を削除して内部バッファ領域に転送する）／コピー（指定した箇所の内容を削除せずに内部バッファ領域に複写する）／貼り付け（ペースト：内部バッファ領域に保持されている内容を指定した箇所に複写する）を実現できると、便利である。
図２５（ａ）及び（ｂ）は、カット／コピー／ペーストのメニューウィンドウを表示した状態を示す図である。
図２５（ａ）において、指定したパートを右クリックすることで、メニューウィンドウ２５０２が現れる。
メニューウィンドウ２５０２には、上から順に「コピー」、「切り取り」、「貼り付け」と列挙され、更に「ファイル読み込み」、「ファイル書き出し」という項目が続く。
図２５（ａ）では、内部バッファ領域に音声データファイル９０４の内容が保持されていないので、「貼り付け」の項目の文字が薄い表示になっており、マウスの左クリックによる指定ができなくなっている。これ以外の項目の文字は濃い表示になっており、マウスの左クリックによる指定ができる。
図２５（ａ）の状態で、「コピー」を選択すると、内部バッファ領域に音声データファイル９０４の内容が複写され、それと共にメニューウィンドウ２５０１の表示がなくなる。
図２５（ａ）の状態で、「切り取り」を選択すると、内部バッファ領域に音声データファイル９０４の内容が複写され、音声データファイル９０４は削除され、「ｎｏｄａｔａ」表示になる。そして、メニューウィンドウ２５０１の表示がなくなる。
内部バッファ領域に音声データファイル９０４の内容が保持されている状態で、「貼り付け」を選択すると、指定したパートの音声データファイル９０４の中身が、内部バッファ領域に保持されている音声データの内容によって上書きされる。なお、このときに「貼り付け」対象のパートが「ｎｏｄａｔａ」表示である場合は、音声データファイル９０４の実体がないので、当該パートのファイル名にて新たに音声データファイル９０４が作成され、そこに内部バッファ領域に保持されている音声データの内容が転送（貼り付け）される。
なお、「ファイル読み込み」「ファイル書き出し」の項目は、指定したパートの音声データファイルを単独で読み込んだり書き出したりするために用いる機能である。 [Cut / Copy / Paste]
In the telephone answering, the conversation content of the responding partner is often a combination of “Yes” and “Yes”. I would like to use the same audio data for the parts corresponding to such conflicts. Therefore, cut (cut: delete the contents of the specified location and transfer it to the internal buffer area) / copy (copy the contents of the specified location to the internal buffer area without deleting it). It is convenient if / paste (paste: copying the content held in the internal buffer area to a specified location) can be realized.
FIGS. 25A and 25B are views showing a state where a cut / copy / paste menu window is displayed.
In FIG. 25A, a menu window 2502 appears by right-clicking the designated part.
In the menu window 2502, “copy”, “cut”, and “paste” are listed in order from the top, and items “file read” and “file write” continue.
In FIG. 25A, since the contents of the audio data file 904 are not held in the internal buffer area, the characters of the “Paste” item are dimmed and cannot be specified by left clicking the mouse. Yes. Characters of other items are darkly displayed and can be specified by left-clicking the mouse.
When “copy” is selected in the state of FIG. 25A, the contents of the audio data file 904 are copied to the internal buffer area, and at the same time, the menu window 2501 disappears.
When “Cut” is selected in the state of FIG. 25A, the contents of the audio data file 904 are copied to the internal buffer area, the audio data file 904 is deleted, and “no data” is displayed. Then, the menu window 2501 disappears.
When “Paste” is selected while the contents of the audio data file 904 are held in the internal buffer area, the contents of the audio data file 904 of the specified part are the contents of the audio data held in the internal buffer area. Overwritten by At this time, when the part to be “pasted” is displayed as “no data”, since the audio data file 904 does not exist, a new audio data file 904 is created with the file name of the part, The contents of the audio data held in the internal buffer area are transferred (pasted) there.
The items “file read” and “file write” are functions used to read and write the audio data file of the specified part independently.

図２５（ｂ）は、リファレンスパターン表示エリア１５１の第１パートにおいてマウスの右クリックを行った状態を示す。
図２５（ａ）と異なる点は、メニューウィンドウ２５０２と異なり、メニューウィンドウ２５０３には図１３で示した表示順指定項目が追加されている点である。このメニューウィンドウ２５０２は、リファレンスパターン表示エリア１５１の第１パートにおいてのみ表示されるものである。 FIG. 25B shows a state where the right mouse click is performed in the first part of the reference pattern display area 151.
25A differs from the menu window 2502 in that the display order designation item shown in FIG. 13 is added to the menu window 2503. This menu window 2502 is displayed only in the first part of the reference pattern display area 151.

［評価］
「評価」ボタン１１７を押すと、カーソルにて指定されている練習パートについて、公知のＤＰマッチング処理アルゴリズムを用いて、時間軸対応関係表示及び類似度表示を行う。
練習支援ソフト１０１が録音／練習／再生を行っていない状態では、カーソル位置のパートについて、カーソル位置のパートが練習パートである場合は、音声信号処理部９１２がタスク制御部９０８の指示を受けて、比較対象となる音声データファイル９０４を読み込み、評価処理を行う。評価結果は表示部９０７によって時間対応付け表示エリア１５５及び類似度スコア表示エリア１５７に表示される。
「自動評価」チェックボックス１１９にチェックを付すと、練習支援ソフト１０１が再生を行っている状態で、カーソル位置のパートについて、カーソル位置のパートが練習パートである場合は、評価処理を逐次行う。
評価のグラフィカル表示が図５である。 [Evaluation]
When the “evaluation” button 117 is pressed, a time axis correspondence display and a similarity display are performed for the practice part designated by the cursor using a known DP matching processing algorithm.
When the practice support software 101 is not recording / practice / playing, when the part at the cursor is the practice part, the audio signal processing unit 912 receives an instruction from the task control unit 908. The audio data file 904 to be compared is read and an evaluation process is performed. The evaluation result is displayed on the time association display area 155 and the similarity score display area 157 by the display unit 907.
When the “automatic evaluation” check box 119 is checked, when the practice support software 101 is playing back and the part at the cursor position is the practice part, the evaluation process is sequentially performed.
A graphical display of the evaluation is shown in FIG.

［ＭＰ３出力］
近年、フラッシュメモリ或は超小型ハードディスク装置を記憶媒体として用いている、ＭＰ３プレーヤが市場に広く流通している。携帯電話においてもＭＰ３プレーヤ機能を備える機種が広く流通している。（以下、これらを「ＭＰ３プレーヤ」と包括して略す。）このような機器に、本実施形態の練習支援ソフト１０１におけるリファレンスパターンの音声内容をＭＰ３形式で出力し、ＭＰ３プレーヤにて再生すれば、外出中でも個人練習ができる。 [MP3 output]
In recent years, MP3 players using a flash memory or a micro hard disk device as a storage medium have been widely distributed in the market. Also in mobile phones, models with MP3 player functions are widely distributed. (Hereafter, these are collectively abbreviated as “MP3 player”.) If the audio content of the reference pattern in the practice support software 101 of the present embodiment is output to such a device in the MP3 format and reproduced by the MP3 player. , Personal practice on the go.

図２６は、「ＭＰ３作成」ボタン１１４を押すと現れる、ＭＰ３作成メニューウィンドウを示す。
ＭＰ３作成メニューウィンドウ２６０１には、ＭＰ３の出力形態を指定するためのラジオボタン等が設けられている。
ＭＰ３出力形式ラジオボタン２６０２は、ＭＰ３の出力形式を指定するラジオボタンである。「全てのパートをモノラル形式で作成」「練習パート／応答パート別にステレオ形式で作成」「練習パートを無音にしたモノラル形式で作成」のいずれか一つが選択できる。
合図音指定ラジオボタン２６０３は、ＭＰ３出力形式ラジオボタン２６０２にて「練習パートを無音にしたモノラル形式で作成」を選択した際に、更にその出力オプションとして、合図音を入れるか否か、入れるならどのような形式で入れるかを指定するラジオボタンである。「合図音を入れない」「一定間隔の合図音を入れる」「一定分割の合図音を入れる」のいずれか一つが選択できる。
合図音間隔指定欄２６０４は、合図音指定ラジオボタン２６０３にて「一定間隔の合図音を入れる」を選択した際に、合図音の間隔を秒数で指定する指定欄である。
合図音数指定欄２６０５は、合図音指定ラジオボタン２６０３にて「一定分割の合図音を入れる」を選択した際に、分割数を指定する指定欄である。
「作成」ボタン２６０６は、ＭＰ３作成を実行するためのボタンである。 FIG. 26 shows an MP3 creation menu window that appears when the “Create MP3” button 114 is pressed.
The MP3 creation menu window 2601 is provided with radio buttons and the like for designating the output form of MP3.
The MP3 output format radio button 2602 is a radio button for designating an MP3 output format. You can select one of “Create all parts in monaural format”, “Create in stereo format for each practice part / response part” and “Create practice part in mono format with silence”.
The signal sound designation radio button 2603 indicates whether or not a signal sound is to be added as an output option when “Create a practice part in monaural format with silence” is selected with the MP3 output format radio button 2602. This is a radio button that specifies what format to use. Any one of “Do not insert a signal sound”, “Enter a signal sound at regular intervals”, and “Enter a signal sound at regular intervals” can be selected.
A cue sound interval designation field 2604 is a designation field for designating a cue sound interval in seconds when “insert a cue sound at a fixed interval” is selected with a cue sound designation radio button 2603.
A signal number designation field 2605 is a designation field for designating the number of divisions when “insert a predetermined number of signal sounds” is selected with the signal sound designation radio button 2603.
A “Create” button 2606 is a button for executing MP3 creation.

図２７及び図２８は、ＭＰ３出力処理の流れを示すフローチャートである。主にタスク制御部９０８が主体となって行う処理である。
図２６の「作成」ボタン２６０６を押すと（Ｓ２７０１）、最初にカウンタ変数ｉに初期値１を設定する（Ｓ２７０２）。
次に、ＭＰ３出力形式ラジオボタンの判定処理を行う。「全てのパートをモノラル形式で作成」が選択されているか（Ｓ２７０３）、「練習パート／応答パート別にステレオ形式で作成」が選択されているか（Ｓ２７０４）を見る。ステップＳ２７０３及びＳ２７０４のいずれの条件にも当てはまらない場合は、「練習パートを無音にしたモノラル形式で作成」を選択したこととなる（Ｓ２７０４のＮ）。 27 and 28 are flowcharts showing the flow of MP3 output processing. This process is mainly performed by the task control unit 908.
When the “Create” button 2606 in FIG. 26 is pressed (S2701), an initial value 1 is first set to the counter variable i (S2702).
Next, MP3 output format radio button determination processing is performed. It is checked whether “Create all parts in monaural format” is selected (S2703) or “Create in stereo format for each practice part / response part” is selected (S2704). If neither of the conditions in steps S2703 and S2704 is satisfied, it means that “create a practice part in monaural form with silence” is selected (N in S2704).

「全てのパートをモノラル形式で作成」が選択されている場合（Ｓ２７０３のＹ）、カウンタｉとパート数Ｎとを比較して（Ｓ２７０５）、パート数Ｎ以内なら（Ｓ２７０５のＹ）、音声信号処理部９１２を制御して、リファレンスパターンのｉ番目のパートをモノラル出力し、ＭＰ３形式に変換する（Ｓ２７０６）。その後、カウンタｉをインクリメントし（Ｓ２７０７）、処理を繰り返す（Ｓ２７０５）。カウンタｉがパート数Ｎを超えたら（Ｓ２７０５のＮ）、処理を終了する（Ｓ２７０８）。 When “Create all parts in monaural” is selected (Y in S2703), the counter i is compared with the number N of parts (S2705). If the number is within N (Y in S2705), the audio signal The processing unit 912 is controlled to output monaurally the i-th part of the reference pattern and convert it to the MP3 format (S2706). Thereafter, the counter i is incremented (S2707), and the process is repeated (S2705). If the counter i exceeds the number N of parts (N in S2705), the process is terminated (S2708).

「練習パート／応答パート別にステレオ形式で作成」が選択されている場合（Ｓ２７０４のＹ）、カウンタｉとパート数Ｎとを比較する（Ｓ２７０９）。パート数Ｎ以内なら（Ｓ２７０９のＹ）、次にｉ番目のパートは応答パートか否か確認する（Ｓ２７１０）。応答パートなら、音声信号処理部９１２を制御して、当該パートを右チャンネルにて出力し、ＭＰ３変換する（Ｓ２７１１）。練習パートなら、当該パートを左チャンネルにて出力し、ＭＰ３変換する（Ｓ２７１２）。
その後、カウンタｉをインクリメントし（Ｓ２７１３）、処理を繰り返す（Ｓ２７０９）。カウンタｉがパート数Ｎを超えたら（Ｓ２７０９のＮ）、処理を終了する（Ｓ２７１４）。 When “Create in stereo format for each practice part / response part” is selected (Y in S2704), the counter i is compared with the number N of parts (S2709). If the number of parts is within N (Y in S2709), it is checked whether the i-th part is a response part (S2710). If it is a response part, the audio signal processing unit 912 is controlled, the part is output on the right channel, and is converted to MP3 (S2711). If it is a practice part, the part is output on the left channel and converted to MP3 (S2712).
Thereafter, the counter i is incremented (S2713), and the process is repeated (S2709). If the counter i exceeds the number N of parts (N in S2709), the process is terminated (S2714).

「練習パートを無音にしたモノラル形式で作成」を選択されている場合（Ｓ２７０４のＮ）、カウンタｉとパート数Ｎとを比較する（Ｓ２８１５）。パート数Ｎ以内なら（Ｓ２８１５のＹ）、次にｉ番目のパートは応答パートか否か確認する（Ｓ２８１６）。応答パートなら、音声信号処理部９１２を制御して、当該パートをモノラルにて出力し、ＭＰ３変換する（Ｓ２８１７）。その後、カウンタｉをインクリメントし（Ｓ２８１８）、処理を繰り返す（Ｓ２８１５）。カウンタｉがパート数Ｎを超えたら（Ｓ２８１５のＮ）、処理を終了する（Ｓ２８１９）。
ステップＳ２８１６において当該パートが練習パートなら（Ｓ２８１６のＮ）、次に合図音指定ラジオボタン２６０３の判定処理を行う。「合図音を入れない」が選択されているか（Ｓ２８２０）、「一定間隔の合図音を入れる」が選択されているか（Ｓ２８２１）を見る。ステップＳ２８２０及びＳ２８２１のいずれの条件にも当てはまらない場合は、「一定分割の合図音を入れる」を選択したこととなる（Ｓ２８２１のＮ）。 When “Create practice part in monaural form with silence” is selected (N in S2704), the counter i is compared with the number N of parts (S2815). If the number of parts is within N (Y in S2815), it is checked whether the i-th part is a response part (S2816). If it is a response part, the audio signal processing unit 912 is controlled to output the part in monaural and MP3 conversion (S2817). Thereafter, the counter i is incremented (S2818), and the process is repeated (S2815). If the counter i exceeds the number of parts N (N in S2815), the process is terminated (S2819).
If it is determined in step S2816 that the part is a practice part (N in S2816), the process of determining the cue sound designation radio button 2603 is performed. It is checked whether “Do not insert signal sound” is selected (S2820) or “Input signal sound at regular intervals” is selected (S2821). If neither of the conditions in steps S2820 and S2821 is satisfied, it means that “insert a predetermined divided sound signal” is selected (N in S2821).

「合図音を入れない」が選択されている場合（Ｓ２８２０）、ｉ番目のパートの音声データファイルの再生時間を得る。そして、音声信号処理部９１２を制御して、その時間に相当する無音データを出力し、ＭＰ３変換する（Ｓ２８２２）。 When “Do not insert cue sound” is selected (S2820), the playback time of the audio data file of the i-th part is obtained. Then, the audio signal processing unit 912 is controlled to output silence data corresponding to the time and MP3 conversion (S2822).

「一定間隔の合図音を入れる」が選択されている場合（Ｓ２８２１）、ｉ番目のパートの音声データファイルの再生時間を得る。そして、音声信号処理部９１２を制御して、その時間だけ指定間隔の合図音を出力し、ＭＰ３変換する（Ｓ２８２３）。 When “insert signal at regular intervals” is selected (S2821), the playback time of the audio data file of the i-th part is obtained. Then, the audio signal processing unit 912 is controlled to output a signal sound at a specified interval for that time, and MP3 conversion is performed (S2823).

「一定分割の合図音を入れる」を選択されている場合（Ｓ２８２１のＮ）、ｉ番目のパートの音声データファイルの再生時間を得て、指定された分割数から、合図音を出す時間間隔を算出する（Ｓ２８２４）。そして、音声信号処理部９１２を制御して、当該音声データファイルの再生時間だけ、算出した間隔の合図音を出力し、ＭＰ３変換する（Ｓ２８２５）。 When “insert a fixed-division signal sound” is selected (N in S2821), the playback time of the audio data file of the i-th part is obtained, and the time interval for outputting the signal sound from the specified number of divisions is set. Calculate (S2824). Then, the audio signal processing unit 912 is controlled to output a signal sound at the calculated interval for the reproduction time of the audio data file, and MP3 conversion is performed (S2825).

ＭＰ３出力を行った後は（Ｓ２８１７、Ｓ２８２２、Ｓ２８２３、Ｓ２８２５）、カウンタｉをインクリメントして（Ｓ２８１８）、カウンタｉがパート数を越えるまで処理を繰り返し（Ｓ２８１５）、パート数Ｎを越えたら終了する（Ｓ２８１９）。 After performing MP3 output (S2817, S2822, S2823, S2825), the counter i is incremented (S2818), and the processing is repeated until the counter i exceeds the number of parts (S2815). (S2819).

本実施形態には、以下のような応用例が考えられる。
（１）上述の実施形態では、音声データを各パート毎に単一の音声データファイルとして管理していたが、単一の音声データファイルにすることもできる。また、音声データを格納するリレーショナルデータベースを用いることもできる。 The following application examples can be considered in the present embodiment.
(1) In the above-described embodiment, audio data is managed as a single audio data file for each part. However, a single audio data file may be used. A relational database that stores voice data can also be used.

（２）上述の実施形態はあくまで相手が一人の場合だった。本発明の技術思想を発展させて、相手が複数の場合においても、本実施形態に多少の改変を施すだけで、複数相手の会話練習ソフトを実現することができる。
具体的には、各パート毎に応答パートか練習パートかを示すフラグを持たせる。
この応用形態では、例えば舞台等の演劇やドラマ等の出演者が、自宅等で台詞を覚える練習をする際に極めて好適である。リファレンスパターンに監督等が台本を読み上げて作成したタスクファイルを、複数の出演者に渡す。出演者は本ソフトを手持ちのパソコンで稼動させ、自分のパート、すなわち練習パートを指定して、台詞の練習ができるのである。 (2) In the above-described embodiment, there is only one partner. By developing the technical idea of the present invention, even when there are a plurality of opponents, it is possible to realize the conversation practice software for a plurality of opponents by making some modifications to this embodiment.
Specifically, each part has a flag indicating whether it is a response part or a practice part.
In this application mode, for example, a performer such as a stage or a drama, for example, is extremely suitable when practicing to learn dialogue at home or the like. A task file created by a director reading a script as a reference pattern is given to multiple performers. Performers can run this software on their own personal computer, specify their own part, that is, the practice part, and practice the dialogue.

（３）本発明の技術思想を応用して、記録するデータを音声データに留まらず、ＭＩＤＩデータを記録再生できるようにすれば、楽器練習にも応用できる。テキストデータの代わりに、楽譜や、ギター或はベースの運指を示すタブラチュア譜を表示すれば、なお便利である。楽器練習においては、評価機能がリズムのずれを明確に表示することとなるので、これもまた練習に大変効果的である。 (3) If the technical idea of the present invention is applied so that the data to be recorded is not limited to audio data but MIDI data can be recorded and reproduced, it can be applied to musical instrument practice. Instead of text data, it is more convenient to display a score or a tablature score indicating a guitar or bass fingering. In musical instrument practice, the evaluation function clearly displays rhythm shifts, which is also very effective for practice.

（４）反復練習を行うために、練習の対象範囲を指定して、繰り返し練習を行うように機能を追加することもできる。 (4) In order to perform repetitive practice, it is possible to add a function to specify the target range of practice and perform repetitive practice.

（５）相槌に相当する応答パートは専用の音声データファイルをランダムに再生する機能を追加することもできる。
予め、″ｒｅｓ″から始まる、相槌に相当する音声データファイルを複数用意する。
そして、リファレンスパターンの応答パートについて、マウスの右クリックにて「相槌」の指定を行う機能を追加する。「相槌」の指定がなされた当該応答パートは、″ｒｅｆ″から始まる音声データファイル名が、用意してある″ｒｅｓ″から始まる音声データファイル名にランダムに変換される。
この機能により、リファレンスパターンの作成作業の手間が軽減される。 (5) A response part corresponding to the conflict can be added with a function of randomly reproducing a dedicated audio data file.
In advance, a plurality of audio data files corresponding to each other starting with “res” are prepared.
Then, for the response part of the reference pattern, a function for specifying “conformity” by right-clicking the mouse is added. In the response part for which “consideration” is designated, the audio data file name starting with “ref” is randomly converted into the prepared audio data file name starting with “res”.
This function reduces the labor of creating a reference pattern.

本実施形態においては、電話応対練習支援プログラム及びこれを実行することによって実現される電話応対練習支援装置を開示した。
装置に蓄積される音声データを相手側（応答パート）と自分側（練習パート）に分類して管理することにより、一人での電話応対練習を効率よく実現できる。 In the present embodiment, a telephone answering practice support program and a telephone answering practice support apparatus realized by executing the program are disclosed.
By classifying and managing the voice data stored in the device into the other party (response part) and one's side (practice part), it is possible to efficiently implement telephone answering practice alone.

お手本（リファレンスパターン）及び練習（入力パターン）の内容を、音声データの波形を表示することにより、現在データとして格納されている内容がどのようになっているかが視覚的に明確にわかる。
また、音声データの波形に応じて表示幅が変わると共に、音声データがない場合は「ｎｏｄａｔａ」表記と共に規定の幅で表示することで、今どの音声データが存在しているか否かが一目で判別できる。 The contents of the model (reference pattern) and practice (input pattern) can be clearly seen visually by displaying the waveform of the audio data so that the contents currently stored as data are displayed.
In addition, the display width changes according to the waveform of the audio data, and when there is no audio data, it is displayed with a specified width together with the “no data” notation so that it can be determined at a glance which audio data is present. Can be determined.

データに表示順フラグを内包させることにより、応答パート、練習パートのいずれから開始する場合にも対応できる。
また、表示順フラグを変更する機能を持つことにより、データ入力の途中から、応答パート、練習パートのいずれから開始する場合にも変更できる。 By including the display order flag in the data, it is possible to cope with the case where the response part or the practice part is started.
In addition, by having a function of changing the display order flag, it is possible to change when starting from either the response part or the practice part in the middle of data input.

リファレンスパターンの録音はマウスのボタンを押しながら連続録音ができるので、リファレンスパターンの録音作業が非常に効率よくできる。
録音や練習の際には、マウスのボタンを押すことで音声データの録音が完了したパートから、録音であればリファレンスパターン表示エリア、練習であれば入力パターン表示エリアの再表示を行うことで、どこまで録音或は練習をしたのか、その進捗状況が一目で判ると共に、録音した音声データの時間幅が視覚的にわかる。 Since the reference pattern can be recorded continuously while pressing the mouse button, the reference pattern can be recorded very efficiently.
When recording or practicing, from the part where audio data recording has been completed by pressing the mouse button, by redisplaying the reference pattern display area for recording and redisplaying the input pattern display area for practice, You can see at a glance how far you have recorded or practiced, and you can visually see the duration of the recorded audio data.

リファレンスパターンの録音を終了すると、入力パターンの応答パートにも同一内容の音声データファイルを複写して、入力パターン表示テーブルに表示される。このように構成することで、練習のために用意するデータが自動的に作成されるので、リファレンスパターンの録音完了後、すぐに練習に取り掛かることができる。
練習の際には応答パートを再生し、再生を終了したら即座に練習パートの録音を開始する。こうすることにより、一人での電話応対練習を効率よく実現できる。 When the recording of the reference pattern is finished, the audio data file having the same contents is copied to the response part of the input pattern and displayed on the input pattern display table. By configuring in this way, data prepared for practice is automatically created, so that the practice can be started immediately after the recording of the reference pattern is completed.
When practicing, the response part is played, and when the playback ends, recording of the practice part starts immediately. By doing so, it is possible to efficiently practice telephone reception alone.

練習の結果、出来の良かった入力パターン中の練習パートをリファレンスパターンへ差し替える処理を行う機能を設けることにより、自己練習でありながら効率良くスキルアップを図ることができる。 As a result of the practice, by providing a function for performing a process of replacing the practice part in the input pattern which is good as a result of the practice, it is possible to improve the skill efficiently while performing self practice.

リファレンスパターンの音声データを一つの音声データとして出力する機能を設けることにより、ノートパソコンが持ち込めないような環境下においても、ＭＰ３プレーヤ等携帯型音楽再生機器にて聴取しながら練習をすることもできる。
更に、音声データを出力する際に、練習パートを含めるか含めないかを選択できるように構成することにより、練習の習熟度に応じて適切な音声データを出力でき、練習が効率よくできる。 By providing a function that outputs the audio data of the reference pattern as a single audio data, it is possible to practice while listening to a portable music player such as an MP3 player even in an environment where a notebook computer cannot be brought in. .
Furthermore, when voice data is output, it is possible to select whether or not to include a practice part, so that appropriate voice data can be output according to the proficiency level of practice and practice can be performed efficiently.

以上、本発明の実施形態例について説明したが、本発明は上記実施形態例に限定されるものではなく、特許請求の範囲に記載した本発明の要旨を逸脱しない限りにおいて、他の変形例、応用例を含むことは言うまでもない。 The embodiment of the present invention has been described above. However, the present invention is not limited to the above-described embodiment, and other modifications may be made without departing from the gist of the present invention described in the claims. It goes without saying that application examples are included.

本発明の一実施の形態による電話応対練習支援プログラムを起動した時の画面表示を示す図である。It is a figure which shows the screen display when starting the telephone reception practice assistance program by one embodiment of this invention. 空のパートを幾つか作成した状態の画面表示を示す図である。It is a figure which shows the screen display of the state which produced some empty parts. リファレンスパターンの録音を完了した状態の画面表示を示す図である。It is a figure which shows the screen display of the state which completed recording of the reference pattern. 入力パターンの練習を行った状態の画面表示を示す図である。It is a figure which shows the screen display of the state which practiced the input pattern. 評価を実行した状態を示す図である。It is a figure which shows the state which performed evaluation. リファレンスパターン表示エリアと入力パターン表示エリアを一部拡大した図である。FIG. 4 is a partially enlarged view of a reference pattern display area and an input pattern display area. 電話応対練習支援装置の内部データの構造を示す模式図である。It is a schematic diagram which shows the structure of the internal data of a telephone reception practice assistance apparatus. 電話応対練習支援装置の内部データと、リファレンスパターン表示エリア及び入力パターン表示エリアの表示内容との対応関係を示す模式図である。It is a schematic diagram which shows the correspondence of the internal data of a telephone reception practice assistance apparatus, and the display content of a reference pattern display area and an input pattern display area. 電話応対練習支援装置の機能ブロック図である。It is a functional block diagram of a telephone reception practice support device. 練習支援ソフトを起動した直後の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process immediately after starting practice assistance software. デフォルト設定の詳細な動作を示すフローチャートである。It is a flowchart which shows the detailed operation | movement of a default setting. 入力パターン管理テーブルを生成する処理を示すフローチャートである。It is a flowchart which shows the process which produces | generates an input pattern management table. 表示画面を用いた状態遷移図である。It is a state transition diagram using a display screen. リファレンスパターン表示エリアにおける表示処理を示すフローチャートである。It is a flowchart which shows the display process in a reference pattern display area. 入力パターン表示エリアにおける表示処理を示すフローチャートである。It is a flowchart which shows the display process in an input pattern display area. パート挿入ボタンを押したときの処理を示すフローチャートである。It is a flowchart which shows a process when a part insertion button is pushed. パート削除ボタンを押したときの処理を示すフローチャートである。It is a flowchart which shows a process when a part deletion button is pushed. 入力パターンのデータ更新処理を示すフローチャートである。It is a flowchart which shows the data update process of an input pattern. リファレンスパターンの録音を行う際の処理を示すフローチャートである。It is a flowchart which shows the process at the time of recording a reference pattern. リファレンスパターンの録音を行う際の処理を示すフローチャートである。It is a flowchart which shows the process at the time of recording a reference pattern. 表示画面を用いた状態遷移図である。It is a state transition diagram using a display screen. 入力パターンにおける練習を行う際の処理を示すフローチャートである。It is a flowchart which shows the process at the time of practicing in an input pattern. 入力パターンにおける練習を行う際の処理を示すフローチャートである。It is a flowchart which shows the process at the time of practicing in an input pattern. リファレンスパターン及び入力パターンの再生処理を示すフローチャートである。It is a flowchart which shows the reproduction | regeneration process of a reference pattern and an input pattern. カット／コピー／ペーストのメニューウィンドウを表示した状態を示す図である。It is a figure which shows the state which displayed the menu window of cut / copy / paste. ＭＰ３作成メニューウィンドウを示す図である。It is a figure which shows a MP3 creation menu window. ＭＰ３出力処理の流れを示すフローチャートである。It is a flowchart which shows the flow of MP3 output processing. ＭＰ３出力処理の流れを示すフローチャートである。It is a flowchart which shows the flow of MP3 output processing. 財団法人日本電信電話ユーザ協会が主催する電話応対コンクールの、平成１８年度の課題である。This is a 2006 issue of the telephone reception contest sponsored by the Nippon Telegraph and Telephone User Association.

Explanation of symbols

１０１…練習支援ソフト、１０２…表示画面、１０３…「リファレンス再生」ボタン、１０４…「入力再生」ボタン、１０５…「タスク新規作成」ボタン、１０６…「タスク読み込み」ボタン、１０７…「タスク書き出し」ボタン、１０８…タスクファイル名表示欄、１０９…「録音」ボタン、１１０…「連続」チェックボックス、１１１…「パート追加」ボタン、１１２…「パート挿入」ボタン、１１３…「パート削除」ボタン、１１４…「ＭＰ３作成」ボタン、１１５…「練習開始」ボタン、１１６…「応答速度」変更欄、１１７…「評価」ボタン、１１８…「比較再生」ボタン、１１９…「自動評価」チェックボックス、１２０…「テキスト表示」チェックボックス、１２１…「メトロノーム」チェックボックス、１２２…タイマチェックボックス、１２３…「制限時間」設定欄、１２４…タイマ表示欄、１２５…テンポ設定欄、１２６…時間軸変更ボタン、１２７…時間軸変更ボタン、１２８…時間軸変更ボタン、１３０…「終了」ボタン、１３１…「差し替え」ボタン、１５１…リファレンスパターン表示エリア、１５２…入力パターン表示エリア、１５３…詳細表示エリア、２０２…練習パート、２０３…応答パート、７０１…リファレンスパターン管理テーブル、７０２…音声データ、７０３…入力パターン管理テーブル、７０４…テキストデータ、７０５…表示順フラグ、９０１…電話応対練習支援装置、９０２…不揮発性ストレージ、９０３…タスクファイル、９０４…音声データファイル、９０５…ＲＡＭ、９０６…操作部、９０７…表示部、９０８…タスク制御部、９０９…タイマ部、９１０…Ａ／Ｄ変換部、９１１…Ｄ／Ａ変換部、９１２…音声信号処理部、９１３…マイクロホン、９１４…スピーカ、２５０２…メニューウィンドウ、２５０３…メニューウィンドウ、２６０１…ＭＰ３作成メニューウィンドウ、２６０２…ＭＰ３出力形式ラジオボタン、２６０３…合図音指定ラジオボタン、２６０４…合図音間隔指定欄、２６０５…合図音数指定欄、２６０６…「作成」ボタン DESCRIPTION OF SYMBOLS 101 ... Practice support software, 102 ... Display screen, 103 ... "Reference reproduction" button, 104 ... "Input reproduction" button, 105 ... "Create new task" button, 106 ... "Read task" button, 107 ... "Export task" Button 108, Task file name display field 109, “Record” button, 110, “Continuous” check box, 111, “Add part” button, 112, “Insert part” button, 113, “Delete part” button, 114 ... "MP3 creation" button, 115 ... "Practice start" button, 116 ... "Response speed" change column, 117 ... "Evaluation" button, 118 ... "Comparison playback" button, 119 ... "Automatic evaluation" check box, 120 ... "Text display" check box, 121 ... "Metronome" check box, 122 ... Timer check 123, “Time limit” setting field, 124, Timer display field, 125, Tempo setting field, 126, Time axis change button, 127, Time axis change button, 128, Time axis change button, 130, “End” Buttons 131 131 “replace” button 151 reference pattern display area 152 input pattern display area 153 detail display area 202 practice part 203 response part 701 reference pattern management table 702 audio data 703 ... Input pattern management table, 704 ... Text data, 705 ... Display order flag, 901 ... Telephone answering practice support device, 902 ... Non-volatile storage, 903 ... Task file, 904 ... Audio data file, 905 ... RAM, 906 ... Operation unit, 907 ... display unit, 908 ... task Control unit, 909 ... Timer unit, 910 ... A / D conversion unit, 911 ... D / A conversion unit, 912 ... Audio signal processing unit, 913 ... Microphone, 914 ... Speaker, 2502 ... Menu window, 2503 ... Menu window, 2601 ... MP3 creation menu window, 2602 ... MP3 output format radio button, 2603 ... cue sound designation radio button, 2604 ... cue sound interval designation field, 2605 ... cue sound number designation field, 2606 ... "Create" button

Claims

Multiple audio data groups;
A reference pattern management table showing a playback procedure of the plurality of audio data groups;
A playback audio data group consisting of a flag indicating whether the playback start is yourself or the other party, and
The reference pattern management table of the reproduced audio data group is read, and a reference pattern display area for displaying the plurality of audio data groups in time series of the reproduction procedure is displayed based on the description of the reference pattern management table and the flag. A display unit;
An audio data reproduction apparatus comprising: a reproduction unit that reproduces the plurality of audio data groups based on the reference pattern management table of the reproduction audio data group.

The display unit further includes:
A plurality of exemplary practice parts, each having a width of a length relatively indicating the playback time of the plurality of audio data;
A plurality of responses, each having a width of a length indicating a reproduction time of a plurality of audio data, wherein at least one of a shape, a pattern, and a color is distinguished from the model practice part in a different display form Part,
2. The audio data reproducing apparatus according to claim 1, wherein a reference pattern display area is displayed in which the plurality of model practice parts and response parts are listed and displayed in time series.

The reproduced audio data group further includes:
An input pattern management table showing a reproduction procedure of the plurality of audio data groups together with the reference pattern management table;
The display unit further includes:
The reference pattern management table, the input pattern management table, and the flag of the reproduction audio data group are read, and the plurality of audio data groups are reproduced based on the description of the reference pattern management table, the input pattern management table, and the flag. 2. The audio data reproducing apparatus according to claim 1, wherein an input pattern display area for displaying the procedures in time series is displayed.

Computer
Multiple audio data groups;
A reference pattern management table showing a playback procedure of the plurality of audio data groups;
A playback audio data group consisting of a flag indicating whether the playback start is yourself or the other party, and
The reference pattern management table of the reproduced audio data group is read, and a reference pattern display area for displaying the plurality of audio data groups in time series of the reproduction procedure is displayed based on the description of the reference pattern management table and the flag. A display unit;
An audio data reproduction program for functioning as a reproduction unit that reproduces the plurality of audio data groups based on the reference pattern management table of the reproduction audio data group.

The display unit further includes:
A plurality of exemplary practice parts, each having a width of a length relatively indicating the playback time of the plurality of audio data;
A plurality of responses, each having a width of a length indicating a reproduction time of a plurality of audio data, wherein at least one of a shape, a pattern, and a color is distinguished from the model practice part in a different display form Part,
5. The audio data reproducing program according to claim 4, wherein a reference pattern display area in which the plurality of model practice parts and response parts are listed and displayed in time series is displayed.

The reproduced audio data group further includes:
An input pattern management table showing a reproduction procedure of the plurality of audio data groups together with the reference pattern management table;
The display unit further includes:
The reference pattern management table, the input pattern management table, and the flag of the reproduction audio data group are read, and the plurality of audio data groups are reproduced based on the description of the reference pattern management table, the input pattern management table, and the flag. 5. The audio data reproducing program according to claim 4, wherein an input pattern display area for displaying the procedures in time series is displayed.

A reference pattern management table showing a procedure for alternately reproducing a plurality of audio data groups classified into a practice part belonging to the own side and a response part belonging to the other side;
An input pattern management table showing a procedure for alternately reproducing a plurality of audio data groups classified into a recording part recorded corresponding to the practice part belonging to the self side and a response part belonging to the counterpart side;
According to the description of the input pattern management table, a playback unit for playing back a response part belonging to the counterpart,
A practice apparatus comprising: a recording unit that adds sound data created by recording to the recording part in accordance with the description of the input pattern management table.

The practice apparatus according to claim 7, further comprising a similarity calculation unit that calculates a similarity of a speech waveform between the speech data belonging to the practice part and the corresponding speech data belonging to the recording part.

The control unit follows the description of the input pattern management table,
In response to the reproduction unit having finished reproducing the audio data belonging to the response part, the recording unit is instructed to record the audio data belonging to the recording part, and
8. The practice apparatus according to claim 7, wherein during the recording of the recording part, in response to a predetermined control signal, the reproducing unit is instructed to reproduce audio data belonging to the response part.

Furthermore, while the control unit controls the reproduction unit according to the description of the reference pattern management table or the input pattern management table and performs reproduction of the audio data group, The practice apparatus according to claim 7, further comprising a time display unit for displaying a reproduction time of the video and a total reproduction time from the top of the reference pattern management table or the input pattern management table.

Furthermore, while the control unit controls the recording unit according to the description of the reference pattern management table and performs recording of the practice part and the response part, the playback time of the currently recorded part and the 8. The practice apparatus according to claim 7, further comprising a time display unit for displaying a total recording time from the top of the reference pattern management table.

Further, the control unit controls the recording unit and the playback unit according to the description of the input pattern management table, and while recording the recording part and playing the response part, 8. The practice apparatus according to claim 7, further comprising a time display unit for displaying a reproduction time of a part or a currently reproduced part and a total elapsed time from the top of the input pattern management table.

The voice data creation unit for creating a single voice data file by connecting voice data belonging to the practice part and the response part according to the description of the reference pattern management table. The practice device described.

The voice data creation unit
A first audio data file creation mode that connects all part audio in monaural;
A second audio data file creation mode in which the practice part and the response part are divided and connected to the left and right of the stereo;
14. The practice apparatus according to claim 13, further comprising a third voice data file creation mode in which the voice of the practice part is connected to the response part instead of silence.

The practice device according to claim 7, wherein the playback unit can change a playback speed of audio data belonging to the response part.

The practice device according to claim 7, wherein the control unit has a function of designating one of the recording parts and overwriting voice data on the corresponding practice part.

8. The control unit according to claim 7, wherein the control unit has a function of designating an arbitrary part on the reference pattern management table and the input pattern management table and copying audio data to another arbitrary part. Exercise equipment.

Furthermore, when practicing based on the said input pattern management table, it has a text display part which displays the text of the audio | voice to input in parallel with the recording of the said recording part, The characterized by the above-mentioned. Practice equipment.

8. The practice apparatus according to claim 7, wherein the control unit has a function of designating whether the head of the reference pattern management table is the practice part or the response part.