JP2011107503A

JP2011107503A - Voice scenario setting program and voice scenario setting device

Info

Publication number: JP2011107503A
Application number: JP2009263772A
Authority: JP
Inventors: Takaya Tsunomori; 隆也角森
Original assignee: Fujitsu Advanced Engineering Ltd
Current assignee: Fujitsu Advanced Engineering Ltd
Priority date: 2009-11-19
Filing date: 2009-11-19
Publication date: 2011-06-02
Anticipated expiration: 2029-11-19
Also published as: JP5331657B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a voice scenario setting program and device, easily setting a voice scenario suitable for an operator's learning level of operation. <P>SOLUTION: User's voice input record is obtained. Based on a table which relates the voice input record to the learning level, the learning level corresponding to user's voice input record is selected. A computer functions as an automatic scenario setting means 105 which automatically sets the voice scenario according to user's learning level. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、音声シナリオ設定プログラム及び音声シナリオ設定装置に係り、特に音声システムで利用する音声シナリオを設定する音声シナリオ設定プログラム及び音声シナリオ設定装置に関する。 The present invention relates to an audio scenario setting program and an audio scenario setting device, and more particularly to an audio scenario setting program and an audio scenario setting device for setting an audio scenario used in an audio system.

近年、工場や物流現場等の一部では、作業を効率化するため、音声認識技術を活用してデータ入力を行うようになった。音声認識技術を活用した音声システムでは例えばヘッドセットと呼ばれるイヤホンとマイクとを一体化した通話装置が利用されている。音声認識技術を活用した音声システム（以下、音声システムと呼ぶ）では、作業者への音声による指示（音声出力）と作業者からの音声による入力（音声入力）とで作業を進めることができる。 In recent years, some factories, distribution sites, and the like have entered data using voice recognition technology to improve work efficiency. In a voice system using a voice recognition technology, for example, a telephone device called a headset that integrates an earphone and a microphone is used. In a voice system utilizing voice recognition technology (hereinafter referred to as a voice system), work can be performed by voice instructions to the worker (voice output) and voice input from the worker (voice input).

音声システムでは、作業者との会話の手順が示された音声シナリオと、音声を認識するための音声グラマー（変換グラマー）とを利用して、作業者への音声出力と作業者からの音声入力とを行う。具体的に、音声グラマーは音声データをテキストデータに変換する為の音声辞書として利用される。従来、音声システムでは音声シナリオをＶｏｉｃｅＸＭＬ等でプログラム開発することにより実現していた（例えば特許文献１参照）。 In the voice system, voice output to the worker and voice input from the worker are made using a voice scenario showing the procedure of conversation with the worker and a voice grammar (conversion grammar) for recognizing the voice. And do. Specifically, the voice grammar is used as a voice dictionary for converting voice data into text data. Conventionally, in an audio system, an audio scenario has been realized by developing a program using VoiceXML or the like (see, for example, Patent Document 1).

ＶｏｉｃｅＸＭＬとはコンピュータを音声による指示で操作するユーザインタフェースを構築する為のＸＭＬベースの言語である。また、音声システムでは音声グラマーについてもＶｏｉｃｅＸＭＬ等でプログラム内に固定記述したものを個別に用意しておく必要があった。 VoiceXML is an XML-based language for constructing a user interface for operating a computer with voice instructions. In the voice system, it is necessary to prepare voice grammars that are fixedly described in the program using VoiceXML or the like.

このように、音声システムはプログラム開発することにより実現するものであり、開発側で構築するものであった。従って、音声システムの音声シナリオや音声グラマーを運用側で変更することは技術的に難しかった。また、運用側は音声システムの音声シナリオや音声グラマーを変更するため、個別に音声システムの開発を開発側に依頼する場合、多大なコスト及び期間を必要としていた。 As described above, the audio system is realized by developing a program, and is constructed on the development side. Therefore, it is technically difficult to change the voice scenario or voice grammar of the voice system on the operation side. In addition, since the operation side changes the voice scenario and the voice grammar of the voice system, enormous cost and time are required when requesting the development side to develop the voice system individually.

従来、音声システムでは管理者が作業者ごとに作業の習熟度を判断し、個々の作業者に適した音声シナリオを設定していた。また、作業者に適した音声シナリオが無い場合は特定の音声シナリオを利用できるように作業者をトレーニングしていた。なお、ユーザのシステムへのアクセス回数等からユーザの慣れの程度を判定し、ユーザの慣れの程度に応じて音声案内を変更する音声案内装置は既に知られている（例えば特許文献２参照）。 Conventionally, in an audio system, an administrator determines the proficiency level of work for each worker, and sets a voice scenario suitable for each worker. In addition, when there is no voice scenario suitable for the worker, the worker is trained so that a specific voice scenario can be used. Note that a voice guidance device that determines a user's familiarity from the number of accesses to the user's system and changes the voice guidance according to the familiarity of the user is already known (see, for example, Patent Document 2).

特開２００５−３２１７３０号公報JP 2005-321730 A 特開２００１−２２３７０号公報JP 2001-22370 A

例えば作業者は、音声システムを利用し続けることで、次第に作業の習熟度が向上していく。したがって、従来の音声システムでは作業者の作業の習熟度の向上に従い、音声シナリオを設定し直す必要があった。しかしながら、作業者の作業の習熟度の向上は個人差が大きい。このため、従来の音声システムでは、管理者が作業者ごとに作業の習熟度の向上を見極め、作業の習熟度に合った音声シナリオを作業者ごとに設定し直さなければならない。 For example, the worker gradually improves the proficiency level of the work by continuing to use the voice system. Therefore, in the conventional voice system, it is necessary to reset the voice scenario in accordance with the improvement of the worker's proficiency level. However, the improvement in the skill level of the worker's work varies greatly among individuals. For this reason, in the conventional voice system, the manager must determine the improvement of the proficiency level of the work for each worker and reset the voice scenario suitable for the proficiency level of the work for each worker.

作業者ごとに作業の習熟度の向上を見極め、作業の習熟度に合った音声シナリオを作業者ごとに設定し直す作業は容易でなく、管理者にとって大きな作業負担になるという問題があった。 There is a problem that it is not easy to determine the improvement of work proficiency for each worker and to reset the voice scenario suitable for the work proficiency for each worker, which causes a heavy work load for the administrator.

本発明の一実施形態は、上記の点に鑑みなされたもので、作業者の作業の習熟度に適した音声シナリオを容易に設定できる音声シナリオ設定プログラム及び音声シナリオ設定装置を提供することを目的とする。 An embodiment of the present invention has been made in view of the above points, and an object thereof is to provide an audio scenario setting program and an audio scenario setting device that can easily set an audio scenario suitable for an operator's proficiency level. And

上記課題を解決するため、本発明の一実施形態は、コンピュータを、ユーザの音声入力実績を取得し、前記音声入力実績と習熟レベルとを対応付けるテーブルに基づいて、ユーザの前記音声入力実績に対応する前記習熟レベルを選択し、ユーザの前記習熟レベルに応じた音声シナリオを自動設定するシナリオ自動設定処理手段として機能させる為の音声シナリオ設定プログラムである。 In order to solve the above-described problem, an embodiment of the present invention is a computer that corresponds to a user's voice input record based on a table that acquires the user's voice input record and associates the voice input record with a proficiency level. This is a voice scenario setting program for functioning as a scenario automatic setting processing means for selecting the learning level to be set and automatically setting a voice scenario according to the user's learning level.

なお、本発明の一実施形態の構成要素、表現又は構成要素の任意の組合せを、方法、装置、システム、コンピュータプログラム、記録媒体、データ構造などに適用したものも本発明の態様として有効である。 In addition, what applied the component, the expression, or the arbitrary combinations of the component of one Embodiment of this invention to a method, an apparatus, a system, a computer program, a recording medium, a data structure, etc. is also effective as an aspect of this invention. .

上述の如く、本発明の一実施形態によれば、作業者の作業の習熟度に適した音声シナリオを容易に設定できる。 As described above, according to an embodiment of the present invention, it is possible to easily set a voice scenario suitable for an operator's proficiency level.

本実施例の音声シナリオ生成装置の一例のハードウェア構成図である。It is a hardware block diagram of an example of the audio | voice scenario production | generation apparatus of a present Example. 音声シナリオ生成装置の一例のブロック図である。It is a block diagram of an example of an audio | voice scenario production | generation apparatus. 在庫確認シナリオの一例のイメージ図である。It is an image figure of an example of an inventory check scenario. シナリオマスタの一例の構成図である。It is a block diagram of an example of a scenario master. シナリオ自動生成処理部の処理手順を表した一例のフローチャートである。It is a flowchart of an example showing the process sequence of the scenario automatic generation process part. 入出力シナリオ記述処理の一例のフローチャートである。It is a flowchart of an example of an input / output scenario description process. 処理シナリオ記述処理の一例のフローチャートである。It is a flowchart of an example of a process scenario description process. 分岐シナリオ記述処理の一例のフローチャートである。It is a flowchart of an example of a branch scenario description process. 終了シナリオ記述処理の一例のフローチャートである。It is a flowchart of an example of an end scenario description process. ３桁コードマスタテーブルの一例の構成図である。It is a block diagram of an example of a 3 digit code master table. ３桁コードマスタテーブルから音声グラマーを生成する手順を表したイメージ図である。It is an image figure showing the procedure which produces | generates an audio grammar from a 3-digit code master table. シナリオマスタの具体例を表した構成図（１／６）である。It is a block diagram (1/6) showing the specific example of the scenario master. シナリオマスタの具体例を表した構成図（２／６）である。It is a block diagram (2/6) showing the specific example of the scenario master. シナリオマスタの具体例を表した構成図（３／６）である。It is a block diagram (3/6) showing the specific example of the scenario master. シナリオマスタの具体例を表した構成図（４／６）である。It is a block diagram (4/6) showing the specific example of the scenario master. シナリオマスタの具体例を表した構成図（５／６）である。It is a block diagram (5/6) showing the specific example of the scenario master. シナリオマスタの具体例を表した構成図（６／６）である。It is a block diagram (6/6) showing the specific example of the scenario master. 生成されたシナリオファイルのファイル名を表示した一例の画面イメージである。It is an example screen image which displayed the file name of the generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 生成されたシナリオファイルの内容を表す一例の構成図である。It is a block diagram of an example showing the content of the produced | generated scenario file. 音声シナリオ設定装置の一例のブロック図である。It is a block diagram of an example of an audio scenario setting device. 音声発話能力評価テーブルの一例の構成図である。It is a block diagram of an example of an audio | voice speech ability evaluation table. 音声ヒアリング能力評価テーブルの一例の構成図である。It is a block diagram of an example of an audio | voice hearing capability evaluation table. 音声応答時間評価テーブルの一例の構成図である。It is a block diagram of an example of a voice response time evaluation table. 初級，中級，上級の音声シナリオの一例のイメージ図である。It is an image figure of an example of a beginner's class, an intermediate class, and an advanced voice scenario.

次に、本発明を実施するための形態を、以下の実施例に基づき図面を参照しつつ説明していく。以下の実施例では、音声シナリオを生成する音声シナリオ生成装置について説明した後、音声シナリオを設定する音声シナリオ設定装置について説明する。 Next, modes for carrying out the present invention will be described based on the following embodiments with reference to the drawings. In the following embodiments, a voice scenario generation apparatus that generates a voice scenario will be described, and then a voice scenario setting apparatus that sets a voice scenario will be described.

（音声シナリオ生成装置１）
図１は本実施例の音声シナリオ生成装置の一例のハードウェア構成図である。音声シナリオ生成装置１は、スタンドアローンの形態でも良いし、インターネットやＬＡＮなどのネットワーク経由でユーザ端末にデータ通信可能に接続された形態でもよい。 (Voice scenario generation device 1)
FIG. 1 is a hardware configuration diagram of an example of a voice scenario generation apparatus according to the present embodiment. The voice scenario generation device 1 may be in a stand-alone form or may be connected to a user terminal via a network such as the Internet or a LAN so that data communication is possible.

音声シナリオ生成装置１は、それぞれバスＢで相互に接続されている入力装置１１，出力装置１２，ドライブ装置１３，補助記憶装置１４，主記憶装置１５，演算処理装置１６及びインターフェース装置１７を有する。 The voice scenario generation device 1 includes an input device 11, an output device 12, a drive device 13, an auxiliary storage device 14, a main storage device 15, an arithmetic processing device 16, and an interface device 17 that are mutually connected by a bus B.

入力装置１１はキーボードやマウス等である。入力装置１１は、各種信号を入力するために用いられる。出力装置１２はディスプレイ装置等である。出力装置１２は、各種ウインドウやデータ等を表示するために用いられる。インターフェース装置１７は、モデム又はＬＡＮカード等である。インターフェース装置１７は、ネットワークに接続する為に用いられる。 The input device 11 is a keyboard or a mouse. The input device 11 is used for inputting various signals. The output device 12 is a display device or the like. The output device 12 is used to display various windows and data. The interface device 17 is a modem or a LAN card. The interface device 17 is used for connecting to a network.

本実施例の音声シナリオ生成プログラムは、音声シナリオ生成装置１を制御する各種プログラムの少なくとも一部である。音声シナリオ生成プログラムは例えば記録媒体１８の配布やネットワークからのダウンロードなどによって提供される。音声シナリオ生成プログラムを記録した記録媒体１８はＣＤ−ＲＯＭ、フレキシブルディスク、光磁気ディスク等の様に情報を光学的，電気的或いは磁気的に記録する記録媒体、ＲＯＭ、フラッシュメモリ等の様に情報を電気的に記録する半導体メモリ等、様々なタイプの記録媒体を用いることができる。 The voice scenario generation program of this embodiment is at least a part of various programs that control the voice scenario generation device 1. The voice scenario generation program is provided by, for example, distribution of the recording medium 18 or downloading from a network. The recording medium 18 on which the voice scenario generation program is recorded is information such as a CD-ROM, a flexible disk, a magneto-optical disk, etc., a recording medium for recording information optically, electrically or magnetically, a ROM, a flash memory, etc. Various types of recording media, such as a semiconductor memory that electrically records data, can be used.

音声シナリオ生成プログラムを記録した記録媒体１８がドライブ装置１３にセットされると、音声シナリオ生成プログラムは記録媒体１８からドライブ装置１３を介して補助記憶装置１４にインストールされる。なお、ネットワークからダウンロードされた音声シナリオ生成プログラムは、インターフェース装置１７を介して、補助記憶装置１４にインストールされる。補助記憶装置１４はインストールされた音声シナリオ生成プログラムを格納すると共に、必要なファイル，データ等を格納する。 When the recording medium 18 on which the audio scenario generation program is recorded is set in the drive device 13, the audio scenario generation program is installed from the recording medium 18 to the auxiliary storage device 14 via the drive device 13. The voice scenario generation program downloaded from the network is installed in the auxiliary storage device 14 via the interface device 17. The auxiliary storage device 14 stores the installed voice scenario generation program and also stores necessary files, data, and the like.

主記憶装置１５は、音声シナリオ生成プログラムの起動時に補助記憶装置１４から音声シナリオ生成プログラムを読み出して格納する。そして、演算処理装置１６は主記憶装置１５に格納された音声シナリオ生成プログラムに従って、後述するような各種処理を実現している。 The main storage device 15 reads and stores the voice scenario generation program from the auxiliary storage device 14 when the voice scenario generation program is activated. The arithmetic processing unit 16 implements various processes as described later in accordance with the voice scenario generation program stored in the main storage unit 15.

図２は音声シナリオ生成装置の一例のブロック図である。音声シナリオ生成装置１はシナリオマスタ２１，グラマーマスタ２２，シナリオ自動生成処理部２３，グラマー自動生成処理部２４，グラマー条件指定受付部２５を有する。なお、図２では音声シナリオ生成装置１内にシナリオマスタ２１，グラマーマスタ２２を有しているが、音声シナリオ生成装置１外（例えばＤＢサーバなど）に設けてもよい。 FIG. 2 is a block diagram of an example of a voice scenario generation device. The voice scenario generation device 1 includes a scenario master 21, a grammar master 22, a scenario automatic generation processing unit 23, a grammar automatic generation processing unit 24, and a grammar condition designation receiving unit 25. In FIG. 2, the scenario master 21 and the grammar master 22 are included in the voice scenario generation apparatus 1, but they may be provided outside the voice scenario generation apparatus 1 (for example, a DB server).

シナリオマスタ２１には、生成する音声シナリオ２６の実行ステップが例えばテーブル形式で登録されている。シナリオ自動生成処理部２３はシナリオマスタ２１から音声シナリオの実行ステップを表すレコードを順次読み出し、読み出したレコードから音声シナリオ２６を自動生成する。シナリオ自動生成処理部２３は読み出したレコード毎にシナリオファイルを生成し、１つ以上のシナリオファイルにより音声シナリオ２６を生成する。複数の音声シナリオ２６を生成する場合、シナリオマスタ２１には複数の音声シナリオ２６毎に、音声シナリオ２６の実行ステップがテーブル形式で登録されている。なお、シナリオ自動生成処理部２３，シナリオマスタ２１及び音声シナリオ２６の詳細は後述する。 In the scenario master 21, execution steps of the voice scenario 26 to be generated are registered in a table format, for example. The scenario automatic generation processing unit 23 sequentially reads records representing the execution steps of the voice scenario from the scenario master 21, and automatically generates the voice scenario 26 from the read record. The scenario automatic generation processing unit 23 generates a scenario file for each read record, and generates an audio scenario 26 using one or more scenario files. When a plurality of voice scenarios 26 are generated, the execution steps of the voice scenarios 26 are registered in the scenario master 21 for each of the plurality of voice scenarios 26 in a table format. Details of the automatic scenario generation processing unit 23, the scenario master 21, and the voice scenario 26 will be described later.

グラマー条件指定受付部２５は、例えば音声シナリオ生成装置１を操作する管理者からの音声グラマー２７の生成条件の指定を受け付ける。グラマーマスタ２２には、音声入力される音声ワードを出力項目（変換語）に変換する為の１つ以上のテーブルが登録されている。なお、出力項目は音声システムで行うシナリオ制御処理において利用される。 The grammar condition designation accepting unit 25 accepts designation of a production condition of the audio grammar 27 from an administrator who operates the audio scenario generation device 1, for example. The grammar master 22 is registered with one or more tables for converting voice words inputted by voice into output items (converted words). The output items are used in scenario control processing performed in the voice system.

そして、グラマー自動生成処理部２４はグラマー条件指定受付部２５が受け付けた音声グラマー２７の生成条件に従い、グラマーマスタ２２に登録されているレコードを読み出し、読み出したレコードから音声グラマー２７を自動生成する。なお、グラマー条件指定受付部２５，グラマー自動生成処理部２４，グラマーマスタ２２及び音声グラマー２７の詳細は後述する。 Then, the grammar automatic generation processing unit 24 reads the record registered in the grammar master 22 according to the generation condition of the audio grammar 27 received by the grammar condition designation reception unit 25, and automatically generates the audio grammar 27 from the read record. Details of the grammar condition designation receiving unit 25, the grammar automatic generation processing unit 24, the grammar master 22, and the audio grammar 27 will be described later.

以下では、自動生成する音声シナリオ２６の一例として、図３に示す在庫確認シナリオを例に説明する。図３の在庫確認シナリオは、在庫確認に関係するステップと、終了確認に関係するステップとを含む。また、図３の在庫確認シナリオは音声システムからの音声指示３１〜３５と、データ処理３６と、作業者の発話４１〜４３とを含む。 In the following, an example of the inventory confirmation scenario shown in FIG. 3 will be described as an example of the voice scenario 26 that is automatically generated. The inventory confirmation scenario in FIG. 3 includes steps relating to inventory confirmation and steps relating to completion confirmation. 3 includes voice instructions 31 to 35 from the voice system, data processing 36, and utterances 41 to 43 of the worker.

図３の在庫確認シナリオの場合、管理者は例えば図４に示すようなシナリオマスタ２１を登録する。図４はシナリオマスタの一例の構成図である。図４に示したシナリオマスタ２１は、項目として「Ｎｏ．」，「出力メッセージ」，「処理区分」，「入力格納変数」，「分岐Ｎｏ．１〜ｎ」，「辞書名」，「データ処理名」，「指示パラメータ」，「戻り値」を有する。 In the case of the inventory confirmation scenario of FIG. 3, the administrator registers a scenario master 21 as shown in FIG. FIG. 4 is a configuration diagram of an example of a scenario master. The scenario master 21 shown in FIG. 4 includes items “No.”, “Output message”, “Processing classification”, “Input storage variable”, “Branch No. 1 to n”, “Dictionary name”, “Data processing”. “Name”, “Instruction parameter”, “Return value”.

項目「Ｎｏ．」は、シナリオの実行ステップを表している。実行ステップは、シナリオが分岐する場合、「分岐Ｎｏ．１〜ｎ」に登録されている「Ｎｏ．」の実行ステップへ遷移する。例えば図４の例では「Ｎｏ．０１」の実行ステップからシナリオを開始し、作業者が「はい」を入力した場合に「Ｎｏ．０２」の実行ステップに遷移し、「いいえ」を入力した場合に「Ｎｏ．９１」の実行ステップに遷移することを表している。 The item “No.” represents a scenario execution step. When the scenario branches, the execution step transitions to the “No.” execution step registered in “Branch Nos. 1 to n”. For example, in the example of FIG. 4, when the scenario is started from the execution step “No. 01”, when the operator inputs “Yes”, the process proceeds to the execution step “No. 02” and “No” is input. Represents transition to the execution step of “No. 91”.

項目「出力メッセージ」は音声システムから作業者へ音声出力するメッセージを表している。例えば図４の例では「Ｎｏ．０１」の実行ステップが、音声システムから作業者に対し、「作業を開始しますか？」というメッセージを音声出力することを表している。 The item “output message” represents a message output from the voice system to the worker. For example, in the example of FIG. 4, the execution step of “No. 01” indicates that a message “Do you want to start work?” Is output from the voice system to the worker.

項目「処理区分」は、実行ステップの区分「入出力」，「処理」，「分岐」又は「終了」を表している。「入出力」は、音声入力又は音声出力を実行する実行ステップであることを表している。「処理」は、データベース操作処理を実行する実行ステップであることを表している。「分岐」は、処理結果により分岐判別を行う実行ステップであることを表している。「終了」は、音声シナリオ２６を終了する実行ステップであることを表している。 The item “processing category” represents the category “input / output”, “processing”, “branch” or “end” of the execution step. “Input / output” represents an execution step for executing voice input or voice output. “Process” represents an execution step for executing a database operation process. “Branch” represents an execution step for performing branch determination based on the processing result. “End” represents an execution step for ending the voice scenario 26.

例えば図４の例では、「Ｎｏ．０１」の実行ステップが、音声システムから作業者へ音声出力し、その音声出力に対して作業者から音声システムへ音声入力を行うことを表している。 For example, in the example of FIG. 4, the execution step of “No. 01” represents that a voice is output from the voice system to the worker, and the voice is input from the worker to the voice system in response to the voice output.

「Ｎｏ．０３」の実行ステップは、「ZaikoSearch.sql」のデータベース操作処理を実行することを表している。「Ｎｏ．０４」の実行ステップは、「Ｎｏ．０３」の実行ステップの結果、存在するアドレスの場合に「Ｎｏ．０５」の実行ステップへ遷移し、存在しないアドレスの場合に「Ｎｏ．０２」の実行ステップへ遷移することを表している。「Ｎｏ．９３」の実行ステップは、実行中の音声シナリオ２６を終了することを表している。 The execution step of “No. 03” represents that the database operation process of “ZaikoSearch.sql” is executed. The execution step of “No. 04” makes a transition to the execution step of “No. 05” when the address exists as a result of the execution step of “No. 03”, and “No. 02” when the address does not exist. This represents a transition to the execution step. The execution step of “No. 93” indicates that the voice scenario 26 being executed is terminated.

項目「入力格納変数」は、作業者から音声入力された音声ワードを格納する入力格納変数の名称を表している。例えば図４の例では、「Ｎｏ．０１」の出力メッセージに対して作業者から入力された音声ワード（入力値）を入力格納変数「Outou01」に格納することを表している。 The item “input storage variable” represents the name of an input storage variable for storing a speech word input by speech from an operator. For example, in the example of FIG. 4, the voice word (input value) input from the operator with respect to the output message “No. 01” is stored in the input storage variable “Outou01”.

項目「分岐Ｎｏ．１〜ｎ」は音声シナリオの分岐条件及び分岐先を表している。例えば図４の例では、「Ｎｏ．０１」の実行ステップが、作業者の音声入力の内容によって「Ｎｏ．０２」又は「Ｎｏ．９１」の実行ステップに遷移することを表している。また、「Ｎｏ．０４」の実行ステップは、「Ｎｏ．０３」の実行ステップの結果によって、「Ｎｏ．０５」又は「Ｎｏ．０２」の実行ステップに遷移することを表している。 The item “branch No. 1 to n” represents the branch condition and branch destination of the voice scenario. For example, the example of FIG. 4 represents that the execution step “No. 01” transitions to the execution step “No. 02” or “No. 91” depending on the contents of the voice input by the operator. Further, the execution step of “No. 04” represents a transition to the execution step of “No. 05” or “No. 02” depending on the result of the execution step of “No. 03”.

項目「辞書名」は作業者が入力した音声ワードをテキスト変換するために使用する音声グラマー２７のファイル名を表している。例えば図４の例では、「Ｎｏ．０１」の実行ステップが、作業者により入力された音声ワードの内容を「共通ワード辞書」という音声グラマー２７により認識することを表している。 The item “dictionary name” represents the file name of the audio grammar 27 used for text conversion of the audio word input by the operator. For example, in the example of FIG. 4, the execution step of “No. 01” represents that the content of the speech word input by the operator is recognized by the speech grammar 27 called “common word dictionary”.

項目「データ処理名」はデータの検索・更新を必要とする場合に実行するデータ処理名を表している。例えば図４の例では、「Ｎｏ．０３」の実行ステップが、データ処理名「ZaikoSearch.sql」のデータベース操作処理を実行することを表している。 The item “data processing name” represents a data processing name to be executed when data search / update is required. For example, in the example of FIG. 4, the execution step “No. 03” indicates that the database operation process with the data processing name “ZaikoSearch.sql” is executed.

項目「指示パラメータ」はデータベース操作処理を実行する際、データベース操作処理へ引き渡すパラメータを表している。例えば図４の例では、「Ｎｏ．０３」の実行ステップが、「ZaikoSearch.sql」のデータベース操作処理へ入力格納変数「Outou02」に格納されている格納値をパラメータとして引き渡すことを表している。 The item “instruction parameter” represents a parameter delivered to the database operation process when the database operation process is executed. For example, in the example of FIG. 4, the execution step of “No. 03” represents passing the storage value stored in the input storage variable “Outou02” as a parameter to the database operation process of “ZaikoSearch.sql”.

また、項目「戻り値」はデータベース操作処理の処理結果を格納する入力格納変数の名称を表している。例えば図４の例では、「Ｎｏ．０３」の実行ステップが、「ZaikoSearch.sql」のデータベース操作処理の処理結果を入力格納変数「kekka03」に格納することを表している。 The item “return value” represents the name of an input storage variable that stores the processing result of the database operation processing. For example, in the example of FIG. 4, the execution step “No. 03” represents that the processing result of the database operation processing “ZaikoSearch.sql” is stored in the input storage variable “kekka03”.

（シナリオ自動生成処理部２３）
図５はシナリオ自動生成処理部の処理手順を表した一例のフローチャートである。例えば音声シナリオ生成装置１を操作する管理者から音声シナリオ２６の一例としての在庫確認シナリオの生成要求を受けると、ステップＳ１に進み、シナリオ自動生成処理部２３はシナリオマスタ２１から在庫確認シナリオに該当する図４のテーブルの最初のレコードを取得する。なお、図４のテーブルの最初のレコードは、在庫確認シナリオの最初の実行ステップに対応する。 (Scenario automatic generation processing unit 23)
FIG. 5 is a flowchart illustrating an example of a processing procedure of the scenario automatic generation processing unit. For example, when an inventory confirmation scenario generation request as an example of the voice scenario 26 is received from an administrator who operates the voice scenario generation device 1, the process proceeds to step S1, and the scenario automatic generation processing unit 23 corresponds to the inventory confirmation scenario from the scenario master 21. The first record in the table of FIG. The first record in the table of FIG. 4 corresponds to the first execution step of the inventory confirmation scenario.

ステップＳ２に進み、シナリオ自動生成処理部２３はステップＳ１で取得したレコードの項目「処理区分」を確認する。ステップＳ３に進み、シナリオ自動生成処理部２３はステップＳ２で確認した項目「処理区分」が「入出力」であるか否かを判定する。ステップＳ２で確認した項目「処理区分」が「入出力」であれば、シナリオ自動生成処理部２３はステップＳ４に進み、後述の入出力シナリオ記述処理を行う。なお、入出力シナリオ記述処理では、取得したレコードに応じて、最初の実行ステップに対応するシナリオファイルをＶｏｉｃｅＸＭＬで記述する。図４のテーブルでは、ステップＳ１で取得したレコードの項目「処理区分」が「入出力」であるため、ステップＳ４に進み、後述の入出力シナリオ記述処理を行ったあと、ステップＳ９に進む。 Proceeding to step S2, the scenario automatic generation processing unit 23 confirms the item “processing classification” of the record acquired at step S1. In step S3, the scenario automatic generation processing unit 23 determines whether or not the item “processing category” confirmed in step S2 is “input / output”. If the item “processing category” confirmed in step S2 is “input / output”, the scenario automatic generation processing unit 23 proceeds to step S4 and performs input / output scenario description processing described later. In the input / output scenario description process, the scenario file corresponding to the first execution step is described in VoiceXML according to the acquired record. In the table of FIG. 4, since the item “processing classification” of the record acquired in step S1 is “input / output”, the process proceeds to step S4, and after performing an input / output scenario description process described later, the process proceeds to step S9.

ステップＳ９では、シナリオ自動生成処理部２３が、シナリオマスタ２１から在庫確認シナリオに該当する図４のテーブルの次のレコードを取得したあと、ステップＳ２の処理に戻る。図４のテーブルでは、ステップＳ９で取得した２番目のレコードの項目「処理区分」が「入出力」であるため、ステップＳ４に進み、入出力シナリオ記述処理を行ったあと、ステップＳ９に進む。 In step S9, the scenario automatic generation processing unit 23 obtains the next record in the table of FIG. 4 corresponding to the inventory confirmation scenario from the scenario master 21, and then returns to the process of step S2. In the table of FIG. 4, since the item “processing classification” of the second record acquired in step S9 is “input / output”, the process proceeds to step S4, and after performing the input / output scenario description process, the process proceeds to step S9.

ステップＳ９では、シナリオ自動生成処理部２３が、シナリオマスタ２１から在庫確認シナリオに該当する図４のテーブルの次のレコードを取得したあと、ステップＳ２の処理に戻る。図４のテーブルでは、ステップＳ９で取得した３番目のレコードの項目「処理区分」が「処理」である。 In step S9, the scenario automatic generation processing unit 23 obtains the next record in the table of FIG. 4 corresponding to the inventory confirmation scenario from the scenario master 21, and then returns to the process of step S2. In the table of FIG. 4, the item “processing category” of the third record acquired in step S9 is “processing”.

ステップＳ２で確認した項目「処理区分」が「入出力」でない為、シナリオ自動生成処理部２３はステップＳ５に進み、ステップＳ２で確認した項目「処理区分」が「処理」であるか否かを判定する。 Since the item “processing category” confirmed in step S2 is not “input / output”, the scenario automatic generation processing unit 23 proceeds to step S5, and determines whether or not the item “processing category” confirmed in step S2 is “processing”. judge.

ステップＳ２で確認した項目「処理区分」が「処理」であれば、シナリオ自動生成処理部２３はステップＳ６に進み、後述の処理シナリオ記述処理を行う。なお、処理シナリオ記述処理では、取得したレコードに応じて、３番目の実行ステップに対応するシナリオファイルをＶｏｉｃｅＸＭＬで記述する。図４のテーブルでは、ステップＳ９で取得したレコードの項目「処理区分」が「処理」であるため、ステップＳ６に進み、後述の処理シナリオ記述処理を行ったあと、ステップＳ９に進む。 If the item “Processing category” confirmed in Step S2 is “Processing”, the scenario automatic generation processing unit 23 proceeds to Step S6 and performs processing scenario description processing described later. In the process scenario description process, a scenario file corresponding to the third execution step is described in VoiceXML according to the acquired record. In the table of FIG. 4, since the item “processing classification” of the record acquired in step S9 is “processing”, the process proceeds to step S6, and after the processing scenario description process described later is performed, the process proceeds to step S9.

ステップＳ９では、シナリオ自動生成処理部２３が、シナリオマスタ２１から在庫確認シナリオに該当する図４のテーブルの次のレコードを取得したあと、ステップＳ２の処理に戻る。図４のテーブルでは、ステップＳ９で取得した４番目のレコードの項目「処理区分」が「分岐」である。 In step S9, the scenario automatic generation processing unit 23 obtains the next record in the table of FIG. 4 corresponding to the inventory confirmation scenario from the scenario master 21, and then returns to the process of step S2. In the table of FIG. 4, the item “processing category” of the fourth record acquired in step S9 is “branch”.

ステップＳ２で確認した項目「処理区分」が「入出力」でない為、シナリオ自動生成処理部２３はステップＳ５に進む。また、ステップＳ２で確認した項目「処理区分」が「処理」でない為、シナリオ自動生成処理部２３はステップＳ７に進み、ステップＳ２で確認した項目「処理区分」が「分岐」であるか否かを判定する。 Since the item “processing classification” confirmed in step S2 is not “input / output”, the scenario automatic generation processing unit 23 proceeds to step S5. Further, since the item “processing category” confirmed in step S2 is not “processing”, the scenario automatic generation processing unit 23 proceeds to step S7, and whether or not the item “processing category” confirmed in step S2 is “branch”. Determine.

ステップＳ２で確認した項目「処理区分」が「分岐」であれば、シナリオ自動生成処理部２３はステップＳ８に進み、後述の分岐シナリオ記述処理を行う。なお、分岐シナリオ記述処理では、取得したレコードに応じて、４番目の実行ステップに対応するシナリオファイルをＶｏｉｃｅＸＭＬで記述する。図４のテーブルでは、ステップＳ９で取得したレコードの項目「処理区分」が「分岐」であるため、ステップＳ８に進み、後述の分岐シナリオ記述処理を行ったあと、ステップＳ９に進む。 If the item “processing category” confirmed in step S2 is “branch”, the scenario automatic generation processing unit 23 proceeds to step S8 and performs a branch scenario description process described later. In the branch scenario description process, the scenario file corresponding to the fourth execution step is described in VoiceXML according to the acquired record. In the table of FIG. 4, since the item “processing category” of the record acquired in step S9 is “branch”, the process proceeds to step S8, a branch scenario description process described later is performed, and then the process proceeds to step S9.

ステップＳ９では、シナリオ自動生成処理部２３が、シナリオマスタ２１から在庫確認シナリオに該当する図４のテーブルの次のレコードを取得したあと、ステップＳ２の処理に戻る。図４のテーブルでは、ステップＳ９で５，６及び７番目に取得したレコードの項目「処理区分」が「入出力」である。なお、ステップＳ９で取得したレコードの項目「処理区分」が「入出力」であるときの処理は前述した為、説明を省略する。 In step S9, the scenario automatic generation processing unit 23 obtains the next record in the table of FIG. 4 corresponding to the inventory confirmation scenario from the scenario master 21, and then returns to the process of step S2. In the table of FIG. 4, the item “processing classification” of the fifth, sixth and seventh records acquired in step S9 is “input / output”. Since the process when the item “process classification” of the record acquired in step S9 is “input / output” has been described above, the description thereof will be omitted.

ステップＳ９では、シナリオ自動生成処理部２３が、シナリオマスタ２１から在庫確認シナリオに該当する図４のテーブルの８番目のレコードを取得したあと、ステップＳ２の処理に戻る。図４のテーブルでは、ステップＳ９で取得した８番目のレコードの項目「処理区分」が「終了」である。 In step S9, the scenario automatic generation processing unit 23 obtains the eighth record in the table of FIG. 4 corresponding to the inventory confirmation scenario from the scenario master 21, and then returns to the process of step S2. In the table of FIG. 4, the item “processing category” of the eighth record acquired in step S9 is “finished”.

ステップＳ２で確認した項目「処理区分」が「入出力」でない為、シナリオ自動生成処理部２３はステップＳ５に進む。また、ステップＳ２で確認した項目「処理区分」が「処理」でない為、シナリオ自動生成処理部２３はステップＳ７に進む。ステップＳ２で確認した項目「処理区分」が「分岐」でない為、シナリオ自動生成処理部２３はステップＳ２で確認した項目「処理区分」が「終了」であると判定する。 Since the item “processing classification” confirmed in step S2 is not “input / output”, the scenario automatic generation processing unit 23 proceeds to step S5. Further, since the item “processing category” confirmed in step S2 is not “processing”, the scenario automatic generation processing unit 23 proceeds to step S7. Since the item “processing category” confirmed in step S2 is not “branch”, the scenario automatic generation processing unit 23 determines that the item “processing category” confirmed in step S2 is “end”.

ステップＳ１０に進み、シナリオ自動生成処理部２３は、後述の終了シナリオ記述処理を行う。なお、終了シナリオ記述処理では、取得したレコードに応じて、８番目の実行ステップに対応するシナリオファイルを生成し、在庫確認シナリオの生成を終了するようにＶｏｉｃｅＸＭＬで記述する。 In step S10, the automatic scenario generation processing unit 23 performs an end scenario description process described later. In the end scenario description process, a scenario file corresponding to the eighth execution step is generated according to the acquired record, and described in VoiceXML so as to end the generation of the inventory confirmation scenario.

（入出力シナリオ記述処理）
図６は入出力シナリオ記述処理の一例のフローチャートである。シナリオ自動生成処理部２３はステップＳ２１に進み、取得したレコードの項目「出力メッセージ」に登録されているメッセージを音声出力ワードとしてＶｏｉｃｅＸＭＬで定義する。なお、音声出力ワードは音声システムから作業者へ音声出力されるメッセージである。 (I / O scenario description processing)
FIG. 6 is a flowchart of an example of the input / output scenario description process. The scenario automatic generation processing unit 23 proceeds to step S21 and defines a message registered in the item “output message” of the acquired record as a voice output word in VoiceXML. The voice output word is a message output from the voice system to the worker.

ステップＳ２２に進み、シナリオ自動生成処理部２３は、作業者から音声入力があった場合、発話された音声ワードを、取得したレコードの項目「入力格納変数」に登録されている名称の入力格納変数へセットするようＶｏｉｃｅＸＭＬで定義する。 Proceeding to step S22, when there is a voice input from the operator, the scenario automatic generation processing unit 23 inputs the spoken voice word to the input storage variable of the name registered in the item “input storage variable” of the acquired record. Define in VoiceXML to set

ステップＳ２３に進み、シナリオ自動生成処理部２３は入力格納変数へセットされた音声ワードを、取得したレコードの項目「辞書名」で表される音声グラマー２７を使用してテキスト変換するようＶｏｉｃｅＸＭＬで定義する。 In step S23, the scenario automatic generation processing unit 23 defines in VoiceXML that the speech word set in the input storage variable is converted to text using the speech grammar 27 represented by the item “dictionary name” of the acquired record. To do.

ステップＳ２４に進み、シナリオ自動生成処理部２３はテキスト変換された音声入力の内容によって、取得したレコードの項目「分岐Ｎｏ．１〜ｎ」で表される音声シナリオの分岐条件及び分岐先からシナリオ遷移先を決定するようＶｏｉｃｅＸＭＬで定義する。 In step S24, the scenario automatic generation processing unit 23 changes the scenario from the branch condition and branch destination of the voice scenario represented by the item “branch No. 1 to n” of the acquired record according to the contents of the voice input after text conversion. Define in VoiceXML to determine the destination.

例えば図４に示す「Ｎｏ．０１」の実行ステップでは、分岐条件「はい」に対する分岐先として「Ｎｏ．０２」の実行ステップが登録され、分岐条件「いいえ」に対する分岐先として「Ｎｏ．９１」の実行ステップが登録されている。したがって、シナリオ自動生成処理部２３は、テキスト変換された音声入力の内容が「はい」であれば「Ｎｏ．０２」の実行ステップをシナリオ遷移先として決定する。なお、シナリオ自動生成処理部２３は入力格納変数へセットされた音声ワードをテキスト変換できなかった場合の分岐条件及び分岐先も登録する。 For example, in the execution step of “No. 01” shown in FIG. 4, the execution step of “No. 02” is registered as the branch destination for the branch condition “Yes”, and “No. 91” as the branch destination for the branch condition “No”. Execution steps are registered. Accordingly, the scenario automatic generation processing unit 23 determines the execution step of “No. 02” as the scenario transition destination if the content of the text-converted voice input is “Yes”. The scenario automatic generation processing unit 23 also registers a branch condition and a branch destination when the speech word set in the input storage variable cannot be converted into text.

（処理シナリオ記述処理）
図７は処理シナリオ記述処理の一例のフローチャートである。シナリオ自動生成処理部２３はステップＳ３１に進み、取得したレコードの項目「データ処理名」にセットされたデータベース操作処理へ、取得したレコードの項目「指示パラメータ」にセットされたパラメータを引き渡して実行するようＶｏｉｃｅＸＭＬで定義する。 (Processing scenario description processing)
FIG. 7 is a flowchart of an example of the process scenario description process. The scenario automatic generation processing unit 23 proceeds to step S31 and delivers the parameter set in the item “instruction parameter” of the acquired record to the database operation processing set in the item “data processing name” of the acquired record and executes it. Define with VoiceXML.

また、ステップＳ３２に進み、シナリオ自動生成処理部２３はデータベース操作処理の結果を、取得したレコードの項目「戻り値」にセットされた入力格納変数へセットするようＶｏｉｃｅＸＭＬで定義する。 In step S32, the scenario automatic generation processing unit 23 defines the result of the database operation processing in VoiceXML so as to set the input storage variable set in the item “return value” of the acquired record.

（分岐シナリオ記述処理）
図８は分岐シナリオ記述処理の一例のフローチャートである。シナリオ自動生成処理部２３はステップＳ３５に進み、取得したレコードの項目「分岐Ｎｏ．１〜ｎ」にセットされた分岐条件及び分岐先と、取得したレコードの項目「戻り値」の入力格納変数にセットされたデータベース操作処理の結果とに応じて、シナリオ遷移先を決定するようＶｏｉｃｅＸＭＬで定義する。 (Branch scenario description process)
FIG. 8 is a flowchart of an example of a branch scenario description process. The scenario automatic generation processing unit 23 proceeds to step S35, and sets the branch condition and branch destination set in the item “branch No. 1 to n” of the acquired record and the input storage variable of the item “return value” of the acquired record. It is defined in VoiceXML so as to determine a scenario transition destination according to the set database operation processing result.

（終了シナリオ記述処理）
図９は終了シナリオ記述処理の一例のフローチャートである。シナリオ自動生成処理部２３はステップＳ４１に進み、実行中の音声シナリオ２６を終了するようＶｏｉｃｅＸＭＬで定義する。 (End scenario description process)
FIG. 9 is a flowchart of an example of the end scenario description process. The scenario automatic generation processing unit 23 proceeds to step S41, and uses VoiceXML to define the voice scenario 26 being executed.

（グラマー自動生成処理部２４）
ここでは、グラマー自動生成処理部２４の処理の一例として、操作者が在庫アドレスを音声入力する際に使用する音声グラマー２７の生成処理について説明する。グラマー条件指定受付部２５は、例えば音声シナリオ生成装置１を操作する管理者から音声グラマー２７の生成条件として、使用するテーブルのテーブル名，出力項目，検索項目及び出力する音声グラマー２７のファイル名の指定を受け付ける。使用するテーブルはグラマーマスタ２２に登録されているテーブルである。 (Glammer automatic generation processing unit 24)
Here, as an example of the process of the grammar automatic generation processing unit 24, a generation process of the audio grammar 27 used when the operator inputs an inventory address by voice will be described. The grammar condition designation receiving unit 25 receives, for example, the table name of the table to be used, the output item, the search item, and the file name of the voice grammar 27 to be output as a generation condition of the voice grammar 27 from the administrator who operates the voice scenario generation device 1. Accept specification. The table to be used is a table registered in the grammar master 22.

操作者は図１０に示すような３桁コードマスタテーブルのテーブル名「３桁コードマスタ」，出力項目「読上げ３桁コード」，検索項目「検索３桁コード」を生成条件として指定したとする。図１０は３桁コードマスタテーブルの一例の構成図である。図１０に示す３桁コードマスタテーブルは、正式３桁コード，読上げ３桁コード，３つの検索３桁コードを項目として有する。 Assume that the operator designates the table name “3-digit code master”, the output item “read-out 3-digit code”, and the search item “search 3-digit code” in the 3-digit code master table as shown in FIG. FIG. 10 is a configuration diagram of an example of a three-digit code master table. The three-digit code master table shown in FIG. 10 has items including an official three-digit code, a reading three-digit code, and three search three-digit codes.

そして、グラマー自動生成処理部２４はグラマー条件指定受付部２５が受け付けた音声グラマー２７の生成条件に従い、グラマーマスタ２２に登録されている図１０に示すような３桁コードマスタテーブルを読み出す。そして、グラマー自動生成処理部２４は読み出した３桁コードマスタテーブルから図１１に示すような手順で音声グラマー２７を自動生成する。 Then, the grammar automatic generation processing unit 24 reads a three-digit code master table as shown in FIG. 10 registered in the grammar master 22 in accordance with the generation conditions of the audio grammar 27 received by the grammar condition designation receiving unit 25. Then, the grammar automatic generation processing unit 24 automatically generates the audio grammar 27 from the read three-digit code master table according to the procedure shown in FIG.

図１１は３桁コードマスタテーブルから音声グラマーを生成する手順を表したイメージ図である。グラマー自動生成処理部２４は３桁コードマスタテーブルからレコードを順次読み出す。グラマー自動生成処理部２４は読み出したレコードに基づき、項目「読上げ３桁コード」に対応する３つの「検索３桁コード」を取得する。 FIG. 11 is an image diagram showing a procedure for generating an audio grammar from the 3-digit code master table. The grammar automatic generation processing unit 24 sequentially reads records from the three-digit code master table. The grammar automatic generation processing unit 24 acquires three “search three-digit codes” corresponding to the item “read-out three-digit code” based on the read record.

グラマー自動生成処理部２４は項目「読上げ３桁コード」を出力項目とし、項目「検索３桁コード」を検索項目とした上、出力項目と検索項目とを対応付ける３つのレコードを生成する。具体的に、グラマー自動生成処理部２４は項目「読上げ３桁コード」に対応する３つの「検索３桁コード」から、出力項目「イチゼロイチ」及び検索項目「イチゼロイチ」を対応付けるレコード、出力項目「イチゼロイチ」及び検索項目「ワンゼロワン」を対応付けるレコード、出力項目「イチゼロイチ」及び検索項目「イチレイイチ」を対応付けるレコードを生成する。 The grammar automatic generation processing unit 24 uses the item “read three-digit code” as an output item, uses the item “search three-digit code” as a search item, and generates three records for associating the output item with the search item. Specifically, the grammar automatic generation processing unit 24 records the output item “1 zero zero” from the three “search three digit codes” corresponding to the item “read three digits code”, the output item “first zero first”, and the search item “first zero first”. ”And the search item“ one zero one ”, and a record that associates the output item“ first zero ”and the search item“ first ”.

結果として、グラマー自動生成処理部２４は図１１に示すような音声グラマー２７を自動生成できる。なお、音声グラマー２７における検索項目は音声入力された音声ワードを検索キーとして音声グラマー２７を検索するときに利用される。検索キーに一致する検索項目が存在すれば、音声システムは音声入力された音声ワードを、検索キーに一致した検索項目に対応する出力項目（変換語）に変換できる。 As a result, the grammar automatic generation processing unit 24 can automatically generate an audio grammar 27 as shown in FIG. Note that the search item in the audio grammar 27 is used when searching the audio grammar 27 using the input voice word as a search key. If there is a search item that matches the search key, the voice system can convert the voice word input by voice into an output item (converted word) corresponding to the search item that matches the search key.

なお、グラマー自動生成処理部２４は、グラマーマスタ２２に登録されている複数種類のマスタテーブルを組み合わせて、１つの音声グラマー２７を生成してもよい。例えばグラマー自動生成処理部２４は苗字マスタテーブル及び名前マスタテーブルがグラマーマスタ２２に登録されている場合、苗字マスタテーブル及び名前マスタテーブルから苗字名前に対応する（フルネームに対応する）音声グラマー２７を生成してもよい。 Note that the grammar automatic generation processing unit 24 may generate one audio grammar 27 by combining a plurality of types of master tables registered in the grammar master 22. For example, when the last name master table and the name master table are registered in the grammar master 22, the grammar automatic generation processing unit 24 generates a voice grammar 27 corresponding to the last name (corresponding to the full name) from the last name master table and the name master table. May be.

（シナリオマスタ２１及び音声シナリオ２６の具体例）
図１２〜図１７はシナリオマスタの具体例を表した構成図である。シナリオ自動生成処理部２３は図１２〜図１７に示すシナリオマスタ２１から音声シナリオの実行ステップを表すレコードを順次読み出し、読み出したレコード毎にシナリオファイルを生成し、１つ以上のシナリオファイルにより音声シナリオ２６を生成する。なお、図１２〜図１７に示したシナリオマスタ２１は項目が多いため、図１２〜図１７に分割して記載している。 (Specific examples of scenario master 21 and voice scenario 26)
12 to 17 are configuration diagrams showing specific examples of scenario masters. The scenario automatic generation processing unit 23 sequentially reads records representing the execution steps of the audio scenario from the scenario master 21 shown in FIGS. 12 to 17, generates a scenario file for each read record, and generates an audio scenario using one or more scenario files. 26 is generated. Since the scenario master 21 shown in FIGS. 12 to 17 has many items, it is divided into FIGS. 12 to 17.

図１８は、生成されたシナリオファイルのファイル名を表示した一例の画面イメージである。例えばシナリオ自動生成処理部２３は生成したシナリオファイルのファイル名を出力装置１２等に表示して管理者に確認させる。生成されたシナリオファイルの内容は例えば図１９〜図２７のようになる。図１９〜図２７のシナリオファイルはＶｏｉｃｅＸＭＬで記述されている。 FIG. 18 is an example screen image displaying the file name of the generated scenario file. For example, the scenario automatic generation processing unit 23 displays the file name of the generated scenario file on the output device 12 or the like to make the administrator confirm. The contents of the generated scenario file are as shown in FIGS. The scenario files in FIGS. 19 to 27 are described in VoiceXML.

（音声シナリオ設定装置１００）
音声シナリオ設定装置１００のハードウェア構成図は図１に示した音声シナリオ生成装置１のハードウェア構成図と同様である。音声シナリオ設定装置１００は、スタンドアローンの形態でも良いし、インターネットやＬＡＮなどのネットワーク経由でユーザ端末にデータ通信可能に接続された形態でもよい。 (Voice scenario setting device 100)
The hardware configuration diagram of the voice scenario setting device 100 is the same as the hardware configuration diagram of the voice scenario generation device 1 shown in FIG. The voice scenario setting device 100 may be in a stand-alone form or may be connected to a user terminal via a network such as the Internet or a LAN so that data communication is possible.

本実施例の音声シナリオ設定プログラムは、音声シナリオ設定装置１００を制御する各種プログラムの少なくとも一部である。音声シナリオ設定プログラムは例えば記録媒体１８の配布やネットワークからのダウンロードなどによって提供される。 The voice scenario setting program of this embodiment is at least a part of various programs for controlling the voice scenario setting device 100. The voice scenario setting program is provided by, for example, distribution of the recording medium 18 or downloading from a network.

音声シナリオ設定プログラムを記録した記録媒体１８は、ＣＤ−ＲＯＭ、フレキシブルディスク、光磁気ディスク等の様に情報を光学的，電気的或いは磁気的に記録する記録媒体、ＲＯＭ、フラッシュメモリ等の様に情報を電気的に記録する半導体メモリ等、様々なタイプの記録媒体を用いることができる。 The recording medium 18 on which the audio scenario setting program is recorded is a recording medium on which information is optically, electrically or magnetically recorded, such as a CD-ROM, flexible disk, magneto-optical disk, ROM, flash memory, etc. Various types of recording media such as a semiconductor memory for electrically recording information can be used.

音声シナリオ設定プログラムを記録した記録媒体１８がドライブ装置１３にセットされると、音声シナリオ設定プログラムは記録媒体１８からドライブ装置１３を介して補助記憶装置１４にインストールされる。なお、ネットワークからダウンロードされた音声シナリオ設定プログラムは、インターフェース装置１７を介して、補助記憶装置１４にインストールされる。補助記憶装置１４はインストールされた音声シナリオ設定プログラムを格納すると共に、必要なファイル，データ等を格納する。 When the recording medium 18 on which the audio scenario setting program is recorded is set in the drive device 13, the audio scenario setting program is installed from the recording medium 18 to the auxiliary storage device 14 via the drive device 13. The voice scenario setting program downloaded from the network is installed in the auxiliary storage device 14 via the interface device 17. The auxiliary storage device 14 stores the installed voice scenario setting program and also stores necessary files, data, and the like.

主記憶装置１５は、音声シナリオ設定プログラムの起動時に補助記憶装置１４から音声シナリオ設定プログラムを読み出して格納する。そして、演算処理装置１６は主記憶装置１５に格納された音声シナリオ設定プログラムに従って、後述するような各種処理を実現している。 The main storage device 15 reads and stores the voice scenario setting program from the auxiliary storage device 14 when the voice scenario setting program is activated. The arithmetic processing unit 16 implements various processes as described later in accordance with the voice scenario setting program stored in the main storage unit 15.

図２８は音声シナリオ設定装置の一例のブロック図である。音声シナリオ設定装置１００は、音声入力ＤＢ１０１，音声入力実績収集処理部１０２，音声入力実績ログＤＢ１０３，実績評価ＤＢ１０４，習熟レベル選択処理部１０５，習熟レベル変換テーブル１０６及び音声シナリオ設定テーブル１０７を有する。なお、図２８では音声シナリオ設定装置１００内に音声入力ＤＢ１０１，実績評価ＤＢ１０４，習熟レベル変換テーブル１０６及び音声シナリオ設定テーブル１０７を有しているが、音声シナリオ設定装置１００外（例えばＤＢサーバなど）に設けてもよい。 FIG. 28 is a block diagram of an example of a voice scenario setting device. The voice scenario setting device 100 includes a voice input DB 101, a voice input result collection processing unit 102, a voice input result log DB 103, a result evaluation DB 104, a proficiency level selection processing unit 105, a proficiency level conversion table 106, and a voice scenario setting table 107. In FIG. 28, the voice scenario setting device 100 has a voice input DB 101, a performance evaluation DB 104, a proficiency level conversion table 106, and a voice scenario setting table 107, but outside the voice scenario setting device 100 (for example, a DB server). May be provided.

音声入力ＤＢ１０１は、作業者から音声システムへ音声入力された音声入力データを作業者別に格納している。音声入力実績収集処理部１０２は、音声入力ＤＢ１０１に格納されている作業者別の音声入力データから、作業者別の音声入力実績をログ情報として収集して音声入力実績ログＤＢ１０３に格納する。 The voice input DB 101 stores voice input data voice-input from the worker to the voice system for each worker. The voice input record collection processing unit 102 collects the voice input record for each worker as log information from the voice input data for each worker stored in the voice input DB 101 and stores it in the voice input record log DB 103.

音声入力実績には、音声認識率，音声適正認識回数，音声読上げ速度，再確認回数，応答時間が含まれる。音声認識率は、作業者の音声入力に対する認識成功率（例えば音声シナリオ２６で想定されている音声ワードに認識できた回数／作業者の音声入力の回数）である。 The voice input results include the voice recognition rate, the number of times of proper voice recognition, the voice reading speed, the number of reconfirmations, and the response time. The voice recognition rate is a recognition success rate (for example, the number of times that a voice word assumed in the voice scenario 26 can be recognized / the number of times of voice input by the worker) for the worker's voice input.

音声適正認識回数は、推奨される音声ワードで認識された回数である。推奨される音声ワードは例えば音声グラマー２７の出力項目に格納されている音声ワードである。具体的には、例えば音声ワード「トウキョウ」が推奨される音声ワードであると判定され、音声ワード「トオキョオ」が推奨される音声ワードであると判定されない。 The number of times of proper speech recognition is the number of times recognized by a recommended speech word. The recommended speech word is, for example, a speech word stored in the output item of the speech grammar 27. Specifically, for example, the speech word “Tokyo” is determined to be a recommended speech word, and the speech word “Tokyo” is not determined to be a recommended speech word.

また、音声読上げ速度は作業者が音声システムの使用時に設定している音声出力の速度である。再確認回数は作業者が音声出力を再確認する為の「ヘルプ」や「もう一度」を音声入力した回数である。応答時間は音声出力に対する音声入力までの経過時間である。 The voice reading speed is the voice output speed set by the operator when using the voice system. The number of reconfirmations is the number of times that the operator inputs “help” and “again” for reconfirming the voice output. The response time is the elapsed time until the voice input with respect to the voice output.

実績評価ＤＢ１０４は、習熟レベル選択処理部１０５が習熟レベル選択処理に利用する実績評価テーブルを格納している。実績評価テーブルは、図２９に示すような音声発話能力評価テーブル，図３０に示すような音声ヒアリング能力評価テーブル，図３１に示すような音声応答時間評価テーブルを格納している。 The performance evaluation DB 104 stores a performance evaluation table used by the skill level selection processing unit 105 for the skill level selection process. The performance evaluation table stores a voice utterance ability evaluation table as shown in FIG. 29, a voice hearing ability evaluation table as shown in FIG. 30, and a voice response time evaluation table as shown in FIG.

図２９は音声発話能力評価テーブルの一例の構成図である。音声発話能力評価テーブルは音声認識率と音声適正認識回数とのマトリックスを作成し、それぞれに該当する評価点を登録したものである。 FIG. 29 is a configuration diagram of an example of a speech utterance ability evaluation table. The speech utterance ability evaluation table is a table in which a matrix of speech recognition rates and appropriate speech recognition times is created, and the corresponding evaluation points are registered.

図３０は音声ヒアリング能力評価テーブルの一例の構成図である。音声ヒアリング能力評価テーブルは音声読上げ速度と再確認回数とのマトリックスを作成し、それぞれに該当する評価点を登録したものである。図３１は音声応答時間評価テーブルの一例の構成図である。音声応答時間評価テーブルは応答時間に対応する係数を登録したものである。 FIG. 30 is a configuration diagram of an example of a voice hearing ability evaluation table. The voice hearing ability evaluation table is a table in which a matrix of the voice reading speed and the number of reconfirmations is created, and the corresponding evaluation points are registered. FIG. 31 is a configuration diagram of an example of a voice response time evaluation table. The voice response time evaluation table is a table in which coefficients corresponding to response times are registered.

習熟レベル選択処理部１０５は、音声入力実績ログＤＢ１０３に格納されている作業者別の音声入力実績と実績評価ＤＢ１０４に格納されている実績評価テーブルとに基づいて作業者毎に評価点を算出する。評価点の算出には、以下の式（１）を利用する。 The proficiency level selection processing unit 105 calculates an evaluation score for each worker based on the voice input results for each worker stored in the voice input result log DB 103 and the result evaluation table stored in the result evaluation DB 104. . The following equation (1) is used to calculate the evaluation score.

評価点＝（音声発話能力の評価点＋音声ヒアリング能力の評価点）×応答時間に対応する係数・・・（１） Evaluation point = (Evaluation point of voice utterance ability + Evaluation point of voice hearing ability) × Coefficient corresponding to response time (1)

習熟レベル変換テーブル１０６は評価点と習熟レベルとを対応付けている。習熟レベル選択処理部１０５は習熟レベル変換テーブル１０６を利用して、評価点を習熟レベルに変換する。そして、習熟レベル選択処理部１０５は習熟レベルと作業者とを対応付けて音声シナリオ設定テーブル１０７に格納する。習熟レベルは作業者と音声シナリオ２６とを対応付けるものである。例えば習熟レベルを初級，中級，上級に分けた場合、習熟レベルは作業者と、初級，中級，上級の音声シナリオ２６の何れかと、を対応付ける。例えば音声シナリオ設定テーブル１０７は、音声システムで行うシナリオ制御処理において、作業者の音声シナリオ２６を選択する為に利用される。 The proficiency level conversion table 106 associates evaluation points with proficiency levels. The proficiency level selection processing unit 105 uses the proficiency level conversion table 106 to convert the evaluation points into proficiency levels. Then, the proficiency level selection processing unit 105 stores the proficiency level and the worker in the voice scenario setting table 107 in association with each other. The proficiency level associates the worker with the voice scenario 26. For example, when the proficiency level is divided into a beginner level, an intermediate level, and an advanced level, the proficiency level associates an operator with one of the beginner level, intermediate level, and advanced voice scenarios 26. For example, the voice scenario setting table 107 is used to select the worker's voice scenario 26 in the scenario control process performed in the voice system.

次に、習熟レベル選択処理部１０５の処理の具体例について説明する。ここでは、図３２に示すような初級，中級，上級の音声シナリオ２６を準備しているものとする。図３２は初級，中級，上級の音声シナリオの一例のイメージ図である。 Next, a specific example of processing of the proficiency level selection processing unit 105 will be described. Here, it is assumed that the beginner, intermediate, and advanced voice scenarios 26 as shown in FIG. 32 are prepared. FIG. 32 is an image diagram of an example of a beginner, intermediate, and advanced voice scenario.

習熟レベル選択処理部１０５は、音声入力実績ログＤＢ１０３から例えば１ヶ月の作業者Ａの音声入力実績として、音声認識率：８３％，音声適正認識回数：１２００回，音声読上げ速度：レベル８，再確認回数：３０回，応答時間：０．４を取得する。 The proficiency level selection processing unit 105 obtains, for example, a voice input result of the worker A for one month from the voice input result log DB 103 as a voice recognition rate: 83%, a voice proper recognition count: 1200 times, a voice reading speed: level 8, Number of confirmations: 30 times, response time: 0.4 is acquired.

習熟レベル選択処理部１０５は取得した音声認識率：８３％，音声適正認識回数：１２００回と図２９の音声発話能力評価テーブルとに基づき、音声発話能力の評価点「２」を得る。また、習熟レベル選択処理部１０５は取得した音声読上げ速度：レベル８，再確認回数：３０回と図３０の音声ヒアリング能力評価テーブルとに基づき、音声ヒアリング能力の評価点「３」を得る。 The proficiency level selection processing unit 105 obtains the evaluation point “2” of the speech utterance ability based on the acquired speech recognition rate: 83%, the appropriate speech recognition count: 1200 times and the speech utterance ability evaluation table of FIG. Further, the proficiency level selection processing unit 105 obtains an evaluation point “3” of the voice hearing ability based on the acquired voice reading speed: level 8, the number of reconfirmations: 30 times and the voice hearing ability evaluation table of FIG.

さらに、習熟レベル選択処理部１０５は取得した応答時間：０．４と図３１の音声応答時間評価テーブルとに基づき、応答時間に対応する係数「１．５」を得る。習熟レベル選択処理部１０５は式（１）を利用して、音声発話能力の評価点「２」，音声ヒアリング能力の評価点「３」及び応答時間に対応する係数「１．５」から、評価点「７．５」を算出する。 Further, the proficiency level selection processing unit 105 obtains a coefficient “1.5” corresponding to the response time based on the acquired response time: 0.4 and the voice response time evaluation table of FIG. The proficiency level selection processing unit 105 evaluates from the evaluation point “2” of the speech utterance ability, the evaluation point “3” of the speech hearing ability, and the coefficient “1.5” corresponding to the response time using the expression (1). The point “7.5” is calculated.

習熟レベル変換テーブル１０６は評価点「３未満」を習熟レベル「初級」、評価点「４以上〜６未満」を習熟レベル「中級」、評価点「７以上」を習熟レベル「上級」に対応付けているものとする。習熟レベル選択処理部１０５は習熟レベル変換テーブル１０６及び評価点「７．５」から、作業者Ａの使用する音声シナリオ２６として上級の音声シナリオ２６を選択する。 The proficiency level conversion table 106 associates the evaluation score “less than 3” with the proficiency level “beginning”, the evaluation score “4 to less than 6” with the proficiency level “intermediate”, and the evaluation score “7 or higher” with the proficiency level “advanced”. It shall be. The proficiency level selection processing unit 105 selects the advanced audio scenario 26 as the audio scenario 26 used by the worker A from the proficiency level conversion table 106 and the evaluation score “7.5”.

なお、中級及び上級の音声シナリオ２６を使用する作業者については、一度、習熟レベルが上がると落ちにくい評価基準とするため、習熟レベル変換テーブル１０６の内容を習熟レベルに応じて異ならせてもよい。例えば初級の音声シナリオ２６を使用する作業者の習熟レベル変換テーブル１０６は、評価点「３未満」を習熟レベル「初級」、評価点「４以上〜６未満」を習熟レベル「中級」、評価点「７以上」を習熟レベル「上級」に対応付ける。中級の音声シナリオ２６を使用する作業者の習熟レベル変換テーブル１０６は評価点「２未満」を習熟レベル「初級」、評価点「３以上〜６未満」を習熟レベル「中級」、評価点「７以上」を習熟レベル「上級」に対応付ける。上級の音声シナリオ２６を使用する作業者の習熟レベル変換テーブル１０６は評価点「２未満」を習熟レベル「初級」、評価点「３以上〜４未満」を習熟レベル「中級」、評価点「５以上」を習熟レベル「上級」に対応付ける。 It should be noted that for the workers who use the intermediate and advanced voice scenarios 26, the content of the proficiency level conversion table 106 may be varied depending on the proficiency level, so that once the proficiency level is raised, the evaluation standard is difficult to drop. . For example, in the proficiency level conversion table 106 for an operator who uses the beginner-level voice scenario 26, the evaluation level "less than 3" is the proficiency level "beginning", the evaluation score "4 to less than 6" is the proficiency level "intermediate", and the evaluation score Associate “7 or higher” with proficiency level “advanced”. The proficiency level conversion table 106 of the worker who uses the intermediate speech scenario 26 has an evaluation score “less than 2” as an acquisition level “beginning”, an evaluation score “3 or more and less than 6” as an acquisition level “intermediate”, and an evaluation score “7”. Corresponding to "advanced level" to "advanced level" The proficiency level conversion table 106 of the worker who uses the advanced speech scenario 26 has an evaluation score of “less than 2” as an acquisition level “beginning”, an evaluation score of “3 to less than 4” as an acquisition level “intermediate”, and an evaluation score of “5”. Corresponding to "advanced level" to "advanced level"

このように、中級及び上級の音声シナリオ２６を使用する作業者については、一度、習熟レベルが上がると落ちにくい評価基準とすることにより、一時的な要因（体調不良や周囲の騒音などの不利な環境下）での習熟レベルの低下に対処できる。 As described above, for the workers who use the intermediate and advanced voice scenarios 26, once the proficiency level is raised, the evaluation criteria that are difficult to be dropped are used, so that temporary factors (adverse conditions such as poor physical condition and ambient noise) can be obtained. It can cope with a decrease in the level of proficiency in the environment).

（まとめ）
以上、本実施例の音声シナリオ生成装置１によれば、管理者がシナリオマスタ２１及びグラマーマスタ２２を登録することで、音声シナリオを自動生成できるので、音声システムの構築が容易になり、音声システムの開発における開発コストの削減と、開発期間の短縮とを実現できる。結果、本実施例の音声シナリオ生成装置１によれば、音声システムの実運用開始までの対応時間を短縮することが可能である。 (Summary)
As described above, according to the voice scenario generation device 1 of the present embodiment, since the administrator can automatically generate a voice scenario by registering the scenario master 21 and the grammar master 22, the voice system can be easily constructed. The development cost can be reduced and the development period can be shortened. As a result, according to the voice scenario generation device 1 of the present embodiment, it is possible to shorten the response time until the actual operation of the voice system starts.

また、本実施例の音声シナリオ設定装置１００によれば、作業者の習熟度を自動的に判定するため、管理者による作業者の習熟度の判断が不要となる。さらに、本実施例の音声シナリオ生成装置１及び音声シナリオ設定装置１００によれば、シナリオマスタ２１及びグラマーマスタ２２の試作を繰り返しながら、音声システムを運用できるため、運用に即したシナリオマスタ２１及びグラマーマスタ２２を生成でき、音声認識率の高い音声システムの提供が容易となる。 Further, according to the voice scenario setting device 100 of the present embodiment, the proficiency level of the worker is automatically determined, so that it is not necessary for the administrator to determine the proficiency level of the worker. Furthermore, according to the voice scenario generation device 1 and the voice scenario setting device 100 of this embodiment, the voice system can be operated while repeating the trial production of the scenario master 21 and the grammar master 22, so that the scenario master 21 and the grammar suitable for the operation are used. The master 22 can be generated, and it becomes easy to provide a voice system with a high voice recognition rate.

本実施例では音声シナリオ生成装置１及び音声シナリオ設定装置１００を別々に説明したが、一つの装置としてもよいし、音声システムの機能として含ませてもよい。 In the present embodiment, the voice scenario generation device 1 and the voice scenario setting device 100 have been described separately, but may be included as one device or may be included as a function of the voice system.

本発明は、以下に記載する付記のような構成が考えられる。
（付記１）
コンピュータを、
ユーザの音声入力実績を取得し、前記音声入力実績と習熟レベルとを対応付けるテーブルに基づいて、ユーザの前記音声入力実績に対応する前記習熟レベルを選択し、ユーザの前記習熟レベルに応じた音声シナリオを自動設定するシナリオ自動設定処理手段
として機能させる為の音声シナリオ設定プログラム。
（付記２）
前記シナリオ自動設定処理手段は、ユーザの音声入力実績として、音声発話能力を評価する為の音声入力実績，音声ヒアリング能力を評価する為の音声入力実績，音声応答能力を評価する為の音声入力実績を取得し、前記音声発話能力を評価する為の音声入力実績と評価情報とを対応付けるテーブルと、前記音声ヒアリング能力を評価する為の音声入力実績と評価情報とを対応付けるテーブルと、前記音声応答能力を評価する為の音声入力実績と評価情報とを対応付けるテーブルとに基づいて、ユーザの前記音声入力実績に対応する前記習熟レベルを選択する付記１記載の音声シナリオ設定プログラム。
（付記３）
前記シナリオ自動設定処理手段は、ユーザの現在の前記習熟レベルに応じて、ユーザの前記音声入力実績に対応する前記習熟レベルを異ならせ、前記習熟レベルが下がりにくくする付記１又は２記載の音声シナリオ設定プログラム。
（付記４）
前記音声発話能力を評価する為の音声入力実績は、音声認識率及び音声適正認識回数であり、前記音声ヒアリング能力を評価する為の音声入力実績は、音声読上げ速度及び再確認回数であり、前記音声応答能力を評価する為の音声入力実績、音声出力に対する応答時間である付記２記載の音声シナリオ設定プログラム。
（付記５）
音声シナリオを自動設定する音声シナリオ設定装置であって、
ユーザの音声入力実績を取得し、前記音声入力実績と習熟レベルとを対応付けるテーブルに基づいて、ユーザの前記音声入力実績に対応する前記習熟レベルを選択し、ユーザの前記習熟レベルに応じた音声シナリオを自動設定するシナリオ自動設定処理手段
を有する音声シナリオ設定装置。 The present invention may have the following configurations as described below.
(Appendix 1)
Computer
A voice scenario corresponding to the user's proficiency level is selected by acquiring the user's voice input record, selecting the proficiency level corresponding to the user's voice input record based on a table associating the voice input record and the proficiency level. Voice scenario setting program for functioning as an automatic scenario setting processing means for automatically setting
(Appendix 2)
The scenario automatic setting processing means includes a voice input record for evaluating voice utterance ability, a voice input record for evaluating voice hearing ability, and a voice input record for evaluating voice response ability as a user's voice input record. And a table associating the speech input performance and the evaluation information for evaluating the speech utterance ability, a table associating the speech input record and the evaluation information for evaluating the speech hearing ability, and the speech response ability The voice scenario setting program according to supplementary note 1, wherein the learning level corresponding to the voice input record of the user is selected based on a table for associating the voice input record for evaluating the voice and the evaluation information.
(Appendix 3)
The audio scenario according to appendix 1 or 2, wherein the scenario automatic setting processing unit varies the proficiency level corresponding to the voice input performance of the user according to the current proficiency level of the user, and makes the proficiency level difficult to decrease. Configuration program.
(Appendix 4)
The speech input performance for evaluating the speech utterance ability is a speech recognition rate and the number of proper speech recognitions, and the speech input performance for evaluating the speech hearing ability is a speech reading speed and the number of reconfirmations, The voice scenario setting program according to appendix 2, which is a voice input result for evaluating voice response capability and a response time for voice output.
(Appendix 5)
A voice scenario setting device for automatically setting a voice scenario,
A voice scenario corresponding to the user's proficiency level is selected by acquiring the user's voice input record, selecting the proficiency level corresponding to the user's voice input record based on a table associating the voice input record and the proficiency level. A voice scenario setting device having scenario automatic setting processing means for automatically setting a voice.

本発明は、具体的に開示された実施例に限定されるものではなく、特許請求の範囲から逸脱することなく、種々の変形や変更が可能である。なお、特許請求の範囲に記載したシナリオ自動設定処理手段は習熟レベル選択処理部１０５に相当する。 The present invention is not limited to the specifically disclosed embodiments, and various modifications and changes can be made without departing from the scope of the claims. The scenario automatic setting processing means described in the claims corresponds to the proficiency level selection processing unit 105.

１音声シナリオ生成装置
１１入力装置
１２出力装置
１３ドライブ装置
１４補助記憶装置
１５主記憶装置
１６演算処理装置
１７インターフェース装置
１８記録媒体
２１シナリオマスタ
２２グラマーマスタ
２３シナリオ自動生成処理部
２４グラマー自動生成処理部
２５グラマー条件指定受付部
２６音声シナリオ
２７音声グラマー
３１〜３５音声指示
３６データ処理
４１〜４３発話
１００音声シナリオ設定装置
１０１音声入力ＤＢ
１０２音声入力実績収集処理部
１０３音声入力実績ログＤＢ
１０４実績評価ＤＢ
１０５習熟レベル選択処理部
１０６習熟レベル変換テーブル
１０７音声シナリオ設定テーブル
Ｂバス DESCRIPTION OF SYMBOLS 1 Voice scenario production | generation apparatus 11 Input device 12 Output device 13 Drive apparatus 14 Auxiliary storage device 15 Main storage device 16 Arithmetic processing device 17 Interface device 18 Recording medium 21 Scenario master 22 Grammar master 23 Scenario automatic generation process part 24 Grammar automatic generation process part 25 Grammar condition designation receiving unit 26 Voice scenario 27 Voice grammar 31-35 Voice instruction 36 Data processing 41-43 Utterance 100 Voice scenario setting device 101 Voice input DB
102 voice input record collection processing unit 103 voice input record log DB
104 Performance evaluation DB
105 Proficiency Level Selection Processing Unit 106 Proficiency Level Conversion Table 107 Voice Scenario Setting Table B Bus

Claims

Computer
A voice scenario corresponding to the user's proficiency level is selected by acquiring the user's voice input record, selecting the proficiency level corresponding to the user's voice input record based on a table associating the voice input record and the proficiency level. Voice scenario setting program for functioning as an automatic scenario setting processing means for automatically setting

The scenario automatic setting processing means includes a voice input record for evaluating voice utterance ability, a voice input record for evaluating voice hearing ability, and a voice input record for evaluating voice response ability as a user's voice input record. And a table associating the speech input performance and the evaluation information for evaluating the speech utterance ability, a table associating the speech input record and the evaluation information for evaluating the speech hearing ability, and the speech response ability The voice scenario setting program according to claim 1, wherein the proficiency level corresponding to the voice input record of the user is selected based on a table that associates the voice input record and evaluation information for evaluating the user.

3. The voice according to claim 1, wherein the scenario automatic setting processing unit varies the learning level corresponding to the voice input result of the user according to the current learning level of the user, and makes the learning level difficult to decrease. Scenario setting program.

A voice scenario setting device for automatically setting a voice scenario,
A voice scenario corresponding to the user's proficiency level is selected by acquiring the user's voice input record, selecting the proficiency level corresponding to the user's voice input record based on a table associating the voice input record and the proficiency level. A voice scenario setting device having scenario automatic setting processing means for automatically setting a voice.