JP2010182191A

JP2010182191A - Business form input device, business form input system, business form input method, and program

Info

Publication number: JP2010182191A
Application number: JP2009026461A
Authority: JP
Inventors: Shuhei Maekawa; 周平前川
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2009-02-06
Filing date: 2009-02-06
Publication date: 2010-08-19

Abstract

<P>PROBLEM TO BE SOLVED: To improve accuracy and efficiency of business form input operation using voice recognition. <P>SOLUTION: When a keyword stored in a recognition keyword database 204 is recognized in input voice data, a recognition event determining section 103 selects an event (action) corresponding to the keyword. A business form screen changing section 104 specifies change of a business form screen or change of entry field selection according to the event, and a business form display section 105 updates and displays a screen of an operator terminal. A language model selecting section 303 selects a language model 301 required for input to the business form screen changed with reference to a business form meta-information language model-corresponding database 302. Data input to the business form using the selected language model 301 is stored in a business form database 202. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、コールセンターの電話応対作業等において、音声認識を使用して会話内容など情報を帳票に入力する帳票入力技術に関する。 The present invention relates to a form input technology for inputting information such as conversation contents into a form by using voice recognition in a telephone reception work of a call center.

コールセンターにおける電話応対作業等において、オペレータは顧客との電話受け答えを行いながら、顧客の問い合わせ、要望などの通話内容を帳票に入力するとともに、顧客に対して適切な対応をする必要がある。また、通話内容などの履歴は、顧客との通話が終了した後に入力することも可能であるが、入力すべき情報量が多い場合は入力ミスがあったり、作業が負担となるなどの問題があった。
そこで、音声認識を使用した様々な帳票入力技術が開発されてきた。 In the telephone reception work in the call center, the operator needs to input the contents of the call such as the customer's inquiry and request into the form while answering the call with the customer, and appropriately respond to the customer. In addition, it is possible to input the history of the call contents after the call with the customer is finished, but there are problems such as input errors and work burden when there is a large amount of information to be input. there were.
Therefore, various form input techniques using voice recognition have been developed.

例えば、特許文献１に記載されているコールセンターシステムでは、利用者の音声を受信し、音声認識した認識結果と、その認識結果をオペレータの表示装置に表示し、オペレータが表示された認識結果を参照しながら復唱した音声を認識した認識結果のうち、認識率の高い方を最終的な認識結果として選択する。
また、特許文献２に記載されているコールセンターシステムは、オペレータの通話内容に関する音声認識情報を取得し、データベースに記憶された語気やフレーズに関する判定情報を参照し、オペレータの対応が適切であるかどうかといった判定を行う。
また、特許文献３に記載されている従来のコールセンターシステムは、サーバが電話の応答を音声認識し、文字データとして表示装置に出力するとともに、サーバに蓄積された過去の応答情報を文字データで出力し、電話応対者はその文字データを参照して適切に応対可能なシステムである。
また、音声認識を行う際に用いる言語モデル（辞書）を複数用意する、或いは、専門言語モデルを用意するといった技術も開発されている。 For example, in the call center system described in Patent Document 1, the user's voice is received, the recognition result obtained by voice recognition, the recognition result is displayed on the operator's display device, and the recognition result displayed by the operator is referred to. Then, the recognition result with the higher recognition rate is selected as the final recognition result among the recognition results obtained by recognizing the repeated voice.
In addition, the call center system described in Patent Document 2 acquires voice recognition information related to an operator's call content, refers to determination information related to vocabulary and phrases stored in the database, and whether or not the operator's response is appropriate. Such a determination is made.
In the conventional call center system described in Patent Document 3, the server recognizes the telephone response as voice and outputs it as character data to the display device, and outputs past response information stored in the server as character data. The telephone responder is a system that can appropriately respond with reference to the character data.
In addition, a technique has been developed in which a plurality of language models (dictionaries) used for speech recognition are prepared or a specialized language model is prepared.

特開平１０−３２２４５０号公報Japanese Patent Laid-Open No. 10-322450 特開２００８−２１１２７１号公報JP 2008-2111271 A 特開２００５−１１００３４号公報JP 2005-110034 A

以上のようなコールセンターシステムにおいて、電話応対を音声認識した認識結果を文字情報として表示、記憶することは可能であるが、多数の帳票や入力欄が多数存在する複雑な形式の帳票に対応した入力を行うことは困難であり、帳票への入力ミスなどが起こるなどの問題があった。また、オペレータが電話の応対をしながら、帳票や音声認識の言語モデルを手動で切替えることも困難であり、切替えミスが起こるなどの問題があった。
本発明は、以上の点に鑑みてなされたものであり、音声認識を利用して顧客とオペレータの発話の内容を認識し、発話内容に即した帳票欄の動的変更、適切な言語モデル選択を自動的に行うことにより、認識精度を向上させ、帳票への入力作業の効率と精度を向上させることを目的とする。 In the above call center system, it is possible to display and store the recognition result of voice recognition of telephone reception as character information, but input corresponding to complicated forms with many forms and many input fields However, it was difficult to perform the process, and there were problems such as an input error in the form. In addition, it is difficult for the operator to manually change the language model of the form and voice recognition while answering the telephone, and there is a problem that a switching error occurs.
The present invention has been made in view of the above points, recognizes the contents of utterances of customers and operators using voice recognition, dynamically changes a form column according to the utterance contents, and selects an appropriate language model. The purpose of this is to improve the recognition accuracy and to improve the efficiency and accuracy of the input operation to the form.

前述した目的を達成するために本発明は、帳票へのデータ入力作業を支援する帳票入力装置であって、前記帳票のメタ情報と言語モデルを対応付けて、帳票メタ情報言語モデル対応データとして記憶する帳票メタ情報言語モデル対応データ記憶手段と、前記帳票のメタ情報とキーワードとイベント情報とを対応付けて、キーワードデータとして記憶するキーワードデータ記憶手段と、音声データを認識する音声認識手段と、前記音声データにおいて前記キーワードが認識された場合、前記キーワードデータに基づいてイベントを実行するイベント実行手段と、前記イベントにより入力対象となる帳票が変更された場合、帳票メタ情報言語モデル対応データに基づいて、当該変更された帳票のメタ情報に対応した言語モデルを選択する言語モデル選択手段と、前記選択された言語モデルを用いて前記帳票の入力情報を決定する帳票入力手段と、を有することを特徴とする。 In order to achieve the above-described object, the present invention is a form input device that supports data input work to a form, and associates the meta information of the form with a language model and stores it as form meta information language model corresponding data. A form meta information language model corresponding data storage means, a keyword data storage means for associating the meta information, keywords and event information of the form and storing them as keyword data, a voice recognition means for recognizing voice data, When the keyword is recognized in the voice data, event execution means for executing an event based on the keyword data, and when the form to be input is changed by the event, based on the data corresponding to the form meta information language model Language model for selecting the language model corresponding to the meta information of the changed form Selection means, characterized by having a a form input means for determining an input information of said document by using said selected language model.

ここで、本発明の帳票入力装置では、予め、帳票（帳票テンプレート）のメタ情報に対応した言語モデル、帳票のメタ情報に対応したキーワードとイベント情報とを記憶する。入力された音声データを認識し、その音声データにキーワードが認識された場合、帳票入力装置は、キーワードに対応したイベント情報に基づいてイベントを行い、帳票画面変更などを行い、変更された帳票の該当帳票入力欄（メタ情報）に基づいた言語モデルを用いて、入力を受け付ける、或いは、自動入力を行うものである。
また、イベント情報として、帳票の入力情報の制限を記憶させることも可能である。
本発明では自動で言語モデルの選択や帳票入力が可能であり、手動の帳票入力、手動の言語モデル切り替えや帳票切り替え等を行わないため、帳票入力作業の軽減と精度の向上が実現できる。 Here, in the form input device of the present invention, a language model corresponding to the meta information of the form (form template), a keyword corresponding to the meta information of the form, and event information are stored in advance. When the input voice data is recognized and a keyword is recognized in the voice data, the form input device performs an event based on the event information corresponding to the keyword, changes the form screen, etc. The language model based on the corresponding form input field (meta information) is used to accept input or to perform automatic input.
Moreover, it is also possible to store the limitation of the input information of the form as event information.
In the present invention, language model selection and form input can be automatically performed, and manual form input, manual language model switching, form switching, and the like are not performed. Therefore, reduction of form input work and improvement of accuracy can be realized.

また、第２の発明は、帳票のメタ情報と言語モデルを対応付けて、帳票メタ情報言語モデル対応データとして記憶するステップと、前記帳票のメタ情報とキーワードとイベント情報とを対応付けて、キーワードデータとして記憶するステップと、音声データを認識するステップと、前記音声データにおいて前記キーワードが認識された場合、前記キーワードデータに基づいてイベントを実行するステップと、前記イベントにより入力対象となる帳票が変更された場合、帳票メタ情報言語モデル対応データに基づいて、当該変更された帳票のメタ情報に対応した言語モデルを選択するステップと、前記選択された言語モデルを用いて前記帳票の入力情報を決定するステップと、を有することを特徴とする帳票入力方法である。 Further, the second invention associates the meta information of the form with the language model and stores it as form meta information language model correspondence data, associates the form meta information with the keyword and the event information, A step of storing as data, a step of recognizing audio data, a step of executing an event based on the keyword data when the keyword is recognized in the audio data, and a form to be input changed by the event If so, a step of selecting a language model corresponding to the meta information of the changed form based on the form meta information language model correspondence data, and determining input information of the form using the selected language model A form input method characterized by comprising the steps of:

また、第３の発明は、顧客端末と音声通信可能なオペレータ端末と、前記オペレータ端末とネットワークを介してデータ送受信可能なサーバと、を有する帳票入力システムであって、前記サーバは、帳票のメタ情報と言語モデルを対応付けて、帳票メタ情報言語モデル対応データとして記憶する帳票メタ情報言語モデル対応データ記憶手段と、前記帳票のメタ情報とキーワードとイベント情報とを対応付けて、キーワードデータとして記憶するキーワードデータ記憶手段と、前記顧客端末と前記オペレータ端末から入力を受け付けた音声データを認識する音声認識手段と、前記音声データにおいて前記キーワードが認識された場合、前記キーワードデータに基づいてイベントを実行するイベント実行手段と、前記イベントにより入力対象となる帳票を変更し、前記オペレータ端末に表示させる手段と、前記帳票メタ情報言語モデル対応データに基づいて、前記変更された帳票のメタ情報に対応した言語モデルを選択する言語モデル選択手段と、前記選択された言語モデルを用いて前記帳票の入力情報を決定する帳票入力手段と、を有することを特徴とする帳票入力システムである。
また、第４の発明は、コンピュータを第１の発明である帳票入力装置として動作させるプログラムである。 According to a third aspect of the present invention, there is provided a form input system having an operator terminal capable of voice communication with a customer terminal, and a server capable of transmitting and receiving data via the operator terminal and a network. A form meta information language model correspondence data storage means for associating information with a language model and storing it as form meta information language model correspondence data, and associating and storing the form meta information, keyword and event information as keyword data Keyword data storage means, voice recognition means for recognizing voice data received from the customer terminal and the operator terminal, and execution of an event based on the keyword data when the keyword is recognized in the voice data Event execution means to be input by the event Means for changing a form and displaying it on the operator terminal; language model selecting means for selecting a language model corresponding to the meta information of the changed form based on the data corresponding to the form meta information language model; and the selection And a form input unit that determines input information of the form using the language model thus formed.
The fourth invention is a program for operating a computer as a form input device according to the first invention.

本発明によれば、音声認識を利用した帳票入力作業を容易にかつ迅速に行うことができ、帳票への入力作業の精度が向上する。 According to the present invention, a form input operation using voice recognition can be easily and quickly performed, and the accuracy of the input operation to the form is improved.

帳票入力システム１のハードウェア構成図である。2 is a hardware configuration diagram of a form input system 1. FIG. 帳票入力システム１の機能ブロック構成とデータの流れを示す図である。It is a figure which shows the functional block structure of the form input system 1, and the flow of data. 帳票のメタ情報の一例を示す図である。It is a figure which shows an example of the meta information of a form. 帳票メタ情報言語モデル対応データベースの一例を示す図である。It is a figure which shows an example of a database corresponding to a form meta information language model. 認識キーワードデータベースの一例を示す図である。It is a figure which shows an example of a recognition keyword database. データ登録処理の流れを示すシーケンス図である。It is a sequence diagram which shows the flow of a data registration process. 帳票入力処理の流れを示すシーケンス図である。It is a sequence diagram which shows the flow of a form input process. 帳票画面の一例を示す図である。It is a figure which shows an example of a form screen. 帳票入力システム１１のハードウェア構成図である。2 is a hardware configuration diagram of a form input system 11. FIG. 帳票入力システム１１の機能ブロック構成とデータの流れを示す図である。It is a figure which shows the functional block structure of the form input system 11, and the flow of data. データ登録処理の流れを示すシーケンス図である。It is a sequence diagram which shows the flow of a data registration process. 帳票のメタ情報の一例を示す図である。It is a figure which shows an example of the meta information of a form. 発話判定データベース４０２の一例を示す図である。It is a figure which shows an example of the speech determination database. 帳票入力処理の流れを示すシーケンス図である。It is a sequence diagram which shows the flow of a form input process. 帳票画面の一例を示す図である。It is a figure which shows an example of a form screen.

以下に、添付図面を参照しながら、本発明に係る帳票入力装置の好適な実施形態について詳細に説明する。なお、以下の説明および添付図面において、略同一の機能構成を有する構成要素については、同一の符号を付することにより重複説明を省略することにする。 Hereinafter, preferred embodiments of a form input apparatus according to the present invention will be described in detail with reference to the accompanying drawings. In the following description and the accompanying drawings, the same reference numerals are given to components having substantially the same functional configuration, and redundant description is omitted.

図１は、帳票入力システムのハードウェア構成図である。帳票入力システム１は、サーバ３、オペレータ端末５、帳票管理者端末７、顧客端末９から構成される。
顧客端末９は、顧客８に設置される公衆回線網１２を介して音声通信可能な端末装置であり、電話機、携帯電話機、コンピュータ等である。図１に示すように、顧客８は、顧客端末９から公衆回線網１２に接続し、ＰＢＸ（Private Branch eXchange）１３を通してオペレータ端末５に問い合わせ等を行う場合を例に説明する。サーバ３、オペレータ端末５、帳票管理者端末７はネットワーク１４を介してデータ送受信可能である。ＰＢＸ１３は、例えば、ＩＰ−ＰＢＸ等であり、サーバ３、１以上のオペレータ端末５、帳票管理者端末７の接続するネットワーク１４を利用して電話網を構築することが可能であり、顧客８とオペレータ４との電話による通話を実現することが可能である。 FIG. 1 is a hardware configuration diagram of a form input system. The form input system 1 includes a server 3, an operator terminal 5, a form manager terminal 7, and a customer terminal 9.
The customer terminal 9 is a terminal device capable of performing voice communication via the public line network 12 installed in the customer 8, and is a telephone, a mobile phone, a computer, or the like. As shown in FIG. 1, an example will be described in which a customer 8 connects to a public line network 12 from a customer terminal 9 and makes an inquiry to the operator terminal 5 through a PBX (Private Branch eXchange) 13. The server 3, the operator terminal 5, and the form manager terminal 7 can transmit and receive data via the network 14. The PBX 13 is, for example, an IP-PBX or the like, and can construct a telephone network using the network 3 connected to the server 3, one or more operator terminals 5, and the form manager terminal 7. It is possible to realize a telephone call with the operator 4.

オペレータ端末５はコンピュータ等であり、制御部５１、記憶部５２、入力部５３、出力部５４、通信部５５を有する。制御部５１は、中央演算処理装置（ＣＰＵ）やマイクロプロセッサ等であり、オペレータ端末５の各部の制御を行う。記憶部５２は、不揮発性メモリ、揮発性メモリ、ハードディスクといった記憶装置である。入力部５３は、キーボードやマウスといった入力装置、ヘッドセットやマイクロフォンといった音声入力装置である。出力部５４は、ディスプレイ装置やスピーカなどの音声出力装置等である。通信部５５は、ネットワーク１４を介してサーバ３とのデータの送受信や、顧客端末９との音声通信を行う装置である。 The operator terminal 5 is a computer or the like, and includes a control unit 51, a storage unit 52, an input unit 53, an output unit 54, and a communication unit 55. The control unit 51 is a central processing unit (CPU), a microprocessor, or the like, and controls each unit of the operator terminal 5. The storage unit 52 is a storage device such as a nonvolatile memory, a volatile memory, or a hard disk. The input unit 53 is an input device such as a keyboard or a mouse, or a voice input device such as a headset or a microphone. The output unit 54 is an audio output device such as a display device or a speaker. The communication unit 55 is a device that performs data transmission / reception with the server 3 and voice communication with the customer terminal 9 via the network 14.

帳票管理者端末７は、コンピュータ等であり、帳票管理者６に設置される。帳票管理者端末７は、制御部７１、記憶部７２、入力部７３、出力部７４、通信部７５を有し、これらの機能と動作は、前述のオペレータ端末５の制御部５１、記憶部５２、入力部５３、出力部５４、通信部５５と同様である。 The form manager terminal 7 is a computer or the like, and is installed in the form manager 6. The form manager terminal 7 includes a control unit 71, a storage unit 72, an input unit 73, an output unit 74, and a communication unit 75. These functions and operations are the control unit 51 and the storage unit 52 of the operator terminal 5 described above. , The same as the input unit 53, the output unit 54, and the communication unit 55.

サーバ３はサーバコンピュータであり、帳票入力装置に相当する。サーバ３は、制御部３１、記憶部３２、入力部３３、出力部３４、通信部３５を有する。制御部３１は、中央演算処理装置（ＣＰＵ）やマイクロプロセッサ等であり、サーバ３内の各部の制御を行うとともに後述する処理を実行する。記憶部３２は、不揮発性メモリ、揮発性メモリ、ハードディスクといった記憶装置である。入力部３３は、キーボードやマウスなどの入力装置である。出力部３４は、ディスプレイ装置等である。通信部３５は、ネットワーク１４を介してオペレータ端末５や帳票管理者端末７とのデータの送受信を行う装置である。 The server 3 is a server computer and corresponds to a form input device. The server 3 includes a control unit 31, a storage unit 32, an input unit 33, an output unit 34, and a communication unit 35. The control unit 31 is a central processing unit (CPU), a microprocessor, or the like, and controls each unit in the server 3 and executes processing to be described later. The storage unit 32 is a storage device such as a nonvolatile memory, a volatile memory, or a hard disk. The input unit 33 is an input device such as a keyboard or a mouse. The output unit 34 is a display device or the like. The communication unit 35 is a device that transmits and receives data to and from the operator terminal 5 and the form manager terminal 7 via the network 14.

図２は、サーバの機能ブロック構成とデータの流れを示す図である。サーバ３の機能には、帳票作成手段８１、帳票管理手段８２、音声認識辞書管理手段８３がある。帳票作成手段８１は、音声入力手段１０１、音声認識手段１０２、認識イベント判定手段１０３、帳票画面変更手段１０４、帳票表示手段１０５、帳票入力手段１０６を有する。帳票管理手段８２は、帳票メタ情報データベース２０１、帳票データベース２０２、帳票管理データ入力手段２０３、認識キーワードデータベース２０４を有する。音声認識辞書管理手段８３は、言語モデル３０１、帳票メタ情報言語モデル対応データベース３０２、言語モデル選択手段３０３を有する。 FIG. 2 is a diagram illustrating a functional block configuration of the server and a data flow. The functions of the server 3 include a form creation unit 81, a form management unit 82, and a voice recognition dictionary management unit 83. The form creation unit 81 includes a voice input unit 101, a voice recognition unit 102, a recognition event determination unit 103, a form screen change unit 104, a form display unit 105, and a form input unit 106. The form management unit 82 includes a form meta information database 201, a form database 202, a form management data input unit 203, and a recognition keyword database 204. The speech recognition dictionary management unit 83 includes a language model 301, a form meta information language model correspondence database 302, and a language model selection unit 303.

音声入力手段１０１は、オペレータ端末５、顧客端末９のそれぞれの入力部から入力された音声（発話）データの入力を受け付ける。音声認識手段１０２は、言語モデル（辞書）３０１を用いて入力された音声データを認識する。この際に使用される言語モデル３０１は、帳票メタ情報言語モデル対応データベース３０２に記憶される帳票メタ情報言語モデル対応データを用いて、言語モデル選択手段３０３によって選択される。 The voice input unit 101 accepts input of voice (utterance) data input from the input units of the operator terminal 5 and the customer terminal 9. The voice recognition unit 102 recognizes voice data input using a language model (dictionary) 301. The language model 301 used at this time is selected by the language model selection unit 303 using the form meta information language model correspondence data stored in the form meta information language model correspondence database 302.

認識イベント判定手段１０３は、帳票メタ情報データベース２０１と認識キーワードデータベース２０４の情報を動作（イベント）を判定し、帳票画面変更手段１０４は認識イベント判定手段１０３等の指示により帳票表示データを動的に変更する。帳票表示手段１０５は、オペレータ端末５等に帳票を表示する。帳票入力手段１０６は、オペレータ端末５等による帳票への入力を受け付けるなどして入力情報を決定し、帳票データベース２０２へ記憶する。 The recognition event determination unit 103 determines the operation (event) of the information in the form meta information database 201 and the recognition keyword database 204, and the form screen change unit 104 dynamically changes the form display data according to an instruction from the recognition event determination unit 103 and the like. change. The form display unit 105 displays the form on the operator terminal 5 or the like. The form input means 106 determines input information by receiving input to the form by the operator terminal 5 or the like, and stores it in the form database 202.

帳票管理データ入力手段２０３は、帳票管理者６により帳票管理者端末７から入力された帳票テンプレートとそのメタ情報、音声認識キーワードとその対応イベントの情報を受け付ける。帳票メタ情報データベース２０１は、帳票テンプレートとそのメタ情報を記憶する。帳票データベース２０２は、帳票に入力されたデータを記憶する。認識キーワードデータベース２０４は、メタ情報に対応するキーワードとそのキーワードに対応したイベント（動作）の情報を記憶する。 The form management data input means 203 receives the form template and its meta information, the voice recognition keyword and the corresponding event information input from the form manager terminal 7 by the form manager 6. The form meta information database 201 stores a form template and its meta information. The form database 202 stores data input to the form. The recognition keyword database 204 stores keywords corresponding to the meta information and event (operation) information corresponding to the keywords.

言語モデル３０１は、帳票入力手段１０６が帳票入力情報を決定するときに用いる複数の辞書であり、音声認識手段１０２が音声認識を行う際にも用いられる。帳票メタ情報言語モデル対応データベース３０２は、帳票メタ情報とその帳票入力に使用する言語モデルの対応を示す情報である。言語モデル選択手段３０３は、音声認識及び帳票入力に適した言語モデル３０１を選択するものである。 The language model 301 is a plurality of dictionaries used when the form input unit 106 determines form input information, and is also used when the speech recognition unit 102 performs speech recognition. The form meta information language model correspondence database 302 is information indicating correspondence between form meta information and a language model used for inputting the form. The language model selection unit 303 selects a language model 301 suitable for speech recognition and form input.

以下に、サーバ３の各データベースに記憶される情報について説明する。
図３は、帳票メタ情報データベース２０１に記憶される帳票メタ情報の一部を示す図である。帳票メタ情報は帳票のデータであり、帳票画面上の各入力欄等についての情報を示すものである。帳票には、単一選択コンボボックス、複数選択チェックボックス、日時エディット欄、数字エディット欄、自由入力欄及びプッシュボタン等の各入力欄が画面上に設置されている。図３に示すメタ情報の一例はＸＭＬで記述され、例えば、図３に示す６０１部は、帳票欄ＩＤ「accidentdate」に対してその属性「Combobox」と属性値「２３／１０／２００８１３：００」が定義されることを示す。 Below, the information memorize | stored in each database of the server 3 is demonstrated.
FIG. 3 is a diagram showing a part of the form meta information stored in the form meta information database 201. The form meta information is form data and indicates information about each input field on the form screen. In the form, a single selection combo box, a multiple selection check box, a date / time edit field, a number edit field, a free input field, a push button, and other input fields are provided on the screen. An example of the meta information shown in FIG. 3 is described in XML. For example, the 601 part shown in FIG. 3 has an attribute “Combobox” and an attribute value “23/10/2008 13:00” for the form column ID “accidentdate”. "Is defined.

図４は、帳票メタ情報言語モデル対応データベース３０２に記憶される帳票メタ情報言語モデル対応データの一例を示す図である。図４に示す帳票メタ情報言語モデル対応データは、メタ情報の帳票欄ＩＤとそれに対応する言語モデル（辞書名）から構成され、リレーショナルデータベースのテーブル項目として表現される。例えば、図４に示す６４１部は、図３に示す６０１部の帳票欄ＩＤ「accidentdate」に対して「日時」を表す言語モデル３０１が対応することを示す。言語モデル選択手段３０３は、帳票メタ情報言語モデル対応データを用いて、帳票の帳票欄（帳票入力欄）ごとに当該帳票欄にデータを入力する際に用いる最適な言語モデル３０１を選択する。 FIG. 4 is a diagram illustrating an example of form meta information language model correspondence data stored in the form meta information language model correspondence database 302. The form meta information language model correspondence data shown in FIG. 4 includes a form column ID of meta information and a corresponding language model (dictionary name), and is expressed as a table item in a relational database. For example, 641 parts shown in FIG. 4 indicate that the language model 301 representing “date and time” corresponds to the form field ID “accidentdate” of 601 parts shown in FIG. The language model selection unit 303 uses the form meta information language model correspondence data to select an optimal language model 301 to be used when inputting data into the form field for each form field (form input field) of the form.

図５は、認識キーワードデータベース２０４に記憶される認識キーワードデータの一例を示す図である。図５に示す認識キーワードデータは、帳票欄ＩＤ、認識されたキーワード、属性、キーワードに対応したイベント（動作）の情報から構成され、リレーショナルデータベースのテーブル項目として表現される。例えば、図５に示す６５１部は、帳票欄ＩＤ「accidentdate」に対して「発生日時」というキーワードが音声認識された場合、帳票の属性は「Combobox」であり、動作は「カレンダー選択」が定義される。認識イベント判定手段１０３は、音声認識手段１０２によって認識された認識データにおいて認識キーワードデータベース２０４に記憶されるキーワードが認識されると、認識キーワードデータベース２０４に記憶される動作を判定し、帳票画面変更手段１０４はその動作を行い、帳票画面の変更などを行う。
このように、帳票欄「accidentdate」に対応して、帳票メタ情報言語モデル対応データとして図４に示す６４１部が、キーワードデータとして図５に示す６５１部が定義され、記憶される。 FIG. 5 is a diagram illustrating an example of recognized keyword data stored in the recognized keyword database 204. The recognized keyword data shown in FIG. 5 includes a form column ID, a recognized keyword, an attribute, and event (operation) information corresponding to the keyword, and is expressed as a table item in a relational database. For example, in the case of 651 shown in FIG. 5, when the keyword “occurrence date” is recognized by voice for the form column ID “accidentdate”, the form attribute is “Combobox” and the operation is defined as “calendar selection”. Is done. The recognition event determination unit 103 determines the operation stored in the recognition keyword database 204 when the keyword stored in the recognition keyword database 204 is recognized in the recognition data recognized by the voice recognition unit 102, and the form screen change unit 104 performs the operation and changes the form screen.
In this way, in correspondence with the form column “accidentdate”, 641 parts shown in FIG. 4 are defined as form meta information language model correspondence data, and 651 parts shown in FIG. 5 are defined and stored as keyword data.

同様に、図３の６０２部のメタ情報は、図４に示す帳票メタ情報言語モデル対応データベース３０２の６４２部、図５に示す認識キーワードデータベース２０４の６５２部に対応する。また、図３の６０３部のメタ情報は、図４に示す帳票メタ情報言語モデル対応データベース３０２の６４３部、図５に示す認識キーワードデータベース２０４の６５３部に対応する。また、図３の６０４部のメタ情報は、図４に示す帳票メタ情報言語モデル対応データベース３０２の６４４部、図５に示す認識キーワードデータベース２０４の６５４部に対応する。また、図３の６０５部のメタ情報は、図４に示す帳票メタ情報言語モデル対応データベース３０２の６４５部、図５に示す認識キーワードデータベース２０４の６５５部に対応する。
こうして、キーワードに対応する帳票の入力欄と動作が定義され、帳票の各入力欄の入力に対する言語モデル３０１が定義される。 Similarly, the meta information 602 in FIG. 3 corresponds to 642 in the form meta information language model correspondence database 302 shown in FIG. 4 and 652 in the recognition keyword database 204 shown in FIG. 3 corresponds to 643 parts of the form meta information language model correspondence database 302 shown in FIG. 4 and 653 parts of the recognition keyword database 204 shown in FIG. 3 corresponds to 644 parts of the form meta information language model correspondence database 302 shown in FIG. 4 and 654 parts of the recognition keyword database 204 shown in FIG. Also, the meta information of 605 in FIG. 3 corresponds to 645 in the form meta information language model correspondence database 302 shown in FIG. 4 and 655 in the recognition keyword database 204 shown in FIG.
In this way, the form input field and operation corresponding to the keyword are defined, and the language model 301 for the input in each input field of the form is defined.

次に、帳票入力システム１の動作について説明する。
ここでは、オペレータ４はコールセンターのオペレータであり、顧客８が顧客端末９からコールセンターに問い合わせを行った場合を例に説明する。
まず、帳票管理者６は、コールセンター業務開始前に予め、帳票メタ情報の追加、更新及び削除を行う。図６は、帳票管理者６により帳票管理者端末７から行われるデータ登録作業の流れを示すシーケンス図である。
帳票管理者端末７は、帳票メタ情報を入力し（ステップＳ１００１）、サーバ３に送信する。サーバ３は、帳票メタ情報を帳票メタ情報データベース２０１に登録し（ステップＳ１００２）、入力された帳票メタ情報の帳票欄ＩＤを参照して、当該帳票で使用される言語モデル３０１を検索し、選択準備する。 Next, the operation of the form input system 1 will be described.
Here, the case where the operator 4 is a call center operator and the customer 8 makes an inquiry to the call center from the customer terminal 9 will be described as an example.
First, the form manager 6 adds, updates, and deletes the form meta information in advance before starting the call center business. FIG. 6 is a sequence diagram showing a flow of data registration work performed by the form manager 6 from the form manager terminal 7.
The form manager terminal 7 inputs the form meta information (step S1001) and transmits it to the server 3. The server 3 registers the form meta information in the form meta information database 201 (step S1002), refers to the form column ID of the input form meta information, searches for and selects the language model 301 used in the form. prepare.

次に、帳票管理者端末７は、図５に示す認識キーワードデータを入力し（ステップＳ１００４）、サーバ３に送信する。サーバ３は、キーワード情報を認識キーワードデータベース２０４に記憶する（ステップＳ１００５）。
次に、コールセンター業務が開始され、顧客端末９からオペレータ端末７に対して電話による問い合わせが行われる場合を例に説明する。 Next, the form manager terminal 7 inputs the recognition keyword data shown in FIG. 5 (step S1004) and transmits it to the server 3. The server 3 stores the keyword information in the recognition keyword database 204 (step S1005).
Next, a case where a call center operation is started and a customer terminal 9 makes an inquiry by telephone to the operator terminal 7 will be described as an example.

図７は、本システムによる帳票入力作業の流れを示すシーケンス図である。
顧客８とオペレータ４との電話による問い合わせと回答などの発話が開始されると、オペレータ端末５の音声入力デバイスなどの入力部５３から入力された発話の音声データ（発話情報）がサーバ３の音声入力手段１０１に送信される（ステップＳ１２０１）。サーバ３の音声認識手段１０２は音声データを認識し（ステップＳ１２０２）、認識キーワードデータベース２０４に記憶されたキーワードの発話があるかどうかを判定する（ステップＳ１２０３）。 FIG. 7 is a sequence diagram showing the flow of the form input operation by this system.
When an utterance such as a telephone inquiry and reply between the customer 8 and the operator 4 is started, the voice data (utterance information) of the utterance input from the input unit 53 such as the voice input device of the operator terminal 5 is the voice of the server 3. It is transmitted to the input means 101 (step S1201). The voice recognition unit 102 of the server 3 recognizes the voice data (step S1202), and determines whether there is an utterance of the keyword stored in the recognition keyword database 204 (step S1203).

予め認識キーワードデータベース２０４に記憶したキーワードの発話がある場合（ステップＳ１２０３で「Ｙｅｓ」）、認識イベント判定手段１０３は、認識キーワードデータベース２０４のデータに従い、キーワードに対応するイベント（動作）を選択する（ステップＳ１２０４）。次に、オペレータが帳票に対してフォーカスを移動（入力欄へのマウスの移動など）して入力欄を変更する等動作の有無を判定する（ステップＳ１２０５）。動作がある場合（ステップＳ１２０５で「Ｙｅｓ」）、帳票画面変更手段１０４はそのイベントに従って帳票画面の変更あるいは入力欄選択の変更指示を出し（ステップＳ１２０６）、帳票表示手段１０５は、オペレータ端末５の出力部５４に画面を更新表示する。また、言語モデル選択手段３０３は、変更した帳票画面あるいは入力欄の入力に必要な言語モデル３０１を選択して準備する（ステップＳ１２０７）。 If there is an utterance of a keyword stored in advance in the recognized keyword database 204 (“Yes” in step S1203), the recognized event determining unit 103 selects an event (operation) corresponding to the keyword according to the data in the recognized keyword database 204 ( Step S1204). Next, the operator moves the focus (such as moving the mouse to the input field) to determine whether or not there is an operation such as changing the input field (step S1205). If there is an action (“Yes” in step S1205), the form screen changing unit 104 issues an instruction to change the form screen or change the input field selection according to the event (step S1206). The screen is updated and displayed on the output unit 54. Further, the language model selection unit 303 selects and prepares the language model 301 necessary for inputting the changed form screen or the input field (step S1207).

オペレータ端末５は、サーバ３からの帳票画面変更指示を受けて、出力部５４に表示する帳票画面を変更し（ステップＳ１２０８）、当該帳票に対して準備された言語モデル３０１を用いてオペレータ４による帳票入力を受け付ける。顧客８とオペレータ４との発話が終了したかどうかを判定し（ステップＳ１２０９）、発話が終了した場合、帳票データ（帳票テンプレートと入力された情報）はサーバ３の帳票入力手段１０６に送信され（ステップＳ１２１０）、帳票入力手段１０６は、帳票データを帳票データベース２０２に登録する（ステップＳ１２１１）。
In response to the form screen change instruction from the server 3, the operator terminal 5 changes the form screen displayed on the output unit 54 (step S1208), and the operator terminal 5 uses the language model 301 prepared for the form. Accepts form entry. It is determined whether or not the utterance between the customer 8 and the operator 4 has ended (step S1209). When the utterance ends, the form data (information input as the form template) is transmitted to the form input means 106 of the server 3 ( In step S1210), the form input unit 106 registers the form data in the form database 202 (step S1211).

ステップＳ１２０３において発話に登録されたキーワードが存在しない場合、また、ステップＳ１２０５においてキーワードに対応したイベントが存在しない場合は、音声認識した認識データをそのまま帳票データベース２０２に登録する（ステップＳ１２１１）。
また、ステップＳ１２０９において、発話が終了しない場合は続けて発話の音声認識がなされ、発話が終了するまでステップＳ１２０１からステップＳ１２１１の動作が繰り返し実行する。 If there is no keyword registered in the utterance in step S1203, or if there is no event corresponding to the keyword in step S1205, the speech-recognized recognition data is directly registered in the form database 202 (step S1211).
In step S1209, if the utterance does not end, speech recognition of the utterance is continued, and the operations from step S1201 to step S1211 are repeatedly executed until the utterance is ended.

例えば、顧客８が故障した機械の修理をコールセンターに依頼した場合において、コールセンターのオペレータ４が「故障の発生日時は１０月２８日ですね」と発話した場合を考える。この発話のうち、「発生日時」というキーワードは図５に示す認識キーワードデータベース２０４に記憶されており、その帳票欄ＩＤは「accidentdate」、その属性は「Combobox」であり、イベント（動作）は「カレンダー選択」に対応付けられる。
ステップＳ１２０３においてキーワードの発話があり、ステップＳ１２０５においてそのキーワードに対応するイベントが存在することとなり、サーバ３は、帳票欄ＩＤ「accidentdate」を含む帳票画面を用意し（ステップＳ１２０６）、図４に示す帳票メタ情報言語モデル対応データベース３０２を基に、帳票欄ＩＤ「accidentdate」に対応した「日時」の言語モデル３０１を選択して準備する（ステップＳ１２０７）。 For example, when the customer 8 requests the call center to repair a broken machine, the call center operator 4 utters “The failure date and time is October 28”. Among the utterances, the keyword “occurrence date” is stored in the recognition keyword database 204 shown in FIG. 5, the form column ID is “accidentdate”, the attribute is “Combobox”, and the event (operation) is “ Corresponding to “Calendar selection”.
There is a keyword utterance in step S1203, and an event corresponding to the keyword exists in step S1205. The server 3 prepares a form screen including the form column ID “accidentdate” (step S1206), and is shown in FIG. Based on the form meta information language model correspondence database 302, the “date and time” language model 301 corresponding to the form column ID “accidentdate” is selected and prepared (step S1207).

オペレータ端末５には、帳票欄ＩＤ「accidentdate」を含む帳票画面が表示される（ステップＳ１２０８）。
図８は、オペレータ端末５に表示された帳票の一例を示す。図８に示す帳票欄７０１は「発生日時」の入力欄である。サーバ３は、認識されたキーワードとそれに基づいた言語モデル３０１（この場合は「日時」の言語モデル）を用いて音声認識を行い、入力欄に発生日を自動的に設定することが可能である。例えば、この場合は図８の帳票欄７０１に示すように、「発生日時」として「２００８年１０月２８日」が帳票の入力欄７０１に自動的に入力される。また、帳票欄ＩＤ「accidentdate」に対応したイベント情報が「カレンダー選択」となり、オペレータ４がオペレータ端末５に表示されたカレンダーから選択した「日」を入力欄７０１に入力することも可能である。 The operator terminal 5 displays a form screen including the form column ID “accidentdate” (step S1208).
FIG. 8 shows an example of a form displayed on the operator terminal 5. A form column 701 shown in FIG. 8 is an input column for “occurrence date”. The server 3 can perform speech recognition using the recognized keyword and the language model 301 based on the recognized keyword (in this case, the “date and time” language model), and can automatically set the occurrence date in the input field. . For example, in this case, as shown in the form column 701 in FIG. 8, “October 28, 2008” is automatically entered as the “occurrence date” in the form entry field 701. Further, the event information corresponding to the form column ID “accidentdate” becomes “calendar selection”, and the operator 4 can input “date” selected from the calendar displayed on the operator terminal 5 in the input column 701.

また、コールセンターのオペレータ４が図８に示す症状欄７０２にマウス等の入力部５３を用いてフォーカスを当てた場合、サーバ３は「症状」に関する言語モデル３０１を選択し、顧客８の発話する症状に関する音声データを認識し、認識したデータを帳票データとして帳票データベース２０２に登録することが可能である。
さらに、作業欄７０３の入力において、顧客８がオペレータ４に対して「作業していただく必要はないです」或いは「問題ないです」などと発話した場合、サーバ３は図５に示すキーワードデータ６５５に登録されたキーワードとして「問題ない」を認識し、作業欄７０３のコンボボックスを自動的に「入力不可」という動作を行う。 Further, when the call center operator 4 focuses on the symptom column 702 shown in FIG. 8 using the input unit 53 such as a mouse, the server 3 selects the language model 301 related to “symptom” and the symptom that the customer 8 speaks. It is possible to recognize the voice data related to the data and register the recognized data in the form database 202 as form data.
Further, when the customer 8 speaks to the operator 4 as “No need to work” or “No problem” in the input in the work column 703, the server 3 displays the keyword data 655 shown in FIG. “No problem” is recognized as the registered keyword, and the combo box in the work column 703 is automatically operated as “input impossible”.

以上のように、帳票システム１では、顧客８とオペレータ４との発話をリアルタイムに音声認識し、その中で発話されたキーワードに基づいて帳票画面を動的に変更し、キーワードに対応付けて記憶された言語モデル３０１を用いて、帳票入力に必要な言語モデル３０１を選択し、選択された言語モデル３０１を用いてキーワードに対応付けられた帳票欄の入力を自動的に行う、或いはその他の決められた動作を行う。また、帳票の入力に関して、発話された音声を認識し入力情報とすることが可能である。また、帳票の入力情報や入力項目の制限を行うことが可能である。これにより、オペレータの入力作業が軽減し、帳票入力作業時間が短縮されるとともに、帳票入力作業の精度が向上する。 As described above, the form system 1 recognizes speech of the customer 8 and the operator 4 in real time, dynamically changes the form screen based on the keyword spoken therein, and stores it in association with the keyword. Using the selected language model 301, the language model 301 necessary for the form input is selected, and the form column associated with the keyword is automatically input using the selected language model 301, or other decisions are made. Perform the specified action. In addition, regarding the input of a form, it is possible to recognize spoken voice and use it as input information. Further, it is possible to restrict the input information and input items of the form. Thereby, the operator's input work is reduced, the form input work time is shortened, and the accuracy of the form input work is improved.

尚、本実施の形態では、サーバ３を一つのサーバコンピュータとしたが、ネットワークで接続された複数のサーバコンピュータで分散して処理を行うことも可能である。例えば、サーバ３の機能のうち音声認識に関する機能、音声入力手段１０１、音声認識手段１０２、音声認識辞書管理８３が行う処理を音声認識サーバが行い、その他の機能を帳票管理サーバが行い、音声認識サーバと帳票管理サーバがデータ送受信することによって上記の処理を実行することも可能である。 In the present embodiment, the server 3 is a single server computer, but it is also possible to perform processing in a distributed manner by a plurality of server computers connected via a network. For example, among the functions of the server 3, the speech recognition server performs the processing related to the speech recognition function, the speech input unit 101, the speech recognition unit 102, and the speech recognition dictionary management 83, and the form management server performs the other functions. It is also possible to execute the above processing by transmitting and receiving data between the server and the form management server.

次に、本発明の第２の実施形態について説明する。
図９は、帳票入力システム１１のハードウェア構成図である。帳票入力システム１１の構成は図１に示す帳票入力システム１とほぼ同じであり、帳票入力システム１と異なるのは、オペレータ管理者端末２１がネットワークに追加されることにある。
オペレータ管理者端末２１はオペレータ管理者２２が使用するコンピュータであり、制御部９１、記憶部９２、入力部９３、出力部９４、通信部９５を有し、これらは、前述のオペレータ端末５の制御部５１、記憶部５２、入力部５３、出力部５４、通信部５５と同様の機能を有し、動作を行う。 Next, a second embodiment of the present invention will be described.
FIG. 9 is a hardware configuration diagram of the form input system 11. The form input system 11 has almost the same structure as the form input system 1 shown in FIG. 1, and is different from the form input system 1 in that an operator manager terminal 21 is added to the network.
The operator manager terminal 21 is a computer used by the operator manager 22, and includes a control unit 91, a storage unit 92, an input unit 93, an output unit 94, and a communication unit 95. These control the operator terminal 5 described above. Unit 51, storage unit 52, input unit 53, output unit 54, and communication unit 55 have the same functions and perform operations.

図１０は、帳票入力システム１１の機能ブロック構成とデータの流れを示す図である。帳票入力システム１１の機能ブロック構成が、図２に示す帳票入力システム１のそれと異なるのは、オペレータ管理手段８４が追加されたことである。オペレータ管理手段８４は、オペレータ管理データ入力手段４０１、発話判定データベース４０２、発話判定手段４０３を有する。
オペレータ管理データ入力手段４０１は、オペレータ管理者端末２１からのキーワード情報入力等を受け付け、発話判定データベース４０２に記憶する。発話判定手段４０３は、発話判定データベース４０２に記憶されたデータを基に、オペレータの発話等に基づいた帳票入力作業や帳票画面の変更などのイベント（動作）を行う。 FIG. 10 is a diagram showing a functional block configuration and data flow of the form input system 11. The functional block configuration of the form input system 11 is different from that of the form input system 1 shown in FIG. 2 in that an operator management unit 84 is added. The operator management unit 84 includes an operator management data input unit 401, an utterance determination database 402, and an utterance determination unit 403.
The operator management data input unit 401 accepts keyword information input from the operator manager terminal 21 and stores it in the utterance determination database 402. Based on the data stored in the utterance determination database 402, the utterance determination unit 403 performs an event (operation) such as a form input operation based on an operator's utterance or the like, or a change of the form screen.

次に、帳票入力システム１１の動作について説明する。
帳票管理者６、オペレータ管理者２２は、コールセンター業務開始前に予め、帳票メタ情報やキーワードなどのデータの追加、更新及び削除を行う。図１１はデータ登録作業の流れを示すシーケンス図である。図１１に示すデータ登録作業は、図６に示すデータ登録作業と同様であるが、オペレータ管理者端末２１によるデータ登録作業が異なる。
ステップＳ１００１で入力された帳票メタ情報は、オペレータ管理者端末２１に送信され、オペレータ管理者端末２１は、メタ情報とキーワードと動作を対応付けたデータを発話判定データベース４０２として記憶する。 Next, the operation of the form input system 11 will be described.
The form manager 6 and the operator manager 22 add, update, and delete data such as form meta information and keywords in advance before starting the call center operation. FIG. 11 is a sequence diagram showing the flow of data registration work. The data registration work shown in FIG. 11 is the same as the data registration work shown in FIG. 6, but the data registration work by the operator manager terminal 21 is different.
The form meta information input in step S1001 is transmitted to the operator manager terminal 21, and the operator manager terminal 21 stores data in which the meta information, the keyword, and the action are associated as the utterance determination database 402.

図１２は帳票のメタ情報の一例を示す図であり、図１３は発話判定データベースのデータの一例を示す図である。
図１３に示す発話判定データは、メタ情報の帳票欄ＩＤとそれに対応する、発話者（キーワードの発話者）、認識されたキーワード、イベント（動作）から構成される。図１２に示すメタ情報の一例はＸＭＬで記述され、例えば、図１３に示す発話判定データ８３５は、帳票欄ＩＤ「comment」の入力時に、オペレータによる発話「クレーマーですね」が認識された場合、動作として「オペレータ管理者に通知後、コメント入力欄に注意を追記する」が定義されることを示す。 FIG. 12 is a diagram illustrating an example of meta information of a form, and FIG. 13 is a diagram illustrating an example of data in an utterance determination database.
The utterance determination data shown in FIG. 13 includes a form column ID of meta information, and a corresponding speaker (speaker of a keyword), a recognized keyword, and an event (action). An example of the meta information shown in FIG. 12 is described in XML. For example, in the case where the utterance determination data 835 shown in FIG. 13 is inputted with the form column ID “comment”, the utterance “Kramer” is recognized by the operator, It indicates that “add attention to the comment input field after notifying the operator administrator” is defined as the action.

なお、図１２、図１３に示すデータにおいて、図１２に示すメタ情報８０１は図１３に示す発話判定データ８３１に対応し、図１２に示すメタ情報８０２は図１３に示す発話判定データ８３２に対応し、図１２に示すメタ情報８０３は図１３に示す発話判定データ８３３に対応し、図１２に示すメタ情報８０４は図１３に示す発話判定データ８３４に対応し、図１２に示すメタ情報８０５は図１３に示す発話判定データ８３５に対応し、図１２に示すメタ情報８０６は図１３に示す発話判定データ８３６に対応する。 12 and 13, the meta information 801 shown in FIG. 12 corresponds to the utterance determination data 831 shown in FIG. 13, and the meta information 802 shown in FIG. 12 corresponds to the utterance determination data 832 shown in FIG. 12 corresponds to the utterance determination data 833 shown in FIG. 13, the meta information 804 shown in FIG. 12 corresponds to the utterance determination data 834 shown in FIG. 13, and the meta information 805 shown in FIG. Corresponding to the utterance determination data 835 shown in FIG. 13, the meta information 806 shown in FIG. 12 corresponds to the utterance determination data 836 shown in FIG.

図１４は、帳票入力システム１１による帳票入力作業の流れを示すシーケンス図である。帳票入力システム１１による帳票入力作業は、図７に示す帳票入力システム１による帳票入力作業とほぼ同じであるが、オペレータ管理者端末２１による作業が追加される。
サーバ３の音声認識手段１０２は音声データを認識し（ステップＳ１２０２）、認識キーワードデータベース２０４、発話判定データベース４０２に記憶されたキーワードの発話があるかどうかを判定する（ステップＳ３００１）。 FIG. 14 is a sequence diagram showing a flow of a form input operation by the form input system 11. The form input work by the form input system 11 is almost the same as the form input work by the form input system 1 shown in FIG. 7, but the work by the operator manager terminal 21 is added.
The voice recognition unit 102 of the server 3 recognizes the voice data (step S1202), and determines whether there is an utterance of the keyword stored in the recognition keyword database 204 and the utterance determination database 402 (step S3001).

予め認識キーワードデータベース２０４、発話判定データベース４０２に記憶したキーワードの発話がある場合（ステップＳ３００１で「Ｙｅｓ」）、認識イベント判定手段１０３は、認識キーワードデータベース２０４、発話判定データベース４０２に記憶したキーワードに対応するイベント（動作）を選択する（ステップＳ３００２）。このイベントのうち、オペレータ管理者２２に通知しオペレータ管理者２２の動作が必要な場合は、オペレータ管理者端末２１に通知し、オペレータに対する指示などオペレータ管理者端末２１からの動作や指示の入力を受け付ける（ステップＳ３００３）。例えば、ここで、オペレータ管理者２２が帳票入力に対して動作を行う場合、帳票画面変更手段１０４はその動作に従って帳票画面の変更あるいは入力欄選択の変更を指示する（ステップＳ１２０６）。以降の作業は図７と同様である。 When there is an utterance of a keyword stored in advance in the recognition keyword database 204 and utterance determination database 402 (“Yes” in step S3001), the recognition event determination unit 103 corresponds to the keyword stored in the recognition keyword database 204 and utterance determination database 402. An event (operation) to be performed is selected (step S3002). Of these events, when the operator manager 22 is notified and the operator manager 22 needs to operate, the operator manager terminal 21 is notified of the event and the operator manager terminal 21 is instructed to input an operation or instruction from the operator manager terminal 21. Accept (step S3003). For example, here, when the operator manager 22 performs an operation on the form input, the form screen changing unit 104 instructs the change of the form screen or the change of the input field selection according to the operation (step S1206). Subsequent operations are the same as those in FIG.

図１５は、帳票入力画面の一例を示す図である。例えば、図１５に示す帳票に入力作業中に、オペレータ４が「お客様はクレーマーですね」と発話し、音声認識手段１０２によって認識された場合、認識イベント判定手段１０３は、オペレータ管理者端末２１に通知し、オペレータ管理者２２によるコメント入力を受け付け、図１５に示すコメント入力欄９１１に表示することが可能である。
例えば、この場合、メタ情報言語モデル対応データベース３０２においてコメント入力欄９１１に対応付けられた言語モデル３０１を用いて、オペレータ管理者２２の発話を音声データとして認識し、その認識データを帳票に入力、表示させることが可能である。
また、顧客８がオペレータ４に対して「お前の会社を訴えてやる」と発話し、音声認識手段１０２によって認識された場合、図１３に示す発話判定データ８３６に従って、帳票欄「urgency」のコンボボックスに「最高」と自動的に入力することも可能である。 FIG. 15 is a diagram illustrating an example of a form input screen. For example, when the operator 4 utters “You are a Kramer” during input work on the form shown in FIG. 15 and is recognized by the voice recognition unit 102, the recognition event determination unit 103 sends it to the operator manager terminal 21. It is possible to receive the notification and receive the comment input by the operator manager 22 and display it in the comment input field 911 shown in FIG.
For example, in this case, using the language model 301 associated with the comment input field 911 in the meta information language model correspondence database 302, the utterance of the operator manager 22 is recognized as voice data, and the recognition data is input to the form. It can be displayed.
Further, when the customer 8 utters “sue your company” to the operator 4 and is recognized by the voice recognition means 102, the combo in the form column “urgency” according to the utterance determination data 836 shown in FIG. 13. It is also possible to automatically enter “best” in the box.

第２の実施形態では、顧客８とオペレータ４との発話におけるキーワードと帳票入力に関する動作の対応付けるデータの入力をオペレータ管理者２２が行うことが可能である。よって、オペレータ４の経験が少ない場合でも、経験の豊富なオペレータ管理者２２の判断や処理を帳票入力作業に反映することが可能となり、帳票入力作業の精度が向上し、オペレータ４に対する作業負担も軽減する。 In the second embodiment, the operator manager 22 can input data for associating keywords and operations related to form input in the utterance between the customer 8 and the operator 4. Therefore, even if the operator 4 has little experience, the judgment and processing of the experienced operator manager 22 can be reflected in the form input work, the accuracy of the form input work is improved, and the work load on the operator 4 is also increased. Reduce.

以上説明したサーバ３による帳票入力処理を実現するプログラムを作成し、汎用のコンピュータがそのプログラムを読み込んで帳票入力装置を実現することも可能である。このプログラムは、ＣＤ−ＲＯＭ等の記録媒体に記録されてもよいし、ネットワークを介して流通させることも可能である。
以上、添付図面を参照しながら、本発明に係る帳票入力システムの好適な実施形態について説明したが、本発明はかかる例に限定されない。当業者であれば、本願で開示した技術的思想の範疇内において、各種の変更例又は修正例に想到し得ることは明らかであり、それらについても当然に本発明の技術的範囲に属するものと了解される。 It is also possible to create a program that realizes the form input process by the server 3 described above, and a general-purpose computer reads the program to realize the form input apparatus. This program may be recorded on a recording medium such as a CD-ROM, or distributed via a network.
The preferred embodiments of the form input system according to the present invention have been described above with reference to the accompanying drawings, but the present invention is not limited to such examples. It will be apparent to those skilled in the art that various changes or modifications can be conceived within the scope of the technical idea disclosed in the present application, and these naturally belong to the technical scope of the present invention. Understood.

このように、本発明によれば、音声発話を伴い履歴データを作成する業務において、予め管理者が入力用データテンプレート、音声認識用キーワード及び言語モデルを用意し、それぞれの関係を作成しておくことで、一般利用者に負担をかけずに音声認識を利用し、入力の簡易化を図る用途に適用できる。例えば、会議議事録を作成する必要がある場合において、会議主催者が予め議事録テンプレート、言語モデル及び発話キーワードを登録しておき、会議参加者の発話から音声認識による議事録を作成するといった用途にも適用可能である。 As described above, according to the present invention, the administrator prepares the input data template, the speech recognition keyword, and the language model in advance in the operation of creating the history data accompanied by the voice utterance, and creates the respective relationships. Thus, it can be applied to a purpose of simplifying input by using voice recognition without imposing a burden on general users. For example, when the meeting minutes need to be created, the meeting organizer registers the minutes template, language model and utterance keywords in advance, and creates the minutes by speech recognition from the utterances of the meeting participants It is also applicable to.

１………帳票入力システム
３………サーバ
５………オペレータ端末
７………帳票管理者端末
９………顧客端末
１０１………音声入力手段
１０２………音声認識手段
１０３………認識イベント判定手段
１０４………帳票画面変更手段
１０５………帳票表示手段
１０６………帳票入力手段
２０１………帳票メタ情報データベース
２０２………帳票データベース
２０３………帳票管理データベース
２０４………認識キーワードデータベース
３０１………言語モデル
３０２………帳票メタ情報言語モデル対応データベース
３０３………言語モデル選択手段 DESCRIPTION OF SYMBOLS 1 ......... Form input system 3 ......... Server 5 ......... Operator terminal 7 ......... Form manager terminal 9 ......... Customer terminal 101 ......... Voice input means 102 ......... Voice recognition means 103 ......... Recognition event determination means 104... Form screen change means 105... Form display means 106... Form input means 201... Form meta information database 202... Form database 203. ... recognition keyword database 301 ......... language model 302 ......... form meta information language model correspondence database 303 ......... language model selection means

Claims

A form input device for supporting data input work on a form,
A form meta information language model compatible data storage means for associating the meta information of the form with a language model and storing it as form meta information language model compatible data;
A keyword data storage unit that associates the meta information of the form, the keyword, and the event information, and stores it as keyword data;
Voice recognition means for recognizing voice data;
Event execution means for executing an event based on the keyword data when the keyword is recognized in the audio data;
When a form to be input is changed by the event, based on the form meta information language model compatible data, a language model selection unit that selects a language model corresponding to the meta information of the changed form;
A form input means for determining input information of the form using the selected language model;
A form input device characterized by comprising:

The event execution means, when a predetermined keyword is recognized, notifies the form input means of the limitation of input information,
The form input device according to claim 1, wherein the form input unit restricts input of input information of the form.

The event execution means notifies the administrator when a predetermined keyword is recognized,
2. The form input device according to claim 1, wherein the form input unit recognizes the administrator's voice data using the selected language model and accepts it as input information of the form.

An operator terminal capable of voice communication with the customer terminal;
A server capable of transmitting and receiving data via the operator terminal and a network;
A form input system having
The server
A form meta information language model compatible data storage means for associating the meta information of the form with the language model and storing the form meta information language model compatible data;
A keyword data storage unit that associates the meta information of the form, the keyword, and the event information, and stores it as keyword data;
Voice recognition means for recognizing voice data received from the customer terminal and the operator terminal;
Event execution means for executing an event based on the keyword data when the keyword is recognized in the audio data;
Means for changing a form to be input by the event and displaying it on the operator terminal;
Language model selection means for selecting a language model corresponding to the meta information of the changed form based on the form meta information language model correspondence data;
A form input means for determining input information of the form using the selected language model;
A form input system characterized by comprising:

Associating the meta information of the form with the language model and storing the form as meta data corresponding to the form meta information language model;
Storing the form meta information, the keyword, and the event information in association with each other as keyword data;
Recognizing audio data;
If the keyword is recognized in the audio data, executing an event based on the keyword data;
When a form to be input is changed by the event, a step of selecting a language model corresponding to the meta information of the changed form based on the form meta information language model correspondence data;
Determining input information of the form using the selected language model;
A form input method characterized by comprising:

Computer
A form meta information language model compatible data storage means for associating the meta information of the form with the language model and storing the form meta information language model compatible data;
A keyword data storage unit that associates the meta information of the form, the keyword, and the event information, and stores it as keyword data;
Voice recognition means for recognizing voice data;
Event execution means for executing an event based on the keyword data when the keyword is recognized in the audio data;
When a form to be input is changed by the event, based on the form meta information language model compatible data, a language model selection unit that selects a language model corresponding to the meta information of the changed form;
A form input means for determining input information of the form using the selected language model;
A program characterized by operating as