JPH11312073A

JPH11312073A - System for recognizing meaning

Info

Publication number: JPH11312073A
Application number: JP10117524A
Authority: JP
Inventors: Masatomo Yazaki; 昌朋矢崎; Toshiaki Gomi; 俊明五味; Kenji Yamamoto; 健司山本; Masahide Noda; 政秀野田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1998-04-27
Filing date: 1998-04-27
Publication date: 1999-11-09
Anticipated expiration: 2018-04-27
Also published as: JP4036528B2; US5991719A; EP0953896B1; EP0953896A1

Abstract

PROBLEM TO BE SOLVED: To provide a meaning recognizing system for recognizing meaning, and communicating the voice input instruction of user's language as an instruction independent of language to an application(AP), and for loading the AP, and constructing a user I/F capable of voice input in user's language. SOLUTION: A voice signal from a voice input device 10 is recognized by a voice recognizing part 30, and converted into a meaning number independent of language by a recognition word and meaning number converting part 40, and transferred to an AP 50. The meaning number is included in one part of the processing content in the AP 50, and the processing is executed, based on the meaning number. Also, in loading the AP 50, the meaning number is obtained, and a registered word corresponding to the meaning number used by the AP 50 is obtained from an outside description file so that a meaning number and registered word list 41 can be generated, and the recognition word and meaning number converting part 40 is updated. The registered word is communicated to a voice recognizing part 30, and a voice recognition engine 31 is tuned up so that a customized user interface can be constructed.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声入力が可能な
アプリケーションに対するインタフェースを提供するシ
ステムであって、利用者による音声によるアプリケーシ
ョンに対する入力指示を、その意味を認識してアプリケ
ーションに通知するシステムに関する。[0001] 1. Field of the Invention [0002] The present invention relates to a system for providing an interface for an application capable of voice input, and a system for recognizing the meaning of a user's input instruction to the application by voice and notifying the application. .

【０００２】[0002]

【従来の技術】近年のコンピュータシステムの高性能
化、マルチメディア処理技術の発達により高度なヒュー
マンマシンインタフェースの提供が可能となり、人間と
直接対話できるようなアプリケーションの開発環境が整
いつつある。2. Description of the Related Art In recent years, the advancement of computer systems and the development of multimedia processing technology have made it possible to provide advanced human-machine interfaces, and an application development environment that allows direct interaction with humans is being prepared.

【０００３】従来技術における音声認識技術を使用した
アプリケーションは、大別して音声入力部と音声認識部
と応用プログラム部分を備えている。音声入力部は、マ
イクロフォン等の音声入力装置を介して肉声を音声信号
に変換する。その他には音声信号を記録したファイルな
どからデータを読み込む場合もある。音声認識部は、入
力された音声信号が表わす単語を、登録された音声信号
パターンとのテンプレートマッチングなどにより認識
し、応用プログラムに対して認識単語を通知する。応用
プログラムは、音声認識部により通知された認識単語を
利用してその後の処理を実行する。An application using the speech recognition technology in the prior art is roughly divided into a speech input unit, a speech recognition unit, and an application program. The voice input unit converts a real voice into a voice signal via a voice input device such as a microphone. Alternatively, data may be read from a file in which an audio signal is recorded. The voice recognition unit recognizes a word represented by the input voice signal by template matching with a registered voice signal pattern, and notifies the application program of the recognized word. The application program executes subsequent processing using the recognized word notified by the voice recognition unit.

【０００４】上記従来の音声認識アプリケーションの特
徴として、以下の点が挙げられる。第一番目の特徴とし
て、応用プログラムが、音声認識対象単語ごとに１対１
に対応した処理プログラムとして記述されている点が挙
げられる。つまり、音声認識部は、登録された認識対象
単語からマッチングした結果を応用プログラムに通知す
るため、アプリケーション内部では、その後の処理を認
識対象単語ごとにあらかじめ必要な処理に分類して個別
に記述しておく必要がある。The features of the above-mentioned conventional speech recognition application are as follows. The first feature is that the application program has a one-to-one correspondence
Is described as a processing program corresponding to. In other words, the speech recognition unit notifies the application program of the result of matching from the registered recognition target words, so that the application internally classifies the subsequent processing into necessary processing in advance for each recognition target word and individually describes the processing. Need to be kept.

【０００５】第二番目の特徴として、アプリケーション
が、登録認識対象単語の言語に依存している点が挙げら
れる。つまり、音声認識部で登録されている認識対象単
語以外は認識できないため、応用プログラムは登録され
た言語を前提として記述されており、言語依存のものと
なっている。A second feature is that the application depends on the language of the word to be registered and recognized. In other words, since words other than the recognition target words registered in the speech recognition unit cannot be recognized, the application program is described based on the registered language, and is language-dependent.

【０００６】[0006]

【発明が解決しようとする課題】現在、各種アプリケー
ションの開発は、開発対象であるアプリケーションの高
度化および複雑化、市場ニーズの変化の多様化、商品の
国際化などに伴い、開発工数の低減、開発期間の短縮、
多言語への対応が求められている。しかし、上記紹介し
た従来の音声認識アプリケーションは以下のような問題
があった。At present, the development of various applications is accompanied by the sophistication and complexity of the applications to be developed, the diversification of changes in market needs, the internationalization of products, and the like. Shorten development time,
Multilingual support is required. However, the conventional speech recognition application introduced above has the following problems.

【０００７】従来の音声認識アプリケーションの第一番
目の特徴として挙げた、音声認識対象単語ごとに応用ア
プリケーションが記述されている点に関し、開発工数の
増加、認識対象単語の変更・追加などのチューニングの
困難性の問題があった。つまり、処理内容を認識対象単
語ごとにあらかじめ必要な処理に分類して個別に記述し
ておく必要があるため、認識対象単語の変更・追加は、
応用プログラムの関連する箇所すべてにその影響が及ぶ
こととなり、記述内容を変更する必要があった。[0007] Regarding the first feature of the conventional speech recognition application, in which an application application is described for each speech recognition target word, tuning for increasing the number of development steps, changing / adding the recognition target word, and the like. There was a problem of difficulty. In other words, it is necessary to classify the processing content into necessary processes for each recognition target word in advance and describe them separately.
This affected all relevant parts of the application program, and the description had to be changed.

【０００８】従来の音声認識アプリケーションの第二番
目の特徴として挙げた、アプリケーションが、登録認識
対象単語の言語に依存している点に関し、開発工数の増
加、多言語への対応の困難性という問題があった。つま
り、アプリケーション内部に言語依存部分があるので、
システム利用者の使用言語、使用方言などに合わせて単
語の表現方法を変えることは、言語依存部分すべてにそ
の影響が及ぶこととなり、記述内容を変更するローカラ
イズ作業の必要があった。The second feature of the conventional speech recognition application is that the application depends on the language of the word to be registered / recognized, which causes problems such as an increase in development man-hours and difficulty in coping with multiple languages. was there. In other words, there is a language-dependent part inside the application,
Changing the word expression in accordance with the system user's language, dialect, etc. would affect all language-dependent parts, and required localization work to change the description.

【０００９】従来の音声認識アプリケーションにおいて
も認識対象単語テーブルの更新、追加により認識対象単
語の更新、追加登録が可能であるが、認識対象単語テー
ブルの更新、追加登録は、応用プログラムの関連する箇
所すべてにその影響が及ぶこととなり、また、他の言語
への対応のため認識対象単語テーブルごと入れ替える方
法によると、結局、言語依存性から、応用プログラム自
体も、入れ替えた認識対象単語テーブルの言語に対応す
るようにローカライズしたものを別途開発して入れ替え
る必要があった。In a conventional speech recognition application, it is possible to update and add a word to be recognized by updating and adding a word table to be recognized. All of them are affected, and according to the method of replacing the recognition target word table for the support of other languages, the application program itself is eventually changed to the language of the replaced recognition target word table due to language dependency. It was necessary to develop and replace a localized version to correspond.

【００１０】本発明は、上記従来の音声認識システムの
問題点に鑑み、言語依存の音声認識部とアプリケーショ
ンの関係を相互に独立とし、利用者のユーザインタフェ
ースとして利用者のもっとも使用に適した言語、方言で
の使用環境を提供し、かつ、アプリケーション側では言
語非依存の意味認識を中心とするシステムとすることに
より、音声認識関連アプリケーションの多言語への容易
な対応を可能とすることを目的とする。また、認識対象
単語テーブルの更新・変更が、容易かつアプリケーショ
ンの応用プログラム記述部分への影響を最小限である意
味認識システムを提供することを目的とする。The present invention has been made in view of the above-mentioned problems of the conventional speech recognition system, and makes the relationship between the language-dependent speech recognition unit and the application mutually independent, so that a language most suitable for the user as a user interface can be used. The purpose of this system is to provide an environment for using dialects, and to make it easy for speech recognition-related applications to handle multiple languages by providing a system that focuses on language-independent semantic recognition on the application side. And It is another object of the present invention to provide a meaning recognition system in which updating and changing of a recognition target word table is easy and has minimal influence on an application program description portion of an application.

【００１１】[0011]

【課題を解決するための手段】上記課題を解決するため
に本発明にかかる意味認識システムは、入力となる音声
信号を受け付ける音声信号入力部と、入力された音声信
号を基に対応する単語を認識する音声認識部と、単語の
意味を表わす意味番号とその意味番号に属する登録単語
のリストである意味番号・登録単語リストを備えた認識
単語・意味番号変換部と、アプリケーション用インタフ
ェースと、意味番号をデータとして扱うアプリケーショ
ンを備え、前記音声信号入力部に入力された音声信号を
基に前記音声認識部により対応する単語を認識し、前記
認識した単語を前記認識単語・意味番号変換部により対
応する意味番号に変換し、前記アプリケーション用イン
タフェースを介して前記変換した意味番号をアプリケー
ションに通知することを特徴とする。In order to solve the above-mentioned problems, a semantic recognition system according to the present invention comprises a voice signal input unit for receiving a voice signal to be input, and a corresponding word based on the input voice signal. A speech recognition unit for recognizing, a recognition word / meaning number conversion unit having a meaning number representing the meaning of the word and a meaning number / registration word list which is a list of registered words belonging to the meaning number, an application interface, and a meaning An application that handles a number as data is provided, the corresponding word is recognized by the voice recognition unit based on the voice signal input to the voice signal input unit, and the recognized word is supported by the recognized word / semantic number conversion unit. And converts the converted meaning number to the application via the application interface. And wherein the door.

【００１２】かかる構成により、ユーザインタフェース
として利用者にとりもっとも便利な言語、方言のものを
提供することができ、かつ、言語依存の音声認識部の登
録単語を言語非依存の意味番号に変換してアプリケーシ
ョンに通知することができ、アプリケーションを言語非
依存のものとして記述ができる。With this configuration, it is possible to provide a user interface in the most convenient language and dialect for the user, and to convert a registered word of the language-dependent speech recognition unit into a language-independent meaning number. The application can be notified and the application can be described as language independent.

【００１３】次に、前記意味認識システムは、前記認識
単語・意味番号変換部の意味番号・登録単語リストの内
容を更新する意味番号・登録単語リスト更新部を備える
ことが好ましい。Next, the meaning recognition system preferably includes a meaning number / registration word list updating unit for updating the meaning number / registration word list of the recognition word / meaning number conversion unit.

【００１４】かかる構成により、意味番号・登録単語リ
ストを追加・変更することにより、意味認識システムを
チューンナップすることができる。次に、前記意味認識
システムは、意味を体系的に分類して各々の意味単位を
コード化した意味番号と各々の意味番号に属する登録単
語とを体系的に記述した単語・意味体系テーブルを記録
した外部記述ファイルを備えることが好ましい。With this configuration, the meaning recognition system can be tuned up by adding / changing the meaning number / registered word list. Next, the semantic recognition system records a word / semantic system table that systematically describes the semantic numbers that systematically classify the meanings and codes each semantic unit, and the registered words belonging to each semantic number. It is preferable to provide the external description file described above.

【００１５】かかる構成により、単語とその単語に対応
する意味単位との関係を体系的に表わした単語・意味体
系テーブルを、アプリケーション本体から分離可能な構
成とすることができ、利用者の使用環境に応じて外部記
述ファイルを交換することでアプリケーションをローカ
ライズするための基本となるデータを用意することがで
きる。With this configuration, a word / semantic system table that systematically represents the relationship between a word and a semantic unit corresponding to the word can be configured to be separable from the application main body, and the user's usage environment By exchanging external description files according to the above, basic data for localizing an application can be prepared.

【００１６】次に、前記アプリケーションは、処理デー
タの少なくとも一部に意味番号を含んだアプリケーショ
ンであって、前記認識単語・意味番号変換部が、前記ア
プリケーション用インタフェースを介して前記アプリケ
ーションに含まれた意味番号を取得する意味番号取得部
と、前記外部記述ファイルの単語・意味体系テーブル中
から前記取得した意味番号に対応する登録単語を抽出し
て前記意味番号・登録単語リストを生成する意味番号・
登録単語リスト生成部とを備えることが好ましい。Next, the application is an application that includes a meaning number in at least a part of processing data, and the recognition word / meaning number conversion unit is included in the application via the application interface. A meaning number acquiring unit for acquiring a meaning number, a meaning number for extracting a registered word corresponding to the acquired meaning number from the word / semantic system table of the external description file, and generating the meaning number / registered word list.
Preferably, a registered word list generation unit is provided.

【００１７】かかる構成により、アプリケーションを言
語非依存の意味番号に基づいて記述し、意味認識システ
ムのローカライズにあたっては、アプリケーションで使
用されている意味番号に対応するローカル言語の単語を
自動的に抽出することができ、ローカライズ作業工数の
大幅な低減が可能となる。With this configuration, an application is described based on a language-independent semantic number, and when localizing the semantic recognition system, words in a local language corresponding to the semantic number used in the application are automatically extracted. And the number of man-hours for localization work can be significantly reduced.

【００１８】次に、前記アプリケーションは、処理デー
タの少なくとも一部に意味番号を含んだアプリケーショ
ンであって、前記アプリケーション自身に含まれた意味
番号に基づいて前記外部記述ファイルの単語・意味体系
テーブル中から対応する登録単語を抽出する処理ステッ
プと、前記意味番号と登録単語を前記認識単語・意味番
号変換部に通知する処理ステップとを備え、前記認識単
語・意味番号変換部が、前記アプリケーションから取得
した意味番号と登録単語に基づいて前記意味番号・登録
単語リストを生成する意味番号・登録単語リスト生成部
を備えることが好ましい。Next, the application is an application that includes a meaning number in at least a part of the processing data, and is based on the meaning number included in the application itself. And a processing step of notifying the meaning number and the registered word to the recognition word / meaning number conversion unit, wherein the recognition word / meaning number conversion unit obtains the corresponding registration word from the application. It is preferable to include a meaning number / registered word list generation unit that generates the meaning number / registered word list based on the obtained meaning number and the registered word.

【００１９】かかる構成により、アプリケーションを言
語非依存の意味番号で記述し、意味認識システムのロー
カライズにあたっては、アプリケーション側から使用さ
れている意味番号に対応するローカル言語の単語を通知
させることができ、ローカライズ作業工数の大幅な低減
が可能となる。With this configuration, an application can be described with a language-independent semantic number, and in localizing the semantic recognition system, a word in a local language corresponding to the semantic number used by the application can be notified. The man-hour for localization work can be greatly reduced.

【００２０】次に、前記認識単語・意味番号変換部が、
前記生成した前記意味番号・登録単語リストに挙げられ
た登録単語を前記音声認識部に通知する登録単語通知部
を備え、前記音声認識部が、前記通知された登録単語に
基づいて音声認識エンジンの内容を更新する音声認識単
語更新部を備えることが好ましい。Next, the recognition word / meaning number conversion unit:
A registered word notifying unit for notifying the voice recognition unit of a registered word listed in the generated meaning number / registered word list, wherein the voice recognition unit includes a voice recognition engine based on the notified registered word. It is preferable to include a speech recognition word updating unit that updates the content.

【００２１】かかる構成により、アプリケーションで利
用するために音声認識部で認識が必要な単語の特定とそ
の単語認識に必要な音声認識エンジンの内容を自動生成
することができ、ローカライズ作業工数の大幅な低減が
可能となる。With this configuration, it is possible to specify a word that needs to be recognized by the voice recognition unit for use in an application and automatically generate the content of the voice recognition engine required for the word recognition. Reduction is possible.

【００２２】次に、前記意味認識システムは、利用者が
使用する言語を指定する使用言語指定部と、前記外部記
述ファイルとして複数の言語ごとの外部記述ファイル
と、前記音声認識部の音声認識エンジンとして複数の言
語ごとの音声認識エンジンとを備え、前記意味番号・登
録単語リスト生成部は、前記使用言語指定部により指定
された言語の外部記述ファイルを基に意味番号・登録単
語リストを生成し、前記音声認識部は、前記使用言語指
定部により指定された言語の音声認識エンジンを使用
し、利用者に対して前記アプリケーションを複数の言語
による入力指示ができるマルチリンガル対応のユーザイ
ンタフェースを提供することが好ましい。Next, the semantic recognition system includes a language specification unit for specifying a language used by a user, an external description file for each of a plurality of languages as the external description file, and a voice recognition engine of the voice recognition unit. And a speech recognition engine for each of a plurality of languages, and the meaning number / registered word list generating unit generates a meaning number / registered word list based on an external description file of the language specified by the working language specifying unit. The voice recognition unit uses a voice recognition engine of a language specified by the language-in-use specification unit, and provides a multilingual user interface capable of instructing a user to input the application in a plurality of languages. Is preferred.

【００２３】かかる構成により、利用者の使用言語、使
用表現に合わせたカスタマイズ作業が容易に実行でき
る。また複数言語の外部記述ファイルと音声認識エンジ
ンを指定することにより複数言語の意味番号・登録単語
リストの自動生成、音声認識エンジン内容の更新をする
ことができ、アプリケーションを、同時に複数の言語に
よる音声入力指示ができるマルチリンガル対応とするこ
とができる。ここで言語には方言も含むことができ、日
本語、英語などの言語に加え、関西弁、東北弁など各国
語の各方言も含むことができる。With this configuration, a customizing operation can be easily performed in accordance with the user's language and expression. Also, by specifying an external description file in multiple languages and a speech recognition engine, it is possible to automatically generate a list of meaning numbers and registered words in multiple languages and update the contents of the speech recognition engine. Multilingual support that allows input instructions can be provided. Here, the language may include a dialect, and in addition to languages such as Japanese and English, may include dialects of various languages such as the Kansai dialect and the Tohoku dialect.

【００２４】次に、前記意味認識システムは、前記外部
記述ファイルを提供する外部記述ファイルサーバと、前
記アプリケーションを提供するアプリケーションサーバ
と、前記音声信号入力部と前記音声認識部と前記認識単
語・意味番号変換部と前記アプリケーション用インタフ
ェースとを備えた意味認識システムクライアントを備
え、前記外部記述ファイルサーバと前記アプリケーショ
ンサーバと前記意味認識システムクライアントがコンピ
ュータネットワークを介して接続されることが好まし
い。Next, the semantic recognition system includes an externally described file server that provides the externally described file, an application server that provides the application, the voice signal input unit, the voice recognition unit, and the recognized word / meaning. Preferably, a semantic recognition system client including a number conversion unit and the application interface is provided, and the externally described file server, the application server, and the semantic recognition system client are connected via a computer network.

【００２５】かかる構成により、外部記述ファイルをネ
ットワーク上に配置された外部記述ファイルサーバとし
たクライアントサーバ構成で構築することができ、各言
語ごとの外部記述ファイルを意味認識システム個別に用
意する必要がなくなる。With this configuration, it is possible to construct an externally described file in a client server configuration as an externally described file server arranged on a network, and it is necessary to prepare an externally described file for each language separately for the semantic recognition system. Disappears.

【００２６】次に、本発明にかかる意味認識処理プログ
ラムを記録したコンピュータ読み取り可能な記録媒体
は、入力となる音声信号を受け付ける音声信号入力処理
ステップと、前記入力処理された音声信号を基に対応す
る単語を認識する音声認識処理ステップと、意味を表わ
す意味番号とその意味番号に属する登録単語のリストで
ある意味番号・登録単語リスト生成する意味番号・登録
単語リスト生成処理ステップと、前記音声認識処理ステ
ップにより認識した単語を前記生成した意味番号・登録
単語リストにより対応する意味番号に変換する認識単語
・意味番号変換処理ステップと、前記変換された意味番
号を前記アプリケーションに通知する意味番号通知処理
ステップとを備えた意味認識処理プログラムを記録した
ことを特徴とする。Next, a computer-readable recording medium on which the semantic recognition processing program according to the present invention is recorded has an audio signal input processing step of receiving an input audio signal, and a computer-readable recording medium corresponding to the input processed audio signal. Voice recognition processing step of recognizing a word to be performed; a meaning number representing a meaning and a meaning number / registered word list generating step of generating a meaning number / registered word list as a list of registered words belonging to the meaning number; A recognition word / meaning number conversion processing step of converting the word recognized by the processing step into a corresponding meaning number based on the generated meaning number / registered word list, and a meaning number notifying process of notifying the converted meaning number to the application And recording a semantic recognition processing program having steps.

【００２７】かかる構成により、コンピュータシステム
を利用して、ユーザインタフェースとして利用者にとり
もっとも便利な言語、方言が使用できる意味認識システ
ムを提供することができ、かつ、言語依存の音声認識部
の登録単語を言語非依存の意味番号に変換してアプリケ
ーションに通知することができ、アプリケーションを言
語非依存のものとして記述ができる。With this configuration, it is possible to provide a semantic recognition system in which a user can use the most convenient language and dialect as a user interface by using a computer system, and furthermore, a registered word of a language-dependent speech recognition unit. Can be converted to a language-independent meaning number and notified to the application, and the application can be described as language-independent.

【００２８】次に、前記意味認識処理プログラムは、前
記意味番号・登録単語リストの内容を更新する意味番号
・登録単語リスト更新処理ステップを備えることが好ま
しい。Next, it is preferable that the semantic recognition processing program includes a meaning number / registered word list updating processing step of updating the contents of the meaning number / registered word list.

【００２９】かかる構成により、意味番号・登録単語リ
ストを追加・変更する処理を行うことができ、意味認識
システムによる処理をチューンナップすることができ
る。次に、前記意味認識処理プログラムは、意味を体系
的に分類して各々の意味単位をコード化した意味番号と
各々の意味番号に属する登録単語とを体系的に記述した
単語・意味体系テーブルを記録した外部記述ファイルを
備えたものであることが好ましい。With this configuration, it is possible to perform processing for adding / changing the meaning number / registered word list, and to tune up the processing by the meaning recognition system. Next, the semantic recognition processing program generates a word / semantic system table that systematically describes the semantic numbers in which the semantic units are coded by systematically classifying the semantic units and the registered words belonging to the respective semantic numbers. It is preferable to have a recorded external description file.

【００３０】かかる構成により、前記意味認識処理にお
いて、単語とその単語に対応する意味単位との関係を体
系的に表わした単語・意味体系テーブルを、アプリケー
ション本体から分離可能な構成とすることができ、利用
者の使用環境に応じて外部記述ファイルを交換すること
でアプリケーションをローカライズするための基本とな
るデータを用意することができる。With this configuration, in the semantic recognition processing, a word / semantic system table that systematically represents the relationship between a word and a semantic unit corresponding to the word can be configured to be separable from the application main body. By exchanging external description files according to the usage environment of the user, basic data for localizing an application can be prepared.

【００３１】次に、前記意味認識処理プログラムは、前
記アプリケーションが処理データの少なくとも一部に意
味番号を含んだアプリケーションであって、前記アプリ
ケーションに含まれた意味番号を取得する意味番号取得
処理ステップとを備え、前記意味番号・登録単語リスト
生成処理ステップが、前記外部記述ファイルの単語・意
味体系テーブル中から前記取得した意味番号に対応する
登録単語を抽出して前記意味番号・登録単語リストを生
成する処理ステップと、前記生成した意味番号・登録単
語リストに挙げられた登録単語を前記音声認識処理ステ
ップに通知する登録単語通知処理ステップを備え、前記
音声認識処理ステップが、前記通知された登録単語に基
づいて音声認識エンジンの内容を更新する音声認識単語
更新処理ステップを備えることが好ましい。Next, the semantic recognition processing program includes a semantic number acquiring step of acquiring the semantic number included in the application, wherein the application is an application including a semantic number in at least a part of processing data; Wherein the meaning number / registered word list generation processing step extracts the registered word corresponding to the acquired meaning number from the word / semantic system table of the external description file to generate the meaning number / registered word list And a registered word notification processing step of notifying the registered word listed in the generated meaning number / registered word list to the voice recognition processing step, wherein the voice recognition processing step includes the notified registered word. Speech recognition word update processing step of updating the content of the speech recognition engine based on the It is preferably provided.

【００３２】かかる構成により、アプリケーションを言
語非依存の意味番号で記述し、意味認識処理ステップの
ローカライズにあたっては、アプリケーションで使用さ
れている意味番号に対応するローカル言語の単語を自動
的に抽出することができ、ローカライズ作業工数の大幅
な低減が可能となる。With this configuration, an application is described with a language-independent semantic number, and a local language word corresponding to the semantic number used in the application is automatically extracted when localizing the semantic recognition processing step. And the man-hour for localization work can be significantly reduced.

【００３３】次に、前記意味認識処理プログラムは、利
用者が使用する言語を指定する利用言語指定処理ステッ
プと、前記外部記述ファイルとして複数の言語ごとの外
部記述ファイルと、前記音声認識処理ステップで利用す
る音声認識エンジンとして複数の言語ごとの音声認識エ
ンジンとを備え、前記意味番号・登録単語リスト生成処
理ステップにおいて、前記利用言語指定処理ステップに
より指定された言語の外部記述ファイルを基に意味番号
・登録単語リストを生成し、前記音声認識処理ステップ
において、前記利用言語指定処理ステップにより指定さ
れた言語の音声認識エンジンを使用し、利用者に対して
前記アプリケーションを複数の言語による入力指示がで
きるマルチリンガル対応のユーザインタフェースを提供
するものであることが好ましい。Next, the semantic recognition processing program includes a use language specification processing step for specifying a language used by a user, an external description file for each of a plurality of languages as the external description file, and a speech recognition processing step. A speech recognition engine for each of a plurality of languages is used as a speech recognition engine to be used. In the meaning number / registered word list generation processing step, the meaning number is determined based on an external description file of the language specified in the use language designation processing step. A registered word list is generated, and in the voice recognition processing step, an instruction to input the application in a plurality of languages can be given to the user by using a voice recognition engine of a language specified by the language specification processing step. Provide a multilingual user interface. It is preferred.

【００３４】かかる構成により、意味認識システム利用
者の使用言語、使用表現に合わせたカスタマイズ作業が
容易に実行できる。また、複数の外部記述ファイルを指
定することにより複数言語の意味番号・登録単語リスト
の自動生成、音声認識エンジンの内容の更新をすること
ができ、アプリケーションを、複数の言語による音声入
力指示ができるマルチリンガル対応とすることができ
る。With this configuration, a customizing operation can be easily performed according to the language and expression used by the user of the semantic recognition system. Also, by specifying a plurality of external description files, it is possible to automatically generate a meaning number / registered word list in a plurality of languages and update the content of a speech recognition engine, and to instruct an application to input speech in a plurality of languages. It can be multilingual.

【００３５】次に、前記意味認識処理プログラムが、前
記外部記述ファイルを提供する外部記述ファイル提供処
理ステップを備えたネットワーク上に配された外部記述
ファイルサーバ用の処理プログラムと、前記アプリケー
ションを提供するアプリケーション提供処理ステップを
備えたネットワーク上に配されたアプリケーションサー
バ用の処理プログラムと、前記外部記述ファイルサーバ
用の処理プログラムと前記アプリケーションサーバ用の
処理プログラムに対してネットワーク上で通信する意味
認識システムクライアント用処理プログラムを備えたも
のであることが好ましい。Next, the semantic recognition processing program provides a processing program for an externally described file server, which is provided on a network, including an externally described file providing processing step of providing the externally described file, and the application. A processing program for an application server provided on a network having an application providing processing step, a processing program for the externally described file server, and a semantic recognition system client for communicating over the network with the processing program for the application server It is preferable that the program is provided with a processing program.

【００３６】かかる構成により、外部記述ファイルとア
プリケーションをネットワーク上に配置されたサーバか
ら提供するクライアントサーバ構成とした意味認識シス
テムを構築することができる。With this configuration, it is possible to construct a semantic recognition system having a client-server configuration in which an externally described file and an application are provided from a server arranged on a network.

【００３７】[0037]

【発明の実施の形態】以下、本発明の実施形態にかかる
意味認識システムについて、図面を参照しながら説明す
る。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a semantic recognition system according to an embodiment of the present invention will be described with reference to the drawings.

【００３８】（実施形態１）実施形態１にかかる意味認
識システムは、利用者に対して音声入力が可能なユーザ
インタフェースを提供し、アプリケーションに対して利
用者の入力内容を個々の単語が持つ本来の意味に着目し
て意味認識を行うことにより言語非依存として伝えるア
プリケーション用インタフェースを提供するシステムで
ある。本実施形態１では、アプリケーションは画面上に
現われるオブジェクトを音声指示により移動制御する処
理を含んだゲームとし、利用者は日本語を利用する者と
し、ユーザーインタフェースとして日本語環境が構築さ
れる例を挙げる。(Embodiment 1) The semantic recognition system according to Embodiment 1 provides a user interface capable of voice input to a user, and the original content of each word corresponding to the input content of the user to an application is provided. This is a system that provides an application interface that communicates as language-independent by performing semantic recognition by focusing on the meaning of. In the first embodiment, the application is a game including a process of controlling the movement of an object appearing on the screen by voice instructions, the user is a person who uses Japanese, and a Japanese environment is constructed as a user interface. I will.

【００３９】本実施形態１にかかる意味認識システムの
全体構成の概略と本システムによる処理流れの全体像を
図面を参照しつつ説明する。図１は、本実施形態１にか
かる意味認識システムの概略構成図を示している。図２
は、本システムによる処理流れの全体像を処理ステップ
として表わしたものである。An outline of the overall configuration of the semantic recognition system according to the first embodiment and an overall image of the processing flow of the system will be described with reference to the drawings. FIG. 1 is a schematic configuration diagram of the meaning recognition system according to the first embodiment. FIG.
Represents the overall image of the processing flow of the present system as processing steps.

【００４０】図１に示すように、本実施形態１にかかる
意味認識システムは、大別して、音声入力デバイス１
０、音声信号ファイル２０、音声認識部３０、認識単語
・意味番号変換部４０、アプリケーション５０を備えて
いる。なお、図示していないがシステム全体の制御処理
に必要なコントローラ、メモリなどは装備している。As shown in FIG. 1, the semantic recognition system according to the first embodiment is roughly divided into a voice input device 1
0, a voice signal file 20, a voice recognition unit 30, a recognized word / meaning number conversion unit 40, and an application 50. Although not shown, a controller, a memory, and the like necessary for control processing of the entire system are provided.

【００４１】音声入力デバイス１０は、マイクロフォ
ン、電話など入力音声を電気信号に変換するデバイスで
あればよい。音声信号ファイル２０は、あらかじめ入力
音声が編集・記録されたファイルである。音声認識部３
０は、音声認識エンジン３１を含み、利用者の使用する
言語に応じて音声認識エンジン３１は入れ替え可能であ
る。認識単語・意味番号変換部４０は、音声認識で認識
された単語とその単語の意味を表わす意味番号との対応
を表わす意味番号・登録単語リスト４１と、意味番号・
登録単語リスト４１の内容を追加・変更などを行う意味
番号・登録単語リスト変更部４２とを備えている。ここ
で、意味番号とは、言葉の意味を体系的に分類した同じ
意味を持つ言葉の集合体である意味単位をコード化した
ものである。意味番号は、言語に依存しない個々の単語
が持つ本来の意味内容に対して与えられた中間コードで
あり、言語が異なってもその言語が意図する意味内容は
変わらないことに着目して導入するものである。この意
味番号は、当該言語体系に属する単語を網羅するものと
しても良いが、システムリソースと実効性を考慮して、
ゲームなどアプリケーションとして使用が予定されてい
る分野において使用があり得る言葉を網羅するものであ
っても良い。The voice input device 10 may be any device that converts an input voice into an electric signal, such as a microphone or a telephone. The audio signal file 20 is a file in which input audio is edited and recorded in advance. Voice recognition unit 3
0 includes the speech recognition engine 31, and the speech recognition engine 31 can be switched according to the language used by the user. The recognized word / meaning number conversion unit 40 includes a meaning number / registered word list 41 indicating a correspondence between a word recognized by speech recognition and a meaning number indicating the meaning of the word,
A meaning number / registered word list changing unit 42 for adding / changing the contents of the registered word list 41 is provided. Here, the semantic number codes a semantic unit, which is a set of words having the same meaning, which systematically classifies the meaning of the words. The semantic number is an intermediate code given to the original semantic content of each word that does not depend on the language, and is introduced by paying attention to the fact that the meaning intended by the language does not change even if the language is different Things. This semantic number may cover words belonging to the language system, but in consideration of system resources and effectiveness,
It may cover words that can be used in a field that is expected to be used as an application such as a game.

【００４２】認識単語・意味番号変換部４０とアプリケ
ーション５０の間には、利用者の音声入力内容などを伝
えるアプリケーション用インタフェースが提供される。
アプリケーション５０は、処理内容の一部に意味番号を
含んだものである。An application interface is provided between the recognition word / meaning number conversion unit 40 and the application 50 for transmitting the voice input contents of the user.
The application 50 includes a semantic number as a part of the processing content.

【００４３】意味認識システムの処理の流れの全体像は
以下の通りである。まず、利用者にはユーザーインタフ
ェースとして提供されているマイクロフォンなどの音声
入力デバイス１０を介して、アプリケーション５０に入
力したい指示内容を自らの言葉で入力する。ここでは例
として日本語で「止まれ」と入力する。次に、入力され
た「止まれ」という音声は電気信号に変換され、音声認
識部３０に入力される。また、音声入力デバイス１０に
よる音声入力に代え、あらかじめ入力する言葉を編集・
記録した音声信号ファイル２０から対応する音声信号を
音声認識部３０に入力しても良い（図２ステップＳ１０
１）。The overall image of the processing flow of the semantic recognition system is as follows. First, the user inputs, in his / her own words, an instruction to be input to the application 50 via the voice input device 10 such as a microphone provided as a user interface. Here, "stop" is input in Japanese as an example. Next, the input “stop” voice is converted into an electric signal and input to the voice recognition unit 30. In addition, instead of the voice input by the voice input device 10, the words to be input in advance can be edited / edited.
A corresponding audio signal may be input from the recorded audio signal file 20 to the audio recognition unit 30 (step S10 in FIG. 2).
1).

【００４４】次に、音声認識部３０は与えられた音声信
号を認識する。音声認識部３０には音声認識エンジン３
１があり、ここでは例として日本語の音声認識エンジン
が搭載されているものとする。音声認識部３０は、通常
の音声認識処理である形態素処理、テンプレートマッチ
ング処理などの処理を実行し、入力された単語を認識す
る。ここでは「止まれ」という音声が「トマレ」という
単語データとして認識される。（図２ステップＳ１０
２）。Next, the speech recognition section 30 recognizes the given speech signal. The voice recognition engine 30 includes a voice recognition engine 3
In this example, it is assumed that a Japanese speech recognition engine is mounted. The speech recognition unit 30 executes processing such as morphological processing and template matching processing, which are normal speech recognition processing, and recognizes an input word. Here, the voice of "stop" is recognized as word data of "Tomare". (Step S10 in FIG. 2)
2).

【００４５】次に、ステップＳ１０２で認識された認識
単語データを認識単語・意味番号変換部４０において対
応する意味番号に変換する。認識単語・意味番号変換部
４０は、意味番号・登録単語リスト４１を参照して、認
識単語データに対応する意味番号をサーチする。ここで
は認識単語データ「トマレ」を意味番号“１”に変換す
る（図２ステップＳ１０３）。Next, the recognized word data recognized in step S102 is converted into a corresponding meaning number by the recognized word / meaning number conversion unit 40. The recognized word / meaning number conversion unit 40 refers to the meaning number / registered word list 41 and searches for a meaning number corresponding to the recognized word data. Here, the recognition word data “Tomare” is converted into the meaning number “1” (Step S103 in FIG. 2).

【００４６】ここで、意味番号・登録単語リストの例を
図３と図４に示す。図３に挙げたリストは、要素メンバ
として、“シンボル番号”、“意味番号”、“登録単
語”に加え、単語の付属情報として“カテゴリ”、“単
語の説明”を要素メンバに持つ構造体を配列として構成
したものである。シンボル番号は、各エントリごとにユ
ニークに割り当てる番号であり、登録単語とは、認識対
象となる文字列であり、カテゴリとは、登録単語の分類
をコード化したものであり、単語に対する説明とは、登
録単語に対する説明である。カテゴリと単語に対する説
明は特に付与されないときは、例えば“ＮＵＬＬ”が割
り当てられている。Here, examples of the meaning number / registered word list are shown in FIGS. The list shown in FIG. 3 is a structure having, as element members, “symbol number”, “semantic number”, and “registered word”, and “category” and “word explanation” as word accessory information as element members. As an array. The symbol number is a number uniquely assigned to each entry, the registered word is a character string to be recognized, the category is a coded classification of the registered word, and the description for the word is , A description for a registered word. When no explanation is given to a category and a word, for example, “NULL” is assigned.

【００４７】図４に挙げたリストは、要素メンバとし
て、図３に示した要素メンバに加え、リンクリストを管
理する要素メンバ“次のテーブルのアドレス”を加え、
個々のエントリの挿入削除を容易に行えるような構造の
ものである。In the list shown in FIG. 4, in addition to the element members shown in FIG. 3, as an element member, an element member for managing a linked list “address of the next table” is added.
The structure is such that individual entries can be easily inserted and deleted.

【００４８】次に、ステップＳ１０３で得られた意味番
号を、アプリケーション用インタフェースを介してアプ
リケーション５０に与える（図２ステップＳ１０４）。
アプリケーション５０は、その処理内容の一部に意味番
号を含んでおり、入力された意味番号に応じた処理が実
行される。アプリケーション５０は画面上に現われるオ
ブジェクトを音声指示により移動制御する処理を含んだ
ゲームであり、ここでは例として、意味番号“１”に対
してオブジェクトの移動を停止する処理が行われる（図
２ステップＳ１０５）。つまり、ここで挙げた例では、
利用者の「止まれ」というマイクロフォンからの音声指
示により、画面上に現われるオブジェクトが移動を停止
する処理が行われる。Next, the meaning number obtained in step S103 is given to the application 50 via the application interface (step S104 in FIG. 2).
The application 50 includes a meaning number in a part of the processing content, and the processing corresponding to the input meaning number is executed. The application 50 is a game including a process of controlling the movement of an object appearing on the screen by a voice instruction. In this example, the process of stopping the movement of the object is performed for the meaning number “1” (step in FIG. 2). S105). In other words, in the example given here,
In response to a voice instruction from the microphone saying “stop” by the user, a process of stopping the movement of the object appearing on the screen is performed.

【００４９】この意味認識処理（図２ステップＳ１０１
〜ステップＳ１０５）は、利用者のゲーム進行に合わせ
て適宜繰り返されて進められる。以上が、本実施形態１
にかかる意味認識システム構成の概略と本システムによ
る処理流れの全体像である。This meaning recognition process (step S101 in FIG. 2)
Steps S105 to S105) are repeated as appropriate in accordance with the progress of the game by the user. The above is the first embodiment.
1 shows an outline of a configuration of a meaning recognition system according to the first embodiment and an overall image of a processing flow by the present system.

【００５０】以上のように、入力デバイス１０からの音
声入力により、アプリケーション５０に対してその指示
内容を伝えることができる。本実施形態１にかかる意味
認識システムは、利用者に対して音声入力のユーザイン
タフェースを提供し、アプリケーションに対して言語に
依存しない意味内容を伝えるアプリケーション用インタ
フェースを提供することができる。As described above, the instruction content can be transmitted to the application 50 by the voice input from the input device 10. The semantic recognition system according to the first embodiment can provide a user interface for voice input to a user, and can provide an interface for an application that conveys language-independent semantic contents to an application.

【００５１】なお、本実施形態１では、一例として、日
本語による処理を挙げたが、他の言語への対応もシステ
ムのチューンナップにより可能なことは言うまでもな
い。つまり、音声認識部３０の音声認識エンジン３１を
使用したい言語の音声認識エンジンに入れ替え、認識単
語・意味番号変換部４０の意味番号・登録単語リスト４
１を使用したい言語の意味番号・登録単語リストに入れ
替えることで対応できる。ここで、意味番号・登録単語
リストの意味番号は、使用言語が異なっている場合で
も、同じ意味内容に対して同じ意味番号が付されている
ので言語非依存であり、アプリケーション５０には、同
じ意味内容の音声入力指示に対して同じ意味番号が伝え
られることとなり、アプリケーション５０は同じ意味番
号に対して同じ処理を実行する。このように、アプリケ
ーションに対して利用者の音声入力内容を個々の単語が
持つ本来の意味に着目して意味認識を行うことにより言
語非依存として伝えるアプリケーション用インタフェー
スを提供することができる。In the first embodiment, the processing in Japanese is given as an example. However, it goes without saying that the system can be adapted to other languages by tune-up of the system. That is, the speech recognition engine 31 of the speech recognition unit 30 is replaced with the speech recognition engine of the language to be used, and the meaning number / registered word list 4
This can be dealt with by replacing 1 with the meaning number / registered word list of the language you want to use. Here, the meaning numbers of the meaning number / registered word list are language-independent because the same meaning number is assigned to the same meaning content even if the language used is different, and the application 50 has the same meaning number. The same meaning number is transmitted to the voice input instruction of the meaning content, and the application 50 executes the same processing for the same meaning number. As described above, it is possible to provide an application interface that conveys the voice input content of the user to the application as language-independent by performing meaning recognition by focusing on the original meaning of each word.

【００５２】また、認識単語・意味番号変換部４０の意
味番号・登録単語リスト変更部４２により、意味番号・
登録単語リスト４１の内容を追加・変更することができ
る。例えば、図４に示したリストの説明で述べたよう
に、“次のテーブルのアドレス”の要素メンバを利用す
ることにより、図５に示すようにエントリの挿入、削除
などが容易にできる。図５は、意味番号・登録単語リス
ト４１中に、認識単語「ウエヘトベ」と意味番号“１
５”からなる認識単語・意味番号のセットを１０１番目
のエントリとして追加した例を示している。この意味番
号・登録単語リスト変更部４２により、意味認識システ
ムを利用環境に合わせてチューンナップすることが可能
である。The meaning number / registration word list changing unit 42 of the recognition word / meaning number conversion unit 40 causes the meaning number
The contents of the registered word list 41 can be added or changed. For example, as described in the description of the list shown in FIG. 4, by using the element members of "address of next table", entry insertion and deletion can be easily performed as shown in FIG. FIG. 5 shows that in the meaning number / registered word list 41, the recognition word “uehetob” and the meaning number “1”
In this example, a set of a recognition word and a meaning number consisting of 5 ″ is added as the 101st entry. By this meaning number / registration word list changing unit, the meaning recognition system can be tuned up according to the use environment. It is possible.

【００５３】（実施形態２）実施形態２にかかる意味認
識システムについて図面を参照しながら説明する。本実
施形態２は、実施形態１と同様、利用者に対する音声入
力によるユーザーインタフェースと、アプリケーション
に対する利用者の音声入力内容を言語非依存として伝え
るアプリケーション用インタフェースとを提供するシス
テムであるが、本実施形態２にかかる意味認識システム
の特徴は、アプリケーションをロードするとともにアプ
リケーション側から情報を得て、当該アプリケーション
を操作するためのユーザーインタフェースをカスタマイ
ズして構築する点である。(Second Embodiment) A semantic recognition system according to a second embodiment will be described with reference to the drawings. The second embodiment is a system that provides a user interface by voice input to the user and an application interface for transmitting the user's voice input to the application as language-independent, as in the first embodiment. A feature of the semantic recognition system according to mode 2 is that the application is loaded and information is obtained from the application, and a user interface for operating the application is customized and constructed.

【００５４】本実施形態２は、実施形態１と同様、アプ
リケーションは画面上に現われるオブジェクトを音声指
示により移動制御する処理を含んだゲームとするが、利
用者は英語を使用する者とし、ユーザーインタフェース
として英語環境が構築される例を挙げる。In the second embodiment, as in the first embodiment, the application is a game including a process of controlling the movement of an object appearing on the screen by voice instruction. As an example, an English environment is constructed.

【００５５】本実施形態２にかかる意味認識システムの
全体構成の概略と本システムによる処理流れの全体像を
図面を参照しつつ説明する。図６は、本実施形態２にか
かる意味認識システムの概略構成図を示している。図７
は、本システムによる処理流れの全体像を処理ステップ
として表わしたものである。An outline of the overall configuration of the semantic recognition system according to the second embodiment and an overall image of the processing flow of this system will be described with reference to the drawings. FIG. 6 is a schematic configuration diagram of the meaning recognition system according to the second embodiment. FIG.
Represents the overall image of the processing flow of the present system as processing steps.

【００５６】図６に示すように、本実施形態２にかかる
意味認識システムは、大別して、音声入力デバイス１
０、音声信号ファイル２０（図示せず）、音声認識部３
０、認識単語・意味番号変換部４０、アプリケーション
５０に加え、外部記述ファイル６０を備えている。な
お、図示していないが、システム全体の制御処理に必要
なコントローラ、メモリなどは装備している。また、実
施形態１で説明したものと同様の要素には同じ番号を割
り当てている。As shown in FIG. 6, the semantic recognition system according to the second embodiment is roughly divided into
0, voice signal file 20 (not shown), voice recognition unit 3
0, an external description file 60 in addition to the recognition word / meaning number conversion unit 40 and the application 50. Although not shown, a controller, a memory, and the like necessary for control processing of the entire system are provided. Elements similar to those described in the first embodiment are assigned the same numbers.

【００５７】アプリケーション５０は、処理データの少
なくとも一部に意味番号を含んだアプリケーションであ
る。外部記述ファイル６０は、アプリケーション本体か
らは分離された外部ファイルであって、ハードディスク
やＣＤ−ＲＯＭなどアクセス可能な二次記憶媒体に格納
されておれば良く、プログラムと一体化している必要は
ない。外部記述ファイル６０に記録されているファイル
の内容は、意味を体系的に分類して各々の意味単位をコ
ード化した意味番号と各々の意味番号に属する登録単語
とを体系的に記述したテーブルである。この単語・意味
体系テーブルは、当該言語体系に属する単語を網羅する
登録単語群と対応する意味番号群を備えているものとし
ても良いが、システムリソースと実効性を考慮して、ゲ
ームなどアプリケーションとして使用が予定されている
分野において使用があり得る単語を網羅するものであっ
ても良い。The application 50 is an application in which at least a part of the processing data includes a meaning number. The external description file 60 is an external file separated from the application itself, and may be stored in an accessible secondary storage medium such as a hard disk or a CD-ROM, and need not be integrated with the program. The content of the file recorded in the external description file 60 is a table that systematically describes the meaning numbers that systematically classify the meanings and encodes each meaning unit and the registered words belonging to each meaning number. is there. This word / semantic system table may include a registered word group covering the words belonging to the language system and a semantic number group corresponding to the registered word group. It may cover words that can be used in the field where the use is planned.

【００５８】図８に外部記述ファイルの一例を示す。
“State-name”とは、単語を登録する際のカテゴリを記
述するための名前を指定する属性であり、“Language-I
D”とは、単語の言語を表わす属性であり、基本言語Ｉ
Ｄ、サブ言語ＩＤなどを備えている。“Dialect”と
は、言語の方言に関する属性であり、デフォルトは標準
語(Standard)とする。“VCMD”とは、音声命令(Voice c
ommand)を記述するキーワードであり、意味番号・登録
単語リスト４１の各エントリとなる意味番号・登録単語
のセットである。“＃”はその行がコメントであること
を示している。FIG. 8 shows an example of the external description file.
“State-name” is an attribute that specifies a name for describing a category when a word is registered, and “Language-I”
D "is an attribute representing the language of the word.
D, sub-language ID, and the like. “Dialect” is an attribute related to the dialect of the language, and the default is a standard language (Standard). “VCMD” is a voice command (Voice c
ommand), which is a set of a meaning number and a registered word which are each entry of the meaning number / registered word list 41. “#” Indicates that the line is a comment.

【００５９】認識単語・意味番号変換部４０は、意味番
号取得部４３と意味番号・登録単語リスト生成部４４と
登録単語通知部４５を備えている。意味番号取得部４３
は、アプリケーション用インタフェースを介してアプリ
ケーション５０に記述された意味番号を取得するもので
ある。意味番号・登録単語リスト生成部４４は、外部記
述ファイル６０から取得した意味番号に対応する全ての
登録単語を抽出して意味番号・登録単語リスト４１を生
成するものである。登録単語通知部４５は、認識単語・
意味番号変換部４０が生成した意味番号・登録単語リス
ト４１に挙げられた登録単語を音声認識部３０に通知す
るものである。The recognized word / meaning number conversion unit 40 includes a meaning number acquiring unit 43, a meaning number / registered word list generating unit 44, and a registered word notifying unit 45. Meaning number acquisition unit 43
Is to acquire the meaning number described in the application 50 via the application interface. The meaning number / registered word list generation unit 44 extracts all registered words corresponding to the meaning number obtained from the external description file 60 and generates the meaning number / registered word list 41. The registered word notification unit 45 outputs the recognized word
The registered words listed in the meaning number / registered word list 41 generated by the meaning number conversion unit 40 are notified to the speech recognition unit 30.

【００６０】音声認識部３０は、利用者の使用言語に対
応した音声認識エンジン３１と音声認識単語更新部３２
とを備えている。音声認識単語更新部３２は、登録単語
通知部４５から通知された登録単語に基づいて音声認識
エンジン３１の内容を更新するものである。The speech recognition unit 30 includes a speech recognition engine 31 and a speech recognition word updating unit 32 corresponding to the language used by the user.
And The speech recognition word update unit 32 updates the content of the speech recognition engine 31 based on the registered word notified from the registered word notification unit 45.

【００６１】本実施形態２にかかる意味認識システムの
処理の流れの全体像は以下の通りである。まず、意味認
識システムにアプリケーション５０をロードする。ロー
ドはＣＤ−ＲＯＭドライブなどのＩＯ装置、ネットワー
クからのダウンロード（図示せず）などにより行う（図
７ステップＳ２０１）。ここでは一例として、ロードさ
れたアプリケーション５０は、画面上に現われたオブジ
ェクトの移動制御を伴うゲームであるとし、処理プログ
ラムの一部にオブジェクトの移動命令を意味番号に基づ
いた記述が含まれているものとする。The overall processing flow of the semantic recognition system according to the second embodiment is as follows. First, the application 50 is loaded into the semantic recognition system. Loading is performed by an IO device such as a CD-ROM drive, download from a network (not shown), or the like (step S201 in FIG. 7). Here, as an example, it is assumed that the loaded application 50 is a game with movement control of an object appearing on the screen, and a description based on a meaning number of an object movement instruction is included in a part of the processing program. Shall be.

【００６２】次に、認識単語・意味番号変換部４０の意
味番号取得部４３は、アプリケーション用インタフェー
スを介してアプリケーション５０に記述された意味番号
を取得する（図７ステップＳ２０２）。ここでは、例え
ば意味番号“１”が取得されたとする。Next, the meaning number acquiring unit 43 of the recognized word / meaning number converting unit 40 acquires the meaning number described in the application 50 via the application interface (step S202 in FIG. 7). Here, it is assumed that the meaning number “1” has been acquired, for example.

【００６３】ステップＳ２０２で取得された意味番号は
意味番号・登録単語リスト生成部４４に渡される。意味
番号・登録単語リスト生成部４４は、例えば、図９に示
すように、取得された意味番号を意味番号・登録単語リ
スト４１に展開して保持する（図７ステップＳ２０
３）。The meaning number obtained in step S202 is passed to the meaning number / registered word list generation unit 44. The meaning number / registered word list generation unit 44, for example, as shown in FIG. 9, expands the acquired meaning number into the meaning number / registered word list 41 and holds it (step S20 in FIG. 7).
3).

【００６４】次に、意味番号・登録単語リスト生成部４
４は、外部記述ファイル６０の単語・意味体系テーブル
を参照して、意味番号・登録単語リスト４１中の意味番
号に対応する登録単語を抽出して意味番号・登録単語リ
スト４１を生成する（図７ステップＳ２０４）。生成さ
れた意味番号・登録単語リスト４１の一例を図１０に示
す。このように、アプリケーションで使用する命令群が
意味番号である中間コードから利用者の使用する言語に
よる命令群として生成される。ここでは英語による命令
群が生成される。図１０からも分かるように、個々の意
味番号から、その意味番号に対応する外部記述ファイル
６０にある登録単語はすべて抽出されてリストに展開さ
れる。例えば「止まれ」の意味を表わす意味番号“１”
に対して、“STOP”,“HALT”など「止まれ」という直
接の意を持つ英語単語に加え、“STAY”，“DON´T MOV
E”などオブジェクトに対して同様の動作をもたらす類
義語も対応させても良い。Next, the meaning number / registered word list generation unit 4
4 extracts the registered word corresponding to the meaning number in the meaning number / registered word list 41 with reference to the word / semantic system table of the external description file 60 to generate the meaning number / registered word list 41 (FIG. 7 steps S204). An example of the generated meaning number / registered word list 41 is shown in FIG. As described above, the instruction group used in the application is generated as the instruction group in the language used by the user from the intermediate code having the meaning number. Here, an instruction group in English is generated. As can be seen from FIG. 10, all registered words in the external description file 60 corresponding to the respective meaning numbers are extracted from the respective meaning numbers, and are expanded into a list. For example, the meaning number “1” representing the meaning of “stop”
In addition to "STOP", "HALT" and other English words with a direct meaning of "stop", "STAY", "DON'T MOV"
A synonym such as "E", which gives a similar action to the object, may also be used.

【００６５】次に、認識単語・意味番号変換部４０の登
録単語通知部４５は、ステップＳ２０４で生成した意味
番号・登録単語リスト４１に挙げられた登録単語を音声
認識部３０に通知する（図７ステップＳ２０５）。ここ
では図１０に示された“STOP”,“HALT”などの登録単
語が通知される。Next, the registered word notifying unit 45 of the recognized word / meaning number conversion unit 40 notifies the speech recognition unit 30 of the registered words listed in the meaning number / registered word list 41 generated in step S204 (FIG. 7 Step S205). Here, registered words such as “STOP” and “HALT” shown in FIG. 10 are notified.

【００６６】次に、音声認識部３０は、ステップＳ２０
５で通知された登録単語に基づき、音声認識単語更新部
３２により音声認識エンジン３１の内容を更新する。つ
まり、音声認識エンジンの利用する音声信号の認識パタ
ーンの内容を更新し、ステップＳ２０５で通知された登
録単語を利用者の使用する言語により音声認識できるユ
ーザーインタフェースを構築する（図７ステップＳ２０
６）。Next, the voice recognition unit 30 proceeds to step S20.
Based on the registered word notified in step 5, the content of the speech recognition engine 31 is updated by the speech recognition word updating unit 32. That is, the content of the recognition pattern of the voice signal used by the voice recognition engine is updated, and a user interface capable of voice recognition of the registered word notified in step S205 in the language used by the user is constructed (step S20 in FIG. 7).
6).

【００６７】以上のステップＳ２０１〜ステップＳ２０
６の処理を実行することにより、ロードしたアプリケー
ションで使用する利用者入力命令群を、言語非依存の中
間コードである意味番号から利用者の使用する言語によ
る命令群に自動的に変換して音声入力によるユーザーイ
ンタフェースを構築することができる。The above steps S201 to S20
By executing the processing in step 6, the user input instructions used in the loaded application are automatically converted from the semantic numbers, which are language-independent intermediate codes, to the instructions in the language used by the user, and the A user interface based on input can be constructed.

【００６８】以上のステップＳ２０１〜ステップＳ２０
６によるユーザーインタフェースのカスタマイズ処理の
後、実施形態１で示した図２の処理ステップを実行する
ことにより、利用者は自らの言語を音声で入力してゲー
ムなどのアプリケーションを楽しむことができる。ま
た、アプリケーションに応じてそのアプリケーションで
使用する意味番号・登録単語リスト４１を生成するの
で、主メモリの使用量を最適化して小さく抑えることが
できる。また、アプリケーションは利用者入力命令を意
味番号を中心とした言語非依存の中間コードで記述でき
るため、アプリケーション開発者の他言語へのローカラ
イズ作業工数を不要とすることができる。The above steps S201 to S20
After the user interface customization processing by the user 6, the user can input his / her own language by voice and enjoy applications such as games by executing the processing steps of FIG. 2 described in the first embodiment. Further, since the meaning number / registered word list 41 used in the application is generated according to the application, the amount of use of the main memory can be optimized and reduced. In addition, since the application can describe the user input command in a language-independent intermediate code centered on the semantic number, the man-hour for localizing the application developer to another language can be eliminated.

【００６９】なお、上記説明では、例として英語による
音声入力ユーザーインタフェース構築例を挙げたが、他
の言語であっても、意味認識システムが使用予定されて
いる環境に応じて、その基本構成として各言語ごとの基
本となる外部記述ファイルと音声認識エンジンを備える
ことにより、同様にユーザーインタフェースを構築でき
ることは言うまでもない。また、上記説明において、認
識単語・意味番号変換部が、アプリケーションに記述さ
れた意味番号を取得する意味番号取得部と、意味番号・
登録単語リスト生成部とを備えた構成としたが、意味番
号取得部と意味番号・登録単語リスト生成部に代え、ア
プリケーションが、アプリケーション自身に記述された
意味番号に基づいて前記外部記述ファイルの単語・意味
体系テーブル中から対応する登録単語を抽出する処理ス
テップと、前記意味番号と登録単語を前記認識単語・意
味番号変換部に通知する処理ステップとを備え、認識単
語・意味番号変換部が通知された意味番号と登録単語に
基づいて意味番号・登録単語リストを生成する意味番号
・登録単語リスト生成部を備える構成であっても良い。In the above description, an example of constructing a voice input user interface in English has been given as an example. However, even in other languages, the basic configuration of the semantic recognition system may be changed according to the environment in which the semantic recognition system is to be used. It goes without saying that a user interface can be similarly constructed by providing a basic external description file and a speech recognition engine for each language. Further, in the above description, the recognition word / meaning number conversion unit obtains a meaning number described in the application,
The registered word list generating unit is provided, but instead of the semantic number acquiring unit and the semantic number / registered word list generating unit, the application uses the words of the externally described file based on the semantic numbers described in the application itself. A processing step of extracting a corresponding registered word from the semantic system table; and a processing step of notifying the meaning number and the registered word to the recognition word / meaning number conversion unit. A configuration may be provided that includes a meaning number / registered word list generation unit that generates a meaning number / registered word list based on the given meaning number and registered word.

【００７０】（実施形態３）実施形態３にかかる意味認
識システムについて図面を参照しながら説明する。本実
施形態３は、実施形態１および実施形態２と同様、利用
者に対する音声入力によるユーザーインタフェースと、
アプリケーションに対する利用者の音声入力内容を言語
非依存として伝えるアプリケーション用インタフェース
とを提供するシステムであるが、本実施形態３にかかる
意味認識システムの特徴は、ユーザーインタフェースの
対象言語を利用者の指定により自由に選択、更新がで
き、さらに、複数の言語を同時に扱うこともできる点で
ある。意味認識システムの想定される使用環境として、
利用者が複数の言語を使用する場合や異なる言語を話す
複数人が同時に使用する可能性がある場合に適してい
る。本実施形態３では特に、複数の言語を同時に扱うこ
とのできるユーザーインタフェースの構築について述べ
る。(Embodiment 3) A semantic recognition system according to Embodiment 3 will be described with reference to the drawings. In the third embodiment, as in the first and second embodiments, a user interface by voice input to the user is provided.
This is a system that provides an application interface that conveys the user's voice input to the application as language-independent. The feature of the semantic recognition system according to the third embodiment is that the target language of the user interface is specified by the user. It can be freely selected and updated, and can handle multiple languages simultaneously. As an assumed usage environment of the semantic recognition system,
It is suitable when the user uses a plurality of languages or when there is a possibility that a plurality of people who speak different languages may use the languages at the same time. In the third embodiment, particularly, the construction of a user interface capable of simultaneously handling a plurality of languages will be described.

【００７１】本実施形態３は、実施形態１と同様、アプ
リケーションは画面上に現われるオブジェクトを音声指
示により移動制御する処理を含んだゲームとするが、利
用者は日本語を使用する者と英語を使用する者を想定
し、ユーザーインタフェースとして日本語入力環境と英
語入力環境が構築される例を挙げる。In the third embodiment, similar to the first embodiment, the application is a game including a process of controlling the movement of an object appearing on the screen by voice instruction. Assuming users, an example is given in which a Japanese input environment and an English input environment are constructed as user interfaces.

【００７２】本実施形態３にかかる意味認識システムの
全体構成の概略と本システムによる処理流れの全体像を
図面を参照しつつ説明する。図１１は、本実施形態３に
かかる意味認識システムの概略構成図を示している。図
１２は、本システムによる処理流れの全体像を処理ステ
ップとして表わしたものである。An outline of the entire configuration of the meaning recognition system according to the third embodiment and an overall image of the processing flow by the present system will be described with reference to the drawings. FIG. 11 is a schematic configuration diagram of a semantic recognition system according to the third embodiment. FIG. 12 shows an overall image of the processing flow of the present system as processing steps.

【００７３】図１１は図６とほぼ同様の構成であるが、
本実施形態３にかかる意味認識システムは、使用言語指
定部７０を備えており、利用者は使用したい言語を指定
する。外部記述ファイル６０は言語ごとにファイルが用
意されており、日本語外部記述ファイル６０ａ、英語外
部記述ファイル６０ｂ、ドイツ語外部記述ファイル６０
ｃなどが備えられている。また、音声認識部３０の音声
認識エンジン３１も言語ごとに用意されており、日本語
音声認識エンジン３１ａ、英語音声認識エンジン３１
ｂ、ドイツ語音声認識エンジン３１ｃなどが備えられて
いる。なお、図１、図６と同様の要素に対しては同じ番
号を割り当てており、ここではその説明は省略する。ま
た、図１１中の各要素間の矢印は、便宜上、後述する処
理ステップＳ３０１における言語指定のデータのなどの
やり取りを中心に記し、その他は省略した。FIG. 11 has a configuration substantially similar to that of FIG.
The semantic recognition system according to the third embodiment includes a used language specifying unit 70, and a user specifies a desired language. The external description file 60 is prepared for each language, and includes a Japanese external description file 60a, an English external description file 60b, and a German external description file 60.
c etc. are provided. Also, a speech recognition engine 31 of the speech recognition unit 30 is prepared for each language, and a Japanese speech recognition engine 31a and an English speech recognition engine 31 are provided.
b, a German speech recognition engine 31c and the like. The same elements as those in FIGS. 1 and 6 are assigned the same numbers, and description thereof will be omitted. The arrows between the elements in FIG. 11 focus on the exchange of language-designated data and the like in processing step S301 to be described later for convenience, and others are omitted.

【００７４】本実施形態３にかかる意味認識システムの
処理の流れの全体像は以下の通りである。実施形態２で
図７などをもって説明した部分と重複する説明は簡単に
記す。まず、利用者は使用したい言語を使用言語指定部
７０により指定する（図１２ステップＳ３０１）。ここ
では、日本語と英語の２つの言語を指定する。この使用
言語指定部７０により指定入力された言語情報は、言語
ＩＤ情報として登録単語・意味番号変換部４０、音声認
識部３０に通知される。ここでは、日本語外部記述ファ
イル６０ａと英語外部記述ファイル６０ｂが用意され、
日本語音声認識エンジン３１ａと英語音声認識エンジン
３１ｂが用意される。The overall processing flow of the semantic recognition system according to the third embodiment is as follows. The description that overlaps with the portion described with reference to FIG. 7 and the like in the second embodiment will be briefly described. First, the user specifies a language to be used by the used language specifying unit 70 (step S301 in FIG. 12). Here, two languages, Japanese and English, are designated. The language information specified and input by the used language specifying unit 70 is notified to the registered word / meaning number conversion unit 40 and the speech recognition unit 30 as language ID information. Here, a Japanese external description file 60a and an English external description file 60b are prepared,
A Japanese speech recognition engine 31a and an English speech recognition engine 31b are prepared.

【００７５】次に、意味認識システムにアプリケーショ
ン５０をロードする（図１２ステップＳ３０２）。次
に、認識単語・意味番号変換部４０の意味番号取得部４
３は、アプリケーション用インタフェースを介してアプ
リケーション５０に記述された意味番号を取得する（図
１２ステップＳ３０３）。ここでは例えば、意味番号
“１”などが取得される。Next, the application 50 is loaded into the semantic recognition system (step S302 in FIG. 12). Next, the meaning number acquisition unit 4 of the recognition word / meaning number conversion unit 40
3 acquires the meaning number described in the application 50 via the application interface (step S303 in FIG. 12). Here, for example, the meaning number “1” is acquired.

【００７６】ステップＳ３０３で取得された意味番号は
意味番号・登録単語リスト生成部４４に渡される。意味
番号・登録単語リスト生成部４４は、実施形態２と同
様、図９に示すように、取得された意味番号を意味番号
・登録単語リスト４１に展開して保持する（図１２ステ
ップＳ３０４）。The meaning number obtained in step S303 is passed to the meaning number / registered word list generation unit 44. Similar to the second embodiment, the meaning number / registered word list generation unit 44 expands the acquired meaning number into the meaning number / registered word list 41 and holds it, as shown in FIG. 9 (step S304 in FIG. 12).

【００７７】次に、意味番号・登録単語リスト生成部４
４は、ステップＳ３０１において通知された言語ＩＤに
基づいて指定された外部記述ファイルを参照する。ここ
では、日本語外部記述ファイル６０ａ、英語外部記述フ
ァイル６０ｂが参照される。各外部記述ファイル６０の
単語・意味体系テーブルを参照して、意味番号・登録単
語リスト４１中の意味番号に対応する登録単語を抽出し
て意味番号・登録単語リスト４１を生成する（図１２ス
テップＳ３０５）。生成された意味番号・登録単語リス
ト４１の一例を図１３に示す。このように、アプリケー
ションで使用する命令群が意味番号である中間コードか
ら利用者が指定した複数の言語による命令群として生成
される。ここでは日本語による命令群と英語による命令
群が生成される。Next, the meaning number / registered word list generation unit 4
No. 4 refers to the external description file specified based on the language ID notified in step S301. Here, the Japanese external description file 60a and the English external description file 60b are referred to. Referring to the word / semantic system table of each external description file 60, the registered words corresponding to the meaning numbers in the meaning number / registered word list 41 are extracted to generate the meaning number / registered word list 41 (step in FIG. 12). S305). An example of the generated meaning number / registered word list 41 is shown in FIG. As described above, the instruction group used in the application is generated as an instruction group in a plurality of languages specified by the user from the intermediate code having the meaning number. Here, an instruction group in Japanese and an instruction group in English are generated.

【００７８】次に、認識単語・意味番号変換部４０の登
録単語通知部４５は、ステップＳ３０５で生成した意味
番号・登録単語リスト４１に挙げられた登録単語を言語
ＩＤとともに音声認識部３０に通知する（図１２ステッ
プＳ３０６）。Next, the registered word notifying section 45 of the recognized word / meaning number conversion section 40 notifies the speech recognition section 30 of the registered words listed in the meaning number / registered word list 41 generated in step S305 together with the language ID. (Step S306 in FIG. 12).

【００７９】次に、音声認識部３０は、ステップＳ３０
６で通知された登録単語と言語ＩＤに基づき、音声認識
単語更新部３２により音声認識エンジン３１の内容を更
新する。つまり、ここでは、日本語音声認識エンジン３
１ａと、英語音声認識エンジン３１ｂがチューンナップ
される（図１２ステップＳ３０７）。Next, the voice recognition unit 30 proceeds to step S30.
The content of the speech recognition engine 31 is updated by the speech recognition word updating unit 32 based on the registered word and the language ID notified in step 6. In other words, here, the Japanese speech recognition engine 3
1a and the English speech recognition engine 31b are tuned up (step S307 in FIG. 12).

【００８０】以上のステップＳ３０１〜ステップＳ３０
７の処理を実行することにより、ロードしたアプリケー
ションで使用する利用者入力命令群を、利用者の使用す
る複数の言語による命令群に自動的に変換して複数言語
の音声入力によるマルチリンガルなユーザーインタフェ
ースを構築することができる。Steps S301 to S30
7. By executing the processing in step 7, the user input instructions used in the loaded application are automatically converted into the instructions in a plurality of languages used by the user, and the multilingual user is input by voice input in a plurality of languages. An interface can be built.

【００８１】以上のステップによりユーザインタフェー
スをカスタマイズした後、アプリケーション５０に対す
る音声入力による操作は、実施形態１で説明した処理と
同様の処理を実行し、ゲームなどのアプリケーションを
マルチリンガル環境で楽しむことができる。After the user interface is customized by the above steps, the operation by voice input to the application 50 executes the same processing as the processing described in the first embodiment, and enjoys the application such as a game in a multilingual environment. it can.

【００８２】なお、上記説明では、例として日本語と英
語による音声入力ユーザーインタフェース構築例を挙げ
たが、他の言語であっても、同様に適用できることは言
うまでもない。また、言語には方言も含まれるものと
し、日本語、英語などの言語に加え、関西弁、東北弁な
ど各国語の各方言に対しても同様に適応可能である。In the above description, an example of constructing a voice input user interface in Japanese and English is given as an example, but it goes without saying that the present invention can be similarly applied to other languages. The language includes dialects, and is applicable to dialects of various languages such as Kansai dialect and Tohoku dialect in addition to languages such as Japanese and English.

【００８３】（実施形態４）実施形態３にかかる意味認
識システムについて図面を参照しながら説明する。本実
施形態３は、実施形態１〜３に示した意味認識システム
をクライアントサーバ構成で構築した例である。(Embodiment 4) A semantic recognition system according to Embodiment 3 will be described with reference to the drawings. The third embodiment is an example in which the semantic recognition system shown in the first to third embodiments is constructed in a client-server configuration.

【００８４】図１４は、システムの全体概略構成を示し
ている。図１４に示すように、ネットワーク上にはアプ
リケーションサーバ８０、外部記述ファイルサーバ９０
が配置されている。アプリケーションサーバ８０にはア
プリケーション５０が各種登録されており、クライアン
ト１００の要求によりアプリケーション５０を提供す
る。外部記述ファイルサーバ９０は各国言語、各方言ご
との外部記述ファイル６０が登録されており、図６、図
１１に示した意味認識システムの外部記述ファイル６０
がネットワーク上でリモートアクセスできる構成となっ
ている。つまり、意味認識システムクライアント１００
からの意味番号による参照要求に対して、意味番号に対
応する登録認識単語を通知する構成である。FIG. 14 shows the overall schematic configuration of the system. As shown in FIG. 14, an application server 80 and an externally described file server 90 are provided on a network.
Is arranged. Various applications 50 are registered in the application server 80, and the application 50 is provided by a request from the client 100. In the external description file server 90, an external description file 60 for each language and each dialect is registered, and the external description file 60 of the semantic recognition system shown in FIGS.
Can be remotely accessed on the network. That is, the semantic recognition system client 100
In response to a reference request with a meaning number from, a registered recognition word corresponding to the meaning number is notified.

【００８５】意味認識システムクライアント１００に
は、図示していないが、構成に応じて、音声入力デバイ
ス１０、音声信号ファイル２０、音声認識部３０、音声
認識エンジン３１、音声認識単語更新部３２、認識単語
・意味番号変換部４０、意味番号・登録単語リスト４
１、意味番号・登録単語リスト変更部４２、意味番号取
得部４３、意味番号・登録単語リスト生成部４４、登録
単語通知部４５、使用言語指定部７０を備えている。Although not shown, the semantic recognition system client 100 has a voice input device 10, a voice signal file 20, a voice recognition unit 30, a voice recognition engine 31, a voice recognition word updating unit 32, a recognition word Word / meaning number conversion unit 40, meaning number / registered word list 4
1, a meaning number / registered word list changing unit 42, a meaning number acquiring unit 43, a meaning number / registered word list generating unit 44, a registered word notifying unit 45, and a language designation unit 70.

【００８６】また、図示していないが、サーバ、クライ
アントともに、ネットワーク接続のための通信インタフ
ェースを備えている。ネットワーク１１０は、データを
通信できるものであれば良く、ローカルエリアネットワ
ーク、インターネットなどのネットワーク網であり、専
用線、公衆回線、有線、無線を問わない。Although not shown, both the server and the client have a communication interface for network connection. The network 110 may be any network that can communicate data, and is a network such as a local area network or the Internet, and may be a dedicated line, a public line, a wired line, or a wireless line.

【００８７】以上のように構成することにより、外部記
述ファイルをネットワーク上に配置された外部記述ファ
イルサーバとしたクライアントサーバ構成で構築するこ
とができ、各言語ごとの外部記述ファイルを意味認識シ
ステム個別に用意する必要がなくなる。With the above configuration, it is possible to construct an externally described file in a client server configuration as an externally described file server arranged on a network, and to convert the externally described file for each language into a semantic recognition system. There is no need to prepare for.

【００８８】（実施形態５）本発明にかかる意味認識シ
ステムは、意味認識システムを実現する処理ステップを
備えたプログラムとして記述し、コンピュータ読み取り
可能な記録媒体に記録して提供することにより、各種コ
ンピュータを用いて構築することができる。本発明にか
かる意味認識システムを実現する処理ステップを備えた
プログラムを記録した記録媒体は、図１５に図示した記
録媒体の例に示すように、ＣＤ−ＲＯＭ１２３やフレキ
シブルディスク１２４等の可搬型記録媒体１２２だけで
なく、ネットワーク上にある記録装置内の記録媒体１２
１や、コンピュータのハードディスクやＲＡＭ等の記録
媒体１２６のいずれであっても良く、プログラム実行時
には、プログラムはコンピュータ１２５上にローディン
グされ、主メモリ上で実行される。(Embodiment 5) The semantic recognition system according to the present invention is described as a program having processing steps for realizing the semantic recognition system, and is provided by being recorded on a computer-readable recording medium and provided. Can be constructed. As shown in the example of the recording medium shown in FIG. 15, a recording medium storing a program having processing steps for realizing the meaning recognition system according to the present invention is a portable recording medium such as a CD-ROM 123 or a flexible disk 124. 122 as well as the recording medium 12 in the recording device on the network.
1 or a recording medium 126 such as a hard disk of a computer or a RAM. When the program is executed, the program is loaded on the computer 125 and executed on the main memory.

【００８９】[0089]

【発明の効果】本発明にかかる意味認識システムによれ
ば、利用者に対して利用者の使用する言語による音声入
力のユーザインタフェースを提供し、利用者は自らの言
語の音声入力によりゲームなどのアプリケーションを楽
しむことができる。また、アプリケーションに対して言
語に依存しない意味内容を伝えるアプリケーション用イ
ンタフェースを提供することができ、アプリケーション
は利用者入力命令を意味番号を中心とした言語非依存の
中間コードで記述できるため、アプリケーション開発者
の他言語へのローカライズ作業工数を不要とすることが
できる。According to the semantic recognition system of the present invention, the user is provided with a user interface for voice input in a language used by the user, and the user can input a voice or the like in a language of his / her own language for a game or the like. You can enjoy the application. In addition, it is possible to provide an application interface that conveys language-independent semantic contents to the application, and the application can describe user input commands with language-independent intermediate code centered on semantic numbers. Localization work for the user in another language can be eliminated.

【００９０】また、本発明にかかる意味認識システムに
よれば、アプリケーションに応じてその都度アプリケー
ションで使用する意味番号・登録単語リストのみを生成
するので使用する主メモリの容量を小さく抑えることが
でき、音声認識の速度が速くなる。Further, according to the semantic recognition system according to the present invention, since only the meaning number / registered word list used in the application is generated each time depending on the application, the capacity of the main memory used can be reduced. Speech recognition speed increases.

【００９１】また、本発明にかかる意味認識システムに
よれば、ロードしたアプリケーションで使用する利用者
入力命令群を、利用者の使用する複数の言語による命令
群に自動的に変換して複数言語の音声入力によるマルチ
リンガルなユーザーインタフェースを構築することがで
きる。Further, according to the semantic recognition system of the present invention, a user input command group used in a loaded application is automatically converted into a command group in a plurality of languages used by the user, and a plurality of languages are used. A multilingual user interface by voice input can be constructed.

【００９２】さらに、本発明にかかる意味認識システム
のうち、外部記述ファイルをネットワーク上に配置され
た外部記述ファイルサーバとしたクライアントサーバ構
成で構築することにより、各言語ごとの外部記述ファイ
ルを意味認識システム個別に用意する必要がなくなり、
利用形態の多様化とシステムの低価格化が実現できる。Further, in the semantic recognition system according to the present invention, by constructing the externally described file in a client server configuration as an externally described file server arranged on a network, the externally described file for each language is semantically recognized. There is no need to prepare individual systems,
It is possible to diversify the usage form and reduce the price of the system.

[Brief description of the drawings]

【図１】本発明の実施形態１にかかる意味認識システ
ムの概略構成図FIG. 1 is a schematic configuration diagram of a semantic recognition system according to a first embodiment of the present invention;

【図２】本発明の実施形態１にかかる意味認識システ
ムの処理流れの全体像を示したフローチャートFIG. 2 is a flowchart showing an overall processing flow of the semantic recognition system according to the first embodiment of the present invention;

【図３】意味番号・登録単語リストの例[Figure 3] Example of meaning number / registered word list

【図４】意味番号・登録単語リストの例Fig. 4 Example of meaning number / registered word list

【図５】意味番号・登録単語リストへの登録単語・意
味番号セット追加例[Fig. 5] Example of adding registered word / meaning number set to the meaning number / registered word list

【図６】本発明の実施形態２にかかる意味認識システ
ムの概略構成図FIG. 6 is a schematic configuration diagram of a semantic recognition system according to a second embodiment of the present invention;

【図７】本発明の実施形態２にかかる意味認識システ
ムの処理流れの全体像を示したフローチャートFIG. 7 is a flowchart showing an overall processing flow of the semantic recognition system according to the second embodiment of the present invention;

【図８】外部記述ファイル５０の例FIG. 8 shows an example of an external description file 50.

【図９】意味番号・登録単語リスト４１へ取得した意
味番号を展開した例FIG. 9 is an example in which the acquired meaning numbers are expanded to the meaning number / registered word list 41.

【図１０】生成された意味番号・登録単語リスト４１
の例FIG. 10: Generated meaning number / registered word list 41
Example

【図１１】本発明の実施形態３にかかる意味認識シス
テムの概略構成図FIG. 11 is a schematic configuration diagram of a semantic recognition system according to a third embodiment of the present invention;

【図１２】本発明の実施形態３にかかる意味認識シス
テムの処理流れの全体像を示したフローチャートFIG. 12 is a flowchart showing an overall processing flow of the semantic recognition system according to the third embodiment of the present invention;

【図１３】生成された意味番号・登録単語リストの例FIG. 13 shows an example of a generated meaning number / registered word list.

【図１４】本発明の実施形態４にかかる意味認識シス
テムの概略構成図FIG. 14 is a schematic configuration diagram of a semantic recognition system according to a fourth embodiment of the present invention.

【図１５】記録媒体の例を示す図FIG. 15 illustrates an example of a recording medium.

[Explanation of symbols]

１０音声入力デバイス２０音声信号ファイル３０音声認識部３１音声認識エンジン３２音声認識単語更新部４０認識単語・意味番号変換部４１意味番号・登録単語リスト４２意味番号・登録単語リスト変更部４３意味番号取得部４４意味番号・登録単語リスト生成部４５登録単語通知部５０アプリケーション６０外部記述ファイル７０使用言語指定部８０アプリケーションサーバ９０外部記述ファイルサーバ１００クライアント１１０ネットワーク１２１回線先のハードディスク等の記録媒体１２２ＣＤ−ＲＯＭやフレキシブルディスク等の可搬
型記録媒体１２３ＣＤ−ＲＯＭ１２４フレキシブルディスク１２５コンピュータ１２６コンピュータ上のＲＡＭ／ハードディスク等の
記録媒体Reference Signs List 10 voice input device 20 voice signal file 30 voice recognition unit 31 voice recognition engine 32 voice recognition word update unit 40 recognition word / meaning number conversion unit 41 meaning number / registered word list 42 meaning number / registered word list changing unit 43 acquisition of meaning number Unit 44 Meaning number / registered word list generation unit 45 Registered word notification unit 50 Application 60 External description file 70 Language specification unit 80 Application server 90 External description file server 100 Client 110 Network 121 Recording medium such as hard disk at connection destination 122 CD- Portable recording medium such as ROM and flexible disk 123 CD-ROM 124 Flexible disk 125 Computer 126 Recording medium such as RAM / hard disk on computer

───────────────────────────────────────────────────── フロントページの続き (72)発明者山本健司神奈川県川崎市中原区上小田中４丁目１番１号富士通株式会社内 (72)発明者野田政秀神奈川県川崎市中原区上小田中４丁目１番１号富士通株式会社内 ──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Kenji Yamamoto 4-1-1, Kamiodanaka, Nakahara-ku, Kawasaki-shi, Kanagawa Prefecture Inside Fujitsu Limited (72) Inventor Masahide Noda 4-1-1, Kamiodanaka, Nakahara-ku, Kawasaki-shi, Kanagawa Prefecture No. 1 Inside Fujitsu Limited

Claims

[Claims]

1. A system for providing a user interface capable of voice input to a user and providing an application interface for transmitting an input content of the user to an application, wherein the input audio signal is received. A voice signal input unit, a voice recognition unit for recognizing a corresponding word based on the input voice signal, a meaning number representing the meaning of the word, and a meaning number / registered word list which is a list of registered words belonging to the meaning number A recognition word / meaning number conversion unit, an application interface, and an application that handles the meaning number as data, and a word corresponding to the speech recognition unit based on the speech signal input to the speech signal input unit. Recognize and convert the recognized word into the corresponding meaning number by the recognized word / meaning number conversion unit , Meaning recognition system and notifies the meaning number the converted via the interface application to application.

2. The meaning recognition system according to claim 1, further comprising a meaning number / registration word list updating unit for updating the meaning number / registration word list of the recognition word / meaning number conversion unit.

3. An external description file in which a word / semantic system table in which a systematic description of systematically classifying meanings and encoding of each semantic unit and registered words belonging to each semantic number is recorded. The semantic recognition system according to claim 1, comprising:

4. The application in which at least a part of the processing data includes a meaning number, wherein the recognition word / meaning number conversion unit converts the meaning described in the application via the application interface. A meaning number acquiring unit for acquiring a number, a meaning number for extracting a registered word corresponding to the acquired meaning number from the word / semantic system table of the external description file, and generating the meaning number / registered word list.
The semantic recognition system according to claim 3, further comprising a registered word list generation unit.

5. The application in which the processing data includes a meaning number in at least a part of the processing data, and based on the meaning number described in the application itself, a word / semantic table of the external description file is used. A processing step of extracting a corresponding registered word; and a processing step of notifying the meaning number and the registered word to the recognition word / meaning number conversion unit, wherein the recognition word / meaning number conversion unit obtains from the application. 4. The meaning recognition system according to claim 3, further comprising a meaning number / registered word list generation unit that generates the meaning number / registered word list based on the meaning number and the registered word.

6. The registered word / semantic number conversion unit includes a registered word notifying unit that notifies the registered words included in the generated meaning number / registered word list to the voice recognition unit.
The semantic recognition system according to claim 4, wherein the voice recognition unit includes a voice recognition word updating unit that updates the content of a voice recognition engine based on the notified registered word.

7. A use language designation unit for designating a language used by a user, an external description file for each of a plurality of languages as the external description file, and a speech for each of a plurality of languages as a speech recognition engine of the speech recognition unit. A recognition engine, wherein the semantic number / registered word list generating unit generates a semantic number / registered word list based on an external description file of the language specified by the working language specifying unit, and the speech recognition unit includes: 7. The meaning according to claim 6, wherein a multi-lingual compliant user interface is provided that allows a user to input an instruction in a plurality of languages by using a speech recognition engine of a language specified by the language-in-use specifying unit. Recognition system.

8. An externally described file server that provides the externally described file, an application server that provides the application, the voice signal input unit, the voice recognition unit, the recognized word / meaning number conversion unit, and the application server. The semantic recognition system according to claim 6, further comprising a semantic recognition system client having an interface, wherein the externally described file server, the application server, and the semantic recognition system client are connected via a computer network.

9. A program describing processing steps for providing a user interface capable of voice input to a user and realizing a system for providing an application interface for transmitting a user input to an application. A voice signal input processing step of receiving a voice signal to be input, a voice recognition processing step of recognizing a corresponding word based on the input voice signal, a meaning number representing a meaning, and a registration belonging to the meaning number. A meaning number / registered word list generating processing step of generating a meaning number / registered word list which is a list of words, and converting the word recognized by the voice recognition processing step to a meaning number corresponding to the generated meaning number / registered word list. Recognized words to be converted
A computer-readable recording medium recording a semantic recognition processing program comprising a semantic number conversion processing step and a semantic number notification processing step of notifying the converted meaning number to the application.