JP2000075887A

JP2000075887A - Device, method and system for pattern recognition

Info

Publication number: JP2000075887A
Application number: JP10246401A
Authority: JP
Inventors: Koji Asano; 康治浅野; Katsuki Minamino; 活樹南野; Hitoshi Honda; 等本田; Osamu Hamada; 修浜田
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1998-08-31
Filing date: 1998-08-31
Publication date: 2000-03-14

Abstract

PROBLEM TO BE SOLVED: To increase the recognition object patterns and to reduce the network load. SOLUTION: In the pattern recognition system, a terminal 10, which is provided with a small scale dictionary 15 that becomes a reference for pattern recognition, and a server 20, which is provided with a large scale dictionary 24 having a larger scale than the dictionary 15, are connected by a network. The terminal 10 conducts a recognition process for the pattern inputted into an input section 11 using the dictionary 15. If the recognition result is not appropriate, recognition is conducted by the dictionary 24 of the server 20.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、パターン認識を利
用した装置及び方法に関し、特にネットワークで接続さ
れた複数の機器におけるパターン認識装置、方法及びシ
ステムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus and method using pattern recognition, and more particularly to an apparatus, method and system for pattern recognition in a plurality of devices connected by a network.

【０００２】[0002]

【従来の技術】一般に、パターン認識を行う場合には、
照合する際に基準になるパターンを登録したパターン辞
書が大きくなるほど、より詳細な照合を行うことがで
き、認識精度を高めたり、またより多くの種類のパター
ンの中からの認識を行うことが可能となる。2. Description of the Related Art Generally, when performing pattern recognition,
The larger the pattern dictionary in which the reference patterns are registered for matching, the more detailed matching can be performed, the higher the recognition accuracy, and the more types of patterns can be recognized. Becomes

【０００３】しかし一方で、パターン辞書を大きくする
と、認識処理に必要となる演算量や記憶容量が大きくな
る。[0003] On the other hand, when the pattern dictionary is made large, the amount of calculation and storage capacity required for the recognition process become large.

【０００４】そのため、通常の装置においても、認識処
理のレスポンスが悪化することがさけられない。[0004] Therefore, even in a normal apparatus, the response of the recognition processing cannot be deteriorated.

【０００５】また、特に携帯を想定した小型端末など演
算量や記憶容量の制約が厳しい場合には、大規模なパタ
ーン辞書を利用したパターン認識を端末単体で実現する
のは困難であった。[0005] In addition, especially when the calculation amount and the storage capacity are severely restricted, such as in a small terminal that is assumed to be portable, it has been difficult to realize pattern recognition using a large-scale pattern dictionary by itself.

【０００６】このような小型の端末で大規模なパターン
辞書を利用したパターン認識処理を行う方法としては、
特開平７−１５２７９７号公報で開示された明細書中に
示されているように、ネットワークを介して、サーバに
接続し、サーバにおいて大規模なパターン辞書を利用し
たパターン認識処理を行い、その結果をネットワークを
介して端末に送ると行った方法が提案されている。A method for performing pattern recognition processing using a large-scale pattern dictionary on such a small terminal is as follows.
As shown in the specification disclosed in JP-A-7-152797, a server is connected to a server via a network, and the server performs a pattern recognition process using a large-scale pattern dictionary. Is sent to a terminal via a network.

【０００７】すなわち、上記明細書中には、図４に示す
ように、情報アクセスシステムの一構成例が開示されて
いる。この情報アクセスシステムは、センターシステム
１０１およびそれと、例えばＰＳＴＮや統合サービスデ
ィジタル網（integrated services digital network;IS
DN）などの電話回線などで接続され、センターシステム
１０１と通信可能なターミナル１０２とから構成されて
いる。[0007] That is, the above specification discloses one configuration example of the information access system as shown in FIG. This information access system comprises a center system 101 and, for example, a PSTN or an integrated services digital network (IS).
The terminal 102 is connected by a telephone line such as DN) and includes a terminal 102 that can communicate with the center system 101.

【０００８】ターミナル１０２は、センターシステム１
０１の音声入力型データベース検索・送出システム１０
３をインターフェイスとして、例えば図書館情報、新聞
情報、雑誌情報、新刊情報などの刊行物の情報が記憶さ
れた図書データベース１０５にアクセスし、所望する情
報を得ることができるようになされている。なお、図書
データベース１０５には、刊行物の情報として、そのテ
キストデータなどの他、その音韻情報および韻律情報か
らなる発音記号情報が記憶されており、ターミナル１０
２では、これらの情報を受信し、刊行物のテキストデー
タの表示、印刷の他、発音記号情報から合成音を生成
し、これにより刊行物を朗読した音声を得ることができ
るようになされている。The terminal 102 is connected to the center system 1
01 voice input type database search / transmission system 10
3 is used as an interface to access a book database 105 in which information of publications such as library information, newspaper information, magazine information, and new book information is stored, and to obtain desired information. The book database 105 stores, as information on publications, phonetic symbol information including phonemic information and prosodic information in addition to text data and the like.
In No. 2, in addition to receiving such information, displaying and printing the text data of the publication, and generating a synthesized sound from the phonetic symbol information, it is possible to obtain a speech reading the publication. .

【０００９】センターシステム１０１は、音声入力型デ
ータベース検索・送出システム１０３、オーサリングシ
ステム１０４、図書データベース１０５、およびネット
ワーク制御部１０６から構成されている。オーサリング
システム１０４は、刊行物の情報を、図書データベース
５に記憶させる、刊行物の情報のデータベース化や、図
書データベース１０５に記憶された情報を編集すること
ができるようになされている。ネットワーク制御部１０
６は、センターシステム１０１と他のセンターシステム
との通信を制御し、これによりセンターシステム１０１
の図書データベース５と、他のセンターシステムにおけ
るデータベースとの共有化を図ることができるようにな
されている。The center system 101 comprises a voice input type database search / transmission system 103, an authoring system 104, a book database 105, and a network control unit 106. The authoring system 104 is configured to store publication information in the book database 5, create a database of publication information, and edit the information stored in the book database 105. Network control unit 10
6 controls communication between the center system 101 and other center systems, and
Of the book database 5 and databases in other center systems.

【００１０】[0010]

【発明が解決しようとする課題】しかし、大規模なパタ
ーン辞書を用いたパターン認識処理には、処理のレスポ
ンスという問題がある。However, pattern recognition processing using a large-scale pattern dictionary has a problem of processing response.

【００１１】すなわち、認識用のパターン辞書を大規模
化すると、入力パターンの照合に時間がかかり、認識結
果がでるまでの時間が長くかかり、レスポンスが悪くな
る。そのため機器の操作性が悪くなることが考えられ
る。That is, when the size of the pattern dictionary for recognition is increased, it takes time to collate the input pattern, it takes a long time until a recognition result is obtained, and the response becomes poor. Therefore, the operability of the device may be degraded.

【００１２】例えば、パターン認識処理の一例である音
声認識処理を用いた端末を考えてみる。端末をコントロ
ールするためのコマンドなど良く使われる語の認識のレ
スポンスが悪いと、端末の操作性が大きく損なわれる。For example, consider a terminal that uses voice recognition processing, which is an example of pattern recognition processing. Poor response in recognizing commonly used words, such as commands for controlling the terminal, greatly impairs the operability of the terminal.

【００１３】また、図４に示した情報アクセスシステム
のような大規模なパターン辞書を用いたパターン認識処
理を、端末とサーバを用いた実現するような構成のパタ
ーン認識処理装置の場合、次のような問題がある。Further, in the case of a pattern recognition processing apparatus configured to realize a pattern recognition process using a large-scale pattern dictionary using a terminal and a server as in the information access system shown in FIG. There is such a problem.

【００１４】すなわち、端末においてパターン認識を実
行しようとすると、端末はまずネットワークを介してサ
ーバに接続し、入力パターンのデータを転送し、サーバ
でパターン認識処理を行い、その結果をふたたびネット
ワーク経由でサーバから得るという処理を行う必要があ
る。そのため、パターン認識結果を得るのに時間が掛か
り、より一層のレスポンスの低下が起こる。That is, to execute pattern recognition in a terminal, the terminal first connects to a server via a network, transfers data of an input pattern, performs pattern recognition processing in the server, and returns the result via the network again. It is necessary to perform the process of obtaining from the server. Therefore, it takes time to obtain a pattern recognition result, and the response is further reduced.

【００１５】さらに、サーバにおいて常にパターン認識
を行う場合、パターン認識を行う度にネットワークを利
用した端末とサーバを接続しなければならず、ネットワ
ークでの負荷が増え、特に携帯端末などで常にネットワ
ークに接続されているわけではない環境では、ネットワ
ーク利用のコストが増大する。Further, in the case where the pattern recognition is always performed in the server, a terminal using the network must be connected to the server every time the pattern recognition is performed, so that the load on the network increases. In an environment that is not connected, the cost of using the network increases.

【００１６】本発明は、上述の実情に鑑みてなされるも
のであって、パターン認識処理のレスポンス短縮と認識
対象パターンの増大を実現するようなパターン認識装置
及び方法を提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and has as its object to provide a pattern recognition apparatus and method which can shorten the response of pattern recognition processing and increase the number of patterns to be recognized. .

【００１７】[0017]

【課題を解決するための手段】上述の課題を解決するた
めに、本発明に係るパターン認識装置は、パターンを入
力する入力手段と、パターンの認識に用いる基準となる
第１の辞書と、パターンの認識に用いる基準となる辞書
であって上記第１の辞書より規模の大きい第２の辞書
と、上記入力手段に入力されたパターンについて上記第
１の辞書で認識を行い、その結果が不適切であると上記
第２の辞書にて認識を行う認識手段と、上記認識手段か
らのパターンの認識の結果を出力する出力手段とを有す
るものである。In order to solve the above-mentioned problems, a pattern recognition apparatus according to the present invention comprises: an input means for inputting a pattern; a first dictionary used as a reference for pattern recognition; A second dictionary, which is a reference dictionary used for recognition of the first dictionary and which is larger in scale than the first dictionary, and a pattern input to the input means is recognized by the first dictionary. In this case, there is provided a recognizing means for performing recognition using the second dictionary, and an output means for outputting a result of pattern recognition from the recognizing means.

【００１８】また、本発明に係るパターン認識方法は、
パターンを入力する入力工程と、パターンの認識に用い
る基準となる第１の辞書、及びパターンの認識に用いる
基準となる辞書であって上記第１の辞書より規模の大き
い第２の辞書を用い、上記入力工程に入力されたパター
ンについて上記第１の辞書で認識を行い、その結果が不
適切であると上記第２の辞書にて認識を行う認識工程
と、上記認識工程からのパターン認識の結果を出力する
出力工程とを有するものである。Further, the pattern recognition method according to the present invention comprises:
An input step of inputting a pattern, a first dictionary serving as a reference used for pattern recognition, and a second dictionary serving as a reference used for pattern recognition and being larger than the first dictionary, A recognition step of recognizing the pattern input to the input step by the first dictionary and recognizing that the result is inappropriate by the second dictionary; and a pattern recognition result from the recognition step. And an output step of outputting

【００１９】そして、本発明に係るパターン認識システ
ムは、パターンの認識に用いる基準となる第１の辞書を
備える端末と、パターンの認識に用いる基準となる辞書
であって上記第１の辞書より規模の大きい第２の辞書を
備えるサーバとがネットワークにて結ばれてなるパター
ン認識システムであって、上記端末は、上記端末に入力
したパターンについて、上記第１の辞書で認識を行い、
その結果が不適切であると上記サーバの備える第２の辞
書で認識を行うものである。The pattern recognition system according to the present invention includes a terminal having a first dictionary serving as a reference used for pattern recognition, and a terminal serving as a reference serving as a reference used for pattern recognition and having a scale larger than that of the first dictionary. Is a pattern recognition system in which a server having a second dictionary having a large size is connected via a network, wherein the terminal recognizes a pattern input to the terminal using the first dictionary,
If the result is inappropriate, recognition is performed using the second dictionary of the server.

【００２０】すなわち、本発明ではパターン認識におけ
る照合処理の基準となるパターン辞書を、小規模な第１
の辞書と大規模な第２の辞書の２つに分けて保持し、そ
れを必要に応じて使い分ける。That is, in the present invention, a pattern dictionary serving as a reference for a matching process in pattern recognition is stored in a small first
And a large-scale second dictionary, which are stored separately, and used as needed.

【００２１】すなわち、まず小規模な第１の辞書を用い
て、パターン認識処理を行う。その結果が、適切な認識
結果が得られれば、パターン認識処理を終了する。もし
適切な結果が得られなかった場合は、大規模な第２の辞
書を用いてふたたびパターン認識処理を行う。That is, pattern recognition processing is first performed using a small first dictionary. If an appropriate recognition result is obtained, the pattern recognition processing ends. If an appropriate result is not obtained, the pattern recognition processing is performed again using the large-scale second dictionary.

【００２２】[0022]

【発明の実施の形態】以下、本発明の実施の形態につい
て、図面を参照して詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００２３】まず、本発明の第１の実施の形態として、
パターン認識により音声を認識する音声認識装置につい
て説明する。First, as a first embodiment of the present invention,
A speech recognition device that recognizes speech by pattern recognition will be described.

【００２４】音声認識装置は、図１に示すように、ユー
ザによりデータの入出力の操作がなわれる端末１０と、
大容量のデータベースを備えてデータの処理を集中して
行うサーバ２０とから構成されている。これら端末１０
およびサーバ２０は、情報を伝送する路線であるネット
ワーク３０により接続が可能である。As shown in FIG. 1, the voice recognition device includes a terminal 10 on which data input / output operation is performed by a user;
And a server 20 having a large-capacity database and centrally performing data processing. These terminals 10
The server 20 and the server 20 can be connected by a network 30 which is a line for transmitting information.

【００２５】まず、端末１０について説明する。端末１
０は、ユーザによるデータの入出力を受け付け、ネット
ワーク３０を介してサーバ２０と接続するものである。First, the terminal 10 will be described. Terminal 1
0 is for accepting data input / output by the user and connecting to the server 20 via the network 30.

【００２６】端末１０は、音声の入力を受け付ける入力
部１１と、パターン辞書に相当する認識辞書部１２と、
音声パターンを認識辞書部１２により照合する演算部１
３と、演算部１３から送られた結果を出力する出力部１
４とを有している。The terminal 10 includes an input unit 11 for receiving a voice input, a recognition dictionary unit 12 corresponding to a pattern dictionary,
The operation unit 1 for collating the voice pattern by the recognition dictionary unit 12
3 and an output unit 1 for outputting a result sent from the arithmetic unit 13
And 4.

【００２７】入力部１１は認識対象となる音声パターン
の入力を受け付ける部分で、マイク、アンプ、ＡＤ変換
器などで構成される。入力部１１では、入力された音声
パターンをデジタル化して演算部１３に送る。The input section 11 is a section for receiving an input of a voice pattern to be recognized, and includes a microphone, an amplifier, an AD converter, and the like. The input unit 11 digitizes the input voice pattern and sends it to the arithmetic unit 13.

【００２８】認識辞書部１２は、パターン辞書に相当す
るものである。認識辞書部１２は、半導体メモリやハー
ドディスクなどで構成される。認識辞書部１２は、音声
認識の場合は、一般に認識の基本単位となる音響パター
ンと、音響パターンがどのように連鎖し得るかを規定す
る単語辞書、文法から構成される。The recognition dictionary unit 12 corresponds to a pattern dictionary. The recognition dictionary unit 12 includes a semiconductor memory, a hard disk, and the like. In the case of speech recognition, the recognition dictionary unit 12 generally includes an acoustic pattern that is a basic unit of recognition, a word dictionary that defines how acoustic patterns can be linked, and a grammar.

【００２９】本実施の形態においては、認識辞書部１２
は、小規模な認識辞書である小規模辞書１５に実装され
ている。小規模辞書１５は、音響パターン１６および単
語辞書・文法１７から構成されている。In the present embodiment, the recognition dictionary unit 12
Are implemented in a small dictionary 15 which is a small recognition dictionary. The small dictionary 15 includes an acoustic pattern 16 and a word dictionary / grammar 17.

【００３０】なお、本実施の形態には、小規模辞書より
規模の大きい認識辞書である大規模辞書も備えられてい
る。この大規模辞書は、小規模辞書１５より精細な音響
パターン、大語彙の単語辞書、複雑な文法を備えてい
る。後述するように、この大規模辞書はサーバ２０に備
えられるので、端末１０からはネットワーク３０により
接続することができる。The present embodiment also includes a large-scale dictionary, which is a recognition dictionary having a larger scale than a small-scale dictionary. This large-scale dictionary has a sound pattern, a large vocabulary word dictionary, and a complicated grammar that are more detailed than the small-scale dictionary 15. As will be described later, this large-scale dictionary is provided in the server 20, so that the terminal 10 can be connected to the network 30 via the network 30.

【００３１】演算部１３はＣＰＵ、半導体メモリなどで
構成される。演算部１３は、入力部１１より送られた音
声パターンと、認識辞書部１２に格納されている音響パ
ターンを、単語辞書、文法に規定されたパターンの接続
条件を満たす形で照合を行う。演算部１３は、このよう
な照合による音声認識により得られた音声認識結果を出
力部１４に送る。The operation unit 13 is composed of a CPU, a semiconductor memory and the like. The arithmetic unit 13 collates the voice pattern sent from the input unit 11 with the acoustic pattern stored in the recognition dictionary unit 12 in a form that satisfies the connection conditions of the pattern specified in the word dictionary and the grammar. The calculation unit 13 sends the speech recognition result obtained by the speech recognition based on the collation to the output unit 14.

【００３２】出力部１４は、ディスプレイや音声合成装
置などから構成される。出力部は、演算部１３より送ら
れた音声認識結果をユーザに提示する。The output unit 14 is composed of a display, a voice synthesizer, and the like. The output unit presents the user with the speech recognition result sent from the calculation unit 13.

【００３３】また、端末１０は、ネットワーク３０に接
続するインターフェース部であるサーバ接続部１８を有
している。サーバ接続部１８はモデムなどで構成され、
必要に応じてネットワークを経由してサーバと接続す
る。The terminal 10 has a server connection unit 18 which is an interface unit for connecting to the network 30. The server connection unit 18 is configured by a modem or the like,
Connect to the server via the network as needed.

【００３４】続いて、サーバ２０について説明する。サ
ーバ２０は、大容量のデータベースを備え、ネットワー
ク３０を介して端末から接続されることにより、データ
処理を集中して行うものである。Next, the server 20 will be described. The server 20 has a large-capacity database, and performs data processing in a concentrated manner by being connected from a terminal via the network 30.

【００３５】サーバ２０は、パターン辞書に相当する認
識辞書部２１と、音声パターンを認識辞書部２１を用い
て認識する演算部２２と、ネットワーク３０に接続する
インターフェースである端末接続部２３とを有してい
る。The server 20 has a recognition dictionary unit 21 corresponding to a pattern dictionary, an operation unit 22 for recognizing a voice pattern using the recognition dictionary unit 21, and a terminal connection unit 23 which is an interface for connecting to a network 30. are doing.

【００３６】認識辞書部２１は、端末１０に備えられる
認識辞書部１２と略々同じ構成である。すなわち、認識
辞書部２１は、パターン辞書に相当するものであって、
半導体メモリやディスクなどで構成される。ただし、端
末１０に備えられる小規模辞書１５より規模の大きい大
規模辞書２４が備えられている。大規模辞書２４には、
小規模辞書１５より精細な音響パターン２５、大語彙の
単語辞書、複雑な文法２６が備えられている。The recognition dictionary unit 21 has substantially the same configuration as the recognition dictionary unit 12 provided in the terminal 10. That is, the recognition dictionary unit 21 corresponds to a pattern dictionary,
It is composed of a semiconductor memory and a disk. However, a large-scale dictionary 24 which is larger than the small-scale dictionary 15 provided in the terminal 10 is provided. In the large-scale dictionary 24,
An acoustic pattern 25, a word dictionary of a large vocabulary, and a complicated grammar 26 are provided.

【００３７】演算部２２は、端末１０に備えられる演算
部１３と略々同じ構成である。すなわち、演算部２２
は、ＣＰＵ、半導体メモリなどで構成され、端末１０か
らネットワークを経由して送られた音声パターンと、認
識辞書部２１に格納されている音響パターンを、単語辞
書、文法に規定されたパターンの接続条件を満たす形で
照合を行う。ただし、サーバ２０に備えられる演算部２
２には、端末１０に備えられる演算部１３より高性能な
ものが用いられている。演算部２２は、認識辞書部２１
を用いた照合による音声認識により得られた音声認識結
果を端末１０に送る。The operation unit 22 has substantially the same configuration as the operation unit 13 provided in the terminal 10. That is, the operation unit 22
Is a connection between a voice pattern sent from the terminal 10 via the network and an acoustic pattern stored in the recognition dictionary unit 21 by a CPU, a semiconductor memory, and the like. Perform matching in a form that satisfies the conditions. However, the operation unit 2 provided in the server 20
For 2, a unit having a higher performance than the operation unit 13 provided in the terminal 10 is used. The operation unit 22 includes a recognition dictionary unit 21
The speech recognition result obtained by the speech recognition based on the collation using is transmitted to the terminal 10.

【００３８】端末接続部２３はモデムなどで構成されて
いる。端末接続部２３は、端末１０からのサーバ接続命
令を受け、このサーバ接続命令に基づいて、端末１０と
の間の接続を制御する。The terminal connection unit 23 is composed of a modem or the like. The terminal connection unit 23 receives a server connection command from the terminal 10 and controls connection with the terminal 10 based on the server connection command.

【００３９】頻繁に利用されるパターンに関しては、小
規模な認識辞書である小規模辞書１５に登録しておくこ
とによって、頻繁に利用される語に関しては、小規模辞
書１５でレスポンスが早い認識処理を行うことができ
る。The frequently used patterns are registered in the small dictionary 15, which is a small recognition dictionary, so that the frequently used words can be quickly recognized by the small dictionary 15. It can be performed.

【００４０】さらに、大規模な認識辞書である大規模２
４辞書をサーバ２０上に備え、ネットワーク３０を介し
て利用することによって、より多くのパターンとより精
細に照合した認識を行うことも可能である。しかも、こ
れは必要となった場合にだけ実行されるので、常時ネッ
トワーク３０で接続してサーバ２０で認識処理を行う場
合に比べて、ネットワーク３０の負荷や通信コストの低
減が可能となる。Further, a large-scale recognition dictionary 2
By providing four dictionaries on the server 20 and using them via the network 30, it is also possible to perform recognition with more patterns and more detailed matching. In addition, since this is executed only when necessary, the load on the network 30 and the communication cost can be reduced as compared with the case where the connection processing is always performed by the network 30 and the server 20 performs the recognition processing.

【００４１】続いて、音声認識装置における音声認識処
理の流れについて、図２に示すフローチャートを参照し
て説明する。Next, the flow of the voice recognition process in the voice recognition device will be described with reference to the flowchart shown in FIG.

【００４２】最初のステップＳ１１においては、端末１
０の入力部１１に対して、音声が入力される。入力部１
１では、入力音声をデジタル化し、演算部１３に送る。In the first step S11, the terminal 1
A voice is input to the input unit 11 of “0”. Input unit 1
In step 1, the input voice is digitized and sent to the operation unit 13.

【００４３】ステップＳ１２においては、演算部１３
は、入力部から送られたデジタル化された音声を、端末
１０の認識辞書部１２の小規模辞書１５を用いて認識処
理を行う。すなわち、小規模辞書１２に備えられる音響
パターン１６および単語辞書・文法１７を用いて認識処
理を行う。In step S12, the operation unit 13
Performs a recognition process on the digitized voice sent from the input unit using the small dictionary 15 of the recognition dictionary unit 12 of the terminal 10. That is, recognition processing is performed using the acoustic pattern 16 and the word dictionary / grammar 17 provided in the small-scale dictionary 12.

【００４４】ステップＳ１３においては、ステップＳ１
２にて行われた音声の認識処理の結果を判断する。認識
処理の結果の判断を行うには、音声認識装置が自動で行
う方法と、ユーザによる確認の方法との２つの方法が考
えられる。In step S13, step S1
The result of the speech recognition process performed in step 2 is determined. There are two methods for determining the result of the recognition process, a method automatically performed by the voice recognition device and a method of confirmation by the user.

【００４５】音声認識の結果を音声認識装置が自動で行
う方法としては、一般に信頼性尺度（confidence measu
re;C）と呼ばれる認識結果の信頼性に関する尺度を基準
として判断する方法がある。As a method of automatically performing the result of speech recognition by the speech recognition apparatus, generally, a confidence measure (confidence measu) is used.
There is a method of making a judgment based on a measure relating to the reliability of the recognition result called re; C).

【００４６】ここで、信頼性尺度Ｃには、認識スコアが
第１位の候補の認識スコアＰ（Ｗfirst）、認識スコア
が第２位の候補の認識スコアＰ（Ｗsecond）について、
次のように計算される値が用いられる。Ｃ＝Ｐ（Ｗfirst）−Ｐ（Ｗsecond）また、信頼性尺度Ｃは、さらに第３位の候補の認識スコ
アＰ（Ｗthird）以下を用いて、Ｃ＝Ｗ（Ｐfirst）／（Ｐ（Ｗsecond）−Ｐ（Ｗthir
d））により与えることもできる。Here, the reliability scale C includes the recognition score P (Wfirst) of the candidate having the first recognition score and the recognition score P (Wsecond) of the candidate having the second recognition score.
The value calculated as follows is used. C = P (Wfirst) −P (Wsecond) Further, the reliability scale C is obtained by using the recognition score P (Wthird) of the third candidate or less, and C = W (Pfirst) / (P (Wsecond) − P (Wthir
d)) can also be given by

【００４７】上述の信頼性尺度は、一般に、パターン認
識処理の結果、認識スコアが第１位の候補と、第２位以
下の候補の認識スコアの差が大きいほど、認識結果の信
頼性が高いと想定されることに基づいて定義したもので
ある。In general, the reliability scale of the above-described reliability measure is such that the larger the difference between the recognition scores of the first and second or lower candidates as a result of the pattern recognition processing, the higher the reliability of the recognition result. It is defined based on what is assumed.

【００４８】このような信頼性尺度を基準として判断す
る方法によると、音声認識装置は、音声の認識処理の過
程で信頼性尺度を計算し、信頼性尺度の値が所定の閾値
を超えた場合には認識結果がＯＫであると判断し、信頼
性尺度の値が上記閾値より小さい場合には認識結果がＮ
Ｇであると判断する。According to such a method of making a determination based on the reliability measure, the speech recognition apparatus calculates the reliability measure in the course of the speech recognition processing, and determines whether the value of the reliability measure exceeds a predetermined threshold value. Determines that the recognition result is OK, and if the value of the reliability scale is smaller than the threshold value, the recognition result is N
It is determined to be G.

【００４９】一方、ユーザによる確認による方法では、
認識処理の結果を演算部１３から出力部１４に送る。そ
して、出力部１４において、画面表示または音声による
提示によってユーザに伝達し、その結果をユーザが判断
して、認識結果の良し悪しを判断する。On the other hand, in the method based on confirmation by the user,
The result of the recognition processing is sent from the calculation unit 13 to the output unit 14. Then, in the output unit 14, the result is transmitted to the user by screen display or audio presentation, and the result is determined by the user, and the quality of the recognition result is determined.

【００５０】ステップＳ１４においては、ステップＳ１
３における認識結果の判断により分岐する。すなわち、
認識結果がＯＫと判断された場合には、ステップＳ２０
において最終結果出力として、その認識結果を最終結果
として、出力部１４に送り、ユーザに出力して、処理は
終了する。認識結果がＮＧであると判断された場合に
は、次のステップＳ１５に進む。In step S14, step S1
The process branches depending on the judgment of the recognition result in 3. That is,
If the recognition result is determined to be OK, step S20
Then, the recognition result is sent to the output unit 14 as a final result output as a final result, and is output to the user, and the process ends. If it is determined that the recognition result is NG, the process proceeds to the next step S15.

【００５１】ステップＳ１５においては、端末１０はサ
ーバ接続部１８からネットワーク２０を介してサーバに
接続する。そして、端末１０は、音声認識に必要なデー
タをサーバ２０に転送する。ここで、音声認識に必要な
データとしては、入力音声データそのものでもいいが、
音響分析をした結果や、さらにベクトル量子化した結果
などを転送するもできる。音声分析した結果やベクトル
量子化した結果を送ることにより、転送する情報量を減
らすことができる。サーバ２０では、転送されたデータ
を端末接続部２３で受け取り、そのデータを演算部２２
に送る。In step S15, the terminal 10 connects to the server from the server connection unit 18 via the network 20. Then, the terminal 10 transfers data necessary for voice recognition to the server 20. Here, the data necessary for speech recognition may be the input speech data itself,
The results of the acoustic analysis and the results of the vector quantization can also be transferred. By transmitting the result of voice analysis and the result of vector quantization, the amount of information to be transferred can be reduced. In the server 20, the transferred data is received by the terminal connection unit 23, and the data is
Send to

【００５２】ステップ１６においては、サーバ２０に演
算部は、端末１０から送られたデータを認識辞書部２１
の大規模辞書２４に保持されているデータと照合して、
認識を行う。そして、演算部２２は、大規模辞書２４を
用いた音声認識の結果を端末接続部２３からネットワー
ク３０を介して端末１０に送る。最後のステップＳ２０
においては、端末１０は、サーバより送られてきた認識
結果を最終結果として出力する。In step 16, the arithmetic unit in the server 20 transmits the data sent from the terminal 10 to the recognition dictionary unit 21.
Is compared with the data held in the large-scale dictionary 24 of
Perform recognition. Then, the arithmetic unit 22 sends the result of the speech recognition using the large-scale dictionary 24 from the terminal connection unit 23 to the terminal 10 via the network 30. Last step S20
In, the terminal 10 outputs the recognition result sent from the server as a final result.

【００５３】なお、サーバ２０による大規模辞書２４を
用いた認識時においては、以下の処理も同時に行うこと
もできる。At the time of recognition by the server 20 using the large-scale dictionary 24, the following processing can be performed at the same time.

【００５４】すなわち、Ｓ１７において、大規模辞書２
４中の各パターン、特に単語認識に関しては、大規模辞
書２４を参照して認識結果を得る際に、過去に何回認識
結果として利用されているかを保持しておくようにす
る。そして、大規模辞書を参照した認識が行われる度
に、認識結果として得られたパターンの利用回数に１を
加える。That is, in S17, the large dictionary 2
Regarding each pattern in No. 4, especially word recognition, when obtaining a recognition result by referring to the large-scale dictionary 24, how many times the pattern has been used as a recognition result in the past is stored. Each time recognition is performed with reference to the large-scale dictionary, 1 is added to the number of times the pattern obtained as a result of recognition is used.

【００５５】そして、ステップＳ１８において、過去の
利用回数がある閾値を超えた場合、またはある期間にお
ける利用回数の頻度がある閾値を越えた場合には、ステ
ップＳ１９において該当するパターンをサーバより端末
に転送し、端末中の小規模辞書に登録する。If, in step S18, the number of past uses exceeds a certain threshold, or if the frequency of uses in a certain period exceeds a certain threshold, the corresponding pattern is sent from the server to the terminal in step S19. Transfer and register in the small dictionary in the terminal.

【００５６】このように、頻繁にアクセスされるパター
ンを端末１０の小規模辞書１５に転送することによっ
て、端末１０がサーバ２０にアクセスする回数を減らす
ことができる。さらに、認識処理のレスポンスの改善や
ネットワーク３０の負荷の低減に効果がある。As described above, by transferring frequently accessed patterns to the small dictionary 15 of the terminal 10, the number of times the terminal 10 accesses the server 20 can be reduced. Further, it is effective in improving the response of the recognition process and reducing the load on the network 30.

【００５７】なお、端末１０の小規模辞書１５へのパタ
ーンの登録は、上記のように音声認識装置により自動的
に行うことも可能であるし、登録前にユーザの確認を取
るように構成構成することも可能である。The registration of the pattern in the small-scale dictionary 15 of the terminal 10 can be automatically performed by the voice recognition device as described above. It is also possible.

【００５８】小規模辞書１５には登録されていないが、
大規模辞書２４に登録されている単語で、ユーザがよく
利用する単語に関しては、小規模辞書１５にも登録する
ことによって、大規模辞書２４へのアクセス回数を減ら
すことができ、さらなるレスポンスとネットワーク３０
負荷の改善が可能となる。Although not registered in the small dictionary 15,
Of the words registered in the large-scale dictionary 24, words frequently used by the user are also registered in the small-scale dictionary 15, so that the number of accesses to the large-scale dictionary 24 can be reduced. 30
The load can be improved.

【００５９】続いて、音声認識装置の実際の処理の例に
ついて説明する。音声認識装置の実際の処理の例として
は、端末１０によるデータベース検索に上記のような音
声認識装置を組み込むことが考えられる。例えば図書情
報を検索する端末１０と、図書情報を待つサーバ２０を
考える。図面のクリアやスクロールなど端末１０を制御
するコマンドなどを小規模辞書１５に保持し、図書の名
称や著者名、出版社名など図書情報に関連し、またその
数も多いものを大規模辞書２４に登録する。Next, an example of actual processing of the speech recognition apparatus will be described. As an example of actual processing of the voice recognition device, it is conceivable to incorporate the above-described voice recognition device into a database search by the terminal 10. For example, consider a terminal 10 that searches for book information and a server 20 that waits for book information. Commands for controlling the terminal 10, such as clearing and scrolling of drawings, are stored in the small dictionary 15, and those related to the book information such as book names, author names, and publisher names, and those having a large number are stored in the large-scale dictionary 24. Register with.

【００６０】ユーザは画面のクリアなど良く用いられる
コマンドに関しては、端末１０に内蔵の小規模辞書１５
による認識で、ネットワーク３０を利用することなくす
ばやいレスポンスで端末を操作することが可能である。
そして図書の名前など小規模辞書１５に登録されていな
いものを入力した場合は、端末１０は自動的にネットワ
ーク３０を経由してサーバ２０に接続し、認識処理を実
行するので、ユーザは認識用辞書が分散配置されている
ことやネットワークと接続されていることなどを意識す
ることなく端末を利用することができる。The user uses the small dictionary 15 built in the terminal 10 for frequently used commands such as clearing the screen.
, The terminal can be operated with a quick response without using the network 30.
If the user inputs a name that is not registered in the small dictionary 15 such as a book name, the terminal 10 automatically connects to the server 20 via the network 30 and executes a recognition process. The terminal can be used without being aware that the dictionaries are distributed and connected to the network.

【００６１】またこの時、端末１０とサーバ２０の間の
通信のプロトコルを決めることによって、個々の機器の
制御コマンドの名称などは各機器毎の独自に決めること
が可能となる。At this time, by determining the communication protocol between the terminal 10 and the server 20, the names of the control commands of the individual devices can be determined independently for each device.

【００６２】次に、本発明の第２の実施の形態として、
パターン認識により手書き文字の認識を行う手書き文字
認識装置について説明する。Next, as a second embodiment of the present invention,
A handwritten character recognition device that recognizes handwritten characters by pattern recognition will be described.

【００６３】まず、手書き文字認識装置の構成につい
て、図３を参照して説明する。なお、この図３において
は、図１に示した音声認識装置に対応する部分について
は同一の符号を附することにする。First, the configuration of the handwritten character recognition device will be described with reference to FIG. In FIG. 3, parts corresponding to those of the speech recognition apparatus shown in FIG. 1 are denoted by the same reference numerals.

【００６４】音声認識装置は、ユーザによりデータの入
出力の操作がなわれる端末１０と、大容量のデータベー
スを備えてデータの処理を集中して行うサーバ２０とか
ら構成されている。これら端末１０およびサーバ２０
は、情報を伝送する路線であるネットワーク３０により
接続が可能である。The speech recognition device is composed of a terminal 10 on which data input / output operation is performed by a user, and a server 20 having a large-capacity database and performing centralized data processing. These terminal 10 and server 20
Can be connected by a network 30 which is a line for transmitting information.

【００６５】端末１０は、ユーザによるデータの入出力
を受け付け、ネットワーク３０を介してサーバ２０と接
続するものである。端末１０は、音声の入力を受け付け
る入力部１１と、パターン辞書に相当する認識辞書部１
２と、音声パターンを認識辞書部１２により照合する演
算部１３と、演算部１３から送られた結果を出力する出
力部１４と、ネットワーク３０を介してサーバ２０と接
続するサーバ接続部１８とを有している。The terminal 10 receives data input and output by the user and connects to the server 20 via the network 30. The terminal 10 includes an input unit 11 for receiving a voice input and a recognition dictionary unit 1 corresponding to a pattern dictionary.
2, an operation unit 13 for collating the voice pattern by the recognition dictionary unit 12, an output unit 14 for outputting the result sent from the operation unit 13, and a server connection unit 18 for connecting to the server 20 via the network 30. Have.

【００６６】認識対象となる手書き文字パターンの入力
を受け付ける入力部１１は、タッチパッドなどで構成さ
れる。入力部１１では、入力された手書き文字パターン
をデジタル化して演算部１３に送る。The input unit 11 for receiving an input of a handwritten character pattern to be recognized is constituted by a touch pad or the like. The input unit 11 digitizes the input handwritten character pattern and sends it to the calculation unit 13.

【００６７】パターン辞書に相当する小規模辞書１５を
備える認識辞書部１２は、認識の基本単位となる文字パ
ターン１９と、文字パターン１９がどのように構成され
るか規定する単語辞書・文法１７を有している。The recognition dictionary unit 12 provided with the small-scale dictionary 15 corresponding to the pattern dictionary stores a character pattern 19 as a basic unit of recognition and a word dictionary / grammar 17 defining how the character pattern 19 is formed. Have.

【００６８】なお、端末１０の他の部分の構成、および
この端末１０における認識の処理については、上述した
音声認識装置と同様であるので説明を省略する。Note that the configuration of the other parts of the terminal 10 and the processing of recognition in the terminal 10 are the same as those of the above-described speech recognition apparatus, and a description thereof will be omitted.

【００６９】サーバ２０は、パターン辞書に相当する認
識辞書部２１と、音声パターンを認識辞書部２１を用い
て認識する演算部２２と、ネットワーク３０に接続する
インターフェースである端末接続部２３とを有してい
る。The server 20 has a recognition dictionary unit 21 corresponding to a pattern dictionary, an operation unit 22 for recognizing a voice pattern using the recognition dictionary unit 21, and a terminal connection unit 23 which is an interface for connecting to the network 30. are doing.

【００７０】認識辞書部２１に備えられる、小規模辞書
１４より規模の大きい大規模辞書２４は、小規模辞書１
５より精細な文字パターン１９、大語彙の単語辞書、複
雑な文法２６を備えている。The large dictionary 24 provided in the recognition dictionary unit 21 and having a larger size than the small dictionary 14 is
5 has a finer character pattern 19, a large vocabulary word dictionary, and a complicated grammar 26.

【００７１】なお、サーバ２０の他の部分の構成、およ
びこのサーバ２０における認識の処理については、上述
した音声認識装置と同様であるので説明を省略する。The configuration of the other parts of the server 20 and the processing of recognition in the server 20 are the same as those of the above-described speech recognition apparatus, and therefore the description is omitted.

【００７２】このように、本実施の形態の手書き文字認
識装置においては、小規模辞書１５および大規模辞書２
４に音響パターンにかわって文字パターンが認識されて
いること、および入力部１１がタッチパッド構成され、
入力された手書き文字をデジタル化することが、上述し
た第１の実施の形態の音声認識装置と相違している。As described above, in the handwritten character recognition device of the present embodiment, the small-scale dictionary 15 and the large-scale dictionary 2
4 that the character pattern is recognized in place of the acoustic pattern, and the input unit 11 is configured with a touch pad,
The digitization of the input handwritten character is different from the voice recognition device of the first embodiment described above.

【００７３】この手書き文字認識装置における処理手順
は、音声パターンに代わって文字パターンを処理するこ
とをのぞくと、図２で示した音声認識装置における処理
手順と原則として同一である。The processing procedure in the handwritten character recognition apparatus is basically the same as the processing procedure in the voice recognition apparatus shown in FIG. 2, except that a character pattern is processed instead of a voice pattern.

【００７４】すなわち、手書き文字認識装置の処理工程
を、図２によって説明すると、最初のステップＳ１１で
は、端末１０において、音声入力に代わって、入力部１
１のタッチパッドなどに手書き文字を入力する。次のス
テップＳ１２においては、」演算部１３は、入力部１１
入力された手書き文字を認識辞書部１２の小規模辞書１
５で文字パターン１９および単語辞書・文法１７を用い
て認識する。That is, the processing steps of the handwritten character recognition device will be described with reference to FIG. 2. In the first step S11, the terminal 10 replaces the voice input with the input unit 1
1. Input handwritten characters on the touch pad or the like. In the next step S12, the "operation unit 13
The small dictionary 1 of the recognition dictionary unit 12 recognizes the input handwritten characters.
In step 5, the character pattern 19 and the word dictionary / grammar 17 are used for recognition.

【００７５】ステップＳ１３においては、ステップＳ１
２における認識の結果を判断する。この認識結果の判断
は、上述したように、例えば信頼性尺度（confidence m
easure）を基準として自動で行う方法と、ユーザに提示
して判断する方法とがある。認識結果の判断がＯＫとさ
れると、ステップＳ２０に進んで最終結果を出力部１４
に表示して終了する。認識結果の判断がＮＧと判断され
ると、ステップＳ１５に進む。In step S13, step S1
The result of the recognition in 2 is determined. As described above, the determination of the recognition result is performed, for example, using a confidence measure (confidence m).
There is a method of automatically performing the determination based on the easure) and a method of making a determination by presenting to the user. If the determination of the recognition result is OK, the process proceeds to step S20, and the final result is output to the output unit 14.
And exit. If the determination of the recognition result is NG, the process proceeds to step S15.

【００７６】ステップＳ１５においては、端末１０は、
サーバ接続部１８によりネットワーク３０を介してサー
バ３０に接続し、入力された手書き文字をサーバ２０に
送る。ここで、サーバ２０に送られる手書き文字のデー
タは、入力された手書き文字そのものである必要はな
く、分析をした結果や、適当な量子化をした結果を送る
ことにより、転送に要する情報量を提言することができ
るのは、上述の音声認識装置と同様である。In step S15, the terminal 10
The server 30 connects to the server 30 via the network 30 via the server connection unit 18 and sends the input handwritten characters to the server 20. Here, the handwritten character data sent to the server 20 does not need to be the input handwritten character itself, and the amount of information required for transfer can be reduced by sending the result of analysis or the result of appropriate quantization. What can be recommended is the same as the above-described speech recognition device.

【００７７】ステップＳ１６においては、サーバ２０に
おいて、演算部２２は、認識処理部２１の備える大規模
辞書２４の文字パターン２７および単語辞書・文法２６
を用いて、手書き文字を認識する。そして、サーバ２０
は、認識処理の結果を端末接続部２３によりネットワー
ク３０を介して端末１０に送る。そして、ステップＳ２
０においては、端末は、サーバにて認識された結果を出
力部１４に表示して、一連の工程を終了する。In step S 16, in the server 20, the calculation unit 22 determines the character pattern 27 and the word dictionary / grammar 26 of the large-scale dictionary 24 included in the recognition processing unit 21.
Is used to recognize handwritten characters. And the server 20
Sends the result of the recognition process to the terminal 10 via the network 30 by the terminal connection unit 23. Then, step S2
In the case of 0, the terminal displays the result recognized by the server on the output unit 14 and ends the series of steps.

【００７８】なお、認識結果の利用回数を計測し、アク
セスに頻度により大規模辞書２４に登録された単語を小
規模辞書に登録することにより、端末１０のサーバ２０
へのアクセス回数を提言することができるのも、上述し
た音声認識装置と同様である。このような大規模辞書２
４から小規模辞書１５への単語の登録については、ステ
ップＳ１７からステップＳ１９に示されているが、上述
の第１の実施の形態と同様であるので、説明を省略する
ことにする。The number of times the recognition result is used is measured, and words registered in the large-scale dictionary 24 are registered in the small-scale dictionary according to the frequency of access, whereby the server 20 of the terminal 10
It is also possible to propose the number of accesses to the same as in the above-described speech recognition device. Such a large-scale dictionary 2
The registration of the word from No. 4 to the small dictionary 15 is shown in steps S17 to S19, but is the same as in the first embodiment described above, and will not be described.

【００７９】以上のように、手書き文字認識装置におけ
る処理手順は、音声認識装置における処理手順と、パタ
ーン認識用のアルゴリズムの処理が、音響パターン認識
により行う音声認識用であるか、手書き文字認識による
手書き文字認識用であるかが相違している。As described above, the processing procedure in the handwritten character recognition device is either the processing procedure in the voice recognition device and the processing of the pattern recognition algorithm for voice recognition performed by acoustic pattern recognition, or the handwriting character recognition. It is different for handwritten character recognition.

【００８０】続いて、この手書き文字認識装置におけ
る、実施の処理について説明する。Next, a description will be given of a process performed by the handwritten character recognition apparatus.

【００８１】手書き文字認識装置においては、音声認識
装置ではユーザから音声によって情報を入手していたの
だが、手書き文字による入力に変更される。これによっ
て人名や地名などにある難しい漢字など、通常端末１０
の小規模辞書１５には登録されていないような文字も、
ネットワーク３０を介して大規模辞書２４に接続するこ
とによって認識することが可能となる。In the handwritten character recognition device, information is obtained by voice from the user in the voice recognition device, but the information is changed to input by handwritten characters. This makes it difficult for ordinary terminals 10 to use difficult kanji in personal names and place names.
Characters that are not registered in the small dictionary 15 of
By connecting to the large-scale dictionary 24 via the network 30, recognition becomes possible.

【００８２】なお、図３に示した手書き文字認識装置の
構成においては、小規模辞書１５と大規模辞書２４を端
末１０とサーバ２０に分散して配置した構成を示した。
しかし、小規模辞書１５と大規模辞書２４の両方を同一
の装置内に内蔵することも可能である。その場合、まず
小規模辞書１５による認識を行うことによって、頻度が
高く入力されるものはすばやいレスポンスで認識処理を
行うことができる。In the configuration of the handwritten character recognition device shown in FIG. 3, a configuration in which the small-scale dictionary 15 and the large-scale dictionary 24 are distributed and arranged in the terminal 10 and the server 20 is shown.
However, both the small dictionary 15 and the large dictionary 24 can be built in the same device. In that case, by performing recognition using the small-scale dictionary 15 first, those frequently input can be recognized with a quick response.

【００８３】また、第１の実施形態の音声認識装置にお
いても、小規模辞書１５と大規模辞書２４を同一の装置
に備えることにより、頻度が高く入力されるものはすば
やいレスポンスで認識処理を行うことができる。Also, in the speech recognition apparatus of the first embodiment, by providing the small dictionary 15 and the large dictionary 24 in the same apparatus, those which are frequently input are recognized with a quick response. be able to.

【００８４】以上説明したように、本実施の形態は、パ
ターン認識の際に用いる照合用の辞書を小規模なものと
大機語なものに分けて保持し、それを適宜使い分けるこ
とによって認識処理のレスポンス短縮と認識対象パター
ンの増大を実現する。さらに小規模辞書を端末に、大規
模辞書をネットワークで接続可能なサーバに備えること
で、小型端末における認識対象パターンの増大とネット
ワーク負荷の低減を実現するものである。As described above, according to the present embodiment, the recognition dictionary used for pattern recognition is divided into small ones and large ones, and the dictionary is used for recognition processing by appropriately using them. To shorten the response and increase the number of patterns to be recognized. Furthermore, by providing a small dictionary in a terminal and a large dictionary in a server connectable to a network, it is possible to increase the number of recognition target patterns and reduce the network load in a small terminal.

【００８５】[0085]

【発明の効果】本発明は、小規模なパターン辞書と大規
模なパターン辞書を併用することによって、頻繁に利用
されるパターンを小規模辞書に登録して、パターン認識
のレスポンスを早めると同時に、大規模辞書へのアクセ
スの回数を減らすことが可能となる。これによって、大
規模辞書を利用したパターン認識がネットワークを介し
て接続可能なサーバによって提供されている場合は、常
時サーバを利用してパターン認識を行う場合に比べて、
ネットワーク負荷の低減が可能になる。According to the present invention, by using a small-sized pattern dictionary and a large-sized pattern dictionary together, frequently used patterns are registered in the small-sized dictionary, and the response of the pattern recognition is accelerated. It is possible to reduce the number of accesses to a large-scale dictionary. Thus, when pattern recognition using a large-scale dictionary is provided by a server that can be connected via a network, compared to the case where pattern recognition is always performed using a server,
Network load can be reduced.

[Brief description of the drawings]

【図１】音声認識装置の概略的な構成を示すブロック図
である。FIG. 1 is a block diagram illustrating a schematic configuration of a speech recognition device.

【図２】音声認識装置における処理手順を示すフローチ
ャートである。FIG. 2 is a flowchart illustrating a processing procedure in the voice recognition device.

【図３】手書き文字認識装置の概略的な構成を示すブロ
ック図である。FIG. 3 is a block diagram illustrating a schematic configuration of a handwritten character recognition device.

【図４】情報アクセスシステムの一例の構成を示すブロ
ック図である。FIG. 4 is a block diagram illustrating a configuration of an example of an information access system.

[Explanation of symbols]

１０端末、１５小規模辞書、２０サーバ、２４
大規模辞書、３０ネットワーク10 terminals, 15 small dictionaries, 20 servers, 24
Large dictionary, 30 networks

───────────────────────────────────────────────────── フロントページの続き (72)発明者本田等東京都品川区北品川６丁目７番35号ソニー株式会社内 (72)発明者浜田修東京都品川区北品川６丁目７番35号ソニー株式会社内Ｆターム(参考） 5B064 DA15 DA27 DA29 5D015 HH13 LL10 9A001 HH17 HH21 HH22 ──────────────────────────────────────────────────続き Continuing on the front page (72) Honda, etc. 6-7-35 Kita Shinagawa, Shinagawa-ku, Tokyo Inside Sony Corporation (72) Osamu Hamada Osamu Hamada 6-35, Kita-Shinagawa, Shinagawa-ku, Tokyo Sony Corporation F term (reference) 5B064 DA15 DA27 DA29 5D015 HH13 LL10 9A001 HH17 HH21 HH22

Claims

[Claims]

An input unit for inputting a pattern; a first dictionary serving as a reference used for pattern recognition; and a second dictionary serving as a reference used for pattern recognition, the second dictionary being larger in size than the first dictionary. Dictionary, recognition processing means for recognizing patterns, determination processing means for determining whether or not the result of pattern recognition is appropriate, output means for outputting pattern recognition results, and input to the input means The recognized pattern is recognized by the recognition processing means using the first dictionary, and the result of recognition by the recognition processing means is determined by the determination processing means;
If the determination processing means determines that the recognition result is appropriate, the output means outputs the recognition result. If the determination processing means does not determine that the recognition result is appropriate, the pattern is recognized. A pattern recognition device, comprising: control means for making a determination using the second dictionary in the processing means and controlling the recognition result to be output in the output means.

2. The method according to claim 1, wherein said determination processing means automatically determines whether or not the recognition result obtained by said recognition processing means is appropriate, using a scale obtained in a process of pattern recognition. Pattern recognition device.

3. The pattern recognition apparatus according to claim 2, wherein said recognition processing means employs a reliability measure as said measure.

4. The pattern recognition device according to claim 1, wherein a pattern used for recognition in said second dictionary is registered in said first dictionary.

5. The method according to claim 1, wherein the registration of the pattern recognized by the second dictionary in the first dictionary is automatically performed in accordance with the frequency of reference to the second dictionary. The pattern recognition device according to claim 4, wherein

6. The pattern recognition apparatus according to claim 4, wherein no pattern is registered in the first dictionary in an initial state.

7. The pattern recognition apparatus according to claim 1, wherein said recognition processing means recognizes at least one of a voice, an image, and a character.

8. An input means for inputting a pattern, a first dictionary serving as a reference used for pattern recognition, and a second dictionary serving as a reference used for pattern recognition, the second dictionary being larger in size than the first dictionary. A recognition processing means for recognizing a pattern; an output means for outputting data; a confirmation means for receiving confirmation of a result of pattern recognition; and a first processing means for recognizing a pattern inputted to the input means in the first recognition processing means. And the recognition result in the recognition processing means is judged by the judgment processing means.
The result of the judgment by the judgment processing means is presented to the confirmation means. When the confirmation result is confirmed by the confirmation means, the recognition result is outputted by the output means, and the confirmation result is confirmed by the confirmation means. A recognition unit that uses the second dictionary to recognize that the recognition result has not been received, and a control unit that controls the output unit to output the recognition result.

9. The system according to claim 8, wherein said control means presents to said presenting means a scale obtained in the course of pattern recognition by said recognition processing means together with a result of the recognition processing by said recognition processing means. Pattern recognition device.

10. The pattern recognition apparatus according to claim 9, wherein said recognition processing means adopts a reliability scale as said scale.

11. The pattern recognition apparatus according to claim 8, wherein a pattern used for recognition in said second dictionary is registered in said first dictionary.

12. The registration of a pattern recognized by the second dictionary in the first dictionary is automatically performed according to the frequency of reference to the second dictionary. The pattern recognition device according to claim 11, wherein

13. The pattern recognition apparatus according to claim 11, wherein a pattern is not registered in the first dictionary in an initial state.

14. A first reference used for pattern recognition.
And a dictionary for pattern recognition,
An input step of inputting a pattern, a recognition processing step of recognizing the pattern, and a judgment of judging whether or not the result of the pattern recognition is appropriate, using a second dictionary that is larger than the first dictionary. A processing step, an output step of outputting a recognition result of the pattern, and a pattern input to the input step is recognized using the first dictionary in the recognition processing step, and the recognition result in the recognition processing step is determined. Judge in the processing process,
If the recognition result is determined to be appropriate in the determination processing step, the recognition result is output in the output step. If the recognition result is not determined to be appropriate in the determination processing step, the pattern is recognized. A control step of making a determination in the processing step using the second dictionary and controlling the recognition result to be output in the output step.

15. A first reference used for pattern recognition.
An input step of inputting a pattern using a dictionary which is a reference dictionary used for pattern recognition and a second dictionary which is larger than the first dictionary, a recognition processing step of recognizing the pattern, An output step of outputting data; a confirmation step of receiving confirmation of a pattern recognition result; a pattern input to the input step is recognized using the first dictionary in the recognition processing step; The recognition result is determined in the above determination processing step,
The judgment result in the judgment processing step is presented to the confirmation step. When the recognition result is confirmed in the confirmation step, the recognition result is output in the output step, and the recognition result is confirmed in the confirmation step. A recognition step using the second dictionary in the recognition processing step if the recognition is not received, and a control step of controlling the output step to output the recognition result.

16. An input means for inputting a pattern, a first dictionary used as a reference for pattern recognition, first recognition processing means for recognizing a pattern using the first dictionary, and pattern recognition. A terminal having judgment processing means for judging whether or not the result of the pattern is appropriate, output means for outputting a pattern recognition result, and control means for controlling pattern recognition; and a reference used for pattern recognition. A second dictionary, which is a dictionary larger than the first dictionary, and a server having second recognition processing means for performing a pattern recognition process using the second dictionary are connected by a network; A pattern recognition system, wherein the control means recognizes the pattern input to the input means using the first dictionary in the first recognition processing means, The recognition result in the recognition processing means is judged by the judgment processing means, and when the judgment processing means judges that the recognition result is appropriate, the recognition result is outputted by the output means, and the judgment result is outputted by the judgment processing means. If the recognition result is not determined to be appropriate, the pattern is determined by the second recognition processing means using the second dictionary, and control is performed so that the recognition result is output by the output means. And a pattern recognition system.