JPH03110675A

JPH03110675A - Word dictionary retrieving device

Info

Publication number: JPH03110675A
Application number: JP1250039A
Authority: JP
Inventors: Shunichi Fukushima; 俊一福島
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1989-09-25
Filing date: 1989-09-25
Publication date: 1991-05-10
Anticipated expiration: 2011-08-21
Also published as: JP2526678B2

Abstract

PURPOSE:To execute the collation with a word dictionary at a high speed by providing a first - an m-th candidate shift registers of an (n) character portion, respectively for executing a forward feed of one character each synchronizing with a shift clock, and a first character - an n-th character comparater. CONSTITUTION:The device is provided with an input device 5 for inputting a character-string in which (m) kinds of candidates exist, (s) kinds of work dictionary memories 1 having data width of an (n) character portion, a controller 7 for generating a clock, whenever (m) kinds of candidates to one character are inputted by the input device, and an address counter 4 being common to the word dictionary memory. Also, the device is provided with a first, a second,... an (m)-th candidate shift registers 2 of an (n) character portion, respectively, a switch 8 for selecting the word dictionary memory, a first character, a second character,... an (n)-th character comparator 3, and a deciding circuit 6 for deciding that a word in the word dictionary memory appears in an inputted character-string in the case a coincidence signal is detected from all the compar ing circuits. In such a way, even if each character has plural candiates, the collation with a word dictionary can be executed at a high speed.

Description

【発明の詳細な説明】（産業上の利用分野）本願発明は、入力文字列と単語辞書とを照合して、入力
文字列において、単語辞書内に存在する単語が出現した
部分を検索する単語辞書検索装置に関するものである。Detailed Description of the Invention (Industrial Field of Application) The present invention is a word dictionary that searches for a portion of the input string in which a word that exists in the word dictionary appears by comparing an input string with a word dictionary. The present invention relates to a dictionary search device.

ただし、入力文字列は、各文字が複数の候補をもつ場合
と、１個しかもたない場合がある。また、単語辞書検索
に置ける検索キーは、単語の表記（漢字かな混じり文字
列）の場合と、読みがな（かな文字列）の場合が多い。However, in the input character string, each character may have multiple candidates or may have only one candidate. In addition, search keys that can be used in word dictionary searches are often the word notation (character strings containing kanji and kana) and the readings (kana character strings).

入力文字列の各文字に複数の候補がなく一意な場合の単
語辞書検索装置は、かな漢字変換装置においてキーボー
ドから入力されなかな文字列に対する単語辞書検索を行
なう部分や、機械翻訳装置・文音声変換装置・文章校正
装置などにおいてワードプロセッサなどで作成された漢
字かな混じり文字列に対する単語辞書検索を行なう部分
などに用いられている。前者は検索キーが読みがなで、
後者は検索キーが表記である。When each character in the input character string is unique without multiple candidates, the word dictionary search device is the part of the kana-kanji converter that performs a word dictionary search for the kana character string entered from the keyboard, machine translation equipment, and sentence-to-speech converters. It is used in the parts of devices and grammar proofing devices that perform word dictionary searches for character strings containing kanji and kana created using word processors. In the former case, the search key is read as Na.
In the latter case, the search key is a notation.

入力文字列の各文字に複数の候補がある場合の単語辞書
検索装置は、音声認識装置や文字認識装置など番こおい
て、認識結果の複数の候補文字から最も確からしい文字
を選択するための単語辞書検索部などに用いられている
。音声認識の場合は検索キーが読みがな（音節）で、文
字認識の場合は検索キーが表記である。When there are multiple candidates for each character in the input character string, the word dictionary search device uses a voice recognition device, character recognition device, etc. to select the most likely character from the multiple candidate characters in the recognition result. It is used in word dictionary search parts, etc. In the case of voice recognition, the search key is the pronunciation (syllable), and in the case of character recognition, the search key is the notation.

（従来の技術）従来、かな漢字変換装置・文音声変換装置などにおいて
用いられている単語辞書検索方式は、基本的に、文献：
［計算機によるカナ漢字変換」（和訳・江原、Ｎ　ＨＫ
技術研究、第２５巻第５号、２３頁〜６０頁、１９７３
年）に示されている方式を引き継いでいる。すなわち、
入力された文字列から部分文字列を切り出して、その部
分文字列に表記（かな漢字変換では「かな表記」ずなわ
ち「読みがな」）の一致する単語を単語辞書内から検索
するという処理を行なう（これを、以下では、第１の従
来技術と呼ぶことにする）。(Prior art) The word dictionary search methods conventionally used in kana-kanji conversion devices, sentence-to-speech conversion devices, etc. are basically based on the documents:
[Kana-Kanji Conversion by Computer] (Japanese translation, Ebara, NHK
Technical Research, Vol. 25, No. 5, pp. 23-60, 1973
It follows the method shown in 2010). That is,
The process of cutting out a substring from the input string and searching the word dictionary for a word that matches the notation (in kana-kanji conversion, "kana notation" or "reading") in the substring. (hereinafter, this will be referred to as the first prior art).

第１の従来技術では、例えば、「文章を解析する」とい
う漢字かな混じり文字列に対する単語辞書検索を行なう
場合は、１文字目・２文字目など各文字位置から始まる
単語を検索するために、「文章を解析する」　「章を解
析する」　「を解析する」　「解析する」　「析する」
　「する」　「る」といった部分文字列を別々に切り出
す、そして、各部分文字列について、さらに、末尾を削
った部分文字列も生成して、その各々を単語辞書から検
索する。すなわち、！文字目から始まる単語を検索する
際には、「文章を解析する」　「文章を解析す」「文章
を解析」　「文章を解」　「文章をＪ’久皇Ｊ「文」と
いった部分文字列に関して、検索を次々に繰り返し、２
文字目から始まる単語を検索する際には、「章を解析す
る」　「章を解析す」　「章を解析」　「章を解」　「
章を」「阜」といった部分文字列に関して、検索を次々
に繰り返すわけである。In the first conventional technique, for example, when performing a word dictionary search for a character string containing kanji and kana, such as "analyzing a sentence," in order to search for words starting from each character position such as the first or second character, "Analyze a text""Analyze a chapter""Analyze""Analyze""Analyze"
Partial strings such as ``suru'' and ``ru'' are extracted separately, and for each partial string, a partial string with the end removed is also generated, and each of them is searched from the word dictionary. In other words! When searching for words that start with the first letter, search for substrings such as ``parse the sentence'', ``analyze the sentence'', ``analyze the sentence'', ``solve the sentence'', ``sent wo J'kyuouJ``bun''. , repeat the search one after another, 2
When searching for words starting with the letter ``parse chapter'' ``parse chapter'' ``parse chapter'' ``solve chapter''
Searches are repeated one after another for substrings such as ``Chapter'' and ``阜''.

その結果、下線を引いた部分文字列などについて、表記
の一致する単語が見つかることになる。As a result, a word that matches the notation of the underlined substring is found.

この第１の従来技術では、上述のように、複数の部分文
字列に対して単語辞書の検索を繰り返すわけであるが、
単語辞書の構成を工夫しておくことによって、１つの部
分文字列に対する検索の所定時間の短縮を図っている１
例えば、単語辞書内の単語をあらかじめソートしておい
て二分検索を行なったり、１文字目・２文字目など文字
ごとに分割して共通な部分をまとめて木構造として１文
字ずつの照合を行なうなどである。このような辞書の構
造と検索方式の工夫については、文献：“Ｔｈｅ　Ａｒ
ｔ　ｏｆ　Ｃｏｍｐｕｔｅｒ　Ｐｒｏｇｒａｍｍｉｎｇ
　３：　Ｓｏｒｔｉｎｇａｎｄ　Ｓｅａｒｃｈｉｎｇ”
　（口、Ｆ、にｎ１Ｊｔｈ、ＡｄｄｉＳＯｎ−１４ｅｓ
ｌｅＶ。In this first conventional technique, as mentioned above, word dictionary searches are repeated for multiple substrings.
By devising the structure of the word dictionary, the predetermined search time for a single substring can be shortened.1
For example, you can sort the words in a word dictionary in advance and perform a binary search, or you can divide the words into characters such as the first and second characters, group the common parts together, and perform character-by-character matching using a tree structure. etc. Regarding the structure of such a dictionary and improvements to the search method, please refer to the literature: “The Ar
of Computer Programming
3: Sorting and Searching”
(mouth, F, n1Jth, AddiSOn-14es
leV.

１９７３）に多く示されている。(1973).

ただし、この第１の従来技術は、コンピュータ上の逐次
型プログラムとしての実現を想定しており、所要時間短
縮の工夫は逐次処理の枠内での工夫である。したがって
、入力文字列のある部分文字列と単語辞書内のある単語
の表記の文字列とを比較する処理などは、１文字ずつ逐
次性なわれる。However, this first conventional technique is intended to be implemented as a sequential program on a computer, and the efforts to reduce the required time are within the framework of sequential processing. Therefore, the process of comparing a certain partial character string of an input character string with a character string representing a certain word in a word dictionary is performed character by character sequentially.

例えば、「文章を」という部分文字列と単語辞書内の「
文章」という単語の比較は、「文」と１文」とを照合し
、一致したところで「章」と１章」とを照合し、次に、
「を」に対して不一致となり失敗するというように、１
文字ずつ照合処理の繰り返しである。For example, the substring ``sentence'' and ``sentence'' in the word dictionary are
To compare the word ``sentence'', ``sentence'' and 1st sentence'' are matched, and when they match, ``chapter'' and 1st chapter'' are compared, and then,
1.
The matching process is repeated character by character.

また、第１の従来技術では、入力文字列の各文字にｍ個
ずつ候補があるような場合は、あらかじめ、長さしの部
分文字列に対して、候補を組み合わせたｍ’通りの文字
列を生成する必要がある。In addition, in the first conventional technique, when there are m candidates for each character of the input character string, m' combinations of candidates are prepared in advance for a partial character string of length. need to be generated.

そして、それらの各々について、さらに、上記のように
末尾を削った部分文字列を生成して、単語辞書内を検索
することになる。Then, for each of them, a partial character string with the end truncated as described above is generated and searched in the word dictionary.

一方、ある長さしの文字列と入力文字列との比較の際に
、１文字ずつの照合を繰り返すのではなく、対応する文
字位置のＬ組の照合を同時に行なうことが考えられてい
る（以下、第２の従来技術と呼ぶ、）その際、入力文字
列をシフトレジスタに格納しておけば、照合の位置をず
らすことが、入力文字列から再度部分文字列を切り出す
のではなく、シフトレジスタ内の全文字を一度に１文字
ずらすことで実現できるようになる。この第２の従来技
術は、特開昭６３−２６１４２１号公報ｒ文字列処理装
置」および特開昭６３−２６１４２２号公報ｒ文字列照
合装置」に述べられている。なお、この第２の従来技術
では、入力文字列に複数の候補は存在せず、検索される
文字列も１通りに限られるもので単語辞書のように多数
ではない、特開昭６２−６７６３６号公報「照合方式」
および文献：［音声日本語入力システムにおける高速な
言語処理のための辞書照合アルゴリズム」　（浜口・鈴
木、電子情報通信学会論文誌、第Ｊ７０−Ｄ巻第８号、
１５８９頁〜１５９６頁、１９８７年）には、第３の従
来技術が示されている。On the other hand, when comparing a character string of a certain length with an input character string, it is considered that instead of repeatedly matching each character, it is possible to simultaneously match L sets of corresponding character positions ( (hereinafter referred to as the second conventional technique) At that time, if the input string is stored in a shift register, the position of the collation can be shifted instead of cutting out the partial string from the input string again. This can be achieved by shifting all the characters in the register one character at a time. This second prior art is described in JP-A No. 63-261421 "Character string processing device" and JP-A No. 63-261422 "Character string matching device". In addition, in this second conventional technique, there are no multiple candidates for the input character string, and the number of character strings to be searched is limited to one, and not as many as in a word dictionary. Publication “Verification method”
and literature: [Dictionary matching algorithm for high-speed language processing in spoken Japanese input systems] (Hamaguchi and Suzuki, Transactions of the Institute of Electronics, Information and Communication Engineers, Vol. J70-D, No. 8,
1589-1596, 1987), a third prior art is shown.

この第３の従来技術では、入力文字列に複数の文字候補
があることを想定している。まず、文字の種類をＭ（例
えば、ＪＩＳの文字コード表では、ひらがなについてＭ
＝８３）とすると、１文字目・２文字目など各文字位置
に、Ｍビットのメモリを１個ずつ用意し、複数の候補文
字に対応する各ビットを１にしておく、そして、照合時
は、単語辞書内の各単語について、単語の表記の１文字
目・２文字目など文字位置ごとに、対応するＭビットメ
モリから、対応する文字のビット内容（１まなは０）を
同時に読み出す、すべてのＭビットメモリから１が読み
出されたら、入力文字列中に単語辞書内にある単語が出
現したことになる。This third conventional technique assumes that an input character string has a plurality of character candidates. First, set the type of character to M (for example, in the JIS character code table, hiragana is M
= 83), one M-bit memory is prepared for each character position such as the first character and second character, and each bit corresponding to multiple candidate characters is set to 1. , for each word in the word dictionary, read the bit contents (1 or 0) of the corresponding character from the corresponding M-bit memory at the same time for each character position such as the first or second character of the word notation. If 1 is read from the M-bit memory of , it means that a word in the word dictionary has appeared in the input character string.

以上の第１〜第３の従来技術では、単語辞書を検索する
際の検索キーを１種類しか設けていない。In the first to third conventional techniques described above, only one type of search key is provided when searching a word dictionary.

しかし、文書処理システムでは、単語辞書検索の際に、
複数の検索キー′を設定する必要が生ずることがある０
例えば、文献：［日本語文章作成支援システムＣＯＭ　
Ｅ’ｒ−文章解析応用の統合化方式を中心に−」　（福
島・犬山、情報処理学会・文章処理とヒユーマンインタ
フェース研究会、２０−２．１９８８年）では、かな漢
字変換、文音声変換、文章校正などのアプリケーション
を統合化したシステムが述べられている。上記の文献で
は、かな漢字変換用の単語の読みがなを検索キーとして
単語の表記を得る辞書と、文音声変換・文章校正用の単
語の表記を検索キーとして単語の読みがなを得る辞書と
いう、２種類の独立した単語辞書を用いている（以下で
は、これを、第４の従来技術と呼ぶことにする）、そし
て、その各々につぃては、第１の従来技術の手法を適用
している。したがって、単語の読みがなと表記とは、２
つの辞書の両方に重複して登録されていることになる。However, in a document processing system, when searching a word dictionary,
It may be necessary to set multiple search keys'0
For example, literature: [Japanese writing support system COM
"E'r - Focusing on an integrated method for text analysis applications" (Fukushima/Inuyama, Information Processing Society of Japan/Text Processing and Human Interface Study Group, 20-2, 1988), Kana-Kanji conversion, sentence-to-speech conversion, text A system is described that integrates applications such as calibration. In the above literature, there is a dictionary that uses the pronunciation of the word as a search key for kana-kanji conversion to obtain the notation of the word, and a dictionary that uses the pronunciation of the word as the search key for sentence-to-speech conversion and grammar proofing to obtain the pronunciation of the word. , two types of independent word dictionaries are used (hereinafter, this will be referred to as the fourth conventional technique), and the method of the first conventional technique is applied to each of them. are doing. Therefore, the pronunciation and notation of a word are 2.
This means that it is registered twice in both dictionaries.

これに対して、１つの単語辞書を、単語の読みがなでも
表記でも検索できるようにしたのが、以下に述べる第５
・第６の従来技術である。In contrast, we have made it possible to search a single word dictionary regardless of the pronunciation or spelling of the word, as described in the fifth section below.
- This is the sixth conventional technology.

第５の従来技術では、単語の表記を検索キーとした第１
の従来技術にもとづく単語辞書と、漢字読み表とを利用
する。漢字読み表は漢字１文字の読み方を登録した表で
あり、単語辞書の単語の表記を漢字読み表で展開して、
単語の読みがなを生成しながら検索する。特開昭６２−
２１２８７６号公報「かな漢字相互変換装置」や特開昭
６２−２２４８５９　ｒ日本語処理方式」などに、その
手法が述べられている。In the fifth prior art, the first method uses word notation as a search key.
A word dictionary based on conventional technology and a kanji reading table are used. The kanji reading table is a table that registers the reading of a single kanji character, and the notation of the word in the word dictionary is expanded into the kanji reading table.
Search while generating readings of words. Unexamined Japanese Patent Publication 1986-
This method is described in Japanese Patent Publication No. 212876 ``Kana-Kanji Mutual Conversion Device'' and Japanese Patent Application Laid-Open No. 62-224859 ``Japanese Processing System''.

第６の従来技術では、ポインタを利用して、２通りの検
索キーについて第１の従来技術の手法が適用できるよう
な単語辞書検索をとる。すなわち、第４の従来技術の重
複した内容の部分を、ポインタで置き換えたものである
。特許公告昭６３−４２２５　ｒ電子辞書」では、第４
の従来技術における２つの単語辞書の一方のすべてをポ
インタの集合と置き換えている。特開昭６１−３０７４
１２「単語辞書構成方式」では、第４の従来技術におけ
る２つの単語辞書の検索キーが、相互にポインタで参照
し合う形態となっている。In the sixth prior art, a pointer is used to perform a word dictionary search for two types of search keys to which the method of the first prior art can be applied. In other words, the duplicate content of the fourth prior art is replaced with a pointer. Patent Publication 1986-4225 r Electronic Dictionary”, No. 4
In the prior art, one of the two word dictionaries is entirely replaced with a set of pointers. Japanese Patent Publication No. 61-3074
In No. 12 "Word Dictionary Configuration System", the search keys of two word dictionaries in the fourth prior art are configured to refer to each other using pointers.

（発明が解決しようとする課題）まず、ｌ［Ｉ′Ｍの検索キーに間する従来技術の課題を
述べる。(Problems to be Solved by the Invention) First, the problems of the prior art related to the search key of l[I'M will be described.

第１の従来技術には、第１に、前述のように、多数の部
分文字列に対する単語辞書検索を繰り返さねばならない
という欠点がある。この欠点は、入力文字列の各文字に
複数の候補がある場合に、特に顕著になる。長さＫの入
力文字列の各文字にｍ個ずつ候補がある場合、最長の部
分文字列の長さしとすると（通常、単語辞書内の最長の
単語の長さをＬとすル、）、最大（ｍ’　ｘＬｘＫ）個
の部分文字列に対する検索を繰り返す必要が生ずる。The first prior art has the drawback that, as described above, word dictionary searches for a large number of partial character strings must be repeated. This drawback is particularly noticeable when there are multiple candidates for each character in the input string. If there are m candidates for each character in an input string of length K, let the length of the longest substring be the length of the longest substring (usually, L is the length of the longest word in the word dictionary). , it becomes necessary to repeat the search for a maximum of (m' x L x K) substrings.

通常、あらかじめ条件を定めて、不要な部分文字列の検
索を避けるようにするので、（ｍ’ＸＬＸＫ）個という
のは最悪の場合であるが、それでも検索の回数はかなり
多くなる。Normally, conditions are determined in advance to avoid searching for unnecessary substrings, so (m'XLXK) is the worst case, but the number of searches is still quite large.

第１の従来技術の第２の欠点は、ある部分文字列と単語
辞書内のある単語の表記の文字列と比較を、１文字ずつ
順に行なうため、比較に時間がかかるということである
。A second drawback of the first conventional technique is that a certain partial character string is compared with a character string representing a certain word in a word dictionary in order, character by character, so the comparison takes time.

第２の従来技術では、第１の従来技術の第２の欠点に対
処している。しかし、第２の従来技術は、入力文字列に
複数の候補がある場合に適用できない、また、検索され
る文字列も１通りに限られ、単語辞書のように多数の文
字列と比較を行なうことを考えると、単語辞書内の各単
語の表記の長さの異なりが問題となって適用できない。The second prior art addresses the second drawback of the first prior art. However, the second conventional technique cannot be applied when there are multiple candidates in the input character string, and the number of character strings to be searched is limited to one, and comparisons are made with many character strings like in a word dictionary. Considering this, the difference in the length of each word in the word dictionary becomes a problem and cannot be applied.

第３の従来技術は、第１の従来技術の２つの欠点に対処
している。しかし、入力文字列を記憶するために、文字
の種類だけのビット数をもったメモリを、文字列長だけ
用意する必要があるので、文字の種類が多数の場合はメ
モリサイズがかなり大きくなってしまう問題がある。か
な漢字変換や音声認識の場合は、入力文字列がひらがな
（あるいは音素文字）で１００種類程度に限定されるの
で、あまり問題にならないが、漢字かな混じり文字列を
対象とした文音声変換や機械翻訳など場合は、漢字を含
み文字の種類が３０００〜４０００種類にもおよぶので
、メモリが大きくなって問題になる。The third prior art addresses two drawbacks of the first prior art. However, in order to store the input string, it is necessary to prepare a memory with the number of bits for each type of character and the length of the string, so if there are many types of characters, the memory size becomes quite large. There is a problem with it. In the case of kana-kanji conversion and voice recognition, the input character string is limited to about 100 types of hiragana (or phonetic characters), so this is not much of a problem, but text-to-speech conversion and machine translation that target character strings containing kanji and kana In such cases, there are 3,000 to 4,000 types of characters, including kanji, and the memory becomes large, which becomes a problem.

また、第３の従来技術では、入力された文字列の先頭か
ら始る単語の検索しか行なわないため、２文字目から始
まる単語や、３文字目から始まる単語などを検索する場
合には、入力文字列のメモリへの登録を、やり直す必要
が生じるという問題もある。In addition, in the third conventional technology, only the word starting from the beginning of the input character string is searched, so when searching for a word starting from the second character, a word starting from the third character, etc., the input There is also the problem that it is necessary to register the character string in memory again.

次に、検索キーを複数設けた場合の従来技術の課題を述
べる。Next, problems with the prior art when a plurality of search keys are provided will be described.

第１〜第３の従来技術では、検索キーはただ１種類と考
えられている。したがって、複数の検索キーを設けよう
とすると、第４の従来技術と同様に、検索キーごとの単
語辞書を用意することになる。その場合、複数の単語辞
書に情報が重複して登録されることになり、無駄が多い
ｆｌｙ４遺となる。In the first to third conventional techniques, only one type of search key is considered. Therefore, if a plurality of search keys are to be provided, a word dictionary must be prepared for each search key, as in the fourth prior art. In that case, information will be registered redundantly in a plurality of word dictionaries, resulting in a wasteful fly4 system.

第５・第６の従来技術では、この重複の無駄をなくして
いるが、次のような問題点がある。まず、第５の従来技
術では、単語の読みがなを検索キーとした検索の場合、
漢字読み表で展開して見出しを生成しながら照合を行な
うといった複雑な処理を行なう上に、本来の単語の読み
がな以外の見出しも生成されてしまうので、処理量が多
く、結果として処理時間が増大する０次に第６の従来技
術では、ポインタで関係を管理しているため、項目（単
語）の追加・削除の際に、その関係が壊れないかをチエ
ツクする必要が一発生し、単語辞書のメンテナンス作業
の労力が大きくなる。また、ポインタに置き換えること
により、見かけ上の情報の重複はなくなっているが、代
わりにポインタ領域がとられることになり、実質的な辞
書内容の面では、第４の従来技術と同様に無駄が多い。Although the fifth and sixth conventional techniques eliminate this wasteful duplication, they have the following problems. First, in the fifth conventional technology, in the case of a search using the pronunciation of a word as a search key,
In addition to performing complex processing such as collating while expanding the kanji reading table and generating headings, headings other than the original pronunciation of the word are also generated, resulting in a large amount of processing and, as a result, processing time. In the 6th prior art, which increases to zero order, relationships are managed using pointers, so when adding or deleting items (words), it is necessary to check whether the relationships are broken or not. The effort required to maintain the word dictionary increases. Furthermore, although the apparent duplication of information is eliminated by replacing it with a pointer, the pointer area is taken up instead, which is wasteful in terms of actual dictionary contents, as in the fourth prior art. many.

本発明の目的は、上記のような従来技術の欠点を除去し
、入力文字列の各文字に複数の候補があっても、高速に
単語辞書との照合が可能な単語辞書検索装置であって、
しかも、複数の検索キーを設けて゛も、容量の面で無駄
のある構造をとることなく、高速性を保てる単語辞書検
索装置を提供することである。SUMMARY OF THE INVENTION An object of the present invention is to provide a word dictionary search device that eliminates the drawbacks of the prior art as described above and is capable of high-speed matching with a word dictionary even if each character in an input character string has multiple candidates. ,
Moreover, it is an object of the present invention to provide a word dictionary search device that can maintain high speed even when a plurality of search keys are provided without adopting a structure that is wasteful in terms of capacity.

（課題を解決するための手段）本願の第１の発明の単語辞書検索装置は、各文字に対し
て第１から第ｍ候補（ｍはｍ≧１なる整数）までのｍ種
類の候補が存在する文字列を入力する入力装置と、ｎ文
字骨（ｎはｎ≧１なる整数）のデータ巾をもつ各アドレ
スに見出しを１個ずつ格納してｎ文字に満たない部分に
あらかじめ定めた残余記号を詰めて構成され見出しの種
類に応じて区分されたＳ種類（ｓはＳ≧１なる整数）の
単語辞書メモリと、前記入力装置によって１文字に対す
るｍ種類の候補が入力されるたびに１回のシフトクロッ
ク並びに前記単語辞書メモリ内の全単語数に応じた回数
の判定クロック及びカウンタクロックを発生するコント
ローラと、前記シフトクロックに同期したリセ・ットと
前記カウンタクロックに同期した力〜ラントアップとを
行なう前記Ｓ種類の単語辞書メモリに共通のアドレスカ
ウンタと、前記入力装置によって入力された文字列の第
１・第２・・・・・第ｍ候補に対応し前記シフトクロ・
ツクに同期した１文字ずつの順送りを行なう各々ｎ文字
骨の第１・第２・・・・・第ｍ候補シフトレジスタと、
前記Ｓ種類の単語辞書メモリの１つを選択するスイッチ
と、前記スイッチで選択された単語辞書メモリから読み
出されるｎ文字骨のデータの１文字目・２文字目・・・
・・ｎ文字目に対応し該当位置の文字が前記第１・第２
・・・・・第ｍ候補シフトレジスタのいずれかの同位置
の文字または前記残余記号に一致したとき一致信号を出
力する１文字目・２文字目・・・・・ｎ文字目比較回路
と、前記判定クロックに同期して前記１文字目・２文字
目・・・・・ｎ文字目比較回路のすべてから一致信号が
検出された場合に前記入力装置によって入力された文字
列中に前記単語辞書メモリ内に存在する単語が出現した
と判定する判定回路とを備える。(Means for Solving the Problems) In the word dictionary search device of the first invention of the present application, there are m types of candidates from the first to the mth candidate (m is an integer where m≧1) for each character. An input device for inputting a character string to be displayed, and a header stored in each address with a data width of n characters (n is an integer with n≧1), and a predetermined residual symbol for the part less than n characters. A word dictionary memory of S types (s is an integer such that S≧1) that is divided according to the type of heading, and once every time m types of candidates for one character are input by the input device. a controller that generates a shift clock, a judgment clock and a counter clock a number of times according to the total number of words in the word dictionary memory; a reset/set synchronized with the shift clock; and a power run-up synchronized with the counter clock. an address counter common to the S types of word dictionary memories, and a shift counter corresponding to the first, second, . . . mth candidates of the character string inputted by the input device;
1st, 2nd, . . . m-th candidate shift registers for each n-character bone, which sequentially advance characters one by one in synchronization with the tsuku;
A switch for selecting one of the S types of word dictionary memories, and the first and second characters of the n-character bone data read from the word dictionary memory selected by the switch.
...The character at the corresponding position corresponding to the nth character is the first and second character.
. . . 1st character, 2nd character, . When a match signal is detected from all of the first character, second character,... nth character comparison circuits in synchronization with the determination clock, the word dictionary is detected in the character string input by the input device. and a determination circuit that determines that a word existing in the memory has appeared.

本願の第２の発明の単語辞書検索装置は、各文字に対し
て第１から第ｍ候補（ｍはｍ≧１なる整数）までのｍ種
類の候補が存在する文字列を入力する入力装置と、ｎ文
字分（ｎはｎ≧１なる１１数のデータ巾をもつ各アドレ
スに見出しを１個ずつ格納してｎ文字に満たない部分に
あらかじめ定めた残余記号を詰めて構成され見出しの種
類に応じて区分されたＳ種類（ｓはＳ≧１なる整数）の
単語辞書メモリと、各アドレスに単語に関する前記８８
Ｍの見出し以外の付加情報を格納した付加情報メモリと
、前記入力装置によって１文字に対するｍ種類の候補が
入力されるたびに１回のシフトクロック並びに前記単語
辞書メモリ内の全単語数に応じた回数の判定クロック及
びカウンタクロックを発生するコントローラと、前記シ
フトクロックに同期したリセットと前記カウンタクロッ
クに同期したカウントアラ１とを行なう前記Ｓ種類の単
語辞書メモリと前記付加情報メモリとに共通のアドレス
カウンタと、前記入力装置によって入力された文字列の
第１・第２・・・・・第ｍ候補に対応し前記シフトクロ
ックに同期した１文字ずつの原送りを行なう各々ｎ文字
分の第１・第２・・・・・第ｍ候補シフトレジスタと、
前記Ｓ種類の単語辞書メモリの１つを選択するスイッチ
と、前記スイッチで選択された単語辞書メモリから読み
出されるｎ文字分のデータの１文字目・２文字目・・・
・・ｎ文字目に対応し該当位置の文字が前記第１・第２
・・・・・第ｍ候補シフトレジスタのいずれかの同位置
の文字または前記残余記号に一致ｊ７たとき一致信号を
出力する１文字目・２文字目・・・・・ｎ文字目比軟回
路と、前記判定クロックに同期して前記１文字目・２文
字目・・・・・ｎ文字目比較回路のすべてから一致信号
が検出された場合に前記入力装置によって入力された文
字列中に前記単語辞書メモリ内に存在する単語が出現し
たと判定する判定回路と前記判定回路により出現が検出
された単語について前記スイッチで選択された単語辞書
メモリ以外の単語辞書メモリ内と前記付加情報メモリ内
とを選別出力するセレクタとを備える。The word dictionary search device of the second invention of the present application comprises an input device for inputting a character string in which there are m types of candidates from the first to the mth candidate (m is an integer where m≧1) for each character. , for n characters (n is n≧1, and one heading is stored in each address with a data width of 11 numbers, and the part less than n characters is filled with predetermined residual symbols. A word dictionary memory of S types (s is an integer such that S≧1) divided according to
an additional information memory that stores additional information other than M headings; a shift clock that is activated once every time m types of candidates for one character are input by the input device; and a shift clock corresponding to the total number of words in the word dictionary memory. A common address for a controller that generates a count determination clock and a counter clock, the S type word dictionary memory that performs reset synchronized with the shift clock, and count alarm 1 synchronized with the counter clock, and the additional information memory. a counter, and first, second, . . . m-th candidates of the character string input by the input device, and performs original forwarding of each n character in synchronization with the shift clock.・Second... m-th candidate shift register,
A switch for selecting one of the S types of word dictionary memories, and the first and second characters of the data for n characters read from the word dictionary memory selected by the switch.
...The character at the corresponding position corresponding to the nth character is the first and second character.
. . . 1st character, 2nd character . . . nth character ratio soft circuit that outputs a match signal when a match j7 occurs with any character at the same position in the m-th candidate shift register or with the above-mentioned residual symbol. When a match signal is detected from all of the first character, second character,... nth character comparison circuit in synchronization with the determination clock, the character string input by the input device is detected. A determination circuit that determines that a word existing in the word dictionary memory has appeared; and a determination circuit that determines that a word existing in the word dictionary memory has appeared, and a word whose appearance is detected by the determination circuit is stored in the word dictionary memory other than the word dictionary memory selected by the switch and in the additional information memory. and a selector that selects and outputs.

（実施例）図面を用いて、本願発明の構成・作用を説明する。(Example) The structure and operation of the present invention will be explained using the drawings.

第１図は本願の第１の発明の単語辞書検索装置の第１の
実施例の構成を示すブロック図である。FIG. 1 is a block diagram showing the configuration of a first embodiment of a word dictionary search device according to the first invention of the present application.

以下、まず、各構成要素を説明する。Below, each component will be explained first.

入力装置５は、各文字に対して第１の候補がち第ｍ候補
（ｍはｍ≧１なる整数）までのｍ種類の候補が存在する
文字列を入力する装置であり、例えば、音声認識装置や
文字認識装置などである。The input device 5 is a device for inputting a character string in which there are m types of candidates for each character, from the first candidate to the m-th candidate (m is an integer where m≧1), and is, for example, a voice recognition device. and character recognition devices.

入力装置５は、各文字に対するｍ個の候補を同時に出力
し、その出方のタイミングを入力クロック５０によって
コントローラ７へ伝える。The input device 5 simultaneously outputs m candidates for each character, and transmits the timing of their appearance to the controller 7 using an input clock 50.

単語辞書メモリ１は、ｎ文字分（ｎはｎ≧１なる整数）
のデータ巾をもつ各アドレスに見出しを１個ずつ格納し
、ｎ文字に満たない部分にはあらかじめ定めた残余記号
を詰めて構成されたものである。この単語辞書メモリ１
を、見出しの種類の異なるＳ種類（ｓはＳ≧１なる整数
）だけ設ける。Word dictionary memory 1 is for n characters (n is an integer where n≧1)
One heading is stored in each address having a data width of , and the portion less than n characters is filled with predetermined residual symbols. This word dictionary memory 1
There are S types of different headings (s is an integer such that S≧1).

以下、Ｓ種類の単語辞書メモリ１を、第１の単語辞書メ
モリ、第２の単語辞書メモリ、１引・・、第Ｓの単語辞
書メモリとする。ちなみに、第１図ではｓ＝２としてい
る。第１の単語辞書メモリは単語の読みがなを見出しと
したものであり、第２の単語辞書メモリは単語の表記を
見出しとしたものである。他にも単語の英表記を見出し
としたものなども考えられる。Ｓ種類の単語辞書メモリ
について、同一のアドレスには同一の単語に対する見出
しを格納するものとする。た辷し、この第１の実施例で
は、単語の出現のみを判定するので１．必ずしも、同一
アドレスが同一単語に対応してなくとも動作する。さら
に単語辞書メモリ内の単語の並びは、ソートされている
必要はない。Hereinafter, the S types of word dictionary memories 1 will be referred to as a first word dictionary memory, a second word dictionary memory, a 1st word dictionary memory, . . . , an S-th word dictionary memory. Incidentally, in FIG. 1, s=2. The first word dictionary memory uses the readings of words as headings, and the second word dictionary memory uses the spellings of words as headings. Another possibility is to use the English notation of the word as a heading. Assume that for S types of word dictionary memories, headings for the same word are stored at the same address. However, in this first embodiment, only the appearance of words is determined, so 1. It works even if the same address does not necessarily correspond to the same word. Furthermore, the arrangement of words in the word dictionary memory does not need to be sorted.

第５図は、単語辞書メモリ１の内容の例を示す図である
。第５図（ａ＞は第１の単語辞書メモリの内容の例、第
５図（ｂ）は第２の単語辞書メモリの内容の例である。FIG. 5 is a diagram showing an example of the contents of the word dictionary memory 1. FIG. 5(a) is an example of the contents of the first word dictionary memory, and FIG. 5(b) is an example of the contents of the second word dictionary memory.

いずれもｎ＝６としている。In both cases, n=6.

なお、Δが残余記号を表している。第５図（ａ）の第１
の単語辞書メモリのアドレスａとすると、「にほんΔΔ
Δ」というｎ（＝６）文字が同時に読み出される。同じ
アドレスに対して、第５図（ｂ）の第２の単語辞書メモ
リからは、「日本ΔΔΔΔ」というｎ（＝６）文字が同
時に読み出される。単語辞書メモリ１は、ＩＣメモリな
どを用いて実現−できる０通常、日本語の文字コードは
１６ビツトで表現されるので、第５図のデータ中は１６
Ｘ６＝９６ビツトである。現在のＩＣメモリでは、同時
に読み出されるデータ中が８ビット程度であるから、第
５図の各単語辞書メモリは、そのようなＩＣメモリを１
２個並列に並べて実現できる。Note that Δ represents the residual symbol. 1 in Figure 5(a)
If the address a of the word dictionary memory is ``Nihon ΔΔ
n (=6) characters "Δ" are read out at the same time. For the same address, n (=6) characters "Japan ΔΔΔΔ" are simultaneously read out from the second word dictionary memory shown in FIG. 5(b). The word dictionary memory 1 can be realized using an IC memory or the like.Normally, Japanese character codes are expressed in 16 bits, so the data in Figure 5 contains 16 bits.
X6=96 bits. In current IC memories, the data read out at the same time is about 8 bits, so each word dictionary memory in FIG.
It can be realized by arranging two pieces in parallel.

コントローラ７は、入力装置５によって１文字に対する
ｍ種類の候補が入力されるたびに、１回のシフトクロッ
ク並びに、単語辞書メモリ１内の全単語数に応じた回数
の判定クロックおよびカウンタクロックを発生する。第
６図は、コントローラ７の入出力信号のタイムチャート
の例である。The controller 7 generates one shift clock and a number of determination clocks and counter clocks corresponding to the total number of words in the word dictionary memory 1 each time m types of candidates for one character are inputted by the input device 5. do. FIG. 6 is an example of a time chart of input/output signals of the controller 7.

第６図のタイムチャートでは、入力装置５から入力タロ
ツク５０が入力されるたびに、まず、シフトクロック７
０を１回出力する０本実施例では入力クロック５０をそ
のままシフトクロック７０として出力している。続いて
、判定クロック２とカウンタクロック７１とを交互にＮ
回出力している。In the time chart of FIG. 6, each time the input tarlock 50 is input from the input device 5, the shift clock 7
0 is outputted once. In this embodiment, the input clock 50 is output as is as the shift clock 70. Next, the judgment clock 2 and the counter clock 71 are alternately set to N.
It is outputting times.

ただし、カウンタクロック７１は（Ｎ−１）回でもよい
、ここで、Ｎは単語辞書メモリ１内の全単語数である。However, the counter clock 71 may be counted (N-1) times, where N is the total number of words in the word dictionary memory 1.

このようなタイムチャート通りに動作するコントローラ
７は、当業者において容易に実現可能である。A person skilled in the art can easily realize a controller 7 that operates according to such a time chart.

アドレスカウンタ４は、シフトクロック７０に同期した
リセットと、カウンタクロック７１に同期したカウント
アツプとを行なうカウンタで、カウンタ値をＳ種類の単
語辞書メモリ１の共通のアドレス値として出力する。ア
ドレスカウンタ４は従来のカウンタＩＣで実現できる。The address counter 4 is a counter that performs reset in synchronization with the shift clock 70 and count-up in synchronization with the counter clock 71, and outputs the counter value as a common address value for the S types of word dictionary memories 1. Address counter 4 can be implemented using a conventional counter IC.

第１候補シフトレジスタ２は、入力装置５によって入力
された文字列の第１候補（Ｉは１≦ｌ≦ｍなる整数）を
、シフトクロック７０に同期して１文字ずつ順送りしな
がら、ｎ文字分格納するシフトレジスタである。このシ
フトレジスタ２は、入力文字列の各文字に対するｍ個の
候補の各々に対して設けられており、第１の候補シフト
レジスタ、第２候補シフトレジスタ、・・・・・・・・
・、第ｍ候補シフトレジスタのｍ個がある。第７図は、
各シフトレジスタ２の構成例を示した図である。１６字
をｄビットで表現している場合、第１候補シフトレジス
タ２は、第７図のように、シフトクロック７０に同期し
た（ｄＸｎ）個のＤフリップ７０ツブで実現できる。す
なわち、ｄ個並列に並べたものを、ｎ個直列に接続する
。並列に接続したｄ個が１文字に対応し、その出力は各
文字目毎にまとめて比較回路３へ送られる。The first candidate shift register 2 sequentially shifts the first candidate (I is an integer satisfying 1≦l≦m) of the character string inputted by the input device 5 one character at a time in synchronization with the shift clock 70, and transfers n characters. This is a shift register that stores minutes. This shift register 2 is provided for each of the m candidates for each character of the input character string, and includes a first candidate shift register, a second candidate shift register,...
, there are m number of m-th candidate shift registers. Figure 7 shows
2 is a diagram showing a configuration example of each shift register 2. FIG. When 16 characters are expressed by d bits, the first candidate shift register 2 can be realized by (dXn) D flips 70 blocks synchronized with the shift clock 70, as shown in FIG. That is, d pieces arranged in parallel are connected in series. The d pieces connected in parallel correspond to one character, and the output is sent to the comparator circuit 3 for each character.

スイッチ８は、Ｓ種類の単語辞書メモリ１の１つを選択
する手段である０選択された単語辞書メモリに対応する
値を選択信号８０として出力する。The switch 8 is a means for selecting one of the S types of word dictionary memories 1, and outputs a value corresponding to the word dictionary memory 0 selected as a selection signal 80.

例えば、ｓ＝２の場合は、選択信号線を１ビツト用意し
、その値がＯのとき第１の単語辞書メモリ、値が１のと
き第２の単語辞書メモリに対応させればよい、一般に、
選択信号線はｌｏｇｓビット必要になる。このスイッチ
８は、トグルスイッチ、デイツプスイッチなどで実現で
きる。なお、入力装置５から入力されるテキストの文字
コードに応じて、切り換えるようなスイッチ８の構成も
可能である。For example, in the case of s=2, it is sufficient to prepare one bit of the selection signal line, and when the value is O, it corresponds to the first word dictionary memory, and when the value is 1, it corresponds to the second word dictionary memory.Generally, ,
The selection signal line requires logs bits. This switch 8 can be realized by a toggle switch, dip switch, or the like. Note that it is also possible to configure the switch 8 such that it changes over in accordance with the character code of the text input from the input device 5.

ｊ文字目比較回路３は、アドレスカウンタ４の示す１づ
のアドレスに対して、スイッチ８で選択された単語辞書
メモリ１から読み出され４ｎ文字分のデータ中の１番目
（ｊは１≦ｊ≦ｎなる整数）の文字が、ｍ個のシフトレ
ジスタ２の１番目の文字のいずれか、または、残余記号
に一致したときに、一致信号を出力する回路である。こ
の比較回路３は、単語辞書メモリ１のデータ巾ｎ文字の
各々に対応じた数だけ設けられており、１文字目比較回
路、２文字目比較回路、・・・・・・・・・、ｎ文字目
比較回路のｎ個がある。第８図はｊ文字目比較回路３の
構成例を示した図である。１も字をｄビットで表現して
、シフトレジスタ２がｍ個設けられている場合、ｊ文字
目比較回路３は、第８図のように、（ｍ＋１）個のｄビ
ット比較器と、１個のＯＲゲートと、１個のｄビットセ
レクタとで構成できる。ｄビットセレクタは、選択信号
８０にしたがって、５個の単語辞書メモリから読み出さ
れたＳ種類のｊ文字目データのうちの１つを選択する（
第８図ではｓ＝２としである）０ｍ個のｄビット比較器
はｄピッチセレクタで選択された文字と各シフトレジス
タ２のｊ文字目とを比較する。The j-th character comparison circuit 3 reads out the word dictionary memory 1 selected by the switch 8 and compares the first address (j is 1≦j This circuit outputs a match signal when a character (integer ≦n) matches one of the first characters of m shift registers 2 or a residual symbol. The comparison circuits 3 are provided in a number corresponding to each of the data width n characters of the word dictionary memory 1, and include a first character comparison circuit, a second character comparison circuit, etc. There are n number of n-th character comparison circuits. FIG. 8 is a diagram showing an example of the configuration of the j-th character comparison circuit 3. When the character 1 is represented by d bits and m shift registers 2 are provided, the j-th character comparison circuit 3 has (m+1) d-bit comparators and 1 It can be configured with 2 OR gates and 1 d-bit selector. The d-bit selector selects one of the j-th character data of the S type read from the five word dictionary memories according to the selection signal 80 (
(In FIG. 8, s=2) 0m d-bit comparators compare the character selected by the d pitch selector with the j-th character of each shift register 2.

一方、残りの１個のｄビット比較器はｄビットセレクタ
で選択された文字が残余記号であるか否かを調べる。最
終的なＯＲゲートの出力では、この（ｍ＋１＞個の比較
器のいずれかで一致が検出されたときに、一致信号３０
が出力される。さら仁、第９図はｊ文字目比較回路３の
別の構成例を示した図である。第９図の構成では、各単
語辞書メモリから読み出されたＪ文字目ごとに、（ｍ＋
１）個のｄビット比較器でｍ個のシフトレジスタの」文
字目および残余記号との比較を行ない、その結果を１ビ
ツトセレクタで選択して出力する。On the other hand, the remaining d-bit comparator checks whether the character selected by the d-bit selector is a residual symbol. At the output of the final OR gate, when a match is detected in any of these (m+1> comparators), the match signal 30
is output. FIG. 9 is a diagram showing another example of the configuration of the j-th character comparison circuit 3. In the configuration shown in FIG. 9, for each J character read from each word dictionary memory, (m+
1) The d-bit comparators compare the ``th'' character of the m shift registers with the remaining symbols, and the results are selected and output using the 1-bit selector.

判定回路６は、判定クロック７２に同期して、ｎ個の比
較回路３のすべてから一致信号が検出された場合に、入
力装置５によって入力された文字列中に単語辞書メモリ
１内に存在する単語が出現したと判定する回路である。In synchronization with the determination clock 72, the determination circuit 6 determines whether a word exists in the word dictionary memory 1 in the character string input by the input device 5 when a matching signal is detected from all of the n comparison circuits 3. This is a circuit that determines that a word has appeared.

第１０図は、判定回路６の構成例を示す図である０判定
回路６は、第１０図のように、１個のＡＮＤゲートと１
個のＤフリップ７０ツブで実現できる。FIG. 10 is a diagram showing a configuration example of the determination circuit 6. The 0 determination circuit 6 includes one AND gate and one
This can be achieved with 70 pieces of D-flips.

次にこの実施例の動作を例を用いて説明する。Next, the operation of this embodiment will be explained using an example.

第１１図・第１２図の（ａ）〜（口は、単語辞書メモリ
のデータ中ｎ＝６、候補数ｍ＝３で、入力文字列の長さ
が６のとき、第１の候補シフトレジスタ・第２候補シフ
トレジスタ・第３候補レジスタの内容の変化の例を示し
た図である。第１１図は、入力装置５として文字認識装
置が用髪）られ、漢字かな混じり文字列が入力された場
合の一例である。また、第１２図は、入力装置５として
音声認識装置が用いられ、かな文字列が入力された場合
の一例である。第１１図・第１２図ともに、３×６のマ
トリックスは、１行が各シフトレジスタ２の内容に対応
しており、行方向が文字位置（１〜６）、列方向が候補
レベル（１〜３）を表わしている０例えば、第１１図の
例でシフトレジスタ２に入力されている文字列は、１文
字目の候補は第１候補から順に１日」「白」　［目」、
２文字目の候補は順に「本ｊｒ本」［大」、３文字目は
順にＥ語」　「話」　「詰」、４文字目は順に「の」　
［め」　［つ」、５文字目は順に「丈」　「又」「文」
、６文字目は順に１章」「草」　「早」である、斜線部
分は文字が格納されていないことを表わす。11 and 12 (a) to () are the first candidate shift registers when n=6 in the data in the word dictionary memory, number of candidates m=3, and the length of the input character string is 6.・This is a diagram showing an example of changes in the contents of the second candidate shift register and the third candidate register. FIG. This is an example of a case where Further, FIG. 12 shows an example in which a voice recognition device is used as the input device 5 and a kana character string is input. In both FIG. 11 and FIG. 12, one row of the 3×6 matrix corresponds to the contents of each shift register 2, and the row direction corresponds to the character position (1 to 6), and the column direction corresponds to the candidate level (1 to 6). 3) 0 represents 0. For example, in the example shown in Figure 11, the character string input to the shift register 2 is as follows: ``1 day'', ``white'', ``eye'', ``white'', ``eye'',
The second character candidates are ``hon jr book'' [dai], the third character is ``E'', ``story'', ``tsume'', and the fourth character is ``no''.
[me] [tsu], the fifth character is [jo] [mata] [bun]
, the sixth character is ``Chapter 1'', ``kusa'', and ``early'', and the shaded area indicates that no characters are stored.

第１１図のような入力文字列（入力装置５が文字認識装
置の場合）に対しては、第５図（ｂ）のような単語の表
記を見出しとして単語辞書検索処理を行なうように、ス
イッチ８を設定する。すなわち、第２の単語辞書メモリ
（表記を見出しとしたとき単語辞書メモリ）を選択する
。For an input character string as shown in FIG. 11 (when the input device 5 is a character recognition device), a switch is set so that word dictionary search processing is performed using the word notation as the heading as shown in FIG. 5(b). Set 8. That is, the second word dictionary memory (word dictionary memory when the notation is used as a heading) is selected.

そのとき、第１１図において、（ａ）→（ｂ）→（ｃ）
→（ｄ）−？（ｅ）→（ｆ）→（ｇ）→（ｈ）→（ｉ）
→（Ｊ）→（ｋ）→（１）という変化は、シフトクロッ
ク７０が１回発生されるたびに起こる変化を示している
。そして、（ａ）〜（ｋ）の各々の状態で、カウンタク
ロック７１と判定クロック７２が全単語数Ｎと同じ８回
発生される。アドレスカウンタ４は、シフトクロック７
０によってリセットされ、カウンタクロック７１によっ
てＮ回カウントアツプされるため、各状態で、単語辞書
メモリ１からは、先頭の単語から末尾の単語まで順に、
全単語（Ｎ個）の表記が読み出される。そして、ｎ個（
この例では６ｍ）の比較回路３において、スイッチ８に
よって選択され単語辞書メモリ１の全見出しと、第１１
図のシフトレジスタの内容との照合が行なわれることに
なる。At that time, in Fig. 11, (a) → (b) → (c)
→(d)-? (e) → (f) → (g) → (h) → (i)
The change →(J) →(k) →(1) indicates a change that occurs each time the shift clock 70 is generated. In each of the states (a) to (k), the counter clock 71 and the determination clock 72 are generated eight times, which is the same as the total number of words N. The address counter 4 has a shift clock 7
Since it is reset by 0 and counted up N times by the counter clock 71, in each state, the word dictionary memory 1 sequentially reads from the first word to the last word.
The notation of all words (N pieces) is read out. And n pieces (
In this example, in the comparator circuit 3 (6m), all the headings in the word dictionary memory 1 selected by the switch 8 and the 11th
A comparison will be made with the contents of the shift register shown in the figure.

その結果、各状態で、第２の単語辞書メモリ１内の例え
ば次のような単語（見出し）、に対して、判定回１ｉ１
１６が出現を検出する。As a result, in each state, for example, the following words (headings) in the second word dictionary memory 1, the judgment time 1i1
16 detects the occurrence.

（ａ）　　該当なしくｂ）　　該当なしくｃ）　　該当なしくｄ）　　該当なしくｅ）　　該当なしくｆ）　　１日」　「白」「目」「日本」　「０大」「
白木」　ｒ日本語ｊ（ｇ）　　「木」　「本」　ｒ大」　「大詰め」（ｈ）
・　１語」　［話」　「詰め」（１）　「の」「め」「つ」（Ｊ）　「丈」［又」「文」「文章」（ｋ）１章」「草」「早」このうち、状態（ｆ）における第７図のシフトレジスタ
２の内容と、第２図の単語辞書メモリ１内の見出し「日
本」（＝「日本ΔΔΔΔ」）とを照合する際の、各比較
回路３の動作を説明する。(a) Not applicable b) Not applicable c) Not applicable d) Not applicable e) Not applicable f) 1 day” “White” “Eyes” “Japan” “0 large” “
"Shiraki" rJapanese j (g) "tree""book" r large""final" (h)
・1 word” [story] “tsume” (1) “no” “me” “tsu” (J) “length” [mata] “bun” “bun” (k) “chapter 1” “kusa” “haya” this Among these, each comparison circuit 3 is used when comparing the contents of the shift register 2 in FIG. 7 in state (f) with the heading "Japan" (= "Japan ΔΔΔΔ") in the word dictionary memory 1 in FIG. Explain the operation.

１文字目比較回路は、「日本ΔΔΔΔ」の１日」と第１
候補シフトレジスタの１文字目の１日」との一致により
一致信号３０を発生する。２文字目比較回路は、「日本
ΔΔΔΔ」の「本」と第２候補シフトレジスタの２文字
目の１本」との一致により一致信号３０を発生する。３
文字目比較回路から６文字目比較回路までは、「日本Δ
ΔΔΔ」の「Δ」　（残余記号）を検出して一致信号３
０を発生する。その結果、判定回路６は、すべての比較
回路３の一致信号３０を受けて、単語の出現を検出する
ことになる。The first character comparison circuit compares "1 day of Japan ΔΔΔΔ" and the first character.
A match signal 30 is generated by a match with "1st day of the first character of the candidate shift register." The second character comparison circuit generates a match signal 30 based on a match between "hon" of "Japan ΔΔΔΔ" and "one of the second characters of the second candidate shift register". 3
From the character comparison circuit to the 6th character comparison circuit, "Japan Δ
Detect “Δ” (residual symbol) of “ΔΔΔ” and generate match signal 3
Generates 0. As a result, the determination circuit 6 receives the match signals 30 from all the comparison circuits 3 and detects the appearance of a word.

さて、第１２図のような文字列が入力される場合（入力
装置５が音声認識装置の場合）には、第５図（ａ）のよ
うな単語の読みがなを見出しとして単語辞書検索処理を
行なうように、スイッチ８を切り換える。すなわち、第
１の単語辞書メモリ（読みがなを見出しとした単語辞書
メモリ）を選択し直す、そして、同様に、１文字ずつ順
送りしながら、その度に、第１の単語辞書メモリ１の全
見出しとシフトレジスタ２の内容とを照合する。Now, when a character string as shown in FIG. 12 is input (in case the input device 5 is a voice recognition device), a word dictionary search process is performed using the pronunciation of the word as a heading as shown in FIG. 5(a). Switch 8 is changed so as to perform the following steps. That is, the first word dictionary memory (word dictionary memory with readings as headings) is reselected, and in the same way, all characters in the first word dictionary memory 1 are selected while sequentially advancing one character at a time. The heading is compared with the contents of shift register 2.

その結果、各状態で検出される第１の単語辞書メモリ内
の単語（見出し）は、例えば、次のようなものである。As a result, the words (headings) in the first word dictionary memory detected in each state are, for example, as follows.

（ａ（ｂ（Ｃ（ｄ（ｅ（ｆ該当なし該当なし該当なし該当なし該当なし「に」　「いＪ　「す」　「におＪ「にほん」　「いはん」　「いほう」「いおん」　「いおう」「は」　「お」　「はん」　「はん」「はぼ」　「はう」「おん」　「おう」　「はんぐ」「はんぶ」　「おんぶ」　「はんぐん」（ｇ）「はんぶん」　「はんぶん」「おうぼう」（ｈ）　　「ん」　「う」　「はぐ」　「はぼ」「はぶ
ん」（ｌ）　「ぐ」　「ぶ」「ぼ」「ぐん」「ぶん」「ぼん
」　「ぐんかＪ　「ぶんかＪ（Ｊ）　　ｒん」　「う」「ほか」（ｋ）　「かＪ　’＜Ｊ　’が」なお、入力文字列長がＫの場合、シフトレジスタ２内の
順送りは、少なくとも（Ｋ＋ｎ−１）回行なわれる必要
がある。したがって、入力装置５は、長さＫの入力文字
列を入力した後、さらに、（ｎ−１）回のダミーの文字
列入力を行なう必要がある。あるいは、コントローラ７
が、入力文字列の末尾を検出して、さらに、シフトクロ
ック７０を１回カウンタクロック７１と判定クロック７
２をＮ回ずつというサイクルを、（ｎ−１）回繰り返す
ようにしてもよい。(a (b (C) (d (e (f) Not applicable Not applicable Not applicable Not applicable Not applicable ``ni'' ``iJ ``su'' ``nij ``nihon''``ihan''``iho'' ``ion ” “Iou” “Ha” “O” “Han” “Han” “Habo” “Hau” “On” “Ou” “Hangu” “Hanbu” “Piggyback” “Hangun” (g) “Hanbun” “Hanbun” “Oubou” (h) “N” “U” “Hagu” “Habo” “Habun” (l) “Gu” “Bu” “Bo” “ ``gun''``bun''``bon'' ``gunkaJ ``bunkaJ (J) run''``u''``other'' (k) ``kaJ '<J'ga'' Note that if the input string length is K , the sequential feeding in the shift register 2 must be performed at least (K+n-1) times. Therefore, after inputting the input character string of length K, the input device 5 further performs the sequential feeding (n-1) times. It is necessary to input a dummy character string.Or, the controller 7
detects the end of the input character string, and then shifts the shift clock 70 once to the counter clock 71 and the judgment clock 7.
2 may be repeated N times each time (n-1) times.

また、最初の（ｎ−１）回の順送りの間（第１１図・第
１２図では（ａ）から（ｅ）まで）は、入力文字列がシ
フトレジスタ２の先頭まで到達していないので、単語辞
書メモリ１との照合を行なっても意味がない、そこで、
その間は、コントローラ７は、カウンタクロック７１や
判定タロツク７２を発生せず、シフトクロック７０のみ
を連続して発生するようにしてもよい。Also, during the first (n-1) sequential feeds (from (a) to (e) in Figures 11 and 12), the input character string has not reached the beginning of the shift register 2, so There is no point in checking against word dictionary memory 1, so
During this period, the controller 7 may continuously generate only the shift clock 70 without generating the counter clock 71 or the judgment clock 72.

以上に示したように、この第１の実施例では、入力装置
５として、文字認識装置を用い表記を見出しとした単語
辞書検索を行なう場合、音声認識装置を用いて読みがな
を見出しとした単語辞書検索を行なう場合、スイッチを
切り換えるだけで共通に適用できる。As described above, in this first embodiment, when a character recognition device is used as the input device 5 to perform a word dictionary search using a notation as a heading, a voice recognition device is used to search a word dictionary using a reading as a heading. When performing a word dictionary search, it can be applied universally by simply flipping a switch.

第２図は、第２の実施例の構成を示すブロック図である
。第１の実施例では、単語の出現を検出するのみであっ
たが、第２の実施例では、検出された単語に関する情報
を出力する。第２の実施例は、第１図にその構成を示し
た第１の実施例に、付加情報メモリ９と出力レジスタ１
０を追加したものである。FIG. 2 is a block diagram showing the configuration of the second embodiment. In the first embodiment, only the occurrence of a word is detected, but in the second embodiment, information regarding the detected word is output. The second embodiment has an additional information memory 9 and an output register 1 added to the first embodiment whose configuration is shown in FIG.
0 is added.

ここで、付加情報メモリ９は、Ｓ種類の単語辞書メモリ
１の見出し以外の付加情報を格納したメモリである０例
えば、単語の品詞、アクセント、意味分類、表記具、読
み長などを収める。付加情報メモリ９では、単語辞書メ
モリ１の単語と同一のアドレスに、それら付加情報を格
納する。第１３図は、単語辞書メモリ１と付加情報メモ
リ９の内容の対応例を示した図である。ただし、付加情
報としては品詞しか登録していない、アドレスａは、表
記が「日本」、読みがなが「にほん」、品詞が固有名詞
である単語に対応している。アドレスカウンタ４は、Ｓ
種類の単語辞書メモリ１だけでなく、付加情報メモリ９
にも、共通のアドレスを与えるようにする。Here, the additional information memory 9 is a memory that stores additional information other than the headings of the S types of word dictionary memory 1, such as part of speech, accent, meaning classification, notation, reading length, etc. of the word. The additional information memory 9 stores the additional information at the same address as the word in the word dictionary memory 1. FIG. 13 is a diagram showing an example of correspondence between the contents of the word dictionary memory 1 and the additional information memory 9. However, address a, for which only the part of speech is registered as additional information, corresponds to a word whose notation is "Japan", whose reading is "nihon", and whose part of speech is a proper noun. Address counter 4 is S
Not only types of word dictionary memory 1 but also additional information memory 9
Also, give a common address.

出力レジスタ１０は、Ｓ種類の単語辞書メモリ１と付加
情報メモリ９から読み出される１つの単語に対する全情
報（ｓ種類の見出しの内容と付加情報の内容）を、その
まま判定回路６から出力される検出クロック６０のタイ
ミングでラッチする。The output register 10 outputs all the information for one word read from the S types of word dictionary memory 1 and the additional information memory 9 (the contents of the S types of headings and the contents of the additional information) and outputs them as they are from the determination circuit 6. It is latched at the timing of clock 60.

出力レジスタ１０はフリップフロップで実現されるレジ
スタである。The output register 10 is a register implemented by a flip-flop.

ここで、漢字かな混じり文字列のテキストを文字認識し
、さらに、文章解析・音声合成する読書器を考えた場合
、表記を見出しとした単語辞書検索処理の結果として、
読みがな・品詞などは必要であるが、表記は出力される
必要はないなぜならば単語辞書検索処理の後で行なわれ
る音声合成処理では使われないからである。また、逆に
、音声入力で文章を作成する音声ワードプロセッサを考
えた場合、読みがなを見出しとした単語辞書検索処理の
結果として、表記・品詞などは必要であるが、読みがな
は出力される必要はない、そこで、第２の実施例のよう
に単語の情報をすべて出力するのではなく、検出の際に
用いられた見出しは出力しないようにしたものが、第３
の実施例である。If we consider a reading device that recognizes text containing kanji and kana characters, and also performs sentence analysis and speech synthesis, as a result of a word dictionary search process using the notation as a heading,
Although the pronunciation and part of speech are necessary, the notation does not need to be output because it is not used in the speech synthesis process that is performed after the word dictionary search process. Conversely, if we consider an audio word processor that creates sentences using voice input, as a result of a word dictionary search process using readings as headings, the spelling and parts of speech are necessary, but the readings are not output. Therefore, instead of outputting all word information as in the second embodiment, the third embodiment does not output the headings used during detection.
This is an example.

第３図は、この第３の実施例の構成を示ずブロック図で
ある。この第３の実施例は、本願の第２の発明にもとづ
くものである。FIG. 3 is a block diagram showing the configuration of this third embodiment. This third embodiment is based on the second invention of the present application.

第３の実施例では、第２図にその構成を示した第２の実
施例の対して、セレクタ１１が追加されている。なお、
第１の実施例に対しては、付加情報メモリ９、セレクタ
１１、出力レジスタ１０が追加されている。In the third embodiment, a selector 11 is added in contrast to the second embodiment whose configuration is shown in FIG. In addition,
An additional information memory 9, a selector 11, and an output register 10 are added to the first embodiment.

第３図において、セレクタ１１は、Ｓ種類の単語辞書メ
モリ１の内容（ｓ種類の見出し）のうちスイッチ８で選
択された単語辞書メモリ以外の（ｓ−１）種類の内容と
、付加情報メモリ９の内容とを選別出力する６例えば、
第２の単語辞書メモリ１がスイッチ８で選択されている
際、その第２の単語辞書メモリ１内の見出し「日本」に
対して検出が行なわれたときは、第１の単語辞書メモリ
１の内容「にほん」と付加情報メモリ９の内容「固有名
詞」が出力される。すなわち、「日本」は出力されない
。In FIG. 3, the selector 11 selects (s-1) types of contents other than the word dictionary memory selected by the switch 8 among the S types of contents of the word dictionary memory 1 (s types of headings) and the additional information memory. For example, select and output the contents of 9 and 6.
When the second word dictionary memory 1 is selected by the switch 8 and the heading "Japan" in the second word dictionary memory 1 is detected, the first word dictionary memory 1 is selected. The content "Nihon" and the content "proper noun" of the additional information memory 9 are output. In other words, "Japan" is not output.

以上の第１〜第３の実施例では、入力文字列の各文字に
ｍ個の候補が存在する一般的な場合の実施例を示したが
、かな漢字変換における単語辞書検索や、作成した文章
を音声出力するための漢字かな変換における単語辞書検
索などでは、入力文字列の各文字は１通りである（ｍ＝
　１　＞　、第３図にその構成を示したような本願の第
２の発明にっいて、ｍ＝１とした場合を、第４の実施例
として第４図に示す、第４図は第４の実施例の構成を示
すブロック図である。In the first to third embodiments above, examples were shown for the general case where there are m candidates for each character in the input character string. In word dictionary searches for kanji-kana conversion for audio output, each character in the input string is unique (m=
1>, in the second invention of the present application whose configuration is shown in FIG. 3, the case where m=1 is shown in FIG. 4 as a fourth embodiment. FIG. 2 is a block diagram showing the configuration of an embodiment of the present invention.

第４の実施例では、シフトレジスタ２が１個でよい、構
成要素・動作は、これまでに示した実施例について、ｍ
＝１とすればよいので、詳しい説明を省略する。かな漢
字変換の際には、入力されなかな文字列と、第１の単語
辞書メモリ（読みがなを見出しとする単語辞書メモリ）
の見出しとを照合して、検出された単語について、表記
・品詞などが出力される。漢字かな変換の際には、入力
された漢字かな混じり文字列と、第２の単語辞書メモリ
（表記を見出しとする単語辞書メモリ）の見出しとを照
合して、検出された単語について、読みがな・品詞など
が出力される。In the fourth embodiment, only one shift register 2 is required, and the components and operations are m
= 1, so a detailed explanation will be omitted. During kana-kanji conversion, the input kana character string and the first word dictionary memory (word dictionary memory with readings as headings)
The spelling, part of speech, etc. of the detected words are output. When converting Kanji to Kana, the input character string containing Kanji and Kana is compared with the headings in the second word dictionary memory (word dictionary memory with spellings as headings), and the readings of the detected words are calculated. , part of speech, etc. are output.

なお、見出しによって、１文字を表現するビット数が異
なることがある０例えば、漢字かな混じりの表記の１文
字の表現には、通常、１６ビ・シトが必要であるが、読
みがなの１文字は８ビツトで表現可能である。そのよう
な場合に対処する方法の１つは、多いビット数のものに
合わせることである０例えば、読みがなの１文字にも、
８ビット余分なビットを付加して、１６ビツト割り当て
ればよい、別な対処方法は、公約数のビット数を単位と
して、有効な区切りのみを利用するものである０例えば
、８ビツトを単位として順送りや比較を行ない、表記に
関する照合の場合には、偶数回目の順送りの回だけを有
効とすればよい、なお、読みがなは８ビツトなので、毎
回有効である。Note that the number of bits to represent one character may differ depending on the heading. For example, 16 bits are usually required to represent one character written in kanji and kana, but one character in reading is can be expressed in 8 bits. One way to deal with such cases is to match the number of bits with a large number of bits.
Another solution is to add 8 extra bits and allocate 16 bits.Another way to deal with this is to use only valid delimiters using the number of bits in the common divisor as a unit.For example, use 8 bits as a unit. When performing sequential forwarding and comparison, and in the case of notation-related verification, only the even-numbered forwarding times need to be valid. Note that since the pronunciation is 8 bits, it is valid every time.

（発明の効果）以上に説明したように、本願発明によれば、漢字のよう
に種類の多い文字から成る文字列に対して、各文字が複
数の候補をもっていても、高速に単語辞書との照合が可
能な単語辞書検索装置が得られる。特に、入力文字列の
各文字の候補数にも、単語辞書内の単語の表記長にもか
かわらず、入力文字列と単語辞書内の１つの単語との照
合が２クロック程度の間に行えることの効果が大きい。(Effects of the Invention) As explained above, according to the present invention, even if each character has multiple candidates for a character string consisting of many types of characters such as kanji, it can be quickly compared to a word dictionary. A word dictionary search device capable of comparison is obtained. In particular, regardless of the number of candidates for each character in the input string and the length of the word in the word dictionary, matching the input string with one word in the word dictionary can be performed within about two clocks. The effect is large.

また、検索キーが１種類に限定されず、複数種類の検索
キーを切り換えて照合が可能である。しかも、複数種類
の検索キーを用いることによる単語辞書容量の無駄な増
加は発生しない。そして、いずれの種類の検索キーにつ
いても、検索の高速性は保たれている。Furthermore, the number of search keys is not limited to one type, and matching can be performed by switching between multiple types of search keys. Furthermore, the word dictionary capacity does not increase unnecessarily due to the use of multiple types of search keys. The speed of the search is maintained for all types of search keys.

また、本願発明の各構成要素は、少数のロジックＩＣを
組み合わせで実現できる。したがって、ＬＳＩ技術を利
用すれば、非常に小型の装置として実現できる点も長所
である。それに加えて、第１の従来技術のように、汎用
計算機上のソフトウェアとして実現されるのではなく、
専用ハードウェア・特殊ＬＳＩとして実現できるなめ、
クロックの周波数自体を、汎用計算機に比べてかなり高
く設定することが可能であり、この点でも高速性が優れ
ている。Further, each component of the present invention can be realized by combining a small number of logic ICs. Therefore, an advantage of using LSI technology is that it can be realized as a very compact device. In addition, unlike the first conventional technology, it is not implemented as software on a general-purpose computer;
Because it can be realized as dedicated hardware/special LSI,
The clock frequency itself can be set much higher than that of general-purpose computers, and in this respect, it is also superior in speed.

以上のような高速性から、本願発明では、前述の第１の
従来技術のように単語辞書内で照合する単語数を制限せ
ずにＩｉｉ語辞書内の全単語との照合を行なっても、従
来より十分高速な単語辞書検索装置が得られる。この結
果、単語辞書内が見出しのコード順にソートされている
必要がないという長所も生ずる。したがって、単語の追
加・削除が行なわれても、単語辞書を編成し直す必要が
なく、単語辞書のメンテナンスがきわめて容易である。Due to the above-mentioned high speed, the present invention does not limit the number of words to be matched in the word dictionary as in the first prior art described above, but even when matching is performed with all words in the III word dictionary, A word dictionary search device that is sufficiently faster than the conventional one can be obtained. As a result, there is an advantage that the word dictionary does not need to be sorted in the order of the heading code. Therefore, even if words are added or deleted, there is no need to reorganize the word dictionary, and maintenance of the word dictionary is extremely easy.

[Brief explanation of drawings]

第１図は本Ｍ発明の第１の実施例の構成を示すブロック
図、第２図は第２の実施例の構成を示すブロック図、第
３図は第３の実施例の構成を示すブロック図、第４図は
第４の実施例の構成を示すブロック図、第５図は単語辞
書メモリ１の内容の例を示す図、第６図はコントローラ
７の入力信号のタイムチャート、第７図はシフトレジス
タ２の構成例を示す図、第８図・第９図は比較回路３の
構成例を示す図、第１０図は判定回路６の構成例を示す
図、第１１図（ａ）〜（１）は漢字かな混じり文字列の
入力に対するシフトレジスタ２の内容の変化の例を示す
図、第１２図（ａ）〜（冒）はかな文字例の入力に対す
るシフトレジスタ２の内容の変化の例を示す図、第１３
図は単語辞書メモリ１と付加情報メモリ９の対応を示す
図である。１・・・単語辞書メモリ、２・・・シフトレジスタ（第
１候補シフトレジスタ）、３・・・比較回路（Ｊ文字目
比較回路）、４・・・アドレスカウンタ、５・・・入力
装置、６・・・判定回路、７・・・コントローラ、８・
・・スイッチ、９・・・付加情報メモリ、１ｏ・・・出
力レジスタ、３０・・・一致信号、５ｏ・・・入力タロ
ツク、６０・・・検出クロック、７ｏ・・・シフトクロ
ック、７１・・・カウンタクロック、７２・・・判定タ
ロツク、８０・・・選択信号。FIG. 1 is a block diagram showing the configuration of the first embodiment of the present M invention, FIG. 2 is a block diagram showing the configuration of the second embodiment, and FIG. 3 is a block diagram showing the configuration of the third embodiment. 4 is a block diagram showing the configuration of the fourth embodiment, FIG. 5 is a diagram showing an example of the contents of the word dictionary memory 1, FIG. 6 is a time chart of input signals of the controller 7, and FIG. is a diagram showing an example of the configuration of the shift register 2, FIGS. 8 and 9 are diagrams showing an example of the configuration of the comparison circuit 3, FIG. 10 is a diagram showing an example of the configuration of the determination circuit 6, and FIGS. (1) is a diagram showing an example of the change in the contents of shift register 2 in response to the input of a character string containing kanji and kana; Illustration showing an example, No. 13
The figure shows the correspondence between the word dictionary memory 1 and the additional information memory 9. DESCRIPTION OF SYMBOLS 1... Word dictionary memory, 2... Shift register (first candidate shift register), 3... Comparison circuit (J-th character comparison circuit), 4... Address counter, 5... Input device, 6... Judgment circuit, 7... Controller, 8...
... Switch, 9... Additional information memory, 1o... Output register, 30... Match signal, 5o... Input tarlock, 60... Detection clock, 7o... Shift clock, 71... - Counter clock, 72... Judgment clock, 80... Selection signal.

Claims

[Claims]

(1) An input device that inputs a character string in which there are m types of candidates from the first to the mth candidate (m is an integer where m≧1) for each character, and One heading is stored in each address with a data width of (an integer), and the part less than n characters is filled with predetermined residual symbols. a word dictionary memory (an integer where s≧1), one shift clock every time m types of candidates for one character are input by the input device, and determination of the number of times according to the total number of words in the word dictionary memory. a controller that generates a clock and a counter clock; an address counter common to the s types of word dictionary memories that performs reset in synchronization with the shift clock and count-up in synchronization with the counter clock; 1st, 2nd, . . . , m-th candidates corresponding to the first, second, . A shift register, a switch for selecting one of the s types of word dictionary memories, and the first and second characters of data for n characters read from the word dictionary memory selected by the switch.
Character: When the character at the corresponding position corresponding to the n-th character matches the character at the same position in the first, second, m-th candidate shift register, or the residual symbol. 1st character, 2nd character...nth character comparison circuit which outputs a match signal, and 1st character, 2nd character...nth character comparison circuit synchronized with the judgment clock. and a determination circuit that determines that a word existing in the word dictionary memory has appeared in the character string input by the input device when matching signals are detected from all of the characters. Device.

(2) An input device for inputting a character string in which there are m types of candidates from the first to the mth candidate (m is an integer where m≧1) for each character; One heading is stored in each address with a data width of (an integer), and the part less than n characters is filled with predetermined residual symbols. an additional information memory storing additional information other than the s types of headings regarding a word at each address; and each time m types of candidates for one character are inputted by the input device. a controller that generates a shift clock once per cycle, a determination clock and a counter clock a number of times according to the total number of words in the word dictionary memory; a reset synchronized with the shift clock; and a count-up synchronized with the counter clock. an address counter common to the s types of word dictionary memory and the additional information memory; and the shift clock corresponding to the first, second, . a first, second, . . . , m-th candidate shift register for each n character, which sequentially shifts characters one by one in synchronization with the above; a switch for selecting one of the s types of word dictionary memories; and a switch for selecting one of the s types of word dictionary memories; The first and second characters of the data for n characters read out from the word dictionary memory selected in...The characters corresponding to the nth character are the first and second characters. The first and second characters that output a match signal when they match the character at the same position in any of the m-th candidate shift registers or the remaining symbol...
・If a match signal is detected from the n-th character comparison circuit, and all of the first character, second character, etc. n-th character comparison circuit in synchronization with the determination clock, the signal is input by the input device. a determination circuit that determines that a word existing in the word dictionary memory has appeared in a character string; and a determination circuit that determines that a word existing in the word dictionary memory has appeared in the character string; A word dictionary search device comprising: a selector for selectively outputting information in the additional information memory.