JPS6274153A

JPS6274153A - Retrieving method for electronic dictionary

Info

Publication number: JPS6274153A
Application number: JP60215691A
Authority: JP
Inventors: Yoshizo Saito; 齋藤　佳三
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1985-09-27
Filing date: 1985-09-27
Publication date: 1987-04-04
Also published as: JPH0378667B2

Abstract

PURPOSE:To shorten the time required for checking a spelling, by adjusting the number of words per one group by increasing or decreasing the number of digits of the index of a hash value, when grouping a word group and registering it to a dictionary. CONSTITUTION:Character and word information inputted and edited by an input device 1 is stored in a memory device 2. An electronic dictionary 4 for a spelling check which is connected to the memory device 2 is provided with an operation processing part for a spelling check processing, and offers information related to whether the spelling of an inputted word is correct or not, in accordance with an inquiry from the memory device 2. This operation processing part which is used exclusively for the spelling check processing is constituted so that the number of words per one group can be adjusted by increasing or decreasing the number of digits of the index of the hash value, when grouping the words and registering them in the dictionary 4.

Description

【発明の詳細な説明】（産業上の利用分！Ｊ！Ｐ）本発明はワードプロセッザ、タイプライタ等を含む種々
の言語処理装置に付随する電子辞書の検索方法に関する
。DETAILED DESCRIPTION OF THE INVENTION (Industrial Application! J!P) The present invention relates to a search method for electronic dictionaries attached to various language processing devices including word processors, typewriters, and the like.

（技術背景）欧文ワードプロセッザ、欧文タイプライタ等に付属され
ろ電子式単語辞書において、スペルヂエック機能は重要
であり、かっこのようなスペルチェック処理はできる限
り迅速に行いたいという要請がある。そこでスペルチェ
ックを行うに当り、例えば第１表に例示するように、辞
書に登録する弔語群を頭文字及び文字数７、−よって２
火星的１．丁グループ分（−ｊ（〜、該当オろｒｊ８語
が属４′ろグル　プ内でのみ検索を行うことか考えられ
る４、なお、第１表に示す数値ｉＪ各ダグループ属オろ
単語数で・ある３、（発明が解決すべき課題）ところがに記検索方法に、１．１Ｌば、電子計−）に登
録されるＱｊ語群をグループ分番士しているにも拘り二
）ず、なお個々のグループ７こ属する９１語の促１数が
がなり多数にのほろので（例えば頭文字が“Ｃ”で８文
字の単語（７目１５４．１７ｉ）、該当する単語の検索
に要−４゛ろｌ！ｊ間が長くなるという問題がある９、
そのため、タイプライタ等にお１．Ｌるリアルクィノ、
処理（、二は供し難い。(Technical background) In electronic word dictionaries attached to Roman word processors, Roman typewriters, etc., a spell check function is important, and there is a demand for spell check processing such as parentheses to be performed as quickly as possible. Therefore, when performing a spell check, for example, as shown in Table 1, the group of eulogy words to be registered in the dictionary has an initial letter and a number of characters of 7, - therefore 2.
Mars-like 1. It is conceivable that the corresponding ororj8 words are searched only within the group 4, and the number of words in each group iJ shown in Table 1. 3. (Problem to be solved by the invention) However, in the search method described in 1.1L, although the Qj word group registered in electronic meter -) is used as a group number, 2) , because the number of 91 words that belong to 7 individual groups is large, and there are many words (for example, a word with the initial letter "C" and 8 letters (7th letter 154.17i), it is necessary to search for the corresponding word. -4゛rol!j There is a problem that the interval becomes long 9,
Therefore, 1. L real quino,
Processing (, 2 is difficult to provide.

父上記検索力法においては、各単語が１文字中１！ソで
コ−１・化されて電子計−１１こ登録さＪｌろ、ｊ：う
７．＝なっているので、１語当りの記憶に要４゛る容積
が文字数によ−・て変動し、特に文字数の多い１１１語
の場合、１語当りの記憶容量が大きくなるという問題が
ある。しかも文字数の多い１１語では、１語当りの検索
時間もかなり長時間となろ３、史に又、上記検索方法に
よれば、万　誤、たスペルの単語が人ｊＪさイ１へ−場
合、該当グル　プ内の全ての単語と照合１−た後でなＩ
ｊｔｌ、ばミススペルと判定４ろことかで八ないので、
判定時間がｒ＝くなるとい・）問題らある３、（問題点を解決４゛ろため（ハ丁段）本発明は１．述Ｌ　ｉｃ種々の不μ合を解消することを
目的と１．ている。そのたｙ）、本発明に係る電子辞書
の検索方法は、メモリ装置からなる電１′・辞Ｉ）に複
数の単語を格納して検索を行・）ζ、−＝　’１０、ｒ
め各アルファベット文字に対し文字ウ−ｒ、　、、１’
　ｌ−を定めるととｔ）にＱｉ語内の各位置ｒ）　、、
　（＋−を定め、各Ｑｉ語にお１する語頭の文字に）い
で文字（、’）　ｊ１′トと位置ウェイトを乗算した値
を予め定めた所定素数で除算して剰余を求め、引続き後
続するｈ文字に−）いて文字ウェ、イトと位置ウゴ、イ
トの乗′ｃ＞値１１、曲回の剰余を加算したり（を上記
所定素数で除痒し７：′剰余を求める操作を語肥の文字
まで１文字ｆｉｉに繰り返して行い、語尾の文字に対す
る最終剰余としで得られたハソン、１値を下位一定柘数
のインテ・ツクスと下位一定桁数のデータに分割して各
単語のデータを１−配電子辞書にインデックスが共通な
グループｈｊに登録しておき、検索すべき単語について
上記ハンソコ値を）Ｉ出し、該検索単語のデ　タとｆ＝
記電子辞書内の検索単語のインデックスに対応するグル
ープのデータ群との一致又は不一致を照合するようにし
たことを特徴とする。Father In the above search power method, each word has 1 out of 1 letters! It has been converted into a code 1 and the electronic meter 11 has been registered in Japan. =, the storage capacity required to store one word varies depending on the number of characters, and especially in the case of 111 words, which have a large number of characters, there is a problem that the storage capacity per word becomes large. Moreover, for 11 words with a large number of characters, the search time per word would be quite long3.In addition, according to the above search method, in the case of a wrongly spelled word, After matching all words in the group,
jtl, it's a misspelled word and the judgment is 4 or so, so
The purpose of the present invention is to solve the various inconveniences mentioned in 1. Therefore, the electronic dictionary search method according to the present invention stores a plurality of words in the electronic dictionary I) consisting of a memory device and performs a search. , r
For each alphabetic character, write the letters U-r, ,,1'
When l- is defined, each position r) in the Qi word is set to t), ,
(Determine +- and add 1 to the first letter of each Qi word.) Then, multiply the character (,') j1' by the position weight, divide the value by a predetermined prime number, find the remainder, and continue to Add the remainder of the curvature of the character wa, it and the position ugo, the power of it'c>value 11, and (divide it by the predetermined prime number 7:') to find the remainder. The process is repeated for each character fii up to the letter ``hi'', and the final remainder for the final character is divided into the lower constant number of inte tx and the lower constant number of digits to calculate the value of each word. Register the data in a group hj with a common index in the electronic dictionary, obtain the above-mentioned value for the word to be searched, and combine the data of the search word and f=
The present invention is characterized in that the index of the search word in the electronic dictionary is checked for match or mismatch with the data group of the group corresponding to the index.

その場合、各グルーブノ、二属するデータ群を数値の小
さい順に登録することが好適である。In this case, it is preferable to register the data groups belonging to each groove number in ascending order of numerical value.

（実施例）以下、添付図面及び添付図表を参照１．なから本発明を
実施例に基づいて説明する。(Example) Please refer to the attached drawings and charts below.1. The present invention will be explained based on examples.

図面には本発明法を適用しうる言δＰ、処理装置の一般
構成が示されている。同図において、■は本言語処理装
置に文字、単語情報を入力するための入力装置であって
、具体的には例えば鍵盤装置、タブレット装置、ＯＣＩ
ｚ　（光学的文字読取装置）、磁気テ　プ装置等が使用
される。The drawings show the general structure of a processing device and a word δP to which the method of the present invention can be applied. In the same figure, ■ is an input device for inputting characters and word information to the language processing device, and specifically, for example, a keyboard device, a tablet device, an OCI device, etc.
z (optical character reader), magnetic tape device, etc. are used.

２は人力装置ｌに接続され、人力装置１によって人力さ
れて編集さＡ１だ文字、単語情報を保存ずる記憶装置で
あって、例えばコアメモリ、Ｉ　Ｃメモリ、磁気ディス
ク装置等が使用される９、３は記憶装置２に接続され、
記憶装置２で保存された情報を出勾オる出力装置であ−
、て、例えば各種プリンタ、ディスプレイ装置、磁気テ
　ブ装置、磁気ディスク装置等が使用されろ４．４は記
憶装置２に接続されろスペル舌上ツタ用電子辞書であ−
）で、例えば：ｌアメモリ、ＩＣメモリ、ＲＯＭ（ラン
グ１、アクセスメモリ）、磁気ディスク装置等により構
成されろ。後述−セるよ５に該辞書４はスペルヂエック
処理専用の演算処理部を備え、記憶装置２からの間合１
１に応じて、人勾された単語のスペルが正しいか否かの
情報を提供１゜うるようになっている。Reference numeral 2 denotes a storage device which is connected to the human-powered device 1 and stores the character and word information edited by the human-powered device 1, such as a core memory, an IC memory, a magnetic disk device, etc. 9 , 3 are connected to the storage device 2,
It is an output device for outputting information stored in the storage device 2.
For example, various printers, display devices, magnetic tape devices, magnetic disk devices, etc. are used. 4.4 is an electronic dictionary for spelling tongue ivy connected to the storage device 2.
), for example, it is composed of: l memory, IC memory, ROM (rung 1, access memory), magnetic disk device, etc. As will be described later, the dictionary 4 is equipped with an arithmetic processing section dedicated to spell check processing,
1, it is possible to provide information on whether the spelling of the word that has been deduced is correct or not.

又５は各装置１〜４に接続されろ制御装置で、例えばコ
ンビコータによって構成され、各装置１〜４間？こおけ
る信号の授受の制御を行う。Reference numeral 5 denotes a control device connected to each device 1 to 4, which is configured by a combi coater, for example, and is connected to each device 1 to 4. Controls the transmission and reception of signals at this station.

次に、本発明におけるハッシコ法による欧文単語のコー
ド化について述べる。Next, encoding of European words using the hashco method in the present invention will be described.

このコード化に際しては、まず第３表に人文字のアルフ
ァベットの一部を例示するように、各文字にそれぞれ２
進数からなる固有の文字ウェイ）・（便宜」二１０進表
示で表す）を定める。なお第３表では省略しているが、
小文字のアルファベット、数字等に対しても同様に文字
ウェイトが定められる。When encoding this, first, as shown in Table 3, which shows part of the human alphabet, each character has two characters.
A unique character way consisting of a base number) (conveniently expressed in decimal notation) is determined. Although omitted in Table 3,
Character weights are similarly determined for lowercase alphabets, numbers, and the like.

それとともに第４表に示すように、単語内の各文字位置
に対し、それぞれ２進数からなる固有の位置ウェイト（
便宜」−１６進表示で表す）を定めろ。At the same time, as shown in Table 4, for each character position within a word, a unique position weight (
(expressed in hexadecimal).

なお第４表には１文字目〜７文字目の位置ウェイトが例
示されているが、８文字目以降についても同様の位置ウ
ェイトが定められる。Table 4 shows position weights for the first to seventh characters, but similar position weights are determined for the eighth and subsequent characters.

次に、上記文字ウェイト及び位置ウェイトに基づいて、
下記の手順で各弔語のハツシュ値を算出する。Next, based on the above character weight and position weight,
Calculate the hash value of each eulogy using the following procedure.

（ｉ）すなわち、まず各単語の１文字目（語頭）の文字
の文字ウェイトと位置ウェイトを乗算する。(i) That is, first, the character weight and position weight of the first character (initial character) of each word are multiplied.

例えば“ＡＩＲ”という単語の場合、“Ａ”の文字ウェ
イト“６０”と１文字目の位置ウェイト“ｏｏｏｇｏｏ
ｏｏ“を乗算する。その場合、文字ウェイトを３ビット
単位に分割して位置ウェイトに乗算することが好適であ
る。そして、その乗算値を２２７に最も近い素数で除算
して剰余を求め、該剰余を記憶する。For example, in the case of the word "AIR", the character weight of "A" is "60" and the positional weight of the first character is "ooogoo".
oo". In that case, it is preferable to divide the character weight into 3-bit units and multiply by the position weight. Then, divide the multiplied value by the prime number closest to 227 to obtain the remainder. Remember the remainder.

（１１）引続き、２文字目の文字の文字ウェイトと位置
ウェイトを乗算した値に１文字目について求めた剰余を
加算してその値を」１記素数で除算し、新たな剰余を算
出する。以下、最後（語尾）の文字まで１文字毎に同様
の演算を繰り返し、最終的に求めた剰余をその単語のハ
ツシュ値とする。ここでは、各回の除算にお客」る除数
として２２７に最も近い素数を選定しているので、−ト
記ハッンユ値は全て２７桁以内の２進数で表現される。(11) Next, add the remainder obtained for the first character to the value obtained by multiplying the character weight and position weight of the second character, and divide that value by a prime number to calculate a new remainder. Thereafter, the same operation is repeated for each character up to the last character (the end of the word), and the finally obtained remainder is used as the hash value of the word. Here, since the prime number closest to 227 is selected as the divisor for each division, all the values are expressed as binary numbers within 27 digits.

なお、除数を変えることによって、ハツシュ値の桁数を
任意に変更できる。Note that by changing the divisor, the number of digits of the hash value can be changed arbitrarily.

第５表にアルファベットの冒頭部分について上記手順で
ハツシュ値を算出した結果を例示する。Table 5 shows the results of calculating the hash value using the above procedure for the beginning of the alphabet.

このようにして求めたハツシュ値を昇り順（数値の小さ
い順）に並べ換えたものの先頭部分を第６表に示す。こ
れら第５．６表においては、便宜上ハツシュ値を８進表
示で表している。Table 6 shows the first part of the hash values obtained in this way, rearranged in ascending order (in ascending order). In Table 5.6, hash values are expressed in octal for convenience.

上記のようにしてハツシュ法によるコード化が終了すれ
ば、次に各単語のハツシュ値を上位１１桁（２進表示の
場合）のインデックス部分（以下単にインデックスとい
う）と下位１６桁のデータ部分（以下単にデータという
）に分割し、インデックスの共通な単語毎にグループ分
けを行う。例えば、第６表に示す単語群のうち、ハツシ
ュ値が２＋６（８進表示にお１３る２０００００）未満
の４３個の単語群ｎｅｖｕｓ〜ａｃｃｏｍｍｏｄａｔｏ
ｒはインデックス“０″として第１番目のグループに分
類される。又、ハラノコ−値が２１′１以上でかつ２１
７未満の単語群はインデックス“ビとして第２番目のグ
ループに分類される。このようにして全ての単語がイン
デックスの桁数に対応する２０４８（−２’り通りのグ
ループに分類される。なお、インデックスの桁数を変え
ることによりグループ数を任意に増減することができる
。Once the encoding using the hash method is completed as described above, the hash value of each word is divided into the upper 11 digits (in the case of binary representation) of the index part (hereinafter simply referred to as index) and the lower 16 digits of the data part ( (hereinafter simply referred to as data), and grouped by words with common indexes. For example, among the word groups shown in Table 6, the 43 word groups with hash values less than 2+6 (13 in octal notation, 200,000)
r is classified into the first group with index "0". In addition, the Haranoko value is 21'1 or more and 21
Words with an index of less than 7 are classified into the second group with the index "bi". In this way, all words are classified into 2048 (-2') groups corresponding to the number of digits in the index. , the number of groups can be increased or decreased arbitrarily by changing the number of digits of the index.

第２表にグループ数を２０４８とした場合の各グループ
に属する単語の個数（便宜上１６進表示で表す）を示す
。第２表の欄外の縦軸には、グループ番号の」１位３桁
（各桁を１６進表示で表す）が、欄外の横軸にはグルー
プ番号の最下位の１桁（８進表示で表す）が示されてい
る。同表から明らかなように、本性ではグループ数を増
加させることにより、個々のグループに属する単語の個
数が減少１．ている。Table 2 shows the number of words belonging to each group (expressed in hexadecimal for convenience) when the number of groups is 2048. The vertical axis outside the margin of Table 2 shows the first three digits of the group number (each digit is expressed in hexadecimal), and the horizontal axis outside the margin shows the lowest digit of the group number (in octal notation). ) is shown. As is clear from the table, by increasing the number of groups, the number of words belonging to each group decreases.1. ing.

ちなみに、第２表中にアンダーラインで示すように、本
性ではＩグループにおける最大の単語数が８８（＋　６
表示における５８）であり、従って最大限８８回の検索
で全てのスペルチェックが行えることになる。By the way, as shown underlined in Table 2, the maximum number of words in the I group in nature is 88 (+ 6
58) in the display, therefore, all spell checks can be performed with a maximum of 88 searches.

以上のようにグループ化された単語のデータは、各グル
ープ毎にそれぞれ昇り順に辞書４に格納される。又、第
２表に示される各グループの単語数に基づいて各グルー
プの先頭アドレスが求められて記憶される（第７表参照
）。これらのアドレスはスペルチェック時における該当
グループの選択に利用される。The word data grouped as described above is stored in the dictionary 4 in ascending order for each group. Furthermore, the start address of each group is determined and stored based on the number of words in each group shown in Table 2 (see Table 7). These addresses are used to select the relevant group during spell checking.

以下、上記辞書４によるスペルチェック処理について述
べる。The spell check process using the dictionary 4 will be described below.

記憶装置２から辞書４にスペルチェックを行うべき単語
（以下検索単語という）が送られると、辞書４内の図示
しない演算処理部により上述と同様のルｉｆＷ方法で検
索１１”ｉ語の・＼ソンコ値が騨出される４、引続き、
検索中５ｔ１のｆン子・Ｉリスに、ｌ−り該当りループ
がｆす別みねた後、上記検索１１１語のう−り鼾該当グ
ル　ブに属−計るデ　タ１！′Ｔとの一致又は不致が順
次照合さ石ろ３．照合の結果、検索１１’ｉ語のブタが
該当グループのい「イ１かのデータと　致４゛わは、記
憶装置ニジ（盲目２いスペルＣあ／）旨を示３）信号が
送信さＡ１ろ。一方、検索ｌｊｊ語のデータか該当グル
　ブのいずれのデータとも一致しなｌｊｊ　４１ば、記
憶装置２にミススペルである旨を・云・＋−（１；　’
、３が送信さイ１ろ３、不法で（Ｊ各！？ル　ブの一？
゛−タか−７１）ｌｌ［Ｑに配列されているの−（＝、
特にミススペルの場合、検索単語のデータが該当１〕゛
ループ（、ｉ）−＋’　　りｉｊｉ　（Ｊ）’、）らの
い４″れかのデ　タより小さくなり、かつそれ−丁での
い一４゛れのデータとも　致しな（［れは、その時点で
ミススペルの判定を十すごとができろ３、とこ／）で第
８表（、−例７■り電ろように、不法においてはンノニ
ノ−３（同Ｍ詔）が発生上ろ３，５−ごてノノー二。When a word to be spell-checked (hereinafter referred to as a search word) is sent from the storage device 2 to the dictionary 4, an arithmetic processing unit (not shown) in the dictionary 4 performs a search 11" for the i word using the same IFW method as described above. 4. Continuing, the Sonko value is calculated.
During the search, 5t1's fonko/I squirrel found the corresponding loop, and then the above searched word 111 belonged to the corresponding group. Data 1! Match or mismatch with 'T is checked sequentially.3. As a result of the matching, the pig in the search 11'i word matches the data in the corresponding group's 1. A1. On the other hand, if the data of the search word ljj does not match any of the data of the corresponding group, write a message to the storage device 2 that it is a misspelling.
, 3 is sent, 1, 3, illegally (J each!? Lube one?
-(=,
In particular, in the case of a misspelling, the data of the search word is smaller than any of the data in the corresponding 1゛loop (, i) - +' riji (J)',), and I don't agree with the previous data. In this case, Nnonino-3 (the same M-edict) occurred, and 3,5-goteno-no-2.

ノー、とは、２語以］−の甲語のハソン：＋、　（ｉＩ
′ｉが同　になろことをい゛）９．シかしながら、こ（
ハよ・“）なジノ。二ノー・は、辞書・１１、−格納４
−ろ甲、１ハ（ｊ）総数７２０００語中３２　、；７ｒ
（））♂）　−Ｑ　Ｉ’ｓ　ｉ）、誤認識は２２５０語
に対１−．　Ｉ　；＋！−８とＦトめ−（−稀に１−５
か／ｌ　ｌ−：ないから、：ｌｊｊ　Ｉｔｌ　、、ｉ−
’、’に障はなしら（ｒ）Ｊ。``No'' means 2 or more words] ``Hason'' in the first word of -: +, (iI
9. While I was thinking,
Hayo ") na Jino. Nino is dictionary 11, - storage 4
- Roko, 1ha (j) 32 out of 72,000 words, ;7r
())♂) -Q I's i), erroneous recognition is 1 to 2250 words. I ;+! -8 and F tome-(-rarely 1-5
ka/l l-: Because there is no, :ljj Itl,,i-
There is no obstacle to ','(r)J.

思わｌ１１ろ５、（発明の効果）以１説明しｊ−、ことから明らかな３１、′）に、本発
明によＡ１ば、ｊｌｉ語群を）、自Ｉ−ブ分（１（７て
辞、１１（σ録Ｗろ（Ｊ当り、上記Ｊ＼ツノーノ値のｆ
ンデリ′ノスの（ｔ」数を増減−計ることに３１−１て
り゛ノ１．　７ｉ′数、換、−１−４−れば゛グＪレー
ーーー７°当ζ）の！）１ｉ菖のＩｌ、’＝１数を調整
−４−ろ、二とかご、＼ろ４、その場り、ｒンデ・２層
ｌスの桁数を充分大１−＼く設疋４゛ろことにより、前
記頭文字及び文字数に３Ｌろ９ル　プ分（）の場合よｉ
）らツノループ数を増して、そイ１だｌＬ’７”ル　ブ
当ｌ）（ハ中、ｉａ数を減少さ１東ろごとかでΔろ。ｊ
Ｌ−ｉ　、スペルチ」！ケの所要時間を短縮−４゛ろこ
とかで、）ろ５、な７１１、萌詞頭文字伎び文字数に括
づ＜′ノ゛ル　ブ化にｔ；いで（Ｊ、辞書にσ緑さイ］
るｌｊｊ語の総数が　定である限り、本発明のよ・′）
なりループ数、−９゛１す′ノ゛ルーーーノ°゛Ｉ戸）
のＩ）Ｉ語数の調整は不可能である。(Effect of the invention) As explained below, it is clear from 31, ') that according to the present invention, A1, jli word group), and own I-b (1 (7)). 11 (σ record Wro (per J, f of the above J\tsunono value
Increasing or decreasing the number (t) of the number 31-1 to calculate the number 1.7i', converting it to -1-4, then the number 7 is 7°. ) 1i irises Il, '=1 Adjust the number -4-ro, 2 and basket, \ro4, spot, rnde, 2nd layer Il, set the number of digits large enough 1-\ku 4 By the way, if the initial letter and number of letters are 3L and 9 loops (), then i
), increase the number of horn loops, then 1 L'7" l) (during the middle, decrease the number of ia, 1 east loop and Δro.j
L-i, Spellch”! Shorten the time required for ``-4゛゛゛゛ word or something,゛゛゛゛゛゛゛゛゚゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛゛)゛゛゛゛゛stomach]
As long as the total number of ljj words in the present invention is constant,
Number of loops, -9゛1su'no゛runo゛I door)
I) It is not possible to adjust the number of words.

叉萌述１−ノニよ・−）に、従来は各１１１語を１文字
中位で′−１−ド化して辞書に登録していたのし対１８
、本発明で（Ｊバッジ、１法の採用に３１−〇中語中位
で二１１・化して登録する、）−うにしへので、１語当
り、ン）記憶容Ｍを一定に４ろとともに該１語当り０）
記憶容量を充分小さく十ろことができろ。、従−・て仝
辞書容晴も低減セる。。Previously, each of the 111 words was registered in the dictionary by changing it to '-1-' with one letter in the middle.
, In the present invention (J badge, 1 method is adopted, 31-0 is registered as 211 in the middle of Chinese) (with 0 per word)
Make the storage capacity sufficiently small. , it also reduces the dictionary appearance. .

更に又、古りル　ブに属するデ　タ＋ｔｙをそＡ′Ｉぞ
れＷり順に配列１−１てお（ｊば、万−誤一、へスペル
の単語が入力へれた場合、人力弔語のデ　タを辞書の該
当グ゛ル　ゾの全デ　タと照合・１′ろまてらなく、人
力Ａ１語のデータが該当グル　ブのいす、ｉ］かのデー
タより小さくな−、ノ二時点でミススペルの判定をＩ；
せろので、処理時間の短縮化を図るごとが７′恣ろ。。Furthermore, the data + ty belonging to the old rubbish are arranged 1-1 in the order of A'I respectively (j, if by mistake, if a spelled word is entered, it will be manually written). Compare the data of the word with all the data of the corresponding group in the dictionary. At this point, judge the misspelling by I;
Because of this, every effort to shorten the processing time is 7' arbitrary. .

（以１・゛余白）第１表頭文字筑　９　夷ンコ　乙　次 □−一−−〇− 第３表第４表第７表第５表　　　　　　　第６表第８表(1・゛margin) Table 1 initials Chiku 9 Ii Nko Otsuji □−1−−〇− Table 3 Table 4 Table 7 Table 5 Table 6 Table 8

[Brief explanation of drawings]

図面は本発明法を適用しうる言語処理装置の一般構成を
示すブロック図である。４・・・辞書。The drawing is a block diagram showing the general configuration of a language processing device to which the method of the present invention can be applied. 4...Dictionary.

Claims

[Claims]

(1) When storing multiple words in an electronic dictionary consisting of a memory device and performing a search, a character weight is determined in advance for each alphabetic character, a position weight is determined for each position within the word, and the initial character of each word is determined. Divide the value obtained by multiplying the character weight and position weight by a predetermined prime number to find the remainder,
For each subsequent character, the value obtained by adding the previous remainder to the multiplication value of the character weight and position weight is divided by the above predetermined prime number to obtain the remainder, and the operation is repeated for each character up to the last character of the word. The hash value obtained as the final remainder is divided into an index with a certain number of upper digits and data with a certain number of lower digits, and the data for each word is registered in the electronic dictionary for each group with a common index, and then searched. The electronic device is characterized in that the hash value is calculated for the word to be searched, and the match or mismatch between the data of the search word and the data group of the group corresponding to the index of the search word in the electronic dictionary is checked. How to search a dictionary.

(2) The method according to claim 1, in which data groups belonging to each group are registered in order of decreasing numerical value.