JPS599749A - "kana" (japanese syllabary) and chinese character converting device - Google Patents

"kana" (japanese syllabary) and chinese character converting device

Info

Publication number
JPS599749A
JPS599749A JP57120223A JP12022382A JPS599749A JP S599749 A JPS599749 A JP S599749A JP 57120223 A JP57120223 A JP 57120223A JP 12022382 A JP12022382 A JP 12022382A JP S599749 A JPS599749 A JP S599749A
Authority
JP
Japan
Prior art keywords
search
word dictionary
word
general
provisional
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP57120223A
Other languages
Japanese (ja)
Inventor
Mitsuyuki Okada
岡田 潤之
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Priority to JP57120223A priority Critical patent/JPS599749A/en
Publication of JPS599749A publication Critical patent/JPS599749A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/53Processing of non-Latin text

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

PURPOSE:To realize an efficient retrieval, by using not only a general word dictionary but also a provisional word dictionary for storing a word designated by a user, in a ''Kana'' (Japanese syllabary) and Chinese character converting device. CONSTITUTION:A character-string inputted from an input device 1 is stored in an input register 2, and is set to a retrieving device 16. The retrieving device 16 retrieves a provisional word dirtionary 17 having a word designated by a user by a rule which is different from the longest coincidence method, for instance, from a word registered in the end. In case when it cannot be retrieved in said retrieving device, retrieval of a general word dictionary 7 is executed by a retrieving device 3. In this way, there is an effect for raising the retrieval efficiency since a provisional word dictionary in which a word used frequently by a user is stored is used prior to a general word dictionary be the longest coincidence method which takes time for the retrieval, and also the retrieval is executed from a work which is used recently and registered in the end.

Description

【発明の詳細な説明】 この発明は、かな文字列を漢字まじり文字列に変換する
かな漢字変換装置に関し、特にかな漢字変換に際して使
用する単語辞書の構成に関するものである。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a kana-kanji conversion device for converting a kana character string into a character string mixed with kanji, and particularly to the structure of a word dictionary used in kana-kanji conversion.

従来、この種の装置として第1図に示すものがあった。Conventionally, there has been a device of this type as shown in FIG.

図において、(1)はかな文字列入力器、(2)はこの
かな文字列入力器(1)よりの入力を保存する入力レジ
スタ、(3)は入力レジスタ(2)の内容を用いてかな
見出し部の最長一致法等のある一定の規則に基づいて辞
書を検索する検索器、(7)は検索器(3)が使用する
単語辞書、(4)は検索器(3)の検索内容の適否を判
定する判定器、(5)は判定器(4)よりの出力を格納
する出力レジスタ、(6)は出力レジスタ(5)の内容
を出力する表示器である。
In the figure, (1) is a kana character string input device, (2) is an input register that stores the input from this kana character string input device (1), and (3) is a kana character string input device that uses the contents of input register (2). A searcher that searches a dictionary based on a certain rule such as the longest match method for the heading section, (7) is a word dictionary used by the searcher (3), and (4) is a word dictionary that searches for the search content of the searcher (3). A determiner (5) is an output register that stores the output from the determiner (4), and (6) is an indicator that outputs the contents of the output register (5).

第2図は、第1図の検索器(3)およびその前後の部分
を詳細に示したものである。図において、(8)は入力
レジスタ(2)の入力データ(かな文字列a1〜a9)
を分割して前半部を検索レジスタ(9)に後半部を剰余
レジスタU■に格納する分割器、Giは検索レジスタ(
9)のデータを単語辞書(7)から検索する検索器、α
2は検索判定器、(13)は検索結果格納レジスタ、(
14)はレジスタlとレジスタ00)の内容を合成して
出力レジスタq9に格納する合成器である。
FIG. 2 shows in detail the search device (3) in FIG. 1 and the parts before and after it. In the figure, (8) is the input data of input register (2) (kana character string a1 to a9)
Gi is a divider that divides and stores the first half in the search register (9) and the second half in the remainder register U. Gi is the search register (9).
A search device, α, that searches the data of 9) from the word dictionary (7)
2 is a search determiner, (13) is a search result storage register, (
14) is a synthesizer that synthesizes the contents of register 1 and register 00) and stores it in output register q9.

次に動作について説明する。Next, the operation will be explained.

入力器(1)により入力された入力かな文字列は入力レ
ジスタ(2)に格納され、検索器(3)にかけられる。
The input kana character string input by the input device (1) is stored in the input register (2) and is applied to the search device (3).

検索器(3)は入力レジスタ(2)のデータの全部又は
一部に一致する見出しをもつ単語を辞書(7)から検索
するが、通常の日本語変換システムでは辞書収録単語数
が大きい為実用上検索に一定の規則を設ける必要があり
、長い見出しの単語から検索する最長一致法が一般的に
採用されている。
The search device (3) searches the dictionary (7) for words with headings that match all or part of the data in the input register (2), but this is not practical in normal Japanese conversion systems because the number of words recorded in the dictionary is large. It is necessary to set certain rules for the above search, and the longest match method, which searches from words in long headings, is generally adopted.

検索器(3)1こより単語が検索された結果、入力レジ
スタ(2)のデータは漢字とかなの混在した文字列に変
換される。判定器(4)は検索器(3)からの出力結果
の妥当性をシステム固有の判断基準に基づいて判定し、
合格したら出力レジスタ(5)に結果を格納する。
As a result of searching for a word using the search device (3), the data in the input register (2) is converted into a character string containing a mixture of kanji and kana. A determiner (4) determines the validity of the output result from the searcher (3) based on system-specific determination criteria,
If the test passes, the result is stored in the output register (5).

続いて、第2図にもとづいて、単語の検索の動作を説明
する。
Next, the word search operation will be explained based on FIG.

レジスタ分割器(8)は入力レジスタ(2)のかな文字
列を検索部と剰余部とに分割し、各々検索レジスタ(9
)、剰余レジスタ(1αに格納する。この分割の仕方は
、最初は入力レジスタ(2)の内容をすべて検索レジス
タ(9)に移し、剰余レジスタ0eは空となる。
The register divider (8) divides the kana character string in the input register (2) into a search part and a remainder part, and divides the kana character string in the input register (2) into a search part and a remainder part.
), and stored in the remainder register (1α).In this division, all the contents of the input register (2) are initially transferred to the search register (9), and the remainder register 0e becomes empty.

その後ループにより検索判定器(121からの判定結果
が1回来る度に分割器(8)で入力レジスタ(2)の内
容を末尾から1文字ずつ剰余レジスタ(!0)の方に分
割してゆき、検索レジスタ(9)に格納されているデー
タもそれにつれ末尾から1文字ずつ短くなる。検索器(
11)は、検索レジスタ(9)に格納されたかな文字列
と完全一致するかな見出しをもつ単語を辞書(7)から
検索し、その成否を検索判定器O2で判定する。
After that, each time the judgment result from the search judger (121) comes through a loop, the divider (8) divides the contents of the input register (2) character by character from the end into the remainder register (!0). , the data stored in the search register (9) also becomes shorter by one character from the end.
11) searches the dictionary (7) for a word having a kana heading that completely matches the kana character string stored in the search register (9), and determines the success or failure of the search using the search determiner O2.

検索できなかった場合、ループにより分割器(8)まで
戻り検索レジスタ(9)の末尾1文字を削り、再度、検
索を繰り返す。そして、検索か成功したら、レジスタ1
13)に横架単語の漢字部(B1−Bs )を格納し、
レジスタ合成器口Jによりレジスタ03)とレジスタ(
101の内容を合成してレジスタ05)に格納する。
If the search is not successful, the process loops back to the divider (8), deletes one character from the end of the search register (9), and repeats the search. Then, if the search is successful, register 1
13) Store the kanji part (B1-Bs) of the horizontal word,
Register 03) and register (
The contents of 101 are combined and stored in register 05).

従来のかな漢字変換装置は以−Lのよう(こ構成されて
いるので、あるかな文字入力に対して、見出し長lで検
索できる単語Aが辞書にあった場合、見出し長が! (
<7 )なる単語Bはつねにそれより後に検索されると
いう結果となり、単語Bの検索か頬繁な場合、かな漢字
賀換システムとしての能率を落とすという欠点があった
Conventional kana-kanji conversion devices are structured as shown below, so that when a certain kana character is input, if there is a word A in the dictionary that can be searched with a heading length l, the heading length is ! (
The result is that word B (<7) is always searched after it, and when word B is frequently searched, the efficiency of the kana-kanji-ka-kanji exchange system is reduced.

この発明は上記のような従来のものの欠点を除去するた
めになされたもので、ユーザにより指定された単語を暫
定的に格納する辞書を一般単語辞書とは別個にシステム
上に構成し、一般単語辞書の検索を行う前に無条件にそ
の暫定単語辞書の検索をラストイン・ファストアウト等
の検索規則で行うことにより、見出し長の長短により無
駄な変換を行う可能性を少なくできる効率的なかな漢字
変換装置を提供することを目的としている。
This invention was made in order to eliminate the drawbacks of the conventional ones as described above, and a dictionary for temporarily storing words specified by the user is configured on the system separately from the general word dictionary. By unconditionally searching the provisional word dictionary using search rules such as last in and fast out before searching the dictionary, you can reduce the possibility of unnecessary conversion due to the length of the heading. The purpose is to provide a conversion device.

以ド、この発明の一実施例を図について説明する。Hereinafter, one embodiment of the present invention will be described with reference to the drawings.

第3図において、(1)はかな文字列入力器、(2)は
このかな文字列入力器(1)よりの入力を格納する入力
レジスタ、(161は入力レジスタ(2)の内容を用い
て一般単語辞書(7)の検索規則とは異なるラストイン
・ファストアウト方式の検索規則により暫定単語辞書0
ηより単語を検索する暫定単語辞書検索器、(3)は暫
定単語辞書検索器(16)で検索ができなかった場合一
般単語辞書(7)を用いて単語を検索する一般単語辞書
検索器、(4)は検索器Oe又は検索器(3)の検索結
果の適否を判定する判定器、(5)は判定器(4)より
の出力を格納する出力レジスタ、(6)は出力レジスタ
(5)の内容を出力する表示器である。また田は1%判
定a 路(4)の判定結果が妥当な場合、出力レジスタ
(5)で保持されている一般単語辞書検索器(3)の検
索結果を暫定用語辞書(1ηに登録すべきか否かを判定
する登録判定回路、(31)は登録判定回路(30)の
判定結果に応じて出力レジスタ(5)の内容を登録する
登録回路である。
In Figure 3, (1) is an ephemeral character string input device, (2) is an input register that stores the input from this kana character string input device (1), and (161 is an input register that stores input from the kana character string input device (1)). Provisional word dictionary 0 with last-in, fast-out search rules that are different from the search rules of the general word dictionary (7).
(3) is a provisional word dictionary searcher that searches for words from η; (3) is a general word dictionary searcher that searches for a word using a general word dictionary (7) if the provisional word dictionary searcher (16) fails; (4) is a determiner that determines whether the search result of searcher Oe or searcher (3) is appropriate, (5) is an output register that stores the output from determiner (4), and (6) is an output register (5). ) is a display device that outputs the contents of If the judgment result of path (4) is valid, should the search results of the general word dictionary search device (3) held in the output register (5) be registered in the provisional term dictionary (1η? A registration determination circuit (31) is a registration circuit that registers the contents of the output register (5) according to the determination result of the registration determination circuit (30).

第4図は第3図の検索器(16)の部分を詳細に示した
ものである。図において、(2)は入力レジスタでa□
〜a9は入力かな文字列、(2Ilは暫定単語辞書(1
71から登録単語を抽出する単語抽出器、(2)は抽出
器(21)からの見出しレジスタ、(柵は入力レジスタ
(2)と抽出単語レジスタ■の内容を比較する比較器、
0g)は比較器081の比較結果を判定する比較判定回
路、齢)は判定成功の際の出力レジスタである。
FIG. 4 shows the search device (16) in FIG. 3 in detail. In the figure, (2) is the input register a□
~a9 is the input kana character string, (2Il is the provisional word dictionary (1
71 is a word extractor that extracts registered words from the extractor (21), (2) is a heading register from the extractor (21), (the fence is a comparator that compares the contents of the input register (2) and the extracted word register ■),
0g) is a comparison judgment circuit that judges the comparison result of the comparator 081, and 0g) is an output register when the judgment is successful.

次に動作について説明する。Next, the operation will be explained.

入力器(1)より入力された入力かな文字列は入力レジ
スタ(2)に格納され、検索器06)にかけられる。
The input kana character string inputted from the input device (1) is stored in the input register (2) and is applied to the search device 06).

検索器(161は暫定単語辞書α′71の格納単語を最
長一致法とは異なる規則で検索を行なう。例えば°ラス
トイン・ファストアウト”の規則によれば最後に登録さ
れたものから1個づつ順番に抽出し、入力レジスタ(2
)に見出し部が含まれるかどうかで検索を行う。検索が
失敗すれば検索器(3)で一般単語辞書(7)の検索を
行い成功すれば検索器(3)をとばす。判定器(4)で
検索結果の妥当性をシステム固有の判定基準で判定し、
成功すれば出力レジスタ(5)に結果を出力し、表示器
(6)に表示する。一般単語辞書検索器(3)の動作は
第2図の従来のものと同じである。
The searcher (161) searches the words stored in the provisional word dictionary α'71 using a rule different from the longest match method. For example, according to the "last in, fast out" rule, words are searched one by one from the last registered word. Extract in order and input register (2
) is searched based on whether the heading part is included. If the search fails, the search device (3) searches the general word dictionary (7), and if it succeeds, the search device (3) is skipped. A judger (4) judges the validity of the search results using system-specific judgment criteria,
If successful, the result is output to the output register (5) and displayed on the display (6). The operation of the general word dictionary search device (3) is the same as the conventional one shown in FIG.

暫定単語辞書検索器αeの動作を第4図について説明す
る。
The operation of the provisional word dictionary search device αe will be explained with reference to FIG.

抽出器21+は要求がある都度、辞書αηを一方向に従
って順番に1単語ずつ抽出し、見出しをレジスタ■に格
納する。比較器Q8)は入力レジスタ(2)と見出しレ
ジスタ@の内容を比較し、レジスタ@の内容がレジスタ
(2)の内容と先頭をあわせてレジスタ(2)の内容に
含まれるなら検索されたとし、て、その単語を出力し、
レジスタ(22)に格納する。ここで、レジスタ器の内
容の方が長い、或いは途中に不一致文字があった場合そ
の単語は不適として抽出器(21)に次の単語の抽出を
指示する。
The extractor 21+ sequentially extracts one word from the dictionary αη in one direction each time there is a request, and stores the heading in the register ■. Comparator Q8) compares the contents of input register (2) and header register @, and if the contents of register @ are included in the contents of register (2) including the contents of register (2) and the beginning, it is considered that the search has been performed. , print that word,
Store in register (22). Here, if the content of the register is longer or if there are unmatched characters in the middle, that word is deemed inappropriate and the extractor (21) is instructed to extract the next word.

なお暫定単語辞書07)への単語の登録は以下のように
行われる。即ちかな文字列入力器(1)には次候補選択
のためのキーが備わっており、ユーザは変換結果が所望
のものでない場合、該次候補キーを押してシステムに次
候補を検索させる。この次候補キーの押下により登録判
定回路(至)は検索後のdカレジスタ(5)の内容を登
録すべきものと判定し、登録回路01)は登録判定回路
■の指示に従い出力レジスタ(5)の内容を暫定単語辞
書07)に登録するとともに表示器(6)に送出する。
Note that the registration of words in the provisional word dictionary 07) is performed as follows. That is, the kana character string input device (1) is equipped with a key for selecting the next candidate, and if the user does not find the desired conversion result, the user presses the next candidate key to cause the system to search for the next candidate. By pressing this next candidate key, the registration determination circuit (to) determines that the contents of the d register (5) after the search should be registered, and the registration circuit 01) registers the output register (5) according to the instructions from the registration determination circuit ■. The contents are registered in the temporary word dictionary 07) and sent to the display device (6).

以上のように、この発明によればユーザにより指定され
た単語データを暫定的に格納する辞書を一般単語辞書と
は別個にシステム上に構築し、一般単語辞書の検索を行
う前に無条件に検索させるように構成し、しかもその検
索規則として一般単語辞書とは異なる検索の早い検索規
則を用いるようにしたので、見出し長の長短によっであ
る変換結果が頻度高く使用されるのに、つねに使用しな
い別の変換結果の次に出力されるという非効率を避ける
ことができる。
As described above, according to the present invention, a dictionary that temporarily stores word data specified by the user is built on the system separately from the general word dictionary, and before searching the general word dictionary, the dictionary is We configured the system to perform a search, and also used a quick search rule that differs from that of a general word dictionary, so even though a certain conversion result is frequently used depending on the length of the heading, it is always This avoids the inefficiency of outputting the result after another unused conversion result.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は従来のかな漢字変換装置の構成図、第2図はそ
の単語検索回路の詳細回路図、第3図はこの発明の一実
施例によるかな漢字変換装置の構成図、第4図はその暫
定辞書部の検索回路図であ(1)・・・かな文字列入力
器、(3)・・・一般単語辞書検索器、(4)・・・判
定器、(7)・・・一般単語辞書、Q6+・・・暫定単
語辞書検索器、0η・・・暫定単語辞書、■・・・登録
判定回路、011・・・登録回路。 なお図中同一符号は同−又は相当部分を示す。 代理人   葛  野  信  − 第1図 第2図 第3図 第4図
Fig. 1 is a block diagram of a conventional kana-kanji conversion device, Fig. 2 is a detailed circuit diagram of its word search circuit, Fig. 3 is a block diagram of a kana-kanji conversion device according to an embodiment of the present invention, and Fig. 4 is a provisional diagram. The search circuit diagram of the dictionary section shows (1)...kana character string input device, (3)...general word dictionary search device, (4)...determiner, (7)...general word dictionary. , Q6+... Provisional word dictionary search device, 0η... Provisional word dictionary, ■... Registration determination circuit, 011... Registration circuit. Note that the same reference numerals in the figures indicate the same or equivalent parts. Agent Shin Kuzuno - Figure 1 Figure 2 Figure 3 Figure 4

Claims (1)

【特許請求の範囲】[Claims] (1)かな文字列を入力するためのかな文字列入力器と
、一般単語を登録している一般単語辞書と、該一般単語
辞書の検索規則により常に他の登録単語より検索順位が
後位となる単語を登録するための暫定単語辞書と、上記
かな文字列の見出しをもつ登録単語を一般単語用検索規
則とは異なる暫定単語用検索規則に基き上記暫定単語辞
書から検索する暫定単語辞書検索器と、上記暫定単語辞
書による検索失敗時に上記かな文字列の見出しをもつ登
録単語を一般単語用検索規則に基き上記一般単語辞書か
ら検索する一般単語辞書検索器と、上記暫定単語辞書検
索器または一般単語辞書検索器の検索結果が妥当か否か
を判定する判定器と、該判定器の判定結果が妥当である
場合上記一般単語辞書検索器の検索結果を上記暫定単語
辞書に登録すべきか否かを判定する登録判定回路と、該
判定結果に応じて上記一般単語辞書検索器の検索結果を
上記暫定単語辞書に登録する登録回路とを備えたことを
特徴とするかな漢字変換装置。
(1) A kana character string input device for inputting kana character strings, a general word dictionary in which general words are registered, and a search rule for the general word dictionary that will always rank lower in the search ranking than other registered words. a provisional word dictionary for registering words, and a provisional word dictionary search device for searching registered words having the heading of the kana character string from the provisional word dictionary based on provisional word search rules different from general word search rules. and a general word dictionary search device that searches the general word dictionary for registered words with the heading of the kana character strings based on search rules for general words when the search using the provisional word dictionary fails, and the temporary word dictionary search device or the general word dictionary. A determiner that determines whether the search results of the word dictionary search device are valid or not, and if the determination results of the determiner are valid, whether or not the search results of the general word dictionary search device should be registered in the provisional word dictionary. 1. A kana-kanji conversion device comprising: a registration determination circuit that determines whether or not a word is written; and a registration circuit that registers a search result of the general word dictionary search device in the provisional word dictionary according to the determination result.
JP57120223A 1982-07-09 1982-07-09 "kana" (japanese syllabary) and chinese character converting device Pending JPS599749A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57120223A JPS599749A (en) 1982-07-09 1982-07-09 "kana" (japanese syllabary) and chinese character converting device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57120223A JPS599749A (en) 1982-07-09 1982-07-09 "kana" (japanese syllabary) and chinese character converting device

Publications (1)

Publication Number Publication Date
JPS599749A true JPS599749A (en) 1984-01-19

Family

ID=14780924

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57120223A Pending JPS599749A (en) 1982-07-09 1982-07-09 "kana" (japanese syllabary) and chinese character converting device

Country Status (1)

Country Link
JP (1) JPS599749A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS554663A (en) * 1978-06-27 1980-01-14 Fujitsu Ltd Character row conversion processor
JPS5544670A (en) * 1978-09-26 1980-03-29 Fujitsu Ltd Kana-chinese character conversion process system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS554663A (en) * 1978-06-27 1980-01-14 Fujitsu Ltd Character row conversion processor
JPS5544670A (en) * 1978-09-26 1980-03-29 Fujitsu Ltd Kana-chinese character conversion process system

Similar Documents

Publication Publication Date Title
JP3360693B2 (en) Customer information search method
JPS599749A (en) &#34;kana&#34; (japanese syllabary) and chinese character converting device
JPH07182333A (en) Japanese processor
JPH06325091A (en) Similarity evaluation type data base retrieval device
JPH1011431A (en) Kanji retrieval device and method
JP3187671B2 (en) Electronic dictionary display
JPH1185765A (en) Retrieval system for document with tag
JPH0746353B2 (en) Japanese text input device
JPS6172361A (en) Kana-to-kanji converter
JPS63229523A (en) Information processor
JPH03127254A (en) Word retrieving device
JPS595335A (en) Japanese language input device
JPH09114854A (en) Document retrieving system
JPH03118661A (en) Word retrieving device
JPS59106029A (en) Kana (japanese syllabary) kanji (chinese character) converter
JPS6180449A (en) Kana-to-kanji converter
JPS58144251A (en) Input device for chinese compound word
JPS60129880A (en) Kana/kanji conversion system of sentence processor
JPH0695330B2 (en) Document creation device
JPS6243769A (en) Kana-to-kanji converting device
JPS6073732A (en) External character retrieving system
JPS59221731A (en) Kana/kanji conversion processor
JPH09282316A (en) Kanji-to-kana conversion device
JPH01237877A (en) Kanji conversion system
JPS6015730A (en) Japanese word input device