JPS5887657A

JPS5887657A - Kana(japanese syllabary) kanji(chinese character) conversion processing system

Info

Publication number: JPS5887657A
Application number: JP56186627A
Authority: JP
Inventors: Hirokawa Hayashi; 林　大川; Yoshitoshi Yamauchi; 佐敏山内
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1981-11-20
Filing date: 1981-11-20
Publication date: 1983-05-25

Abstract

PURPOSE:To decrease the time for the 1st conversion candidate display, by performing conversion operation through the replacement of the retrieval with a word dictionary into the retreval with the word dictionary of the 2nd ranking, in depressing the next candidate key. CONSTITUTION:A Kana-Kanji conversion processing is done with an input preprocessing section 1, a word pickup section 2, synonym discriminting section 3, an output control section 4, and a conversion control section 5, and when a Japanese sentence is inputted in Kana, the sentence is outputted as a sentence mixed with Kanji and Kana. In this system, the section 2 has a word dictionary which is stepwise sectioned into a plurality of areas depending on the frequency of usage. In the Kana-Kanji conversion processing, when the next candidate key instructing the next candidate is depressed by the number of the conversion candidates or more, that is, the candidate wanted by the operator is not obtained at the retrieval with the 1st word dictionary, further retrieval is done by using the 2nd word dictionary. In this system, the 2nd and 3rd candidates can be displayed in succesion to the candidate by the word dictionary having the word group of the highest frequency of usage by only depressing the next candidate key one after another.

Description

【発明の詳細な説明】本発明は邦文ワードプリセッサ等におけるカナ“漢字変
換処理方式に関し、特に文節区切り情報を与えるカナ漢
字変換処理方式における文書作成作業の処理速度を向上
さＪ−カナ漢字変換処理方式カナ漢字変換（以下、単に
「変換」ともいう）処理方式に関しては従来から種々の
方式が提案されている◎従来の変換方式においては、文
節指定。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a kana to kanji conversion processing method in a Japanese word preprocessor, etc., and in particular improves the processing speed of document creation work in a kana to kanji conversion processing method that provides bunsetsu separation information. Processing methods Various methods have been proposed for the processing method of kana-kanji conversion (hereinafter simply referred to as "conversion"). ◎In the conventional conversion method, phrase specification is used.

単語指定等の差はあっても、一般には、入力仮名文字列
に対する可能な変換文字列を全て探索して変換結果を出
力表示するものであった。そして、同音異字語（以下「
同音語」という）がある場合には最有力候補から変換文
字列を表示してオペレータに選択、確認を求め、オペレ
ータの意図しな一変換文字列のときは次候補キーを押す
ことにより、別の変換文字列が替って表示される。この
課程でオペレータの意図したものがあればオペレータが
選択キーを押すことにより変換文字列が確定するという
ものであった。Although there are differences in word specifications, etc., in general, all possible conversion character strings for an input kana character string are searched and the conversion results are output and displayed. and homophones (hereinafter “
If there is a conversion string (called a homonym), the most likely conversion string is displayed and the operator is asked to select and confirm. If the operator does not intend to use one conversion string, press the next candidate key to select another conversion string. The converted string will be displayed instead. During this process, if the operator finds something that he or she intended, the operator presses a selection key to confirm the converted character string.

しかしながら、上述の如き方式は、常に可能な変換文字
列を全て探索するため、処理時間が長くな〉、最初に変
換候補が表示されるまでに長時間を要するという重大な
欠点があった。However, the above-mentioned method always searches for all possible conversion character strings, so it has the serious disadvantage that the processing time is long, and it takes a long time until conversion candidates are displayed for the first time.

本発明は上記事情に鑑みてなされえもので、その目的と
するところは、従来の変換方式における上述の如き欠点
を除去して、最初の変換表示までの時間を短縮し、オペ
レータの使い勝手を良くし、文書作成作業の処理速度向
上を可能とするカナ漢字変換処理方式を提供することに
ある。The present invention was made in view of the above circumstances, and its purpose is to eliminate the above-mentioned drawbacks of conventional conversion methods, shorten the time until the first conversion display, and improve usability for the operator. The object of the present invention is to provide a kana-kanji conversion processing method that can improve the processing speed of document creation work.

本発明の上記目的祉、入力文を、自立語を中心とする分
かち書き単位に分解して仮名文字で入力し、これに対応
する漢字カナ混じシ文を逐次得るカナ漢字変換処理方式
において、使用頻度によって区分される２つ以上の単語
辞書を設けて、第１の単語辞書による探索におりる最有
力変換候補として出力表示された変換文字列がオペレー
タの意図しない文字列である場合に、次点となる候替を
次候補キーにより次々に表示させるとともに、上記次点
となる候補の中にオペレータの意図する文字列がない場
合に、更に前記次候補キーを押すことにより、次位の単
語辞書による探索に切換えて再び変換操作全行うことを
特徴とするカナ漢字変換処理方式により達成される。The purpose of the present invention is to provide a kana-kanji conversion processing method in which an input sentence is broken down into separated units centered on independent words and input as kana characters, and the corresponding kanji-kana-mixed sentences are sequentially obtained. By providing two or more word dictionaries divided by The following candidates are displayed one after another using the Next Candidate Key, and if the character string that the operator intends is not found among the runner-up candidates, by further pressing the Next Candidate Key, the next candidate word dictionary is displayed. This is achieved by using a kana-kanji conversion processing method that is characterized by switching to a search using , and then performing all conversion operations again.

以下、本発明の実施例を図面に基づいて詳細に説明する
。Embodiments of the present invention will be described in detail below with reference to the drawings.

第１図は本発明の一実施例であるカナ漢字変換処理のブ
ロック図である。　図において、１は入力前処理部、２
ｔｉ単語抽出部、３は同音語判別部、４社出力制御部モ
して６は変換制御部である。FIG. 1 is a block diagram of kana-kanji conversion processing according to an embodiment of the present invention. In the figure, 1 is an input preprocessing unit, 2
ti word extraction section, 3 a homophone discrimination section, 4 company output control section, and 6 a conversion control section.

日本語文が仮名文で入力されると、以下の如き処理を経
て漢字カナ混じり文として出力される。When a Japanese sentence is input as a kana sentence, it is output as a mixed kanji/kana sentence through the following processing.

入力前処理部１は、入力仮名文中の英数字、文節区切〉
情報等を認識して変換対象となる仮名文字列を抽出し、
変換制御部５の制御の下に単語抽出！２に、変換単位と
なる仮名文字列を渡す。単語抽出部２社前記仮名文字列
から、カナを見出しとする単語辞書および該単語辞書に
付加されている語の品詞情報、品詞側の接続情報を納め
た辞書等を参照し、前記仮名文字列と辞書見出しとの一
致を試み、文法的に許容される単語列の候補を抽出する
０　同音語判別部３は、上記単語列の候補が複数個存在
する場合に、単語の持つ頻＊１＊報等を用いて最有力単
語列を決定する。　出力制御部４は、上述の如く決定さ
れた文節の対応文字を出力表示装置に表示する。その後
、入力前処理部１が制御キーの次候補を指示する制御信
号を受ゆると、初めに表示した最有力候補の代シに次点
となった候補を表示する。The input preprocessing unit 1 processes alphanumeric characters and phrase breaks in the input kana sentence.
Recognizes information etc. and extracts the kana character string to be converted,
Extract words under the control of the conversion control unit 5! Pass the kana character string that will be the conversion unit to 2. Two word extraction units The homophone discrimination unit 3 attempts to match the word string with the dictionary entry and extracts grammatically permissible word string candidates. Determine the most likely word string using information such as information. The output control unit 4 displays the corresponding characters of the clause determined as described above on the output display device. Thereafter, when the input preprocessing section 1 receives a control signal instructing the next candidate for the control key, the second candidate is displayed in place of the most likely candidate that was initially displayed.

以下、本発明の要点である単語抽出部２について説明す
る。The word extraction section 2, which is the main point of the present invention, will be explained below.

単語抽出部は使用頻度によって次に示す如く段階的に検
数に区分された単語辞書を有し、入力される仮名文字列
との照合により単語列の抽出を行っている。The word extraction unit has a word dictionary that is divided into stages according to the frequency of use as shown below, and extracts word strings by comparing them with input kana character strings.

（１）■ユーザ登録の自立語の暫定語 ■出現頻度の高い自立語の重要語（１）その他の自立語ここで、（■）の■、■は高頻度で使用されると考えら
れる語に相当し、（勇は必要ではあるが余り頻繁には使
用されない語に相当する。(1) ■ Provisional words for independent words registered by users ■ Important words for independent words that appear frequently (1) Other independent words Here, ■ and ■ in (■) are words that are considered to be used frequently. (Brave is a word that is necessary but not used very often.

上記各群を有する単語辞書はそれぞれ単独に使用するこ
とも可能であり、また、適宜組合わせて使用することも
可能である。The word dictionaries having each of the above groups can be used individually, or in combination as appropriate.

本実施例のカナ漢字変換処理においては１第１図に示し
た入力前処理部に入力される制御キーによ抄１前記次候
補を指示する次候補キーが、前記（υの■および■の各
群の語を有する単語辞書によるいわば第１次の変換候補
探索範囲内の探索によって得られた変換候補の数似上に
押されたとき）すなわち、前記（わの■および■の各群
の語を有する単語辞書による探索でオペレータの欲する
候補がなかった場合に、変換候補探索範囲を前記（幻に
属する語を有する単語辞書に切換えて更に探索を行わ艙
るようにして−る。In the kana-kanji conversion process of this embodiment, the next candidate key for instructing the next candidate for the excerpt (1) by the control key input to the input preprocessing unit shown in FIG. When the number of conversion candidates obtained by searching within the so-called first conversion candidate search range using the word dictionary having words of each group is If the operator does not find the desired candidate in the search using the word dictionary containing the word, the conversion candidate search range is switched to the word dictionary containing the word belonging to the phantom category, and further search is carried out.

また、前記（Ｉ）の■および■の各群の語を有する単語
辞書を段階的に用いて、まず、前記（１）の■に属する
語を有する単語辞書のみを用いて探索操作を行い、これ
が不成功であった場合には前記（Ｄの■に属する語を有
する単語辞書による探索を行い１これも不成功に終った
場合に杜、更に探索範囲を前記（船に属する語を有する
単語辞書にまで拡大するようにすることも可能である・更に、上記方式に、いわゆる最長一致法による探索操作
と、総当り方による探索操作等による照合方式の切換を
組合オ〕せても良い。　最長一致法による抽出の場合は
、一旦、ある長さの自立語の照合によシその文節から抽
出特定された変換候補群の中にはその自立語より短−自
立語を構成要素とする変換文字列は与えられな−が、総
当り法による抽出ではこれらも漏れることがないからで
ある。In addition, the word dictionaries having words in groups ``■'' and ``■'' in (I) are used step by step, and a search operation is first performed using only the word dictionary having words belonging to group ``■'' in (1). If this is unsuccessful, a search is performed using the word dictionary containing words belonging to (D).1 If this is also unsuccessful, the search range is changed to It is also possible to extend the search to a dictionary.Furthermore, the above method may be combined with a search operation using a so-called longest match method and a matching method switching using a search operation using a brute force method. In the case of extraction using the longest match method, once an independent word of a certain length is matched, among the conversion candidates extracted from that clause, there are conversions whose constituent elements are shorter independent words than the independent word. Although character strings are not given, extraction by brute force method will not omit them.

上記実施例の変換処理方式においては、次候補キーを次
々に押すたけで、使用頻度第１位の単語群を有する単語
辞書によって得られた変換候補に引続いて、第２位、第
３位の単語辞書内の探索によって得られた変換候補が自
動的に表示されるため、オペレータに負担を感じさせる
ことがな一０上記実施例に示した単語辞書の段階的区分
は一例であり、本発明けこれに限られる亀のでな−こと
は言うまでもない。In the conversion processing method of the above embodiment, by simply pressing the next candidate key one after another, the second and third most frequently used words are Since the conversion candidates obtained by searching the word dictionary are automatically displayed, the operator does not feel burdened. It goes without saying that turtles are limited to inventions.

以上述べた如く、本発明によれば、入力文を、自立語を
中心とする分かち書き単位に分解して仮名文字で入力し
、これに対応する漢字カナ混じり文を途次得るカナ漢字
変換処理方式において、使用頻度によって区分される２
つ以上の単語辞書を設けて、第１の単語辞書による探索
における最有力変換候補として出力表示された変換文字
列がオペレータの意図しない文字列である場合に、次点
となる候補を次候補キーにより次々に表示させるととも
に、上記次点となる候補の中にオペレータの意図する文
字列がない場合に、更に前記次候補キーを押すことによ
り、次位の単語辞書による探索に切換えて再び変換操作
を行わせるようにしたので、最初の変羨候袴表示までの
時間を短縮し、オペレータの使−勝手を改善し、文書作
成作業の処理速度を向上させることの可能なカナ漢字変
換処理方式を実現すると−う顕着な効果を秦するもので
ある。As described above, according to the present invention, an input sentence is broken down into division units centered on independent words, input as kana characters, and corresponding sentences containing kanji and kana are successively obtained. 2, categorized by frequency of use.
If more than one word dictionary is provided, and the converted character string output and displayed as the most likely conversion candidate in the search using the first word dictionary is a character string that is not intended by the operator, the next candidate is selected as the next candidate key. If the character string intended by the operator is not found among the runner-up candidates, by pressing the next candidate key, the search is switched to the next word dictionary and the conversion operation is performed again. As a result, we have developed a kana-kanji conversion processing method that can shorten the time until the first henenryo hakama is displayed, improve usability for the operator, and increase the processing speed of document creation work. When realized, it will have a significant effect.

[Brief explanation of the drawing]

第１図社本発明の一実施例であるカナ漢字変換処理のプ
ルツク図である。１：入力前処理部、２：単語抽出部、３＝同音語判別部
、４＝出力制御部、５：変換制御部。セ　　運FIG. 1 is a pull diagram of kana-kanji conversion processing which is an embodiment of the present invention. 1: Input pre-processing unit, 2: Word extraction unit, 3 = Homophone discrimination unit, 4 = Output control unit, 5: Conversion control unit. se luck

Claims

[Claims]

In the kana-kanji conversion processing method, an input sentence is broken down into division units centered on self-contained candy and input as kana characters, and the corresponding sentences containing kanji and kana are sequentially obtained. A word dictionary is provided, and when the converted character string output and displayed as the most likely conversion candidate in the search using the first word dictionary is a character string not intended by the operator, the runner-up candidate is selected as the second candidate. In addition to outputting and displaying in the next layer, if the character string intended by the operator is not among the runner-up candidates, by pressing the next candidate key again, the search is switched to the next word dictionary and the search is performed again. A kana-kanji conversion processing method characterized by performing conversion operations.