JP3466669B2

JP3466669B2 - Character processing method

Info

Publication number: JP3466669B2
Application number: JP21403093A
Authority: JP
Inventors: 治樹中越
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1993-08-30
Filing date: 1993-08-30
Publication date: 2003-11-17
Anticipated expiration: 2018-11-17
Also published as: JPH0764984A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は文字処理方法に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character processing method.

【０００２】[0002]

[Prior art]

（１）図２に示すように従来、仮名漢字変換処理の文節
候補登録処理では、まず、２１で、処理対象自立部情報
群から、自立部情報を取り出す。そして、２３で、取り
出した自立部情報を文節テーブルの自立部情報格納領域
に登録する。(1) As shown in FIG. 2, in the conventional phrase candidate registration process of the kana-kanji conversion process, first, in step 21, the independent part information is extracted from the processing independent part information group. Then, in step 23, the retrieved independent part information is registered in the independent part information storage area of the clause table.

【０００３】２２では、処理対象自立部情報群から、自
立部情報を全て取り出した場合に、次のステップへ移行
する。２４では、文節テーブルの自立部情報格納領域に
登録されている自立部情報などの情報を、文節候補を作
成する為に文節候補作成用情報に設定する。２６では、
２４で設定された文節候補作成用情報を元にして、文節
候補の作成を行う。In step 22, when all the independent part information is extracted from the processing target independent part information group, the process proceeds to the next step. At 24, information such as the independent part information registered in the independent part information storage area of the phrase table is set as the phrase candidate creation information in order to create the phrase candidate. In 26,
The phrase candidates are created based on the phrase candidate creation information set in 24.

【０００４】２７では、文節候補の作成結果を判定し、
文節候補が作成できた場合には、次のステップへ移行
し、作成できなかった場合には、次の自立部情報に対し
て処理が行われる為、２４に移行する。２８では、作成
された文節候補を、文節テーブルの文節候補格納領域に
登録する。２５では、文節テーブルの自立部情報格納領
域に登録されている自立部情報の全てに対して、文節候
補の作成処理が行われた場合に、本処理を終了する。In step 27, the result of creating a bunsetsu candidate is judged,
If the bunsetsu candidate can be created, the process proceeds to the next step. If it cannot be created, the process proceeds to 24 because the next independent part information is processed. At 28, the created phrase candidate is registered in the phrase candidate storage area of the phrase table. In 25, when the phrase candidate creation process is performed for all the independent part information registered in the independent part information storage area of the phrase table, this process ends.

【０００５】（２）図７に示すように従来の他の仮名漢
字変換処理の文節候補登録処理では、まず、取り出され
た自立部情報から品詞テーブルを作成し、それを元にし
て文節候補を作成し、文節テーブルの文節候補登録領域
に登録していた。またそれと同時に自立部情報も文節テ
ーブルの自立部情報登録領域に登録していた。(2) As shown in FIG. 7, in a conventional phrase candidate registration process of another kana-kanji conversion process, first, a part-of-speech table is created from the extracted independent part information, and a phrase candidate is created based on the table. It was created and registered in the phrase candidate registration area of the phrase table. At the same time, the independent club information was also registered in the independent club information registration area of the clause table.

【０００６】（３）従来、仮名漢字変換処理の文節候補
登録処理では、まず、図１２のような文節候補テーブル
がある場合に、文節候補バッファに格納されている文節
候補を登録すると、最短の文節候補が削除され、図１３
のように登録される。(3) Conventionally, in the phrase candidate registration process of the kana-kanji conversion process, first, if there is a phrase candidate table as shown in FIG. 12, the phrase candidates stored in the phrase candidate buffer are registered, the shortest result is obtained. The phrase candidate is deleted, and FIG.
Is registered as.

【０００７】[0007]

【発明が解決しようとする課題】上述した従来の文字処
理方式（１）では、処理対象自立部情報群の全ての自立
部情報を文節テーブルの自立部情報格納領域に登録する
ことになる。従って、文節候補が作成できなかった自立
部情報も文節テーブルの自立部情報格納領域に登録する
ことになるので、無駄な自立部情報の登録も行っている
ことになり、仮名漢字変換処理の文節候補登録処理の速
度低下を招いているという問題点があった。In the above-mentioned conventional character processing method (1), all the independent part information of the independent part information group to be processed is registered in the independent part information storage area of the clause table. Therefore, since the independent section information for which the phrase candidate could not be created is also registered in the independent section information storage area of the clause table, useless independent section information is also registered. There is a problem in that the speed of the candidate registration process is reduced.

【０００８】また上述した従来の文字処理方式（２）で
は、作成された品詞テーブル全てに対して、文節候補作
成処理を行うが、同一の品詞テーブルが作成される場合
があり、そのような場合にも、文節候補作成処理を行う
ので、同一の処理を重複して行うことになり、仮名漢字
変換処理の文節候補登録処理の速度低下を招いていると
いう問題点があった。In the above-described conventional character processing method (2), the phrase candidate creating process is performed for all created part-of-speech tables, but the same part-of-speech table may be created. However, since the bunsetsu candidate creation process is performed, the same process is repeated, which causes a problem that the bunsetsu candidate registration process of the kana-kanji conversion process is slowed down.

【０００９】さらに上述した従来の文字処理方式（３）
では、文節に句読点や、変換起動コードなどが接続して
いる場合には、その文節は優先して変換され、それ以外
の文節候補は変換の対象外となるが、その対象外の文節
候補も一緒に文節候補テーブルに登録されていたという
問題点があった。Further, the above-mentioned conventional character processing method (3)
Then, if a punctuation mark or a conversion start code is connected to a phrase, that phrase is converted with priority, and other phrase candidates are excluded from the conversion. There was a problem that they were registered in the phrase candidate table together.

【００１０】そこで本発明の目的は以上のような問題を
解消した文字処理方法を提供することにある。Therefore, an object of the present invention is to provide a character processing method that solves the above problems.

【００１１】[0011]

【課題を解決するための手段】上記目的を達成するため
本発明は、処理対象自立部情報群及び文節テーブルが格
納されたメモリを備えた情報処理システムにより実行さ
れる文字処理方法であって、前記メモリ上の処理対象自
立部情報群から自立部情報を取り出すステップと、前記
取り出した自立部情報を元にして、文節候補を作成する
ステップと、前記作成した文節候補および当該文節候補
が作成できた自立部情報を前記メモリ上の文節テーブル
の文節候補格納領域および自立部情報格納領域に各々登
録するステップとを具えたことを特徴とする。In order to achieve the above object, the present invention is a character processing method executed by an information processing system including a memory in which a processing target independent section information group and a clause table are stored. The step of extracting independent part information from the processing target independent part information group on the memory, the step of creating a bunsetsu candidate based on the extracted independent part information, the created bunsetsu candidate and the bunsetsu candidate concerned can be created. And registering the independent part information in the phrase candidate storage area and the independent part information storage area of the phrase table on the memory, respectively.

【００１２】さらに本発明は、文節候補テーブル及び変
換優先文節長データが格納されたメモリを備え、文節候
補バッファを有する情報処理システムにより実行される
文字処理方法であって、前記メモリ上の変換優先文節長
データに基づいて、前記文節候補バッファ内の文節候補
が、前記メモリ上の文節候補テーブルに登録可能かを判
定する対象文節登録可能判定ステップと、前記対象文節
登録可能判定ステップにおいて登録可能と判定された文
節候補を前記メモリ上の文節候補テーブルに登録する登
録ステップとを具えたことを特徴とする。Further, the present invention is a character processing method which is executed by an information processing system having a phrase candidate table and a memory storing a conversion priority phrase length data, and having a phrase candidate buffer. Based on the phrase length data, the phrase candidates in the phrase candidate buffer can be registered in the target phrase registerable determination step that determines whether the phrase candidates can be registered in the phrase candidate table on the memory, and in the target phrase registerable determination step. A registration step of registering the determined phrase candidate in the phrase candidate table on the memory.

【００１３】[0013]

【００１４】[0014]

【実施例】以下、図面を参照して本発明の実施例を詳細
に説明する。Embodiments of the present invention will now be described in detail with reference to the drawings.

【００１５】＜実施例１＞次に、本発明の実施例１につ
いて図面を参照して詳細に説明する。<Embodiment 1> Next, Embodiment 1 of the present invention will be described in detail with reference to the drawings.

【００１６】図３は、本発明の各実施例に共通の文字処
理方法を実行する情報処理システムの構成を示すブロッ
ク図である。図３において、１は仮名漢字変換処理を含
む文字処理を実行する中央演算処理装置であって、ＲＯ
Ｍ２内に格納された図１，図６，図１１に示す如き文字
処理手順を実行する。ＲＯＭ２は、図１，図６，図１１
に示す文字処理手順を実行する際に用いる各情報を格納
している。表示装置３では仮名漢字変換処理の処理結果
などが出力される。キーボード等の入力装置４では読み
列などを入力する。ＲＡＭ５は中央演算処理装置１の作
業領域、すなわち文節データなどの情報を一時的に格納
する領域を有する。FIG. 3 is a block diagram showing the arrangement of an information processing system for executing the character processing method common to the respective embodiments of the present invention. In FIG. 3, reference numeral 1 denotes a central processing unit that executes character processing including kana-kanji conversion processing, and RO
The character processing procedure as shown in FIGS. 1, 6 and 11 stored in M2 is executed. The ROM 2 is shown in FIGS.
It stores each information used when executing the character processing procedure shown in. The display device 3 outputs the processing result of the kana-kanji conversion processing and the like. The input device 4 such as a keyboard is used to input a reading string or the like. The RAM 5 has a work area of the central processing unit 1, that is, an area for temporarily storing information such as clause data.

【００１７】まず実施例１の基本的処理を図４を用いて
説明する。First, the basic processing of the first embodiment will be described with reference to FIG.

【００１８】本実施例における仮名漢字変換処理の文節
候補登録処理では、まずはじめに、仮名漢字変換処理の
自立部情報抽出処理などで作成された処理対象自立部情
報群がＲＡＭ５内に格納されている。そこから自立部情
報を取り出し、その自立部情報などの情報を元にして、
文節候補の作成処理を行う。文節候補が作成できたら、
その文節候補と、取り出した自立部情報のうちの文節候
補が作成できた自立部情報とを文節テーブルの文節候補
格納領域および自立部情報格納領域に登録する。In the phrase candidate registration process of the kana-kanji conversion process in this embodiment, first, the processing target independent part information group created by the independent part information extraction process of the kana-kanji conversion process is stored in the RAM 5. . Take out the information of the independent section from there, and based on the information such as the information of the independent section,
Performs phrase segment creation processing. Once you have created a phrase candidate,
The bunsetsu candidate and the independence part information of which the bunsetsu candidate of the extracted independence part information can be created are registered in the bunsetsu candidate storage area and the independence part information storage area of the bunsetsu table.

【００１９】図１は、本実施例の処理を表す流れ図であ
る。FIG. 1 is a flow chart showing the processing of this embodiment.

【００２０】まず１１では、処理対象自立部情報群か
ら、自立部情報を取り出す。１２では、処理対象自立部
情報群から、自立部情報を全て取り出した場合に、次の
ステップへ移行する。First, in step 11, the independent section information is extracted from the processing target independent section information group. In 12, when all the independent part information is extracted from the processing target independent part information group, the process proceeds to the next step.

【００２１】１３では、取り出した自立部情報などの情
報を、文節候補を作成する為に文節候補作成用情報に設
定する。In step 13, information such as the extracted independent section information is set in the phrase candidate creating information in order to create the phrase candidate.

【００２２】１４では、１３で設定された文節候補作成
用情報を元にして、文節候補の作成を行う。At 14, the phrase candidates are created based on the phrase candidate creation information set at 13.

【００２３】１５では、文節候補の作成結果を判定し、
文節候補が作成できた場合には、次のステップへ移行
し、作成できなかった場合には、次の自立部情報に対し
て処理を行う為、１１に戻る。In step 15, the bunsetsu candidate creation result is judged,
If the phrase candidate can be created, the process proceeds to the next step. If it cannot be created, the process returns to 11 to process the next independent part information.

【００２４】そして、１６で、文節候補が作成できた自
立部情報を文節テーブルの自立部情報格納領域に登録
し、１７で、作成された文節候補を、文節テーブルの文
節候補格納領域に登録する。Then, in 16 the self-supporting part information for which the clause candidate has been created is registered in the self-supporting part information storage area of the clause table, and in 17 the created clause candidate is registered in the clause candidate storage area of the clause table. .

【００２５】図５は、文節テーブルの構成を表す図であ
る。FIG. 5 is a diagram showing the structure of the phrase table.

【００２６】文節テーブルは、文節候補格納領域と自立
部情報格納領域および、その他の情報から構成されてい
る。The phrase table is composed of a phrase candidate storage area, an independent section information storage area, and other information.

【００２７】以上、説明したように実施例１によれば文
節候補が作成できた自立部情報だけを文節データの自立
部情報格納領域に登録できるので、文節候補が作成でき
ない、必要のない自立部情報の登録が省けることにな
り、仮名漢字変換処理の文節候補登録処理の高速化を図
ることができるという効果がある。また、登録数が減る
ので格納領域の減少という効果もある。As described above, according to the first embodiment, only the independent part information for which the bunsetsu candidate can be created can be registered in the independent part information storage area of the bunsetsu data. Since information registration can be omitted, there is an effect that the phrase candidate registration process of the kana-kanji conversion process can be speeded up. Further, since the number of registrations is reduced, there is an effect that the storage area is reduced.

【００２８】＜実施例２＞まず実施例２の基本的処理を
図８を用いて説明する。<Second Embodiment> First, the basic processing of the second embodiment will be described with reference to FIG.

【００２９】本実施例における仮名漢字変換処理の文節
候補登録処理では、まず、自立部情報抽出処理などで処
理対象から抽出され取り出された自立部情報を元に品詞
テーブルを作成し、それが品詞テーブル格納領域の有効
品詞テーブルおよび、無効品詞テーブルに同一のものが
登録されているかをチェックし、同一のものが登録され
ていれば、品詞テーブルが重複しているということで、
処理は終了する。In the phrase candidate registration process of the kana-kanji conversion process in the present embodiment, first, a part-of-speech table is created based on the independent-part information extracted and extracted from the processing target by the independent-part information extraction process or the like. Check whether the same part is registered in the effective part-of-speech table and the invalid part-of-speech table in the table storage area. If the same part is registered, it means that the part-of-speech tables are duplicated.
The process ends.

【００３０】品詞テーブルを元にして、文節候補の作成
処理を行い、文節候補が作成できた場合には有効品詞テ
ーブル格納領域に、作成できなかった場合には無効品詞
テーブル格納領域に、品詞テーブルを登録する。そし
て、文節候補が作成できなかった場合には処理を終了
し、作成できた場合には、次の処理に移行する。Based on the part-of-speech table, a bunsetsu candidate is created. If the bunsetsu candidate can be created, it is stored in the effective part-of-speech table storage area, and if it cannot be created, it is stored in the invalid part-of-speech table storage area. To register. Then, if the phrase candidate cannot be created, the process is terminated, and if it is created, the process shifts to the next process.

【００３１】作成された文節候補の自立部情報を文節テ
ーブルの自立部情報格納領域に、そして、文節候補を文
節テーブルの文節候補格納領域に登録する。The independent part information of the created phrase candidate is registered in the independent part information storage area of the phrase table, and the phrase candidate is registered in the phrase candidate storage area of the phrase table.

【００３２】図６は、本実施例の処理を表す流れ図であ
る。FIG. 6 is a flow chart showing the processing of this embodiment.

【００３３】３１では、処理対象から自立部情報抽出処
理などで抽出された自立部情報を取り出す。３２では、
３１で自立部情報が取り出せなかった場合の判定を下
し、その場合には、本処理を終了する。In step 31, the independent part information extracted by the independent part information extracting process or the like is extracted from the processing target. In 32,
In step 31, a determination is made when the independent part information could not be extracted, and in that case, this processing ends.

【００３４】３３では、取り出した自立部情報を元に品
詞テーブルを作成する。（品詞テーブルとは、自立部情
報を表す詳細な情報などが格納されているものであ
る。）３４では、３３で作成された品詞テーブルと品詞
テーブル格納領域の有効品詞テーブルおよび、無効品詞
テーブルに格納されている品詞テーブルとを比較し、同
一のものがあるかを判定する。At 33, a part-of-speech table is created on the basis of the extracted independent part information. (The part-of-speech table stores detailed information representing the independent part information.) At 34, the part-of-speech table created at 33 and the effective part-of-speech table in the part-of-speech table storage area and the ineffective part-of-speech table are stored. The stored part-of-speech table is compared to determine whether or not there is the same one.

【００３５】３５では、同一の品詞テーブルが登録され
ていた場合の判定を下し、その場合には、本処理を終了
する。At 35, a determination is made when the same part-of-speech table is registered, and in this case, this processing ends.

【００３６】３６では、作成された品詞テーブルを元に
して文節候補の作成処理を行う。At 36, a phrase candidate is created based on the created part-of-speech table.

【００３７】３７では、文節候補の作成結果の判定を行
い、作成された場合には３８のステップへ、また、作成
できなかった場合には４１のステップへ処理が移行す
る。At 37, the result of the bunsetsu candidate is determined, and if it is created, the process proceeds to step 38, and if it is not created, the process proceeds to step 41.

【００３８】３８では、作成された文節候補の品詞テー
ブルを品詞テーブル格納領域の有効品詞テーブル格納領
域に登録する。At 38, the part-of-speech table of the created clause candidates is registered in the effective part-of-speech table storage area of the part-of-speech table storage area.

【００３９】そして、３９では、作成された文節候補の
自立部情報を文節テーブルの自立部情報格納領域に、ま
た、４０では、作成された文節候補を文節テーブルの文
節候補格納領域に登録する。Then, in 39, the independent section information of the created clause candidates is registered in the independent section information storage area of the clause table, and in 40, the prepared clause candidates are registered in the clause candidate storage area of the clause table.

【００４０】４１では、作成できなかった文節候補の品
詞テーブルを品詞テーブル格納領域の無効品詞テーブル
格納領域に登録する。At 41, the part-of-speech table of the phrase candidates that could not be created is registered in the invalid part-of-speech table storage area of the part-of-speech table storage area.

【００４１】図９は、品詞テーブル格納領域の構成を表
す図である。品詞テーブル格納領域は、有効品詞テーブ
ル格納領域と無効品詞テーブル格納領域などの情報から
成り立っている。FIG. 9 is a diagram showing the structure of the part-of-speech table storage area. The part-of-speech table storage area is composed of information such as an effective part-of-speech table storage area and an invalid part-of-speech table storage area.

【００４２】図１０は、文節テーブルの構成を表す図で
ある。文節テーブルは、文節候補格納領域と自立部情報
格納領域などの情報から成り立っている。FIG. 10 is a diagram showing the structure of the phrase table. The clause table is made up of information such as a clause candidate storage area and an independent section information storage area.

【００４３】以上説明したように実施例２によれば品詞
テーブルの重複登録が回避できるので、文節候補作成処
理の回数を減少させ、仮名漢字変換処理の文節候補登録
処理の高速化を図ることができるという効果がある。As described above, according to the second embodiment, duplicate registration of the part-of-speech table can be avoided, so that the number of bunsetsu candidate creation processes can be reduced, and the bunsetsu candidate registration process of the kana-kanji conversion process can be speeded up. The effect is that you can do it.

【００４４】＜実施例３＞まず実施例３の仮名漢字変換
処理の文節候補登録処理における基本処理を図１４〜図
２２を用いて説明する。<Third Embodiment> First, the basic processing in the phrase candidate registration processing of the kana-kanji conversion processing of the third embodiment will be described with reference to FIGS.

【００４５】図１４から図１７までは、変換優先文節を
はじめに登録する場合の説明である。FIGS. 14 to 17 show the case of registering the conversion priority clause first.

【００４６】図１４では、仮名漢字変換処理の文節候補
作成処理などで作成された文節候補があり、それが文節
候補バッファに格納されている。また、処理開始時に無
効値を格納してある変換優先文節長がある。そして、作
成途中の文節候補テーブルがある。In FIG. 14, there is a bunsetsu candidate created by the bunsetsu candidate creating process of the kana-kanji conversion process, and the bunsetsu candidate is stored in the bunsetsu candidate buffer. In addition, there is a conversion priority clause length that stores an invalid value at the start of processing. Then, there is a phrase candidate table in the process of being created.

【００４７】そのような状態で変換優先文節が、文節候
補バッファに格納されている場合を例にして説明をす
る。（変換優先文節とは、句読点や変換起動コードなど
が文末に後続している文節のことである。）図１５で
は、変換優先文節長に無効値が格納されている状態で、
文節候補バッファの文節候補が変換優先文節の時に、そ
の文節候補の文節長を求め、それを変換優先文節長に登
録する。A case where the conversion priority clause is stored in the clause candidate buffer in such a state will be described as an example. (The conversion priority clause is a clause in which a punctuation mark, a conversion start code, and the like follow at the end of the sentence.) In FIG. 15, in the state where an invalid value is stored in the conversion priority clause length,
When the phrase candidate in the phrase candidate buffer is a conversion priority phrase, the phrase length of the phrase candidate is calculated and registered as the conversion priority phrase length.

【００４８】図１６では、変換優先文節長を登録した場
合、文節候補テーブルを検索し、変換優先文節長より短
い文節を全て削除し、無効な文節候補を文節候補テーブ
ルから省く処理を行う。In FIG. 16, when the conversion priority clause length is registered, the clause candidate table is searched, all clauses shorter than the conversion priority clause length are deleted, and invalid clause candidates are omitted from the clause candidate table.

【００４９】図１７では、文節候補の登録処理で、文節
候補バッファに格納されている文節候補を文節候補テー
ブルに登録したものである。In FIG. 17, the phrase candidates stored in the phrase candidate buffer are registered in the phrase candidate table in the phrase candidate registration process.

【００５０】図１８から図１９までは、変換優先文節が
文節候補テーブルに登録済で、変換優先外文節を登録す
る場合の説明である。FIGS. 18 to 19 show the case where the conversion-preferred clause is already registered in the clause candidate table and the conversion-preferred clause is not registered.

【００５１】図１８では、仮名漢字変換処理の文節候補
作成処理などで作成された文節候補があり、それが文節
候補バッファに格納されている。また、変換優先文節長
には、文節候補テーブルに登録されている変換優先文節
の文節長が格納されている。そして、変換優先文節が登
録されている文節候補テーブルがある。In FIG. 18, there is a bunsetsu candidate created by a bunsetsu candidate creating process of the kana-kanji conversion process, and the bunsetsu candidate is stored in the bunsetsu candidate buffer. The conversion priority phrase length stores the phrase length of the conversion priority phrase registered in the phrase candidate table. Then, there is a clause candidate table in which conversion priority clauses are registered.

【００５２】そのような状態で変換優先外文節が文節候
補バッファに格納されている場合を例にして説明をす
る。（変換優先外文節とは、句読点や変換起動コードな
どが文末に後続していない、通常の文節のことであ
る。）図１９では、変換優先文節長に、変換優先文節の
文節長が格納されており、文節候補バッファの文節候補
の文節長は、変換優先文節より短いので、登録不可能と
判定され、文節登録処理は行わないで処理が終了する。The case where the conversion-priority external clause is stored in the clause candidate buffer in such a state will be described as an example. (The conversion-priority external clause is a normal clause in which punctuation marks, conversion start codes, etc. do not follow the sentence end.) In FIG. 19, the conversion-priority clause length stores the clause length of the conversion-priority clause. Since the phrase length of the phrase candidate in the phrase candidate buffer is shorter than the conversion priority phrase, it is determined that the phrase cannot be registered, and the process ends without performing the phrase registration process.

【００５３】図２０から図２２までは、変換優先文節が
登録されていない状態で、変換優先外文節を登録する場
合の説明である。20 to 22 show the case where the conversion-preferred clause is registered while the conversion-preferred clause is not registered.

【００５４】図２０では、仮名漢字変換処理の文節候補
作成処理などで作成された文節候補があり、それが文節
候補バッファに格納されている。また、変換優先文節長
には、文節候補テーブルに変換優先文節が登録されてい
ないので、無効値が格納されている。そして、文節候補
が登録されている文節候補テーブルがある。In FIG. 20, there is a bunsetsu candidate created by the bunsetsu candidate creating process of the kana-kanji conversion process, and the bunsetsu candidate is stored in the bunsetsu candidate buffer. Further, since the conversion priority clause is not registered in the clause candidate table, an invalid value is stored in the conversion priority clause length. Then, there is a phrase candidate table in which the phrase candidates are registered.

【００５５】そのような状態で変換優先外文節が文節候
補バッファに格納されている場合を例にして説明をす
る。The case where the conversion-preferred external clause is stored in the clause candidate buffer in such a state will be described as an example.

【００５６】図２１では、変換優先文節長には、無効値
が登録されているので、文節候補バッファの変換優先外
文節の文節候補テーブルへの登録は可能と判定される。In FIG. 21, since an invalid value is registered in the conversion priority clause length, it is determined that the conversion priority external clause in the clause candidate buffer can be registered in the clause candidate table.

【００５７】図２２では、文節候補の登録処理で、文節
候補バッファに格納されている文節候補を文節候補テー
ブルに登録したものである。In FIG. 22, the phrase candidates stored in the phrase candidate buffer are registered in the phrase candidate table in the phrase candidate registration processing.

【００５８】図１１は、本実施例の処理を表す流れ図で
ある。FIG. 11 is a flow chart showing the processing of this embodiment.

【００５９】まず、５１では、文節候補バッファの文節
候補の状況および、変換優先文節長の登録状況などから
文節候補テーブルに登録可能かを判定する。First, at 51, it is determined whether registration is possible in the phrase candidate table based on the condition of the phrase candidates in the phrase candidate buffer and the registration condition of the conversion priority phrase length.

【００６０】５２では、判定の結果、文節候補バッファ
の文節候補が登録可能の場合には、次のステップへ移行
し、登録不可能の場合には、本処理を終了する。At 52, if the result of the determination is that the phrase candidate in the phrase candidate buffer can be registered, the process proceeds to the next step, and if it cannot be registered, this processing ends.

【００６１】また、５３では、文節候補バッファの文節
候補が、変換優先文節の場合には、次のステップへ移行
し、そうでない場合には、６０へ移行する。At 53, if the phrase candidate in the phrase candidate buffer is a conversion priority phrase, the process proceeds to the next step, and if not, the process proceeds to 60.

【００６２】５４では、変換優先文節が文節候補テーブ
ルに登録されているかの判定を、変換優先文節長に、そ
の変換優先文節の文節長が設定されているか、または無
効な値が設定されているかで行う。At 54, it is judged whether or not the conversion priority phrase is registered in the phrase candidate table. Whether the conversion priority phrase length is set to the phrase length of the conversion priority phrase or an invalid value is set. Done in.

【００６３】５５では、変換優先文節が文節候補テーブ
ルに登録されていないとの判定が下された場合には、次
のステップへ移行し、そうでない場合には、６０へ移行
する。At 55, if it is determined that the conversion priority phrase is not registered in the phrase candidate table, the process proceeds to the next step, and if not, the process proceeds to 60.

【００６４】５６では、変換優先文節の文節長を求め、
変換優先文節長へ登録する。At 56, the phrase length of the conversion priority phrase is obtained,
Register to the conversion priority phrase length.

【００６５】５７では、文節候補テーブルに変換優先文
節長以下の文節候補が登録されているかを判定する。At 57, it is judged whether or not the phrase candidates having the conversion priority phrase length or less are registered in the phrase candidate table.

【００６６】５８では、文節候補テーブルに変換優先文
節長以下の文節候補が登録されていた場合には、次のス
テップへ移行し、そうでない場合には、６０へ移行す
る。At 58, if a phrase candidate having a conversion priority phrase length or less is registered in the phrase candidate table, the process proceeds to the next step, and if not, the process proceeds to 60.

【００６７】５９では、文節候補テーブルに登録されて
いる変換優先文節長以下の文節候補を全て削除する。At 59, all the phrase candidates whose length is equal to or shorter than the conversion priority phrase length registered in the phrase candidate table are deleted.

【００６８】そして、６０では、文節候補バッファに登
録されている文節候補を文節候補テーブルに対して文節
登録処理を行う。Then, at 60, the phrase registration processing is performed for the phrase candidates registered in the phrase candidate buffer in the phrase candidate table.

【００６９】以上、説明したように実施例３によれば、
変換優先文節が存在する場合には、その文節だけを文節
候補テーブルに登録することになり、変換優先外文節の
登録処理を省くことができ、仮名漢字変換処理の文節候
補登録処理の高速化を図ることができるという効果があ
る。また、作成された文節候補テーブルを参照する処理
に対しては、無駄な文節候補が登録されていないので、
その文節候補テーブルを参照する処理の高速化も図るこ
とができるという効果もある。As described above, according to the third embodiment,
If a conversion-preferred phrase exists, only that phrase is registered in the phrase candidate table, and the process of registering non-conversion-preferred phrases can be omitted, and the phrase candidate registration process for Kana-Kanji conversion processing can be speeded up. The effect is that it can be achieved. Further, since no unnecessary phrase candidates are registered for the process of referring to the created phrase candidate table,
There is also an effect that the process of referring to the phrase candidate table can be speeded up.

[Brief description of drawings]

【図１】実施例１の処理を表す流れ図である。FIG. 1 is a flowchart showing a process of a first embodiment.

【図２】従来の技術の処理を表す流れ図である。FIG. 2 is a flowchart showing the processing of a conventional technique.

【図３】本発明の文字処理が行われる情報処理システム
の構成を示すブロック図である。FIG. 3 is a block diagram showing a configuration of an information processing system in which character processing of the present invention is performed.

【図４】実施例１の基本的処理を表す図である。FIG. 4 is a diagram illustrating a basic process of the first embodiment.

【図５】文節テーブル格納領域を表す図である。FIG. 5 is a diagram showing a phrase table storage area.

【図６】実施例２の処理を表す流れ図である。FIG. 6 is a flowchart showing the processing of the second embodiment.

【図７】従来の技術の処理を表す図である。FIG. 7 is a diagram showing a process of a conventional technique.

【図８】実施例２の基本的処理を表す図である。FIG. 8 is a diagram illustrating a basic process of the second embodiment.

【図９】品詞テーブル格納領域を表す図である。FIG. 9 is a diagram showing a part-of-speech table storage area.

【図１０】文節テーブル格納領域を表す図である。FIG. 10 is a diagram showing a phrase table storage area.

【図１１】実施例３の処理を表す流れ図である。FIG. 11 is a flowchart showing the processing of the third embodiment.

【図１２】従来の技術の処理を表す流れ図である。FIG. 12 is a flowchart showing the processing of a conventional technique.

【図１３】従来の技術の処理を表す別の流れ図である。FIG. 13 is another flow chart representing the processing of the prior art.

【図１４】実施例３の基本的な処理の一部を表す図であ
る。FIG. 14 is a diagram illustrating a part of the basic processing of the third embodiment.

【図１５】実施例３の基本的な処理の他の一部を表す図
である。FIG. 15 is a diagram illustrating another part of the basic processing of the third embodiment.

【図１６】実施例３の基本的な処理のさらに他の一部を
表す図である。FIG. 16 is a diagram illustrating still another part of the basic processing according to the third embodiment.

【図１７】実施例３の基本的な処理のさらに他の一部を
表す図である。FIG. 17 is a diagram illustrating still another part of the basic processing according to the third embodiment.

【図１８】実施例３の基本的な処理のさらに他の一部を
表す図である。FIG. 18 is a diagram illustrating still another part of the basic processing according to the third embodiment.

【図１９】実施例３の基本的な処理のさらに他の一部を
表す図である。FIG. 19 is a diagram illustrating still another part of the basic processing according to the third embodiment.

【図２０】実施例３の基本的な処理のさらに他の一部を
表す図である。FIG. 20 is a diagram illustrating still another part of the basic processing according to the third embodiment.

【図２１】実施例３の基本的な処理のさらに他の一部を
表す図である。FIG. 21 is a diagram illustrating still another part of the basic processing according to the third embodiment.

【図２２】実施例３の基本的な処理のさらに他の一部を
表す図である。FIG. 22 is a diagram illustrating still another part of the basic processing according to the third embodiment.

[Explanation of symbols]

１中央演算処理装置２ＲＯＭ３表示装置４入力装置５ＲＡＭ 1 Central processing unit 2 ROM 3 display devices 4 input device 5 RAM

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/22 514 ＪＩＣＳＴファイル（ＪＯＩＳ)─────────────────────────────────────────────────── ─── Continuation of the front page (58) Fields surveyed (Int.Cl. ⁷ , DB name) G06F 17/22 514 JISST file (JOIS)

Claims

(57) [Claims]

1. A processing target independent section information group and a clause table.
By an information processing system equipped with a memory that stores
A method for performing character processing, comprising: extracting independence part information from a processing target independence part information group on the memory; creating a phrase candidate based on the extracted independence part information; The character processing method further comprises: registering the phrase candidate and the independent part information in which the phrase candidate has been created in the phrase candidate storage area and the independent part information storage area of the phrase table on the memory , respectively.

2. A phrase candidate table and a conversion priority phrase length data.
It has a memory that stores data and has a clause candidate buffer.
Character processing method executed by the information processing system
There, based on the conversion priority clause length data on the memory, phrase candidate of the phrase candidate in the buffer, and the target phrase registrable judgment step of judging whether registerable in clause candidate table on the memory, the target Can be registered in the phrase registration possibility judgment step
The determined phrase candidates are stored in the phrase candidate table on the memory.
A character processing method comprising: a registration step for registering with .

3. The target clause registration according to claim 2 ,
If it is possible to register at the Noh judgment step, the registration is possible
Judge whether the phrase candidate determined to be a conversion priority phrase
A conversion priority clause determining step, a conversion priority clause registered determination step for determining whether a conversion priority clause is registered in the clause candidate table, and a conversion priority clause determining step for determining the conversion priority clause, Transformation priority sentence
When unregistered determination in the section registered determination step, conversion priority to be set to the memory length of the determined phrase candidates to be the conversion priority clause, as the conversion priority phrase length data
A phrase length setting step, if the length following clause is registered in the phrase candidate table, cutting deletes the phrase candidates registered <br/> Rokusumi follows the length from the phrase candidate table
And a step of removing the character.