JPH06203014A

JPH06203014A - Kana to kanji converter

Info

Publication number: JPH06203014A
Application number: JP5187563A
Authority: JP
Inventors: Yasuo Koyama; 泰男小山
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 1993-06-29
Filing date: 1993-06-29
Publication date: 1994-07-22
Anticipated expiration: 2014-01-13
Also published as: JP2845096B2

Abstract

PURPOSE:To constitute a converter so that when other character, etc., than a KANA (Japanese syllabary) are inputted, its character, etc., are set as delimiters for a grammatical analysis, the grammatical analysis is executed, and thereafter, KANA to KANJI (Chinese character) conversion is executed. CONSTITUTION:When a character is inputted (701), the kind of the character is discriminated (702). In the case the inputted character is a KANA, 4a write processing is executed from a phrase portion by a prescribed grammatical analysis together with a KANA character-string inputted alreaby (708 to 711), and it is outputted as a KANA and KANJI mixed phrase candidate (715). In the case the inputted character is other character, etc., than a KANA, the connection with the KANA character-string inputted already is not executed, and its character, etc., become delimiters for a grammatical analysis (704). In this case, the inputted character, etc., form a phrase by one character (705). Thus, the delimiter for the grammatical analysis is defined, and the conversion efficiency can be improved.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、日本語文章を入力する
為に入力された仮名文字列を、仮名漢字混じり文章に変
換する仮名漢字変換装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a kana-kanji conversion device for converting a kana character string input for inputting a Japanese sentence into a sentence mixed with kana-kanji.

【０００２】[0002]

【従来の技術】従来の仮名漢字変換装置では、仮名漢字
変換を行なう場合、総ての仮名文字列を入力した後、変
換キー押下又はそれに同等の変換指示を使用者が行なっ
ている。また、入力された長い仮名文字列を仮名漢字混
じり文に変換するのに、２文節最長一致法などの解析処
理を、入力された仮名文字列全体に適用していた。2. Description of the Related Art In a conventional kana-kanji conversion device, when performing kana-kanji conversion, the user presses the conversion key or equivalent conversion instruction after inputting all kana character strings. In addition, in order to convert an input long kana character string into a kana-kanji mixed sentence, an analysis process such as the two-segment longest matching method is applied to the entire input kana character string.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、こうし
た従来技術によれば、仮名漢字変換装置において、仮名
漢字変換を行いたい総ての仮名文字列を入力した後、変
換キー押下又はそれに同等の変換指示により、初めて変
換行為が行われる為、いちいち変換指示操作を行なわね
ばならないという問題があった。しかも、仮名漢字変換
のための文法解析等の処理は、それまでに入力された仮
名文字列全体に対して適用されるから、変換効率を高め
るには、変換指示を、文法解析しやすい位置で行なう必
要がある。このため、使用者は、仮名漢字変化の指示を
行なうタイミングを考えなければならず、その度に思考
の流れを中断されてしまう。一方、適当なところで変換
指示を行なったのでは、正しい変換結果を得るのは困難
であった。However, according to such a conventional technique, in the kana-kanji conversion device, after inputting all the kana character strings to be converted into kana-kanji, the conversion key is pressed or a conversion instruction equivalent thereto is input. Therefore, there is a problem that the conversion operation is performed for the first time, and therefore the conversion instruction operation must be performed one by one. Moreover, since the processing such as grammar analysis for Kana-Kanji conversion is applied to the entire Kana character string input so far, in order to improve the conversion efficiency, the conversion instruction should be placed at a position where it is easy to grammar analyze. I need to do it. Therefore, the user has to consider the timing of instructing the kana-kanji change, and the flow of thought is interrupted each time. On the other hand, it was difficult to obtain a correct conversion result if the conversion instruction was given at an appropriate place.

【０００４】本発明は、上記欠点を解決するもので、変
換操作を行なわなくても逐次自動的に変換処理を行な
い、かつ変換効率に優れた仮名漢字変換を実現する事を
目的としてなされた。The present invention has been made in order to solve the above-mentioned drawbacks, and has an object to realize a kana-kanji conversion which is performed automatically and sequentially without any conversion operation and which is excellent in conversion efficiency.

【０００５】[0005]

【課題を解決するための手段】本発明の仮名漢字変換装
置は、平仮名，片仮名，英字，数字，記号等の複数の種
類の文字要素のうち少なくとも２種類以上を入力可能な
入力手段を備え、該入力手段から入力された文字要素列
を仮名漢字混じり文に変換する仮名漢字変換装置であっ
て、前記入力手段から順次入力される前記文字要素を配
列し、未変換の文字要素列として記憶する未変換文字要
素列記憶手段と、前記入力手段により、直前に入力され
た種類の文字要素と異なる種類の文字要素が入力された
とき、該種類の異なる文字要素の位置を文法解析上の区
切りとして、未変換の文字要素列に対する文法解析，文
節分かち書き処理などの解析処理を行なう解析処理手段
と、該解析されて得られた文節について仮名漢字変換を
行なう変換手段とを備えたことを要旨とする。A kana-kanji conversion device according to the present invention comprises an input means capable of inputting at least two or more of a plurality of types of character elements such as hiragana, katakana, letters, numbers and symbols. A kana-kanji conversion device for converting a character element string input from the input means into a kana-kanji mixed sentence, in which the character elements sequentially input from the input means are arranged and stored as an unconverted character element string. When a character element of a different type from the character element of the type input immediately before is input by the unconverted character element string storage means and the input means, the position of the character element of the different type is used as a delimiter in grammar analysis. , Analysis processing means for performing analysis processing such as grammar analysis and phrase segmentation processing for the unconverted character element string, and conversion means for performing kana-kanji conversion for the clauses obtained by the analysis. And summarized in that with.

【０００６】[0006]

【作用】以上のように構成された本発明の仮名漢字変換
装置は、入力手段から入力された文字要素は、未変換文
字要素列記憶手段により、未変換の文字要素列として記
憶される。一方、既に入力された文字要素の種類と異な
る種類の文字要素が入力されたとき、解析処理手段は、
種類の異なる文字要素の位置を文法解析の区切りとし
て、未変換の文字要素列に対する文法解析，文節分かち
書き処理などの解析処理を行なう。この解析により得ら
れた文節について、変換手段が、仮名漢字変換を行な
う。In the kana-kanji conversion device of the present invention configured as described above, the character element input from the input means is stored as an unconverted character element string by the unconverted character element string storage means. On the other hand, when a character element of a type different from the type of the already input character element is input, the analysis processing means,
Using the positions of character elements of different types as grammar analysis delimiters, analysis processing such as grammar analysis and phrase segmentation processing is performed on unconverted character element strings. The conversion means performs kana-kanji conversion for the bunsetsu obtained by this analysis.

【０００７】[0007]

【実施例】以上、説明した本発明の構成・作用を一層明
らかにするために、以下本発明の好適な実施例について
説明する。The preferred embodiments of the present invention will be described below in order to further clarify the constitution and operation of the present invention described above.

【０００８】図１は、実施例として、文書作成や編集等
を行なう装置のハード的な周知の構成例を示すブロック
図である。図示するように、この装置は、ＣＰＵ２１を
中心にバス３１により相互に接続された次の各部を備え
る。ＣＰＵ２１とバス３１により相互に接続された各部
について、簡単に説明する。FIG. 1 is a block diagram showing an example of a well-known hardware structure of an apparatus for creating and editing a document as an embodiment. As shown in the figure, this device is provided with the following respective parts centered on the CPU 21 and interconnected by a bus 31. Each part mutually connected by the CPU 21 and the bus 31 will be briefly described.

【０００９】ＲＯＭ２２：仮名漢字変換プログラム等を
記憶するマスクメモリ、ＲＡＭ２３：主記憶を構成する読み出し・書き込み可能
メモリ、キーボードインタフェース２５：キーボード２４からの
キー入力を司るインタフェース、ＣＲＴＣ２７：カラーで表示可能なＣＲＴ２６への信号
出力を制御するＣＲＴコントローラ、プリンタインタフェース２９：プリンタ２８へのデータ
の出力を制御するインタフェース、また、これらの各部の他、バス３１には将来の拡張に備
えて、拡張用スロット３０が接続されている。ROM 22: Mask memory for storing kana-kanji conversion program, RAM 23: Readable / writable memory constituting main memory, Keyboard interface 25: Interface for controlling key input from keyboard 24, CRTC 27: Displayable in color A CRT controller for controlling signal output to the CRT 26, a printer interface 29: an interface for controlling output of data to the printer 28. In addition to these units, the bus 31 is provided with an expansion slot 30 for future expansion. Are connected.

【００１０】こうして構成されたハードウエアにより、
文章が入力，仮名漢字変換，編集，表示，印刷などがな
される。すなわち、キーボード２４から入力された文字
列は、ＣＰＵ２１により所定の処理がなされ、ＲＡＭ２
３の所定領域に格納され、ＣＲＴＣ２７を介してＣＲＴ
２６の画面上に表示される。With the hardware thus configured,
Text is input, Kana-Kanji conversion, editing, display, printing, etc. That is, the character string input from the keyboard 24 is subjected to predetermined processing by the CPU 21, and the RAM 2
3 is stored in a predetermined area, and a CRT is sent via the CRTC 27.
It is displayed on the screen of 26.

【００１１】次に、こうして構成されたハードウエアに
より実行される機能を図２を用いて説明する。図２は、
本発明の仮名漢字変換装置の機能をブロックにより示す
ブロック図である。図中符号１は入力部２より入力され
たデータに基づきシステム全体の制御を行う中央処理装
置（ＣＰＵ２１）である。このＣＰＵにより装置の総て
の制御がおこなわれる。Next, the function executed by the hardware thus constructed will be described with reference to FIG. Figure 2
It is a block diagram which shows the function of the kana-kanji conversion device of this invention by a block. Reference numeral 1 in the figure denotes a central processing unit (CPU 21) that controls the entire system based on the data input from the input unit 2. The CPU controls all of the apparatus.

【００１２】入力部２で入力された仮名文字は、仮名漢
字変換装置７に送られ仮名漢字変換が行われ、文節候補
を解析結果表示部３により表示しながら、文節候補選択
部４により文節単位に文節候補を表示選択し、文節候補
確定部５により所望の文節候補を確定するか、文節候補
内部確定部１９により自動的に確定され、仮名漢字文字
列は文章として、文書格納部６に格納される。The kana characters input by the input unit 2 are sent to the kana-kanji conversion device 7 where kana-kanji conversion is performed, and while the bunsetsu candidates are displayed on the analysis result display unit 3, the bunsetsu candidate selecting unit 4 selects the bunsetsu unit. The bunsetsu candidate is selected and displayed, and the bunsetsu candidate deciding unit 5 decides a desired bunsetsu candidate or is automatically decided by the bunsetsu candidate internal confirming unit 19, and the kana-kanji character string is stored as a sentence in the document storing unit 6. To be done.

【００１３】仮名漢字変換装置７内では、入力文字獲得
部８により、仮名漢字、英数字、記号あるいは制御文字
等が受け取られ、これに基づき入力文字格納修正部９に
より入力文字が格納・修正され、格納される文字は、１
文字単位に文法解析部１５におくられ文法解析がなされ
る。また、入力文字が修正されたならば、その旨も文法
解析部１５に送られ必要に応じて、文法再解析がなされ
る。In the kana-kanji conversion device 7, the kana kanji, alphanumeric characters, symbols, control characters, etc. are received by the input character acquisition part 8, and the input characters are stored / corrected by the input character storage / correction part 9 based on these. , The stored character is 1
The characters are sent to the grammar analysis unit 15 on a character-by-character basis for grammar analysis. Further, when the input character is corrected, the fact is also sent to the grammar analysis unit 15 and the grammar is reanalyzed if necessary.

【００１４】この文法解析部１５における解析結果に基
づき単語辞書検索指示部１０が、必要に応じて自立語辞
書検索部１１により自立語辞書１２を検索し、該情報を
文法解析部１５におくる。文法解析部１５は、この自立
語内容に基づき付属語辞書検索部１６により付属語辞書
１７を検索し、自立語と付属語の結合検定ならびに文節
分かち書き処理を行い、文節候補格納部１４に解析結果
を格納し、解析結果出力部２０により文節分かち書き状
態で出力される。The word dictionary search instructing section 10 searches the independent word dictionary 12 by the independent word dictionary searching section 11 as needed based on the analysis result of the grammar analyzing section 15, and sends the information to the grammar analyzing section 15. The grammar analysis unit 15 searches the adjunct word dictionary 17 by the adjunct word dictionary search unit 16 based on the content of the independence word, performs the join test of the independence word and the adjunct word, and the phrase segmentation and writing process, and stores the analysis result in the phrase candidate storage unit 14. Is stored, and the analysis result output unit 20 outputs it in a phrase segmented state.

【００１５】また、ここで文法解析部１５の解析結果に
基づき文節候補内部確定部１９により、文節候補選択部
４よりの前文節・次文節候補の出力要求あるいは入力部
２よりの入力仮名文字列の修正または文節候補確定部５
よりの強制確定が無い限り、必要により文節候補が内部
確定される。内部確定された文節候補は、文節候補格納
部１４より削除されると共に、それに対応する仮名文字
列は、入力文字格納修正部９より削除される。また内部
確定された文節情報は単語学習部２２により、自立語使
用頻度等が自立語辞書に学習される。Further, based on the analysis result of the grammar analysis unit 15, the phrase candidate internal confirmation unit 19 requests the phrase candidate selection unit 4 to output the previous phrase / next phrase candidate or the input kana character string from the input unit 2. Correction or phrase candidate confirmation part 5
Unless there is a forced confirmation, the clause candidates will be internally confirmed if necessary. The internally fixed phrase candidate is deleted from the phrase candidate storage unit 14, and the corresponding kana character string is deleted from the input character storage correction unit 9. The word learning unit 22 learns the independent word usage frequency of the internally determined phrase information in the independent word dictionary.

【００１６】また、解析結果表示部３に文節分かち書き
状態で表示された文節候補に対しては、文節候補選択部
４により、前候補・次候補を呼び出し、所望の文節候補
を得る為、文節候補出力指示部１３により文節候補格納
部１４より、必要な文節候補を解析結果出力部２０が、
読み出し解析結果表示部３に出力する。この時文節候補
格納部１４に所望の文節が無く、さらに文節候補が要求
された場合は、文節再解析部１８により、自立語辞書１
２と付属語辞書１７を検索し、文法解析部１５とは違う
方法で、文法解析がなされ文節候補格納部１４に、新た
な文節候補が格納されるとともに、解析結果出力部２０
により出力される。For the phrase candidates displayed in the analysis result display unit 3 in the phrase segmented state, the phrase candidate selection unit 4 calls the previous candidate and the next candidate to obtain a desired phrase candidate. The analysis result output unit 20 outputs necessary phrase candidates from the phrase candidate storage unit 14 by the output instruction unit 13.
The read analysis result is output to the display unit 3. At this time, if there is no desired phrase in the phrase candidate storage unit 14 and a phrase candidate is requested, the phrase reanalysis unit 18 causes the independent word dictionary 1
2 and the adjunct word dictionary 17 are searched, and a grammatical analysis is performed by a method different from that of the grammar analysis unit 15 to store a new phrase candidate in the phrase candidate storage unit 14 and an analysis result output unit 20.
Is output by.

【００１７】また、解析結果表示部３に表示された文節
候補群に対して、文節候補確定部５により、強制的に確
定がなされた場合は、確定文節情報受取部２１により、
確定された文節の情報が受け取られ、単語学習部２２に
より、自立語使用頻度等が自立語辞書１２に学習され
る。When the phrase candidate confirming unit 5 forcibly confirms the phrase candidate group displayed on the analysis result display unit 3, the confirmed phrase information receiving unit 21
The information of the decided phrase is received, and the word learning unit 22 learns the independent word usage frequency and the like in the independent word dictionary 12.

【００１８】図３は、本発明の仮名漢字変換装置の動作
を説明するフローチャートである。前記した通り、この
仮名漢字変換装置は、７０１でキーボード２４の操作に
従って文字要素を受け取り、７０２によりこれが、いか
なる種類の文字であるか検定する。ここで、入力文字
が、変換を指示する種類のものであれば、７０３により
入力済みの全仮名文字列に対して、変換を行い、総てを
未確定文節として処理を終了する。本実施例の仮名漢字
変換装置は、後述するように文字種類の変化を契機とし
て自動的に変換処理に入る機能を有するが、同一種類の
文字要素、例えば平仮名ばかりの入力が続く場合、使用
者の意思で変換処理を行なわせたい場合も存在するの
で、変換を指示する入力文字（例えば、スペースキー
等）により、全入力文字の変換に入ることができるもの
としている。FIG. 3 is a flow chart for explaining the operation of the kana-kanji conversion device of the present invention. As described above, this kana-kanji conversion device receives a character element according to the operation of the keyboard 24 at 701, and verifies at 702 what kind of character this is. Here, if the input character is of a type instructing conversion, conversion is performed on all the kana character strings that have been input by 703, and the processing ends with all undetermined phrases. The kana-kanji conversion device according to the present embodiment has a function of automatically entering a conversion process when a character type changes, as will be described later. However, when inputting only the same type of character elements, for example, hiragana characters, the user There is also a case in which the user wants to perform the conversion process with the intention of, so that the conversion of all the input characters can be started by the input character (for example, the space key) instructing the conversion.

【００１９】また、入力文字が、英字・数字・記号・片
仮名の場合は、それの前の文字が平仮名であれば、７０
４により未確定平仮名文字列全部を変換し、７０５で、
入力された平仮名以外の一文字で文節を形成する。既に
入力されている文字が平仮名以外の場合は、今回入力さ
れた文字が直前の文字と同一文字種であれば、該文字を
前文字の文節に加えた文節を形成し（図示せず）、そう
でなければ７０５により１文字で新たな文節を形成し、
７０５で作成した文節については未確定文節とし、７０
４での変換結果と併せて７１５により解析結果を出力す
る。その後、７０１へ戻り処理を繰り返す。If the input character is an alphabetic character, numeral, symbol, or katakana, if the character before it is hiragana, then 70
Convert all undetermined hiragana character strings by 4, and at 705,
A phrase is formed by one character other than the entered hiragana. If the already entered character is something other than hiragana, if the character entered this time is of the same character type as the immediately preceding character, then that character is added to the preceding character's phrase to form a phrase (not shown). Otherwise, 705 forms a new phrase with one character,
The clause created in 705 is an undetermined clause,
The analysis result is output by 715 together with the conversion result in 4. After that, the process returns to 701 and the processing is repeated.

【００２０】また、入力文字が入力文字修正用の編集制
御文字であった場合は、７０６により入力文字を修正
し、７０７により、対象入力文字を再度文法解析する。If the input character is an edit control character for correcting the input character, the input character is corrected by 706 and the target input character is grammatically analyzed again by 707.

【００２１】また、入力文字が平仮名文字であれば、こ
れに対して７０８により付属語連接処理がなされ、７０
９により自立語の検索が必要であれば７１０により自立
語辞書が検索され、さらに７１１により文法解析がなさ
れる。If the input character is a hiragana character, an adjunct word concatenation process is performed on the input character by 708.
If it is necessary to search for an independent word by 9, the independent word dictionary is searched by 710, and further grammar analysis is performed by 711.

【００２２】７０７，７１１により解析された結果は、
７１２により確定できる文節があるか検定され、あれば
７１３により文節確定がなされ、７１４により確定文節
が出力され、残りの解析結果は７１５により出力される
が、なければ解析結果は７１５で出力され、７０１へ戻
り処理を繰り返す。The results analyzed by 707 and 711 are
If there is a clause that can be confirmed by 712, if there is a clause that can be confirmed, the clause is confirmed by 713, the confirmed clause is output by 714, and the remaining analysis result is output by 715, but if not, the analysis result is output by 715. The process returns to step 701 and the processing is repeated.

【００２３】図４ないし図８は、本発明の仮名漢字変換
装置の動作を例を似て説明する図である。ここでは、
「くるまではこをはこぶおとうさん。」という仮名文字
列および句点が連続して入力された場合を例として、説
明する。4 to 8 are diagrams for explaining the operation of the kana-kanji conversion device of the present invention similarly. here,
An example will be described in which a kana character string and punctuations such as "Okusan ha kobu otou san." Are input in succession.

【００２４】図４（ａ）は、第１の文字が入力されると
共に、自立語辞書が第１の文字を始めとする位置を見出
しの先頭として読み（図中記号「ｒ」は自立語辞書を検
索する為の見出しの先頭位置を示す。以下略）、第２〜
第５の文字に対しても、先の「ｒ」を基準として自立語
と照合すると共に付属語を連接した状態で図示の様な文
節候補が入力桁単位に得られたものである。ここで、候
補は常に文節単位で作成される為、「ｅ」の記号は文節
末を示す。また、この状態において、候補の見出し最終
桁と入力最終桁の差が３文字以上の候補は、次の連接が
無いものとして切り捨てられる（図中記号「※」以下
略）。In FIG. 4 (a), the first character is input, and the independent word dictionary reads the position beginning with the first character as the head of the heading (the symbol "r" in the figure indicates the independent word dictionary). Indicates the start position of the heading for searching. (Omitted below), 2nd
Also for the fifth character, the bunsetsu candidate as shown in the figure is obtained for each input digit in a state in which it is collated with the independent word on the basis of "r" and the adjunct is concatenated. Here, since the candidates are always created in phrase units, the symbol "e" indicates the end of the phrase. Further, in this state, a candidate in which the difference between the last digit of the heading and the last digit of the input of the candidate is 3 characters or more is rounded down as no next concatenation (the symbol "*" in the figure is omitted).

【００２５】図４（ｂ）の状態は、第６の文字が入力さ
れた状態であり、ここでは付属語の連接により第１の文
節候補群が第６文字までの長さの物を切り出せない。そ
こで、有効な文節に対して最小の桁の文節の最終桁の次
の桁を次の自立語辞書を読む見出し先頭桁とし、自立語
を読む（以降自立語解析位置算定読み出しという）。こ
こでは、「出」、「出は」が切り出されるが、文節数最
小法の原則により両者とも文節末位置が同一の第１の文
節がある為切り捨てられる（図中記号「ｃ」以下略）。
また、「車」は次の自立語と連接する可能性が無い為、
切り捨てられる（図中記号「ｎ」以下略）。The state of FIG. 4 (b) is a state in which the sixth character has been input, and here the first phrase candidate group cannot cut out objects up to the sixth character due to the concatenation of adjunct words. . Therefore, the next digit after the last digit of the phrase having the smallest digit with respect to the valid phrase is set as the heading first digit for reading the next independent word dictionary, and the independent word is read (hereinafter referred to as independent word analysis position calculation reading). Here, “out” and “out” are cut out, but due to the principle of the minimum number of bunsetsu, there is a first bunsetsu having the same bunsetsu end position, so it is cut off (symbol “c” in the figure is omitted). .
Also, "car" has no possibility of connecting with the next independent word,
It is rounded down (abbreviated below "n" in the figure).

【００２６】図５（ａ）の状態は第７の文字が入力され
た状態であり、自立語解析位置算定読み出しにより第５
の文字より自立語が検索される。ここで、第１の文節群
のうち第２の文節群と連接したものは、暫定文節候補と
し、漢字文字列として出力され、そうでないものは仮名
文字列として出力される（図中記号「ｋ」以下略）。以
降、暫定文節候補は、自立語を読む為の位置算定には使
用されない。The state of FIG. 5 (a) is a state in which the seventh character has been input, and the fifth character is obtained by reading the independent word analysis position calculation.
Independent words are searched for from the letters. Here, of the first bunsetsu group, those that are connected to the second bunsetsu group are output as Kanji character strings as temporary bunsetsu candidates, and those that are not are output as kana character strings (symbol “k” in the figure). "Hereafter omitted). After that, the provisional phrase candidates are not used for position calculation for reading independent words.

【００２７】図５（ｂ）の状態は、自立語解析位置算定
読み出しを行った状態であり、図６（ａ）の状態は自立
語解析位置算定読み出しを行った状態であり、ここで第
３文節まで切り出された為、３文節の連接を行う組み合
わせに対して、文節のコスト計算等優先順位を計算し、
第１の文節群を暫定確定する。暫定確定された文節を暫
定確定文節（図中記号「○」以下略）とし、以降暫定確
定文節として他と区別して表示し、これに対する前候補
・次候補処理を可能とする。この処理により優先順位の
低いものは切り捨てられる（図中記号「▽」以下略）。The state of FIG. 5 (b) is the state in which the independent word analysis position calculation read is performed, and the state of FIG. 6 (a) is the state in which the independent word analysis position calculation read is performed, which is the third here. Since the bunsetsu has been cut out, the priority of bunsetsu cost calculation etc. is calculated for the combination that connects 3 bunsetsu,
The first clause group is provisionally confirmed. The provisionally determined phrase is designated as a provisionally determined phrase (symbols in the figure below "○" are omitted) and displayed as a provisionally determined phrase thereafter separately from the others, and the previous candidate / next candidate process for this is enabled. By this processing, the one with the lower priority is discarded (the symbol "▽" in the figure is omitted).

【００２８】図６（ｂ）の状態は、暫定確定後に自立語
解析位置算定読み出しを行った状態である。図７の状態
は、第２の文節群に対して、文節の暫定確定がなされた
状態であり、第２の文節群が暫定確定された状態で、第
１の文節群は完全に確定されたものとみなし、文節候補
の現在表示候補を自動的に確定文節として（図中記号
「◎」以下略）出力し、以降仮名漢字変換処理の対象外
とする。また、この処理により、文節群の末表示候補は
切り捨てられる（図中記号「★」以下略）。The state of FIG. 6B is a state in which the independent word analysis position calculation read-out is performed after the provisional decision. The state of FIG. 7 is a state in which the bunsetsu is provisionally determined for the second bunsetsu group, the second bunsetsu group is provisionally determined, and the first bunsetsu group is completely determined. The current display candidate of the bunsetsu candidate is automatically output as a fixed bunsetsu (abbreviated below the symbol “⊚” in the figure), and is thereafter excluded from the kana-kanji conversion processing. In addition, by this processing, the end display candidate of the bunsetsu group is truncated (the symbol "★" in the figure is omitted).

【００２９】図８の状態は、入力文字列の種類が変わっ
た場合を示し、ここでは、平仮名の入力の後に句
点「。」が入力された場合を示している。同様の処理
は、変換キーが押下されるか、もしくは平仮名文字列に
続いて英数字などが入力された場合にも行なわれる。こ
の場合は、異なる種類の文字が入力された位置を文節の
区切り位置とみなして上述した文法解析などの処理が行
なわれる。尚、この場合でも、文節候補は確定出力され
ず、文節候補選択状態（前文節・次文節選択可能状態、
図中記号「☆」）であり、これらは確定処理されると確
定文節として出力される。The state of FIG. 8 shows the case where the type of the input character string has changed, and here the case where the punctuation mark "." Is input after the input of the hiragana. Similar processing is performed when the conversion key is pressed or when an alphanumeric character or the like is input following the hiragana character string. In this case, the position where different types of characters are input is regarded as the segment delimiter position, and the above-described grammar analysis and the like are performed. Even in this case, the phrase candidates are not definitely output, and the phrase candidate selection state (previous phrase / next phrase selectable state,
The symbol "☆" in the figure) is output as a definite clause when the definite processing is performed.

【００３０】以上説明した本実施例の仮名漢字変換装置
では、英字，数字，記号（句読点などを含む），片仮名
など、平仮名と異なる文字が入力されると、これらの文
字を区切りとして文節分かち書き処理がなされるので、
文法解析を効率よく行なうことができる。しかも、漢字
変換のための操作を必要としないので、日本語文章入力
時の思考を中断することがなく作業効率を高くすること
ができる。また、所定の仮名文字入力後には、文節を順
次確定していくので、文法解析および仮名漢字変換の対
象を少なくすることができ、処理を迅速に行なうことが
できる。さらに、操作者の意思による変換操作も可能と
したので、変換困難な仮名文字の漢字変換にも対応でき
る。In the kana-kanji conversion device of the present embodiment described above, when characters different from hiragana, such as letters, numbers, symbols (including punctuation marks), katakana, etc., are input, these characters are used as delimiters for segmentation and segmentation processing. Is done,
Grammar analysis can be performed efficiently. Moreover, since the operation for converting the kanji is not required, the work efficiency can be improved without interrupting the thinking when inputting the Japanese sentence. In addition, since the clauses are sequentially determined after the input of a predetermined kana character, the objects of grammar analysis and kana-kanji conversion can be reduced, and the processing can be performed quickly. Further, since the conversion operation can be performed by the operator's intention, it is possible to correspond to the Kanji conversion of a difficult kana character.

【００３１】以上本発明の実施例について説明したが、
本発明はこうした実施例に何等限定されるものではな
く、本発明の要旨を逸脱しない範囲内において、種々な
る態様で実施し得ることは勿論である。The embodiment of the present invention has been described above.
The present invention is not limited to these examples, and it goes without saying that the present invention can be implemented in various modes without departing from the scope of the present invention.

【００３２】[0032]

【発明の効果】本発明の仮名漢字変換装置では、既に入
力された文字要素列と種類の異なる文字要素が入力され
ると、この種類の異なる文字要素を文法解析の区切りと
して、文法解析や文節分かち書き処理などの仮名漢字変
換に必要な処理を行なうので、文法解析の効率を高め、
変換処理を正確かつ迅速に行なうことができる。しか
も、特別な変換操作を必要としないので、日本語文章の
入力時に、操作者の思考を中断することがなく、作業効
率を高くすることができる。In the kana-kanji conversion device of the present invention, when a character element of a different type from the already input character element string is input, the different type of character element is used as a delimiter for grammatical analysis and grammar analysis or clause. The process required for Kana-to-Kanji conversion, such as the segmentation process, is performed, improving the efficiency of grammar analysis.
The conversion process can be performed accurately and quickly. Moreover, since no special conversion operation is required, it is possible to improve work efficiency without interrupting the thinking of the operator when inputting a Japanese sentence.

[Brief description of drawings]

【図１】本発明の一実施例である仮名漢字変換装置のハ
ードウエア構成の概略を示すブロック図である。FIG. 1 is a block diagram showing an outline of a hardware configuration of a kana-kanji conversion device according to an embodiment of the present invention.

【図２】実施例の仮名漢字変換装置の機能構成を例示す
るブロック図である。FIG. 2 is a block diagram illustrating a functional configuration of a kana-kanji conversion device according to an embodiment.

【図３】実施例の仮名漢字変換装置の動作を例示するフ
ローチャートである。FIG. 3 is a flowchart illustrating an operation of the kana-kanji conversion device according to the embodiment.

【図４】実施例の仮名漢字変換装置の処理過程を例示す
る説明図である。FIG. 4 is an explanatory diagram illustrating the processing steps of the kana-kanji conversion device according to the embodiment.

【図５】同じく、実施例の仮名漢字変換装置の処理過程
を例示する説明図である。FIG. 5 is an explanatory diagram illustrating a process of the kana-kanji conversion device according to the embodiment.

【図６】同じく、実施例の仮名漢字変換装置の処理過程
を例示する説明図である。FIG. 6 is an explanatory diagram illustrating the process of the kana-kanji conversion device according to the embodiment.

【図７】同じく、実施例の仮名漢字変換装置の処理過程
を例示する説明図である。FIG. 7 is also an explanatory diagram illustrating a process of the kana-kanji conversion device according to the embodiment.

【図８】同じく、実施例の仮名漢字変換装置の処理過程
を例示する説明図である。FIG. 8 is an explanatory diagram illustrating a process of the kana-kanji conversion device according to the embodiment.

【符号の説明】２１…ＣＰＵ２２…ＲＯＭ２３…ＲＡＭ２４…キーボード２５…キーボードインタフェース２６…ＣＲＴ２７…ＣＲＴＣ２８…プロンタ２９…プリンタインタフェース３０…拡張用スロット[Explanation of Codes] 21 ... CPU 22 ... ROM 23 ... RAM 24 ... Keyboard 25 ... Keyboard Interface 26 ... CRT 27 ... CRTC 28 ... Pronter 29 ... Printer Interface 30 ... Expansion Slot

Claims

[Claims]

1. An input means capable of inputting at least two or more kinds of a plurality of kinds of character elements such as hiragana, katakana, letters, numbers and symbols, and a character element string inputted from the input means is kana kanji. A kana-kanji conversion device for converting a mixed sentence, wherein the character elements sequentially input from the input means are arranged and stored as unconverted character element string storage means, and the input means , When a character element of a different type from the character element of the last input is input, the position of the character element of a different type is used as a delimiter in the grammatical analysis, and the grammar analysis for the unconverted character element string and the phrase segmentation are performed. A kana-kanji conversion device comprising analysis processing means for performing analysis processing such as processing, and conversion means for performing kana-kanji conversion for the phrase obtained by the analysis.