JPH06314274A

JPH06314274A - Document preparing device and document information input method

Info

Publication number: JPH06314274A
Application number: JP5102032A
Authority: JP
Inventors: Takeshi Inoue; 健井上; Yukihiro Fukunaga; 幸弘福永
Original assignee: Toshiba Corp; Toshiba AVE Co Ltd
Current assignee: Toshiba Corp; Toshiba AVE Co Ltd
Priority date: 1993-04-28
Filing date: 1993-04-28
Publication date: 1994-11-08

Abstract

PURPOSE:To improve a conversion rate at the time of KANJI-mixed-KANA/ KANJI-converting a character string which is newly inputted to a document read from a storage device. CONSTITUTION:When an input control part 3 reads document data from the storage device 2, a KANJI mixed KANA/KANJI conversion part 6 temporarily stores the read KANJI mixed KANA character string in a converting use buffer 7 and then, executes KANJI mixed KANA/KANJI conversion. A conversion candidate control part 10 searches for the conversion candidate of the same notation with the character string inputted to the converting use buffer 7 from among conversion candidates obtained by conversion to store it in a document buffer 12 and transfers word attribute information of the conversion candidate to a word attribute buffer 13 at the same time. Thereby, as the word attribute buffer 13 can obtain word attribute information at the time of reading document data from the storage device 2, at the time of KANJI-mixed-KANA/KANJI- converting the character string which is newly inputted after this, word attribute information is referred to so as to improve the conversion efficiency.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明はペン入力手段を用いて漢
字混じりのかな文字列を入力し、漢字混じりの文章を作
成する文書作成装置に関わり、特に外部記憶装置等に既
に保存されている文書をその読み込み時に漢字混じりか
な漢字変換を施してその単語属性情報を得て、文書校正
・編集時の漢字混じりかな漢字変換効率の向上をはかる
処理に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document creating apparatus for entering a kana-blended kana character string using a pen input means to create a kanji-blended sentence, and in particular, it is already stored in an external storage device or the like. The present invention relates to a process for performing kanji-mixed kana-kanji conversion when reading a document to obtain the word attribute information, and improving the kanji-mixed kana-kanji conversion efficiency at the time of proofreading / editing a document.

【０００２】[0002]

【従来の技術】従来、文書作成装置を用いて文字入力と
それに付随する変換処理を連続して行っている場合、入
力単語の変換は直前単語の品詞情報及び単語の切れ目等
の単語属性情報を利用して精度良く行うことができる。
又、文書作成中にオペレータが選択した単語の情報をメ
モリ上に確保する機能を持つ装置では、既に作成された
文書の任意の位置に単語を挿入して変換する場合でも、
直前或いは直後の単語属性情報を利用して精度の良い変
換を行うことができるようになっている。しかし、文書
作成装置で上記のように連続して作成した文書を一旦フ
ァイル等に保存した後、新たに読み出して前記文書の作
成や編集を行う場合、完全互換が保たれるような特殊な
例を除けば、同一機種の文書作成装置においても前記フ
ァイルに保存される文書はＪＩＳやシフトＪＩＳ、ＡＳ
ＣＩＩによる文字情報に限定されるため、ファイルから
文書を読み出した際に以前文書を作成する際に収集した
品詞や単語の切れ目等の単語属性情報を引き継ぐことが
できず、このような文書に対して新たに単語を入力する
場合にはその変換率が悪くなってしまうという欠点があ
った。2. Description of the Related Art Conventionally, when character input and accompanying conversion processing are continuously performed using a document creation apparatus, input word conversion is performed by using word attribute information such as part-of-speech information of the immediately preceding word and word breaks. It can be used with high accuracy.
Further, in a device having a function of securing information of a word selected by an operator on a memory during document creation, even when a word is inserted and converted at an arbitrary position of a document already created,
It is possible to perform accurate conversion using the word attribute information immediately before or immediately after. However, in the case where a document created by the document creation device is stored in a file or the like as described above, and then newly read to create or edit the document, a special example in which complete compatibility is maintained Except for the above, the documents stored in the above-mentioned files can be stored in JIS, Shift JIS, AS
Since it is limited to the character information by CII, when the document is read from the file, the word attribute information such as the part of speech and the word break collected when the document was previously created cannot be inherited. However, when a new word is input, the conversion rate becomes worse, which is a drawback.

【０００３】[0003]

【発明が解決しようとする課題】従来の文書作成装置に
おいて、同一機種又は他機種間でファイルを介して文書
のやりとりをする場合、ＪＩＳなどの規格化された文字
コードしか前記ファイルに保存することができないた
め、かな混じりかな漢字変換の際に別途獲得した単語属
性情報を前記ファイルを介して同一機種又は他機種間で
やり取りすることができないようになっている。そのた
め、文字コードのみとして保存したファイルから文書を
同一機種又は別機種で読み込んで再度編集作業を行う場
合、ファイル保存前に収集されていた単語の切れ目や各
単語の品詞等の単語属性情報が失われている。従って、
例えば読み込んだ文書中に文字列の挿入を行って漢字変
換する際に、直前又は直後の単語属性情報がないため
に、文頭からの入力と同じ変換結果となり、画面上に連
続して単語を入力して変換した場合に比べて、変換率が
悪くなるという欠点があった。In the conventional document creating apparatus, when documents are exchanged between the same model or another model via a file, only standardized character codes such as JIS are stored in the file. Therefore, it is impossible to exchange the word attribute information separately acquired at the time of kana-kana-kana conversion with the same model or between other models via the file. Therefore, when reading a document from a file saved as character codes only with the same model or a different model and editing it again, the word attribute information such as word breaks and parts of speech of each word collected before saving the file is lost. It is being appreciated. Therefore,
For example, when converting a Chinese character by inserting a character string in a read document, there is no word attribute information immediately before or after, so the conversion result is the same as the input from the beginning of the sentence, and words are continuously input on the screen. There is a drawback that the conversion rate becomes worse as compared with the case where the conversion is performed.

【０００４】そこで本発明は上記の欠点を除去し、ファ
イルから読み込んだ文書を編集するような際に、前記文
書の読み込み時に単語属性情報を獲得し、読み込んだ文
書に対して新たに入力する文字列を漢字混じりかな漢字
変換する際に前記単語属性情報を利用してその変換率を
向上させることができる文書作成装置及び文書情報入力
方法を提供することを目的としている。Therefore, the present invention eliminates the above-mentioned drawbacks, and when editing a document read from a file, the word attribute information is acquired at the time of reading the document, and the character newly input to the read document is obtained. An object of the present invention is to provide a document creation apparatus and a document information input method that can improve the conversion rate by utilizing the word attribute information when converting a column into kanji and kanji.

【０００５】[0005]

【課題を解決するための手段】本発明はオンライン手書
き文字認識手段を持ち、認識された漢字混じりかな文字
情報を入力文字情報とし、これを辞書情報及び単語属性
情報を参照して漢字混じりかな漢字変換する文書作成装
置において、外部記憶装置または通信装置から既成の文
書情報を読み込む文書読み込み手段と、前記文書読み込
み手段によって読み込まれた文書情報に漢字混じりかな
漢字変換を施す変換手段と、前記変換手段から得られる
変換候補の中から前記文書読み込み手段によって読み込
まれた既成の文書情報と同一表記の変換候補を選択して
文書バッファに格納する変換候補選択手段と、前記変換
候補選択手段によって選択された変換候補の単語属性情
報を拾得する拾得手段とを具備した構成を有する。The present invention has an on-line handwritten character recognition means, and uses recognized kana-kanji mixed kana character information as input character information, which is referred to dictionary information and word attribute information to convert kanji kana-kana kanji characters. In the document creating apparatus, a document reading means for reading ready-made document information from an external storage device or a communication device, a converting means for performing kanji-mixed kanji conversion on the document information read by the document reading means, and Conversion candidates selected from the conversion candidates selected by the conversion candidate selecting means and conversion candidates having the same notation as the existing document information read by the document reading means and stored in the document buffer. And a finding means for finding the word attribute information.

【０００６】[0006]

【作用】本発明の文書作成装置において、入力手段は外
部記憶装置又は通信装置から既成の文書データを読み込
む。変換手段は前記入力手段によって読み込まれた文書
データに漢字かな混じり変換を施す。変換候補選択手段
は前記変換手段から得られる変換候補の中から前記入力
手段によって読み込まれた文書データと同一表記の変換
候補を選択して前記文書バッファに格納する。拾得手段
は前記変換候補選択手段によって選択された変換候補の
単語属性情報を拾得する。これにより、前記読み込んだ
文書データに新たな文書を入力する際に、前記拾得手段
により拾得した単語属性情報を参照してかな漢字変換を
行うことができるため、その変換率を向上させることが
できる。In the document creating apparatus of the present invention, the input means reads the existing document data from the external storage device or the communication device. The conversion means converts the document data read by the input means into a kanji / kana mixture. The conversion candidate selection means selects a conversion candidate having the same notation as the document data read by the input means from the conversion candidates obtained from the conversion means and stores it in the document buffer. The finding means finds the word attribute information of the conversion candidate selected by the conversion candidate selecting means. As a result, when a new document is input to the read document data, kana-kanji conversion can be performed by referring to the word attribute information found by the finding means, so that the conversion rate can be improved.

【０００７】[0007]

【実施例】以下、本発明の一実施例を図面を参照して説
明する。図１は本発明の文書作成装置の一実施例を示し
たブロック図である。１は文字列を利用者がペンで入力
すると、オンライン手書き文字認識処理を行い、対応す
る文字コードを出力する手書き透明タブレット等の入力
装置で、これは後述する表示装置１６の上に重ね合せて
は位置され、表示情報が見える構造になっている。２は
フロッピーディスク等の文書データの保存／呼出が行え
る記憶装置で、通信機能のように外部からの文書データ
を受信できる通信装置でも良い。３は入力された文字列
や設定データ等を必要な部分に振り分ける入力制御部、
４は入力された文字列などを一旦保存する入力バッフ
ァ、５は入力文書の書式を設定する書式制御部、６は入
力される漢字混じりかな文字列を漢字混じりかな漢字変
換する漢字混じりかな漢字変換部、７は変換対象の漢字
混じりかな文字列が一旦保存される変換用バッファ、８
は直前或いは直後の単語や品詞の単語間接続関係等の単
語の属性情報を拾得する単語属性拾得部、９は漢字混じ
りかな漢字変換用辞書、１０は漢字混じりかな漢字変換
により得られた変換候補の出力優先度等を調整する変換
候補制御部、１１は漢字混じりかな漢字変換により得ら
れる変換候補を一旦保存する変換候補バッファ、１２は
漢字混じりかな漢字変換して確定された文字コードを保
存する文書バッファ、１３は漢字混じりかな漢字変換時
に得られる単語の属性情報を保存する単語属性バッフ
ァ、１４は表示用バッファ１５に展開されている表示用
データを表示装置１６に表示する表示制御部、１５は表
示装置１６に表示する表示用データを展開する表示用バ
ッファ、１６は文書や図形等を表示するＣＲＴやＬＣＤ
等の表示装置である。この表示装置１６は前記入力装置
１と一体に重ね合った構造になっている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of a document creating apparatus of the present invention. Reference numeral 1 denotes an input device such as a handwriting transparent tablet which performs online handwritten character recognition processing when a user inputs a character string with a pen and outputs a corresponding character code, which is superimposed on a display device 16 described later. Is located so that the displayed information can be seen. Reference numeral 2 denotes a storage device such as a floppy disk capable of storing / calling document data, and may be a communication device capable of receiving document data from the outside such as a communication function. 3 is an input control unit that sorts input character strings and setting data into necessary parts,
Reference numeral 4 is an input buffer for temporarily storing input character strings and the like, 5 is a format control unit for setting the format of the input document, 6 is a Kanji-mixed kana-kanji conversion unit for converting the input kanji-mixed kana-kana to kanji 7 is a conversion buffer in which a kana-mixed kana character string to be converted is temporarily stored, 8
Is a word attribute extraction unit that acquires attribute information of words such as immediately preceding or succeeding words and word-to-word connection relations of parts of speech, 9 is a kanji-kana-kanji conversion dictionary, and 10 is output of conversion candidates obtained by kanji-kana kanji conversion A conversion candidate control unit that adjusts the priority and the like, 11 is a conversion candidate buffer that temporarily stores conversion candidates obtained by Kanji-kana-kana conversion, and 12 is a document buffer that stores the character code determined by kanji-mix-kana conversion. Is a word attribute buffer for storing attribute information of words obtained when converting kana and kanji, 14 is a display control unit for displaying the display data expanded in the display buffer 15 on the display device 16, and 15 is for the display device 16. A display buffer that expands the display data to be displayed, and 16 is a CRT or LCD that displays documents, figures, etc.
Etc. is a display device. The display device 16 has a structure in which the display device 16 and the input device 1 are integrally laminated.

【０００８】次に本実施例の動作について説明する。通
常は手書きタブレット等の入力装置１にオペレータがペ
ン等で漢字混じりかな文字列を書くことにより、これを
図示しないオンライン手書き文字認識部が文字認識を行
い、認識文字列を生成し、これを文書として装置に入力
する。漢字混じりかな文字入力／変換はペン入力手段を
持つ文書作成装置において重要な入力／変換手段で、画
数の少ない漢字はその漢字のまま入力でき、複雑で画数
の多い漢字はその読みを入力し、これをかな漢字変換で
きるもので、入力及び変換速度が従来のペン入力手段
（かな文字入力／変換手段のみ）を持つ文書作成装置に
よる文書作成よりも文字認識及び変換効率が高い。この
際、オペレータは表示装置１６に表示される各種メッセ
ージ等を参照して対話的に文書作成作業を進めて行く。
しかし、場合によってはフロッピーディスク等の記憶装
置２から文書を読み込んで、この文書に追加入力するこ
ともある。記憶装置２から入力される文字コードで構成
される文書データは入力制御部３を通して、入力バッフ
ァ４に変換処理待ち等の間一旦保存された後、再び入力
制御部３から漢字混じりかな漢字変換部６へ送られる。
ここまでの動作は入力装置１から入力された文字コード
についても同様である。漢字混じりかな漢字変換部６は
変換対象となる入力文字列が単語や文節等を形成する適
当な長さになるまで一旦変換用バッファ７に格納する。
変換用バッファ７に格納された文字列の長さが句読点の
文字コードの検出によって区切られた長さ、即ち、変換
対象の長さとなった段階で、漢字混じりかな漢字変換部
６は漢字混じりかな漢字変換用辞書９及び単語属性拾得
部８の単語属性情報を参照しながら前記変換対象文字列
の漢字混じりかな漢字変換処理を開始する。Next, the operation of this embodiment will be described. Usually, an operator writes a kana-mixed kana character string on the input device 1 such as a handwriting tablet with a pen or the like, and an online handwritten character recognition unit (not shown) performs character recognition to generate a recognized character string and document it. As input to the device. Kanji mixed kana character input / conversion is an important input / conversion means in a document creation device having a pen input means. Kanji with a small number of strokes can be input as it is, and kanji with a large number of strokes can be input with its reading, It can convert kana-kanji into characters, and its input and conversion speeds are higher than those of a document creation device having a conventional pen input means (only kana character input / conversion means). At this time, the operator refers to various messages displayed on the display device 16 and interactively proceeds with the document creation work.
However, in some cases, a document may be read from the storage device 2 such as a floppy disk and additionally input to this document. Document data composed of character codes input from the storage device 2 is temporarily stored in the input buffer 4 through the input control unit 3 while waiting for conversion processing or the like, and then is again input from the input control unit 3 into the kanji / kanji / kanji conversion unit 6. Sent to.
The operation up to this point is the same for the character code input from the input device 1. The kana-mixed kana-kanji conversion unit 6 temporarily stores the input character string to be converted into the conversion buffer 7 until the input character string has an appropriate length for forming a word or a phrase.
At the stage when the length of the character string stored in the conversion buffer 7 is separated by the detection of the punctuation character code, that is, the length of the conversion target, the kanji-mixed kana-kanji conversion unit 6 converts the kanji-mixed kana-kanji While referring to the word dictionary 9 and the word attribute information of the word attribute pick-up unit 8, the Kanji-kana-kanji conversion processing of the conversion target character string is started.

【０００９】ここで、変換対象となる文字の挿入位置の
直前・直後に空白以外の文字がある場合、単語属性拾得
部８は直前或いは直後の単語の品詞や単語間接続関係を
単語属性バッファ１３から拾得する。尚，この単語属性
バッファ１３への情報入力については後述する。漢字混
じりかな漢字変換部６は前記単語属性拾得部８により拾
得された単語属性情報により、例えば、変換対象の文字
列が「は」で直前の品詞が名詞であった場合、名詞の
「歯」や「葉」等より助詞の「は」を優先して変換処理
を行い、逆に、直前が空白であったり、句読点により文
書が切れている場合には名詞の候補を優先する変換処理
を行う。漢字混じりかな漢字変換部６は変換処理が終了
すると、変換候補を出力優先順位や品詞等の単語属性情
報と共に変換候補制御部１１に送る。変換候補制御部１
１は送られてきた前記変換候補及びその他の情報を一旦
変換候補バッファ１１に格納する。通常の入力装置１か
らの入力であった場合、変換候補制御部１０は変換候補
バッファ１１に格納されている変換候補の出力優先度に
従い、変換候補を表示制御部１４を介して表示装置１６
に表示して、オペレータに対して変換候補の選択を促
す。Here, when there is a character other than a space immediately before or after the insertion position of the character to be converted, the word attribute acquisition unit 8 determines the part of speech of the word immediately before or immediately after or the interword connection relation by the word attribute buffer 13. To get from. Information input to the word attribute buffer 13 will be described later. The kanji-mixed-kana-kanji conversion unit 6 uses the word attribute information acquired by the word attribute acquisition unit 8 to determine, for example, when the conversion target character string is “ha” and the last part of speech is a noun, the noun “tooth” or The conversion process is performed by giving priority to the particle "ha" over "leaf", and conversely, when there is a blank immediately before or when the document is cut due to punctuation, the conversion process is given priority to the noun candidate. When the conversion process is completed, the kanji / kana-kana conversion unit 6 sends the conversion candidates to the conversion candidate control unit 11 together with the word attribute information such as the output priority and the part of speech. Conversion candidate control unit 1
1 temporarily stores the received conversion candidates and other information in the conversion candidate buffer 11. When the input is from the normal input device 1, the conversion candidate control unit 10 displays the conversion candidates via the display control unit 14 according to the output priority of the conversion candidates stored in the conversion candidate buffer 11.
, And prompts the operator to select a conversion candidate.

【００１０】その結果、オペレータが入力装置１を操作
して選択動作を行った場合、変換候補制御部１０はオペ
レータが選択した変換候補の文字コードを文書バッファ
１２に格納し、オペレータが変換候補の選択操作を行わ
ず次の文字の入力を開始した場合には第１変換候補を選
択したものと見做して、この第１変換候補の文字コード
を文書バッファ１２に格納する。又、変換候補制御部１
０は前記変換候補の文書バッファ１２への格納位置情報
と共に、変換候補バッファ１１に格納されている前記変
換候補の単語属性情報を読み出して、これを単語属性バ
ッファ１３に転送する。一方、記憶装置２から文書デー
タを読み込んだ場合、変換候補制御部１１は変換候補バ
ッファ１１内の変換候補の中から変換用バッファ７に格
納されている変換対象入力文字列と同一表記で且つその
中で最も出力優先順位が高い変換候補を選択して、この
変換候補の文字コードを文書バッファ１２に格納すると
共に、前記選択した変換候補の品詞等の単語属性情報を
単語属性バッファ１３に格納する。表示制御部１４は書
式制御部５からの書式情報に従って展開された前記文書
バッファ１２内の文字コードを表示装置１６の表示画面
に対応するビットマップ状に表示用バッファ１５に展開
する。又、同様に表示制御部１４は変換候補の選択中に
は変換候補選択用のウインドウを表示用バッファ１５に
上記と同様に展開する。表示制御部１４は表示用バッフ
ァ１５に展開された表示用データを常に表示装置１６の
画面に表示する。As a result, when the operator operates the input device 1 to perform the selection operation, the conversion candidate control unit 10 stores the character code of the conversion candidate selected by the operator in the document buffer 12, and the operator selects the conversion candidate. When the input of the next character is started without performing the selection operation, it is considered that the first conversion candidate is selected, and the character code of the first conversion candidate is stored in the document buffer 12. Also, the conversion candidate control unit 1
0 reads the conversion candidate word attribute information stored in the conversion candidate buffer 11 together with the storage position information of the conversion candidate in the document buffer 12, and transfers it to the word attribute buffer 13. On the other hand, when the document data is read from the storage device 2, the conversion candidate control unit 11 uses the same notation as the conversion target input character string stored in the conversion buffer 7 among the conversion candidates in the conversion candidate buffer 11 and The conversion candidate having the highest output priority among them is selected, the character code of this conversion candidate is stored in the document buffer 12, and the word attribute information such as the part of speech of the selected conversion candidate is stored in the word attribute buffer 13. . The display control unit 14 develops the character code in the document buffer 12 developed according to the format information from the format control unit 5 into the display buffer 15 in a bit map corresponding to the display screen of the display device 16. Similarly, the display control unit 14 develops a conversion candidate selection window in the display buffer 15 in the same manner as above while the conversion candidates are being selected. The display control unit 14 always displays the display data expanded in the display buffer 15 on the screen of the display device 16.

【００１１】図２は図１に示した文書作成装置における
記憶装置２から文書を文書バッファ１２に読み込む際の
動作を示したフローチャートである。ステップ２０１に
て記憶装置２から文書ファイルの一文字が読み込まれる
と、ステップ２０２にて入力制御部３に文字データが送
られ、入力制御部３から漢字混じりかな漢字変換部６を
通して前記文字データが変換用バッファ７に格納され
る。ステップ２０３にて漢字混じりかな漢字変換部６は
変換用バッファ７内の入力文字列を監視し、それが単
語、文節等適当な長さになった段階で変換を開始する。
又、ステップ２０４にて単語属性拾得部８は現在変換対
象となっている文字列の直前にある単語の品詞や単語間
接続情報等の単語属性情報を単語属性バッファ１３より
拾得する。ステップ２０５にて漢字混じりかな漢字変換
部６は、漢字混じりかな漢字変換用辞書９内の辞書デー
タに基づいて変換用バッファ７の文字列を漢字混じりか
な漢字変換するが、この時、前記単語属性拾得部８より
得た直前単語の属性を参考にして候補漢字の接続可否や
優先度決定を行う。ステップ２０６にて漢字混じりかな
漢字変換部６は前記漢字混じりかな漢字変換によって得
られた変換候補を表記文字列、単語属性、優先順位情報
と共に変換候補バッファ１１に格納する。FIG. 2 is a flow chart showing the operation of reading a document from the storage device 2 into the document buffer 12 in the document creating apparatus shown in FIG. When one character of the document file is read from the storage device 2 in step 201, the character data is sent to the input control unit 3 in step 202, and the character data is converted from the input control unit 3 through the kana-kana-kanji conversion unit 6 for conversion. It is stored in the buffer 7. In step 203, the kana / kanji / kanji conversion unit 6 monitors the input character string in the conversion buffer 7 and starts conversion when the input character string has an appropriate length such as a word or a phrase.
Further, in step 204, the word attribute acquisition unit 8 acquires from the word attribute buffer 13 the word attribute information such as the part of speech of the word immediately before the character string to be converted and the interword connection information. In step 205, the kanji-mixed kana-kanji conversion unit 6 converts the character string in the conversion buffer 7 into kanji-mixed kana-kanji based on the dictionary data in the kanji-mixed kana-kanji conversion dictionary 9, and at this time, the word attribute acquisition part 8 By referring to the attribute of the immediately preceding word obtained, the candidate kanji can be connected or not and the priority is determined. In step 206, the kanji-mixed kana-kanji conversion unit 6 stores the conversion candidates obtained by the kanji-mixed kana-kanji conversion in the conversion candidate buffer 11 together with the written character string, word attributes, and priority information.

【００１２】変換候補制御部１０はステップ２０７にて
変換用バッファ７の格納データと同一、即ち入力文字列
と同一表記の変換候補を変換候補バッファ１１より探
す。その結果、変換候補の中に無変換候補以外で入力表
記と同一の変換候補が存在した場合、変換候補制御部１
０はステップ２０８に進み、前記した同一の変換候補の
中で最も出力優先度の高い候補を選択し、変換候補バッ
ファ１１に格納されている該候補の単語属性を文書バッ
ファ１２への文字列格納予定位置と共に単語属性バッフ
ァ１３に格納する。これと同時に、ステップ２１０にて
変換候補制御部１０は表記文字列（選択された変換候
補）を文書バッファ１２に格納する。一方、漢字混じり
かな漢字変換部６は変換対象文字列が辞書未登録単語で
あったり、文法接続判定等により前後の接続が全て禁止
された場合、変換対象文字列を入力表記のまま無変換候
補として変換候補バッファ１１に積む。この際、変換候
補制御部１０は変換対象文字列が無変換候補としか一致
しない場合、これを選択し、単語属性バッファ１３に文
書バッファへの文字列格納予定位置と共に解析失敗であ
ることをステップ２０９にて格納する。これと同時に、
変換候補制御部１０はステップ２１０にて表記文字列を
文書バッファ１２に格納する。表示制御部１４はステッ
プ２１１にて文書バッファ１２に格納された文字列を書
式制御部５で設定されている書式に従って表示用バッフ
ァ１５にビットマップ状に展開した後、表示装置１６の
画面上に表示する。画面表示が行われると、漢字混じり
かな漢字変換部６は次の変換の準備のために変換用バッ
ファ７をステップ２１２にて初期化し、次の文字を入力
制御部３を介して記憶装置２から読み込み、上記ステッ
プ２０１以降の処理を繰り返す。しかし、ステップ２１
３にて入力制御部３が前記記憶装置２内のファイルが終
端であると、判定した場合、前記記憶装置２からの文字
列の読み込みが終了される。In step 207, the conversion candidate control unit 10 searches the conversion candidate buffer 11 for a conversion candidate that is the same as the data stored in the conversion buffer 7, that is, the same notation as the input character string. As a result, when the conversion candidates other than the non-conversion candidates are the same as the input notation, the conversion candidate control unit 1
If 0, the process proceeds to step 208, the candidate having the highest output priority is selected from the same conversion candidates, and the word attribute of the candidate stored in the conversion candidate buffer 11 is stored in the document buffer 12 as a character string. It is stored in the word attribute buffer 13 together with the planned position. At the same time, in step 210, the conversion candidate control unit 10 stores the notation character string (selected conversion candidate) in the document buffer 12. On the other hand, if the conversion target character string is a word that is not registered in the dictionary, or if all the connections before and after are prohibited due to the grammatical connection judgment, the conversion target character string is regarded as a non-conversion candidate as the input notation. The conversion candidate buffer 11 is loaded. At this time, if the conversion target character string matches only the non-conversion candidate, the conversion candidate control unit 10 selects this and selects the word attribute buffer 13 as well as the character string storage planned position in the document buffer and the step of analysis failure. It is stored at 209. At the same time,
The conversion candidate control unit 10 stores the notation character string in the document buffer 12 in step 210. In step 211, the display control unit 14 expands the character string stored in the document buffer 12 in the display buffer 15 according to the format set by the format control unit 5, and then displays it on the screen of the display device 16. indicate. When the screen is displayed, the kanji / kanji / kanji conversion unit 6 initializes the conversion buffer 7 in step 212 to prepare for the next conversion, and reads the next character from the storage device 2 via the input control unit 3. The processing from step 201 onward is repeated. However, step 21
When the input control unit 3 determines in 3 that the file in the storage device 2 is the end, the reading of the character string from the storage device 2 is terminated.

【００１３】図３は図１に示した文書作成装置の通常の
文字入力の文書作成処理を示したフローチャートであ
る。入力装置１からステップ３０１にて入力された文字
列は入力制御部３によって漢字混じりかな漢字変換部６
を通して変換用バッファ７にステップ３０２にて格納さ
れる。ステップ３０３では変換用バッファ７に格納され
るデータがオペレータからの変換指示、或いは漢字混じ
りかな漢字変換部６の認識によって変換対象になったか
否かが判定され、変換対象にならない場合はステップ３
０１に戻り、変換対象になった場合はステップ３０４へ
進む。単語属性拾得部８はステップ３０４にて変換開始
と同時に変換対象となっている文字列の挿入位置をカー
ソル位置等より検出し、ステップ３０５にて前記検出さ
れた位置と設定されている書式から文書バッファ１２内
の該当位置（図５参照）を求め、対応する単語属性バッ
ファ１３（図６参照）より文字列挿入位置の直前単語の
属性情報をステップ３０５にて拾得すると共に、直後の
単語の属性情報をステップ３０６にて拾得する。FIG. 3 is a flowchart showing a normal character input document creating process of the document creating apparatus shown in FIG. The character string input from the input device 1 in step 301 is input by the input control unit 3 and the Kana-Kanji conversion unit 6
Is stored in the conversion buffer 7 in step 302. In step 303, it is determined whether or not the data stored in the conversion buffer 7 is the conversion target by the conversion instruction from the operator or the recognition of the kanji-mixed kana-kanji conversion unit 6, and if it is not the conversion target, step 3
Returning to 01, if it is a conversion target, the process proceeds to step 304. At step 304, the word attribute extracting unit 8 detects the insertion position of the character string to be converted at the same time as the conversion is started from the cursor position or the like, and at step 305, the document position is detected based on the detected position and the set format. The corresponding position in the buffer 12 (see FIG. 5) is obtained, and the attribute information of the word immediately before the character string insertion position is found from the corresponding word attribute buffer 13 (see FIG. 6) in step 305, and the attribute of the word immediately after is acquired. The information is retrieved at step 306.

【００１４】漢字混じりかな漢字変換部６はステップ３
０７にて変換用バッファ７内の文字列のかな漢字変換を
漢字混じりかな漢字変換辞書９及び単語属性拾得部８内
の前後にある単語属性情報を参考にして漢字混じりかな
漢字変換し、得られた変換結果の候補を表記文字列、単
語属性、出力優先度と共に変換候補バッファ１１にステ
ップ３０８にて図４に示すように格納する。次に変換候
補制御部１０は記憶装置２からの読み込み時と異なる候
補選択を行うが、仮に選択されたものとして出力優先順
位が最も高い変換候補をステップ３０９にて表示装置１
６の画面上に表示する。尚、オペレータによる候補選択
指示がない限り、前記表示した変換候補が選択されたも
のとする。しかし、オペレータより次候補選択指示や一
括表示選択指示等が行われた場合、変換候補制御部１０
はステップ３１０にて選択用のウインドウ等を立ち上げ
てオペレータによる候補選択を待つ。こうして変換候補
が選択されると、ステップ３１１にて変換候補制御部１
０は変換候補バッファ１１に格納されている該当する変
換候補の表記文字列を図５に示すように文書バッファ１
２に格納し、ステップ３１２にて前記文書バッファ１２
への格納位置と共に単語属性を単語属性バッファ１３に
図６に示すように格納する。文書バッファ１２に選択さ
れた候補の文字列が格納されることにより、表示制御部
１４は前記選択された候補の文字列を表示用バッファ１
５に展開することにより、表示装置１６の画面上の表示
中の文書に選択された候補の文字列をステップ３１３に
て表示する。その後、漢字混じりかな漢字変換部６は候
補選択が終了すると、ステップ３１４にて次の入力の準
備のために変換用バッファ７を初期化する。The kana-kanji kana-kanji conversion unit 6 performs step 3
At 07, Kana-Kanji conversion of the character string in the conversion buffer 7 is performed by referring to the Kana-Kanji conversion dictionary 9 containing Kanji and the word attribute information before and after in the word attribute acquisition unit 8 and the Kana-Kana conversion is obtained. The candidate of is stored in the conversion candidate buffer 11 in step 308 as shown in FIG. 4 together with the notation character string, the word attribute, and the output priority. Next, the conversion candidate control unit 10 performs candidate selection different from that at the time of reading from the storage device 2, but in step 309, the conversion candidate having the highest output priority as a temporary selection is selected.
Display on the screen of 6. Unless the operator gives a candidate selection instruction, it is assumed that the displayed conversion candidate is selected. However, when the operator gives a next candidate selection instruction or a batch display selection instruction, the conversion candidate control unit 10
In step 310, a selection window or the like is activated to wait for the operator to select a candidate. When the conversion candidate is selected in this manner, the conversion candidate control unit 1 is selected in step 311.
0 is the notation character string of the corresponding conversion candidate stored in the conversion candidate buffer 11 as shown in FIG.
2 and stores the document buffer 12 in step 312.
The word attribute is stored in the word attribute buffer 13 as shown in FIG. When the character string of the selected candidate is stored in the document buffer 12, the display control unit 14 displays the character string of the selected candidate in the display buffer 1.
By expanding to 5, the character string of the candidate selected in the document being displayed on the screen of the display device 16 is displayed in step 313. After that, when the selection of candidates is completed, the kana-kana kana-kanji conversion unit 6 initializes the conversion buffer 7 in step 314 in preparation for the next input.

【００１５】図４は図１に示した変換候補バッファ１１
のデータ格納例を示した図である。この例では、変換例
として「しあいがはじまる。」と入力した場合のデータ
例が示してある。表記文字列のＯｘｆｆは同音語の切れ
目を示すセパレータである。辞書に入力表記と同一の語
が存在しない場合、同音語として品詞に「無」を持つ無
変換候補が追加される。オペレータは同音語単位で選択
処理を行う。又、記憶装置２からの読み込みで「試合が
はじまる。」が入力された場合、漢字かな混じり変換部
６により優先順位に拘らず、「試合」「が」「はじま
る」「。」が選択される。FIG. 4 shows the conversion candidate buffer 11 shown in FIG.
It is a figure showing an example of data storage. In this example, as a conversion example, a data example in the case of inputting "Shiga begins" is shown. The notation character string Oxff is a separator indicating a break in a homophone. If the same word as the input notation does not exist in the dictionary, a non-conversion candidate having “no” in the part of speech is added as a homophone. The operator performs selection processing in units of homophones. Further, when “match begins” is input by reading from the storage device 2, the kanji / kana mixing conversion unit 6 selects “match” “ga” “start” “.” Regardless of the priority order. .

【００１６】図５は図１に示した文書バッファ１２のデ
ータ格納例を示した図である。文書バッファ１２は作成
中文書の文字コードが順に格納されており、これら文字
コードは印字或いは画面表示の際に設定書式に従って展
開される。図６は図１に示した単語属性バッファ１３の
データ格納例を示した図である。単語属性バッファ１３
は選択された候補の単語属性データが格納されるもの
で、この単語属性データは変換候補バッファ１１から変
換候補制御部１０により転送される。図５に示した文字
コードの文書バッファ１２内の格納位置は該当する文字
列が格納されている文書バッファのアドレスで示されて
いる。このアドレスは図６に示した単語属性バッファ１
３の格納位置に対応している。これにより、単語属性拾
得部８は指定された文字を含む単語の属性情報を前記文
書バッファ１２の格納位置をキーとして前記図６に示し
た単語属性バッファ１３から拾得することができる。
又、図６に示した単語属性バッファ１３に格納されてい
る品詞や接続関係は、漢字混じりかな漢字変換部６の変
換処理の際に変換候補の接続判定や優先順位の決定に際
して参照される。FIG. 5 is a diagram showing an example of data storage in the document buffer 12 shown in FIG. In the document buffer 12, the character codes of the document being created are stored in order, and these character codes are expanded according to the setting format at the time of printing or screen display. FIG. 6 is a diagram showing an example of data storage of the word attribute buffer 13 shown in FIG. Word attribute buffer 13
Stores the word attribute data of the selected candidate, and this word attribute data is transferred from the conversion candidate buffer 11 by the conversion candidate control unit 10. The storage position of the character code in the document buffer 12 shown in FIG. 5 is indicated by the address of the document buffer in which the corresponding character string is stored. This address is the word attribute buffer 1 shown in FIG.
3 corresponds to the storage position. As a result, the word attribute acquisition unit 8 can acquire the attribute information of the word including the designated character from the word attribute buffer 13 shown in FIG. 6 using the storage position of the document buffer 12 as a key.
In addition, the part of speech and the connection relation stored in the word attribute buffer 13 shown in FIG. 6 are referred to when the conversion candidate is connected and the priority is determined during the conversion process of the Kanji / Kana conversion unit 6.

【００１７】本実施例によれば、記憶装置２から文書デ
ータを読み込む場合、読み込んだ漢字混じりかな文字列
を漢字混じりかな漢字変換部６により全て漢字混じりか
な漢字変換処理し、その結果得られる変換候補の中から
記憶装置２から入力された文書データの文字列と同一と
なる変換候補を選択して、これを文書バッファ１２に格
納するため、前記漢字混じりかな漢字変換処理をする際
に文書データの単語の切れ目や品詞等の単語属性情報を
単語属性バッファ１３に確保することができる。従っ
て、前記記憶装置２から読み込んだ文書に対して編集作
業を行う際、特に文字列の挿入を行う場合等、挿入する
位置の前後の前記単語属性情報を用いて、挿入文字列の
漢字混じりかな漢字変換を行うことができるため、この
時の変換効率を文書データを連続的に入力編集した場合
と同一レベルまで向上させることができる。According to the present embodiment, when reading the document data from the storage device 2, the Kana-kana kana-kanji conversion unit 6 performs all kanji-kana kana-kanji conversion processing on the read kana-kanji kana-kana conversion string, and the resulting conversion candidates are obtained. A conversion candidate that is the same as the character string of the document data input from the storage device 2 is selected from the inside and is stored in the document buffer 12. Therefore, when performing the Kanji-mixed Kanji conversion process, the word of the document data Word attribute information such as breaks and parts of speech can be secured in the word attribute buffer 13. Therefore, when performing editing work on a document read from the storage device 2, particularly when inserting a character string, etc., using the word attribute information before and after the insertion position, kana characters mixed with kanji of the inserted character string Since conversion can be performed, the conversion efficiency at this time can be improved to the same level as in the case where document data is continuously input and edited.

【００１８】[0018]

【発明の効果】以上記述した如く本発明の文書作成装置
及び文書情報入力方法によれば、ファイルから読み込ん
だ文書を編集するような際に、前記文書の読み込み時に
単語属性情報を獲得し、読み込んだ文書に対して新たに
入力する文字列を漢字混じりかな漢字変換する際に前記
単語属性情報を利用してその変換率を向上させることが
できる。As described above, according to the document creating apparatus and the document information input method of the present invention, when the document read from the file is edited, the word attribute information is acquired and read when the document is read. When a character string to be newly input to a document is converted to kanji containing kanji, the word attribute information can be used to improve the conversion rate.

[Brief description of drawings]

【図１】本発明の文書作成装置の一実施例を示したブロ
ック図。FIG. 1 is a block diagram showing an embodiment of a document creation device of the present invention.

【図２】図１に示した文書作成装置における記憶装置か
ら文書を文書バッファに読み込む際の動作を示したフロ
ーチャート。FIG. 2 is a flowchart showing an operation of reading a document from a storage device in the document creation device shown in FIG. 1 into a document buffer.

【図３】図１に示した文書作成装置の通常の文字入力の
文書作成処理を示したフローチャート。FIG. 3 is a flowchart showing a document creating process for normal character input of the document creating apparatus shown in FIG.

【図４】図１に示した変換候補バッファのデータ格納例
を示した図。FIG. 4 is a diagram showing an example of data storage of a conversion candidate buffer shown in FIG.

【図５】図１に示した文書バッファのデータ格納例を示
した図。5 is a diagram showing an example of data storage in the document buffer shown in FIG.

【図６】図１に示した単語属性バッファのデータ格納例
を示した図。6 is a diagram showing an example of data storage in the word attribute buffer shown in FIG.

[Explanation of symbols]

１…入力装置２…記憶装置／通信
装置３…入力制御部４…入力バッファ５…書式制御部６…漢字混じりかな
漢字変換部７…変換用バッファ８…単語属性拾得部９…漢字混じりかな漢字変換辞書１０…変換候補制御
部１１…変換候補バッファ１２…文書バッファ１３…単語属性バッファ１４…表示制御部１５…表示用バッファ１６…表示装置DESCRIPTION OF SYMBOLS 1 ... Input device 2 ... Storage device / communication device 3 ... Input control unit 4 ... Input buffer 5 ... Format control unit 6 ... Kanji / Kana character conversion unit 7 ... Conversion buffer 8 ... Word attribute pick-up unit 9 ... Kanji character / Kana character conversion dictionary 10 ... Conversion candidate control unit 11 ... Conversion candidate buffer 12 ... Document buffer 13 ... Word attribute buffer 14 ... Display control unit 15 ... Display buffer 16 ... Display device

Claims

[Claims]

1. An online handwritten character recognition means is provided,
Document that reads existing document information from an external storage device or a communication device in a document creation device that uses recognized kana-kanji character information as input character information and converts the kanji-mixed kana character information by referring to dictionary information and word attribute information Reading means, converting means for performing kanji-mixed kanji conversion on the document information read by the document reading means, same as ready-made document information read by the document reading means from conversion candidates obtained from the converting means Document creation characterized by comprising conversion candidate selecting means for selecting a conversion candidate for notation and storing it in a document buffer, and finding means for finding word attribute information of the conversion candidate selected by the conversion candidate selecting means. apparatus.

2. An online handwritten character recognition means is provided,
A method of creating a document in which kana-kanji mixed kana character information is used as input information and kanji-mixed kana kanji conversion is performed with reference to dictionary information and word attribute information, and ready-made document information is read from an external storage device or communication device. At this time, the read document information is converted into kanji with mixed kanji characters, and from the obtained conversion candidates, the same conversion candidate as the notation of the read ready-made document information is selected to create a document, and the selection is performed. A method for inputting document information, characterized in that the word attribute information of the converted conversion candidates is acquired.