JPS60164864A

JPS60164864A - Device for processing data

Info

Publication number: JPS60164864A
Application number: JP59019969A
Authority: JP
Inventors: Yoshinari Hiraoka; 平岡　良成; Yasunori Katayama; 片山　恭紀; Yutaka Yoshiura; 裕吉浦; Kunio Nakanishi; 邦夫中西
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1984-02-08
Filing date: 1984-02-08
Publication date: 1985-08-27
Anticipated expiration: 2010-03-15
Also published as: JPH0724054B2

Abstract

PURPOSE:To correct errors of Japanese sentences with KANA (Japanese syllabary) KANJI (Chinese character), and homonyms which are used as faulse substitute characters by providing an inner-sentence from element analysis means and inner-sentence form element analysis means by the Roman letters in a processor. CONSTITUTION:For correcting misspellings in Japanese sentences with KANA and KANJI and homonyms which are used as faulse substitute characters, a character string to be examined is analysed in its every sentence through a sentence feeding out means 21 by an inner sentence form element analysis means 23. This is conducted with a vocabulary dictionary 30 and inter-vocabulary connection string 34 in a KANJI and KANA description. When the results fail, the device decides that the sentence contains errors of the faulse substitute character, by homonyms and a form element analysis is made by an inner-sentence form element analysis means 26 by the Roman letters. The resuslts of this are converted by a Roman letters - KANJI.KANA conversion means 27 into a KANJI.KANA combined sentence and are presented to a user. In this way, the provision and analysis of the means 23 and 26 enable correction of errors of the substitute characters in homonyms as well as errors of misspellings.

Description

【発明の詳細な説明】〔発明の利用分野〕本発明は、仮名漢字混すの日本語文を処理するテキスト
処理システムに係り、特にこのよう、なテキスト処理シ
ステムにおいて、綴りの誤りを検出あるいは訂正する装
置に係る。[Detailed Description of the Invention] [Field of Application of the Invention] The present invention relates to a text processing system for processing Japanese sentences mixed with kana and kanji, and particularly for detecting or correcting spelling errors in such a text processing system. It pertains to equipment that performs

[Background of the invention]

ＯＡの進展に伴って計算機によるテキスト処理が注目を
集めてφる。特に欧米では、アルファベットの字数が少
ないこともおって、テキストの計算機処理が相当普及し
てきており、単にテキストを人、力・記憶・出力するば
かりでなく、入力されたテキストの練りの誤りを検出・
訂正するシステムが製品化されつつある。例えば、ＵＮ
ＩＸのユーティリティプログラムであるＴＹＰＯは英単
語の綴り誤りを検出する機能を有しており、ＤＥＣｌｏ
の５ＰＢＬＬというプログラムは英単語の綴り誤りの一
部を訂正する機能がある。例えばＪ、　Ｌ、　Ｐｅｔｅ
ｒｓｏｎ　−Ｃｏｍｐｕｔｅｒ　ｐｒＱｇｒａｍｆｉ　
ｆｏｒＤｅｔｅｃｔｌｎｇ　ａｎｄ　Ｃｏｒｒｅｃｔｉ
ｎｇ　ＳｐｅｌｌｌｎｇＥｒｒｏｒｓ”、　ＣＡＣＭ、
　Ｖｏ　１２３　、４９２には、このような英文綴り検
証システムが詳しく紹介されている。With the advancement of OA, text processing by computers has been attracting attention. Particularly in Europe and America, where the number of letters in the alphabet is small, computer processing of text has become quite widespread, and it is not only possible to simply process, memorize, and output text, but also to detect errors in the processing of input text.・
Correcting systems are being commercialized. For example, UN
TYPO, a utility program of IX, has a function to detect misspellings of English words, and DEClo
A program called 5PBLL has the ability to correct some spelling errors in English words. For example, J.L.Pete
rson-Computer prQgramfi
for Detection and Correcti
ng SpellngErrors”, CACM,
Vo 123, 492 introduces such an English spelling verification system in detail.

しかしながら、日本語は（１）漢字、平仮名１斥仮名と３つの字種が混在してい
る。However, Japanese has three types of characters: (1) kanji, hiragana, and kanji.

（２）漢字は字数が極めて多い。(2) Kanji has an extremely large number of characters.

（３）単語の切れ目に空白が挿入されないため単語の切
り出しが難しい。(3) It is difficult to cut out words because spaces are not inserted between words.

等の特徴があるため、日本では、テキストの入力方式及
び綴りの検出・訂正システムは開発が困難である。Because of these characteristics, it is difficult to develop text input methods and spelling detection/correction systems in Japan.

さらに、日本語は綴り誤りのパターン自体が欧米語と異
なるため、欧米で開発された処理方式は適用することが
できず、日本語独自の方式が必要である。Furthermore, since the spelling error patterns in Japanese are different from those in Western languages, processing methods developed in Europe and the United States cannot be applied, and a method unique to Japanese is required.

日本語の綴シ誤りのパターンは入力方式にも大きく依存
している。現時点では日本語入力方式として′、（１）　タッチ−タイプ方式（２）ペン・タッチ方式（３）仮名漢字変換方式（４）手書文字認識方式が知られている（特開昭５８−１０６６６３、特開昭５
５−１４３６８１）。The pattern of spelling errors in Japanese is also highly dependent on the input method. At present, the following Japanese input methods are known: (1) touch-type method, (2) pen-touch method, (3) kana-kanji conversion method, and (4) handwritten character recognition method (Japanese Patent Laid-Open No. 58-106663). , Japanese Patent Publication No. 5
5-143681).

一方綴り誤りのパターンとしては、（ａ）　ユーザの記憶誤り（ｂ）　タイプミス（Ｃ）　装置の認識誤りが考えられる。このうち（ａ）は、日本語では同音異語
をあて字として使った綴シ誤り、送り仮名の誤り等が多
く、（１）、　（２）、　（４）の入力方式では多く見
受けられるが、（３）の方式のうち文節入力可能な方式
ではユーザが表音文字である仮名で文節を入力するとシ
ステムがこれを仮名漢字混シの文字列に変換するため、
この種の誤りは他の方式に比較し少ない。（ｂ）は、主
として（１）と（２）に特有の誤りである。On the other hand, possible spelling error patterns include (a) user's memory error (b) typographical error (C) device recognition error. Of these, in (a), there are many spelling errors using homophones as address characters, errors in okurikana, etc., which are often seen in the input methods of (1), (2), and (4). Among methods (3), in the method that allows phrase input, when the user inputs a phrase in kana, which is a phonetic character, the system converts it into a character string that is a mixture of kana, kanji, and kanji.
This type of error is less common than other methods. (b) is mainly an error specific to (1) and (2).

（Ｃ）は（４）の方式のみで問題となるエラーである。(C) is an error that is a problem only in method (4).

以上の考察で明らかなように、現在日本語ワードプロセ
ッサで多用されている（３）の仮名漢字変換方式は比較
的誤りの発生しにくい入力方式と考えられるが、初心者
にとって使いやすいこと、文字認識技術の応用から、（
４）の手書文字認識方式の日本語入力方式に適用される
、（ａ）及び（Ｃ）タイプの誤りを検出・訂正するのに
好都合な装置を提供するものである。As is clear from the above considerations, the kana-kanji conversion method (3), which is currently widely used in Japanese word processors, is considered to be an input method that is relatively unlikely to cause errors, but it is easy to use for beginners, and the character recognition technology From the application of (
The present invention provides a device suitable for detecting and correcting errors of types (a) and (C), which is applied to the Japanese input method of the handwritten character recognition method in 4).

[Purpose of the invention]

本発明は、上記の考察に基づくものでおって、その第１
の目的は、漢字仮名混９日本語文を処理するテキスト処
理システムにおいて、日本語文に含まれる綴り誤りを文
節単位に漏出する手段を提供することにある。その第２
の目的は、上記検出された綴り誤りのうち、同音異字を
あて字として使った誤抄を訂正する手段を提供すること
にある。The present invention is based on the above considerations, and the first
The purpose of this system is to provide a means for leaking misspellings contained in Japanese sentences in units of phrases in a text processing system that processes nine Japanese sentences containing kanji and kana. The second
The purpose of this is to provide a means for correcting misspellings that use homophones as spellings among the above-mentioned detected misspellings.

[Summary of the invention]

本発明は、綴シ誤りを含んだ漢字仮名混）の日本語文を
、漢字仮名混９表記の単語辞書を参照して単語分割する
手段と、単語分割が失敗した文節をユーザに提示する手
段と、単語分割が失敗した文節を仮名漢字変換方式から
表音文字（例えばローマ字）表記に変換する変換テーブ
ルと、得られた表音文字表記の文節を、表音文字表記の
単語辞書を参照して暎語に分割する手段と、分割が成功
した時、その表音文字表記の文節を仮名漢字変換方式に
変換し、訂正された文節としてユーザに提示する手段を
有することを特徴とするものである。The present invention provides a means for dividing a Japanese sentence containing spelling errors (a combination of kanji and kana) into words by referring to a word dictionary containing 9 combinations of kanji and kana, and a means for presenting to a user the phrases for which word division has failed. , by referring to a conversion table that converts the phrases for which word division has failed from the kana-kanji conversion method to phonograms (for example, Romaji), and a word dictionary with phonograms to convert the resulting phrases in phonograms. The present invention is characterized by having means for dividing the phrase into ``Ero'' words, and, when the division is successful, means for converting the phonetic phrase into a kana-kanji conversion method and presenting it to the user as a corrected phrase. .

[Embodiments of the invention]

以下、本発明の一実施例を図面に基づいて説明する。第
１図は、本実施例として取り上げた文書処理ワークステ
ーションの外観を示す。ワークステーション１は、ディ
スプレイ２．キーボード３゜手書文字入力用タブレット
４、及びフロッピー記憶装置６を有する。ユーザは、日
本語文を、キーボード３からキー人力するか、又はタブ
レット４にスタイラスペン５で手書入力する。入力され
た日本語文はディスプレイ２に表示されるか、フロツピ
ー記憶装置６に格納される。又、一旦、記憶装置６格納
された日本語文をディスプレイ２に表示させることがで
きる。Hereinafter, one embodiment of the present invention will be described based on the drawings. FIG. 1 shows the external appearance of the document processing workstation used as this embodiment. The workstation 1 has a display 2. It has a keyboard 3, a tablet 4 for inputting handwritten characters, and a floppy storage device 6. The user inputs the Japanese text by hand from the keyboard 3 or by hand using the stylus pen 5 on the tablet 4. The input Japanese sentence is displayed on the display 2 or stored in the floppy storage device 6. Furthermore, the Japanese text once stored in the storage device 6 can be displayed on the display 2.

第２図は第１図のワークステーションの詳細を示すブロ
ック図である。ワークステーションは、ディスフレイ２
、表示用のバッファメモリ７、キーボード３、タブレッ
ト４、フロッピー記憶装置６、プロセッサ８、手書文字
ＬＩＡ＆装置９、及びプリンタｌＯを含む。FIG. 2 is a block diagram showing details of the workstation of FIG. 1. Workstation is Disfray 2
, a display buffer memory 7, a keyboard 3, a tablet 4, a floppy storage device 6, a processor 8, a handwritten character LIA & device 9, and a printer IO.

プロセッサ８は、入力されたテキストの処理等各種のデ
ータ処理を行う装置であって、マイクロプロセッサ又は
専用論理回路で実現される場合もあるし、ホストコンピ
ュータと端末装置で実現される場合もある。本発明の綴
シ誤り訂正処理はこのプロセッサで実行される。The processor 8 is a device that performs various data processing such as processing input text, and may be realized by a microprocessor or a dedicated logic circuit, or may be realized by a host computer and a terminal device. The spelling error correction process of the present invention is executed by this processor.

手書文字認識装置９は、タブレットから入力された漢字
や仮名文字を認識しこれをコード化して、プロセッサ８
に送り込む装置であって、９５〜９９チ程度の認識率が
得られている。The handwritten character recognition device 9 recognizes kanji and kana characters input from the tablet, codes them, and sends them to the processor 8.
It is a device that sends images to the computer, and a recognition rate of about 95 to 99 inches has been obtained.

プリンタ１０は、プロセッサ８で処理され九日本語文を
印刷する装置である。The printer 10 is a device that is processed by the processor 8 and prints nine Japanese sentences.

フロッピー記憶装置６には、入力された日本語文が格納
される他、日本語の解析に用いられる辞書やテーブル類
も格納されている。なおフロッピー記憶装置の代りにデ
ィスク記憶装置等信の外部記憶装置を用いてもよい。In addition to storing input Japanese sentences, the floppy storage device 6 also stores dictionaries and tables used for analyzing Japanese. Note that an external storage device such as a disk storage device may be used instead of the floppy storage device.

第３図は、本発明で綴り誤りの検出・訂正を行った場合
のディスプレイスクリーン表示の一例を示す図である。FIG. 3 is a diagram showing an example of a display screen display when spelling errors are detected and corrected according to the present invention.

キーボード２、タブレット４、又はフロッピー記憶装置
６から入力された日本語文はスクリーンの上半分に表示
されている。ここで綴シ検出用のコマンドを、キーボー
ド３から入力すると、システムは、日本語文の先頭から
順番に一文節ずつ解析し、誤った文節が見付かると、ア
ンダーラインが施される。この例では「専問家の」とい
う誤った文節が検出されている。これに対してシステム
は、スクリーンの下半分に３種類のメニューを表示する
。これによってユーザは、システムの検出した誤りがシ
ステムの辞書にない特殊な用語であれば、１を選択し、
システムが訂正した綴シが正しければ２を選択し、自分
で新たに正しい綴シを入力したければ３を選択すること
ができる。ユーザの選択を終了するとシステムは次の文
節から再び文の解析を再開する。Japanese sentences input from the keyboard 2, tablet 4, or floppy storage device 6 are displayed in the upper half of the screen. When a spelling line detection command is input from the keyboard 3, the system analyzes the Japanese sentence phrase by phrase from the beginning, and if an incorrect phrase is found, it underlines it. In this example, the incorrect phrase "expert" is detected. In response, the system displays three menus in the bottom half of the screen. This allows the user to select 1 if the error detected by the system is a special term that is not in the system dictionary;
If the spelling corrected by the system is correct, you can select 2, and if you want to enter a new correct spelling yourself, you can select 3. Once the user has made his selection, the system resumes parsing the sentence with the next clause.

第４図は、上記のような綴り誤り検出・訂正処理をプロ
セッサ８及びフロッピー記憶装置６でどのような手順で
実行するかを示したものである。FIG. 4 shows the procedure by which the processor 8 and the floppy storage device 6 execute the spelling error detection and correction process as described above.

検査すべき文字列はまず文字列バッファ２ｏに格納され
た後、文節切り出し手段２１によって文節に分割される
。文節は、自立語（一般に漢字又は片仮名で始まる。）
に任意の個数の付属語（平仮名）が続く構成になってい
るので、平仮名から漢字又は片仮名に字種が変化する点
を検出すれば、容易に文節分けができる。次に切り出さ
れた文節ごとに文節内形態素解析２１が行なわれる。こ
とで形態素解析とは文節を単語に切り分ける処理のこと
で、日本語ワードプロセッサの仮名漢字変換処理におい
て扱われる概念である。ここでは、テキストが仮名漢字
混り文であるから、漢字仮名表記による単語辞書３０及
び単語間接続行列３４を（９）用いて解析を進める。もし、形態素解析が成功すれば、
その文節は肥しいと判断し、次の文節を切出すため文節
切出し手段２１に制御が移る。もし、形態素解析が失敗
した場合は、その文節に同音異字による当て学的な誤字
が含まれていると判断し、当該文節を、漢字仮名→ロー
マ字変換手段２４、及び漢字仮名→ローマ字対応表３３
を用いて、ローマ字表記に変換し、ローマ字文節バッフ
ァ２５に格調する。次に、このローマ字文節を、ローマ
字による形態素解析手段２６を用い、形態素解析を行な
う。今度は表音文字をベースとした形態素解析であるた
め、音は正しくても表記が誤っているような綴シ誤りを
含む文節は正しく解析される。The character string to be tested is first stored in the character string buffer 2o, and then divided into phrases by the phrase cutting means 21. A clause is an independent word (generally begins with a kanji or katakana).
is followed by an arbitrary number of auxiliary words (hiragana), so if the point where the character type changes from hiragana to kanji or katakana is detected, it can be easily divided into phrases. Next, intra-clause morphological analysis 21 is performed for each extracted phrase. Morphological analysis is the process of dividing phrases into words, and is a concept handled in the kana-kanji conversion process of Japanese word processors. Here, since the text is a mixed sentence of kana and kanji, the analysis proceeds using the word dictionary 30 written in kanji and kana notation and the word connection matrix 34 (9). If morphological analysis is successful,
The phrase is judged to be fat, and control is transferred to phrase cutting means 21 to cut out the next phrase. If the morphological analysis fails, it is determined that the phrase contains a homonym-based typographical error, and the phrase is transferred to the Kanji-kana → Roman character conversion means 24 and the Kanji-kana → Roman character correspondence table 33.
is used to convert it into Roman characters and store it in the Roman characters buffer 25. Next, this Roman-ji phrase is subjected to morphological analysis using the Roman-ji morphological analysis means 26. This time, the morphological analysis is based on phonetic characters, so phrases that contain spelling errors, such as spelling errors but with the correct sound, are correctly analyzed.

この場合の形態素解析は、ローマ字表記による単語辞書
３２及び単語間接続行列３４を用いる。ここでもし形態
素解析が失敗した場合は、もとの文字列にある字以外の
誤りが含まれていたことになり、システムは、ユーザに
当該文節に誤りが６ったことだけを通知する。もし形態
素解析が成功した場合には、正しく解析された文節を、
ローマ字（１０） →漢字仮名変換手段２７によって、漢字仮名混り文に変
換し訂正文字列バッファ２８に格納した後ユーザに提示
する。変換は、ローマ字表記による単語辞書３２中の単
語から、漢字仮名表記による単語辞書３０中の単語への
ポインタ３１を参照して実行される。なお、同一のロー
マ字表記の文節に対して、複数の漢字仮名混り文節が対
応する場合には、複数の候補をユーザに提示する。Morphological analysis in this case uses a word dictionary 32 written in Roman letters and a word connection matrix 34. If the morphological analysis fails here, it means that the original character string contains an error other than a certain character, and the system notifies the user that there are 6 errors in the phrase. If the morphological analysis is successful, the correctly analyzed phrase is
Romaji (10) -> The kanji/kana conversion means 27 converts it into a sentence containing kanji/kana, stores it in the corrected character string buffer 28, and then presents it to the user. Conversion is performed by referring to a pointer 31 from a word in the word dictionary 32 written in Roman letters to a word in the word dictionary 30 written in Kanji/kana. Note that if a plurality of phrases containing kanji and kana characters correspond to the same phrase written in Roman characters, a plurality of candidates are presented to the user.

第５図は、「専門家の」という誤った文節が切り出され
た時の処理の流れを示した図である。文節バッファ２０
に格納された「専門家の」という文節を形態素解析する
と、「専門家」が辞書にないため、解析は失敗する。そ
こで、漢字仮名→ローマ字変換表３３を用いてローマ字
表記ｒ８ＥＮＭＯＮＫＡＮＯＪに変換すると、今度はｒ
８ＥＮＭＯＮＫＡＪ　という名詞が辞書中にあるため形
態素解析が成功し、名詞１’−Ｓ　ＥＮＭＯＮＫＡＪ　
と、名詞に続く格助詞「ＮＯ」に分割される。最後にこ
れらの単語は、２つの辞書の間のポインタ３１をたどっ
て、「専門家の」という訂正文字列に変（１１）換されユーザーに提示される。FIG. 5 is a diagram showing the flow of processing when the incorrect phrase "expert" is extracted. Clause buffer 20
When we morphologically analyze the phrase "expert" stored in , the analysis fails because "expert" is not in the dictionary. Therefore, when converting from kanji kana to romaji notation r8ENMONKANOJ using the Romaji conversion table 33, this time r
Since the noun 8ENMONKAJ is in the dictionary, the morphological analysis is successful, and the noun 1'-S ENMONKAJ
and the case particle ``NO'' that follows the noun. Finally, following the pointer 31 between the two dictionaries, these words are converted (11) into the corrected character string ``expert'' and presented to the user.

第６図は、「恥ぢない」という誤った文節が切り出され
た時の処理の流れを示した図である。この場合も「恥ぢ
る」という学語が辞書にないため漢字仮名混り表記によ
る形態素解析は失敗する。FIG. 6 is a diagram showing the flow of processing when the incorrect phrase ``shamejinai'' is extracted. In this case as well, morphological analysis based on the kanji/kana mixed notation fails because the academic term ``shajiru'' is not in the dictionary.

しかし、ローマ字表記に変換すると形態素解析は成功し
、［ｆ（ＡＪＩＲＵ、ｌという動詞の未然形と、「ＮＡ
ＩＪという助動詞に分割される。最後にこれを、漢字仮
名混り表記の文節「恥じる」に変換し、訂正された文節
としてユーザに提示する。However, when converted to the Roman alphabet, the morphological analysis was successful, and the unformed form of the verb [f(AJIRU, l) and the ``NA''
It is divided into auxiliary verbs called IJ. Finally, this is converted into the phrase ``shame'' written in a mixture of kanji and kana, and the corrected phrase is presented to the user.

表音文字としてローマ字の例を示したが、カナ文字であ
っても差しつかえない。Although we have shown an example of Romaji as phonetic characters, Kana characters may also be used.

〔Effect of the invention〕

上述のように本発明によれば、第１に仮名漢字混り日本
語文の綴り誤りを検出できる効果がある。As described above, according to the present invention, first, spelling errors in Japanese sentences containing kana and kanji can be detected.

誤りとしてはタイプミス、手書文字認識の認識誤り、ユ
ーザの記憶誤り等を含む。第２に、上記の誤りのうちで
、同音異字を誤ってあて手酌に用いた綴り誤りを訂正す
る効果を有する。Errors include typographical errors, recognition errors in handwritten character recognition, and user memory errors. Second, among the above-mentioned errors, it has the effect of correcting spelling errors in which homonyms are mistakenly used as compensation.

[Brief explanation of the drawing]

（１２）図面は本発明の一実施例で、第１図はシステムの外観図
、第２図はブロック図、第３図はディスプレイ表示の一
例を示す図、第４図は、第２図のプロセッサと、フロッ
ピー記憶装置内のデータフローを示す機能図、第５図は
、「専門家の」という誤った文字列が入力された時の処
理の説明図、第６図は、「恥ぢる」という誤った文字列
が入った時の処理の説明図である。１・・・ワークステーション、２・・・ディスプレイス
クリーン、３・・・キーボード、４・・・タブレット、
５・・・スタイラスペン、６・・・フロッピー記憶装置
、７・・・表示用バッファメモリ、８−プロセッサ、９
・・・手書文字認識装置、ｌＯ・・・プリンタ、２０・
・・文字列バッファ、２１・・・文節切出し手段、２２
・・・漢字仮名混り文節バッファ、２３・・・漢字仮名
混り表記の形態素解析手段、２４・・・漢字仮名→ロー
マ字変換手段、２５・・・ローマ字バッファ、２６・・
・ローマ字表記の形態素解析手段、２７・・・ローマ字
→漢字仮名変換手段、２８・・・訂正文字列バッファ、
２９・・・文書メモＩＪ、３０・・・漢字仮名表記によ
る単語辞書、（１３）３１・・・ポインタ、３２・・・ローマ字表記による単
語辞書、３３・・・漢字仮名→ローマ字対応表、３４・
・・単語間接続行列。代理人　弁理士　高橋明夫（１４）(12) The drawings show one embodiment of the present invention; FIG. 1 is an external view of the system, FIG. 2 is a block diagram, FIG. 3 is a diagram showing an example of a display display, and FIG. A functional diagram showing the processor and the data flow within the floppy storage device. Figure 5 is an explanatory diagram of the processing when the incorrect character string ``expert'' is input. Figure 6 is ``embarrassing''. It is an explanatory diagram of processing when an incorrect character string is entered. 1... Workstation, 2... Display screen, 3... Keyboard, 4... Tablet,
5... Stylus pen, 6... Floppy storage device, 7... Display buffer memory, 8-Processor, 9
...Handwritten character recognition device, lO...Printer, 20.
... Character string buffer, 21 ... Bunsetsu extraction means, 22
... Kanji/kana mixed phrase buffer, 23... Morphological analysis means for kanji/kana mixed notation, 24... Kanji/kana → Roman character conversion means, 25... Roman character buffer, 26...
・Morphological analysis means for Romaji notation, 27... Romaji → Kanji-kana conversion means, 28... Corrected character string buffer,
29... Document memo IJ, 30... Word dictionary in kanji-kana notation, (13) 31... Pointer, 32... Word dictionary in romanization, 33... Kanji-kana → romanization correspondence table, 34・
...Connection matrix between words. Agent Patent attorney Akio Takahashi (14)

Claims

[Scope of Claims] 1. In a data processing device that inputs a series of sentences containing kanji and kana, divides them into clauses while referring to an interrogation dictionary, and outputs the results, comprising: or a 6 first means for converting the entire phonetic alphabet to the phonetic alphabet, a phonetic alphabet zero word dictionary, and a second means for converting the phonetic alphabet to the kanji kakuho mixed transcription; When the normal division processing of a series of sentences containing kanji and kana fails, part or all of the kanji and kana sentences related to the failure are expressed in letters in the table by the first means, and the words in the phonogram are written. A data processing device characterized by performing a second division process of dividing the phrase into phrases while referring to a dictionary, converting the phrase into a sentence containing kanji and kana by the second means, and outputting the phrase. 2. Claims No. In the data processing according to item 1, a printer or a display is provided as a means for outputting the division results, and in the second division process in item 1ff, an output instructing the selection of homophones is sent to the printer or the display. A data processing device characterized by displaying.1