JPS61235978A - Character string correction system - Google Patents

Character string correction system

Info

Publication number
JPS61235978A
JPS61235978A JP60077816A JP7781685A JPS61235978A JP S61235978 A JPS61235978 A JP S61235978A JP 60077816 A JP60077816 A JP 60077816A JP 7781685 A JP7781685 A JP 7781685A JP S61235978 A JPS61235978 A JP S61235978A
Authority
JP
Japan
Prior art keywords
kanji
character string
kana
input
reading
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP60077816A
Other languages
Japanese (ja)
Other versions
JPH0682366B2 (en
Inventor
Yutaka Ooyama
裕 大山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP60077816A priority Critical patent/JPH0682366B2/en
Publication of JPS61235978A publication Critical patent/JPS61235978A/en
Publication of JPH0682366B2 publication Critical patent/JPH0682366B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

PURPOSE:To output sure Kanji(Chinese character)-Kana(Japanese syllabary) character string candidate by providing a character string output means deciding a required character string and outputting it when plural Kanji-Kana character strings are obtained as the result of Kana-Kanji conversion. CONSTITUTION:A Kana-Kanji converting means 6 applies Kana-Kanji conversion while repeating the retrieval of a word dictionary 5 and a grammar detection based on a reading character string under the control of a control section 1. The means 6 extracts a corresponding word and applies grammar detection, leaves proper phrases only and excludes inadequate phrases. A word output means 7 decides a desired character string and outputs it with the method that it is selected by the user while all candidates given from the means 6 are displayed.

Description

【発明の詳細な説明】 (産業上の利用分野) 本発明は日本語入力方式に関するものであり、さらに詳
細には入力された漢字かな文字列をもとに正しい漢字か
な文字列を得るための字文列訂正方式に関するものであ
る。
[Detailed Description of the Invention] (Industrial Application Field) The present invention relates to a Japanese input method, and more specifically, a method for obtaining a correct Kanji-Kana character string based on an input Kanji-Kana character string. This relates to a character string correction method.

(従来技術とその問題点) 近年の、ワードプロセッサをはじめとした日本語入力機
器の普及につれ、日本語入力の経験の少ない未熟練者や
、漢字の用法にあまり熟知していない一般利用者が、こ
れらの機器を使用する機会が多くなってきた。
(Prior art and its problems) In recent years, with the spread of Japanese input devices such as word processors, inexperienced users with little experience in Japanese input and general users who are not familiar with the usage of kanji Opportunities to use these devices are increasing.

日本語入力手段の中で、盤面上の漢字をタッチペンで拾
いながら漢字かな文字列を入力するベンタッチ入力法を
はじめ、各漢字に対応したコードを入力するコード入力
法、実際に手で漢字を書くことで入力するオンライン手
書き文字認識法などは、利用者が所望の漢字を直接入力
できる有効なものではあるが、入力単位が文字であるこ
とから、入力した漢字文字列が日本語として正しいもの
であることは利用者自身が保証しなければならない。
Among the Japanese input methods, there is the Bentouch input method, which inputs a kanji-kana character string while picking up kanji on the board with a touch pen, the code input method, which inputs the code corresponding to each kanji, and the actual writing of kanji by hand. Online handwritten character recognition methods, which allow users to directly input the desired kanji, are effective, but because the input unit is characters, the input kanji character strings may not be correct as Japanese. The user must guarantee that this is the case.

このため、例えば「徐行する」という文字列を利用者が
誤って「除行する」と入力してしまってもこの書き方が
誤りであることを知ることはできなかった。
For this reason, for example, even if a user mistakenly inputs the character string "go slowly" as "go slowly," he or she would not be able to tell that this writing is incorrect.

(発明の目的) 本発明の目的は、利用者が入力したい漢字かな文字列の
綴りゃ用法を誤った場合に、これに変えて確からしい漢
字かな文字列候補を出力するための文字列訂正方式を提
供することにある。
(Object of the Invention) The object of the present invention is to provide a character string correction method for outputting a probable Kanji-Kana character string candidate in place of the incorrect spelling or usage of a Kanji-Kana character string that the user wants to input. Our goal is to provide the following.

(発明の構成) 本発明によれば、入力された漢字かな文字列内のかな文
字および、各々の漢字をもとに漢字の読みが収められた
漢字辞書を検索することで得られる該漢字の読みを組み
合わせて、1個または複数個の読み文字列を作成するた
めの読み文字列作成手段と、該読み文字列をカナ漢字変
換するカナ漢字変換手段と、変換の結果複数個の漢字か
な文字列が得られた場合に必要とする文字列を決定して
出力する文字列出力手段を備えたことを特徴とする文字
列訂正方式を得ることができる。
(Structure of the Invention) According to the present invention, the kana characters in the input kanji-kana character string and the kanji characters obtained by searching a kanji dictionary containing the readings of the kanji characters based on each kanji character string. A reading character string creation means for creating one or more reading character strings by combining readings, a kana-kanji conversion means for converting the reading character string into kana-kanji, and a plurality of kanji-kana characters as a result of the conversion. It is possible to obtain a character string correction method characterized by comprising character string output means for determining and outputting a required character string when a string is obtained.

(実施例) 以下に本発明の実施例について、図面を参照しながら詳
細に説明する。
(Example) Examples of the present invention will be described in detail below with reference to the drawings.

第1図は本発明の一実施例である。1は全体の制御を管
理する制御部であり、2は漢字かな文字列を入力するた
めの入力手段であり、3は漢字の読みが納められている
漢字辞書であり、4は漢字辞書3を検索することで得ら
れる各漢字の読みと入力文字列中のかな文字を組み合わ
せて、1個または複数個の読み文字列を作成するための
読み文字列作成手段であり、5は少なくとも単語の読み
と表記と品詞情報が収められた単語辞書であり、6は単
語辞書5の検索と文法検定を繰り返しながら読み文字列
をカナ漢字変換するカナ漢字変換手段であり、7はカナ
漢字変換の結果複数個の漢字かな文字列候補が得られた
場合に、単語候補の中から必要とする文字列を指示決定
して出力する文字列出力手段である。また、第1図の実
施例を用いて、入力された漢字かな文字列から正しい漢
字かな文字列を得るまでの流れの概略を第2図に示す。
FIG. 1 shows an embodiment of the present invention. 1 is a control unit that manages the overall control, 2 is an input means for inputting a kanji-kana character string, 3 is a kanji dictionary that stores the readings of kanji, and 4 is a kanji dictionary that stores kanji dictionary 3. 5 is a reading character string creation means for creating one or more reading character strings by combining the reading of each kanji obtained by searching with the kana characters in the input character string, 6 is a word dictionary containing notation and part-of-speech information, 6 is a kana-kanji conversion means that converts reading character strings into kana-kanji while repeating searches of the word dictionary 5 and grammar tests, and 7 is a kana-kanji conversion means that converts multiple kana-kanji characters as a result of kana-kanji conversion. When Kanji/Kana character string candidates are obtained, this is a character string output means that instructs and outputs the required character string from among the word candidates. Further, using the embodiment shown in FIG. 1, FIG. 2 shows an outline of the flow of obtaining a correct kanji-kana character string from an input kanji-kana character string.

ここでは、前述の「除行する」を訂正する手順を例にと
って説明する。但し、説明の便宜のためカナ漢字変換手
段6は文節分かち書きされたカナ文字列を入力とするも
のように構成されているとする。
Here, we will explain the procedure for correcting the above-mentioned "to go" as an example. However, for convenience of explanation, it is assumed that the kana-kanji converting means 6 is configured to receive a kana character string separated into clauses.

はじめに、漢字かな文字列「除行する」は、例えば文字
単位による直接入力や既に入力された漢字かな文字列か
らの文節抽出などにより、入力手段2から入力される(
第2図の101)。
First, the kanji-kana character string ``gokusuru'' is input from the input means 2, for example, by direct input character by character or by extracting phrases from the kanji-kana character string that has already been input (
101 in Figure 2).

文字単位の入力は、前述のベンタッチ入力などのほか、
漢字の読みを与えた上で表示された漢字群の中から選択
する表示選択や、既に入力されているテキストデータの
利用、外部からの文字列入力などによっても行うことが
できる。また文節の抽出は、例えば、情報処理学会計算
言語学研究会資料15−2”日本語の文節の認定”(1
978)に記述されている手法などによる自動処理や、
利用者の範囲指定指示などで行うことができる。また、
抽出された文節列が日本語として正しいか否かを予め辞
書引き0文法検定などでチェックし、文節とならないも
ののみを処理対象とすることも可能である。
In addition to the above-mentioned Bentouch input, you can input characters by character.
This can also be done by display selection by giving the reading of a kanji and selecting from a group of kanji displayed, by using text data that has already been input, or by inputting a character string from an external source. In addition, the extraction of phrases can be performed, for example, in Information Processing Society of Japan Computational Linguistics Study Group Material 15-2 "Certification of Japanese phrases" (1
Automatic processing using methods such as those described in 978),
This can be done by the user's instructions to specify a range. Also,
It is also possible to check in advance whether the extracted phrase strings are correct as Japanese or not using a dictionary lookup grammar test or the like, and to process only those that are not phrases.

入力手段2から入力された文字列「除行する」は、制御
部1の制御により読み文字列作成手段4に送られる。読
み文字列作成手段4は、得られた文字列「除行する」を
構成する漢字「除」と「行」をもとに漢字辞書3を検索
して各漢字の読みを抽出しく102)、これらと残りの
かな文字「す」と「る」を組み合わせて、可能な読み文
字列を作成する(103)。ここで漢字辞書3内に「除
」の読みとしてfじよ」と「じ」、「行」の読みとして
「こう」と「きよう」が存在していたとすると、読み文
字列作成手段4は、これらの読みから、読み文字列「じ
よこうする」、「じよきようする」、「しこうする」、
「じぎようする」を作成して、カナ漢字変換手段6に渡
す。
The character string "Yuyokusuru" inputted from the input means 2 is sent to the reading character string creation means 4 under the control of the control section 1. The reading character string creation means 4 searches the kanji dictionary 3 based on the kanji ``jo'' and ``gyo'' that make up the obtained character string ``yugyo suru'' and extracts the reading of each kanji (102); These and the remaining kana characters "su" and "ru" are combined to create a possible reading character string (103). Here, if the kanji dictionary 3 contains "fjiyo" and "ji" as the pronunciations of "extra" and "kou" and "kiyo" as the pronunciations of "line", the pronunciation character string creation means 4 , From these pronunciations, the pronunciation strings ``Jiyokosuru'', ``Jiyokiyousuru'', ``Shikosuru'',
``Jigiyosuru'' is created and passed to the kana-kanji conversion means 6.

カナ漢字変換手段6は制御部lの制御により、単語辞書
5の検索と文法検定を繰り返しながら読み文字列をもと
にカナ漢字変換を行う(104)。
Under the control of the control unit 1, the kana-kanji conversion means 6 performs kana-kanji conversion based on the reading character string while repeatedly searching the word dictionary 5 and checking the grammar (104).

ここで、単語辞書5内に「じよこう」に対して「女工」
 (名詞)、「徐行」 (す変名詞)が、「しこう」に
対して「事項」 (名詞)、「時候」 (名詞)、「時
効」 (名詞)が、「じぎよう」に対して「事業」 (
名詞)、「地形」 (名詞)が収められていたとする。
Here, in the word dictionary 5, "woman worker" is written for "jiyoko".
(noun), ``slow progress'' (su-modular noun) is ``shikou,'' while ``matters'' (noun), ``season'' (noun), and ``statute of limitations'' (noun) are ``jigiyo.''"business" (
Suppose that it contains ``terrain'' (noun) and ``topography'' (noun).

カナ漢字変換手段6は、これらの単語を抽出するととも
に文法検定を行い、文節として正しいもののみを残して
不適当なものを排除する。本例では、語尾に「する」が
存在するため1結果的にす変名詞である「徐行」に「す
る」が接続した「徐行する」だけが残り、文字列決定手
段7へ渡される。
The kana-kanji conversion means 6 extracts these words and performs a grammar test, leaving only those that are correct as clauses and eliminating inappropriate ones. In this example, since ``suru'' is present at the end of the word, only ``suru'', which is the mundane noun ``suru'' connected to ``suru'', remains and is passed to the character string determining means 7.

単語出力手段7は、例えばカナ漢字変換手段6から渡さ
れた全候補を表示した上で利用者に選択させるなどの方
法で所望の文字列を決定しく105)、これを出力する
(106)。本例では、「徐行する」1つだけが存在す
るため、これを出力することができる。
The word output means 7 determines a desired character string by, for example, displaying all the candidates passed from the kana-kanji conversion means 6 and having the user select one (105), and outputs it (106). In this example, there is only one "go slowly", so this can be output.

これら一連の手順により、「除行する」を「徐行する」
に訂正することができる。
Through these series of steps, we can change "to go slowly" to "to go slowly"
can be corrected.

(発明の効果) 本発明を用いることにより、利用者が漢字かな文字列の
綴りゃ用法の誤りを犯した際にも、自動的に正しい文字
列候補を抽出、提示することにより、利用者の文字入力
、誤字訂正の作業を径減することができる。
(Effect of the invention) By using the present invention, even when a user makes a spelling or usage error in a Kanji/Kana character string, the correct character string candidate is automatically extracted and presented. It is possible to reduce the work involved in inputting characters and correcting typographical errors.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例を説明するためのブロック図
であり、第2図は入力された漢字かな文字列から正しい
文字列を得るまでの流れの概略を示した流れ図である。 図において、1・・・制御部、2・・・入力手段、3・
・・漢字辞書、4・・・読み文字列作成手段、5・・・
単語辞書、6・・・カナ漢字変換手段、7・・・文字列
出力手段。 第 2 図
FIG. 1 is a block diagram for explaining an embodiment of the present invention, and FIG. 2 is a flowchart showing an outline of the process of obtaining a correct character string from an input Kanji/Kana character string. In the figure, 1... control unit, 2... input means, 3...
...Kanji dictionary, 4... Reading string creation means, 5...
Word dictionary, 6... Kana-Kanji conversion means, 7... Character string output means. Figure 2

Claims (1)

【特許請求の範囲】[Claims] 1、入力された漢字かな文字列内のかな文字および、各
々の漢字をもとに漢字の読みが収められた漢字辞書を検
索することで得られる該漢字の読みを組み合わせて、1
個または複数個の読み文字列を作成するための読み文字
列作成手段と、該読み文字列をカナ漢字変換するカナ漢
字変換手段と、変換の結果複数個の漢字かな文字列が得
られた場合に必要とする文字列を決定して出力する文字
列出力手段を備えたことを特徴とする文字列訂正方式。
1. Combining the kana characters in the input kanji-kana character string and the readings of the kanji obtained by searching a kanji dictionary containing the readings of kanji based on each kanji, 1.
A reading character string creation means for creating one or more reading character strings, a kana-kanji conversion means for converting the reading character string into kana-kanji, and a case where a plurality of kanji-kana character strings are obtained as a result of the conversion. A character string correction method characterized by comprising a character string output means for determining and outputting a character string required for a character string.
JP60077816A 1985-04-12 1985-04-12 Character string correction method Expired - Lifetime JPH0682366B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP60077816A JPH0682366B2 (en) 1985-04-12 1985-04-12 Character string correction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP60077816A JPH0682366B2 (en) 1985-04-12 1985-04-12 Character string correction method

Publications (2)

Publication Number Publication Date
JPS61235978A true JPS61235978A (en) 1986-10-21
JPH0682366B2 JPH0682366B2 (en) 1994-10-19

Family

ID=13644549

Family Applications (1)

Application Number Title Priority Date Filing Date
JP60077816A Expired - Lifetime JPH0682366B2 (en) 1985-04-12 1985-04-12 Character string correction method

Country Status (1)

Country Link
JP (1) JPH0682366B2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61269768A (en) * 1985-05-24 1986-11-29 Oki Electric Ind Co Ltd Kana and kanji input device
JPH04174055A (en) * 1990-11-02 1992-06-22 Chubu Nippon Denki Software Kk Erroneously converted word detection and correction system of japanese word processor

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60164864A (en) * 1984-02-08 1985-08-27 Hitachi Ltd Device for processing data

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60164864A (en) * 1984-02-08 1985-08-27 Hitachi Ltd Device for processing data

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61269768A (en) * 1985-05-24 1986-11-29 Oki Electric Ind Co Ltd Kana and kanji input device
JPH04174055A (en) * 1990-11-02 1992-06-22 Chubu Nippon Denki Software Kk Erroneously converted word detection and correction system of japanese word processor

Also Published As

Publication number Publication date
JPH0682366B2 (en) 1994-10-19

Similar Documents

Publication Publication Date Title
JPS61235978A (en) Character string correction system
JPH11238051A (en) Chinese input conversion processing device, Chinese input conversion processing method, recording medium recording Chinese input conversion processing program
JP2002207728A (en) Phonetic character generation device and recording medium storing program for realizing the same
JPS634206B2 (en)
JPS61234462A (en) Character string correcting system
JP3847801B2 (en) Character processing apparatus and processing method thereof
JPS61234461A (en) Character string correcting system
JPS61234459A (en) Word correction system
JPH08272780A (en) Chinese input processing apparatus, Chinese input processing method, language processing apparatus and language processing method
JPS58103022A (en) Sentence input device
JPH0441398Y2 (en)
JP3710157B2 (en) Kanji phrase processing method and apparatus
JPS62209667A (en) Sentence producing device
JPH028956A (en) Document processor
WO2006051647A1 (en) Text data structure and text data processing method
JPH08335217A (en) Reading conversion method and document creation device
JPS61234460A (en) Word correction system
JPH06236399A (en) Word processor with translation function
JPH05151194A (en) Document creation support device
JPH02136959A (en) Extracting device for correction candidate of japanese sentence
JPS61260354A (en) Kana and written kanji converting system
JPH0785026A (en) Dictionary rehabilitation method and device
JPS60207948A (en) Kana-kanji conversion processing device
JPH01316863A (en) Automatic qualifying and correcting device for error in japanese language text
JPH0346055A (en) Conversion system from roman to sentence mixed with kanji(chinese character) and kana(japanese syllabary)

Legal Events

Date Code Title Description
EXPY Cancellation because of completion of term