JPS58222330A - Japanese language converting system - Google Patents

Japanese language converting system

Info

Publication number
JPS58222330A
JPS58222330A JP57106571A JP10657182A JPS58222330A JP S58222330 A JPS58222330 A JP S58222330A JP 57106571 A JP57106571 A JP 57106571A JP 10657182 A JP10657182 A JP 10657182A JP S58222330 A JPS58222330 A JP S58222330A
Authority
JP
Japan
Prior art keywords
kanji
character
kana
word
section
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP57106571A
Other languages
Japanese (ja)
Inventor
Ryohei Nakatsu
良平 中津
Ikuo Namiki
並木 育夫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nippon Telegraph and Telephone Corp
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Priority to JP57106571A priority Critical patent/JPS58222330A/en
Publication of JPS58222330A publication Critical patent/JPS58222330A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/53Processing of non-Latin text

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

PURPOSE:To select quickly and efficiently Kanji (Chinese characters), by adopting a two-stage selecting method, in a Japanese language converting system converting an inputted word into a Kanji. CONSTITUTION:The result of recognition transmitted from an input terminal 10 is stored in a character matrix buffer 1 and the word part to be converted into a Kanji is stubbed. The character matrix corresponding to the stubbed word is given to a Kanji converting section 2, where it is converted into a Kanji code. Further, a part other than the word is given to a Kana (Japanese syllabary) character converting section 3, where the part is converted into a Hiragana (round form of Japanese syllabary) or Katakana (square form of Japanese syllabary). Finally, a Kana/Kanji mixed sentence forming section 4 forms a Kana/Kanji mixed sentence by combining outputs of the Kanji converting section 2 and the Kana character converting section 3.

Description

【発明の詳細な説明】 この発明は、音声認識もしくは文字認識技術を用いて日
本語文を入力する際、認識結果と単語辞書の照合の結果
得られる多数の漢字中から目的とする漢字を高速に選択
する機能を有する日本語変換方式に関するものでおる。
[Detailed Description of the Invention] This invention provides a method for quickly selecting a desired kanji from a large number of kanji obtained as a result of comparing the recognition result with a word dictionary when inputting a Japanese sentence using voice recognition or character recognition technology. This article concerns a Japanese conversion method that has a selection function.

く背景〉 日本語情報処理が進むにつれ、簡便な日本語入力法とし
て音声1手書き文字等による一本語人カが有望な技術と
目されている。音声入力の場合、入力は単音節単位等に
行われ、まず音声認識部で各音節の認識が行われた後、
単語辞書と照合して漢字に変換する処理が行われる。こ
の場合一般・Kは次のような処理が考えられる。即ち音
声認識の現状の技術では各音節の認識が必ずしも完全に
は行えないため、音節毎に篭数個の候補を出力するのが
通常である。そのような認識結果の一例を第1図に示す
。これは「温泉」と入力した場合の音声認識結果であり
、第it#節a1は「コ」、「オ」「ホ」の何れかと認
識され、第2音節alは「ン」と認識され、第3音節a
sは「セ」「ゼ」の何れかと認識され、第4音節a4は
「ン」と認識された場合である。
Background> As Japanese information processing progresses, monolingual input using voice and handwritten characters is seen as a promising technology as a simple Japanese input method. In the case of voice input, input is performed in monosyllable units, and each syllable is first recognized by the voice recognition unit, and then
A process of comparing the words with a word dictionary and converting them into kanji is performed. In this case, the following processing can be considered for general K. That is, since current speech recognition technology cannot always completely recognize each syllable, it is common to output several candidates for each syllable. An example of such a recognition result is shown in FIG. This is the speech recognition result when "hot spring" is input, and the it# clause a1 is recognized as either "ko", "o" or "ho", and the second syllable al is recognized as "n", 3rd syllable a
This is a case where s is recognized as either "se" or "ze", and the fourth syllable a4 is recognized as "n".

このような形式は必ずしも音声入力ばかシでなく、手書
きのかな文字入力の場合にも用いられるものであるから
、これを以後は文字行列と呼ぶことにする。この文字行
列を漢字に変換するには各文字の複数個の候補から1個
ずつ文字を選び出しこれをつないでできる文字系列と単
語辞書と照合し、合致する見出しが単語辞書中に存在す
ると、その見出しに対応する漢字に変換する。日本語に
は同音異義語が存在するので一つの見出しに対応する漢
字は必ずしも一種類に限らなho例えば「オンド」とい
う見出しに対応する漢字は「温度」「音頭」等が存在す
る。ところが入力が文字行列の場合は候補の組み合わせ
Kよって生じる異なった各文字系列に対し、辞書照合の
結果それぞれ複数の漢字が得られる。例えば第1図の例
では文字系列「オンセン」に対応する漢字「温泉」の#
1かニ文字系列「コンセン」に対して漢字「混線」。
Since this format is not necessarily used for voice input, but is also used for handwritten kana character input, it will be referred to as a character matrix from now on. To convert this character matrix into kanji, select one character from the multiple candidates for each character, match the resulting string of characters with the word dictionary, and if a matching heading exists in the word dictionary, Convert to kanji corresponding to the heading. Since there are homonyms in Japanese, there is always only one type of kanji that corresponds to one heading.For example, the kanji that corresponds to the heading ``ondo'' include ``temperature'' and ``ondo.'' However, when the input is a character matrix, a plurality of kanji characters are obtained as a result of dictionary matching for each different character sequence generated by the combination K of candidates. For example, in the example in Figure 1, the # of the kanji ``hot spring'' corresponding to the character series ``onsen'' is
The kanji character ``cross line'' for the one or two character series ``konsen''.

「混戦」等が、文字系列「コンセン」に対して漢字「婚
前」、「混然」等・・・・・・が得られる。したがって
1つの文字行列に対応して得られる漢字は極や、多種類
にヶ、。   ・□・1: 従来のかなキーボード入力によるかな漢字変換方式では
入力は文字系列であるため同音異義語があるにしてもそ
の漢字の数は少数に限定されていた。したがって使用者
のキーボード操作にょル同音異義語を次々と漢字を画面
上に表示しその中から正しい漢字を選択させる方法が通
常とられていた。これに対し、入力が文字行列の場合は
、上述のように極めて多数の漢字候補が存在するため、
これを使用者のキーボード操作にょシ次々と画面上に表
示し、使用者に選択させる方式は操作が極めて繁雑とな
9時間を要するばかシでなく使用者の負担も大きくなシ
、ひりては効率良い日本語入力装置を実現できないこと
になる。
For the character series ``konsen'', the kanji ``before marriage'', ``chaosen'', etc. are obtained. Therefore, the number of kanji that can be obtained from one character matrix is extremely large.・□・1: In the conventional kana-kanji conversion method using kana keyboard input, the input is a series of characters, so even if there are homophones, the number of kanji is limited to a small number. Therefore, the usual method was to display kanji characters one after another on the screen as homonyms were used by the user on the keyboard, and to have the user select the correct kanji character from among the kanji characters. On the other hand, when the input is a character matrix, there are a large number of kanji candidates as mentioned above, so
The method of displaying these on the screen one after another as the user operates the keyboard and letting the user make a selection is not only extremely complicated and takes 9 hours to operate, but also places a heavy burden on the user. This means that an efficient Japanese input device cannot be realized.

〈発明の概要〉 この発明の目的は入力された文字行列に対する漢字の選
択を迅速に行うことができ、操作も簡単な日本語変換方
式を提供することにある。
<Summary of the Invention> An object of the present invention is to provide a Japanese conversion method that can quickly select Kanji characters from an input character matrix and is easy to operate.

この発明においては使用者に対し、まず入力単語がいず
れの読みに属するかを選択させ、次に選択された読みに
′:″属する複数個の漢字中から正しい漢字を選択させ
るという2段がまえの選択法をとることによシ複数個の
漢字中から正しい漢字を効率良く選択するものである。
In this invention, the user is first asked to select which reading the input word belongs to, and then the user is made to select the correct kanji from a plurality of kanji that belong to the selected reading ``:''. By using this selection method, the correct kanji can be efficiently selected from among a plurality of kanji.

〈実施例〉 第2図はこの発明の実施例を示し、文字行列バッファ1
は本図には示してない音声認識部(もしくは手書き文字
認識部)より入力端子1ot−通じて送られてくる認識
結果を保持し、かつ漢字に変換すべき単語部分を切シ出
す機能を備えている。文字行列バッファ1で切)出され
た単語に相当する文字行列はかな漢字変換部2に送られ
漢字コードに変換される。また単語以外の部分について
はかな文字変換部3に送られる。かな文字変換部3では
ひらかな文字についてはひらがなのコードにカタカナ文
字についてはカタカナコードに変換する等の処理を行う
。最後にかな漢字混じυ文作成部4では漢字変換部2.
かな文字変換部3の出力を組み合わせて、かな漢字混じ
シ文を作成する。
<Embodiment> FIG. 2 shows an embodiment of this invention, in which character matrix buffer 1
has the function of holding the recognition results sent from the speech recognition unit (or handwritten character recognition unit), which is not shown in this figure, through the input terminal 1ot-, and cutting out the part of the word to be converted into kanji. ing. The character matrix corresponding to the word cut out by the character matrix buffer 1 is sent to the kana-kanji converter 2 and converted into a kanji code. Also, parts other than words are sent to the kana character conversion section 3. The kana character conversion unit 3 performs processing such as converting hirakana characters into hiragana codes and katakana characters into katakana codes. Finally, in the kana-kanji mixed υ sentence creation section 4, the kanji conversion section 2.
The outputs of the kana character conversion section 3 are combined to create a kana-kanji mixed shi sentence.

くかな漢字変換部〉 第3図は第2図中のかな漢字変換部2の構成を詳しく示
したものである。漢字に変換すべき文字行列は照合部2
1へ送られる。単語辞書22には文字系列で表現される
かな見出しとそれに対応する漢字コードが蓄えである。
Kana-Kanji Conversion Section> FIG. 3 shows in detail the configuration of the Kana-Kanji conversion section 2 in FIG. 2. The character matrix to be converted into kanji is sent to the matching unit 2.
Sent to 1. The word dictionary 22 stores kana headings expressed as character sequences and their corresponding kanji codes.

照合部21では文字行列と単語辞書22の照合を行う。The matching unit 21 matches the character matrix with the word dictionary 22.

すなわち文字行列の各文字の候補を1個ずつ選び、これ
らをつないでできる文字系列で単語辞書22を照合し、
一致するかな見出しが存在すると、対応する漢字コード
を漢字候補バッファ23に出力する。照合部21はこの
処理を文字行列から生じうるすべての文字系列について
行う。その結果漢字バッファ23中には複数個の読みに
対応した漢字が葺見られ、しかも各々の読みに対して複
数個の漢字、いわゆる同音異義語が蓄えられる。
That is, select one candidate for each character in the character matrix, and check the word dictionary 22 with the character sequence formed by connecting these candidates.
If a matching kana heading exists, the corresponding kanji code is output to the kanji candidate buffer 23. The matching unit 21 performs this process for all character sequences that can arise from the character matrix. As a result, kanji corresponding to a plurality of readings are found in the kanji buffer 23, and a plurality of kanji, so-called homonyms, are stored for each reading.

制御部24はこれらの多数の漢字の中から表示部25と
キーボード部26を使用して正しい漢字を以下の処理に
よって使用者に選択させる。まず第1段階として正しい
読みを選択させる。この選択の手−法はいくつか考えら
れる。例えば表示部25に異なった読みをすべて表示し
、使用者にキーボード部26から正しい読みに対応する
番号等を打鍵させてもよい。また異なった読みの表示を
指示するキーをもうけておき、使用者が該当するキーを
打鍵する度に異なった読み、もしくはそれに対応する漢
字を次々と表示してゆく方式でも良い。要は第1段階で
正しい読みを選択させることがこの発明の趣旨であシ、
その細かな手法については規定しない。
The control section 24 allows the user to select the correct Kanji from among these many Kanji characters using the display section 25 and the keyboard section 26 through the following process. The first step is to have the child choose the correct reading. There are several possible methods for this selection. For example, all the different pronunciations may be displayed on the display section 25, and the user may be asked to type a number or the like corresponding to the correct pronunciation from the keyboard section 26. It is also possible to provide a key for instructing the display of different readings, and each time the user presses the corresponding key, a different reading or the corresponding kanji are displayed one after another. The point is that the purpose of this invention is to have the correct reading selected in the first step.
The detailed method is not stipulated.

上述の処理によって正しい読みが選択されると、次に正
しい漢字を選択させる。この場合も選択の手法はいくつ
か考えられる。例えば表示部25に該当する読みに対応
する漢字をすべて表示し、使用者にキーボード部26か
ら正しい漢字に対応する番号等を打鍵させても良い。も
しくは同音異義語の表示を指示するキーをもうけておき
、使用者が該当するキーを打鍵する度に同音異義語を次
々と表示してゆく方式でも良い。要は第2段階で同音異
義語中から正しい漢字を選択することがこの発明の趣旨
であシ、その細かな手法については規定しない。
Once the correct reading is selected through the above process, the next correct kanji is selected. In this case as well, several selection methods can be considered. For example, all the kanji characters corresponding to the corresponding pronunciation may be displayed on the display section 25, and the user may be asked to type a number or the like corresponding to the correct kanji character from the keyboard section 26. Alternatively, a key for instructing the display of homophones may be provided, and the homonyms may be displayed one after another each time the user presses the corresponding key. The point is that the purpose of this invention is to select the correct kanji from the homophones in the second step, and the detailed method is not specified.

以上の処理によって正しい漢字が選択されると該当する
漢字のコードはかな漢字混じシ文作成部4へ送られる。
When the correct kanji is selected through the above processing, the code of the corresponding kanji is sent to the kana-kanji mixed script creation section 4.

以上のような2段がまえの選択手法をとることにより多
数の漢字候補中から正しい漢字を効率良く選択すること
が可能になる。例えば文字行列に対応した異なった読み
がM種類、各読みに対応する漢字がN個存在するとした
場合、キーボードの打鍵によシ次々と漢字候補を画面に
表示する方式によれば最大MXN回のキー操作が必要で
あるが、との発明の方式では最大M+N回のキー操作で
正しい漢字が選択できるた検高速の日本語変換が可能で
ある。
By adopting the above-mentioned two-stage selection method, it becomes possible to efficiently select the correct kanji from among a large number of kanji candidates. For example, if there are M types of different pronunciations corresponding to a character matrix and N kanji corresponding to each pronunciation, a method of displaying kanji candidates one after another on the screen by keystrokes on the keyboard can be used up to MXN times. Although key operations are required, the method of the invention allows the correct kanji to be selected with a maximum of M+N key operations, making it possible to perform high-speed Japanese conversion.

なお先に述べた例においては入力された文字行列より、
そのと9得る文字系列で単語辞書を照合しているから、
その各文字系列はそれぞれ文字行列の異った読みと対応
しているため、同じ読み罠属する漢字の組が自動的に得
られる。単語辞書の照合前に文字行列よシ得られる各文
字系列を表示して使用者に選択達:′:::讐、その選
択した文字系列によシ単語辞書を照合して同音異義語中
から正しい漢字を選択させてもよい。
In addition, in the example mentioned earlier, from the input character matrix,
Then, we are checking the word dictionary with the character sequence obtained by 9, so
Since each character series corresponds to a different reading of the character matrix, sets of kanji that belong to the same reading trap are automatically obtained. Before checking the word dictionary, each character sequence obtained is displayed in a character matrix and the user selects it.The selected character sequence is checked against the word dictionary and selected from among the homophones. You may also choose the correct kanji.

〈効果〉 以上説明したようにこの発明は音声認識技術。<effect> As explained above, this invention is a voice recognition technology.

文字認識技術を利用した日本語入力装置における日本語
変換方式を提供するもので、その要旨は認識結果の文字
行列と単語辞書との照合によシ得られる多数の漢字候補
から正しい漢字を選択する際、多数の漢字候補を同じ読
みととKtとめた複数個の組に分け、まず正しい読みを
使用者に選択させ、或は漢字候補を複数組に分けること
なく、正しい読みを選択させ、次に同じ読みに属する同
音異義語中から正しい漢字を選択させるという2段力五
まえの選択手段を用いることにある。その結果、多数の
漢字候補から正しい漢字を短時間に、力1つ使用者に対
する負担を少なくして選択することを可能にし、ひいて
は操作性に優れた日本語入力装置を実現できる利点があ
る。
It provides a Japanese conversion method for Japanese input devices using character recognition technology, and its gist is to select the correct kanji from a large number of kanji candidates obtained by comparing the character matrix of the recognition result with a word dictionary. In this case, a large number of kanji candidates are divided into multiple groups with the same reading and Kt, and the user first selects the correct reading, or the user selects the correct reading without dividing the kanji candidates into multiple groups, and then The purpose of this method is to use a second-level selection method in which the correct kanji is selected from homophones belonging to the same reading. As a result, it is possible to select the correct kanji from a large number of kanji candidates in a short time with less burden on the user, and this has the advantage of realizing a Japanese input device with excellent operability.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は文字行列の例を示す図、WJ2図はこの発明の
一実施例を示す構成図、第3図は第2図中2のかな漢字
変換部2についての詳細例を示す構成図である。 l・・・文字行列バッファ、2・・・かな漢字変換部、
3・・・かな文字変換部、4・・・かな漢字混μシ文作
成部、21・・・照合部、22・・・単語辞書、23・
・・漢字候補バッファ、24・・・制御部、25・・・
表示部、26・・・キーボード部。 特許出願人  日本電信電話公社 代理人 単針  卓
Fig. 1 is a diagram showing an example of a character matrix, Fig. WJ2 is a block diagram showing an embodiment of the present invention, and Fig. 3 is a block diagram showing a detailed example of the kana-kanji conversion unit 2 shown in Fig. 2. . l...Character matrix buffer, 2...Kana-Kanji conversion unit,
3... Kana character conversion section, 4... Kana-kanji mixed μshi sentence creation section, 21... Collation section, 22... Word dictionary, 23.
...Kanji candidate buffer, 24...Control unit, 25...
Display section, 26...keyboard section. Patent Applicant Nippon Telegraph and Telephone Public Corporation Agent Takashi Single Needle

Claims (1)

【特許請求の範囲】[Claims] 文字毎に複数個の候補を持った文字系列(以後文字行列
と呼ぶ)として表現された入力単語と単語辞書とを照合
して前記入力単語を漢字に変換する日本語変換方式にお
いて、操作者に対し前記入力単語がいずれの読みに属す
るかを選択させる操作手段と、その選択された読みに属
する複数個の漢字中から正しい漢字を選択させる操作手
段とによシ多数の漢字中から正しい漢字を選択すること
を特徴とする日本語変換方式。
In the Japanese conversion method, an input word expressed as a character sequence (hereinafter referred to as a character matrix) with multiple candidates for each character is compared with a word dictionary to convert the input word into kanji. On the other hand, an operating means for selecting which reading the input word belongs to, and an operating means for selecting the correct kanji from a plurality of kanji belonging to the selected reading, are used to select the correct kanji from a large number of kanji. Japanese conversion method characterized by selection.
JP57106571A 1982-06-21 1982-06-21 Japanese language converting system Pending JPS58222330A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP57106571A JPS58222330A (en) 1982-06-21 1982-06-21 Japanese language converting system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP57106571A JPS58222330A (en) 1982-06-21 1982-06-21 Japanese language converting system

Publications (1)

Publication Number Publication Date
JPS58222330A true JPS58222330A (en) 1983-12-24

Family

ID=14436931

Family Applications (1)

Application Number Title Priority Date Filing Date
JP57106571A Pending JPS58222330A (en) 1982-06-21 1982-06-21 Japanese language converting system

Country Status (1)

Country Link
JP (1) JPS58222330A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01147658A (en) * 1987-12-03 1989-06-09 Fujitsu Ltd Handwritten kana/kanji conversion control system
US5136504A (en) * 1989-03-28 1992-08-04 Canon Kabushiki Kaisha Machine translation system for output of kana/kanji characters corresponding to input character keys

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5635275A (en) * 1979-08-31 1981-04-07 Fujitsu Ltd Monitor correcting method for optical character reading
JPS5680770A (en) * 1979-12-04 1981-07-02 Dainippon Printing Co Ltd "kanji" (chinese character) input device for print
JPS5729177A (en) * 1980-07-30 1982-02-17 Nippon Telegr & Teleph Corp <Ntt> Japanese voice input system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5635275A (en) * 1979-08-31 1981-04-07 Fujitsu Ltd Monitor correcting method for optical character reading
JPS5680770A (en) * 1979-12-04 1981-07-02 Dainippon Printing Co Ltd "kanji" (chinese character) input device for print
JPS5729177A (en) * 1980-07-30 1982-02-17 Nippon Telegr & Teleph Corp <Ntt> Japanese voice input system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01147658A (en) * 1987-12-03 1989-06-09 Fujitsu Ltd Handwritten kana/kanji conversion control system
US5136504A (en) * 1989-03-28 1992-08-04 Canon Kabushiki Kaisha Machine translation system for output of kana/kanji characters corresponding to input character keys

Similar Documents

Publication Publication Date Title
JPS62274360A (en) Word processor
JP4263627B2 (en) Communication terminal device, receiving device, and program
JPS58222330A (en) Japanese language converting system
JP3664383B2 (en) Japanese input system, Japanese input method, and recording medium
KR100702483B1 (en) Chinese character input system and its method
KR20020021182A (en) Method and apparatus for inputting Chinese characters using information of tone
JPS6362018B2 (en)
JPS5818730A (en) Character processing device
JPS59121425A (en) Chinese phonetic alphabet of kanji converter
JP2002333942A (en) Keyboard for &#39;roman character input&#39; exclusive terminal equipment for computer
JPS63316162A (en) Document preparing device
JPS62209667A (en) Sentence producing device
JPH09153044A (en) Character processor and device for kana and english letter conversion
JPS5998236A (en) Input device of japanese sentence
JP2004199356A (en) Alphabet input system with character prediction using keyboard and electronic equipment having same system
JPS59153232A (en) Character converter
JPS6246030B2 (en)
JPS6298456A (en) Japanese language input device
JPS5887618A (en) Kana (japanese syllabary)-kanji(chinese character) conversion processing system
JPS61128364A (en) Retrieving device of dictionary
JPH0217568A (en) Kana-kanji conversion system for clause including katakana word
JPH03225462A (en) Roman character/kanji converter
JPH0384658A (en) Document processor
JPS5952338A (en) Kana (japanese syllabary)-kanji (chinese character) converter
JPH02255936A (en) Document preparing and editing device