JP2806452B2

JP2806452B2 - Kana-kanji conversion device and method, and recording medium

Info

Publication number: JP2806452B2
Application number: JP9108420A
Authority: JP
Inventors: 秀行岸塲
Original assignee: Omron Corp
Current assignee: Omron Corp
Priority date: 1996-12-19
Filing date: 1997-04-25
Publication date: 1998-09-30
Anticipated expiration: 2017-04-25
Also published as: JPH10232863A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、かな漢字変換装置
および方法、並びに記録媒体に関し、特に、日本語と外
国語とが混在する文章を、迅速かつ正確に、作成するこ
とができるようにした、かな漢字変換装置および方法、
並びに記録媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a kana-kanji conversion apparatus and method, and a recording medium, and more particularly to a method for quickly and accurately creating a sentence in which Japanese and foreign languages are mixed. Kana-kanji conversion apparatus and method,
And a recording medium.

【０００２】[0002]

【従来の技術】最近、インターネットを代表とするネッ
トワークの普及にともない、電子メール、ネットニュー
スなどが活発に利用されるようになってきた。このよう
な電子メール、ネットニュースなどを活用するには、コ
ンピュータ（ワードプロセッサ）を利用して、文章を作
成する必要がある。そして、その文章には、日本語だけ
でなく、英単語のアルファベット文字やカタカナ文字を
混在させる必要がある場合が増加している。2. Description of the Related Art Recently, with the spread of networks represented by the Internet, electronic mail, net news, and the like have been actively used. In order to utilize such e-mail, net news, etc., it is necessary to create sentences using a computer (word processor). The sentence is increasingly required to include not only Japanese characters but also alphabetic characters and katakana characters of English words.

【０００３】このようなコンピュータ上における従来の
かな漢字変換システムの連文節解析アルゴリズムでは、
英単語のアルファベット文字列を含む日本語の文章を作
成するには、アルファベット文字を入力する時点で、入
力モードを、日本語（ローマ字）入力モードから英数固
定入力モードに切り替え、アルファベット文字を入力し
た後、入力モードを再び元に戻すという処理を行うか、
あるいはまた、アルファベット文字だけを後変換という
別の変換により入力するようにしていた。[0003] In such a continuous kana-kanji conversion algorithm of a conventional kana-kanji conversion system on a computer,
To create a Japanese sentence that includes an alphabetic string of English words, switch the input mode from Japanese (Roman) input mode to fixed alphanumeric input mode and enter alphabetic characters at the time of entering alphabetic characters After that, perform the process of returning the input mode again,
Alternatively, only alphabetic characters are input by another conversion called post-conversion.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、このよ
うな従来のかな漢字変換装置においては、入力モードを
切り替えるか、または後変換を行う必要があるため、自
分がこれからアルファベット文字を入力するのか、日本
語を入力するのかを常に意識する必要があるばかりでな
く、現在の入力モードが、いま入力している文字に対応
するモードとなっているか否かを常に意識しておく必要
があった。さらにまた、入力モードの切換や後変換によ
り、本来の文章作成のために必要な操作以外のオーバヘ
ッドの操作が必要であり、入力ミスや、それにともなう
修正、文章作成中の思考の中断などが発生し易く、結
局、文章を作成するのに時間がかかる課題があった。However, in such a conventional kana-kanji conversion device, it is necessary to switch the input mode or to perform post-conversion, so that it is necessary to input the alphabet characters in the future. It is necessary not only to always be conscious of whether or not to input a character, but also to always be conscious whether or not the current input mode is a mode corresponding to the character that is being input. Furthermore, input mode switching and post-conversion require overhead operations other than those required for original sentence creation, resulting in erroneous input, correction due to it, interruption of thinking during sentence creation, etc. There is a problem that it is easy to do, and it takes time to create a sentence after all.

【０００５】本発明はこのような状況に鑑みてなされた
ものであり、日本語と英語が混在する文章を、迅速かつ
正確に、作成することができるようにするものである。The present invention has been made in view of such a situation, and aims to quickly and accurately create a sentence in which Japanese and English are mixed.

【０００６】[0006]

【課題を解決するための手段】請求項１に記載のかな漢
字変換装置は、変換する文字列を受け取る受け取り手段
と、受け取り手段により受け取られた文字列を、日本語
として連文節解析する第１の解析手段と、受け取り手段
により受け取られた文字列から外国語に対応する文字列
を抽出する抽出手段と、受け取り手段により受け取られ
た文字列のうち、抽出手段により外国語として抽出され
なかった部分を、日本語として連文節解析する第２の解
析手段と、第１の解析手段の解析結果と第２の解析手段
の解析結果を、端部に向かって解析を進め、その文節の
区切りが最初に一致する位置を区切り位置として検出す
る検出手段と、第１の解析手段の解析結果の、検出手段
により検出された区切り位置までの未決定の文字列と、
第２の解析手段の解析結果の、検出手段により検出され
た区切り位置までの未決定の文字列を、所定の選択条件
に基づいて比較し、その比較結果に対応して、検出手段
により検出された区切り位置までの未決定の文字列の解
析結果として、第１の解析手段の解析結果と第２の解析
手段の解析結果の一方を選択する選択手段とを備えるこ
とを特徴とする。 [MEANS FOR SOLVING THE PROBLEMS] The kana kan according to claim 1
The character conversion device receives a character string to be converted.
And the character string received by the receiving means
First analyzing means for analyzing a continuous phrase as a sentence, and receiving means
A string corresponding to a foreign language from the string received by
Extracted by the extracting means and the receiving means
Extracted as a foreign language by the extraction means
The second solution which analyzes the missing part in Japanese
Analysis means, analysis results of the first analysis means, and second analysis means
The analysis result of
Detect the first position where a break matches as a break position
Detecting means, and detecting means for detecting the analysis result of the first analyzing means.
Character string up to the break position detected by
The detection result of the analysis result of the second analysis means is detected by the detection means.
The undetermined character string up to the delimiter position is
And based on the comparison result, the detecting means
Of the undetermined string up to the break position detected by
As an analysis result, the analysis result of the first analysis means and the second analysis
Selection means for selecting one of the analysis results of the means.
And features.

【０００７】請求項５に記載のかな漢字変換方法は、変
換する文字列を受け取る受け取りステップと、受け取り
ステップで受け取られた文字列を、日本語として連文節
解析する第１の解析ステップと、受け取りステップで受
け取られた文字列から外国語に対応する文字列を抽出す
る抽出ステップと、受け取りステップで受け取られた文
字列のうち、抽出ステップで外国語として抽出されなか
った部分を、日本語として連文節解析する第２の解析ス
テップと、第１の解析ステップの解析結果と第２の解析
ステップの解析結果を、端部に向かって解析を進め、そ
の文節の区切りが最初に一致する位置を区切り位置とし
て検出する検出ステップと、第１の解析ステップでの解
析結果の、検出ステップで検出された区切り位置までの
未決定の文字列と、第２の解析ステップでの解析結果
の、検出ステップで検出された区切り位置までの未決定
の文字列を、所定の選択条件に基づいて比較し、その比
較結果に対応して、検出ステップで検出された区切り位
置までの未決定の文字列の解析結果として、第１の解析
ステップでの解析結果と第２の解析ステップでの解析結
果の一方を選択する選択ステップとを含むことを特徴と
する。[0007] Kana-Kanji conversion method according to claim 5, varying
Receiving a character string to be replaced, and receiving
The character string received in the step is a continuous phrase as Japanese
A first analyzing step for analyzing and a receiving step
Extract foreign language strings from the extracted strings
Extraction step and the statement received in the receiving step
Whether the string is not extracted as a foreign language in the extraction step
The second analysis method that analyzes the phrase
Step, analysis result of first analysis step and second analysis
Analyze the analysis result of the step toward the end, and
The position where the segment break of the first
And the solution in the first analysis step
Analysis results up to the break position detected in the detection step.
Undetermined character string and analysis result in the second analysis step
Not determined up to the break position detected in the detection step
Are compared based on predetermined selection conditions.
The break position detected in the detection step according to the comparison result
As the result of analyzing the undetermined character string up to
Analysis result in the step and the analysis result in the second analysis step
Selecting one of the fruits .

【０００８】請求項９に記載のコンピュータ読み取り可
能な記録媒体は、入力された文字列を変換するかな漢字
変換装置に、変換する文字列を受け取る受け取りステッ
プと、受け取りステップで受け取られた文字列を、日本
語として連文節解析する第１の解析ステップと、受け取
りステップで受け取られた文字列から外国語に対応する
文字列を抽出する抽出ステップと、受け取りステップで
受け取られた文字列のうち、抽出ステップで外国語とし
て抽出されなかった部分を、日本語として連文節解析す
る第２の解析ステップと、第１の解析ステップの解析結
果と第２の解析ステップの解析結果を、端部に向かって
解析を進め、その文節の区切りが最初に一致する位置を
区切り位置として検出する検出ステップと、第１の解析
ステップでの解析結果の、検出ステップで検出された区
切り位置までの未決定の文字列と、第２の解析ステップ
での解析結果の、検出ステップで検出された区切り位置
までの未決定の文字列を、所定の選択条件に基づいて比
較し、その比較結果に対応して、検出ステップで検出さ
れた区切り位置までの未決定の文字列の解析結果とし
て、第１の解析ステップでの解析結果と第２の解析ステ
ップでの解析結果の一方を選択する選択ステップを実行
させるためのプログラムを記録したことを特徴とする。[0008] The computer readable of claim 9
Kana-Kanji that can convert the input character string
A receiving step for receiving the character string to be converted is provided to the converting device.
And the character string received in the receiving step
First parsing step for parsing clauses as words, receiving
Corresponding to a foreign language from the string received in the step
In the extraction step to extract the character string and the receiving step
Of the received strings, in the extraction step as a foreign language
Segment not analyzed by Japanese
A second analysis step and an analysis result of the first analysis step.
The results and the results of the second analysis step towards the end
Proceed with parsing and find the position where the segment break first matches.
A detecting step of detecting as a break position, and a first analysis
Of the analysis result in the step detected in the detection step
Undetermined character string up to the cutting position and second analysis step
Position detected in the detection step of the analysis result in
Undetermined character strings up to
In the detection step according to the comparison result.
Of the undetermined character string up to the
The analysis result of the first analysis step and the second analysis step
Performs a selection step to select one of the analysis results in the step
A program for causing the program to be recorded .

【０００９】選択は、文節数の少ない方、直前に使用し
た候補が含まれている方、または日本語である方を、そ
の優先順位で、選択条件として、行うことができる。 The selection should be made in the direction with the fewest phrases,
If you have a candidate that includes
, And can be performed as a selection condition.

【００１０】さらに、２つの解析結果の一部を次候補と
したり、外国語に対応する文字列の大文字と小文字、全
角文字と半角文字などの差異を、候補毎、または変換環
境毎に学習するようにすることができる。[0010] Further, a part of the two analysis results is regarded as a next candidate.
Or the uppercase and lowercase letters in the string corresponding to the foreign language,
Differences between square characters and half-width characters can be learned for each candidate or each conversion environment.

【００１１】請求項１に記載のかな漢字変換装置、請求
項５に記載のかな漢字変換方法、および請求項９に記載
のコンピュータ読み取り可能な記録媒体においては、受
け取られた文字列が日本語として連文節解析されるだけ
でなく、受け取られた文字列から外国語に対応する文字
列が抽出され、外国語に対応する文字列として抽出され
なかった部分がさらに日本語に連文節解析される。そし
て、日本語としての解析結果と、外国語に対応する文字
列を含む解析結果の一方が、選択条件に基づいて選択さ
れる。従って、迅速かつ正確に、日本語と外国語が混在
する文章を作成することが可能となる。[0011] In the kana-kanji conversion apparatus according to the first aspect, the kana-kanji conversion method according to the fifth aspect, and the computer-readable recording medium according to the ninth aspect, the received character string is subjected to continuous phrase analysis as Japanese. In addition, the character string corresponding to the foreign language is extracted from the received character string, and the portion not extracted as the character string corresponding to the foreign language is further subjected to continuous phrase analysis in Japanese. Then, one of the analysis result in Japanese and the analysis result including the character string corresponding to the foreign language is selected based on the selection condition. Therefore, it is possible to quickly and accurately create a sentence in which Japanese and foreign languages are mixed.

【００１２】[0012]

【発明の実施の形態】図１は、本発明のかな漢字変換装
置の構成例を示すブロック図である。ＣＰＵ１は、ＲＯ
Ｍ２またはハードディスクドライブ（ＨＤＤ）５に記憶
されているプログラムに従って、かな漢字変換処理を実
行する。ＲＯＭ２には、プログラムのほか、アルファベ
ット文字、数字、漢字、ひらがな、カタカナなどの各種
の文字のフォントも記憶されている。ＲＡＭ３には、か
な漢字変換処理をする上において必要な各種のデータ
が、適宜記憶される。FIG. 1 is a block diagram showing a configuration example of a kana-kanji conversion device of the present invention. CPU1 is RO
According to the program stored in the M2 or the hard disk drive (HDD) 5, the kana-kanji conversion process is executed. The ROM 2 stores fonts for various characters such as alphabetic characters, numbers, kanji, hiragana, and katakana, in addition to programs. The RAM 3 appropriately stores various data required for performing the kana-kanji conversion process.

【００１３】インタフェース（Ｉ／Ｆ）４には、ＬＣＤ
６が接続されており、このＬＣＤ６には、作成した文章
が表示されるようになされている。キーボード７は、文
字を入力するとき操作される。The interface (I / F) 4 has an LCD
6 is connected, and the LCD 6 displays the created text. The keyboard 7 is operated when inputting characters.

【００１４】図２は、かな漢字変換処理の基本的なプロ
グラムの構成を表している。キーボード７を操作して所
定の文字列を入力すると、この文字列は、インタフェー
ス４を介してＣＰＵ１に入力される。ＣＰＵ１は、この
文字列をユーザ入出力部２１に、変換前文字列情報とし
て供給する。ユーザ入出力部２１は、この変換前文字列
情報を、連文節解析処理部２２と次候補解析処理部２３
に供給する。FIG. 2 shows the configuration of a basic program for kana-kanji conversion processing. When a predetermined character string is input by operating the keyboard 7, the character string is input to the CPU 1 via the interface 4. The CPU 1 supplies this character string to the user input / output unit 21 as character string information before conversion. The user input / output unit 21 converts the pre-conversion character string information into a continuous phrase analysis processing unit 22 and a next candidate analysis processing unit 23.
To supply.

【００１５】連文節解析処理部２２は、入力された文字
列を解析し、辞書検索すべき文字列を抽出し、これを辞
書検索処理部２４に送る。辞書検索処理部２４は、供給
された文字列が、辞書２５に記憶されているか否かを検
索する。そして、検索結果を連文節解析処理部２２に出
力する。連文節解析処理部２２は、例えば、辞書検索処
理部２４から、辞書検索文字列に対応する漢字が供給さ
れてきたとき、それに対応する辞書検索文字列を、その
漢字に置換する処理を行う。The continuous phrase analysis processing section 22 analyzes the input character string, extracts a character string to be searched in the dictionary, and sends the extracted character string to the dictionary search processing section 24. The dictionary search processing unit 24 searches whether or not the supplied character string is stored in the dictionary 25. Then, the search result is output to the continuous phrase analysis processing unit 22. For example, when a kanji corresponding to a dictionary search character string is supplied from the dictionary search processing unit 24, the continuous phrase analysis processing unit 22 performs a process of replacing the corresponding dictionary search character string with the kanji.

【００１６】連文節解析処理部２２は、以上のようにし
て、変換前文字列情報を解析した結果得られた解析結果
情報を、ユーザ入出力部２１に出力する。ユーザ入出力
部２１は、連文節解析処理部２２より供給された解析結
果情報に対応する変換後文字列を出力する。ＣＰＵ１
は、この文字列をインタフェース４を介してＬＣＤ６に
表示させる。The continuous phrase analysis processing section 22 outputs the analysis result information obtained as a result of analyzing the pre-conversion character string information to the user input / output section 21 as described above. The user input / output unit 21 outputs a converted character string corresponding to the analysis result information supplied from the continuous phrase analysis processing unit 22. CPU1
Causes the LCD 6 to display this character string via the interface 4.

【００１７】同様にして、次候補解析処理部２３は、ユ
ーザ入出力部２１より入力された変換前文字列情報から
辞書検索すべき文字列を抽出し、これを辞書検索処理部
２４に出力する。辞書検索処理部２４は、入力された辞
書検索すべき文字列に対応する文字が、辞書２５に登録
されているか否かを検索し、その検索結果を、次候補解
析処理部２３に出力する。次候補解析処理部２３は、辞
書検索処理部２４から次候補として送られてくる複数の
候補をまとめ、これをユーザ入出力部２１に出力する。
ユーザ入出力部２１は、この複数の次候補の文字列を、
変換後文字列としてＣＰＵ１に出力する。ＣＰＵ１は、
この次候補の文字列も、インタフェース４を介してＬＣ
Ｄ６に出力し、表示させる。Similarly, the next candidate analysis processing unit 23 extracts a character string to be searched for a dictionary from the pre-conversion character string information input from the user input / output unit 21 and outputs this to the dictionary search processing unit 24. . The dictionary search processing unit 24 searches whether a character corresponding to the input character string to be searched in the dictionary is registered in the dictionary 25, and outputs the search result to the next candidate analysis processing unit 23. The next candidate analysis processing unit 23 puts together a plurality of candidates sent from the dictionary search processing unit 24 as next candidates, and outputs this to the user input / output unit 21.
The user input / output unit 21 converts the character strings of the plurality of next candidates into
The converted character string is output to the CPU 1. CPU1
The character string of the next candidate is also transmitted to the LC via the interface 4.
Output to D6 for display.

【００１８】以上のようにして、ＬＣＤ６には、キーボ
ード７より入力したかな（またはローマ字）が、適宜に
漢字に変換された文字列が、次候補の文字列とともに表
示される。使用者は、次候補の中から、所定の候補の文
字列を選択し、意図する文章を作成する。As described above, on the LCD 6, a character string obtained by appropriately converting kana (or Roman characters) inputted from the keyboard 7 into kanji is displayed together with the character string of the next candidate. The user selects a character string of a predetermined candidate from the next candidates and creates an intended sentence.

【００１９】本発明のかな漢字変換装置は、日本語中に
英語が混在する場合にも、入力モードを変更することな
く、日本語を連続して入力する場合と同様の操作で、文
章を作成することができるようになされている。このた
め、本発明のかな漢字変換装置においても、プログラム
は基本的に、図２に示すように構成されるのであるが、
より細かく区分すると、プログラムは、図３に示すよう
に構成されている。According to the kana-kanji conversion device of the present invention, even when Japanese is mixed with English, a sentence is created by the same operation as in the case of continuously inputting Japanese without changing the input mode. It has been made possible. For this reason, even in the kana-kanji conversion device of the present invention, the program is basically configured as shown in FIG.
More specifically, the program is configured as shown in FIG.

【００２０】入力部４１と出力部４２は、図２における
ユーザ入出力部２１に対応する部分であり、入力部４１
は、キーボード７より入力された文字列を受け取り、各
部に出力する。出力部４２は、各部から供給されてきた
文字列を受け取り、ＬＣＤ６に出力する。The input unit 41 and the output unit 42 correspond to the user input / output unit 21 in FIG.
Receives a character string input from the keyboard 7 and outputs it to each unit. The output unit 42 receives the character string supplied from each unit and outputs the character string to the LCD 6.

【００２１】辞書検索処理部４３と辞書４４は、図２に
おける辞書検索処理部２４と辞書２５に対応しており、
辞書検索処理部４３は、各部から供給されてきた辞書検
索文字列を辞書４４を参照して検索し、検索結果を各部
に返す処理を実行する。The dictionary search processing unit 43 and the dictionary 44 correspond to the dictionary search processing unit 24 and the dictionary 25 in FIG.
The dictionary search processing unit 43 executes a process of searching for a dictionary search character string supplied from each unit with reference to the dictionary 44 and returning a search result to each unit.

【００２２】日英混在連文節解析部４５は、図２におけ
る連文節解析処理部２２に対応している。この日英混在
連文節解析部４５は、日本語連文節解析部４６と英語連
文節解析部４７により構成されている。日本語連文節解
析部４６は、従来のかな漢字変換装置の場合と同様に、
変換前文字列を日本語として連文節解析する処理を行
い、英語連文節解析部４７は、変換前文字列を英語とし
ての（英語読みの）連文節解析を行う。The Japanese-English mixed phrase analysis unit 45 corresponds to the continuous phrase analysis processing unit 22 in FIG. The Japanese-English mixed phrase analysis unit 45 includes a Japanese phrase analysis unit 46 and an English phrase analysis unit 47. As in the case of the conventional kana-kanji conversion device,
The pre-conversion character string is subjected to a continuous phrase analysis process as Japanese, and the English consecutive phrase analysis unit 47 performs a continuous phrase analysis (of English reading) using the pre-conversion character string as English.

【００２３】日英混在全候補解析部４８は、図２におけ
る次候補解析処理部２３に対応しており、この日英混在
全候補解析部４８は、変換前文字列を日本語（かな）に
変換するローマ字かな変換部４９を有している。日本語
次候補処理部５０（次候補解析手段）は、日本語の同音
異義語などを次候補として解析処理する。英単語読み抽
出部５１（抽出手段）は、変換前文字列を英語として解
析するために、英単語を抽出する処理を行う。The mixed Japanese-English all-candidate analysis unit 48 corresponds to the next candidate analysis processing unit 23 in FIG. 2. The mixed Japanese-English all-candidate analysis unit 48 converts the character string before conversion into Japanese (kana). It has a Roman character kana conversion unit 49 for conversion. The next Japanese candidate processing unit 50 (next candidate analysis means) analyzes Japanese homonyms and the like as the next candidate. The English word reading extraction unit 51 (extraction means) performs a process of extracting English words in order to analyze the character string before conversion as English.

【００２４】英単語次候補処理部５２（次候補解析手
段）は、英単語の次候補を解析処理する。この英単語次
候補処理部５２は、日本語を処理する場合における日本
語次候補処理部５０と同様の処理を、英語において行う
ことになる。The next English word candidate processing section 52 (next candidate analysis means) analyzes the next English word candidate. The next English word candidate processing unit 52 performs the same processing as the next Japanese candidate processing unit 50 in processing English in English.

【００２５】同音異義語評価ソート部５３は、日本語次
候補処理部５０と英単語次候補処理部５２で解析処理さ
れた結果得られた次候補を、所定の基準で評価して、ソ
ート処理を行う。The homonym synonym evaluation sort unit 53 evaluates the next candidate obtained as a result of the analysis processing by the Japanese next candidate processing unit 50 and the English word next candidate processing unit 52 according to a predetermined criterion. I do.

【００２６】日本語候補学習結果反映部５４（学習手
段）と英単語候補学習結果反映部５５（学習手段）は、
それぞれ、日本語または英単語の次候補の中から所定の
単語が選択された場合、その結果を学習し、後の次候補
の解析に反映させる処理を行う。例えば、英単語候補学
習結果反映部５５は、表記の異なる候補（全角／半角、
大文字／小文字、アルファベット／カタカナ）を、直前
の学習情報（候補毎の学習情報、変換環境毎の学習情
報）に応じて切り替える処理を行う。The Japanese candidate learning result reflecting unit 54 (learning means) and the English word candidate learning result reflecting unit 55 (learning means)
When a predetermined word is selected from the next candidates of Japanese or English words, a process of learning the result and reflecting the result in the analysis of the next candidate is performed. For example, the English word candidate learning result reflecting unit 55 outputs the candidates having different notations (full-width / half-width,
A process of switching between uppercase / lowercase and alphabet / Katakana according to the immediately preceding learning information (learning information for each candidate, learning information for each conversion environment) is performed.

【００２７】次に、図４のフローチャートを参照して、
所定の文字列がアルファベット文字で入力された場合
に、これをかな漢字変換する、具体的な処理例について
説明する。Next, referring to the flowchart of FIG.
A description will be given of a specific processing example in which, when a predetermined character string is input in alphabetical characters, the character string is converted into kana-kanji characters.

【００２８】使用者が、キーボード７を操作して、例え
ば、「systemikansitekudasai」の文字列を入力したと
する。この文字列には、日本語（ローマ字）と英語が混
在している。ステップＳ１では、この変換前情報を受け
取る処理が実行される。この処理は、入力部４１（受け
取り手段）で行われる。入力部４１で受け取られた変換
前文字列（キー入力情報）は、ステップＳ２において、
日本語読み変換処理が行われる。すなわち、入力部４１
で受け取られた変換前文字列は、日本語連文節解析部４
６（第１の解析手段）に送られ、日本語として連文節解
析される。これにより、例えば、「sysてみかんしてく
ださい」の文字列が得られる。そして、さらにステップ
Ｓ３において、日本語連文節解析部４６は、ひらがな部
分を漢字に変換し、「sys手未完してください」の文字
列を得る。It is assumed that the user operates the keyboard 7 to input, for example, a character string "systemikansitekudasai". This character string is a mixture of Japanese (Roman) and English. In step S1, a process of receiving the pre-conversion information is executed. This processing is performed by the input unit 41 (receiving means). The pre-conversion character string (key input information) received by the input unit 41 is
Japanese reading conversion processing is performed. That is, the input unit 41
The pre-conversion character string received by
6 (first analysis means) and is subjected to continuous phrase analysis as Japanese. As a result, for example, a character string of “Please try sys” is obtained. Then, further in step S3, the Japanese consecutive phrase analysis unit 46 converts the hiragana portion into kanji, and obtains a character string "sys hand not complete".

【００２９】一方、入力部４１は、受け取った変換前文
字列「systemikansitekudasai」を、英単語読み抽出部
５１（第２の解析手段）にも出力する。英単語読み抽出
部５１は、ステップＳ４において、入力された文字列か
ら英単語を抽出する処理を行う。この英単語読み抽出処
理の詳細は、図５のフローチャートに示されている。On the other hand, the input unit 41 also outputs the received pre-conversion character string "systemikansitekudasai" to the English word reading and extracting unit 51 (second analysis means). In step S4, the English word reading and extracting unit 51 performs a process of extracting an English word from the input character string. Details of the English word reading extraction processing are shown in the flowchart of FIG.

【００３０】最初にステップＳ２１において、入力され
た変換前文字列を英語辞書で検索する処理が行われる。
すなわち、英単語読み抽出部５１は、入力された変換前
文字列から辞書検索すべき文字列を抽出し、辞書検索処
理部４３に送り、辞書４４のうち、英語の辞書を参照し
て、英単語としての検索を実行させる。辞書検索処理部
４３は、検索結果を英単語読み抽出部５１に出力する。First, in step S21, a process of searching the input pre-conversion character string in the English dictionary is performed.
That is, the English word reading extraction unit 51 extracts a character string to be dictionary-searched from the input pre-conversion character string, sends the character string to the dictionary search processing unit 43, and refers to the English dictionary among the dictionaries 44, and Perform a search as a word. The dictionary search processing unit 43 outputs the search result to the English word reading and extracting unit 51.

【００３１】英単語読み抽出部５１は、ステップＳ２２
において、検索された英単語があるか否かを判定する。
いまの場合、例えば、「system」と「site」が検索され
る。このときステップＳ２３に進み、英単語読み抽出部
５１は、入力された変換前文字列のうち、検索された文
字列に対応する部分を英単語として置き換える処理を実
行する。いまの場合、「system」と「site」が英単語と
して処理される。The English word reading and extracting unit 51 determines in step S22
In, it is determined whether or not there is a searched English word.
In this case, for example, “system” and “site” are searched. At this time, the process proceeds to step S23, and the English word reading extraction unit 51 executes a process of replacing a portion corresponding to the searched character string in the input pre-conversion character string as an English word. In this case, "system" and "site" are processed as English words.

【００３２】次に、ステップＳ２４に進み、英単語とし
て検索されたなかった単語をかなに変換する処理が行わ
れる。すなわち、英単語読み抽出部５１は、ステップＳ
２３で英単語として検索されなかった文字列「ikan」と
「kudasai」をひらがなに変換させる。これにより、結
局、変換前文字列は、「systemいかんsiteください」に
変換されることになる。Next, the process proceeds to step S24, in which a word which has not been searched for as an English word is converted into kana. That is, the English word reading extraction unit 51 determines in step S
The character strings “ikan” and “kudasai” that were not searched for as English words in 23 are converted to hiragana. As a result, the pre-conversion character string is eventually converted into “system ikan site please”.

【００３３】ステップＳ２２において検索された英単語
が存在しないと判定された場合には、ステップＳ２３と
Ｓ２４の処理はスキップされる。If it is determined in step S22 that the searched English word does not exist, steps S23 and S24 are skipped.

【００３４】図４に戻って、以上のようにして、英単語
読み抽出処理の結果、「systemいかんsiteください」の
文字列が得られたとすると、この文字列が、英単語読み
抽出部５１から、英語連文節解析部４７（第２の解析手
段）に送られる。英語連文節解析部４７においては、入
力された文字列のうち、英語の単語の部分以外のひらが
なの部分を、辞書検索処理部４３に送り、漢字検索し
て、適宜漢字に変換する処理を行う。これにより、例え
ば、「system移管siteください」が得られる。Returning to FIG. 4, assuming that a character string “system ikan site please” is obtained as a result of the English word reading extraction processing as described above, this character string is Is sent to the English consecutive phrase analysis unit 47 (second analysis means). The English continuous phrase analysis unit 47 sends a hiragana portion other than the English word portion of the input character string to the dictionary search processing unit 43, performs a kanji search, and performs a process of appropriately converting it to kanji. Thereby, for example, "Please transfer system site" is obtained.

【００３５】以上のようにして、ステップＳ３またはス
テップＳ５において、日本語読みとしての連文節解析
と、英語読みとしての連文節解析が得られたら、ステッ
プＳ６以降に進み、２つの連文節解析の結果のうち、い
ずれか一方を選択する処理が行われる。As described above, in step S3 or step S5, if the continuous phrase analysis as a Japanese reading and the continuous phrase analysis as an English reading are obtained, the process proceeds to step S6 and the subsequent steps. , Is performed.

【００３６】ステップＳ６においては、日英混在連文節
解析部４５で、読みの先頭までの評価が終了したか否か
が判定される。いまの場合、まだ、読みの先頭までの評
価が終了していないので、ＮＯの判定が行われ、ステッ
プＳ７（検出手段）に進む。ステップＳ７においては、
日英混在連文節解析部４５は、ステップＳ３で得られた
日本語読みの連文節解析結果と、ステップＳ５で得られ
た英語読みの連文節解析結果の文節の区切りが一致する
ところまで解析結果を進める。この例の場合、「siteku
dasai」までが、いずれの文節においても一致するの
で、この文節が、最初に抽出される。In step S6, the Japanese / English mixed phrase analyzer 45 determines whether the evaluation up to the beginning of the reading has been completed. In this case, since the evaluation up to the beginning of the reading has not been completed, a NO determination is made, and the process proceeds to step S7 (detection means) . In step S7,
The Japanese-English mixed-phrase analysis unit 45 advances the analysis result until the result of the Japanese-language consecutive- clause analysis result obtained in step S3 matches the phrase segmentation of the English-read continuous- phrase analysis result obtained in step S5. In this example, "siteku
Since "dasai" matches in any of the clauses, this clause is extracted first.

【００３７】次に、ステップＳ８（選択手段）に進み、
２つの連文節解析結果のうち、いずれを優先して選択す
るのかが判定される。この判定は、次の優先順位に従っ
て行われる。（１）キー入力情報が一致する文節区切り位置での文
節数の少ない方を優先する。（２）文節数が同じ場合には、直前に使用した（確定
した）候補が含まれている方を優先する。（３）上記した（１）と（２）が同じ場合には、「日
本語読み」の方を優先する。Next, the process proceeds to step S8 (selection means).
It is determined which of the two consecutive phrase analysis results is to be selected with priority. This determination is made according to the following priorities. (1) Priority is given to the phrase having the smaller number of phrases at the phrase break position where the key input information matches. (2) If the number of clauses is the same, the one that contains the candidate used (fixed) immediately before is prioritized. (3) When the above (1) and (2) are the same, priority is given to "Japanese reading".

【００３８】この基準に従って、優先順位を判定する
と、いまの場合、上記した（３）の条件が適用され、
「日本語読み」の文節が選択される。その結果、「site
ください」ではなく、「してください」が選択される。When the priority order is determined according to this criterion, in this case, the condition (3) is applied,
The phrase "Japanese reading" is selected. As a result, "site
"Please" is selected instead of "Please."

【００３９】次に、ステップＳ９に進み、日英混在連文
節解析部４５は、ステップＳ８で選択した解析結果を記
憶する処理を行う。いまの場合、「してください」が記
憶される。Next, proceeding to step S9, the Japanese-English mixed-phrase analysis unit 45 performs a process of storing the analysis result selected in step S8. In this case, "please" is stored.

【００４０】次に、ステップＳ１０に進み、日本語読み
の連文節解析結果と英語読みの連文節解析結果の両方
を、それぞれ、次の解析処理に進める。そして、ステッ
プＳ６に戻り、読みの先頭まで評価が終了したか否かが
判定される。いまの場合、まだ、読みの先頭まで評価が
終了していないので、再び、ＮＯの判定が行われ、ステ
ップＳ７に進む。ステップＳ７においては、「systemik
an」の文節の解析が行われることになる。いまの場合、
日本語読みの連文節解析の結果は、「sys手未完」とな
り、英語読みの連文節解析の結果は、「system移管」と
なる。Next, the process proceeds to step S10, where both the results of the continuous phrase analysis of the Japanese reading and the results of the continuous phrase analysis of the English reading are respectively advanced to the next analysis processing. Then, the process returns to step S6, and it is determined whether the evaluation has been completed up to the beginning of the reading. In this case, since the evaluation has not yet been completed up to the beginning of the reading, a NO determination is made again, and the process proceeds to step S7. In step S7, "systemik
The phrase "an" will be analyzed. In this case,
The result of the continuous phrase analysis of Japanese reading is "sys hand incomplete", and the result of continuous phrase analysis of English reading is "system transfer".

【００４１】ステップＳ８においては、この２つの解析
結果のうち、いずれを優先するかが判定される。いまの
場合、上記した（１）の文節数の少ない方を優先すると
いう基準に従って、「system移管」が選択される。In step S8, it is determined which of the two analysis results has priority. In this case, “system transfer” is selected in accordance with the criterion (1) that gives priority to the one with a smaller number of clauses.

【００４２】次に、ステップＳ９に進み、ステップＳ８
で選択された解析結果が、前回の処理で解析された結果
と合わせて記憶される。これにより、「system移管して
ください」の解析結果が記憶されたことになる。Next, the process proceeds to step S9, and the process proceeds to step S8.
The analysis result selected in is stored together with the result analyzed in the previous processing. As a result, the analysis result of "Please transfer system" is stored.

【００４３】次に、ステップＳ１０からステップＳ６に
戻り、ステップＳ６において、読みの先頭まで評価が終
了したと判定されるので、ここでＹＥＳの判定が行わ
れ、ステップＳ１１に進む。ステップＳ１１において
は、ステップＳ９で記憶された解析結果を出力部４２に
返す処理が行われる。いまの場合、「system移管してく
ださい」の変換後文字列が、出力部４２に出力されるこ
とになる。出力部４２は、この文字列をＬＣＤ６に出力
し、表示させる。Next, the process returns from step S10 to step S6, where it is determined in step S6 that the evaluation has been completed up to the beginning of the reading, so a YES determination is made here and the process proceeds to step S11. In step S11, a process of returning the analysis result stored in step S9 to the output unit 42 is performed. In this case, the converted character string of "Please transfer system" is output to the output unit 42. The output unit 42 outputs this character string to the LCD 6 and displays it.

【００４４】なお、本発明のかな漢字変換装置において
は、変換前文字列を強制的に日本語に変換（ローマ字か
な変換）した読み（日本語読み）での候補と、英単語と
して有効な部分を抽出し、残りを日本語に変換した読み
（英語読み）での候補が、同時に、同音異義語として処
理される。この処理について、図６のフローチャートを
参照して説明する。In the kana-kanji conversion apparatus according to the present invention, a candidate for reading (Japanese reading) in which a character string before conversion is forcibly converted to Japanese (Roman-Kana conversion) and a portion effective as an English word are included. Candidates in the extracted (English reading) extracted and converted into Japanese are processed as homonyms at the same time. This processing will be described with reference to the flowchart in FIG.

【００４５】例えば、ステップＳ４１において、変換前
情報（キー入力情報）として、「datega」が入力部４１
で受け取られたものとする。このとき、日本語連文節解
析部４６において、ステップＳ４２で、文字列「dateg
a」が、強制的に日本語に変換される。その結果、「だ
てが」が得られる。For example, in step S 41, “datega” is input to the input unit 41 as the pre-conversion information (key input information).
Shall be received at At this time, in the Japanese consecutive phrase analyzing unit 46, in step S42, the character string "dateg
"a" is forcibly converted to Japanese. As a result, a "date" is obtained.

【００４６】次に、ステップＳ４３に進み、日本語次候
補処理部５０において、次候補解析処理が行われる。こ
れにより、例えば、「伊達が」、「立てが」、または
「建てが」が次候補として得られる。Next, the process proceeds to step S43, where the next Japanese candidate processing unit 50 performs a next candidate analysis process. Thereby, for example, “Date”, “Tatega”, or “Building” is obtained as the next candidate.

【００４７】一方、ステップＳ４１で得られた変換前情
報「datega」は、ステップＳ４４において、英単語読み
抽出部５１に送られ、英単語読み抽出処理が行われる。
これにより、「date」の部分が英単語として認識され
る。その結果、ステップＳ４５において、英単語以外の
部分を英語連文節解析部４７で日本語に変換する処理が
行われて、「dateが」が得られる。On the other hand, the pre-conversion information "datega" obtained in step S41 is sent to the English word reading extraction unit 51 in step S44, and the English word reading extraction processing is performed.
As a result, the "date" portion is recognized as an English word. As a result, in step S45, a process other than the English word is converted into Japanese by the English consecutive clause analysis unit 47, and "date" is obtained.

【００４８】さらに、ステップＳ４６において、英単語
次候補処理部５２が、ステップＳ４５で得られた同音異
義語として、「dateが」、「ｄａｔｅが」、または「デ
ートが」を得る。[0048] Further, in step S46, the English word next candidate processing unit 52, a homonym obtained in step S45, "date is", "dat e is" or get "dating".

【００４９】この発明の実施の形態においては、図７に
示すように、小文字半角アルファベット文字、先頭だけ
が大文字の半角のアルファベット文字、すべてが大文字
の半角のアルファベット文字、すべてが小文字の全角の
アルファベット文字、先頭だけが大文字の全角アルファ
ベット文字、すべてが大文字の全角アルファベット文
字、全角カタカナ文字、または半角カタカナ文字のいず
れもが、同音異義語として、次候補として処理される。In the embodiment of the present invention, as shown in FIG. 7, lowercase half-width alphabet letters, only the uppercase half-width alphabet letters, all uppercase half-width alphabet letters, and all lowercase half-width alphabet letters, Any of the characters, full-width alphabetic characters in which only the first letter is capitalized, and all-uppercase full-width alphabet characters, full-width katakana characters, or half-width katakana characters are processed as homonyms as the next candidate.

【００５０】ステップＳ４３または、ステップＳ４６に
おいて、日本語読みまたは英語読みの次候補解析結果が
得られた場合には、これらの次候補は、同音異義語評価
ソート部５３に送られ、ステップＳ４７において、所定
の基準により（後述する日本語候補学習結果反映部５４
と英単語候補学習結果反映部５５の学習結果に基づき）
評価され、その評価結果に対応して、ソート処理が行わ
れる。ステップＳ４８においては、以上のようにして得
られたソート結果が、出力部４２に出力され、そこから
ＬＣＤ６に出力、表示される。If the next candidate analysis result of Japanese reading or English reading is obtained in step S43 or S46, these next candidates are sent to the homonymous word evaluation sort unit 53, and in step S47. , According to a predetermined criterion (the Japanese candidate learning result reflecting unit 54 described later)
And the learning result of the English word candidate learning result reflecting unit 55)
The evaluation is performed, and a sorting process is performed according to the evaluation result. In step S48, the sorting result obtained as described above is output to the output unit 42, from which it is output and displayed on the LCD 6.

【００５１】以上のようにして、次候補が複数得られた
場合には、その中から使用者が、所望のものを選択する
ことになる。この選択処理を行った結果は、日本語候補
学習結果反映部５４と、英単語候補学習結果反映部５５
で、それぞれ反映される。例えば、表記の異なる英単語
候補（全角／半角、大文字／小文字、アルファベット／
カタカナ）は、直前の学習情報に応じて、最適な表記が
選択される。As described above, when a plurality of next candidates are obtained, the user selects a desired one from among them. The result of performing this selection processing is a Japanese candidate learning result reflecting unit 54 and an English word candidate learning result reflecting unit 55
In each, it is reflected. For example, English word candidates with different notations (full-width / half-width, uppercase / lowercase, alphabet /
For katakana, an optimal notation is selected according to the immediately preceding learning information.

【００５２】この処理を実現するために、辞書４４内の
情報には、図８に示すような定義が行われている。In order to realize this processing, information in the dictionary 44 is defined as shown in FIG.

【００５３】例えば、systemという語に対しては、読み
としては英語（アルファベット文字(system)）と日本語
（ひらがな（しすてむ））の２種類が登録されており、
変換後候補は、図９に示すようなフォーマットで規定さ
れる。For example, for the word “system”, two kinds of readings, English (alphabetic characters (system)) and Japanese (hiragana (shisutemu)), are registered.
The converted candidates are defined in a format as shown in FIG.

【００５４】￥ｅは、特殊候補であることを示すフラグ
であり、その次には、英語（アルファベット）の候補が
配置され、その次には、￥で示される候補区切りフラグ
が配置され、その次に、日本語（カタカナ）候補が配置
される。$ E is a flag indicating that the candidate is a special candidate. Next, English (alphabet) candidates are arranged, and then a candidate delimiter flag indicated by @ is arranged. Next, Japanese (Katakana) candidates are arranged.

【００５５】さらに、品詞としては、「英単語」という
特殊品詞（文法上は名詞と同等）が採用されている。Further, a special part of speech "English word" (grammatically equivalent to a noun) is adopted as the part of speech.

【００５６】図８に示すように、「system」と入力した
後、変換操作を行うと、「しすてむ」と入力した後、変
換操作を行った場合と、同一の変換後候補が得られる。
英単語次候補処理部５２と日本語次候補処理部５０は、
変換後の候補の英語の候補または日本語の候補に対し
て、全角と半角をコンバートしたり、小文字と大文字を
コンバートすることで、表記の異なる次候補を自動的に
作成する。As shown in FIG. 8, when a conversion operation is performed after "system" is input, the same post-conversion candidate is obtained as when "shisutemu" is input and the conversion operation is performed. Can be
The next English word candidate processing unit 52 and the next Japanese candidate processing unit 50
By converting full-width and half-width characters, or converting lowercase and uppercase characters to English or Japanese candidates after conversion, the next candidates with different notations are automatically created.

【００５７】英単語候補学習結果反映部５５と日本語候
補学習結果反映部５４は、候補確定時に学習を行う。例
えば、最近の使用、使用頻度などが学習される他、どの
表記で確定されたかを候補毎に学習し、読みの種類（例
えば通常アルファベット、先頭大文字アルファベット、
カタカナなど）に応じて、変換環境全体での学習が行わ
れる。The English word candidate learning result reflecting unit 55 and the Japanese candidate learning result reflecting unit 54 perform learning when a candidate is determined. For example, in addition to learning the recent use, the frequency of use, etc., it also learns which notation has been confirmed for each candidate, and determines the type of reading (for example, normal alphabet, initial capital letter,
Learning in the entire conversion environment is performed according to katakana.

【００５８】そして、学習が行われると、それ以降の変
換時に、次の優先順位により、どの表記を終了するかが
決定される。（１）その候補での学習情報（２）その読みの種類に対応した環境全体での学習情
報（３）その読みの種類に対応したデフォルト設定情報Then, when learning is performed, at the time of subsequent conversion, which notation is to be ended is determined based on the next priority. (1) Learning information for the candidate (2) Learning information for the entire environment corresponding to the type of reading (3) Default setting information corresponding to the type of reading

【００５９】従って、例えば、「system」と入力し、
「system」が前回選択されたときは、次回も、「syste
m」が第１の候補とされる。「System」と入力し、「Sys
tem」が前回選択されたときは、同様に、「System」が
第１の候補とされる。「system」と入力し、「システ
ム」が前回選択されたときも、次回は「システム」が第
１の候補とされる。Therefore, for example, if "system" is input,
When “system” was selected last time, next time “syste
m ”is the first candidate. Type "System" and enter "Sys
When “tem” was previously selected, “System” is similarly the first candidate. When “system” is input and “system” is selected last time, “system” is the first candidate next time.

【００６０】そして、学習は、環境全体で行われるの
で、例えば、ｅ−ｍａｉｌ作成時における学習と、ワー
ドプロセッサ機能を利用した文章作成時の学習は、個別
に行われ、それぞれにおける学習が優先され、次に、他
の機能実行時の学習が参照される。Since the learning is performed in the entire environment, for example, the learning at the time of creating an e-mail and the learning at the time of writing a sentence using a word processor function are performed individually, and the learning in each of them is given priority. Next, learning at the time of executing another function is referred to.

【００６１】なお、上記実施の形態においては、外国語
として英語を例として説明したが、この他、フランス
語、イタリア語、その他の外国が、日本語と混在する場
合にも、本発明は適用することが可能である。In the above embodiment, English has been described as an example of a foreign language. However, the present invention is also applicable to a case where French, Italian, and other foreign languages are mixed with Japanese. It is possible.

【００６２】また、上記した実施の形態においては、上
記した処理を実行するプログラムをＲＯＭ２またはハー
ドディスクドライブ５に予め記憶させるようにしたが、
フロッピーディスク、ＣＤ−ＲＯＭディスクなどの記録
媒体にプログラムを記録しておき、これらの記録媒体を
ユーザに配布し、ハードディスクやＲＡＭにインストー
ルさせるようにしてもよい。さらにインターネット、そ
の他のネットワークを介してプログラムを伝送して、ハ
ードディスクやＲＡＭにインストールさせるようにして
もよい。In the above-described embodiment, the program for executing the above-described processing is stored in the ROM 2 or the hard disk drive 5 in advance.
The program may be recorded on a recording medium such as a floppy disk or a CD-ROM disk, and the recording medium may be distributed to users and installed on a hard disk or a RAM. Further, the program may be transmitted via the Internet or another network and installed on a hard disk or a RAM.

【００６３】[0063]

【発明の効果】以上の如く、請求項１に記載のかな漢字
変換装置、請求項５に記載のかな漢字変換方法、および
請求項９に記載のコンピュータ読み取り可能な記録媒体
によれば、日本語としての解析結果だけでなく、外国語
としての文字列を含む解析結果も得るようにし、両者の
区切り位置までの文字列を、所定の選択条件に基づいて
比較し、その比較結果に対応して、両者のうちのいずれ
か一方を選択するようにしたので、日本語と外国語が混
在する文章を、迅速かつ正確に、作成することが可能と
なる。As described above, according to the kana-kanji conversion device of claim 1, the kana-kanji conversion method of claim 5, and the computer-readable recording medium of claim 9, In addition to analysis results in Japanese, analysis results including character strings as foreign languages are also obtained, and character strings up to the delimitation position of both are compared based on predetermined selection conditions, and corresponding to the comparison results Then, one of the two is selected, so that a sentence in which Japanese and foreign languages are mixed can be quickly and accurately created.

[Brief description of the drawings]

【図１】本発明のかな漢字変換装置の構成例を示すブロ
ック図である。FIG. 1 is a block diagram illustrating a configuration example of a kana-kanji conversion device of the present invention.

【図２】図１の実施の形態におけるプログラムの構成を
説明する図である。FIG. 2 is a diagram illustrating a configuration of a program according to the embodiment of FIG. 1;

【図３】図１の実施の形態におけるより詳細なプログラ
ムの構成を説明する図である。FIG. 3 is a diagram for explaining a more detailed configuration of a program in the embodiment of FIG. 1;

【図４】図３のプログラムの構成の処理を説明するフロ
ーチャートである。FIG. 4 is a flowchart illustrating processing of the configuration of the program in FIG. 3;

【図５】図４のステップＳ４の英単語読み抽出処理のよ
り詳細な処理を説明するフローチャートである。FIG. 5 is a flowchart illustrating a more detailed process of the English word reading extraction process in step S4 of FIG. 4;

【図６】図３の日英混在全候補解析部の処理を説明する
フローチャートである。FIG. 6 is a flowchart illustrating a process of a Japanese-English mixed all candidate analysis unit in FIG. 3;

【図７】同音異義語の構成を説明する図である。FIG. 7 is a diagram illustrating the configuration of a homonym.

【図８】辞書内の情報の定義を説明する図である。FIG. 8 is a diagram illustrating the definition of information in a dictionary.

【図９】変換後候補の特殊フォーマットを説明する図で
ある。FIG. 9 is a diagram illustrating a special format of a candidate after conversion.

[Explanation of symbols]

２１ユーザ入出力部２２連文節解析処理部２３次候補解析処理部２４辞書検索処理部２５辞書４１入力部４２出力部４３辞書検索処理部４４辞書４５日英混在連文節解析部４６日本語連文節解析部４７英語連文節解析部４８日英混在全候補解析部４９ローマ字かな変換部５０日本語次候補処理部５１英単語読み抽出部５２英単語次候補処理部５３同音異義語評価ソート部５４日本語候補学習結果反映部５５英単語候補学習結果反映部 Reference Signs List 21 User input / output unit 22 Continuous phrase analysis processing unit 23 Next candidate analysis processing unit 24 Dictionary search processing unit 25 Dictionary 41 Input unit 42 Output unit 43 Dictionary search processing unit 44 Dictionary 45 Japanese-English mixed continuous phrase analysis unit 46 Japanese consecutive phrase analysis unit 47 English phrase and phrase analysis unit 48 Japanese-English mixed candidate analysis unit 49 Romaji kana conversion unit 50 Japanese next candidate processing unit 51 English word reading and extraction unit 52 English word next candidate processing unit 53 Homonymic synonym evaluation sort unit 54 Japanese candidate learning result Reflection unit 55 English word candidate learning result reflection unit

フロントページの続き (56)参考文献特開平５−282360（ＪＰ，Ａ) 特開昭63−316162（ＪＰ，Ａ) 特開昭64−53265（ＪＰ，Ａ) 特開平４−75162（ＪＰ，Ａ) 特開平８−106461（ＪＰ，Ａ) 特開昭64−18872（ＪＰ，Ａ) 特開平８−212216（ＪＰ，Ａ) 特開平５−342192（ＪＰ，Ａ) 特開昭60−20234（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06F 17/21 G06F 17/22Continuation of front page (56) References JP-A-5-282360 (JP, A) JP-A-63-316162 (JP, A) JP-A-64-53265 (JP, A) JP-A-4-75162 (JP, A) JP-A 8-106461 (JP, A) JP-A 64-18872 (JP, A) JP-A 8-212216 (JP, A) JP-A 5-342192 (JP, A) 60-20234 (JP, A) (58) Fields investigated (Int. Cl. ⁶ , DB name) G06F 17/21 G06F 17/22

Claims

(57) [Claims]

A receiving means for receiving a character string to be converted; a first analyzing means for performing continuous phrase analysis on the character string received by the receiving means as Japanese; and a foreign language from the character string received by the receiving means. Extracting means for extracting a character string corresponding to a word; and second analyzing means for analyzing a portion of the character string received by the receiving means, which is not extracted as a foreign language by the extracting means, as a continuous phrase analysis in Japanese Detecting means for analyzing the analysis result of the first analysis means and the analysis result of the second analysis means toward an end, and detecting a position where a segment break of the phrase first matches as a break position; An undetermined character string of the analysis result of the first analysis means up to the break position detected by the detection means, and an analysis result of the second analysis means. The undetermined character string up to the break position detected by the detection means is compared based on a predetermined selection condition, and the undetermined character string up to the break position detected by the detection means is compared with the comparison result. A kana-kanji conversion device comprising: a selection unit that selects one of an analysis result of the first analysis unit and an analysis result of the second analysis unit as an analysis result of a character string.

2. The method according to claim 1, wherein the selecting means selects the one having the smaller number of phrases.
As the first selection condition, the candidate used immediately before is included
Is used as the next selection condition, and the first analysis means
The analysis result is used as the next selection condition.
2. The kana-kanji conversion device according to claim 1, wherein the selection is performed .

3. The foreign language extracted by the extracting means.
At least uppercase and lowercase letters, full-width characters of the corresponding string
Letters and single-byte characters, or only capital letters and small letters thereafter
A string that is a character and a string that starts and ends with a lowercase letter
3. The kana-kanji conversion device according to claim 1 or 2, further comprising a processing unit for processing as a next candidate .

4. Learning the notation finally selected by the user
The kana-kanji conversion device according to claim 1, 2, or 3, further comprising a learning means for reflecting the result in a next process .

5. A kana-kanji conversion for converting an input character string.
In the kana-kanji conversion method in the conversion device, a receiving step of receiving a character string to be converted , and converting the character string received in the receiving step into Japanese
A first parsing step of parsing a phrase as a sequence, and a foreign language from the character string received in the receiving step.
Extracting a character string corresponding to the character string, and among the character strings received in the receiving step,
Part not extracted as a foreign language in the extraction step
Second parsing step for analyzing multiple phrases in Japanese
And the analysis result of the first analysis step and the second analysis step.
Analyze the analysis results of the step toward the end,
The position where the break of the clause first matches as the break position
A detecting step of detecting, and the detecting step of the analysis result in the first analyzing step.
Undetermined character string up to the break position detected in the
Detecting the analysis result in the second analysis step;
The undetermined character string up to the break position detected in the
Comparing based on predetermined selection conditions and corresponding to the comparison result
To the break position detected in the detection step.
As the analysis result of the undetermined character string, the first analysis step
Analysis results in the second analysis step and the analysis results in the second analysis step.
Kana-Kanji conversion method characterized by comprising a selection step of selecting one of the fruit.

6. In the selecting step, the number of phrases is small.
Is the first selection condition, and the candidate used immediately before is included.
The first analysis is defined as the next selection condition.
The analysis result in the step is further
The kana-kanji conversion method according to claim 5 , wherein the selection is performed as a matter .

7. The foreign language extracted in the extracting step
At least uppercase and lowercase letters, full-width characters of the corresponding string
Letters and single-byte characters, or only capital letters and small letters thereafter
A string that is a character and a string that starts and ends with a lowercase letter
7. The kana-kanji conversion method according to claim 5 , further comprising a processing step of processing as a next candidate .

8. Learning the notation finally selected by the user
The kana-kanji conversion method according to claim 5, 6 or 7, further comprising a learning step for reflecting the result in a next process.

9. A kana-kanji conversion for converting an input character string.
A receiving step of receiving a character string to be converted , and converting the character string received in the receiving step into Japanese
A first parsing step of parsing a phrase as a sequence, and a foreign language from the character string received in the receiving step.
Extracting a character string corresponding to the character string, and among the character strings received in the receiving step,
Part not extracted as a foreign language in the extraction step
Second parsing step for analyzing multiple phrases in Japanese
And the analysis result of the first analysis step and the second analysis step.
Analyze the analysis results of the step toward the end,
The position where the break of the clause first matches as the break position
A detecting step of detecting, and the detecting step of the analysis result in the first analyzing step.
Undetermined character string up to the break position detected in the
Detecting the analysis result in the second analysis step;
The undetermined character string up to the break position detected in the
Comparing based on predetermined selection conditions and corresponding to the comparison result
To the break position detected in the detection step.
As the analysis result of the undetermined character string, the first analysis step
Analysis results in the second analysis step and the analysis results in the second analysis step.
To perform a selection step to select one of the results.
Computer reading characterized by recording the program
A removable recording medium .

10. In the selecting step, the number of phrases is small.
Is the first selection condition, and the candidate used immediately before
The included condition is used as the next selection condition, and the first solution
Analysis result in the analysis step is further selected
10. The method according to claim 9 , wherein the selection is performed as a condition.
A recording medium according to claim 1.

11. The foreign language extracted in the extraction step
, At least uppercase and lowercase, all
Square and half-width characters, or only the first letter is uppercase and
Strings that are lowercase and characters that start and end with lowercase
Perform further processing steps to process the column as the next candidate
The recording medium according to claim 9, wherein a program for causing the recording medium to be recorded is recorded.

12. Learning the notation finally selected by the user
And further execute the learning step to be reflected in the next processing
12. The recording medium according to claim 9, wherein a program for causing the recording medium to be recorded is recorded.