JP2001109740A

JP2001109740A - Device and method for preparing chinese document

Info

Publication number: JP2001109740A
Application number: JP28505099A
Authority: JP
Inventors: Yasushi Ishizuka; 靖石塚; Kaito Kaminari; 雷海涛
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1999-10-06
Filing date: 1999-10-06
Publication date: 2001-04-20

Abstract

PROBLEM TO BE SOLVED: To improve input efficiency by inputting a pinyin character string from a keyboard as in the conventional way and also inputting voice information by using voice. SOLUTION: A pinyin character string inputting means inputs the pinyin character string and also, a voice information inputting means inputs voice information of Kanji (Chinese character). A voice information analyzing means analyzes the inputted voice information and acquires tone information, and a pinyin converting means performs pinyin conversion on the basis of the tone information acquired by the voice information analyzing means and the pinyin character string inputted from the pinyin character string inputting means and acquires a conversion candidate.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、中国語文書の入力
に際し、キーボードから入力したピンイン文字列と音声
で入力した声調情報を使ってピンイン変換を行なうこと
により、中国語の入力操作性の向上を図った中国語文書
作成装置及び中国語文書作成方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention improves Chinese input operability by performing a Pinyin conversion using a Pinyin character string input from a keyboard and tone information input by voice when a Chinese document is input. And a Chinese document creation method.

【０００２】[0002]

【従来の技術】中国語の漢字には、日本語の漢字の読み
のような発音表記をアルファベットで表したピンイン文
字列が対応している。図１のように、漢字１文字に対応
するピンイン文字列は、声母、韻母の対で表現される。
中国語におけるピンイン変換は、図２のようにピンイン
文字列「ｇａｉ」をキーボードから入力し、入力された
ピンイン文字列を変換し、対応する変換候補「該、垓、
改、蓋、概」を得るものである。しかしながら、日本語
のかな漢字変換における同音異義語の問題と同様に、中
国語のピンイン変換においても変換候補として複数の同
音異義語が出力されるという問題があり、この同音異義
語の問題が変換率の低下を招いている。図３に、ピンイ
ン文字列「ｇａｉ」及び「ｑｉ」に対応する複数の同音
異義語の例を示している。2. Description of the Related Art Chinese kanji correspond to Pinyin character strings in which phonetic notations, such as Japanese kanji readings, are represented in the alphabet. As shown in FIG. 1, a Pinyin character string corresponding to one kanji character is represented by a pair of an initial and a final.
Pinyin conversion in Chinese is performed by inputting a pinyin character string “gai” from a keyboard as shown in FIG. 2, converting the input pinyin character string, and converting the corresponding conversion candidates “the, phantom,
Break, lid, outline. However, similar to the problem of homonyms in Japanese Kana-Kanji conversion, there is also the problem that multiple homonyms are output as conversion candidates in Chinese Pinyin conversion. Has been reduced. FIG. 3 shows an example of a plurality of homonyms corresponding to the Pinyin character strings “gai” and “qi”.

【０００３】この問題を解決するために、ピンイン文字
列の他に日本語のアクセント情報のような声調情報を入
力してピンイン変換を行なうという方法が提案されてい
る。声調情報は、第１声から第５声までの５種類があ
り、全ての漢字はいずれかの種類の声調情報を持ってい
る。ピンイン文字列の他に声調情報の種類を入力してピ
ンイン変換することにより、図４のように、変換結果と
して出力する変換候補の数を減らすことができる。In order to solve this problem, there has been proposed a method of inputting tone information such as Japanese accent information in addition to a Pinyin character string and performing Pinyin conversion. There are five types of tone information from the first voice to the fifth voice, and all kanji have any type of tone information. By inputting the type of tone information in addition to the Pinyin character string and performing Pinyin conversion, the number of conversion candidates output as conversion results can be reduced as shown in FIG.

【０００４】このような声調情報の種類を入力してピン
イン変換を行なう方法としては、特開平８−１８５３９
６号が提案されていた。しかしながら、この従来技術で
は、キーボードから、声調情報の種類を示す数字（以
下、声調番号という）を入力する（即ち、声調情報の種
類である第１声〜第５声に対応した１〜５の数字を入力
する）ものであるため、ピンイン文字列の入力に加え
て、声調番号の入力もキーボードから行なう必要があ
り、ユーザのキー操作が煩雑となるという課題があっ
た。更に、大多数のユーザは、漢字がどのような声調情
報の種類を持っているのか正しく認識しておらず、ピン
イン文字列と一緒に声調番号もキーボードから入力する
方法は、入力速度の低下、正確性の低下を招いているの
が現状である。As a method of inputting the type of tone information and performing pinyin conversion, see Japanese Patent Application Laid-Open No. Hei 8-18539.
No. 6 had been proposed. However, in this conventional technique, a number indicating the type of tone information (hereinafter referred to as a tone number) is input from a keyboard (that is, 1 to 5 corresponding to the first to fifth voices, which are the types of tone information). (For inputting numbers), it is necessary to input the tone number in addition to the input of the Pinyin character string from the keyboard, and there has been a problem that the key operation of the user becomes complicated. Furthermore, most users do not correctly recognize what kind of tone information the kanji has, and the method of inputting the tone number along with the pinyin character string from the keyboard reduces input speed, The current situation is that the accuracy is reduced.

【０００５】[0005]

【発明が解決しようとする課題】上述したように、従来
技術では、ピンイン変換を行なう際にはピンイン文字列
と一緒に声調番号をキーボードから入力する方法が採用
されているが、このように声調番号を余分に入力しなけ
ればならず、入力操作が煩雑である。また、声調番号を
誤って入力してしまうと、希望していない変換候補のみ
出力されるため、再度ピンイン文字列と声調番号を入力
する必要があるため、声調情報の種類を正しく認識でき
ていないユーザにとっては、反って正確性の低下を招い
てしまうという課題があった。As described above, in the prior art, when performing pinyin conversion, a method of inputting a tone number along with a pinyin character string from a keyboard is employed. An extra number must be entered, and the input operation is complicated. In addition, if the tone number is erroneously input, only the conversion candidates that are not desired are output, so it is necessary to input the pinyin character string and the tone number again, so that the type of the tone information cannot be correctly recognized. For the user, there is a problem that the accuracy is lowered.

【０００６】本発明は、上述した事情を考慮してなされ
たもので、ピンイン文字列は従来通りキーボードから入
力すると共に、声調情報については音声を使って入力す
ることにより入力効率の向上を可能とした中国語文書作
成装置及び中国語文書作成方法を提供することを目的と
する。The present invention has been made in consideration of the above-mentioned circumstances, and it is possible to improve the input efficiency by inputting a Pinyin character string from a keyboard as in the past and inputting tone information using voice. It is an object to provide a Chinese document creation device and a Chinese document creation method.

【０００７】[0007]

【課題を解決するための手段】ピンイン文字列入力手段
からピンイン文字列を入力させると共に、音声情報入力
手段から音声情報を入力させる。音声情報解析手段によ
り音声情報を解析し、前記ピンイン文字列入力手段から
入力されたピンイン文字列に対応する声調情報を取得す
る。ピンイン変換手段が、ピンイン文字列入力手段から
入力されたピンイン文字列と音声情報解析手段により取
得された声調情報に基づいてピンイン変換用辞書を参照
し対応する漢字を取得する。表示手段が、ピンイン変換
手段によって取得された漢字を変換候補として表示す
る。A pinyin character string is input from the pinyin character string input means, and voice information is input from the voice information input means. Voice information is analyzed by voice information analysis means, and tone information corresponding to the Pinyin character string input from the Pinyin character string input means is obtained. The Pinyin conversion means refers to the Pinyin conversion dictionary based on the Pinyin character string input from the Pinyin character string input means and the tone information obtained by the voice information analysis means, and acquires a corresponding Chinese character. The display means displays the kanji obtained by the pinyin conversion means as conversion candidates.

【０００８】このような中国語文書作成装置及び中国語
文書作成方法によれば、音声情報を入力することで声調
情報の入力が可能であるため、キー入力はピンイン文字
列を入力する場合のみで良いため、声調情報を用いたピ
ンイン変換において、入力操作を容易に且つ迅速に行な
うことが可能である。According to such a Chinese document creating apparatus and a Chinese document creating method, since tone information can be input by inputting voice information, key input is performed only when a Pinyin character string is input. Therefore, in pinyin conversion using tone information, an input operation can be easily and quickly performed.

【０００９】[0009]

【発明の実施の形態】以下、図面を参照して本発明の実
施の形態について説明する。（第1 の実施形態）第１の実施形態は、ピンイン文字列
をキーにより入力し、声調情報を音声入力により入力
し、この２つの情報からピンイン変換を行なうようにし
たものである。Embodiments of the present invention will be described below with reference to the drawings. (First Embodiment) In the first embodiment, a pinyin character string is input by a key, tone information is input by voice input, and pinyin conversion is performed from these two pieces of information.

【００１０】図５は第１の実施形態に係わる中国語文書
作成装置の構成を示したブロック図である。図５におい
て、キーボード入力部１は、キーボードなどで構成され
ており、ピンイン文字列の入力等、中国語文書作成のた
めの各種入力、編集操作を行うものである。音声入力部
２は、マイクなどで構成されており、音声データを入力
するためのものである。ポインティングデバイス部３
は、ペン入力装置やマウスなどで構成されており、ピン
イン変換によって得られた変換候補の選択などを行なう
ものである。制御部４は、ＣＰＵなどで構成されてお
り、キーボード入力部１、音声入力部２、ポインティン
グデバイス部３から渡される夫々のキーデータ、音声デ
ータ、ポインティングデータを使って、後述するピンイ
ン変換部５、音声データ処理部７、出力部１０などの各
処理部を制御する。ピンイン変換部５は、キーボード入
力部１から入力されたピンイン文字列と音声入力部２か
ら入力された音声データにより得られる声調情報から、
ピンイン変換用辞書６を参照して漢字に変換し、その変
換候補を制御部４に渡す。ピンイン変換用辞書６は、ピ
ンイン文字列を対応する漢字に変換するために必要な情
報を収容しているものである。また、ユーザが登録する
ユーザ単語データもここに記憶される。音声データ処理
部７は、音声入力部２から入力された音声データを音声
データ処理用辞書８を参照して解析処理することによ
り、音声データから漢字の声母、韻母、声調情報などの
解析結果を取得し、この解析結果を制御部４に渡す。音
声データ処理用辞書８は、入力された音声データを解析
処理するために必要な情報を収容している。データ記憶
部９は、キーボード入力部１から入力されるピンイン文
字列、入力されたピンイン文字列に対するピンイン変換
に必要なデータや、ピンイン変換を行なった結果得られ
た変換候補、この変換候補を一覧表示するための候補一
覧データ、音声入力部２から入力される音声データ、入
力される音声データの解析処理に必要なデータ、ユーザ
から設定された各種設定情報などを記憶する。出力部１
０は、ＣＲＴやＬＣＤなどからなり、ピンイン変換によ
り得られた変換結果、候補一覧などを表示する。FIG. 5 is a block diagram showing the configuration of the Chinese document creating apparatus according to the first embodiment. In FIG. 5, a keyboard input unit 1 is composed of a keyboard or the like, and performs various input and editing operations for creating a Chinese document, such as input of a Pinyin character string. The voice input unit 2 is configured by a microphone or the like, and is for inputting voice data. Pointing device section 3
Is composed of a pen input device, a mouse, and the like, and selects a conversion candidate obtained by pinyin conversion. The control unit 4 includes a CPU or the like, and uses key data, voice data, and pointing data passed from the keyboard input unit 1, the voice input unit 2, and the pointing device unit 3 to generate a pin-in conversion unit 5 described later. , An audio data processing unit 7, an output unit 10, and the like. The Pinyin conversion unit 5 converts the Pinyin character string input from the keyboard input unit 1 and the tone information obtained from the voice data input from the voice input unit 2 into
Referring to the Pinyin conversion dictionary 6, it is converted into Kanji, and the conversion candidate is passed to the control unit 4. The Pinyin conversion dictionary 6 contains information necessary for converting a Pinyin character string into a corresponding Chinese character. The user word data registered by the user is also stored here. The voice data processing unit 7 analyzes the voice data input from the voice input unit 2 with reference to the voice data processing dictionary 8, thereby analyzing the analysis results such as the initials, finals, and tone information of the kanji from the voice data. Acquired and the analysis result is passed to the control unit 4. The audio data processing dictionary 8 stores information necessary for analyzing input audio data. The data storage unit 9 lists a Pinyin character string input from the keyboard input unit 1, data necessary for Pinyin conversion for the input Pinyin character string, conversion candidates obtained as a result of the Pinyin conversion, and a list of the conversion candidates. It stores candidate list data to be displayed, audio data input from the audio input unit 2, data necessary for analysis processing of the input audio data, various setting information set by the user, and the like. Output unit 1
Numeral 0 is composed of a CRT, LCD, or the like, and displays a conversion result obtained by pinyin conversion, a candidate list, and the like.

【００１１】次に、第１の実施形態のピンイン変換処理
の動作について、図６のフローチャートを参照して説明
する。最初に、制御部４は、ユーザからキーボード入力
部１を介してピンイン文字列が入力されるのを待ち、ピ
ンイン文字列が入力されるとそのピンイン文字列をデー
タ記憶部９に記憶する（ステップＳ１０１）。引き続
き、制御部４は、ピンイン文字列が入力されてから一定
時間内に、ユーザから音声入力部２を介して所望する漢
字の発音表記が音声データとして入力されるのを待ち、
音声データが入力されるとその音声データをデータ記憶
部９に記憶する（ステップＳ１０２）。制御部４は、ユ
ーザからの音声データの入力がピンイン文字列が入力さ
れてから一定時間内にあったか否かを判断し（ステップ
Ｓ１０３）、音声の入力があった場合（ステップＳ１０
３のＹＥＳ）、入力された音声データを音声データ処理
部７へ出力し、音声データから声調情報を取り出す処理
を行なわせる（ステップＳ１０４）。Next, the operation of the pinyin conversion process according to the first embodiment will be described with reference to the flowchart of FIG. First, the control unit 4 waits for a pinyin character string to be input from the user via the keyboard input unit 1, and when the pinyin character string is input, stores the pinyin character string in the data storage unit 9 (step). S101). Subsequently, the control unit 4 waits for the user to input a desired kanji phonetic notation as voice data through the voice input unit 2 within a predetermined time after the pinyin character string is input,
When voice data is input, the voice data is stored in the data storage unit 9 (step S102). The control unit 4 determines whether or not the input of the voice data from the user has been within a predetermined time after the input of the Pinyin character string (step S103), and when the voice has been input (step S10).
3), the input voice data is output to the voice data processing unit 7, and a process of extracting tone information from the voice data is performed (step S104).

【００１２】音声データ処理部７は、制御部４から渡さ
れた音声データを、音声データ処理用辞書８を参照して
解析し、この音声データに対応する漢字の声母、韻母、
声調情報などの解析結果を取得する。音声データ処理部
７は、得られた解析結果を制御部４に出力する。The voice data processing unit 7 analyzes the voice data passed from the control unit 4 with reference to the voice data processing dictionary 8, and outputs a kanji initial, a final base, and a kanji corresponding to the voice data.
Obtain analysis results such as tone information. The audio data processing unit 7 outputs the obtained analysis result to the control unit 4.

【００１３】制御部４は、音声データ処理部７から出力
された解析結果をチェックし、声調情報が正しく取り出
せたか否かを判断する（ステップＳ１０５）。すなわ
ち、制御部４は、音声データの解析結果のうち、漢字の
声母、韻母の文字列が、ユーザから入力されたピンイン
文字列と一致するか否かを判断し、両者が一致する場合
に声調情報が正しく取り出せたと判断する。The control unit 4 checks the analysis result output from the audio data processing unit 7 and determines whether tone information has been correctly extracted (step S105). That is, the control unit 4 determines whether or not the character strings of the kanji initial and the final in the analysis result of the audio data match the Pinyin character string input by the user. Judge that the information was correctly extracted.

【００１４】この判断により声調情報が正しく取り出せ
たと判断した場合（ステップＳ１０５のＹＥＳ）、制御
部４は、音声データの解析結果から得られる声調情報と
データ記憶部９に記憶されているピンイン文字列とをピ
ンイン変換部５に出力し、ピンイン変換を行なわせる
（ステップＳ１０６）。ピンイン変換部５は、制御部４
から出力されたピンイン文字列と声調情報とからピンイ
ン変換用辞書６を参照することによりピンイン変換を行
ない、ピンイン文字列に対応する変換候補を取得する。
このときピンイン変換部５は、声調情報を変換候補の候
補数の絞込みに用い、声調情報と一致しない漢字は変換
候補として出力しないようにする。ピンイン変換部５
は、ピンイン変換を行なって得られた変換候補を制御部
４に出力する。If it is determined that the tone information has been correctly extracted (YES in step S105), the control unit 4 determines the tone information obtained from the analysis result of the voice data and the Pinyin character string stored in the data storage unit 9. Is output to the Pinyin conversion unit 5 to perform the Pinyin conversion (step S106). The Pinyin conversion unit 5 includes a control unit 4
The Pinyin conversion is performed by referring to the Pinyin conversion dictionary 6 from the Pinyin character string and tone information output from, and a conversion candidate corresponding to the Pinyin character string is obtained.
At this time, the Pinyin conversion unit 5 uses the tone information to narrow down the number of conversion candidates, and prevents kanji characters that do not match the tone information from being output as conversion candidates. Pinyin converter 5
Outputs the conversion candidate obtained by performing the Pinyin conversion to the control unit 4.

【００１５】制御部４は、ピンイン変換部５から出力さ
れた変換候補をデータ記憶部９に記憶させると共に、当
該変換候補を出力部１０に出力し、変換候補を表示させ
る（ステップＳ１０８）。ユーザは、出力部１０に表示
される変換候補の中から所望する変換候補をポインティ
ングデバイス３で指定することにより、変換候補を選択
することができる。The control unit 4 stores the conversion candidates output from the Pinyin conversion unit 5 in the data storage unit 9, outputs the conversion candidates to the output unit 10, and displays the conversion candidates (step S108). The user can select a conversion candidate by designating a desired conversion candidate from the conversion candidates displayed on the output unit 10 with the pointing device 3.

【００１６】一方、ステップＳ１０３の判断においてピ
ンイン文字列が入力されてから一定時間内に音声入力が
ない場合（ステップＳ１０３のＮＯ）、及び、ステップ
Ｓ１０５の判断において音声データの解析処理の結果、
声調情報が正しく取り出せなかったと判断した場合（ス
テップＳ１０５のＮＯ）、制御部４は、ピンイン文字列
のみピンイン変換部５に出力する。ピンイン変換部５
は、制御部４から出力されたピンイン文字列のみの情報
に基づいて、ピンイン変換用辞書６を参照してピンイン
変換を行う（ステップＳ１０７）。On the other hand, if there is no voice input within a predetermined time after the pinyin character string is input in the determination of step S103 (NO in step S103), and as a result of voice data analysis processing in the determination of step S105,
If it is determined that the tone information could not be correctly extracted (NO in step S105), the control unit 4 outputs only the Pinyin character string to the Pinyin conversion unit 5. Pinyin converter 5
Performs Pinyin conversion with reference to the Pinyin conversion dictionary 6 based on the information of only the Pinyin character string output from the control section 4 (step S107).

【００１７】以上説明した動作によりピンイン変換を行
なった際の具体例を図７に示している。図７に示すよう
に、ユーザは、ピンイン文字列「ｇａｉ」と、「ｇａ
ｉ」の第３声の声調情報をもつ音声を入力することで、
その変換候補として「改」を得ることができる。FIG. 7 shows a specific example when pin-in conversion is performed by the operation described above. As shown in FIG. 7, the user inputs the pinyin character string “gai” and “ga”
By inputting the voice having the tone information of the third voice of "i",
"Kai" can be obtained as the conversion candidate.

【００１８】このように、ピンイン変換に必要な情報の
うち声調情報を音声で入力するようにしたので、ピンイ
ン文字列のみキーボード入力部１から入力すれば良く、
入力操作の煩雑化を抑制することができ、入力速度の低
下を招くことなく、簡単に声調情報を用いた変換率の高
いピンイン変換を行なうことが可能となる。As described above, the tone information among the information necessary for the Pinyin conversion is input by voice, so that only the Pinyin character string needs to be input from the keyboard input unit 1.
Complexity of input operation can be suppressed, and pinyin conversion with a high conversion rate using tone information can be easily performed without lowering the input speed.

【００１９】なお、上記説明では、単一の漢字をピンイ
ン変換する例を示しているが、複数の漢字からなる単
語、或いは、文を一括してピンイン変換することも可能
である。即ち、ユーザから、先ず、単語或いは文のピン
イン文字列を一括して入力させ、次に、同様の単語或い
は文を音声により一括して入力させる。入力された単語
或いは文の音声からそれらを構成している各漢字の声調
情報を一括して取り出し、この各漢字の声調情報を用い
て、先に入力されているピンイン文字列を各漢字毎に変
換する。In the above description, an example in which a single kanji is converted into a pinyin is shown. However, a word or a sentence composed of a plurality of kanji can be collectively converted into a pinyin. That is, first, the user is caused to collectively input a pinyin character string of a word or sentence, and then to input the same word or sentence collectively by voice. From the voice of the input word or sentence, the tone information of each kanji that composes them is extracted at a time, and the pinyin character string that has been input earlier is extracted for each kanji using the tone information of each kanji. Convert.

【００２０】また、ユーザからの音声入力を受け付ける
タイミングをピンイン文字列入力後の一定時間内として
いるが、これに限らず、例えば次のピンイン文字列が入
力されるまで、音声入力を有効とするとしても良い。Further, the timing at which the voice input from the user is received is set within a fixed time after the input of the Pinyin character string, but the present invention is not limited to this. For example, the voice input is valid until the next Pinyin character string is input. It is good.

【００２１】また、音声で入力する声調情報は、ピンイ
ン文字列として入力される漢字の１文字毎に全て入力す
る必要はなく、変換候補が多い漢字や、ピンイン変換を
行なった際に変換間違いが予想される特定の漢字を入力
する場合のみ入力すれば良い。It is not necessary to input all tone information input by voice for each kanji character input as a pinyin character string, and kanji with many conversion candidates or conversion error when pinyin conversion is performed is not required. You only need to enter it if you want to enter a specific expected kanji.

【００２２】また、一般的に中国語の音声認識処理にお
いては、音声データの入力開始時の声母の認識が、その
後に続く韻母に比べて難しいという問題がある。従っ
て、本実施形態の音声データの解析処理で、声母の認識
結果に誤り（声母の認識結果がピンイン文字列と異な
る）が比較的多く生じる可能性がある。しかし、声母の
認識結果に誤りがあっても、その声母に続く韻母の認識
結果が正しい（韻母の認識結果がピンイン文字列と一致
する）場合は、これらの声母と韻母を持つ認識結果を正
しいものとして判断し、この認識結果により得られた声
調情報をピンイン変換に用いるようにしても良い。Further, in general, in the speech recognition processing in Chinese, there is a problem that the recognition of the initial at the start of inputting the audio data is more difficult than the subsequent final. Therefore, in the voice data analysis process of the present embodiment, there is a possibility that relatively many errors occur in the recognition result of the initial (the recognition result of the initial differs from the Pinyin character string). However, even if the recognition result of the initial is incorrect, if the recognition result of the final following the initial is correct (the recognition result of the final matches the Pinyin character string), the recognition result having these initials and the final is correct. It is also possible to use the tone information obtained from the recognition result for pinyin conversion.

【００２３】（第２の実施形態）次に、第２の実施形態
について説明する。上述した第１の実施形態では、ピン
イン変換を行なう際にピンイン文字列と共に音声入力に
て声調情報を入力するようにしたが、これに対して第２
の実施形態では、ピンイン変換の結果得られた変換候補
に対して音声入力で声調情報を入力することで、その声
調情報を有する漢字の変換候補を優先的に表示するよう
にしたものである。即ち、ピンイン文字列のみを利用し
てピンイン変換を行なった場合、その変換結果として複
数の声調情報を有する漢字が変換候補として得られるた
め、変換候補の表示ではユーザの所望する声調情報を有
する漢字の変換候補が優先的に表示されるとは限らな
い。このため、変換候補に対して所望する漢字の声調情
報を入力できるようにし、入力した声調情報を有する漢
字の変換候補を優先的に表示させるようにした。(Second Embodiment) Next, a second embodiment will be described. In the above-described first embodiment, tone information is input by voice input together with a pinyin character string when performing pinyin conversion.
In the embodiment, the tone information is input by voice to the conversion candidate obtained as a result of the Pinyin conversion, so that the conversion candidate of the kanji having the tone information is preferentially displayed. That is, when the Pinyin conversion is performed using only the Pinyin character string, a kanji having a plurality of tone information is obtained as a conversion candidate as a conversion result. Therefore, in the conversion candidate display, the kanji having the tone information desired by the user is displayed. Is not always displayed preferentially. For this reason, the desired kanji tone information can be input to the conversion candidate, and the kanji conversion candidate having the input tone information is preferentially displayed.

【００２４】第２の実施形態に係わる中国語文書作成装
置の構成は、図５に示される第１の実施形態の構成と同
じである。第２の実施形態の声調情報の入力による変換
候補の表示順位変更処理の動作について、図８を参照し
て説明する。The configuration of the Chinese document creating apparatus according to the second embodiment is the same as the configuration of the first embodiment shown in FIG. The operation of the process for changing the display order of conversion candidates according to the input of tone information according to the second embodiment will be described with reference to FIG.

【００２５】ここで、ピンイン変換部５においてピンイ
ン文字列のみを用いたピンイン変換が行なわれ、出力部
１０の表示画面上にピンイン変換の変換候補が既に表示
された状態にあるものとする。この変換候補の表示は、
図９のように、複数の声調情報を有する変換候補のうち
表示優先順位の高いもの（ここでは、「該」）が優先的
に表示されているものとする。Here, it is assumed that Pinyin conversion using only the Pinyin character string is performed in the Pinyin conversion section 5 and conversion candidates for Pinyin conversion have already been displayed on the display screen of the output section 10. The display of this conversion candidate is
As shown in FIG. 9, it is assumed that among the conversion candidates having a plurality of tone information, the conversion candidate having a higher display priority (here, “the”) is displayed with priority.

【００２６】先ず、ポインティングデバイス３を介して
ユーザから出力部１０の表示画面上の任意の位置が指定
されると、指定された出力部１０の表示画面上の指定位
置情報を受け付け（ステップＳ２０１）、この指定位置
情報に基づいて、ユーザから変換候補が指定されたか否
かを判断する（ステップＳ２０２）。ユーザから変換候
補が指定されたと判断した場合（ステップＳ２０２のＹ
ＥＳ）、次に、制御部４は、指定された変換候補につい
てその他の変換候補があるか否かを判断する（ステップ
Ｓ２０３）。即ち、制御部４は、指定された変換候補
（表示画面上に表示されている変換候補）に代えて表示
画面上に表示すべき変換候補がデータ記憶部９に記憶さ
れているか否かを判断する。First, when an arbitrary position on the display screen of the output unit 10 is designated by the user via the pointing device 3, information on the designated position on the display screen of the designated output unit 10 is received (step S201). It is determined whether or not a conversion candidate has been designated by the user based on the designated position information (step S202). When it is determined that a conversion candidate has been designated by the user (Y in step S202)
ES) Next, the control unit 4 determines whether or not there is another conversion candidate for the specified conversion candidate (step S203). That is, the control unit 4 determines whether a conversion candidate to be displayed on the display screen is stored in the data storage unit 9 in place of the designated conversion candidate (conversion candidate displayed on the display screen). I do.

【００２７】ポインティングデバイス３により指定され
た変換候補ついてその他の変換候補があると判断した場
合（ステップＳ２０３のＹＥＳ）、次いで、制御部４
は、ポインティングデバイス３による変換候補の指定
が、ユーザにより取り消されたか否かを判断する（ステ
ップＳ２０４）。If it is determined that there is another conversion candidate for the conversion candidate specified by the pointing device 3 (YES in step S203), then the control unit 4
Determines whether or not the designation of the conversion candidate by the pointing device 3 has been canceled by the user (step S204).

【００２８】ユーザによる変換候補の指定が取り消され
ていない場合（ステップＳ２０４のＮＯ）、制御部４
は、ユーザから音声入力部２を介して音声データが入力
されるのを待ち、音声データが入力されるとそれを受け
付け、音声データ処理部７へ出力する（ステップＳ２０
５）。If the designation of the conversion candidate by the user has not been canceled (NO in step S204), the control unit 4
Waits for audio data to be input from the user via the audio input unit 2, accepts the input audio data, and outputs it to the audio data processing unit 7 (step S20).
5).

【００２９】音声データ処理部７は、制御部４から渡さ
れた音声データを、音声データ処理用辞書を参照して解
析し、解析結果（漢字の声母、韻母、声調情報など）を
取得し、その解析結果を制御部４に出力する（ステップ
Ｓ２０６）。制御部４は、音声データ処理部７から出力
された解析結果をチェックし、声調情報が正しく取り出
せたか否かを判断する（ステップＳ２０７）。なお、解
析結果のチェックの方法は、第１の実施形態で説明した
方法と同様である。The voice data processing unit 7 analyzes the voice data passed from the control unit 4 with reference to a voice data processing dictionary, and obtains analysis results (such as the initials of kanji, finals, and tone information). The analysis result is output to the control unit 4 (step S206). The control unit 4 checks the analysis result output from the audio data processing unit 7, and determines whether or not the tone information has been correctly extracted (step S207). The method of checking the analysis result is the same as the method described in the first embodiment.

【００３０】この解析結果のチェックの結果、声調情報
が正しく取り出せたと判断した場合（ステップＳ２０７
のＹＥＳ）、制御部４は、データ記憶部９に記憶された
変換候補の中から、取り出した声調情報を持つものを抽
出し、出力部１０で現在表示している変換候補に代えて
抽出した変換候補を入れ替えて表示するよう制御する
（ステップＳ２０８）。As a result of checking the analysis result, when it is determined that tone information has been correctly extracted (step S207)
YES), the control unit 4 extracts a conversion candidate having the extracted tone information from the conversion candidates stored in the data storage unit 9 and extracts the conversion candidate instead of the conversion candidate currently displayed on the output unit 10. Control is performed so that the conversion candidates are switched and displayed (step S208).

【００３１】一方、ステップＳ２０７の判断において、
声調情報が正しく取り出せなかったと判断した場合（ス
テップＳ２０７のＮＯ）、制御部４は、ステップＳ２０
４に戻り、再び、ポインティングデバイス３による変換
候補の指定が、ユーザにより取り消されたか否かを判断
する。On the other hand, in the judgment of step S207,
When it is determined that the tone information has not been correctly extracted (NO in step S207), the control unit 4 proceeds to step S20.
4 again, it is determined whether or not the designation of the conversion candidate by the pointing device 3 has been canceled by the user.

【００３２】また、ステップＳ２０２の判断においてユ
ーザから変換候補が指定されていない判断した場合（ス
テップＳ２０２のＮＯ）、ステップＳ２０３の判断にお
いてポインティングデバイス３により指定された変換候
補ついてその他の変換候補がないと判断した場合（ステ
ップＳ２０３のＮＯ）、ステップＳ２０４の判断におい
てユーザによる変換候補の指定が取り消されたと判断し
た場合（ステップＳ２０４のＹＥＳ）、制御部４は、声
調情報の入力による変換候補の表示順位変更処理を終了
する。If it is determined in step S202 that no conversion candidate has been specified by the user (NO in step S202), there is no other conversion candidate for the conversion candidate specified by the pointing device 3 in step S203. When it is determined (NO in step S203), in the determination in step S204, it is determined that the designation of the conversion candidate by the user has been canceled (YES in step S204), the control unit 4 displays the conversion candidate by inputting tone information. The order change processing ends.

【００３３】以上説明した動作によりピンイン変換を行
なった際の具体例を図９、図１０に示している。図９
は、変換候補「該」に代えて表示すべき変換候補が１つ
場合の例を示しており、変換候補「該」を指定して第３
声の声調情報を入力することで、「改」が入れ替えて表
示される様子を示している。また、図１０は、変換候補
「該」に代えて表示すべき変換候補が２つ場合の例を示
しており、変換候補「該」を指定して第４声の声調情報
を入力することで、「蓋」と「概」が一覧で表示される
様子を示している。変換候補が複数ある場合はこのよう
に一覧表示し、ユーザに、ポインティングデバイス３を
用いて所望の変換候補を選択させるようにしている。FIGS. 9 and 10 show specific examples when the pinyin conversion is performed by the operation described above. FIG.
Shows an example in which one conversion candidate to be displayed is displayed in place of the conversion candidate “the”.
The figure shows a state in which “kai” is replaced and displayed by inputting the tone information of the voice. FIG. 10 shows an example in which there are two conversion candidates to be displayed in place of the conversion candidate “the”, and by specifying the conversion candidate “the” and inputting the tone information of the fourth voice. , “Lid” and “Overview” are displayed in a list. When there are a plurality of conversion candidates, a list is displayed in this way, and the user is caused to select a desired conversion candidate using the pointing device 3.

【００３４】以上のようにピンイン文字列を用いたピン
イン変換を行なって得られた変換候補に対して、音声入
力により声調情報を入力し、所望する声調情報を持つ変
換候補を優先的に表示させることができる。従って、声
調情報を考慮せずに得た変換候補を単純に一覧表示し、
その中から所望する変換候補を探す場合に比べ、ユーザ
における変換候補の選択操作を効率良く行なわせること
が可能となる。For the conversion candidates obtained by performing the Pinyin conversion using the Pinyin character string as described above, tone information is input by voice input, and conversion candidates having desired tone information are displayed preferentially. be able to. Therefore, a list of conversion candidates obtained without considering tone information is simply displayed as a list,
As compared with a case where a desired conversion candidate is searched from among them, it is possible to allow the user to efficiently perform a conversion candidate selection operation.

【００３５】（第３の実施形態）次に、第３の実施形態
について説明する。第３の実施形態に係わる中国語文書
作成装置の構成は、図５に示される第１の実施形態の構
成と同じである。(Third Embodiment) Next, a third embodiment will be described. The configuration of the Chinese document creating apparatus according to the third embodiment is the same as the configuration of the first embodiment shown in FIG.

【００３６】第３の実施形態は、ピンイン変換用辞書６
に登録されていない単語を新たなユーザ単語として辞書
登録するユーザ単語登録に関するものである。ユーザ単
語を登録する場合、声調情報が分からない等の理由か
ら、登録するユーザ単語に声調情報を指定しない場合が
ある。この場合、その後のピンイン変換時にユーザ単語
に関する声調情報が入力された際に、この声調情報をユ
ーザ単語の声調情報として自動的に登録するようにした
ものである。なお、ここでは、ユーザ単語の登録は、ピ
ンイン変換用辞書６に対して行なわれるものとして説明
するが、ピンイン変換用辞書６とは別に、ユーザ単語を
登録するためのユーザ辞書を設けても良い。In the third embodiment, the dictionary 6 for pinyin conversion is used.
In user dictionary for registering a word that is not registered in the dictionary as a new user word. When registering a user word, there is a case where tone information is not specified for a user word to be registered because tone information is not known. In this case, when tone information relating to the user word is input during the subsequent pinyin conversion, this tone information is automatically registered as tone information of the user word. Here, the description will be given assuming that the registration of the user words is performed on the Pinyin conversion dictionary 6, but a user dictionary for registering the user words may be provided separately from the Pinyin conversion dictionary 6. .

【００３７】次に、第３の実施形態のユーザ単語への声
調情報付加処理の動作について、図１１のフローチャー
トを参照して説明する。ここで、変換用辞書６に通常に
登録されている単語は、図１２のようにピンイン文字列
の声母、韻母、及び、声調情報（この例では、声調番号
を記憶している）と、これに対応する漢字が対応付けら
れた状態で記憶されているが、これに対して、図１３の
ようにユーザが登録したユーザ単語には、声調情報が記
憶されていないとする。このように、声調情報の記憶さ
れていないユーザ単語に関して、ピンイン変換時に声調
情報が入力された際の処理について以下に説明する。Next, the operation of adding tone information to a user word according to the third embodiment will be described with reference to the flowchart of FIG. Here, the words normally registered in the conversion dictionary 6 are the initial, final, and tone information (in this example, the tone number is stored) of the Pinyin character string as shown in FIG. It is assumed that the kanji corresponding to is stored in a state in which the tone information is not stored in the user word registered by the user as shown in FIG. A process performed when tone information is input during pinyin conversion for a user word in which tone information is not stored will be described below.

【００３８】ステップＳ３０１からＳ３０７までの処理
は、第１の実施形態におけるピンイン変換処理の動作と
同様である。即ち、制御部４は、ユーザからキーボード
入力部１を介して入力されるピンイン文字列を受け付
け、そのピンイン文字列をデータ記憶部９に記憶し（ス
テップＳ３０１）、次いで、ユーザから音声入力部２を
介して所望する漢字の発音表記が音声データとして入力
されるのを待ち、ピンイン文字列が入力されてから一定
時間内に音声データが入力されるとその音声データをデ
ータ記憶部９に記憶する（ステップＳ３０２、Ｓ３０
３）。音声データ処理部７が、入力された音声データ
を、音声データ処理用辞書８を参照して解析し、この音
声データに対応する漢字の声母、韻母、声調情報などの
解析結果を取得し、得られた解析結果を制御部４に出力
する（ステップＳ３０４）。制御部４は、音声データ処
理部７から出力された解析結果をチェックし声調情報が
正しく取り出せたか否かを判断する（ステップＳ３０
５）。制御部４が、声調情報が正しく取り出せたと判断
した場合、ピンイン変換部５が、制御部４から出力され
たピンイン文字列と声調情報とからピンイン変換用辞書
６を参照することによりピンイン変換を行ない、ピンイ
ン文字列に対応する変換候補を取得する（ステップＳ３
０６）。一方、ステップＳ３０７の判断においてピンイ
ン文字列が入力されてから一定時間内に音声入力がない
場合、及び、ステップＳ３０５の判断において音声デー
タの解析処理の結果、声調情報が正しく取り出せなかっ
たと判断した場合のいずれかの場合は、ピンイン変換部
５は、ピンイン文字列のみの情報に基づいて、ピンイン
変換用辞書６を参照してピンイン変換を行う（ステップ
Ｓ３０７）。The processing in steps S301 to S307 is the same as the operation of the pinyin conversion processing in the first embodiment. That is, the control unit 4 receives a Pinyin character string input from the user via the keyboard input unit 1, stores the Pinyin character string in the data storage unit 9 (step S301), and then inputs the voice input unit 2 from the user. Waits for the desired kanji phonetic notation to be input as voice data via the PC, and if voice data is input within a predetermined time after the pinyin character string is input, the voice data is stored in the data storage unit 9. (Steps S302, S30
3). The voice data processing unit 7 analyzes the input voice data with reference to the voice data processing dictionary 8 and obtains the analysis results of the kanji's initial, final, tone information and the like corresponding to the audio data. The obtained analysis result is output to the control unit 4 (step S304). The control unit 4 checks the analysis result output from the audio data processing unit 7 and determines whether tone information has been correctly extracted (step S30).
5). When the control unit 4 determines that tone information has been correctly extracted, the pinyin conversion unit 5 performs pinyin conversion by referring to the pinyin conversion dictionary 6 from the pinyin character string and tone information output from the control unit 4. , The conversion candidate corresponding to the Pinyin character string is obtained (step S3).
06). On the other hand, when there is no voice input within a certain period of time after the pinyin character string is input in the determination of step S307, and when it is determined in the determination of step S305 that tone information cannot be correctly extracted as a result of the voice data analysis process In either case, the Pinyin conversion unit 5 performs the Pinyin conversion with reference to the Pinyin conversion dictionary 6 based on the information of only the Pinyin character string (step S307).

【００３９】次に、制御部４は、ステップＳ３０６にお
いてピンイン変換して得られた変換候補と同じ単語で且
つ声調情報が記憶されていないユーザ単語が存在するか
否かを判断する（ステップＳ３０８）。即ち、制御部４
は、ピンイン変換用辞書６を検索し、変換候補のピンイ
ン文字列（声母、韻母）と漢字の組合せと同じ組合せを
持つユーザ単語が登録されているか否かを判断し、変換
候補の組合せと同じユーザ単語が登録されている場合、
更にそのユーザ単語に声調情報が登録されているか否か
を判断する。Next, the control unit 4 determines whether or not there is a user word that is the same word as the conversion candidate obtained by pinyin conversion in step S306 and for which tone information is not stored (step S308). . That is, the control unit 4
Searches the Pinyin conversion dictionary 6 to determine whether or not a user word having the same combination as the combination of the Pinyin character string (initial, final) of the conversion candidate and the Chinese character is registered, and If the user word is registered,
Further, it is determined whether tone information is registered for the user word.

【００４０】この結果、変換候補のピンイン文字列と漢
字の組合せと同一で、且つ、声調情報の登録されていな
いユーザ単語が存在すると判断した場合（ステップＳ３
０８のＹＥＳ）、制御部４は、ピンイン変換用辞書６
に、ユーザ単語の声調情報として、ユーザ単語と同一の
組合せの変換候補の声調情報を登録する（ステップＳ３
０９）。ユーザ単語の声調情報の登録後のピンイン変換
用辞書６の例を、図１４に示している。As a result, when it is determined that there is a user word which is the same as the combination of the Pinyin character string and the kanji of the conversion candidate and for which tone information is not registered (step S3).
08), the control unit 4 sets the pinyin conversion dictionary 6
The tone information of the conversion candidate having the same combination as the user word is registered as the tone information of the user word (step S3).
09). FIG. 14 shows an example of the Pinyin conversion dictionary 6 after registering the tone information of the user word.

【００４１】次いで、制御部４は、ステップＳ３０６若
しくはステップＳ３０７において得られた変換候補を、
出力部１０にて表示させる（ステップＳ３１０）。以上
説明した動作の具体例を図１５を参照して説明する。Next, the control unit 4 converts the conversion candidates obtained in step S306 or step S307 into
It is displayed on the output unit 10 (step S310). A specific example of the operation described above will be described with reference to FIG.

【００４２】ピンイン変換用辞書６には、図１３のよう
に、ユーザ単語「互聯網」が声調情報の指定のない状態
で既に登録されているとする。このとき、ユーザからピ
ンイン文字列「ｈｕｌｉａｎｗａｎｇ」が入力さ
れ、次いで「ｈｕ（第４声）ｌｉａｎ（第２声）ｗ
ａｎｇ（第３声）」と音声データが入力されると、この
ピンイン文字列と、音声データから得た声調情報から、
ピンイン変換を行ない変換候補「互聯網」を得る。ピン
イン変換結果後、ピンイン文字列「ｈｕｌｉａｎｗ
ａｎｇ」と漢字「互聯網」の組合せについてピンイン変
換用辞書６を検索し、同一の組合せを持ち且つ声調情報
の指定のないユーザ単語を探し出す。図１３のようにピ
ンイン変換用辞書６には、上記した組合せに対応し、且
つ、声調情報が指定されていないユーザ単語が記憶され
ているので、このユーザ単語に声調情報を登録する。ユ
ーザ単語に対して声調情報が登録されたピンイン変換用
辞書６の様子が図１４である。ユーザ単語「互聯網」に
対して声調情報「第４声、第２声、第３声」がピンイン
変換用辞書６に登録される。It is assumed that the user word “reciprocal network” has already been registered in the Pinyin conversion dictionary 6 without specifying tone information as shown in FIG. At this time, the user inputs a pinyin character string “hu lian wang”, and then “hu (fourth voice) lian (second voice) w
ang (third voice) ”, and the pinyin character string and the tone information obtained from the voice data,
Pinyin conversion is performed to obtain a conversion candidate “interlink”. After the pinyin conversion result, the pinyin character string “hu lian w
The pinyin conversion dictionary 6 is searched for the combination of “ang” and the kanji “reciprocal network”, and a user word having the same combination and without specifying tone information is searched. As shown in FIG. 13, the pinyin conversion dictionary 6 stores a user word corresponding to the above-described combination and for which tone information is not specified, so that tone information is registered in this user word. FIG. 14 shows the state of the pinyin conversion dictionary 6 in which tone information is registered for a user word. Tone information “fourth voice, second voice, third voice” for the user word “reciprocal network” is registered in the Pinyin conversion dictionary 6.

【００４３】以上説明した処理によって、ユーザ単語の
登録時に声調情報を登録していなくても、その後のピン
イン変換の際に同一の単語に対して音声入力により声調
情報を入力すると、入力した声調情報がユーザ単語の登
録情報として自動的に登録されるので、ユーザによるユ
ーザ単語登録操作が簡単となる。According to the above-described processing, even when tone information is not registered at the time of registering a user word, if tone information is input by voice input for the same word during the subsequent pinyin conversion, the input tone information is input. Is automatically registered as user word registration information, so that the user can easily perform a user word registration operation.

【００４４】なお、上述した各実施形態において記載し
た手法は、コンピュータに実行させることのできるプロ
グラムとして、例えば磁気ディスク（ハードディスク
等）、光ディスク（ＣＤ−ＲＯＭ、ＤＶＤ等）、半導体
メモリ等の記憶媒体に書きこんで各種装置に適用した
り、通信媒体により伝送して各種装置に適用することも
可能である。本装置を実現するコンピュータは、記憶媒
体に記憶されたプログラムを読込み、このプログラムに
よって動作が制御されることにより、上述した処理を実
行する。The method described in each of the above-described embodiments is a program that can be executed by a computer, for example, a storage medium such as a magnetic disk (hard disk or the like), an optical disk (CD-ROM, DVD or the like), or a semiconductor memory. And can be applied to various devices, or transmitted by a communication medium and applied to various devices. The computer that realizes the present apparatus reads the program stored in the storage medium, and executes the above-described processing by controlling the operation of the program.

【００４５】[0045]

【発明の効果】以上詳述したように本発明によれば、声
調情報を用いてピンイン変換を行なうに際し、声調情報
の入力を所望する漢字の音声情報を入力することで行な
うことが可能であるため、ピンイン文字列のみキー入力
を行なえば良く、ユーザによる入力操作を容易且つ迅速
に行なうことが可能である。As described in detail above, according to the present invention, when performing pinyin conversion using tone information, it is possible to input tone information by inputting desired kanji voice information. Therefore, the key input only needs to be performed for the Pinyin character string, and the input operation by the user can be performed easily and quickly.

【００４６】また、ピンイン文字列のみでピンイン変換
を行なった後、所定の変換候補に対して、音声情報を入
力することにより声調情報を入力し、所望する漢字の変
換候補を優先的に表示することが可能であるため、ユー
ザによる変換候補の選択操作を容易且つ迅速に行なうこ
とが可能である。After the Pinyin conversion is performed using only the Pinyin character string, tone information is input by inputting voice information for a predetermined conversion candidate, and the desired Kanji conversion candidate is displayed preferentially. Therefore, the user can easily and quickly select a conversion candidate.

【００４７】また、ピンイン変換の際に音声情報により
入力された声調情報を、ユーザ単語辞書に登録されたユ
ーザ単語の声調情報として登録するようにしたので、ユ
ーザは声調情報が分からない場合はユーザ単語辞書に登
録しなくても良くなり、ユーザ単語登録の際の利便性が
向上する。Further, the tone information input by the speech information at the time of the pinyin conversion is registered as the tone information of the user word registered in the user word dictionary. It is not necessary to register in the word dictionary, and the convenience in registering user words is improved.

[Brief description of the drawings]

【図１】漢字とピンイン文字列との対応例を示した図。FIG. 1 is a diagram showing an example of correspondence between kanji and pinyin character strings.

【図２】ピンイン文字列のみでピンイン変換を行なう際
の動作を示した図。FIG. 2 is a diagram showing an operation when pinyin conversion is performed using only a pinyin character string;

【図３】同じピンイン文字列を持つ漢字例示した図。FIG. 3 is a diagram illustrating a kanji having the same Pinyin character string.

【図４】同じピンイン文字列を持つ漢字の声調情報を示
した図。FIG. 4 is a diagram showing tone information of Chinese characters having the same Pinyin character string.

【図５】本発明に係わる中国語文書作成装置の構成を示
した図。FIG. 5 is a diagram showing a configuration of a Chinese document creating apparatus according to the present invention.

【図６】第１の実施形態のピンイン変換処理の動作を示
すフローチャート。FIG. 6 is a flowchart illustrating an operation of a pinyin conversion process according to the first embodiment;

【図７】第１の実施形態のピンイン変換処理の具体例を
示した図。FIG. 7 is a diagram showing a specific example of a pinyin conversion process according to the first embodiment;

【図８】第２の実施形態の声調情報の入力による変換候
補の表示順位変換処理の動作を示すフローチャート。FIG. 8 is a flowchart showing an operation of a display order conversion process of conversion candidates by input of tone information according to the second embodiment;

【図９】第２の実施形態の声調情報の入力による変換候
補の表示順位変換処理の第１の具体例を示した図。FIG. 9 is a diagram showing a first specific example of a display order conversion process of conversion candidates by input of tone information according to the second embodiment;

【図１０】第２の実施形態の声調情報の入力による変換
候補の表示順位変換処理の第２の具体例を示した図。FIG. 10 is a diagram showing a second specific example of the display order conversion process of conversion candidates by inputting tone information according to the second embodiment;

【図１１】本発明の第３の実施形態のユーザ単語への声
調情報付加処理の動作を示すフローチャート。FIG. 11 is a flowchart showing the operation of processing for adding tone information to a user word according to the third embodiment of the present invention.

【図１２】変換用辞書６に格納された通常の単語の格納
例を示した図。FIG. 12 is a diagram showing a storage example of normal words stored in the conversion dictionary 6.

【図１３】変換用辞書６に格納された声調情報の付加さ
れていないユーザ単語の格納例を示した図。FIG. 13 is a diagram showing a storage example of user words to which tone information is not added, which is stored in the conversion dictionary 6.

【図１４】変換用辞書６に格納された声調情報の付加さ
れたユーザ単語の格納例を示した図。FIG. 14 is a diagram showing a storage example of a user word to which tone information stored in a conversion dictionary 6 is added.

【図１５】本発明の第３の実施形態のユーザ単語への声
調情報付加処理の具体例を示した図。FIG. 15 is a diagram showing a specific example of a process of adding tone information to a user word according to the third embodiment of the present invention.

[Explanation of symbols]

１…キーボード入力部２…音声入力部３…ポインティングデバイス４…制御部５…ピンイン変換部６…ピンイン変換用辞書７…音声データ処理部８…音声データ処理用辞書９…データ記憶部１０…出力部 REFERENCE SIGNS LIST 1 keyboard input unit 2 voice input unit 3 pointing device 4 control unit 5 pinyin conversion unit 6 pinyin conversion dictionary 7 voice data processing unit 8 voice data processing dictionary 9 data storage unit 10 output Department

Claims

[Claims]

1. A pinyin character string input means for inputting a pinyin character string, a pinyin conversion dictionary for storing a combination of a pinyin character string and tone information and a kanji corresponding to the combination in association with each other; Voice information input means for inputting the kanji voice information corresponding to the pinyin character string input from, and the voice information input from the voice information input means is analyzed, and the pinyin character input from the pinyin character string input means is analyzed. Voice information analysis means for obtaining tone information corresponding to a column; and the pinyin conversion dictionary based on the pinyin character string input from the pinyin character string input means and the tone information obtained by the voice information analysis means. And converting the kanji obtained by the pinyin conversion means to obtain the corresponding kanji Chinese document creating apparatus characterized by comprising a display means for displaying as a complement.

2. A Pinyin character string input means for inputting a Pinyin character string, a Pinyin conversion dictionary for storing a combination of a Pinyin character string and tone information and a Chinese character corresponding to the combination in association with each other; Pinyin conversion means for referring to the Pinyin conversion dictionary based on the Pinyin character string input from the column input means to obtain corresponding kanji and tone information, and displaying the kanji obtained by the Pinyin conversion means as conversion candidates A voice information inputting means for inputting voice information of a desired kanji, and analyzing voice information inputted from the voice information inputting means, and corresponding to a pinyin character string inputted from the pinyin character string inputting means. Voice information analyzing means for obtaining tone information; and a conversion candidate obtained by the pinyin conversion means, Control means for controlling display of conversion candidates by the display means so as to preferentially display, as conversion candidates, kanji having tone information obtained by the voice information analysis means, .

3. A Pinyin character string input means for inputting a Pinyin character string, a Pinyin conversion dictionary for storing a combination of a Pinyin character string and tone information and a Chinese character corresponding to the combination in association with each other; Voice information input means for inputting the kanji voice information corresponding to the pinyin character string input from, and the voice information input from the voice information input means is analyzed, and the pinyin character input from the pinyin character string input means is analyzed. Voice information analysis means for obtaining tone information corresponding to a sequence, and the pinyin conversion dictionary based on the pinyin character string input from the pinyin character string input means and the tone information obtained by the voice information analysis means. Pinyin conversion means for referring to and acquiring the corresponding kanji, and a Pinyin character string and voice for a newly registered user word Searching the user word dictionary, a user word with the same kanji acquired Chinese character by the pinyin conversion means from the user word dictionary for storing in association with Chinese character corresponding to the combination and the combination with the information, respectively,
And a user word registration unit for registering tone information acquired by the speech information analysis unit as tone information of the searched user word.

4. A Chinese character having a Pinyin character string input means for inputting a Pinyin character string, a voice information input means for inputting Chinese character voice information, and a display means for displaying a Chinese character corresponding to the Pinyin character string as a conversion candidate. In the Chinese document creation method in the word document creation device, a Pinyin character string input from the Pinyin character string input means is received, voice information input from the voice information input means is analyzed, and the input Pinyin character string is input. The tone information corresponding to the obtained Pinyin character string and the acquired tone information are referred to a Pinyin conversion dictionary that stores the combination of the Pinyin character string and the tone information and the kanji corresponding to the combination respectively. A kanji corresponding to the received pinyin character string is acquired based on tone information, and the acquired kanji is used as a conversion candidate. Chinese document creating apparatus characterized by controlling so as to be displayed on said display means.

5. A Chinese character having a Pinyin character string input means for inputting a Pinyin character string, voice information input means for inputting Chinese character voice information, and a display means for displaying a Kanji character corresponding to the Pinyin character string as a conversion candidate. In the Chinese document creation method in the word document creation device, a Pinyin character string input from the Pinyin character string input unit is received, and a combination of the Pinyin character string and tone information based on the received Pinyin character string and the combination A kanji and tone information corresponding to the received pinyin character string are obtained by referring to a pinyin conversion dictionary that stores kanjis corresponding to the corresponding kanjis, and the obtained kanjis are displayed as conversion candidates on the display means. Controlling, and analyzing the voice information input from the voice information input means after displaying the conversion candidate, and The tone information corresponding to the obtained Pinyin character string, and a conversion candidate by the display means for preferentially displaying a kanji having the same tone information as the acquired tone information among the conversion candidates obtained by the Pinyin conversion as a conversion candidate. Controlling the display of a Chinese document.

6. A Pinyin character string input means for inputting a Pinyin character string, voice information input means for inputting Chinese character voice information, a display means for displaying a Chinese character corresponding to the Pinyin character string as a conversion candidate, and a user word In a Chinese document creation method in a Chinese document creation device having a user word dictionary capable of registering, a Pinyin character string input from the Pinyin character string input means is received, and voice information input from the voice information input means is received. To obtain the tone information corresponding to the input Pinyin character string, and refer to a Pinyin conversion dictionary that stores the combination of the Pinyin character string and the tone information and the kanji corresponding to the combination in association with each other. A Chinese character corresponding to the received pinyin character string based on the received pinyin character string and the acquired tone information. A character is obtained, a user word having the same kanji as the obtained kanji is searched from the user word dictionary, and tone information obtained by voice information analysis is registered in the user word dictionary as tone information of the searched user word. An apparatus for creating a Chinese document.