JP3102636B2

JP3102636B2 - Notation string conversion device and notation string conversion method

Info

Publication number: JP3102636B2
Application number: JP10175643A
Authority: JP
Inventors: 和宣浮川; 英一四宮; 和徳弓削
Original assignee: 株式会社ジャストシステム
Priority date: 1998-06-23
Filing date: 1998-06-23
Publication date: 2000-10-23
Anticipated expiration: 2018-06-23
Also published as: JP2000010970A

Abstract

PROBLEM TO BE SOLVED: To more precisely recognize voice. SOLUTION: A notation character string outputted from a voice data conversion means 5 is stored in a non-decided character string storage means 12 in a non-decided state and a read character string is stored in a read character string storage means 11. A language rule storage means 17 stores a language rule. An automatic conversion means 15 judges the relation of the specified notation character string in a plurality of notation character strings stored in the non-decided character string storage means 12 based on the language rule of the language rule storage means 17. When the relation of the notation character strings is judged to match the language rule, the notation character string in the non-decided state is changed to match the language rule. A display means 19 displays the character string of the non-decided character string storage means. Thus, precise recognition can be executed by changing recognized voice data based on the language rule.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、音声認識技術に
関し、特に表記文字列の変換精度向上に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech recognition technology, and more particularly, to an improvement in the accuracy of conversion of a written character string.

【０００２】[0002]

【従来技術およびその課題】今日、コンピュータに文字
列を入力するのに、図９に示すような音声入力システム
が着目されている。音声変換辞書には音声データの特徴
量と、この特徴量に対応する表記文字列がテーブルとし
てあらかじめ登録されている。マイク６１から音声が入
力されると、音声変換部６３は、入力された音声から所
定の特徴量を抽出し、音声変換辞書６５に登録されてい
る各表記文字列の特徴量とのマッチングを演算し、一致
度のもっとも高いものを表記文字列として出力する。2. Description of the Related Art To input a character string to a computer, a voice input system as shown in FIG. In the speech conversion dictionary, a feature amount of speech data and a notation character string corresponding to the feature amount are registered in advance as a table. When a voice is input from the microphone 61, the voice conversion unit 63 extracts a predetermined feature amount from the input voice and calculates a matching with a feature amount of each written character string registered in the voice conversion dictionary 65. Then, the one with the highest matching degree is output as a notation character string.

【０００３】しかし、上記音声入力システムにおいて
は、以下のような問題があった。音声データの特徴量で
一致度を判断しているので、登録している表記文字列以
外の音声データが与えられた場合は、操作者の望む表記
文字列に変換できない。However, the above-mentioned voice input system has the following problems. Since the matching degree is determined based on the feature amount of the voice data, if voice data other than the registered text string is given, the voice data cannot be converted into a text string desired by the operator.

【０００４】このため、多くの表記文字列を登録してお
くことも考えられる。しかし、登録対象の表記文字列が
多ければ多いほど、マッチング演算に時間がかかるとと
もに、発音の仕方が少しずれただけで、他の変換候補と
認識してしまう。このため、変換精度も低下するおそれ
がある。[0004] For this reason, it is conceivable to register many notation character strings. However, as the number of written character strings to be registered increases, the matching operation takes more time, and even if the pronunciation method is slightly shifted, it is recognized as another conversion candidate. For this reason, the conversion accuracy may be reduced.

【０００５】特に、音声入力システムの音声変換辞書は
統計的によく使用される表記文字列について、その特徴
量を記憶しておくので、一般的でない専門分野の文章に
おいては、誤変換される可能性が高い。[0005] In particular, the speech conversion dictionary of the speech input system stores the characteristic amount of a notation character string that is statistically frequently used, so that a sentence in an unusual specialized field may be erroneously converted. High in nature.

【０００６】この発明は上記問題を解決し、音声変換辞
書がそれほど大きくせずとも、変換効率を向上でき、ま
た未登録表記文字列に変換可能な表記文字列変換装置ま
たはその方法を提供する。The present invention solves the above-mentioned problem, and provides a typographical character string conversion apparatus and method capable of improving conversion efficiency and converting to an unregistered typographical character string without making the speech conversion dictionary so large.

【０００７】[0007]

【課題を解決するための手段および発明の効果】１）本
発明にかかる表記文字列変換装置は、音声データの特徴
量に対応する表記文字列と前記表記文字列に対応する読
み文字列とを記憶した音声変換辞書を用いることによ
り、入力された音声データを前記音声変換辞書に記録さ
れた特徴量に最も一致する表記文字列に変換し対応する
読み文字列とともに出力する音声データ変換手段から与
えられた表記文字列を、言語規則に合致するように変換
する表記文字列変換装置であって、 1)特定の表記文字列間の対応関係を言語規則に基づいて
記憶するとともに前記表記文字列に対応する対応読み文
字列を併せて記憶する言語規則記憶手段、 2)前記音声データ変換手段から与えられた表記文字列を
未確定状態で記憶する未確定文字列記憶手段、 3)前記音声データ変換手段から与えられた読み文字列を
記憶する読み文字列記憶手段、前記未確定文字列記憶手
段に記憶された複数の表記文字列のうち特定の表記文字
列間の関係が前記対応関係に合致しているか否かを判断
するとともに、合致していない場合には前記読み文字列
記憶手段に記憶された読み文字列のうち前記特定の表記
文字列に対応する読み文字列と前記対応読み文字列とに
基づいて前記特定の表記文字列を前記対応関係に合致す
る表記文字列に変換する自動変換手段を備えている。Means for Solving the Problems and Effects of the Invention 1) The notation character string conversion device according to the present invention is characterized by the characteristics of voice data.
The notation character string corresponding to the quantity and the reading
By using a speech conversion dictionary that stores only character strings
The input voice data is recorded in the voice conversion dictionary.
To the notation character string that best matches the
The notation string given from the voice data conversion means for output together with the read character string, a writing character string converter for converting to match the language rules, 1) language correspondence between specific writing character string According to the rules
A corresponding reading sentence that is stored and corresponds to the notation character string
Language rule storage means for storing character strings together ; 2) undetermined character string storage means for storing the notation character string provided from the voice data conversion means in an undetermined state; 3) provided from the voice data conversion means. Reading character string storage means for storing the read character string, and whether or not the relationship between specific notation character strings among the plurality of notation character strings stored in the undetermined character string storage means matches the correspondence relationship Judge
And if not, read the character string
The specific notation of the reading character string stored in the storage means;
The reading character string corresponding to the character string and the corresponding reading character string
Match the specific notation string with the correspondence based on
Automatic conversion means for converting into a notation character string .

【０００８】このように、音声変換辞書を用いて最も一
致度の高い表記文字列および読み文字列が音声データ変
換手段から与えられると、この表記文字列を未確定状態
で記憶するとともに、その読み文字列を記憶しておき、
前記複数の表記文字列のうち特定の表記文字列の関係を
あらかじめ記憶した言語規則に基づき判断して、表記文
字列間の関係が前記言語規則に合致していないと判断し
た場合には、前記未確定状態の表記文字列が前記言語規
則に合致するように変更する。したがって、多くの表記
文字列についての音声変換規則がなくとも、前記言語規
則に合致した表記文字列を得ることができる。また、前
記読み文字列を考慮して、前記特定の表記文字列の関係
についての判断および表記文字列の変更を行う。このよ
うに、読み文字列を考慮することにより、より正確に前
記判断及び変更ができる。 [0008] As described above, using the speech conversion dictionary,
Character strings and reading character strings with a high degree of
If given by the conversion means, this notation character string
And memorize the reading character string,
The relationship of a specific notation string among the plurality of notation strings
Judgment based on language rules stored in advance
Judge that the relationship between strings does not conform to the language rules
If not, the written character string in the undetermined state is
Change to match the rule. Therefore, many notations
Even if there are no speech conversion rules for character strings,
A notation character string that matches the rule can be obtained. Also before
Considering the written string, the relationship between the specific written string
And change the notation character string. This
As described above, by considering the reading string,
You can make judgments and make changes.

【０００９】２）本発明にかかるコンピュータを用いた
表記文字列変換方法は、音声データ変換手段から与えら
れた表記文字列を言語規則に合致するように変換する表
記文字列変換方法であって、前記音声データ変換手段
は、音声データの特徴量に対応する表記文字列と前記表
記文字列に対応する読み文字列とを記憶した音声変換辞
書を用いることにより、入力された音声データを前記音
声変換辞書に記録された特徴量に最も一致する表記文字
列に変換し対応する読み文字列とともに出力する手段で
あって、前記コンピュータは以下の処理を実行するこ
と、特定の表記文字列間の対応関係を言語規則に基づい
て記憶するとともに前記表記文字列に対応する対応読み
文字列を併せて記憶しておき、前記音声データ変換手段
から与えられた表記文字列を未確定状態で記憶するとと
もに前記音声データ変換手段から与えられた読み文字列
を記憶し、前記未確定状態で記憶された複数の表記文字
列のうち特定の表記文字列の関係が前記対応関係に合致
しているか否かを判断するとともに、合致していない場
合には前記音声データ変換手段から与えられた読み文字
列と前記対応読み文字列とに基づいて前記特定の表記文
字列を前記対応関係に合致する表記文字列に変換する。
したがって、多くの表記文字列についての音声変換規則
がなくとも、前記言語規則に合致した表記文字列を得る
ことができる。また、前記特定の表記文字列の関係につ
いての判断および表記文字列の変更は前記読み文字列が
考慮されて行われる。このように、読み文字列を考慮す
ることにより、より正確に前記判断及び変更ができる。2) A notation character string conversion method using a computer according to the present invention is a notation character string conversion method for converting a notation character string provided from voice data conversion means so as to conform to language rules. The audio data conversion means includes a notation character string corresponding to a feature amount of the audio data and the table.
A speech conversion dictionary that stores the reading character string corresponding to the written character string
The input voice data can be
Notation characters that best match the features recorded in the voice conversion dictionary
By converting to a column and outputting it with the corresponding reading character string
The computer executes the following processing, and determines the correspondence between specific notation character strings based on language rules.
And the corresponding reading corresponding to the notation character string.
A character string is also stored, and the notation character string provided from the voice data conversion means is stored in an undetermined state, and a read character string provided from the voice data conversion means is stored in the undetermined state. The relationship of a specific notation character string among the plurality of notation character strings stored in the step matches the correspondence.
Judge whether or not the character is read, and if not , read the character given by the voice data conversion means.
The specific notation sentence based on a column and the corresponding reading character string
The character string is converted into a written character string that matches the correspondence .
Therefore, it is possible to obtain a written character string that conforms to the language rules even if there are no speech conversion rules for many written character strings. In addition, the determination regarding the relationship of the specific written character string and the change of the written character string are performed in consideration of the read character string. As described above, the determination and the change can be performed more accurately by considering the read character string.

【００１０】４，５）本発明にかかる表記文字列変換装
置または方法においては、前記言語規則記憶手段は、あ
る読み文字列に対して類似の読みを有する表記文字列を
記憶しており、前記自動変換手段は、前記未確定文字列
記憶手段に記憶された複数の表記文字列のうち特定の表
記文字列間の関係が前記対応関係に合致していないと判
断した場合には、自動変換対象の表記文字列を前記類似
の読みを有する表記文字列に変換する。したがって、少
しずれた読みが与えられても、適切な表記文字列を得る
ことができる。[0010] 4,5) in notation string conversion device or method according to the present invention, the language rules storing means, the writing character string having a reading similar <br/> stored for a read character string The automatic conversion means outputs the undetermined character string.
A specific table among a plurality of notation character strings stored in the storage means
If it is determined that the relationship between the character strings does not match the correspondence , the character string to be automatically converted is converted to the character string having the similar pronunciation. Therefore, even if a slightly deviated reading is given, an appropriate written character string can be obtained.

【００１１】なお、自動変換手段は、実施形態において
は、ＣＰＵ２３のステップＳＴ１１の処理に該当する。
また、実施形態においては、読みバッファ２７ｂの読み
文字列を参照せずに、かな漢字変換規則に合致するか否
かの判断を行ったが、読み文字列を参照するようにして
もよい。The automatic conversion means corresponds to the processing of step ST11 of the CPU 23 in the embodiment.
Further, in the embodiment, it is determined whether or not the character string matches the Kana-Kanji conversion rule without referring to the reading character string in the reading buffer 27b. However, the reading character string may be referred to.

【００１２】手動変換手段は、実施形態においては、操
作者のキー操作の種類に応じてＣＰＵ２３が行うステッ
プＳＴ１７、ステップＳＴ２１、ステップＳＴ２５、ス
テップＳＴ２７、ステップＳＴ２９、ステップＳＴ３
１、ステップＳＴ３３の処理に該当する。In the embodiment, the manual conversion means performs step ST17, step ST21, step ST25, step ST27, step ST29, step ST3 performed by the CPU 23 according to the type of key operation by the operator.
1. This corresponds to the process of step ST33.

【００１３】[0013]

【発明の実施の形態】１．機能ブロック図の説明本発明の一実施形態を図面に基づいて説明する。図１
に、表記文字列変換装置１０の機能ブロック図を示す。
表記文字列変換装置１０は、操作者が入力した音声デー
タを対応する表記文字列および読み文字列に変換して出
力する音声データ変換手段５から与えられた表記文字列
を言語規則に合致するように変換する装置であって、言
語規則記憶手段１７、未確定文字列記憶手段１２、自動
変換手段１５、読み文字列記憶手段１１、手動変更手段
１３、および出力手段１４を備えている。BEST MODE FOR CARRYING OUT THE INVENTION Description of Functional Block Diagram One embodiment of the present invention will be described with reference to the drawings. FIG.
2 shows a functional block diagram of the notation character string conversion device 10.
The notation character string conversion device 10 converts the notation character string given from the sound data conversion means 5 that converts the voice data input by the operator into the corresponding notation character string and reading character string and outputs the converted character string so as to match the language rule. The apparatus includes a language rule storage unit 17, an undetermined character string storage unit 12, an automatic conversion unit 15, a read character string storage unit 11, a manual change unit 13, and an output unit 14.

【００１４】まず、表記文字列変換装置１０に表記文字
列を与える音声データ変換手段５について説明する。音
声入力手段３に操作者が音声を入力すると、音声変換手
段６は入力された音声データから所定の特徴量を抽出す
る。音声変換辞書７にはあらかじめ複数の変換候補が記
憶されている。各変換候補は、特徴量、その表記文字列
およびその読み文字列で構成されている。音声変換手段
６は、抽出した特徴量と、前記各変換候補との特徴量と
の一致度を判断し、もっとも一致した候補の表記文字列
および読み文字列を出力する。First, a description will be given of the voice data conversion means 5 for providing a written character string to the written character string conversion device 10. When the operator inputs a voice to the voice input unit 3, the voice conversion unit 6 extracts a predetermined feature amount from the input voice data. A plurality of conversion candidates are stored in the voice conversion dictionary 7 in advance. Each conversion candidate is composed of a feature amount, its notation character string, and its reading character string. The voice conversion means 6 determines the degree of coincidence between the extracted feature quantity and the feature quantity with each of the conversion candidates, and outputs a notation character string and a reading character string of the best matching candidate.

【００１５】音声データ変換手段５から出力された表記
文字列は、未確定文字列記憶手段１２に未確定状態で記
憶され、読み文字列は読み文字列記憶手段１１に記憶さ
れる。言語規則記憶手段１７は、言語規則を記憶す
る。本実施形態においては、言語規則として、かな漢字
変換の変換辞書を採用した。より具体的にいうと、共起
関係にある表記文字列およびその読み文字列から構成さ
れた共起情報を採用した。The written character string output from the voice data conversion means 5 is stored in the undetermined character string storage means 12 in an undetermined state, and the read character string is stored in the read character string storage means 11. The language rule storage means 17 stores a language rule. In the present embodiment, a conversion dictionary for kana-kanji conversion is adopted as a language rule. More specifically, co-occurrence information composed of a notation character string having a co-occurrence relation and a reading character string thereof is employed.

【００１６】自動変換手段１５は、未確定文字列記憶手
段１２に記憶された複数の表記文字列のうち特定の表記
文字列の関係を、言語規則記憶手段１７の言語規則に基
づき判断して、表記文字列間の関係が前記言語規則に合
致していないと判断した場合には、前記未確定状態の表
記文字列が前記言語規則に合致するように変更する。The automatic conversion means 15 determines the relation of a specific notation character string among a plurality of notation character strings stored in the unconfirmed character string storage means 12 based on the language rules of the language rule storage means 17, If it is determined that the relationship between the notation character strings does not match the language rule, the notation state of the notation character string is changed to match the language rule.

【００１７】本実施形態においては、自動変換手段１５
は、読み文字列記憶手段１１に記憶された読み文字列を
考慮して、前記判断及び変更を行う。In this embodiment, the automatic conversion means 15
Performs the above determination and change in consideration of the read character string stored in the read character string storage unit 11.

【００１８】手動変更手段１３は、操作者から変換命令
が与えられると、言語規則記憶手段１７に記憶された言
語規則に基づいて、未確定文字列記憶手段１２の表記文
字列を変更する。例えば、変換命令として未確定文字列
を別の候補に変換する命令が与えられると、その未確定
文字列を別の候補に変換する。また、手動変更手段１３
は、操作者から命令が与えられると、読み文字列記憶手
段１１の読み文字列を参照して、未確定文字列記憶手段
１２の表記文字列を変更する。例えば、文節区切り位置
変更命令が操作命令として与えられると、新たな文節区
切り処理を行う。When a conversion command is given by the operator, the manual change unit 13 changes the character string written in the undetermined character string storage unit 12 based on the language rules stored in the language rule storage unit 17. For example, when an instruction to convert an undetermined character string to another candidate is given as a conversion instruction, the undetermined character string is converted to another candidate. Also, the manual changing means 13
When a command is given by the operator, the notation character string in the undetermined character string storage means 12 is changed with reference to the reading character string in the reading character string storage means 11. For example, when a clause break position change command is given as an operation command, new clause break processing is performed.

【００１９】出力手段１４は、未確定文字列記憶手段１
２または読み文字列記憶手段１１に記憶された文字列を
出力する。The output means 14 is an undetermined character string storage means 1
2 or the character string stored in the read character string storage means 11.

【００２０】２．ハードウェア構成 (2.1)概略図２に、音声変換装置および表示文字列変換装置を組み
込んだかな漢字変換装置４０を示す。かな漢字変換装置
４０は、入力装置４１、制御装置４３、表示装置４５お
よび記憶装置４７を備えている。入力装置４１は、文字
列、変換命令または選択命令等を、音声入力またはキー
入力可能である。記憶装置４７には、音声変換用の辞
書、およびかな漢字変換用の辞書が格納されている。2. 2. Hardware Configuration (2.1) Outline FIG. 2 shows a kana-kanji conversion device 40 incorporating a voice conversion device and a display character string conversion device. The kana-kanji conversion device 40 includes an input device 41, a control device 43, a display device 45, and a storage device 47. The input device 41 is capable of voice input or key input of a character string, a conversion command, a selection command, and the like. The storage device 47 stores a dictionary for voice conversion and a dictionary for kana-kanji conversion.

【００２１】操作者は、入力装置４１から音声データ入
力する。制御装置４３は、前記入力された音声データを
表記文字列に前記音声変換用辞書を用いて変換するとと
もに、変換された表記文字列をかな漢字変換辞書を用い
て、かな漢字変換規則に合致した変換が行われているか
を検査し、かな漢字変換規則に合致した変換が行われて
いない場合には、かな漢字変換規則に合致した表記文字
列に変換する。表示装置４５には、変換された表記文字
列が表示される。The operator inputs voice data from the input device 41. The control device 43 converts the input speech data into a notation character string using the speech conversion dictionary, and converts the converted notation character string using the kana-kanji conversion dictionary to match the kana-kanji conversion rule. It is checked whether the conversion has been performed, and if the conversion that conforms to the Kana-Kanji conversion rule has not been performed, it is converted to a written character string that conforms to the Kana-Kanji conversion rule. The display device 45 displays the converted notation character string.

【００２２】(2.2)詳細図３に、図２に示すかな漢字変換装置４０を、ＣＰＵを
用いて実現したハードウェア構成の一例を示す。(2.2) Details FIG. 3 shows an example of a hardware configuration in which the kana-kanji conversion device 40 shown in FIG. 2 is realized using a CPU.

【００２３】かな漢字変換装置４０は、ＣＰＵ２３、メ
モリ２７、ハードディスク２６、ＣＲＴ３０、ＦＤＤ２
５、キーボード２８、マウス３１、インターフェイス３
３、マイク３４およびバスライン２９を備えている。The kana-kanji conversion device 40 includes a CPU 23, a memory 27, a hard disk 26, a CRT 30, a FDD2
5, keyboard 28, mouse 31, interface 3
3, a microphone 34 and a bus line 29.

【００２４】マイク３４には音声データが入力される。
入力された音声データはインターフェイス３３でデジタ
ル変換され、メモリ２７の作業バッファ（図示せず）に
一時記憶される。Audio data is input to the microphone 34.
The input audio data is digitally converted by the interface 33 and temporarily stored in a work buffer (not shown) of the memory 27.

【００２５】ハードディスク２６には、音声変換プログ
ラム２６ａ，音声変換辞書２６ｂ、かな漢字変換プログ
ラム２６ｃ，かな漢字変換辞書２６ｄおよびアプリケー
ションプログラム２６ｐが記憶されている。音声変換辞
書２６ｂのデータ構造を図４に示す。このように、音声
データ毎に表記文字列およびその読み文字列が記憶され
ている。なお、音声変換辞書２６ｂには単語がどのよう
に連接するかを示す単語間連接データも記憶されている
（図示せず）。かな漢字変換辞書２６ｄに記憶されてい
る共起辞書の一例を図５に示す。共起関係にある表記文
字列が対応づけられて記憶されている。音声変換プログ
ラム２６ａおよびかな漢字変換プログラム２６ｃについ
て、後述する。アプリケーションプログラム２６ｐは、
かな漢字変換プログラム２６ｃから表記文字列を受け取
って、ＣＲＴ３０へ出力する。The hard disk 26 stores a voice conversion program 26a, a voice conversion dictionary 26b, a kana-kanji conversion program 26c, a kana-kanji conversion dictionary 26d, and an application program 26p. FIG. 4 shows the data structure of the voice conversion dictionary 26b. As described above, the written character string and its read character string are stored for each voice data. The speech conversion dictionary 26b also stores inter-word connection data indicating how words are connected (not shown). FIG. 5 shows an example of the co-occurrence dictionary stored in the kana-kanji conversion dictionary 26d. Notation character strings having a co-occurrence relationship are stored in association with each other. The voice conversion program 26a and the kana-kanji conversion program 26c will be described later. The application program 26p
The notation character string is received from the kana-kanji conversion program 26c and output to the CRT 30.

【００２６】図３に示すＣＰＵ２３は、ハードディスク
２６に記憶された前記２つのプログラムにしたがいバス
ライン２９を介して、各部を制御する。これらプログラ
ムは、ＦＤＤ２５を介して、プログラムが記憶されたフ
レキシブルディスク２５ａから読み出されてハードディ
スク２６にインストールされたものである。なお、フレ
キシブルディスク以外に、ＣＤ−ＲＯＭ、ＩＣカード等
のプログラムを実体的に一体化したコンピュータ可読の
記憶媒体から、ハードディスクにインストールさせるよ
うにしてもよい。さらに、通信回線を用いてダウンロー
ドするようにしてもよい。The CPU 23 shown in FIG. 3 controls each unit via the bus line 29 according to the two programs stored in the hard disk 26. These programs are read from the flexible disk 25a in which the programs are stored via the FDD 25 and installed on the hard disk 26. In addition to the flexible disk, a hard disk may be installed from a computer-readable storage medium in which a program such as a CD-ROM or an IC card is substantially integrated. Furthermore, you may make it download using a communication line.

【００２７】本実施形態においては、プログラムをフレ
キシブルディスクからハードディスク２６にインストー
ルさせることにより、フレキシブルディスクに記憶させ
たプログラムを間接的にコンピュータに実行させるよう
にしている。しかし、これに限定されることなく、フレ
キシブルディスクに記憶させたプログラムをＦＤＤ２５
から直接的に実行するようにしてもよい。なお、コンピ
ュータによって、実行可能なプログラムとしては、その
ままのインストールするだけで直接実行可能なものはも
ちろん、一旦他の形態等に変換が必要なもの（例えば、
データ圧縮されているものを、解凍する等）、さらに
は、他のモジュール部分と組合して実行可能なものも含
む。In the present embodiment, the program is installed on the hard disk 26 from the flexible disk, so that the computer indirectly executes the program stored on the flexible disk. However, without being limited to this, the program stored in the flexible disk is stored in the FDD25.
Alternatively, it may be executed directly from. Note that, as a program executable by a computer, not only a program that can be directly executed by simply installing it as it is, but also a program that needs to be temporarily converted into another form (for example,
Decompression of data that has been compressed, etc.), and also includes those that can be executed in combination with other module parts.

【００２８】メモリ２７には、音声データバッファ２７
ａ、特徴量バッファ２７ｆ、変換後バッファ２７ｅ、読
みバッファ２７ｂ，半確定バッファ２７ｃ、確定バッフ
ァ２７ｄを有する。The memory 27 has an audio data buffer 27
a, a feature buffer 27f, a post-conversion buffer 27e, a reading buffer 27b, a semi-determined buffer 27c, and a defined buffer 27d.

【００２９】音声データバッファ２７ａ、特徴量バッフ
ァ２７ｆおよび変換後バッファ２７ｅは、音声変換プロ
グラム用のバッファである。音声データバッファ２７ａ
には入力された音声データをデジタル変換されたデジタ
ル音声データが記憶される。特徴量バッファ２７ｆに
は、音声変換プログラム２６ａによって抽出された音声
特徴量が記憶される。変換後バッファ２７ｅには、音声
変換プログラム２６ａによって特定された表記文字列お
よびその読み文字列が記憶される。The audio data buffer 27a, the feature buffer 27f and the post-conversion buffer 27e are buffers for an audio conversion program. Audio data buffer 27a
Stores digital audio data obtained by digitally converting input audio data. The voice feature extracted by the voice conversion program 26a is stored in the feature buffer 27f. The post-conversion buffer 27e stores the written character string specified by the voice conversion program 26a and its read character string.

【００３０】読みバッファ２７ｂ，半確定バッファ２７
ｃ、確定バッファ２７ｄは、かな漢字変換プログラム用
のバッファである。読みバッファ２７ｂには、変換後バ
ッファ２７ｅに記憶された読み文字列が順次記憶され
る。半確定バッファ２７ｃには、変換後バッファ２７ｅ
に記憶された表記文字列が順次記憶される。確定バッフ
ァ２７ｄには、キーボード２８から与えられる確定命令
によって確定された表記文字列が記憶される。なお、メ
モリ２７にはその他、各種の演算結果等が記憶される。Read buffer 27b, semi-determined buffer 27
c, the determination buffer 27d is a buffer for a kana-kanji conversion program. The reading character strings stored in the converted buffer 27e are sequentially stored in the reading buffer 27b. The semi-determined buffer 27c includes a post-conversion buffer 27e.
Are sequentially stored. The notation character string determined by the determination command given from the keyboard 28 is stored in the determination buffer 27d. The memory 27 also stores various calculation results and the like.

【００３１】ＣＲＴ３０には、未確定状態の表記文字列
または確定状態の表記文字列が表示される。On the CRT 30, a notation character string in an undetermined state or a notation character string in a confirmed state is displayed.

【００３２】３．フローチャートつぎに、ハードディスク２６に記憶されているプログラ
ムについて、図６〜図８を用いて説明する。この実施形
態においては、アプリケーションプログラム２６ｐは、
文字列入力可能状態で音声変換プログラム２６ａおよび
かな漢字変換プログラム２６ｂを呼び出し、かな漢字文
字列が入力される。音声変換プログラム２６ａは表記文
字列を特定し（図６ステップＳＴ０〜ステップＳＴ
３）、この表記文字列をかな漢字変換プログラム２６ｂ
は未確定状態でアプリケーションプログラム２６ｐに渡
す。アプリケーションプログラム２６ｐは表記文字列を
未確定状態で表示する。3. Next, a program stored in the hard disk 26 will be described with reference to FIGS. In this embodiment, the application program 26p is:
The voice conversion program 26a and the kana-kanji conversion program 26b are called in a state where a character string can be input, and a kana-kanji character string is input. The voice conversion program 26a specifies the notation character string (FIG. 6, step ST0 to step ST0).
3) The kana-kanji conversion program 26b
Is passed to the application program 26p in an undetermined state. The application program 26p displays the notation character string in an undetermined state.

【００３３】かな漢字変換プログラム２６ｂは、表記文
字列を検査し、所定のかな漢字変換規則に合致していな
ければ、かな漢字変換規則に基づいて表記文字列を変換
する（図７ステップＳＴ４〜図７ステップＳＴ３３）。
かな漢字変換規則に基づいて変換された表記文字列は、
アプリケーションプログラム２６ｐに渡される。本実施
形態においては、かな漢字変換プログラム２６ｂとし
て、ＩＭＥ(input method editor)の１つである株式
会社ジャストシステム製の「ＡＴＯＫ１１」（商標）を
用い、アプリケーションプログラム２６ｐとして、同社
製の「一太郎８」（商標）を採用した。The kana-kanji conversion program 26b examines the notation character string, and if it does not match the predetermined kana-kanji conversion rule, converts the notation character string based on the kana-kanji conversion rule (steps ST4 to ST33 in FIG. 7). ).
Notation character strings converted based on the Kana-Kanji conversion rules are:
It is passed to the application program 26p. In the present embodiment, “ATOK11” (trademark) manufactured by Just System Co., Ltd., which is one of IME (input method editor), is used as the kana-kanji conversion program 26b, and “Ichitaro 8” (made by the company) is used as the application program 26p. Trademark).

【００３４】まず、アプリケーションプログラム２６ｐ
について説明する。アプリケーションプログラム２６ｐ
は、日本語文書作成プログラムであり、起動することに
より、文字列入力可能画面がＣＲＴ３０に表示される
（図示せず）。First, the application program 26p
Will be described. Application program 26p
Is a Japanese document creation program, and when activated, a character string input enabled screen is displayed on the CRT 30 (not shown).

【００３５】この状態で、音声変換プログラム２６ａが
呼び出される。音声変換プログラム２６ａによって以下
の処理が実行される。操作者は、図３に示すマイク３４
から音声データを入力する。ＣＰＵ２３は、マイク３４
より音声入力があるか否かを判断しており（図６ステッ
プＳＴ０）、音声入力があれば音声データバッファ２７
ａにインターフェイス３３にてデジタル変換されたデジ
タル音声データを記憶する（ステップＳＴ１）。たとえ
ば、「さんせいとあるかりせいのすいようえき」と発声
されると、デジタル変換されたデータが音声データバッ
ファ２７ａに記憶される。In this state, the voice conversion program 26a is called. The following processing is executed by the voice conversion program 26a. The operator operates the microphone 34 shown in FIG.
Input audio data from. The CPU 23 includes a microphone 34
It is determined whether or not there is a voice input (step ST0 in FIG. 6).
The digital audio data digitally converted by the interface 33 is stored in a (step ST1). For example, when "Sansei-Arikari-no-Sui-Eki" is uttered, the digitally converted data is stored in the audio data buffer 27a.

【００３６】ＣＰＵ２３は、音声データバッファ２７ａ
に記憶されたデジタル音声データから特徴量を抽出する
（ステップＳＴ２）。つぎに、図３に示す音声変換辞書
２６ｂに登録された音声データ、単語間連接データと照
合し、最も一致度の高い文字列を得て、変換後バッファ
２７ｅに記憶する（ステップＳＴ３）。例えば、この場
合、「さんせい／と／あるかりせい／の／すいようえき
／」と単語単位に区切られ、各単語毎に変換候補が特定
され、変換後バッファ２７ｅには、表記文字列「賛成／
と／アルカリ性／の／水溶液」および読み文字列「さん
せい／と／あるかりせい／の／すいようえき」が記憶さ
れる。The CPU 23 has an audio data buffer 27a
A feature amount is extracted from the digital audio data stored in (step ST2). Next, the voice data and the inter-word concatenation data registered in the voice conversion dictionary 26b shown in FIG. 3 are collated to obtain a character string having the highest matching degree, and stored in the converted buffer 27e (step ST3). For example, in this case, a conversion candidate is specified for each word, such as "sansei / to / arikasei / no / suoiyoueki /", and a conversion candidate is specified for each word. Agree/
And / alkaline // aqueous solution ”and the reading character string“ sansei / to / akarisei / no / suoiyoueki ”are stored.

【００３７】なお、図６ステップＳＴ０にて、音声入力
がなければ音声変換プログラムは終了する。In step ST0 in FIG. 6, if there is no voice input, the voice conversion program ends.

【００３８】つぎに、かな漢字変換プログラム２６ｂが
呼び出される。かな漢字変換プログラム２６ｂによって
以下の処理が実行される。ＣＰＵ２３は、変換後バッフ
ァ２７ｅに新たに文字列が記憶されたか否かを判断して
おり（図７ステップＳＴ４）、新たに文字列が記憶され
た場合には、かな漢字変換プログラムの変換単位（文
節）に区切りを修正し、その表記文字列を未確定バッフ
ァ２７ｃに、その読み文字列を読みバッファ２７ｂに記
憶する（図７ステップＳＴ５）。上記の場合であれば、
たとえば、表記文字列「賛成と／アルカリ性の／水溶
液」が半確定バッファ２７ｃに、読み「さんせいと／あ
るかりせいの／すいようえき」が読みバッファ２７ｂに
記憶される。Next, the kana-kanji conversion program 26b is called. The following processing is executed by the kana-kanji conversion program 26b. The CPU 23 determines whether or not a new character string is stored in the post-conversion buffer 27e (step ST4 in FIG. 7). If a new character string is stored, the conversion unit (phrase) of the kana-kanji conversion program is used. ) Is corrected, and the written character string is stored in the undetermined buffer 27c and the read character string is stored in the read buffer 27b (step ST5 in FIG. 7). In the above case,
For example, the notation character string “agreement / alkaline / aqueous solution” is stored in the semi-fixed buffer 27c, and the reading “sansanto / arukasei / suoiyoeki” is stored in the reading buffer 27b.

【００３９】つぎに、ＣＰＵ２３は、半確定バッファ２
７ｃに記憶された表記文字列「賛成とアルカリ性の水溶
液」を、アプリケーションプログラム２６ｐに半確定状
態であるとの条件をつけて渡す（図７ステップＳＴ
７）。アプリケーションプログラム２６ｐが実行される
と、例えば、ＣＲＴ３０に半確定状態で表示される。Next, the CPU 23 executes the semi-determined buffer 2
The notation character string "agreement and alkaline aqueous solution" stored in 7c is passed to the application program 26p with the condition that it is in a semi-determined state (step ST in FIG. 7).
7). When the application program 26p is executed, it is displayed on the CRT 30, for example, in a semi-determined state.

【００４０】つぎに、ＣＰＵ２３は、半確定バッファ２
７ｃに記憶された文字列が、かな漢字変換規則に合致し
ているか否かを判断する（図７ステップＳＴ９）。この
例では、図５に示す共起辞書に基づいて、前記判断が行
われる。この場合、表記文字列「アルカリ性」と共起関
係にあるのは表記文字列「賛成」ではなく表記文字列
「酸性」であることがわかる。Next, the CPU 23 executes the semi-determined buffer 2
It is determined whether the character string stored in 7c matches the Kana-Kanji conversion rule (step ST9 in FIG. 7). In this example, the determination is made based on the co-occurrence dictionary shown in FIG. In this case, it is found that the co-occurrence relationship with the notation character string “alkaline” is not the notation character string “agree” but the notation character string “acidity”.

【００４１】したがって、ＣＰＵ２３は、半確定バッフ
ァ２７ｃに記憶された文字列がかな漢字変換規則に合致
していないと判断し、半確定バッファ２７ｃに記憶され
た表記文字列をかな漢字変換規則に基づき、変換する
（図７ステップＳＴ１１）。具体的には、「賛成」が
「酸性」に置換され、半確定バッファ２７ｃに「賛成と
アルカリ性の水溶液」と記憶される。Therefore, the CPU 23 determines that the character string stored in the semi-fixed buffer 27c does not conform to the Kana-Kanji conversion rule, and converts the written character string stored in the semi-fixed buffer 27c based on the Kana-Kanji conversion rule. (Step ST11 in FIG. 7). Specifically, “agree” is replaced by “acid”, and “agree and alkaline aqueous solution” is stored in the semi-fixed buffer 27c.

【００４２】つぎに、ＣＰＵ２３は、半確定バッファ２
７ｃに記憶された表記文字列「酸性とアルカリ性の水溶
液」を、アプリケーションプログラム２６ｐに半確定状
態で渡す（図７ステップＳＴ１３）。これにより、ＣＲ
Ｔ３０に半確定状態で表示される。Next, the CPU 23 executes the semi-determined buffer 2
The notation character string “acidic and alkaline aqueous solution” stored in 7c is transferred to the application program 26p in a semi-determined state (step ST13 in FIG. 7). Thereby, CR
It is displayed in a semi-determined state at T30.

【００４３】なお、ステップＳＴ９にて合致していると
判断した場合には、ステップＳＴ１１、１３の処理は行
わない。If it is determined in step ST9 that the values match, the processes in steps ST11 and ST13 are not performed.

【００４４】つぎに、ＣＰＵ２３は、キー操作があるか
否かが判断される（ステップＳＴ１４）。キー操作がな
ければ、かな漢字変換プログラムを終了し、図６に示す
音声変換プログラムを実行する。Next, the CPU 23 determines whether or not there is a key operation (step ST14). If there is no key operation, the kana-kanji conversion program ends, and the voice conversion program shown in FIG. 6 is executed.

【００４５】一方、ステップＳＴ１４にて、キー操作が
あれば、ステップＳＴ１５に進み、キーの種類を判断す
る。On the other hand, if there is a key operation in step ST14, the flow advances to step ST15 to determine the type of key.

【００４６】キーの種類が確定キーであれば、図８ステ
ップＳＴ１７に進み、確定バッファ２７ｄに半確定バッ
ファ２７ｃのデータを記憶する。そして、確定状態の条
件付きでアプリケーションプログラム２６ｐに渡す。ア
プリケーションプログラム２６ｐは、例えば、確定バッ
ファ２７ｄに記憶された表記文字列「酸性とアルカリ性
の水溶液」を、ＣＲＴ３０に確定状態で表示させる。If the type of the key is the decision key, the process proceeds to step ST17 in FIG. 8, and the data of the semi-decision buffer 27c is stored in the decision buffer 27d. Then, it is passed to the application program 26p with the condition of the fixed state. The application program 26p causes the CRT 30 to display, for example, the notation character string “acidic and alkaline aqueous solution” stored in the determination buffer 27d.

【００４７】キーの種類が区切り位置変更キーであれ
ば、読みバッファ２７ｂの区切り位置変更を行い（図８
ステップＳＴ２１）、区切り位置変更された読み文字列
をアプリケーションプログラム２６ｐに渡す（ステップ
ＳＴ２３）。そして、ステップＳＴ１４に戻り、キー操
作があったかを判断する。If the key type is a break position change key, the break position of the reading buffer 27b is changed (FIG. 8).
In step ST21, the read character string whose delimiter position has been changed is passed to the application program 26p (step ST23). Then, returning to step ST14, it is determined whether or not a key operation has been performed.

【００４８】キーの種類が取消キーであれば、半確定バ
ッファ２７ｃ、読みバッファ２７ｂのデータをクリアす
る（図８ステップＳＴ２５）。そして、処理を終了す
る。If the key type is the cancel key, the data in the semi-determined buffer 27c and the read buffer 27b are cleared (step ST25 in FIG. 8). Then, the process ends.

【００４９】キーの種類がカタカナキーであれば、半確
定バッファ２７ｃの注目文節をカタカナ変換する（図８
ステップＳＴ２７）。そして、ステップＳＴ１４に戻
り、キー操作がされたかを判断する。If the type of the key is a katakana key, the notable phrase in the semi-fixed buffer 27c is subjected to katakana conversion (FIG. 8).
Step ST27). Then, returning to step ST14, it is determined whether a key operation has been performed.

【００５０】キーの種類がひらがなキーであれば、半確
定バッファ２７ｃの注目文節をひらがな変換する（図８
ステップＳＴ２９）。そして、ステップＳＴ１４に戻
り、キー操作がされたかを判断する。If the key type is a hiragana key, the target phrase in the semi-fixed buffer 27c is converted into hiragana (FIG. 8).
Step ST29). Then, returning to step ST14, it is determined whether a key operation has been performed.

【００５１】キーの種類が注目文節変更キーであれば、
半確定バッファ２７ｃの注目文節を変更変換する（図８
ステップＳＴ３１）。そして、ステップＳＴ１４に戻
り、キー操作がされたかを判断する。If the key type is the noticeable phrase change key,
The target phrase in the semi-determined buffer 27c is changed and converted (FIG. 8).
Step ST31). Then, returning to step ST14, it is determined whether a key operation has been performed.

【００５２】キーの種類が変換キーであれば、半確定バ
ッファ２７ｃの注目文節の表記を次候補に変換する（図
８ステップＳＴ３３）。そして、ステップＳＴ１４に戻
り、キー操作がされたかを判断する。If the type of the key is a conversion key, the notation of the target phrase in the semi-fixed buffer 27c is converted to the next candidate (step ST33 in FIG. 8). Then, returning to step ST14, it is determined whether a key operation has been performed.

【００５３】キーの種類が半角変換キーであれば、半確
定バッファ２７ｃの注目文節を半角変換する（図８ステ
ップＳＴ３５）。そして、ステップＳＴ１４に戻り、キ
ー操作がされたかを判断する。If the type of key is a half-width conversion key, the target phrase in the half-fixed buffer 27c is half-width converted (step ST35 in FIG. 8). Then, returning to step ST14, it is determined whether a key operation has been performed.

【００５４】なお、図７ステップＳＴ４にて、変換後バ
ッファ２７ｅに文字列が追加記憶されなかった場合に
は、ステップＳＴ１４に進み、キー操作がされたかを判
断する。If no character string is additionally stored in the post-conversion buffer 27e in step ST4 in FIG. 7, the process proceeds to step ST14 to determine whether a key operation has been performed.

【００５５】かな漢字変換プログラムによる処理と音声
変換プログラムとによる処理が順次繰り返される。The processing by the kana-kanji conversion program and the processing by the voice conversion program are sequentially repeated.

【００５６】このように本実施形態においては、音声変
換プログラムによって変換された表記文字列を未確定状
態でＩＭＥであるかな漢字変換プログラムに渡して検査
し、適切な表記文字列でない場合には、かな漢字変換プ
ログラムの変換規則に基づいて再変換するようにしてい
る。これにより、学習させていない音声変換プログラム
であっても、すでにその操作者が蓄積しているかな漢字
変換規則を活用して適切なかな漢字変換を行うことがで
きる。As described above, in the present embodiment, the notation character string converted by the voice conversion program is passed in an undetermined state to the Kana-Kanji conversion program, which is an IME, and inspected. The conversion is performed again based on the conversion rules of the conversion program. As a result, even if the speech conversion program has not been trained, appropriate Kana-Kanji conversion can be performed by utilizing the Kana-Kanji conversion rules already accumulated by the operator.

【００５７】４．その他の実施形態なお、本実施形態においては、かな漢字変換規則とし
て、共起情報を用いた場合について説明したが、これ以
外の規則、例えば学習情報等を採用してもよい。また、
日本語へのかな漢字変換だけでなく、言語規則であれば
どのようなものであってもよい。例えば、他の言語であ
る英語、中国語等の音声入力についても同様に適用する
ことかできる。4. Other Embodiments In the present embodiment, the case where co-occurrence information is used as the kana-kanji conversion rule has been described, but other rules, such as learning information, may be used. Also,
Not only kana-kanji conversion to Japanese but any language rules may be used. For example, the same can be applied to voice input in other languages such as English and Chinese.

【００５８】また、同音異議語がある場合、本実施形態
のように共起情報を用いることにより、操作者の操作な
しで音声入力されたデータから適切な表記文字列を得る
ことができる。これに対して、音声変換プログラムの音
声変換辞書に登録されていない読みの音声データ（専門
用語等）については、正しく変換することができない。
なぜなら、音声変換プログラムによって、その読みによ
く似た読みを有する表記文字列に変換されてしまうから
である。例えば、特許という用語がない場合、よく似た
東京などに変換され、その読みも「とうきょう」と変換
されるわけである。一般に、かな漢字変換プログラム
は、音声変換プログラムから与えられた表記文字列また
は読みを参照して、変換候補を検索する。したがって、
このような場合でも、正確に変換できるように、かな漢
字変換プログラムにかな漢字変換規則として、類似の読
みを有する表記文字列を記憶するようにしてもよい。こ
のように、類似語読みデータを記憶しておき、正しい読
みまで予想することにより、少しずれた読みが、音声変
換プログラムから与えられても、適切な表記文字列を得
ることができる。When there is a homonymous object word, by using the co-occurrence information as in the present embodiment, it is possible to obtain an appropriate written character string from data input by voice without an operation by an operator. On the other hand, reading voice data (technical terms and the like) not registered in the voice conversion dictionary of the voice conversion program cannot be correctly converted.
This is because the voice conversion program converts the character string into a written character string having a reading very similar to the reading. For example, if there is no term for a patent, it is converted to a similar Tokyo, etc., and its reading is also converted to “Tokyo”. In general, a kana-kanji conversion program searches for a conversion candidate by referring to a written character string or a reading given from a voice conversion program. Therefore,
Even in such a case, a written character string having a similar pronunciation may be stored in the kana-kanji conversion program as a kana-kanji conversion rule so that conversion can be performed accurately. In this way, by storing the similar word reading data and predicting the correct reading, an appropriate written character string can be obtained even if a slightly shifted reading is given from the speech conversion program.

【００５９】また、和製英語となっているような単語、
例えば、「システム」等については、表記文字列とし
て、そのスペル「ｓｙｓｔｅｍ」を記憶しておき、これ
に変換するようにしてもよい。Also, words that are in Japanese English,
For example, as for “system” and the like, the spelling “system” may be stored as a notation character string and converted to this.

【００６０】なお、本実施形態においては、音声変換プ
ログラムとして、日本アイ・ビー・エム株式会社製のＶ
ｉａＶｏｉｃｅ（商標）を採用したが、これに限定され
ず、他の音声認識プログラムについても同様に適用でき
る。このように、市販の音声変換プログラムを音声入力
エンジンとして用いた場合に、変換対象の分野等によ
り、その変換率がそれほどよくない場合でも、いままで
操作者が蓄えたかな漢字変換の変換規則を用いて所望の
表記文字列を得ることができる。In the present embodiment, as the voice conversion program, a program manufactured by IBM Japan, Ltd.
Although iaVoice (trademark) was adopted, the present invention is not limited to this, and other voice recognition programs can be similarly applied. In this way, when a commercially available voice conversion program is used as a voice input engine, even if the conversion rate is not so good due to the conversion target field, the conversion rules of the kana-kanji conversion stored by the operator are used. Thus, a desired written character string can be obtained.

【００６１】また、本実施形態においては、入力された
音声データから単語毎区切られた変換候補を特定する場
合について説明したが、これに限定されず、文節単位で
候補を特定する場合でもよい。Further, in the present embodiment, a case has been described in which the conversion candidates separated for each word are specified from the input voice data. However, the present invention is not limited to this, and the candidates may be specified for each phrase.

【００６２】また、音声変換プログラムについてはソフ
トウェアで実現した場合について説明したが、ハードウ
ェアで実現させてもよい。すなわち、音声データ変換手
段とは、音声データ変換装置を含む概念である。Further, the case where the voice conversion program is realized by software has been described, but it may be realized by hardware. That is, the audio data conversion means is a concept including the audio data conversion device.

【００６３】また、本実施形態においては、アプリケー
ションプログラム２６ｐに半確定状態でＣＲＴ３０に表
示させるようにした。すなわち、かな漢字変換プログラ
ムは、未確定バッファに記憶された表記文字列が未確定
状態で表示されるように、表示制御命令を出力してい
る。しかし、未確定状態での表示については、このよう
にアプリケーションプログラム２６ｐが実行するだけで
なく、アプリケーションプログラム２６ｐから表示しな
かったという返答指令を、かな漢字変換プログラムが受
けて、これを表示するようにしてもよい。また、かかる
表示については、オペレーティングシステム（ＯＳ）と
分担して、実現するようにしてもよい。In this embodiment, the application program 26p is displayed on the CRT 30 in a semi-determined state. That is, the kana-kanji conversion program outputs the display control command so that the notation character string stored in the undetermined buffer is displayed in an undetermined state. However, as for the display in the undetermined state, not only is the application program 26p executed in this way, but the Kana-Kanji conversion program receives a reply command indicating that no display was made from the application program 26p, and displays it. You may. In addition, such display may be realized by sharing with the operating system (OS).

【００６４】なお、本実施形態においては、図１に示す
機能を実現する為に、ＣＰＵ２３を用い、ソフトウェア
によってこれを実現している。しかし、その一部もしく
は全てを、ロジック回路等のハードウェアによって実現
してもよい。In the present embodiment, the functions shown in FIG. 1 are realized by using the CPU 23 and software. However, some or all of them may be realized by hardware such as a logic circuit.

[Brief description of the drawings]

【図１】本発明にかかる表記文字列変換装置１０の機能
ブロック図である。FIG. 1 is a functional block diagram of a notation character string conversion device 10 according to the present invention.

【図２】図１に示す表記文字列変換装置１０のハードウ
エア構成の一例を示す図である。FIG. 2 is a diagram illustrating an example of a hardware configuration of a notation character string conversion device 10 illustrated in FIG.

【図３】図２に示す表記文字列変換装置１０を、ＣＰＵ
２３を用いて実現したハードウエア構成の一例を示す図
である。FIG. 3 is a block diagram of a notation character string conversion device 10 shown in FIG.
FIG. 3 is a diagram illustrating an example of a hardware configuration realized using the H.23.

【図４】音声変換辞書のデータ構造を示す。FIG. 4 shows a data structure of a speech conversion dictionary.

【図５】かな漢字変換辞書のデータ構造を示す。FIG. 5 shows a data structure of a kana-kanji conversion dictionary.

【図６】音声変換処理のフローチャートである。FIG. 6 is a flowchart of a voice conversion process.

【図７】かな漢字変換処理のフローチャートである。FIG. 7 is a flowchart of a kana-kanji conversion process.

【図８】かな漢字変換処理のフローチャートである。FIG. 8 is a flowchart of a kana-kanji conversion process.

【図９】従来の音声変換システムの機能ブロック図であ
る。FIG. 9 is a functional block diagram of a conventional voice conversion system.

[Explanation of symbols]

３・・・・・音声入力手段５・・・・・音声データ変換手段６・・・・・音声変換手段７・・・・・変換変換辞書１０・・・・表記文字列変換装置１１・・・・読み文字列記憶手段１２・・・・未確定文字列記憶手段１３・・・・手動変換手段１４・・・・出力手段１５・・・・自動変換手段１７・・・・言語規則記憶手段１９・・・・表示手段２３・・・ＣＰＵ２７・・・メモリ 3 ... Voice input means 5 ... Voice data conversion means 6 ... Voice conversion means 7 ... Conversion conversion dictionary 10 ... Notation character string conversion device 11 ... ..Reading character string storage means 12... Undetermined character string storage means 13... Manual conversion means 14... Output means 15... Automatic conversion means 17. 19 ... display means 23 ... CPU 27 ... memory

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平９−160750（ＪＰ，Ａ) 特開昭64−70865（ＪＰ，Ａ) 実開平５−79657（ＪＰ，Ｕ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/22 ──────────────────────────────────────────────────続き Continuation of the front page (56) References JP-A-9-160750 (JP, A) JP-A-64-70865 (JP, A) JP-A-5-79657 (JP, U) (58) Field (Int.Cl. ⁷ , DB name) G06F 17/22

Claims

(57) [Claims]

1. A notation character string corresponding to a feature amount of audio data.
And a reading character string corresponding to the reading character string corresponding to the notation character string.
By using the voice conversion dictionary, the input voice data
Best matches the feature value recorded in the voice conversion dictionary
A notation character string conversion device for converting a notation character string given from a voice data conversion means for converting into a notation character string and outputting it together with a corresponding reading character string so as to conform to a language rule, comprising a specific notation character string. The correspondence between the two is described based on language rules.
Corresponding reading characters corresponding to the notation character string
A language rule storage means for storing the strings together; an unconfirmed character string storage means for storing the notation character string provided from the audio data conversion means in an unconfirmed state; and a read character string provided from the audio data conversion means. A reading character string storing means for storing, a relationship between specific notation character strings among a plurality of notation character strings stored in the unconfirmed character string storage means matches the correspondence relationship.
Judge whether or not they match, and do not match
In the case, the reading character stored in the reading character string storage means
A reading character string corresponding to the specific notation character string in the column;
The specific notation character string based on the corresponding reading character string
Automatic conversion means for converting the notation character string into a notation character string that matches the correspondence relationship .

With 2. A computer, a writing character string conversion method for converting to match the writing character string supplied from the sound data conversion unit into language rules, the audio data converting means, the audio data Notation character string corresponding to the feature amount and the notation sentence
A speech conversion dictionary that stores the reading character string corresponding to the character string
By using the input voice data,
Character string that most closely matches the feature value recorded in the translation dictionary
It is a means to convert and output with the corresponding reading character string.
The computer executes the following processing, and records the correspondence between specific notation character strings based on language rules.
Corresponding reading characters corresponding to the notation character string
A string is also stored, and the notation character string given from the voice data conversion means is stored in an undetermined state, and the voice data conversion means
Storing the read character string given al, or relationship of a particular writing character string among the plurality of writing character string stored in the undetermined state meets the relationship
Judge whether it is not the same as above.
The character string read from the voice data conversion means and the pair
The specific notation character string based on the unread character string
A notation character string conversion method using a computer , which performs conversion to a notation character string that matches the correspondence .

3. A storage medium storing a program for causing a computer provided with an input device, a control device, an output device, and a storage device to function as a notation character string conversion device, wherein the program executes the following means: Function based on language rules.
Corresponding reading characters corresponding to the notation character string
Language rule storage means for storing strings together, a notation character string corresponding to a feature amount of voice data, and the notation sentence
A speech conversion dictionary that stores the reading character string corresponding to the character string
By using the input voice data,
Character string that most closely matches the feature value recorded in the translation dictionary
An undetermined character string storage unit that stores a notation character string provided from the audio data conversion unit that converts and outputs the character string together with the corresponding reading character string in an undetermined state; and stores a read character string that is supplied from the audio data conversion unit. read character string storage means, if the correspondence relationship between certain writing character string among the plurality of writing character string stored in said unfixed character string storage means
Judge whether or not they match, and do not match
In the case, the reading character stored in the reading character string storage means
A reading character string corresponding to the specific notation character string in the column;
Writing character string of the specific based on said corresponding read character string
Automatic conversion means for converting into a notation character string that matches the correspondence relationship, a storage medium storing a program.

4. The notation character string conversion device according to claim 1, wherein the language rule storage means includes a notation character string having a similar reading to a certain reading character string.
The automatic conversion means stores a plurality of written characters stored in the unconfirmed character string storage means.
If the relationship between certain writing character string of the string is determined not to conform to the correspondence, to convert the representation string automatically converted into writing character string having a reading of the similarity, the Character string conversion device.

5. The notation character string conversion method according to claim 2, wherein the computer further includes a notation character string having a similar reading to a certain reading character string.
Stores the, especially among the plurality of writing character string that the stored in the undetermined state
If the relationship between the constant notation string is determined not to conform to the correspondence, to convert the representation string automatically converted into writing character string having a reading of the similarity, and wherein Notation string conversion method.

6. A storage medium storing a program according to claim 3, wherein said language rule storage means stores a notation character string having a similar reading to a certain reading character string, and said automatic conversion means , a plurality of writing character stored in said unfixed character string storage means
If it is determined that the relationship between specific notation character strings among the columns does not match the correspondence , the undetermined character string
Converting a plurality of notation character strings stored in the storage means into notation character strings having similar readings.