JPH0540494A - Composite voice tester - Google Patents

Composite voice tester

Info

Publication number
JPH0540494A
JPH0540494A JP3196591A JP19659191A JPH0540494A JP H0540494 A JPH0540494 A JP H0540494A JP 3196591 A JP3196591 A JP 3196591A JP 19659191 A JP19659191 A JP 19659191A JP H0540494 A JPH0540494 A JP H0540494A
Authority
JP
Japan
Prior art keywords
voice
pattern
input
matching
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP3196591A
Other languages
Japanese (ja)
Inventor
Jun Kametani
潤 亀谷
Hisae Hashimoto
久恵 橋本
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
NEC Engineering Ltd
Original Assignee
NEC Corp
NEC Engineering Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp, NEC Engineering Ltd filed Critical NEC Corp
Priority to JP3196591A priority Critical patent/JPH0540494A/en
Publication of JPH0540494A publication Critical patent/JPH0540494A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To decide whether a composite voice is satisfactory or not, based on a result of matching with a pattern sequence of an inputted composite voice by registering in advance the pattern sequence of a standard composite voice. CONSTITUTION:The tester is constituted of a voice input part 1 for digitizing an inputted composite voice, a voice analyzing part 2 for extracting a feature pattern sequence of an input voice, a pattern memory part 3 for storing in advance a standard feature pattern sequence, a pattern matching part 4 for executing DP matching between the input and the standard feature pattern sequence, a result deciding part 5 for comparing similarity of a matching result with a prescription and deciding it, and a whole control part 6 for executing control of each constituting unit and a communication to a host.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は音声合成装置の自動試験
装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech synthesizer automatic test apparatus.

【0002】[0002]

【従来の技術】従来、音声合成装置の検査、試験におい
ては、最終的な試験として決められた単語、語句の合成
音声出力信号を検査者が実際に試聴し、合成音声に誤り
の無いことを確認している。
2. Description of the Related Art Conventionally, in the inspection and testing of a speech synthesizer, it is necessary for an inspector to actually listen to a synthesized speech output signal of a word or a phrase determined as a final test and check that the synthesized speech has no error. I have confirmed.

【0003】[0003]

【発明が解決しようとする課題】しかしながら、この従
来の検査方法では、検査者が被試験対象装置にかかりっ
きりで検査を行なわなければならず、検査工数の増加と
いう課題があった。
However, this conventional inspection method has a problem in that the inspector must perform the inspection on the device under test all the time, which increases the number of inspection steps.

【0004】また、音声合成装置の素片データを格納し
ておくデータメモリや、合成を行うシグナルプロセッサ
等の試験では、自動診断用ソウトウェアの導入がはから
れているにも拘らず、最終的な合成音声出力の確認のみ
人手を要するために、検査工程全体のスループットが向
上しない原因となっていた。
Further, in the test of the data memory for storing the segment data of the speech synthesizer, the signal processor for synthesizing, etc., although the software for automatic diagnosis is introduced, the final result is obtained. Since it requires manpower only to check the output of the synthesized voice, the throughput of the entire inspection process is not improved.

【0005】本発明は従来の上記実情に鑑みてなされた
ものであり、従って本発明の目的は、従来の技術に内在
する上記諸課題を解決することを可能とした新規な合成
音声試験器を提供することにある。
The present invention has been made in view of the above-mentioned conventional circumstances, and therefore, an object of the present invention is to provide a novel synthetic speech tester capable of solving the above-mentioned problems inherent in the prior art. To provide.

【0006】[0006]

【課題を解決するための手段】上記目的を達成するため
に、本発明に係る合成音声試験器は入力される合成音声
信号をディジタル化し始終端を決定する音声入力部と、
入力されたディジタル化合成音声から音響的な特徴パタ
ーン系列を抽出する音声分析部と、特徴パターン系列を
格納、登録しておくパターンメモリ部と、あらかじめ登
録しておいて特徴パターン系列と入力合成音声から抽出
した特徴パターン系列との間でパターンマッチングを行
うパターンマッチング部と、マッチングの結果として得
られるパターン系列間の類似度から入力合成音声の正当
性を判定する結果判定部と、本発明の各構成ユニットを
制御する全体制御部とを備えて構成される。
In order to achieve the above object, a synthetic speech tester according to the present invention comprises a speech input section for digitizing a synthesized speech signal to be input and determining the start and end points,
A voice analysis unit that extracts an acoustic characteristic pattern sequence from the input digitized synthetic speech, a pattern memory unit that stores and registers the characteristic pattern sequence, a characteristic pattern sequence that is registered in advance, and an input synthetic speech. A pattern matching unit that performs pattern matching with the characteristic pattern sequence extracted from the result pattern, a result determination unit that determines the validity of the input synthesized speech from the similarity between the pattern sequences obtained as a result of matching, And an overall control unit for controlling the constituent units.

【0007】[0007]

【実施例】次に本発明をその好ましい一実施例について
図面を参照して具体的に説明する。
BEST MODE FOR CARRYING OUT THE INVENTION The present invention will now be described in detail with reference to the accompanying drawings with reference to the accompanying drawings.

【0008】図1は本発明の一実施例を示すブロック構
成図である。
FIG. 1 is a block diagram showing an embodiment of the present invention.

【0009】図1を参照するに、音声入力部1にはマイ
クロフォン7を通して被試験対象である音声合成装置
(図示せず)からの出力合成音声が入力され、ここで合
成音声のディジタル化、始終端検出が行われる。音声入
力部1でディジタル化された合成音声は、音声分析部2
に送られ、メルケプストラム分析等の音響分析によって
特徴パターン系列に変換される。パターンメモリ部3
は、あらかじめ音声分析部2によって抽出した特徴パタ
ーン系列を格納しておくメモリである。パターンマッチ
ング部4は、音声分析部2で得られた入力合成音声の特
徴パターン系列とパターンメモリ部3に登録されている
特徴パターン系列との間でDPマッチングを実行する。
このパターンマッチング部4のDPマッチングにより得
られたパターン系列間の類似度は、結果判定部5におい
て規定の類似度と比較され、規定以上の類似度を示す入
力合成音声に対しては合格と判定して、全体制御部6に
結果を通知する。
Referring to FIG. 1, an output synthesized voice from a voice synthesizer (not shown), which is an object to be tested, is input to a voice input section 1 through a microphone 7, where the synthesized voice is digitized and the whole process is started. Edge detection is performed. The synthesized voice digitized by the voice input unit 1 is used as the voice analysis unit 2
And is converted into a feature pattern series by acoustic analysis such as mel cepstrum analysis. Pattern memory unit 3
Is a memory for storing the characteristic pattern series extracted by the voice analysis unit 2 in advance. The pattern matching unit 4 executes DP matching between the characteristic pattern series of the input synthesized speech obtained by the speech analysis unit 2 and the characteristic pattern series registered in the pattern memory unit 3.
The similarity between the pattern sequences obtained by the DP matching of the pattern matching unit 4 is compared with the prescribed similarity in the result determination unit 5, and it is determined that the input synthesized speech showing the similarity higher than the prescribed is acceptable. Then, the overall control unit 6 is notified of the result.

【0010】全体制御部6は、ホスト8からの指示に基
づき特徴パターン系列のパターンメモリ部3への登録、
パターンマッチング部4がマッチングテンプレートに使
用する特徴パターン系列の指定、各構成ユニットの動作
シーケンスの制御等を行う。
The overall control unit 6 registers a characteristic pattern sequence in the pattern memory unit 3 based on an instruction from the host 8,
The pattern matching unit 4 specifies the characteristic pattern series used for the matching template, controls the operation sequence of each constituent unit, and the like.

【0011】以下に本実施例の動作を簡単に説明する。The operation of this embodiment will be briefly described below.

【0012】本実施例により音声合成装置の出力合成音
声の検査を行う場合には、あらかじめ標準となる合成音
声の特徴パターン系列を登録する必要がある。そのため
にはまずマイクロフォン7を通じ音声入力部1に対して
標準の合成音声を単語または語句単位に入力し、ディジ
タル化、始終端の決定を行った後、音声分析部2におい
て特徴パターン系列に変換してパターンメモリ部3に格
納する。この際に全体制御部6は、ホスト8よりこの標
準合成音声に対応したフレーズ番号を受け取り、特徴パ
ターン系列と一緒にパターンメモリ部3に登録する。
When the output synthesized speech of the speech synthesizer is inspected according to this embodiment, it is necessary to register the standard characteristic pattern series of synthesized speech in advance. For this purpose, first, a standard synthesized voice is input to the voice input unit 1 through the microphone 7 in units of words or phrases, digitized and the start and end are determined, and then converted into a feature pattern sequence in the voice analysis unit 2. And stores it in the pattern memory unit 3. At this time, the overall control unit 6 receives the phrase number corresponding to this standard synthesized voice from the host 8 and registers it in the pattern memory unit 3 together with the characteristic pattern series.

【0013】音声合成装置の出力合成音声の検査を行う
際には、まず全体制御部6が入力される合成音声のフレ
ーズ番号をホスト8より受け取り、パターンマッチング
部4に指定を行う。次にマイクロフォン7を通じて音声
入力部1に入力される合成音声をディジタル化後始終端
決定し、音声分析部2で特徴パターン系列に変換してパ
ターンマッチング部4に転送する。パターンマッチング
部4は、全体制御部6より指定されたフレーズ番号に対
応する特徴パターン系列と、音声分析部2から送られて
きた特徴パターン系列の間でDPマッチングを実行し、
パターン系列間の類似度を計算して結果を結果判定部5
に送る。
When the output synthetic speech of the speech synthesizer is inspected, the overall control section 6 first receives the phrase number of the synthetic speech input from the host 8 and designates it to the pattern matching section 4. Next, the synthesized voice input to the voice input unit 1 through the microphone 7 is digitized, and the start and end are determined, and the voice analysis unit 2 converts it into a characteristic pattern sequence and transfers it to the pattern matching unit 4. The pattern matching unit 4 executes DP matching between the characteristic pattern sequence corresponding to the phrase number designated by the overall control unit 6 and the characteristic pattern sequence sent from the voice analysis unit 2,
The result judging unit 5 calculates the similarity between the pattern series and outputs the result.
Send to.

【0014】結果判定部5は、送られてきた類似度とあ
らかじめ全体制御部6により規定された類似度を比較
し、規定値より大きければ合格判定を、小さければ不合
格判定を全体制御部6に送る。全体制御部6は、受け取
った合否判定を該当するフレーズ番号に添えてホスト8
に通知し、次のホスト8からの指示を待つ。
The result judging section 5 compares the sent similarity with the similarity defined in advance by the overall control section 6, and if it is larger than the specified value, the pass determination is made. Send to. The overall control unit 6 adds the received pass / fail judgment to the corresponding phrase number
And waits for the next instruction from the host 8.

【0015】[0015]

【発明の効果】以上説明した様に、本発明によれば、あ
らかじめ標準となる合成音声の音響的特徴パターン系列
をメモリに登録しておき、それに対応した合成音声の入
力を分析して得た特徴パターン系列の間でパターンマッ
チングを行った結果の類似度の大小から合成音声の正当
性を検証するために、検査者の人手を介さず自動的に音
声合成装置の検査を行うことが可能となり、検査工数の
削減をはかれるという効果が得られる。
As described above, according to the present invention, a standard acoustic feature pattern sequence of synthesized speech is registered in a memory in advance, and the input of the synthesized speech corresponding thereto is obtained. In order to verify the correctness of synthesized speech based on the degree of similarity of the results of pattern matching between feature pattern series, it becomes possible to automatically inspect the speech synthesizer without human intervention by the inspector. The effect is that the number of inspection steps can be reduced.

【0016】また本発明によれば、音声合成特有の再現
性の高さと安定性を考えると、正しい合成音声に対して
は非常に高い類似度を示すために、合成音声出力の正当
性すなわち被試験対象装置の良否を高い精度で判定でき
るという効果が得られる。
Further, according to the present invention, considering the high reproducibility and stability peculiar to speech synthesis, the correctness of the synthesized speech output, that is, the correctness of the synthesized speech output, is shown because a very high degree of similarity is shown for a correct synthesized speech. The effect that the quality of the device under test can be determined with high accuracy is obtained.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例を示すブロック構成図であ
る。
FIG. 1 is a block diagram showing an embodiment of the present invention.

【符号の説明】[Explanation of symbols]

1…音声入力部 2…音声分析部 3…パターンメモリ部 4…パターンマッチング部 5…結果判定部 6…全体制御部 7…マイクロフォン 8…ホスト 1 ... Voice input unit 2 ... Voice analysis unit 3 ... Pattern memory unit 4 ... Pattern matching unit 5 ... Result determination unit 6 ... Overall control unit 7 ... Microphone 8 ... Host

Claims (2)

【特許請求の範囲】[Claims] 【請求項1】 マイクロフォンから入力される合成音声
をディジタル化する手段と、ディジタル化音声信号から
音響的特徴パターンを抽出する手段と、前記音響的特徴
パターンを格納しておく記憶手段と、あらかじめ格納さ
れている特徴パターンと入力音声から得られた特徴パタ
ーンとのパターンマッチングを行う手段と、パターンマ
ッチング結果の類似度より入力された合成音声の正当性
を判定する手段と、本発明を構成する前記各ユニットを
制御する制御手段とを有し、あらかじめ正しい合成音声
の特徴パターンを登録しておき、被試験対象となる合成
音声装置からの合成音声出力とのパターンマッチングに
より、被試験対象装置の検査を自動的に行うことを特徴
とする合成音声試験器。
1. A means for digitizing a synthetic voice input from a microphone, a means for extracting an acoustic feature pattern from a digitized voice signal, a storage means for storing the acoustic feature pattern, and a pre-stored unit. Means for performing pattern matching between the feature pattern being input and the feature pattern obtained from the input voice, means for determining the validity of the input synthetic voice from the similarity of the pattern matching results, It has a control means for controlling each unit, registers a correct characteristic pattern of synthesized speech in advance, and performs pattern matching with the synthesized speech output from the synthesized speech apparatus to be tested to inspect the apparatus to be tested. A synthetic speech tester characterized by performing automatically.
【請求項2】 前記制御手段は、ホストより標準合成音
声に対応したフレーズ番号を受け取り、該フレーズ番号
を前記記憶手段に前記特徴パターン系列と一緒に登録す
ることを更に特徴とする請求項1に記載の合成音声試験
器。
2. The control means further receives a phrase number corresponding to a standard synthesized voice from the host, and registers the phrase number in the storage means together with the characteristic pattern series. The synthetic speech tester described.
JP3196591A 1991-08-06 1991-08-06 Composite voice tester Pending JPH0540494A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP3196591A JPH0540494A (en) 1991-08-06 1991-08-06 Composite voice tester

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP3196591A JPH0540494A (en) 1991-08-06 1991-08-06 Composite voice tester

Publications (1)

Publication Number Publication Date
JPH0540494A true JPH0540494A (en) 1993-02-19

Family

ID=16360288

Family Applications (1)

Application Number Title Priority Date Filing Date
JP3196591A Pending JPH0540494A (en) 1991-08-06 1991-08-06 Composite voice tester

Country Status (1)

Country Link
JP (1) JPH0540494A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013072903A (en) * 2011-09-26 2013-04-22 Toshiba Corp Synthesis dictionary creation device and synthesis dictionary creation method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013072903A (en) * 2011-09-26 2013-04-22 Toshiba Corp Synthesis dictionary creation device and synthesis dictionary creation method
US9129596B2 (en) 2011-09-26 2015-09-08 Kabushiki Kaisha Toshiba Apparatus and method for creating dictionary for speech synthesis utilizing a display to aid in assessing synthesis quality

Similar Documents

Publication Publication Date Title
US5625747A (en) Speaker verification, speech recognition and channel normalization through dynamic time/frequency warping
US5146539A (en) Method for utilizing formant frequencies in speech recognition
US5339385A (en) Speaker verifier using nearest-neighbor distance measure
US4032711A (en) Speaker recognition arrangement
JPH0736475A (en) Standard-pattern forming method in speech analysis
US4665548A (en) Speech analysis syllabic segmenter
US4519094A (en) LPC Word recognizer utilizing energy features
GB2196460A (en) Voice recognition
JPH0540494A (en) Composite voice tester
JPH03167600A (en) Voice recognizing device
US4783808A (en) Connected word recognition enrollment method
JP2975772B2 (en) Voice recognition device
EP1939861B1 (en) Registration for speaker verification
JP3227623U (en) Article image identification system
CN117393002B (en) Read-aloud quality assessment method based on artificial intelligence and related device
Rouhe et al. Reading Validation for Pronunciation Evaluation in the Digitala Project.
JPH0236960B2 (en)
CN115440248A (en) Sample preparation system of voice identity identification device and identification capability evaluation method
JPH10171488A (en) Method for speech recognition and device therefor and storage medium
JPS58130394A (en) Voice recognition equipment
JPH0331274B2 (en)
JPH05323992A (en) Inspection learning system for speech recognition device
US20030163312A1 (en) Speech processing apparatus and method
JPH01267718A (en) Automatic data checking system
JPH03223799A (en) Method and apparatus for recognizing word separated, especially very large vocabu- lary