JPH07153188A

JPH07153188A - Voice reproducing device

Info

Publication number: JPH07153188A
Application number: JP5298163A
Authority: JP
Inventors: Masayuki Misaki; 正之三▲崎▼; Ryoji Suzuki; 良二鈴木
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1993-11-29
Filing date: 1993-11-29
Publication date: 1995-06-16

Abstract

PURPOSE:To efficiently reproduce a voice being easy to listen by analyzing voice data previously read by plural analyzing signal processing means and performing the prescribed voice processing. CONSTITUTION:A previous reading reproducing control means 12 previously reads voice data read previous to voice data reproduced by a reproducing control means 18 as necessary, and an uttering speed arithmetic means 15 estimates an uttering speed for the voice data. A storage means 16 stores addresses of a voice section and a non-voice section discriminated by a voice/non-voice discriminating means 14 and voice speed information estimated by the means 15. A voice output control means 17 decides a data length deleting and adding a non-voice section based on a analyzed result of given reproducing conditions and previously read voice data stored in the means 16, further, decides parameters of speed conversion processing changing an uttering speed of the voice section. Thereby, voice can be made easy to listen.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声信号を能率よく、
また聴き取りやすく再生するための音声再生装置に関す
るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention
The present invention also relates to an audio reproducing device for reproducing in an easily audible manner.

【０００２】[0002]

【従来の技術】従来より、テープレコーダや等に記録さ
れている音声信号を効率よく、あるいは繰り返して聴取
する目的のために、早聞きや遅聞き再生，スキップ，リ
ピート再生などの特殊再生を行うことができる音声再生
装置が利用されている。2. Description of the Related Art Conventionally, for the purpose of efficiently or repeatedly listening to a voice signal recorded on a tape recorder or the like, special playback such as fast playback, slow playback playback, skip, repeat playback, etc. is performed. A voice reproducing device capable of performing is used.

【０００３】以下、図面を参照しながら、上述したよう
な従来の音声再生装置について説明を行う。図４は従来
の音声再生装置の構成を示すものである。図４におい
て、41は磁気テープ再生装置、42はバッファメモリ、43
はメモリ制御回路、44は再生制御回路である。以上のよ
うに構成された音声再生装置について、以下その動作を
説明する。まず磁気テープ再生装置41は再生制御回路44
で再生速度を制御され、読み出された音声信号は、メモ
リ制御回路43の制御で周期的にバッファメモリ42へ書き
込まれる。そして、再生制御回路44は、設定された再生
速度の情報をもとに所定のデータがバッファメモリ42に
読み込まれて読み出されるように、メモリ制御回路43と
磁気テープ再生装置41を制御する。A conventional audio reproducing apparatus as described above will be described below with reference to the drawings. FIG. 4 shows the structure of a conventional audio reproducing apparatus. In FIG. 4, 41 is a magnetic tape reproducing device, 42 is a buffer memory, 43
Is a memory control circuit, and 44 is a reproduction control circuit. The operation of the audio reproducing apparatus configured as above will be described below. First, the magnetic tape reproducing device 41 has a reproduction control circuit 44.
The reproduction speed is controlled by and the read audio signal is periodically written in the buffer memory 42 under the control of the memory control circuit 43. Then, the reproduction control circuit 44 controls the memory control circuit 43 and the magnetic tape reproducing device 41 so that predetermined data is read and read by the buffer memory 42 based on the information of the set reproduction speed.

【０００４】メモリ制御回路43は再生制御回路44の制御
で、バッファメモリ42の入力信号のうちの一部分を定期
的に出力するようにバッファメモリ42を制御する。この
バッファメモリ42を用いるデータの再生方法によって、
様々な再生速度の信号でも音程を変化させることなく音
声信号を再生することができる。図５は再生速度を変更
したときに音声データを読み出す方法を模式的に示した
ものである。図５において、各数字のボックスは特定の
時間長のデータブロックを示している。この時間長は使
用するメモりの容量に依存するが、ここでは３秒とす
る。通常、再生時には記録した音声データと同様のデー
タをすべて出力できるが、２倍速再生時には１，３，
５，７，９のブロックの音声信号は出力できるが２，
４，６，８のブロックの音声信号は廃棄して再生しな
い。同様に５倍速再生時には１，５，10のブロックを出
力し、その間のブロックはすべて廃棄されることにな
る。このように再生速度をより高速にする場合には、廃
棄するデータの割合を増加させればよい。Under the control of the reproduction control circuit 44, the memory control circuit 43 controls the buffer memory 42 so as to periodically output a part of the input signal of the buffer memory 42. According to the data reproducing method using the buffer memory 42,
It is possible to reproduce an audio signal even with signals of various reproduction speeds without changing the pitch. FIG. 5 schematically shows a method of reading audio data when the reproduction speed is changed. In FIG. 5, each number box indicates a data block of a specific time length. This time length depends on the capacity of the memory used, but here it is 3 seconds. Normally, all the same data as the recorded audio data can be output during playback, but 1,2,3 during double speed playback.
Audio signals of blocks 5, 7, and 9 can be output, but
The audio signals of blocks 4, 6 and 8 are discarded and not reproduced. Similarly, during 5 × speed reproduction, blocks 1, 5 and 10 are output, and all blocks in between are discarded. In order to increase the reproduction speed in this way, the ratio of data to be discarded may be increased.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記の
ような構成では、時間軸を圧縮して速度を早める場合に
は、データを間引くために子音などが欠落して明瞭度が
低下し、さらにブロックの接続点は不連続であり、それ
を減らすために接続点をミューティングしているもの
の、振幅や位相が不連続で自然性に乏しい音声しか得ら
れないという課題を有していた。また、無音区間等の再
生不要な信号に対しても音声区間と同様に時間軸圧縮・
伸長の処理を行うため、効率よく音声情報を理解するた
めには無駄が多かった。本発明は上記課題に鑑み、所望
の再生速度を得るために先読みして得られた分析結果に
基き、非音声区間と音声区間を考慮した速度変換処理を
行って時間軸伸縮すると同時に、ホルマント強調や子音
強調などの信号処理を有効な部分に対して適用して、聴
き取りやすい可変速再生音声を得ることができる音声再
生装置を提供することを目的とするものである。However, in the above configuration, when the time axis is compressed to increase the speed, consonants and the like are lost to thin out the data, resulting in a decrease in clarity and further block. The connection point of is discontinuous, and although the connection point is muted in order to reduce it, there is a problem that only amplitude and phase discontinuous voices with poor naturalness can be obtained. Also, for signals that do not need to be reproduced, such as in silent sections, time-based compression and
Since the decompression process is performed, there is much waste in efficiently understanding the voice information. In view of the above problems, the present invention performs a speed conversion process in consideration of a non-voice section and a voice section to expand / contract the time axis based on the analysis result obtained by prefetching to obtain a desired reproduction speed, and at the same time formant emphasis. It is an object of the present invention to provide a voice reproducing device that can obtain variable-speed reproduced voice that is easy to hear by applying signal processing such as or consonant enhancement to an effective portion.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するため
に、本発明の音声再生装置は、先読みした音声データを
複数の分析信号処理手段で分析し、この結果を音声出力
制御手段で総合的に判断して、音声データの再生制御手
段と信号処理手段による音声加工を行う構成となってい
る。また、本発明の音声再生装置は、先読みした音声デ
ータを音声・非音声判別手段と発声速度演算手段とを用
いて分析し、この結果を用いて音声出力制御手段で非音
声区間の削除，付加を行う割合と音声区間における発声
速度を変更する割合を決定する構成となっている。ま
た、本発明の音声再生装置は、先読みした音声データを
音声・非音声判別手段を用いて分析し、この結果を用い
て音声出力制御手段で非音声区間の削除，付加を行う割
合とホルマント強調を行う区間を決定する構成となって
いる。さらに、本発明の音声再生装置は、先読みした音
声データを音韻分析手段を用いて分析し、この結果を用
いて音声出力制御手段で非音声区間の削除，付加を行う
割合と子音強調を行う区間を決定する構成となってい
る。In order to achieve the above object, the audio reproducing apparatus of the present invention analyzes preread audio data by a plurality of analysis signal processing means, and the result is comprehensively analyzed by the audio output control means. The audio processing is performed by the reproduction control means of the audio data and the signal processing means. Further, the voice reproduction device of the present invention analyzes the preread voice data by using the voice / non-voice discriminating means and the utterance speed calculating means, and uses the result to delete or add the non-voice section by the voice output controlling means. And the rate of changing the speaking rate in the voice section are determined. Further, the voice reproduction apparatus of the present invention analyzes the preread voice data by using the voice / non-voice discriminating means, and based on the result, the ratio and the formant emphasis at which the non-voice section is deleted and added by the voice output control means. It is configured to determine the section for performing. Further, the voice reproduction device of the present invention analyzes the preread voice data by using the phonological analysis means, and using the result, the rate at which the non-voice section is deleted and added by the voice output control means and the section for consonant emphasis. It is configured to determine.

【０００７】[0007]

【作用】これらの構成によって、非音声区間の削除，付
加を行うとともに、速度変換処理，子音強調，ホルマン
ト強調などの信号処理を各々有効な部分に対して適用し
て音声の加工を行うことにより、音声情報をより効率良
く、かつ聴き取りやすい音声を再生することができる。With these configurations, the non-speech section is deleted and added, and the signal processing such as speed conversion processing, consonant emphasis, and formant emphasis is applied to each effective portion to process the sound. , It is possible to reproduce voice information more efficiently and easily heard.

【０００８】[0008]

【実施例】以下、本発明の第１の実施例について、図面
を参照しながら説明する。図１は本発明の第１の実施例
における音声再生装置の構成図を示すものである。図１
において、11は音声蓄積メディア、12は第１の先読み再
生制御手段、13は第２の先読み再生制御手段、14は音声
・非音声判別手段、15は発声速度演算手段、16は記憶手
段、17は音声出力制御手段、18は再生制御手段、19は速
度変換処理手段である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A first embodiment of the present invention will be described below with reference to the drawings. FIG. 1 shows a block diagram of an audio reproducing apparatus in a first embodiment of the present invention. Figure 1
In FIG. 11, 11 is a voice storage medium, 12 is a first read-ahead reproduction control means, 13 is a second read-ahead reproduction control means, 14 is a voice / non-voice discrimination means, 15 is a speech rate calculation means, 16 is a storage means, 17 Is audio output control means, 18 is reproduction control means, and 19 is speed conversion processing means.

【０００９】以下、その動作について説明する。音声蓄
積メディア11には、すでに音声信号が記録されていると
する。この記録された音声信号を出力する以前に、音声
信号を予め分析して、その結果に基づいて、後の再生制
御手段18および速度変換処理手段19の動作を変更する。
第１の先読み再生制御手段12は再生制御手段18が再生す
る音声データより以前の音声データを必要に応じて先読
みし、その音声データに対して、音声・非音声判別手段
14は音声区間・非音声区間の判定を行う。第２の先読み
再生制御手段13も再生制御手段18が再生する音声データ
より以前の音声データを必要に応じて先読みし、その音
声データに対して発声速度演算手段15は発声速度を推定
する。また、記憶手段16は音声・非音声判別手段14が判
定した音声区間のアドレスと非音声区間のアドレスおよ
び発声速度演算手段15で推定された発声速度の情報を記
憶しておく。音声出力制御手段17は与えられた再生条件
と記憶手段16に記憶している先読みした音声データの分
析結果をもとに、非音声区間の削除，付加を行うデータ
長を決定し、さらに音声区間の発声速度を変更する速度
変換処理のパラメータを決定する。この決定方法は、こ
れから再生する音声データの発声速度と非音声区間の情
報を予め分析しておくことで、与えられた再生条件に対
して有効に求めることができる。すなわち、非音声区間
と音声区間を独立に時間軸伸縮する速度変換処理を行う
場合に、音声データの存在する時間の割合を考慮して各
々のパラメータを決定できるものである。これにより、
人間が発声速度を変更する場合と同様の自然な発声速度
変換処理が実現できるものである。また、「２文章戻っ
て繰り返し再生を行う」とういうような再生条件が与え
られても、記憶手段を参照すれば音声蓄積メディアの再
生すべき音声データアドレスは容易に求められる。一
方、「現在再生中の次の文章の再生を行う」というよう
な再生条件が与えられた場合にも、先読みするデータ数
が十分に大きければ、先ほどの例と同様に記憶手段を参
照することで再生すべき音声データアドレスは容易に求
められる。The operation will be described below. It is assumed that the audio signal is already recorded in the audio storage medium 11. Before outputting the recorded audio signal, the audio signal is analyzed in advance, and based on the result, the operations of the reproduction control means 18 and the speed conversion processing means 19 are changed.
The first pre-reading reproduction control means 12 pre-reads the audio data prior to the audio data reproduced by the reproduction control means 18 as necessary, and the audio / non-voice discrimination means for the audio data.
14 determines the voice section / non-voice section. The second read-ahead reproduction control means 13 also pre-reads the voice data prior to the voice data reproduced by the reproduction control means 18, if necessary, and the utterance speed calculation means 15 estimates the utterance speed for the voice data. Further, the storage means 16 stores the address of the voice section and the address of the non-voice section judged by the voice / non-voice discrimination means 14 and the information of the utterance speed estimated by the utterance speed calculation means 15. The voice output control means 17 determines the data length for deleting and adding the non-voice section based on the given reproduction condition and the analysis result of the preread voice data stored in the storage means 16, and further determines the voice section. The parameters of the speed conversion process for changing the utterance speed are determined. This determination method can be effectively obtained for a given reproduction condition by analyzing in advance the vocalization rate and the non-voice section information of the audio data to be reproduced. That is, when performing the speed conversion processing for independently expanding and contracting the non-voice section and the voice section on the time axis, each parameter can be determined in consideration of the ratio of the time when the voice data exists. This allows
It is possible to realize a natural speech rate conversion process similar to the case where a human changes the speech rate. Further, even if a reproducing condition such as "repeating two sentences and repeatedly reproducing" is given, the audio data address to be reproduced in the audio storage medium can be easily obtained by referring to the storage means. On the other hand, even when a reproduction condition such as “play the next sentence currently being reproduced” is given, if the number of prefetched data is sufficiently large, refer to the storage means as in the previous example. The voice data address to be reproduced in is easily obtained.

【００１０】以上のように、本実施例によれば先読みし
た音声データに対して音声・非音声判別手段と発声速度
演算手段とで分析した結果を記憶しておくことで、非音
声区間の削除，付加を行うことを含めた速度変換処理の
パラメータ設定を、より精度良く与えられると同時に、
音声区間単位でのスキップ再生や繰り返し再生が容易に
行えることになる。As described above, according to the present embodiment, the result of analysis of the preread voice data by the voice / non-voice discrimination means and the utterance speed calculation means is stored, so that the non-voice section is deleted. ， The parameter setting of the speed conversion processing including addition can be given more accurately, and at the same time,
This makes it possible to easily perform skip reproduction and repeated reproduction in units of voice sections.

【００１１】次に本発明の第２の実施例について、図面
を参照しながら説明する。図２は本発明の第２の実施例
における音声再生装置の構成図を示すものである。図２
において、11は音声蓄積メディア、12は第１の先読み再
生制御手段、14は音声・非音声判別手段、16は記憶手
段、17は音声出力制御手段、18は再生制御手段、20はホ
ルマント強調手段である。Next, a second embodiment of the present invention will be described with reference to the drawings. FIG. 2 shows a block diagram of an audio reproducing apparatus according to the second embodiment of the present invention. Figure 2
In FIG. 11, 11 is a voice storage medium, 12 is a first read-ahead reproduction control means, 14 is a voice / non-voice discrimination means, 16 is a storage means, 17 is a voice output control means, 18 is a reproduction control means, and 20 is a formant emphasis means. Is.

【００１２】以下、その動作について説明する。基本的
な動作は第１の実施例と同様であり、記録された音声信
号を出力する以前に音声信号を予め分析して、その結果
に基づいて、後の再生制御手段18およびホルマント強調
手段20の動作を変更する。第１の先読み再生制御手段12
は再生制御手段18が再生する音声データより以前の音声
データを必要に応じて先読みし、その音声データに対し
て音声・非音声判別手段14は音声区間・非音声区間の判
定を行う。また、記憶手段16は音声・非音声判別手段14
が判定した音声区間のアドレスと非音声区間のアドレス
の情報を記憶しておく。音声出力制御手段17は与えられ
た再生条件と記憶手段16に記憶している先読みした音声
データの分析結果をもとに、非音声区間の削除，付加を
行うデータ長を決定し、さらに音声区間に対して行うホ
ルマント強調のパラメータを決定する。ホルマント強調
は音声信号を聴き取りやすくするために、音声区間のみ
に対して処理を行い、非音声区間に対しては信号の特性
変化を防ぐために処理を行わないようにする。これらの
決定方法は、これから再生する音声データの非音声区間
の情報を予め分析しておくことで、与えられた再生条件
に対して有効に求めることができる。以上のように、本
実施例によれば先読みした音声データに対して音声・非
音声判別手段で分析した結果を記憶しておくことで、非
音声区間の削除，付加およびホルマント強調を有効に行
えると同時に、音声区間単位でのスキップ再生や繰り返
し再生が容易に行えることになる。The operation will be described below. The basic operation is the same as that of the first embodiment, the voice signal is analyzed in advance before the recorded voice signal is output, and based on the result, the reproduction control means 18 and the formant emphasizing means 20 to be performed later. Change the behavior of. First prefetch reproduction control means 12
If necessary, the audio data prior to the audio data reproduced by the reproduction control means 18 is pre-read, and the audio / non-audio discrimination means 14 determines the audio section / non-voice section for the audio data. Further, the storage means 16 is a voice / non-voice discrimination means 14
The information of the address of the voice section and the address of the non-voice section determined by is stored. The voice output control means 17 determines the data length for deleting and adding the non-voice section based on the given reproduction condition and the analysis result of the preread voice data stored in the storage means 16, and further determines the voice section. Determine the formant enhancement parameters for. The formant enhancement processes only the voice section in order to make the voice signal easier to hear, and does not process the non-voice section in order to prevent the characteristic change of the signal. These determination methods can be effectively obtained for given reproduction conditions by analyzing in advance the information of the non-voice section of the audio data to be reproduced. As described above, according to the present embodiment, by storing the result of analysis of the preread voice data by the voice / non-voice discriminating means, it is possible to effectively delete and add the non-voice section and formant enhancement. At the same time, skip playback and repeat playback can be easily performed in voice section units.

【００１３】次に本発明の第３の実施例について、図面
を参照しながら説明する。図３は本発明の第３の実施例
における音声再生装置の構成図を示すものである。図３
において、11は音声蓄積メディア、12は第１の先読み再
生制御手段、16は記憶手段、17は音声出力制御手段、18
は再生制御手段、30は音韻分析手段、31は子音強調手段
である。Next, a third embodiment of the present invention will be described with reference to the drawings. FIG. 3 shows a block diagram of an audio reproducing apparatus in a third embodiment of the present invention. Figure 3
In the figure, 11 is a voice storage medium, 12 is a first prefetch reproduction control means, 16 is a storage means, 17 is a voice output control means,
Is a reproduction control means, 30 is a phoneme analysis means, and 31 is a consonant emphasis means.

【００１４】以下、その動作について説明する。基本的
な動作は第１の実施例と同様であり、記録された音声信
号を出力する以前に音声信号を予め分析して、その結果
に基づいて、後の再生制御手段18および子音強調手段31
の動作を変更する。第１の先読み再生制御手段12は、再
生制御手段18が再生する音声データより以前の音声デー
タを必要に応じて先読みし、その音声データに対して音
韻分析手段30を用いて分析し、非音声区間の削除，付加
を行うと同時に音声区間の発声速度変換，子音強調，ホ
ルマント強調等の情報を得る。また記憶手段16は音韻分
析手段30で分析した情報を記憶しておく。音声出力制御
手段17は与えられた再生条件と記憶手段16に記憶してい
る先読みした音声データの分析結果をもとに、非音声区
間の削除，付加を行うデータ長を決定し、さらに音声区
間に含まれる子音の種類に応じて子音強調のパラメータ
を決定する。この決定方法は、これから再生する音声デ
ータの非音声区間の情報を予め分析しておくことで与え
られた再生条件に対して有効に求めることができる。以
上のように、本実施例によれば先読みした音声データに
対して音韻分析手段で分析した結果を記憶しておくこと
で、非音声区間の削除，付加および子音強調を有効に行
えると同時に、音声区間単位でのスキップ再生や繰り返
し再生が容易に行えることになる。The operation will be described below. The basic operation is the same as that of the first embodiment, the voice signal is pre-analyzed before the recorded voice signal is output, and based on the result, the reproduction control means 18 and the consonant emphasizing means 31 to be performed later.
Change the behavior of. The first look-ahead reproduction control means 12 pre-reads voice data prior to the voice data reproduced by the reproduction control means 18 as needed, analyzes the voice data by using the phonological analysis means 30, and outputs the non-voice data. At the same time as deleting and adding a section, information such as vocal speed conversion, consonant emphasis, formant emphasis, etc. of a voice section is obtained. The storage means 16 stores the information analyzed by the phonological analysis means 30. The voice output control means 17 determines the data length for deleting and adding the non-voice section based on the given reproduction condition and the analysis result of the preread voice data stored in the storage means 16, and further determines the voice section. The consonant emphasis parameter is determined according to the type of consonant included in. This determination method can be effectively obtained for a given reproduction condition by previously analyzing the information of the non-voice section of the audio data to be reproduced. As described above, according to the present embodiment, by storing the result of the analysis performed by the phoneme analysis unit for the preread voice data, it is possible to effectively delete and add the non-speech section and enhance the consonant. This makes it possible to easily perform skip reproduction and repeated reproduction in units of voice sections.

【００１５】[0015]

【発明の効果】本発明は、先読み再生制御手段で先読み
した音声信号を分析信号処理手段であらかじめ分析して
おき、この分析結果と与えられた再生条件をもとに、再
生すべき音声データと信号処理手段の適用方法を決定す
ることにより、優れた処理効果を得ることができる。そ
して、音声・非音声判別手段を分析信号処理手段として
用いることで、非音声区間の削除，付加を行うと同時
に、音声区間の発声速度変換，ホルマント強調に対して
有益な情報を得ることができる。また、音韻分析手段を
分析信号処理手段として用いることで、非音声区間の削
除，付加を行うと同時に、音声区間の発声速度変換，子
音強調，ホルマント強調に対して有益な情報を得ること
ができる。これらの構成により、音声蓄積メディアの音
声信号を自然に可変速再生するのみならず、子音部また
は母音部の音声区間をより聴き取りやすくすることがで
きる優れた音声再生装置を実現できるものである。According to the present invention, the audio signal pre-read by the pre-reading reproduction control means is analyzed in advance by the analysis signal processing means, and the audio data to be reproduced is determined based on the analysis result and the given reproduction condition. An excellent processing effect can be obtained by determining the application method of the signal processing means. By using the speech / non-speech discrimination means as the analysis signal processing means, it is possible to delete and add the non-speech section, and at the same time, obtain useful information for the conversion of the speech rate of the speech section and the formant enhancement. . Further, by using the phonological analysis means as the analysis signal processing means, it is possible to delete or add the non-speech section and at the same time obtain useful information for the conversion of the vocalization rate of the speech section, the consonant emphasis, and the formant emphasis. . With these configurations, it is possible to realize an excellent audio reproduction device that not only naturally reproduces the audio signal of the audio storage medium at a variable speed but also makes it easier to hear the audio section of the consonant part or the vowel part. .

[Brief description of drawings]

【図１】本発明の第１の実施例における音声再生装置の
構成図である。FIG. 1 is a configuration diagram of an audio reproducing device according to a first embodiment of the present invention.

【図２】本発明の第２の実施例における音声再生装置の
構成図である。FIG. 2 is a configuration diagram of an audio reproducing device according to a second embodiment of the present invention.

【図３】本発明の第３の実施例における音声再生装置の
構成図である。FIG. 3 is a configuration diagram of an audio reproducing device according to a third embodiment of the present invention.

【図４】従来の音声再生装置の構成図である。FIG. 4 is a configuration diagram of a conventional audio reproducing device.

【図５】従来の音声再生装置の出力信号の模式図であ
る。FIG. 5 is a schematic diagram of an output signal of a conventional audio reproducing device.

[Explanation of symbols]

11…音声蓄積メディア、 12…第１の先読み再生制御手
段、 13…第２の先読み再生制御手段、 14…音声・非
音声判別手段、 15…発声速度演算手段、 16…記憶手
段、 17…音声出力制御手段、 18…再生制御手段、
19…速度変換処理手段、 20…ホルマント強調手段、
30…音韻分析手段、 31…子音強調手段、41…磁気テー
プ再生装置、 42…バッファメモリ、 43…メモリ制御
回路、 44…再生制御回路。11 ... Voice storage medium, 12 ... First look-ahead playback control means, 13 ... Second look-ahead playback control means, 14 ... Voice / non-voice discrimination means, 15 ... Speaking speed calculation means, 16 ... Storage means, 17 ... Voice Output control means, 18 ... Regeneration control means,
19 ... Velocity conversion processing means, 20 ... Formant emphasis means,
30 ... Phonological analysis means, 31 ... Consonant emphasizing means, 41 ... Magnetic tape reproducing device, 42 ... Buffer memory, 43 ... Memory control circuit, 44 ... Reproduction control circuit.

Claims

[Claims]

1. A voice storage medium for recording a voice signal, a prefetch playback control means for prefetching and playing back the voice data stored in the voice storage medium, and an output of the prefetch playback control means for analysis or signal processing. Analysis signal processing means, storage means for storing analysis and calculation results of the analysis signal processing means, and audio output control means for determining an audio data reproduction method with reference to a given reproduction condition and a value of the storage means. A reproduction control means for receiving information about data to be read from the audio output control means and reading the audio data; and a predetermined process for the output signal of the reproduction control means under the control of the audio output control means. And a signal processing means for reproducing the time-series audio signal to be reproduced from the result of the analysis of the pre-reading data and subjecting it to predetermined processing. A sound reproducing device characterized by reproducing sound.

2. A voice / non-voice discrimination means and a utterance speed calculation means are provided as analysis signal processing means, and a ratio of deleting and adding a non-voice section by the voice output control means based on given reproduction conditions. The rate of changing the utterance speed in the voice section is determined, the non-voice section is deleted and added by the reproduction control means, the signal processing for changing the utterance speed is performed by the speed conversion processing means, and the voice reproduction speed is changed. The audio reproducing device according to claim 1, wherein

3. A voice / non-voice discriminating means is provided as the analysis signal processing means, and a ratio of deleting and adding a non-voice section by the voice output control means based on a given reproduction condition and a section for performing formant enhancement. Then, the reproduction control means deletes and adds the non-speech section, and the formant emphasizing means performs signal processing for emphasizing a predetermined formant, thereby emphasizing the formant of the sound part while deleting and adding the non-speech section. The audio reproducing device according to claim 1, wherein

4. A phoneme analysis means is provided as an analysis signal processing means, and a ratio of deleting and adding a non-voice section by a voice output control means and a utterance speed of a voice section are changed based on a given reproduction condition. 2. The voice reproduction speed is variable according to claim 1, wherein the ratio is determined, the non-voice section is deleted and added by the reproduction control means, the signal processing for changing the utterance speed is performed by the speed conversion processing means, and the voice reproduction speed is changed. Audio playback device.

5. A phoneme analysis means is provided as an analysis signal processing means, and a ratio of deleting and adding a non-voice section and a section for consonant emphasis are determined by the voice output control means based on a given reproduction condition. The reproduction control means deletes and adds a non-speech section, and the consonant emphasizing section performs signal processing for emphasizing a predetermined consonant so that the consonant portion is emphasized while expanding and contracting the non-speech section. Item 1. The audio reproduction device according to item 1.

6. A phoneme analysis means is provided as an analysis signal processing means, and a ratio of deleting and adding a non-voice section and a section for performing formant enhancement are determined by the voice output control means based on a given reproduction condition. , The reproduction control means deletes and adds non-speech intervals, and the formant emphasizing means performs signal processing for formant emphasis on a predetermined vowel, thereby expanding and contracting the non-speech intervals to emphasize the vowel formants. The audio reproducing device according to claim 1.

7. A phoneme analysis means is provided as an analysis signal processing means, and a ratio of deleting and adding a non-voice section by a voice output control means and a utterance speed of a voice section are changed based on a given reproduction condition. Decide the ratio and the section to emphasize the consonant,
The reproduction control means deletes and adds a non-voice section, the speed conversion processing means performs signal processing for changing the utterance speed, and the consonant emphasis means performs signal processing for emphasizing a predetermined consonant to determine the voice reproduction speed. The audio reproducing apparatus according to claim 1, wherein the consonant part is varied and emphasized.

8. A phoneme analysis means is provided as an analysis signal processing means, and a ratio of deleting and adding a non-voice section by a voice output control means and a utterance speed of a voice section are changed based on a given reproduction condition. The ratio and the section for which formant emphasis is performed are determined, and the non-voice section is deleted or added by the reproduction control means.
The speed conversion processing means performs signal processing for changing the utterance speed, and the formant emphasizing means performs signal processing for formant emphasizing on a predetermined vowel, varying the voice reproduction speed and emphasizing the vowel formant. The audio reproducing device according to claim 1.

9. A phoneme analysis means is provided as an analysis signal processing means, and a ratio of deleting and adding a non-voice section by a voice output control means and a utterance speed of a voice section are changed based on a given reproduction condition. The ratio, the section in which the consonant emphasis is performed and the section in which the formant emphasis is performed are determined, the non-speech section is deleted and added by the reproduction control unit, and the signal processing for changing the utterance speed is performed by the speed conversion processing unit, and the consonant emphasis unit. The signal processing for emphasizing a given consonant is performed with the formant emphasizing means for the formant emphasizing with respect to the given vowel by the formant emphasizing means, and the voice reproduction speed is varied, while the consonant emphasis and the vowel formant are enhanced. The audio reproducing apparatus according to claim 1, wherein the audio reproducing apparatus is emphasized.