JP2001318700A

JP2001318700A - Speech speed converter

Info

Publication number: JP2001318700A
Application number: JP2001014067A
Authority: JP
Inventors: Tatsuo Inoue; 健生井上
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 2000-02-28
Filing date: 2001-01-23
Publication date: 2001-11-16

Abstract

PROBLEM TO BE SOLVED: To provide a speech speed conversion in which the amount of accumulation of unread voice data in a voice data accumulation memory does not exceed the capacity of the memory without greatly increasing the speech speed of output voice even though the amount of the accumulation of the unread voice data in the memory is increased. SOLUTION: The speech speed converter is provided with a speech speed conversion processing means which conducts speech speed conversion processes for input voice signals inputted from a voice reproducing device, a voice data accumulation memory into which the output of the speech speed conversion processing means is written, a voice data reading means which reads voice data from the memory, a computing means which computes the accumulation rate of unread voice data in the memory and a control means which controls the reproducing speed of the voice reproducing device in accordance with the accumulation rate of the unread voice data in the memory.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、話速変換装置に
関する。The present invention relates to a speech speed conversion device.

【０００２】[0002]

【従来の技術】ＶＴＲの高速再生時において、ビデオテ
ープから読み取られた音声信号のうち、無音区間の音声
信号を削除し、音声区間の音声信号を時間軸圧縮伸長部
によって時間軸上において圧縮し、音声区間の音声をユ
ーザによって設定されたＶＴＲの再生速度（設定再生速
度）より遅い速度で出力する話速変換装置が知られてい
る（特開平７−１９２３９２号参照）。2. Description of the Related Art During high-speed reproduction of a VTR, audio signals in a silent section are deleted from audio signals read from a video tape, and audio signals in an audio section are compressed on a time axis by a time axis compression / expansion unit. There is also known a speech speed conversion device that outputs a voice in a voice section at a speed lower than a VTR playback speed (set playback speed) set by a user (see Japanese Patent Application Laid-Open No. 7-192392).

【０００３】このような話速変換装置では、入力音声の
話速を遅くして出力する際に入力音声と出力音声との間
に生じる時間的な遅延量を吸収するためのリングメモリ
（音声データ蓄積用メモリ）が設けられている。リング
メモリ内の未読み出しの音声データの蓄積量が、リング
メモリの容量を越えると、出力される音声区間の音声に
途切れが生じる。In such a speech speed conversion device, a ring memory (speech data) for absorbing a temporal delay generated between the input speech and the output speech when the speech speed of the input speech is reduced and outputted. (Storage memory). When the storage amount of unread audio data in the ring memory exceeds the capacity of the ring memory, the audio in the output audio section is interrupted.

【０００４】そこで、リングメモリ内の未読み出しの音
声データの蓄積量がリングメモリの容量を越えないよう
にするために、リングメモリ内の未読み出しの音声デー
タの蓄積量が所定量を越えたときに、時間軸圧縮伸長部
の圧縮率を変化させている。しかしながら、このように
すると、出力音声速度が速くなるという問題がある。In order to prevent the storage amount of the unread voice data in the ring memory from exceeding the capacity of the ring memory, the storage amount of the unread voice data in the ring memory exceeds a predetermined amount. Next, the compression ratio of the time axis compression / expansion unit is changed. However, in this case, there is a problem that the output sound speed is increased.

【０００５】また、高齢者等への聴覚補助または語学学
習のために、テープレコーダ等の音声再生装置から出力
される音声の話速を遅くする話速変換装置が実用化され
ているが、この場合にも同様な問題がある。[0005] Further, a speech speed conversion device for reducing the speech speed of speech output from a speech reproducing device such as a tape recorder has been put to practical use for assisting hearing or language learning for the elderly and the like. There is a similar problem in the case.

【０００６】[0006]

【発明が解決しようとする課題】この発明は、音声デー
タ蓄積用メモリ内の未読み出しの音声データの蓄積量が
増加した場合でも、出力音声の話速をさほど速くさせる
ことなく、音声データ蓄積用メモリ内の未読み出しの音
声データの蓄積量が音声データ蓄積用メモリの容量を越
えないようにすることができる話速変換装置を提供する
ことを目的とする。SUMMARY OF THE INVENTION According to the present invention, even if the storage amount of unread voice data in the voice data storage memory increases, the voice data storage speed is not increased so much. An object of the present invention is to provide a speech speed conversion device capable of preventing the storage amount of unread voice data in a memory from exceeding the capacity of a voice data storage memory.

【０００７】[0007]

【課題を解決するための手段】この発明による第１の話
速変換装置は、音声再生装置から入力される入力音声信
号を話速変換処理する話速変換処理手段、話速変換処理
手段の出力が書き込まれる音声データ蓄積用メモリ、お
よび音声データ蓄積用メモリから音声データを読み出す
手段を備えた話速変換装置において、音声データ蓄積用
メモリ内の未読み出しの音声データの蓄積率を算出する
算出手段、および音声データ蓄積用メモリ内の未読み出
しの音声データの蓄積率に応じて、音声再生装置の再生
速度を制御する制御手段を備えていることを特徴とす
る。According to a first aspect of the present invention, there is provided a first speech speed conversion device for performing speech speed conversion processing of an input speech signal input from an audio reproduction device, and an output of the speech speed conversion processing device. Calculating means for calculating a storage rate of unread voice data in the voice data storage memory in a speech speed conversion device having a voice data storage memory into which voice data is written, and voice data reading means from the voice data storage memory. And control means for controlling the reproduction speed of the audio reproduction device in accordance with the accumulation rate of unread audio data in the audio data storage memory.

【０００８】話速変換処理手段としては、たとえば、入
力音声信号が音声区間であるか無音区間であるかを判定
する区間判定手段、無音区間であると判定された入力音
声信号を削除処理する削除処理手段、および有音区間で
あると判定された入力音声信号を、メモリ内の未読み出
しの音声データの蓄積率に応じた圧縮率で時間軸圧縮伸
長処理する時間軸圧縮伸長処理手段を備えているものが
用いられる。The speech speed conversion processing means includes, for example, a section determining means for determining whether an input voice signal is a voice section or a silent section, and a deletion processing for deleting an input voice signal determined to be a silent section. Processing means, and time axis compression / expansion processing means for performing time axis compression / expansion processing on the input audio signal determined to be a sound section at a compression rate corresponding to the accumulation rate of unread audio data in the memory. Is used.

【０００９】音声再生装置としては、たとえば、ＶＴ
Ｒ、ハードディスクレコーダが用いられる。[0009] As an audio reproducing apparatus, for example, VT
R, a hard disk recorder is used.

【００１０】この発明による第２の話速変換装置は、音
声再生装置から入力されるアナログ音声信号を設定され
た再生速度倍率に応じたサンプリング周波数でサンプリ
ングするＡ／Ｄ変換手段、Ａ／Ｄ変換手段から出力され
る音声データが入力されるフレームメモリ、フレームメ
モリに所要数の音声データが入力される毎に、それらの
音声データに対して話速変換処理を行なう話速変換処理
手段、話速変換処理手段の出力が書き込まれる音声デー
タ蓄積用メモリ、および音声データ蓄積用メモリから音
声データを読み出す手段を備えた話速変換装置におい
て、音声データ蓄積用メモリ内の未読み出しの音声デー
タの蓄積率を算出する算出手段、および音声データ蓄積
用メモリ内の未読み出しの音声データの蓄積率に応じ
て、音声再生装置の再生速度を制御する制御手段を備え
ていることを特徴とする。A second speech speed conversion device according to the present invention is an A / D conversion means for sampling an analog audio signal input from an audio reproduction device at a sampling frequency corresponding to a set reproduction speed magnification, and an A / D conversion device. A frame memory to which voice data output from the means is input, and a voice speed conversion processing means for performing voice speed conversion processing on the voice data each time a required number of voice data is input to the frame memory; An audio data storage memory into which an output of a conversion processing unit is written, and a speech rate conversion device having means for reading out audio data from the audio data storage memory, wherein a storage rate of unread audio data in the audio data storage memory Calculating means for calculating the audio data and the accumulation rate of the unread audio data in the audio data storage memory. Characterized in that it comprises a control means for controlling the speed.

【００１１】この発明による第３の話速変換装置は、音
声再生装置から入力されるデジタル音声信号が、設定さ
れた再生速度倍率に応じた速度で書き込まれるフレーム
メモリ、フレームメモリに所要数の音声データが入力さ
れる毎に、それらの音声データに対して話速変換処理を
行なう話速変換処理手段、話速変換処理手段の出力が書
き込まれる音声データ蓄積用メモリ、および音声データ
蓄積用メモリから音声データを読み出す手段を備えた話
速変換装置において、音声データ蓄積用メモリ内の未読
み出しの音声データの蓄積率を算出する算出手段、およ
び音声データ蓄積用メモリ内の未読み出しの音声データ
の蓄積率に応じて、音声再生装置の再生速度を制御する
制御手段を備えていることを特徴とする。A third speech speed conversion device according to the present invention is a frame memory in which a digital audio signal input from an audio reproduction device is written at a speed corresponding to a set reproduction speed magnification, and a required number of voices are stored in the frame memory. Each time data is input, the voice speed conversion processing means for performing voice speed conversion processing on the voice data, the voice data storage memory to which the output of the voice speed conversion processing means is written, and the voice data storage memory In a speech speed conversion device provided with means for reading voice data, a calculating means for calculating a storage rate of unread voice data in a voice data storage memory, and a storage of unread voice data in a voice data storage memory. It is characterized by comprising control means for controlling the playback speed of the audio playback device according to the rate.

【００１２】上記第２または第３の話速変換装置におけ
る話速変換処理手段としては、たとえば、フレームメモ
リに入力された所要数の音声データに対応する入力音声
が音声区間であるか無音区間であるかを判定する区間判
定手段、無音区間であると判定された音声データを削除
処理する削除処理手段、および有音区間であると判定さ
れた音声データを、音声データ蓄積用メモリ内の未読み
出しの音声データの蓄積率に応じた圧縮率で時間軸圧縮
伸長処理する時間軸圧縮伸長処理手段を備えているもの
が用いられる。[0012] The speech speed conversion processing means in the second or third speech speed conversion device includes, for example, an input speech corresponding to a required number of speech data input to the frame memory is a speech section or a silent section. Section determination means for determining whether there is a voice section, deletion processing means for deleting voice data determined to be a silent section, and voice data determined to be a voiced section not yet read in the voice data storage memory. And a time axis compression / expansion processing means for performing time axis compression / expansion processing at a compression rate corresponding to the accumulation rate of the audio data.

【００１３】[0013]

【発明の実施の形態】以下、図面を参照して、この発明
の実施の形態について説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１４】〔１〕第１の実施の形態の説明[1] Description of First Embodiment

【００１５】図１は、ＶＴＲの高速再生時において、ユ
ーザによって設定されたＶＴＲ２０の再生速度（設定再
生速度）より遅い速度で音声を出力する話速変換装置の
構成を示している。図１には、図示していないが、ＶＴ
Ｒから出力された映像信号は図示しないモニタに表示さ
れる。FIG. 1 shows the configuration of a speech speed conversion device that outputs a voice at a speed lower than the playback speed (set playback speed) of the VTR 20 set by the user during high-speed playback of the VTR. Although not shown in FIG. 1, VT
The video signal output from R is displayed on a monitor (not shown).

【００１６】ＶＴＲ２０から出力された音声信号は、Ａ
／Ｄ変換部１に送られ、たとえば１２ビットのデジタル
信号に変換される。The audio signal output from the VTR 20 is A
The signal is sent to the / D conversion unit 1 and is converted into, for example, a 12-bit digital signal.

【００１７】Ａ／Ｄ変換部１の出力は、フレームメモリ
２に一旦格納される。区間判定部３、無音区間削除部４
および時間軸圧縮伸長部５は、フレームメモリ２に格納
された１フレーム単位の音声データに対して処理を行な
う。The output of the A / D converter 1 is temporarily stored in the frame memory 2. Section determination section 3, silent section deletion section 4
The time axis compression / expansion unit 5 performs processing on the audio data in units of one frame stored in the frame memory 2.

【００１８】区間判定部３は、１フレーム分の音声デー
タのパワーの平均値、パワーの累積値、振幅平均値、振
幅累積値等に基づいて、入力音声が音声区間であるか、
無音区間であるかを判定する。無音区間削除部４は、区
間判定部３によって無音区間であると判定された音声デ
ータを削除する。無音区間削除部４によって無音区間の
音声データが削除された後の音声データ（音声区間の音
声データ）は、時間軸圧縮伸長部５に送られ、時間軸圧
縮伸長処理が行なわれる。The section determining unit 3 determines whether the input voice is a voice section based on the average value of the power of the audio data for one frame, the cumulative power value, the average amplitude value, the cumulative amplitude value, and the like.
It is determined whether it is a silent section. The silent section deletion unit 4 deletes the voice data determined by the section determination unit 3 to be a silent section. The audio data (the audio data of the audio section) from which the audio data of the silent section has been deleted by the silent section deletion section 4 is sent to the time axis compression / expansion section 5, where the time axis compression / expansion processing is performed.

【００１９】時間軸圧縮伸長部５によって時間軸圧縮伸
長処理が行なわれた音声データは、リングメモリ（音声
データ蓄積用メモリ）６に一旦蓄積される。リングメモ
リ６に蓄積された音声データは、読み出されてＤ／Ａ変
換部９に送られ、アナログ信号に変換されて一定速度で
出力される。The audio data subjected to the time axis compression / expansion processing by the time axis compression / expansion section 5 is temporarily stored in a ring memory (audio data storage memory) 6. The audio data stored in the ring memory 6 is read, sent to the D / A converter 9, converted into an analog signal, and output at a constant speed.

【００２０】リングメモリ６内の未読み出しの音声デー
タの蓄積率が、蓄積率算出部７によって算出される。こ
こで、リングメモリ６内の未読み出しの音声データの蓄
積率とは、リングメモリ６に記憶できる音声データ総量
に対する未読み出しの音声データの蓄積量の割合〔％〕
をいう。蓄積率算出部７によって算出された蓄積率は適
応話速制御部８に送られるとともにＶＴＲ２０の再生速
度を制御する再生速度制御部２１に送られる。The storage rate of the unread audio data in the ring memory 6 is calculated by the storage rate calculation unit 7. Here, the storage rate of unread audio data in the ring memory 6 is the ratio [%] of the storage amount of unread audio data to the total amount of audio data that can be stored in the ring memory 6.
Say. The accumulation rate calculated by the accumulation rate calculation unit 7 is sent to the adaptive speech speed control unit 8 and also sent to the playback speed control unit 21 that controls the playback speed of the VTR 20.

【００２１】以下の説明において、圧縮率とは、時間軸
圧縮伸長部５への入力信号の時間長（データ数）をＰ、
上記入力信号に対して時間軸圧縮伸長部５から出力され
る出力信号の時間長（データ数）をＱとすると、Ｐ／Ｑ
で定義される。また、リングメモリ６内の未読み出しの
音声データの蓄積率を、単に蓄積率ということにする。In the following description, the compression ratio is defined as P, the time length (number of data) of the input signal to the time axis compression / expansion unit 5,
Assuming that the time length (number of data) of the output signal output from the time axis compression / expansion unit 5 with respect to the input signal is Q, P / Q
Is defined by The storage rate of unread audio data in the ring memory 6 is simply referred to as the storage rate.

【００２２】適応話速制御部８は、蓄積率に基づいて時
間軸圧縮伸長部５で用いられる圧縮率を制御する。ま
た、再生速度制御部２１は、ユーザによって設定された
ＶＴＲ２０の再生速度倍率（以下、設定再生速度倍率と
いう）と蓄積率とに基づいて、ＶＴＲ２０の実際の再生
速度（実際の再生速度倍率）を制御する。The adaptive speech speed control section 8 controls the compression rate used in the time axis compression / expansion section 5 based on the accumulation rate. Further, the reproduction speed control unit 21 determines the actual reproduction speed (actual reproduction speed magnification) of the VTR 20 based on the reproduction speed magnification (hereinafter, referred to as a set reproduction speed magnification) and the accumulation rate of the VTR 20 set by the user. Control.

【００２３】Ａ／Ｄ変換部１の標準サンプリング周波数
およびＤ／Ａ変換部９の標準サンプリング周波数は、こ
の例では８ｋＨｚである。ＶＴＲ２０の再生速度倍率が
Ｍの場合には、Ｍ倍速再生時にＡ／Ｄ変換部１によって
得られるサンプリングデータと、標準再生速度での再生
時にＡ／Ｄ変換部１によって得られるサンプリングデー
タとを一致させるために、Ａ／Ｄ変換部１のサンプリン
グ周波数ｆ_ADは、Ｄ／Ａ変換部９のサンプリング周波数
ｆ_DAのＭ倍に設定される。したがって、Ｍ＝２の場合
（２倍速再生時）には、ｆ_AD＝１６ｋＨｚとなり、ｆ_DA
＝８ｋＨｚとなる。Ｄ／Ａ変換部９のサンプリング周波
数ｆ_DAは、再生速度倍率にかかわらず、常に標準サンプ
リング周波数（８ｋＨｚ）に保たれる。The standard sampling frequency of the A / D converter 1 and the standard sampling frequency of the D / A converter 9 are 8 kHz in this example. When the reproduction speed magnification of the VTR 20 is M, the sampling data obtained by the A / D conversion unit 1 at the time of M-times reproduction matches the sampling data obtained by the A / D conversion unit 1 at the time of reproduction at the standard reproduction speed. For this purpose, the sampling frequency f _AD of the A / D converter 1 is set to M times the sampling frequency f _DA of the D / A converter 9. Therefore, when M = 2 (during double-speed playback), f _AD = 16 kHz, and f _DA
= 8 kHz. The sampling frequency f _DA of the D / A converter 9 is always kept at the standard sampling frequency (8 kHz) regardless of the reproduction speed magnification.

【００２４】２倍速再生時において、音声を設定再生速
度より遅い速度で出力する場合の適応話速制御部８およ
び再生速度制御部２１の動作について説明する。The operation of the adaptive speech speed control section 8 and the playback speed control section 21 when outputting voice at a speed lower than the set playback speed during double speed playback will be described.

【００２５】表１は、設定再生速度倍率が２である場合
の、蓄積率と圧縮率との関係および蓄積率と再生速度倍
率との関係を示している。表１において、メモリ残量率
とは、１００から蓄積率〔％〕を減算した値をいう。Table 1 shows the relationship between the accumulation ratio and the compression ratio and the relationship between the accumulation ratio and the reproduction speed ratio when the set reproduction speed magnification is 2. In Table 1, the remaining memory ratio refers to a value obtained by subtracting the accumulation ratio [%] from 100.

【００２６】[0026]

【表１】 [Table 1]

【００２７】適応話速制御部８は、表１の蓄積率と圧縮
率との関係を記憶した蓄積率／圧縮率テーブルを備えて
いる。また、再生速度制御部２１は、表１の蓄積率と再
生速度倍率との関係を記憶した蓄積率／再生速度倍率テ
ーブルを備えている。The adaptive speech speed controller 8 has an accumulation rate / compression rate table storing the relationship between the accumulation rate and the compression rate shown in Table 1. Further, the reproduction speed control unit 21 has an accumulation ratio / reproduction speed magnification table in which the relationship between the accumulation ratio and the reproduction speed magnification in Table 1 is stored.

【００２８】適応話速制御部８は、蓄積率算出部７から
蓄積率が送られてくると、蓄積率／圧縮率テーブルに基
づいて、蓄積率算出部７から送られてきた蓄積率に対応
する圧縮率を読み出し、時間軸圧縮伸長部５に設定す
る。再生速度制御部２１は、蓄積率算出部７から蓄積率
が送られてくると、蓄積率／再生速度倍率テーブルに基
づいて、蓄積率算出部７から送られてきた蓄積率に対応
する再生速度倍率を読み出し、ＶＴＲ２０の再生速度を
読み出した再生速度倍率に応じた速度となるように制御
する。When the storage rate is sent from the storage rate calculator 7, the adaptive speech speed controller 8 responds to the storage rate sent from the storage rate calculator 7 based on the storage rate / compression rate table. The compression rate to be read is read out and set in the time axis compression / expansion unit 5. When the storage rate is sent from the storage rate calculator 7, the playback speed controller 21 determines the playback speed corresponding to the storage rate sent from the storage rate calculator 7 based on the storage rate / playback speed magnification table. The magnification is read, and the reproduction speed of the VTR 20 is controlled so as to be a speed corresponding to the read reproduction speed magnification.

【００２９】（１）蓄積率が０〜２０％（０以上でか
つ２０％未満）である場合蓄積率が０〜２０％である場合には、圧縮率は１に設定
され、再生速度倍率は設定再生速度倍率である２に設定
される。この場合には、設定再生速度倍率２に応じた再
生速度でＶＴＲ２０から出力された音声信号は、Ａ／Ｄ
変換部１によってＤ／Ａ変換部９の標準サンプリング周
波数の２倍の周波数（１６ｋＨｚ）でサンプリングされ
てフレームメモリ２に格納される。(1) When the accumulation rate is 0 to 20% (0 or more and less than 20%) When the accumulation rate is 0 to 20%, the compression rate is set to 1, and the reproduction speed magnification is It is set to 2 which is the set reproduction speed magnification. In this case, the audio signal output from the VTR 20 at the playback speed corresponding to the set playback speed magnification 2 is A / D
The data is sampled by the converter 1 at a frequency (16 kHz) that is twice the standard sampling frequency of the D / A converter 9 and stored in the frame memory 2.

【００３０】フレームメモリ２に格納された音声データ
は、無音区間削除部４によって無音区間のデータが削除
された後、時間軸圧縮伸長部５で時間軸圧縮伸長処理は
行なわれずに、リングメモリ６に蓄積される。リングメ
モリ６に蓄積された音声データは、Ｄ／Ａ変換部９によ
って標準サンプリング周波数（８ｋＨｚ）でサンプリン
グされて出力される。したがって、出力音声の話速は、
標準再生速度（１倍速再生時の再生速度）で再生された
ときの出力音声の話速と等しくなる。After the audio data stored in the frame memory 2 is deleted in the silent section by the silent section deleting section 4, the time axis compressing / expanding section 5 does not perform the time axis compressing / expanding processing, and the ring memory 6 Is accumulated in The audio data stored in the ring memory 6 is sampled by the D / A converter 9 at a standard sampling frequency (8 kHz) and output. Therefore, the output speech speed is
It becomes equal to the speech speed of the output sound when reproduced at the standard reproduction speed (reproduction speed at 1 × speed reproduction).

【００３１】リングメモリ６へのデータ書き込み速度
は、リングメモリ６からのデータ読み出し速度より速い
ので、リングメモリ６内の未読み出しの音声データの蓄
積量が増加していく。未読み出しの音声データの蓄積量
が増加していく速度は、入力音声データに無音区間のデ
ータが少ない程、速くなる。Since the speed of writing data to the ring memory 6 is faster than the speed of reading data from the ring memory 6, the amount of unread audio data stored in the ring memory 6 increases. The speed at which the storage amount of unread audio data increases increases as the input audio data contains less data in a silent section.

【００３２】（２）蓄積率が２０〜４０％である場合蓄積率が２０〜４０％である場合には、圧縮率が１．２
に設定される。ただし、再生速度倍率は２のままであ
る。この場合には、時間軸圧縮伸長部５は、入力信号の
時間長Ｐと出力信号の時間長Ｑとの比が１．２：１とな
るように、入力データに対して時間軸圧縮処理を行な
う。この結果、出力音声の話速は、標準再生速度（１倍
速再生時の再生速度）で再生されたときの出力音声の話
速より若干速くなる。一方、リングメモリ６に入力され
る音声区間の音声データ量が低減されるので、上記
（１）の場合に比べて、リングメモリ６から読み出され
るデータ量に対する、リングメモリ６に書き込まれるデ
ータ量の比を小さくすることができる。(2) When the accumulation rate is 20 to 40% When the accumulation rate is 20 to 40%, the compression rate is 1.2
Is set to However, the reproduction speed magnification remains at 2. In this case, the time axis compression / expansion unit 5 performs time axis compression processing on the input data so that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 1.2: 1. Do. As a result, the voice speed of the output voice is slightly higher than the voice speed of the output voice when reproduced at the standard reproduction speed (reproduction speed at 1 × speed reproduction). On the other hand, since the amount of voice data in the voice section input to the ring memory 6 is reduced, the amount of data written to the ring memory 6 with respect to the amount of data read from the ring memory 6 is smaller than in the case (1). The ratio can be reduced.

【００３３】（３）蓄積率が４０〜６０％である場合蓄積率が４０〜６０％である場合には、圧縮率が１．４
に設定される。ただし、再生速度倍率は２のままであ
る。この場合には、時間軸圧縮伸長部５は、入力信号の
時間長Ｐと出力信号の時間長Ｑとの比が１．４：１とな
るように、入力データに対して時間軸圧縮処理を行な
う。この結果、出力音声の話速は、上記（２）の場合に
比べてさらに速くなる。一方、リングメモリ６に入力さ
れる音声区間の音声データ量が上記（２）の場合に比べ
てさらに低減されるので、上記（２）の場合に比べて、
リングメモリ６から読み出されるデータ量に対する、リ
ングメモリ６に書き込まれるデータ量の比を小さくする
ことができる。(3) When the accumulation rate is 40 to 60% When the accumulation rate is 40 to 60%, the compression rate is 1.4.
Is set to However, the reproduction speed magnification remains at 2. In this case, the time axis compression / expansion unit 5 performs the time axis compression processing on the input data so that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 1.4: 1. Do. As a result, the speech speed of the output voice is higher than in the case of the above (2). On the other hand, the amount of voice data in the voice section input to the ring memory 6 is further reduced as compared with the case of the above (2).
The ratio of the amount of data written to the ring memory 6 to the amount of data read from the ring memory 6 can be reduced.

【００３４】（４）蓄積率が６０〜８０％である場合蓄積率が６０〜８０％である場合には、圧縮率が１．４
に設定され、再生速度倍率が１．８倍に設定される。こ
の場合には、Ａ／Ｄ変換部１のサンプリング周波数ｆ_AD
は、Ｄ／Ａ変換部９の標準サンプリング周波数ｆ_DAの
１．８倍に設定される。また、時間軸圧縮伸長部５は、
入力信号の時間長Ｐと出力信号の時間長Ｑとの比が１．
４：１となるように、入力データに対して時間軸圧縮処
理を行なう。また、再生速度制御部２１は、ＶＴＲ２０
の再生速度を再生速度倍率１．８に応じた速度となるよ
うに制御する。(4) When the accumulation rate is 60 to 80% When the accumulation rate is 60 to 80%, the compression rate is 1.4.
And the reproduction speed magnification is set to 1.8 times. In this case, the sampling frequency f _AD of the A / D converter 1
Is set to 1.8 times the standard sampling frequency f _DA of the D / A converter 9. In addition, the time axis compression / expansion unit 5
The ratio of the time length P of the input signal to the time length Q of the output signal is 1.
A time axis compression process is performed on the input data so that the ratio becomes 4: 1. Also, the playback speed control unit 21
Is controlled so as to be a speed corresponding to the reproduction speed magnification 1.8.

【００３５】再生速度倍率が１．８に設定されるので、
上記（３）の場合に比べて、リングメモリ６へのデータ
の書き込み速度が低下するため、上記（３）の場合に比
べて、リングメモリ６から読み出されるデータ量に対す
る、リングメモリ６に書き込まれるデータ量の比を小さ
くすることができる。Since the reproduction speed magnification is set to 1.8,
Since the speed of writing data to the ring memory 6 is lower than in the case (3), the data is written to the ring memory 6 with respect to the amount of data read from the ring memory 6 as compared to the case (3). The ratio of the data amount can be reduced.

【００３６】（５）蓄積率が８０〜９５％である場合蓄積率が８０〜９５％である場合には、圧縮率が１．４
に設定され、再生速度倍率が１．６倍に設定される。こ
の場合には、Ａ／Ｄ変換部１のサンプリング周波数ｆ_AD
は、Ｄ／Ａ変換部９の標準サンプリング周波数ｆ_DAの
１．６倍に設定される。また、時間軸圧縮伸長部５は、
入力信号の時間長Ｐと出力信号の時間長Ｑとの比が１．
６：１となるように、入力データに対して時間軸圧縮処
理を行なう。また、再生速度制御部２１は、ＶＴＲ２０
の再生速度を再生速度倍率１．６に応じた速度となるよ
うに制御する。(5) When the accumulation rate is 80 to 95% When the accumulation rate is 80 to 95%, the compression rate is 1.4.
And the reproduction speed magnification is set to 1.6 times. In this case, the sampling frequency f _AD of the A / D converter 1
Is set to 1.6 times the standard sampling frequency f _DA of the D / A converter 9. In addition, the time axis compression / expansion unit 5
The ratio of the time length P of the input signal to the time length Q of the output signal is 1.
Time axis compression processing is performed on the input data so that the ratio becomes 6: 1. Also, the playback speed control unit 21
Is controlled so as to be a speed corresponding to the reproduction speed magnification 1.6.

【００３７】再生速度倍率が１．６に設定されるので、
上記（４）の場合に比べて、リングメモリ６へのデータ
の書き込み速度が低下するため、上記（４）の場合に比
べて、リングメモリ６から読み出されるデータ量に対す
る、リングメモリ６に書き込まれるデータ量の比を小さ
くすることができる。Since the reproduction speed magnification is set to 1.6,
Since the speed of writing data to the ring memory 6 is lower than in the case (4), the data is written to the ring memory 6 with respect to the amount of data read from the ring memory 6 as compared to the case (4). The ratio of the data amount can be reduced.

【００３８】（６）蓄積率が９５〜１００％である場
合蓄積率が９５〜１００％である場合には、圧縮率が１．
４に設定され、再生速度倍率が１．４倍に設定される。
この場合には、Ａ／Ｄ変換部１のサンプリング周波数ｆ
_ADは、Ｄ／Ａ変換部９の標準サンプリング周波数ｆ_DAの
１．４倍に設定される。また、時間軸圧縮伸長部５は、
入力信号の時間長Ｐと出力信号の時間長Ｑとの比が１．
４：１となるように、入力データに対して時間軸圧縮処
理を行なう。また、再生速度制御部２１は、ＶＴＲ２０
の再生速度を再生速度倍率１．４に応じた速度となるよ
うに制御する。(6) When the accumulation rate is 95 to 100% When the accumulation rate is 95 to 100%, the compression rate is 1.
4, and the reproduction speed magnification is set to 1.4 times.
In this case, the sampling frequency f of the A / D converter 1
_AD is set to 1.4 times the standard sampling frequency f _DA of the D / A converter 9. In addition, the time axis compression / expansion unit 5
The ratio of the time length P of the input signal to the time length Q of the output signal is 1.
A time axis compression process is performed on the input data so that the ratio becomes 4: 1. Also, the playback speed control unit 21
Is controlled so as to be a speed corresponding to the reproduction speed magnification of 1.4.

【００３９】再生速度倍率が１．４に設定されるので、
上記（５）の場合に比べて、リングメモリ６へのデータ
の書き込み速度が低下するため、上記（５）の場合に比
べて、リングメモリ６から読み出されるデータ量に対す
る、リングメモリ６に書き込まれるデータ量の比を小さ
くすることができる。Since the reproduction speed magnification is set to 1.4,
Since the speed of writing data to the ring memory 6 is lower than in the case (5), the data is written to the ring memory 6 with respect to the amount of data read from the ring memory 6 compared to the case (5). The ratio of the data amount can be reduced.

【００４０】なお、未読み出しの音声データの蓄積率が
小さい場合、たとえば、未読み出しの音声データの蓄積
率が２０％未満のときに、無音区間削除部４による削除
動作を停止させるようにしてもよい。When the storage rate of unread audio data is small, for example, when the storage rate of unread audio data is less than 20%, the deletion operation by the silent section deletion unit 4 may be stopped. Good.

【００４１】なお、リングメモリ６として、より容量の
小さいものを用いたい場合には、図２に示すように、リ
ングメモリ６の前段に、時間軸圧縮伸長部５から出力さ
れる音声データを符号化する音声符号化部１１を設ける
とともにリングメモリ６の後段に、リングメモリ６から
読み出された符号化データを復号する音声復号化部１２
を設ければよい。When it is desired to use a ring memory 6 having a smaller capacity, the audio data output from the time axis compression / expansion unit 5 is encoded before the ring memory 6 as shown in FIG. And a speech decoding unit 12 for decoding encoded data read from the ring memory 6 at the subsequent stage of the ring memory 6.
May be provided.

【００４２】〔２〕第２の実施の形態の説明[2] Description of Second Embodiment

【００４３】図３は、テープレコーダ等の音声再生装置
において、音声を標準再生速度より遅い速度で出力する
話速変換装置の構成を示している。図３において、図１
と同じものには同じ符号を付してその説明を省略する。FIG. 3 shows the configuration of a speech speed conversion device for outputting a voice at a speed lower than the standard playback speed in an audio playback device such as a tape recorder. In FIG. 3, FIG.
The same components as those described above are denoted by the same reference numerals and description thereof will be omitted.

【００４４】図３において、３０は音声再生装置であ
り、３１は音声再生装置３０の再生速度制御部である。In FIG. 3, reference numeral 30 denotes an audio reproducing device, and reference numeral 31 denotes a reproducing speed control section of the audio reproducing device 30.

【００４５】音声再生装置３０の再生速度倍率がＭの場
合には、Ｍ倍速再生時にＡ／Ｄ変換部１によって得られ
るサンプリングデータと、標準再生速度での再生時にＡ
／Ｄ変換部１によって得られるサンプリングデータとを
一致させるために、Ａ／Ｄ変換部１のサンプリング周波
数ｆ_ADは、Ｄ／Ａ変換部９のサンプリング周波数ｆ_DAの
Ｍ倍に設定される。Ｄ／Ａ変換部９のサンプリング周波
数ｆ_DAは、再生速度倍率にかかわらず、常に標準サンプ
リング周波数に保たれる。When the reproduction speed magnification of the audio reproduction device 30 is M, the sampling data obtained by the A / D converter 1 at the M-times speed reproduction and the A / D at the standard reproduction speed.
The sampling frequency f _AD of the A / D converter 1 is set to M times the sampling frequency f _DA of the D / A converter 9 in order to match the sampling data obtained by the / D converter 1. The sampling frequency f _DA of the D / A converter 9 is always kept at the standard sampling frequency regardless of the reproduction speed magnification.

【００４６】標準再生速度（１倍速再生時の再生速度）
で再生する場合において、音声を標準再生速度より遅い
速度で出力させる場合の適応話速制御部８および再生速
度制御部３１の動作について説明する。Standard playback speed (playback speed at 1x speed playback)
The operation of the adaptive speech speed control unit 8 and the playback speed control unit 31 in the case of outputting the voice at a speed lower than the standard playback speed in the case of playing back with.

【００４７】表２は、設定再生速度倍率が１である場合
の、蓄積率と圧縮率との関係および蓄積率と再生速度倍
率との関係を示している。Table 2 shows the relationship between the accumulation ratio and the compression ratio and the relationship between the accumulation ratio and the reproduction speed ratio when the set reproduction speed ratio is 1.

【００４８】[0048]

【表２】 [Table 2]

【００４９】適応話速制御部８は、表２の蓄積率と圧縮
率との関係を記憶した蓄積率／圧縮率テーブルを備えて
いる。また、再生速度制御部３１は、表２の蓄積率と再
生速度倍率との関係を記憶した蓄積率／再生速度倍率テ
ーブルを備えている。The adaptive speech speed controller 8 has an accumulation rate / compression rate table storing the relationship between the accumulation rate and the compression rate shown in Table 2. Further, the reproduction speed control unit 31 includes an accumulation ratio / reproduction speed magnification table in which the relationship between the accumulation ratio and the reproduction speed magnification in Table 2 is stored.

【００５０】適応話速制御部８は、蓄積量算出部７から
未読み出しの音声データの蓄積率が送られてくると、蓄
積率／圧縮率テーブルに基づいて、蓄積量算出部７から
送られてきた蓄積率に対応する圧縮率を読み出し、時間
軸圧縮伸長部５に設定する。再生速度制御部３１は、蓄
積量算出部７から未読み出しの音声データの蓄積率が送
られてくると、蓄積率／再生速度倍率テーブルに基づい
て、蓄積量算出部７から送られてきた蓄積率に対応する
再生速度倍率を読み出し、音声再生装置３０の再生速度
を読み出した再生速度倍率に応じた速度となるように制
御する。When the storage rate of the unread audio data is sent from the storage amount calculation unit 7, the adaptive speech speed control unit 8 sends the data from the storage amount calculation unit 7 based on the storage ratio / compression ratio table. The compression rate corresponding to the obtained accumulation rate is read out and set in the time axis compression / expansion unit 5. When the storage rate of the unread audio data is sent from the storage amount calculation unit 7, the playback speed control unit 31 stores the storage data sent from the storage amount calculation unit 7 based on the storage ratio / playback speed magnification table. The reproduction speed magnification corresponding to the rate is read, and the reproduction speed of the audio reproduction device 30 is controlled so as to be a speed corresponding to the read reproduction speed magnification.

【００５１】（１）蓄積率が０〜２５％である場合蓄積率が０〜２５％である場合には、圧縮率は０．７に
設定され、再生速度倍率は設定再生速度倍率である１に
設定される。この場合には、音声再生装置３０から再生
速度倍率が１の速度で出力された音声信号は、Ａ／Ｄ変
換部１によってＤ／Ａ変換部９の標準サンプリング周波
数と同じサンプリング周波数でサンプリングされてフレ
ームメモリ２に格納される。(1) When the accumulation rate is 0 to 25% When the accumulation rate is 0 to 25%, the compression rate is set to 0.7, and the reproduction speed magnification is 1 which is the set reproduction speed magnification. Is set to In this case, the audio signal output from the audio reproduction device 30 at a reproduction speed magnification of 1 is sampled by the A / D converter 1 at the same sampling frequency as the standard sampling frequency of the D / A converter 9. It is stored in the frame memory 2.

【００５２】フレームメモリ２に格納された音声データ
は無音区間削除部４によって無音区間のデータが削除さ
れた後、時間軸圧縮伸長部５に送られる。時間軸圧縮伸
長部５では、入力信号の時間長Ｐと出力信号の時間長Ｑ
との比が０．７：１となるように、入力データ（音声区
間の音声データ）に対して時間軸伸長処理が行なわれ
る。The audio data stored in the frame memory 2 is sent to the time axis compression / expansion unit 5 after the data of the silent interval is deleted by the silent interval deletion unit 4. In the time axis compression / expansion unit 5, the time length P of the input signal and the time length Q of the output signal
Is subjected to time axis expansion processing on the input data (audio data in the audio section) so that the ratio of the input data becomes 0.7: 1.

【００５３】時間軸圧縮伸長部５によって時間軸伸長処
理が行なわれた後の音声データは、リングメモリ６に蓄
積される。リングメモリ６に蓄積された音声データは、
Ｄ／Ａ変換部９によって標準サンプリング周波数でサン
プリングされて出力される。The audio data after the time axis expansion processing is performed by the time axis compression / expansion unit 5 is stored in the ring memory 6. The voice data stored in the ring memory 6 is
It is sampled at a standard sampling frequency by the D / A converter 9 and output.

【００５４】音声区間の音声データは時間軸上で伸長さ
れた後にリングメモリ６に書き込まれているので、出力
音声の話速は標準再生速度で再生されたときの出力音声
の話速より遅くなる。ただし、無音区間の音声データが
少ない程、リングメモリ６内の未読み出しの音声データ
の蓄積量が増加していく。Since the voice data in the voice section is written in the ring memory 6 after being expanded on the time axis, the speech speed of the output voice is lower than the voice speed of the output voice when reproduced at the standard reproduction speed. . However, the smaller the voice data in the silent section, the larger the storage amount of unread voice data in the ring memory 6.

【００５５】（２）蓄積率が２５〜５０％である場合蓄積率が２５〜５０％である場合には、圧縮率が０．８
に設定される。ただし、再生速度倍率は１のままであ
る。この場合には、時間軸圧縮伸長部５は、入力信号の
時間長Ｐと出力信号の時間長Ｑとの比が０．８：１とな
るように、入力データに対して時間軸伸長処理を行な
う。この結果、出力音声の話速は標準再生速度で再生さ
れたときの出力音声の話速より遅いが上記（１）の場合
に比べて若干速くなる。ただし、リングメモリ６に入力
される音声区間の音声データ量が上記（１）の場合に比
べて低減するため、上記（１）の場合に比べて、リング
メモリ６から読み出されるデータ量に対する、リングメ
モリ６に書き込まれるデータ量の比を小さくすることが
できる。(2) When the accumulation rate is 25 to 50% When the accumulation rate is 25 to 50%, the compression rate is 0.8
Is set to However, the reproduction speed magnification remains at 1. In this case, the time axis compression / expansion unit 5 performs the time axis expansion processing on the input data so that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 0.8: 1. Do. As a result, the voice speed of the output voice is lower than the voice speed of the output voice when reproduced at the standard reproduction speed, but is slightly higher than in the case of the above (1). However, since the voice data amount of the voice section input to the ring memory 6 is reduced as compared with the case of the above (1), the ring data with respect to the data amount read from the ring memory 6 is compared with the case of the above (1). The ratio of the amount of data written to the memory 6 can be reduced.

【００５６】（３）蓄積率が５０〜７５％である場合蓄積率が５０〜７５％である場合には、圧縮率が０．９
に設定され、再生速度倍率が０．９に設定される。この
場合には、Ａ／Ｄ変換部１のサンプリング周波数ｆ
_ADは、Ｄ／Ａ変換部９の標準サンプリング周波数ｆ_DAの
０．９倍に設定される。(3) When the accumulation rate is 50 to 75% When the accumulation rate is 50 to 75%, the compression rate is 0.9.
And the reproduction speed magnification is set to 0.9. In this case, the sampling frequency f of the A / D converter 1
_AD is set to 0.9 times the standard sampling frequency f _DA of the D / A converter 9.

【００５７】また、時間軸圧縮伸長部５は、単位時間当
たりのデータ入力個数Ｐと、単位時間当たりのデータ出
力個数Ｑとの比が、０．９：１となるように、入力デー
タに対して時間軸伸長処理を行なう。また、再生速度制
御部３１は、音声再生装置３０の再生速度を再生速度倍
率０．９に応じた速度となるように制御する。Further, the time axis compression / expansion unit 5 adjusts the input data so that the ratio of the number of data inputs P per unit time to the number of data outputs Q per unit time is 0.9: 1. To extend the time axis. Further, the reproduction speed control unit 31 controls the reproduction speed of the audio reproduction device 30 to be a speed corresponding to the reproduction speed magnification 0.9.

【００５８】上記（２）の場合に比べて、時間軸上の圧
縮率が大きくされているとともに、再生速度倍率が上記
（２）に比べて小さくされているので、上記（２）の場
合に比べて、リングメモリ６から読み出されるデータ量
に対する、リングメモリ６に書き込まれるデータ量の比
を小さくすることができる。ただし、再生速度倍率が上
記（２）に比べて小さくされているので、圧縮率のみを
大きくする場合に比べて、出力音声の話速は速くならな
い。As compared with the case of the above (2), the compression ratio on the time axis is increased, and the reproduction speed magnification is made smaller than that of the above (2). In comparison, the ratio of the amount of data written to the ring memory 6 to the amount of data read from the ring memory 6 can be reduced. However, since the reproduction speed magnification is smaller than that in the above (2), the speaking speed of the output sound is not faster than when only the compression ratio is increased.

【００５９】（４）蓄積率が７５〜１００％である場
合蓄積率が７５〜１００％である場合には、圧縮率が１．
０に設定され、再生速度倍率が０．８倍に設定される。
この場合には、Ａ／Ｄ変換部１のサンプリング周波数ｆ
_ADは、Ｄ／Ａ変換部９の標準サンプリング周波数ｆ_DAの
０．８倍に設定される。(4) When the accumulation rate is 75 to 100% When the accumulation rate is 75 to 100%, the compression rate is 1.
0 is set, and the reproduction speed magnification is set to 0.8 times.
In this case, the sampling frequency f of the A / D converter 1
_AD is set to 0.8 times the standard sampling frequency f _DA of the D / A converter 9.

【００６０】また、時間軸圧縮伸長部５は、時間軸伸長
処理を行なわない。再生速度制御部３１は、音声再生装
置３０の再生速度を再生速度倍率０．８に応じた速度と
なるように制御する。The time axis compression / expansion unit 5 does not perform the time axis expansion processing. The playback speed control unit 31 controls the playback speed of the audio playback device 30 to be a speed corresponding to a playback speed magnification of 0.8.

【００６１】上記（３）の場合に比べて、時間軸上の圧
縮率が大きくされているとともに、再生速度倍率が上記
（３）に比べて小さくされているので、上記（３）の場
合に比べて、リングメモリ６から読み出されるデータ量
に対する、リングメモリ６に書き込まれるデータ量の比
を小さくすることができる。ただし、再生速度倍率が上
記（３）に比べて小さくされているので、圧縮率のみを
大きくする場合に比べて、出力音声の話速は速くならな
い。As compared with the case of the above (3), the compression ratio on the time axis is increased, and the reproduction speed magnification is made smaller than that of the above (3). In comparison, the ratio of the amount of data written to the ring memory 6 to the amount of data read from the ring memory 6 can be reduced. However, since the reproduction speed magnification is smaller than that in the above (3), the speaking speed of the output sound is not faster than when only the compression ratio is increased.

【００６２】なお、未読み出しの音声データの蓄積率が
小さい場合、たとえば、未読み出しの音声データの蓄積
率が２０％未満のときに、無音区間削除部４による削除
動作を停止させるようにしてもよい。When the accumulation rate of unread audio data is low, for example, when the accumulation rate of unread audio data is less than 20%, the deletion operation by the silent section deletion unit 4 may be stopped. Good.

【００６３】なお、リングメモリ６として、より容量の
小さいものを用いたい場合には、図３と同様に、リング
メモリ６の前段に、時間軸圧縮伸長部５から出力される
音声データを符号化する音声符号化部を設けるとともに
リングメモリ６の後段に、リングメモリ６から読み出さ
れた符号化データを復号する音声復号化部を設ければよ
い。When it is desired to use a smaller memory as the ring memory 6, the audio data output from the time axis compression / expansion unit 5 is encoded before the ring memory 6 as in FIG. It is sufficient to provide an audio encoding unit for performing the decoding and an audio decoding unit for decoding the encoded data read from the ring memory 6 at the subsequent stage of the ring memory 6.

【００６４】上記第１の実施の形態および第２の実施の
形態では、ＶＴＲ２０または音声再生装置３０からアナ
ログの音声信号が送られてくる場合について説明した
が、ＶＴＲ２０または音声再生装置３０からデジタルの
音声データが送られてくる場合にもこの発明を適用する
ことができる。この場合には、ＶＴＲ２０または音声再
生装置３０から送られてきたデジタルの音声データを再
生速度倍率に応じた書き込み速度でフレームメモリ２に
書き込み、１倍速再生時のフレームメモリ２への音声デ
ータの書き込み速度と同じ速度でリングメモリからデー
タを読み出せばよい。In the first and second embodiments, the case where an analog audio signal is transmitted from the VTR 20 or the audio reproducing device 30 has been described. The present invention can be applied to a case where voice data is sent. In this case, the digital audio data sent from the VTR 20 or the audio reproduction device 30 is written into the frame memory 2 at a writing speed corresponding to the reproduction speed magnification, and the audio data is written into the frame memory 2 at the time of 1 × speed reproduction. Data may be read from the ring memory at the same speed as the speed.

【００６５】〔３〕第３の実施の形態の説明[3] Description of Third Embodiment

【００６６】図４は、ハードディスクレコーダの再生回
路に話速変換装置を応用した例を示している。図４にお
いて、図１と同じものには、同じ符号を付してその説明
を省略する。FIG. 4 shows an example in which a speech speed converter is applied to a reproduction circuit of a hard disk recorder. 4, the same components as those in FIG. 1 are denoted by the same reference numerals, and description thereof will be omitted.

【００６７】図４において、４０はハードディスクレコ
ーダに設けられかつ音声データが格納されるハードディ
スク（ＨＤ）である。４１は、再生時において、ハード
ディスク４０から読み出された音声データを一時的に格
納するバッファである。４２は、バッファ４１から音声
データを出力する速度を制御する再生速度制御部であ
る。In FIG. 4, reference numeral 40 denotes a hard disk (HD) provided in the hard disk recorder and storing audio data. Reference numeral 41 denotes a buffer for temporarily storing audio data read from the hard disk 40 during reproduction. Reference numeral 42 denotes a reproduction speed control unit that controls the speed at which audio data is output from the buffer 41.

【００６８】図４においては、ハードディスク４０に音
声データを格納するための音声記録回路は省略してい
る。このハードディスクレコーダでは、再生モードとし
て、出力音声が早口になったり、音声情報の欠落を防止
しながら、短時間で再生を行なうための早聞きモード
と、話速を遅くして再生する遅聞きモードとがある。以
下、これらの各再生モード時の動作について説明する。In FIG. 4, an audio recording circuit for storing audio data on the hard disk 40 is omitted. This hard disk recorder has two playback modes: a fast-listening mode for playing back in a short period of time while preventing the output sound from being played quickly or a loss of audio information, and a slow-listening mode for playing back at a lower speaking speed. There is. The operation in each of these playback modes will be described below.

【００６９】〔３−１〕早聞きモード時の動作の説明表３は、早聞きモード時の、蓄積率と圧縮率との関係お
よび蓄積率とバッファから音声データを出力する速度の
倍率（再生速度倍率）との関係を示している。[3-1] Description of Operation in Fast Listening Mode Table 3 shows the relationship between the accumulation rate and the compression rate in the fast listening mode, and the magnification of the accumulation rate and the speed at which audio data is output from the buffer (playback). (Speed magnification).

【００７０】[0070]

【表３】 [Table 3]

【００７１】適応話速制御部８は、表３の蓄積率と圧縮
率との関係を記憶した早聞きモード用の蓄積率／圧縮率
テーブルを備えている。また、再生速度制御部４２は、
表３の蓄積率とバッファから音声データを出力する速度
の倍率との関係を記憶した早聞きモード用の蓄積率／再
生速度倍率テーブルを備えている。The adaptive speech speed controller 8 has a storage ratio / compression ratio table for the fast listening mode in which the relationship between the storage ratio and the compression ratio shown in Table 3 is stored. In addition, the playback speed control unit 42
A storage ratio / reproduction speed magnification table for a fast listening mode is provided which stores the relationship between the storage ratio and the magnification of the speed at which audio data is output from the buffer in Table 3.

【００７２】適応話速制御部８は、蓄積率算出部７から
蓄積率が送られてくると、早聞きモード用の蓄積率／圧
縮率テーブルに基づいて、蓄積率算出部７から送られて
きた蓄積率に対応する圧縮率を読み出し、時間軸圧縮伸
長部５に設定する。When the storage rate is sent from the storage rate calculation section 7, the adaptive speech speed control section 8 is sent from the storage rate calculation section 7 based on the storage rate / compression rate table for the fast listening mode. The compression rate corresponding to the stored accumulation rate is read out and set in the time axis compression / expansion unit 5.

【００７３】再生速度制御部４２は、蓄積率算出部７か
ら蓄積率が送られてくると、早聞きモード用の蓄積率／
再生速度倍率テーブルに基づいて、蓄積率算出部７から
送られてきた蓄積率に対応する再生速度倍率を読み出
し、バッファ４１から音声データの出力速度を、再生速
度倍率に応じた速度となるように制御する。なお、ハー
ドディスク４０から音声データを読み出す速度は、バッ
ファ４１から音声データを出力する速度に比べて非常に
早く、バッファ４１が空になることはない。When the storage rate is sent from the storage rate calculation section 7, the playback speed control section 42 sets the storage rate /
Based on the reproduction speed magnification table, the reproduction speed magnification corresponding to the accumulation ratio sent from the accumulation ratio calculation unit 7 is read, and the output speed of the audio data from the buffer 41 is set to a speed corresponding to the reproduction speed magnification. Control. Note that the speed at which audio data is read from the hard disk 40 is much faster than the speed at which audio data is output from the buffer 41, and the buffer 41 does not become empty.

【００７４】（１）蓄積率が０〜２０％（０以上でか
つ２０％未満）である場合蓄積率が０〜２０％である場合には、圧縮率は１に設定
され、再生速度倍率は２倍に設定される。この場合に
は、再生速度制御部４２は、標準再生速度（１倍速再生
時の再生速度）の２倍に応じた速度で、バッファ４１か
ら音声データを出力させる。(1) When the accumulation rate is 0 to 20% (0 or more and less than 20%) When the accumulation rate is 0 to 20%, the compression rate is set to 1 and the reproduction speed magnification is It is set to double. In this case, the playback speed control unit 42 causes the buffer 41 to output the audio data at a speed corresponding to twice the standard playback speed (the playback speed at 1 × speed playback).

【００７５】バッファ４１から出力された音声データ
は、無音区間削除部４によって無音区間のデータが削除
された後、時間軸圧縮伸長部５で時間軸圧縮伸長処理は
行なわれずに、リングメモリ６に蓄積される。リングメ
モリ６に蓄積された音声データは、標準再生速度に応じ
た速度で読み出されて出力される。したがって、出力音
声の話速は、標準再生速度（１倍速再生時の再生速度）
で再生されたときの出力音声の話速と等しくなる。In the audio data output from the buffer 41, after the data of the silent section is deleted by the silent section deleting section 4, the time axis compressing / expanding section 5 does not perform the time axis compressing / expanding processing, and stores it in the ring memory 6. Stored. The audio data stored in the ring memory 6 is read and output at a speed corresponding to the standard playback speed. Therefore, the speech speed of the output voice is the standard playback speed (playback speed at 1x speed playback).
Becomes the same as the speech speed of the output sound when reproduced in.

【００７６】リングメモリ６へのデータ書き込み速度
は、リングメモリ６からのデータ読み出し速度より速い
ので、リングメモリ６内の未読み出しの音声データの蓄
積量が増加していく。未読み出しの音声データの蓄積量
が増加していく速度は、入力音声データに無音区間のデ
ータが少ない程、速くなる。Since the speed of writing data to the ring memory 6 is faster than the speed of reading data from the ring memory 6, the amount of unread voice data stored in the ring memory 6 increases. The speed at which the storage amount of unread audio data increases increases as the input audio data contains less data in a silent section.

【００７７】（２）蓄積率が２０〜４０％である場合蓄積率が２０〜４０％である場合には、圧縮率が１．２
に設定される。ただし、再生速度倍率は２のままであ
る。この場合には、時間軸圧縮伸長部５は、入力信号の
時間長Ｐと出力信号の時間長Ｑとの比が１．２：１とな
るように、入力データに対して時間軸圧縮処理を行な
う。この結果、出力音声の話速は、標準再生速度（１倍
速再生時の再生速度）で再生されたときの出力音声の話
速より若干速くなる。一方、リングメモリ６に入力され
る音声区間の音声データ量が低減されるので、上記
（１）の場合に比べて、リングメモリ６から読み出され
るデータ量に対する、リングメモリ６に書き込まれるデ
ータ量の比を小さくすることができる。(2) When the accumulation rate is 20 to 40% When the accumulation rate is 20 to 40%, the compression rate is 1.2
Is set to However, the reproduction speed magnification remains at 2. In this case, the time axis compression / expansion unit 5 performs time axis compression processing on the input data so that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 1.2: 1. Do. As a result, the voice speed of the output voice is slightly higher than the voice speed of the output voice when reproduced at the standard reproduction speed (reproduction speed at 1 × speed reproduction). On the other hand, since the amount of voice data in the voice section input to the ring memory 6 is reduced, the amount of data written to the ring memory 6 with respect to the amount of data read from the ring memory 6 is smaller than in the case (1). The ratio can be reduced.

【００７８】（３）蓄積率が４０〜６０％である場合蓄積率が４０〜６０％である場合には、圧縮率が１．４
に設定される。ただし、再生速度倍率は２のままであ
る。この場合には、時間軸圧縮伸長部５は、入力信号の
時間長Ｐと出力信号の時間長Ｑとの比が１．４：１とな
るように、入力データに対して時間軸圧縮処理を行な
う。この結果、出力音声の話速は、上記（２）の場合に
比べてさらに速くなる。一方、リングメモリ６に入力さ
れる音声区間の音声データ量が上記（２）の場合に比べ
てさらに低減されるので、上記（２）の場合に比べて、
リングメモリ６から読み出されるデータ量に対する、リ
ングメモリ６に書き込まれるデータ量の比を小さくする
ことができる。(3) When the accumulation rate is 40 to 60% When the accumulation rate is 40 to 60%, the compression rate is 1.4.
Is set to However, the reproduction speed magnification remains at 2. In this case, the time axis compression / expansion unit 5 performs the time axis compression processing on the input data so that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 1.4: 1. Do. As a result, the speech speed of the output voice is higher than in the case of the above (2). On the other hand, the amount of voice data in the voice section input to the ring memory 6 is further reduced as compared with the case of the above (2).
The ratio of the amount of data written to the ring memory 6 to the amount of data read from the ring memory 6 can be reduced.

【００７９】（４）蓄積率が６０〜８０％である場合蓄積率が６０〜８０％である場合には、圧縮率が１．４
に設定され、再生速度倍率が１．８倍に設定される。こ
の場合には、再生速度制御部４２は、標準再生速度の
１．８倍に応じた速度で、バッファ４１から音声データ
を出力させる。また、時間軸圧縮伸長部５は、入力信号
の時間長Ｐと出力信号の時間長Ｑとの比が１．４：１と
なるように、入力データに対して時間軸圧縮処理を行な
う。(4) When the accumulation rate is 60 to 80% When the accumulation rate is 60 to 80%, the compression rate is 1.4.
And the reproduction speed magnification is set to 1.8 times. In this case, the playback speed control unit 42 causes the buffer 41 to output audio data at a speed corresponding to 1.8 times the standard playback speed. Further, the time axis compression / expansion unit 5 performs time axis compression processing on the input data so that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 1.4: 1.

【００８０】再生速度倍率が１．８に設定されるので、
上記（３）の場合に比べて、リングメモリ６へのデータ
の書き込み速度が低下するため、上記（３）の場合に比
べて、リングメモリ６から読み出されるデータ量に対す
る、リングメモリ６に書き込まれるデータ量の比を小さ
くすることができる。また、再生速度倍率が上記（３）
に比べて小さくされているので、圧縮率のみを大きくす
る場合に比べて、出力音声の話速が早口になりすぎるこ
とがない。即ち、聞き取りが容易な範囲での早口とする
ことができる。Since the reproduction speed magnification is set to 1.8,
Since the speed of writing data to the ring memory 6 is lower than in the case (3), the data is written to the ring memory 6 with respect to the amount of data read from the ring memory 6 as compared to the case (3). The ratio of the data amount can be reduced. In addition, the reproduction speed magnification is the above (3)
Therefore, the speech speed of the output sound is not too fast compared to the case where only the compression ratio is increased. In other words, it is possible to make the speech as quick as possible in a range that is easy to hear.

【００８１】（５）蓄積率が８０〜９５％である場合蓄積率が８０〜９５％である場合には、圧縮率が１．４
に設定され、再生速度倍率が１．６倍に設定される。こ
の場合には、再生速度制御部４２は、標準再生速度の
１．６倍に応じた速度で、バッファ４１から音声データ
を出力させる。また、時間軸圧縮伸長部５は、入力信号
の時間長Ｐと出力信号の時間長Ｑとの比が１．６：１と
なるように、入力データに対して時間軸圧縮処理を行な
う。(5) When the accumulation rate is 80 to 95% When the accumulation rate is 80 to 95%, the compression rate is 1.4.
And the reproduction speed magnification is set to 1.6 times. In this case, the playback speed control unit 42 causes the buffer 41 to output audio data at a speed corresponding to 1.6 times the standard playback speed. Further, the time axis compression / expansion unit 5 performs time axis compression processing on the input data such that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 1.6: 1.

【００８２】再生速度倍率が１．６に設定されるので、
上記（４）の場合に比べて、リングメモリ６へのデータ
の書き込み速度が低下するため、上記（４）の場合に比
べて、リングメモリ６から読み出されるデータ量に対す
る、リングメモリ６に書き込まれるデータ量の比を小さ
くすることができる。また、再生速度倍率が上記（４）
に比べて小さくされているので、圧縮率のみを大きくす
る場合に比べて、出力音声の話速が早口になりすぎるこ
とがない。即ち、聞き取りが容易な範囲での早口とする
ことができる。Since the reproduction speed magnification is set to 1.6,
Since the speed of writing data to the ring memory 6 is lower than in the case (4), the data is written to the ring memory 6 with respect to the amount of data read from the ring memory 6 as compared to the case (4). The ratio of the data amount can be reduced. In addition, the reproduction speed magnification is (4)
Therefore, the speech speed of the output sound is not too fast compared to the case where only the compression ratio is increased. In other words, it is possible to make the speech as quick as possible in a range that is easy to hear.

【００８３】（６）蓄積率が９５〜１００％である場
合蓄積率が９５〜１００％である場合には、圧縮率が１．
４に設定され、再生速度倍率が１．４倍に設定される。
この場合には、再生速度制御部４２は、標準再生速度の
１．４倍に応じた速度で、バッファ４１から音声データ
を出力させる。また、時間軸圧縮伸長部５は、入力信号
の時間長Ｐと出力信号の時間長Ｑとの比が１．４：１と
なるように、入力データに対して時間軸圧縮処理を行な
う。(6) When the accumulation rate is 95 to 100% When the accumulation rate is 95 to 100%, the compression rate is 1.
4, and the reproduction speed magnification is set to 1.4 times.
In this case, the playback speed control unit 42 causes the buffer 41 to output audio data at a speed corresponding to 1.4 times the standard playback speed. Further, the time axis compression / expansion unit 5 performs time axis compression processing on the input data so that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 1.4: 1.

【００８４】再生速度倍率が１．４に設定されるので、
上記（５）の場合に比べて、リングメモリ６へのデータ
の書き込み速度が低下するため、上記（５）の場合に比
べて、リングメモリ６から読み出されるデータ量に対す
る、リングメモリ６に書き込まれるデータ量の比を小さ
くすることができる。また、再生速度倍率が上記（５）
に比べて小さくされているので、圧縮率のみを大きくす
る場合に比べて、出力音声の話速が早口になりすぎるこ
とがない。即ち、聞き取りが容易な範囲での早口とする
ことができる。Since the reproduction speed magnification is set to 1.4,
Since the speed of writing data to the ring memory 6 is lower than in the case (5), the data is written to the ring memory 6 with respect to the amount of data read from the ring memory 6 compared to the case (5). The ratio of the data amount can be reduced. In addition, the reproduction speed magnification is equal to the above (5).
Therefore, the speech speed of the output sound is not too fast compared to the case where only the compression ratio is increased. In other words, it is possible to make the speech as quick as possible in a range that is easy to hear.

【００８５】〔３−２〕遅聞きモード時の動作の説明表４は、遅聞きモード時の、蓄積率と圧縮率との関係お
よび蓄積率とバッファから音声データを出力する速度の
倍率との関係を示している。[3-2] Description of Operation in Slow Listening Mode Table 4 shows the relationship between the accumulation rate and the compression rate in the slow listening mode and the magnification of the accumulation rate and the speed at which audio data is output from the buffer. Shows the relationship.

【００８６】[0086]

【表４】 [Table 4]

【００８７】適応話速制御部８は、表４の蓄積率と圧縮
率との関係を記憶した遅聞きモード用の蓄積率／圧縮率
テーブルを備えている。また、再生速度制御部４２は、
表４の蓄積率とバッファから音声データを出力する速度
の倍率との関係を記憶した遅聞きモード用の蓄積率／再
生速度倍率テーブルを備えている。The adaptive speech speed controller 8 has a storage ratio / compression ratio table for the slow listening mode in which the relationship between the storage ratio and the compression ratio shown in Table 4 is stored. In addition, the playback speed control unit 42
There is provided a storage rate / reproduction speed magnification table for the slow listening mode, which stores the relationship between the storage rate and the magnification of the speed at which audio data is output from the buffer in Table 4.

【００８８】適応話速制御部８は、蓄積率算出部７から
蓄積率が送られてくると、遅聞きモード用の蓄積率／圧
縮率テーブルに基づいて、蓄積率算出部７から送られて
きた蓄積率に対応する圧縮率を読み出し、時間軸圧縮伸
長部５に設定する。When the accumulation rate is sent from the accumulation rate calculation section 7, the adaptive speech speed control section 8 is sent from the accumulation rate calculation section 7 based on the accumulation rate / compression rate table for the slow listening mode. The compression rate corresponding to the stored accumulation rate is read out and set in the time axis compression / expansion unit 5.

【００８９】再生速度制御部４２は、蓄積率算出部７か
ら蓄積率が送られてくると、遅聞きモード用の蓄積率／
再生速度倍率テーブルに基づいて、蓄積率算出部７から
送られてきた蓄積率に対応する再生速度倍率を読み出
し、バッファ４１から音声データの出力速度を、再生速
度倍率に応じた速度となるように制御する。When the storage rate is sent from the storage rate calculation section 7, the playback speed control section 42 stores the storage rate /
Based on the reproduction speed magnification table, the reproduction speed magnification corresponding to the accumulation ratio sent from the accumulation ratio calculation unit 7 is read, and the output speed of the audio data from the buffer 41 is set to a speed corresponding to the reproduction speed magnification. Control.

【００９０】（１）蓄積率が０〜２５％である場合蓄積率が０〜２５％である場合には、圧縮率は０．７に
設定され、再生速度倍率は１に設定される。この場合に
は、再生速度制御部４２は、標準再生速度に応じた速度
で、バッファ４１から音声データを出力させる。(1) When the accumulation rate is 0 to 25% When the accumulation rate is 0 to 25%, the compression rate is set to 0.7 and the reproduction speed magnification is set to 1. In this case, the playback speed control unit 42 causes the buffer 41 to output audio data at a speed corresponding to the standard playback speed.

【００９１】バッファ４１から出力された音声データは
無音区間削除部４によって無音区間のデータが削除され
た後、時間軸圧縮伸長部５に送られる。時間軸圧縮伸長
部５では、入力信号の時間長Ｐと出力信号の時間長Ｑと
の比が０．７：１となるように、入力データ（音声区間
の音声データ）に対して時間軸伸長処理が行なわれる。The audio data output from the buffer 41 is sent to the time axis compression / expansion unit 5 after the data of the silent interval is deleted by the silent interval deletion unit 4. The time axis compression / expansion unit 5 performs time axis expansion on the input data (audio data in the audio section) such that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 0.7: 1. Processing is performed.

【００９２】時間軸圧縮伸長部５によって時間軸伸長処
理が行なわれた後の音声データは、リングメモリ６に蓄
積される。リングメモリ６に蓄積された音声データは、
標準再生速度に応じた速度で読み出されて出力される。The audio data after the time axis expansion processing is performed by the time axis compression / expansion unit 5 is stored in the ring memory 6. The voice data stored in the ring memory 6 is
It is read and output at a speed corresponding to the standard playback speed.

【００９３】音声区間の音声データは時間軸上で伸長さ
れた後にリングメモリ６に書き込まれているので、出力
音声の話速は標準再生速度で再生されたときの出力音声
の話速より遅くなる。ただし、無音区間の音声データが
少ない程、リングメモリ６内の未読み出しの音声データ
の蓄積量が増加していく。Since the voice data of the voice section is written in the ring memory 6 after being expanded on the time axis, the voice speed of the output voice is lower than the voice speed of the output voice when reproduced at the standard reproduction speed. . However, the smaller the voice data in the silent section, the larger the storage amount of unread voice data in the ring memory 6.

【００９４】（２）蓄積率が２５〜５０％である場合蓄積率が２５〜５０％である場合には、圧縮率が０．８
に設定される。ただし、再生速度倍率は１のままであ
る。この場合には、時間軸圧縮伸長部５は、入力信号の
時間長Ｐと出力信号の時間長Ｑとの比が０．８：１とな
るように、入力データに対して時間軸伸長処理を行な
う。この結果、出力音声の話速は標準再生速度で再生さ
れたときの出力音声の話速より遅いが上記（１）の場合
に比べて若干速くなる。ただし、リングメモリ６に入力
される音声区間の音声データ量が上記（１）の場合に比
べて低減するため、上記（１）の場合に比べて、リング
メモリ６から読み出されるデータ量に対する、リングメ
モリ６に書き込まれるデータ量の比を小さくすることが
できる。(2) When the accumulation ratio is 25 to 50% When the accumulation ratio is 25 to 50%, the compression ratio is 0.8
Is set to However, the reproduction speed magnification remains at 1. In this case, the time axis compression / expansion unit 5 performs the time axis expansion processing on the input data so that the ratio of the time length P of the input signal to the time length Q of the output signal becomes 0.8: 1. Do. As a result, the voice speed of the output voice is lower than the voice speed of the output voice when reproduced at the standard reproduction speed, but is slightly higher than in the case of the above (1). However, since the voice data amount of the voice section input to the ring memory 6 is reduced as compared with the case of the above (1), the ring data with respect to the data amount read from the ring memory 6 is compared with the case of the above (1). The ratio of the amount of data written to the memory 6 can be reduced.

【００９５】（３）蓄積率が５０〜７５％である場合蓄積率が５０〜７５％である場合には、圧縮率が０．９
に設定され、再生速度倍率が０．９に設定される。この
場合には、再生速度制御部４２は、標準再生速度の０．
９倍に応じた速度で、バッファ４１から音声データを出
力させる。また、時間軸圧縮伸長部５は、単位時間当た
りのデータ入力個数Ｐと、単位時間当たりのデータ出力
個数Ｑとの比が、０．９：１となるように、入力データ
に対して時間軸伸長処理を行なう。(3) When the accumulation rate is 50 to 75% When the accumulation rate is 50 to 75%, the compression rate is 0.9.
And the reproduction speed magnification is set to 0.9. In this case, the playback speed control unit 42 sets the standard playback speed to 0.
The audio data is output from the buffer 41 at a speed corresponding to nine times. Further, the time axis compression / expansion unit 5 applies a time axis to the input data such that the ratio of the data input number P per unit time to the data output number Q per unit time is 0.9: 1. Perform decompression processing.

【００９６】上記（２）の場合に比べて、時間軸上の圧
縮率が大きくされているとともに、再生速度倍率が上記
（２）に比べて小さくされているので、上記（２）の場
合に比べて、リングメモリ６から読み出されるデータ量
に対する、リングメモリ６に書き込まれるデータ量の比
を小さくすることができる。ただし、再生速度倍率が上
記（２）に比べて小さくされているので、圧縮率のみを
大きくする場合に比べて、出力音声の話速は速くならな
い。As compared with the case of the above (2), the compression rate on the time axis is increased, and the reproduction speed magnification is made smaller than that of the above (2). In comparison, the ratio of the amount of data written to the ring memory 6 to the amount of data read from the ring memory 6 can be reduced. However, since the reproduction speed magnification is smaller than that in the above (2), the speaking speed of the output sound is not faster than when only the compression ratio is increased.

【００９７】（４）蓄積率が７５〜１００％である場
合蓄積率が７５〜１００％である場合には、圧縮率が１．
０に設定され、再生速度倍率が０．８倍に設定される。
再生速度制御部４２は、標準再生速度の０．８倍に応じ
た速度で、バッファ４１から音声データを出力させる。
また、時間軸圧縮伸長部５は、時間軸伸長処理を行なわ
ない。(4) When the accumulation rate is 75 to 100% When the accumulation rate is 75 to 100%, the compression rate is 1.
0 is set, and the reproduction speed magnification is set to 0.8 times.
The playback speed control unit 42 causes the buffer 41 to output audio data at a speed corresponding to 0.8 times the standard playback speed.
The time axis compression / expansion unit 5 does not perform the time axis expansion processing.

【００９８】上記（３）の場合に比べて、時間軸上の圧
縮率が大きくされているとともに、再生速度倍率が上記
（３）に比べて小さくされているので、上記（３）の場
合に比べて、リングメモリ６から読み出されるデータ量
に対する、リングメモリ６に書き込まれるデータ量の比
を小さくすることができる。ただし、再生速度倍率が上
記（３）に比べて小さくされているので、圧縮率のみを
大きくする場合に比べて、出力音声の話速は速くならな
い。As compared with the case of the above (3), the compression ratio on the time axis is increased, and the reproduction speed magnification is made smaller than that of the above (3). In comparison, the ratio of the amount of data written to the ring memory 6 to the amount of data read from the ring memory 6 can be reduced. However, since the reproduction speed magnification is smaller than that in the above (3), the speaking speed of the output sound is not faster than when only the compression ratio is increased.

【００９９】[0099]

【発明の効果】この発明によれば、音声データ蓄積用メ
モリ内の未読み出しの音声データの蓄積量が増加した場
合でも、出力音声の話速をさほど速くさせることなく、
音声データ蓄積用メモリ内の未読み出しの音声データの
蓄積量が音声データ蓄積用メモリの容量を越えないよう
にすることができるようになる。According to the present invention, even if the storage amount of unread voice data in the voice data storage memory increases, the speech speed of the output voice does not increase so much.
The amount of unread audio data stored in the audio data storage memory can be prevented from exceeding the capacity of the audio data storage memory.

[Brief description of the drawings]

【図１】第１の実施の形態である話速変換装置の構成を
示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a speech speed conversion device according to a first embodiment.

【図２】第１の実施の形態の変形例を示すブロック図で
ある。FIG. 2 is a block diagram showing a modification of the first embodiment.

【図３】第２の実施の形態である話速変換装置の構成を
示すブロック図である。FIG. 3 is a block diagram illustrating a configuration of a speech speed conversion device according to a second embodiment.

【図４】第３の実施の形態である話速変換装置の構成を
示すブロック図である。FIG. 4 is a block diagram illustrating a configuration of a speech speed conversion device according to a third embodiment.

[Explanation of symbols]

１Ａ／Ｄ変換部２フレームメモリ３区間判定部４無音区間削除部５時間軸圧縮伸長部６リングメモリ７蓄積率算出部８適応話速制御部９Ｄ／Ａ変換部２０ＶＴＲ２１再生速度制御部３０音声再生装置３１再生速度制御部４０ハードティスク４１バッファ４２再生速度制御部 DESCRIPTION OF SYMBOLS 1 A / D conversion part 2 Frame memory 3 Section judgment part 4 Silence section deletion part 5 Time axis compression / decompression part 6 Ring memory 7 Accumulation rate calculation part 8 Adaptive speech speed control part 9 D / A conversion part 20 VTR 21 Reproduction speed control Unit 30 audio playback device 31 playback speed control unit 40 hard disk 41 buffer 42 playback speed control unit

Claims

[Claims]

1. A speech speed conversion processing means for speech speed conversion processing of an input speech signal input from a speech reproduction device, a speech data storage memory to which an output of the speech speed conversion means is written, and a speech data storage memory A speech speed conversion device provided with means for reading voice data; a calculating means for calculating a storage rate of unread voice data in a voice data storage memory; and a storage of unread voice data in a voice data storage memory. Control means for controlling the reproduction speed of the audio reproduction device according to the rate.

2. The speech speed conversion processing means includes: a section determining means for determining whether an input voice signal is a voice section or a silent section; and a deleting process for deleting an input voice signal determined to be a silent section. And a time axis compression / expansion processing means for performing time axis compression / expansion processing on an input audio signal determined to be a voiced section at a compression rate corresponding to a storage rate of unread audio data in an audio data storage memory. The speech speed conversion device according to claim 1, comprising:

3. The speech speed conversion device according to claim 1, wherein the voice reproduction device is a VTR.

4. The speech speed conversion device according to claim 1, wherein the audio reproduction device is a hard disk recorder.

5. An A / D converter for sampling an analog audio signal input from an audio reproducer at a sampling frequency corresponding to a set reproduction speed magnification, and audio data output from the A / D converter are input. A frame memory, a speech rate conversion processing means for performing speech rate conversion processing on the speech data each time a required number of speech data are input to the frame memory, and speech data to which the output of the speech rate conversion processing means is written. A speech speed conversion device comprising a storage memory and a means for reading voice data from the voice data storage memory, a calculating means for calculating a storage rate of unread voice data in the voice data storage memory, and a voice data storage Control means for controlling the playback speed of the audio playback device according to the accumulation rate of unread audio data in the memory for Speech speed conversion apparatus according to claim Rukoto.

6. A frame memory in which a digital audio signal input from an audio reproducing device is written at a speed corresponding to a set reproduction speed magnification, and each time a required number of audio data are input to the frame memory, the digital audio signal is input to the frame memory. Voice speed conversion processing means for performing voice speed conversion processing on voice data, voice data storage memory into which the output of voice speed conversion processing means is written,
And a speech speed conversion device provided with means for reading voice data from the voice data storage memory, comprising: a calculating means for calculating a storage rate of unread voice data in the voice data storage memory; Control means for controlling the reproduction speed of the audio reproduction device according to the accumulation rate of unread audio data.

7. Speech rate conversion processing means includes: a section determination means for determining whether an input voice corresponding to a required number of voice data input to the frame memory is a voice section or a silent section; Deletion processing means for deleting the voice data determined to be a voice section, and processing the voice data determined to be a sound section at a compression rate corresponding to the storage rate of the unread voice data in the voice data storage memory. 6. A time axis compression / expansion processing means for performing axis compression / expansion processing.
The speech speed conversion device according to any one of claims 6 and 7.