JPH08106299A

JPH08106299A - Voice signal time base conversion device

Info

Publication number: JPH08106299A
Application number: JP6264555A
Authority: JP
Inventors: Kimiharu Watanabe; 公治渡辺; Masayuki Misaki; 正之三崎; Takeshi Norimatsu; 武志則松
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1994-10-03
Filing date: 1994-10-03
Publication date: 1996-04-23

Abstract

PURPOSE: To realize a voice signal time base conversion device capable of changing arbitrarily the speaking speed of a voice signal reproduced at high speed and which is of a low cost. CONSTITUTION: The voice signal reproduced at high speed is inputted to a memory 22. A second clock signal having an N-fold frequency with respect to a first clock signal is applied to readout address counters 24, 25. Voice data of two adjacent sections are read out by address signals of these address counters to be respectively held in latches 27, 28. These signals are applied to a cross fade circuit 29 and two signals are added by simultaneously performing a fade-out and a fade-in. The added signal and the signal held in the latch 27 are changed over based on a reproducing speed setting signal. Then, the interval at the time of a recording is maintained and the voice signal whose speaking speed is arbitrarily conversed is obtained.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ビデオテープデッキ
（以下、ＶＴＲと略す）やマルチレーザディスクプレー
ヤ等に記録された音声信号を再生する装置において、音
声信号の高速再生、又は低速再生を行う際に用いる音声
信号時間軸変換装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention performs high speed reproduction or low speed reproduction of an audio signal in an apparatus for reproducing an audio signal recorded in a video tape deck (hereinafter abbreviated as VTR), a multi-laser disc player or the like. The present invention relates to an audio signal time base conversion device used at that time.

【０００２】[0002]

【従来の技術】近年、ＶＴＲに音声信号時間軸変換装置
を搭載することにより、音声を録音時の２倍の再生速度
で再生しても、録音時の音程と音声の速度（話速）で再
生することが可能となっている。2. Description of the Related Art In recent years, by mounting a voice signal time base converter on a VTR, even if a voice is reproduced at a reproduction speed twice as fast as that at the time of recording, the pitch and the speed of voice (speech speed) at the time of recording can be improved. It is possible to play.

【０００３】ここでＶＴＲに搭載された第１の従来例に
おける音声信号時間軸変換装置（Ｃ＆Ｐともいう）を説
明する。図４は従来の音声信号時間軸変換装置１００の
構成例を示すブロック図である。本図において音声信号
時間軸変換装置１００は、アナログデジタル変換器（Ａ
／Ｄ変換器）１０１、メモリ１０２、デジタルアナログ
変換器（Ｄ／Ａ変換器）１０３、書込アドレスカウンタ
１０４、読出アドレスカウンタ１０５を含んで構成され
る。そして入力端子１は速度変換すべきアナログの音声
信号Ｓ１の入力端子であり、入力端子３はＡ／Ｄ変換器
１０１のクロック信号Ｓ１１の入力端子、入力端子４は
Ｄ／Ａ変換器１０３のクロック信号Ｓ１２の入力端子、
出力端子２は速度変換されたアナログの音声信号Ｓ２の
出力端子である。A voice signal time base conversion device (also referred to as C & P) in a first conventional example mounted on a VTR will be described below. FIG. 4 is a block diagram showing a configuration example of a conventional audio signal time base conversion device 100. In the figure, the audio signal time base converter 100 is an analog-digital converter (A
/ D converter) 101, memory 102, digital-analog converter (D / A converter) 103, write address counter 104, and read address counter 105. The input terminal 1 is an input terminal for the analog audio signal S1 to be subjected to speed conversion, the input terminal 3 is the input terminal for the clock signal S11 of the A / D converter 101, and the input terminal 4 is the clock for the D / A converter 103. Input terminal for signal S12,
The output terminal 2 is an output terminal of the speed-converted analog audio signal S2.

【０００４】このように構成された音声信号時間軸変換
装置１００の動作を図４、図５を用いて説明する。録音
時の２倍の再生速度で再生され音声信号Ｓ１は入力端子
１を介してＡ／Ｄ変換器１０１に入力される。この信号
はクロック信号Ｓ１１の周期でサンプリングされ、デジ
タルの音声信号Ｓ２１に変換される。この音声信号Ｓ２
１はメモリ１０２に入力される。また書込アドレスカウ
ンタ１０４はクロック信号Ｓ１１を入力し、書き込みの
アドレス信号Ｓ２２をメモリ１０２に与える。こうして
変換された音声信号Ｓ２１はメモリ１０２に書き込まれ
る。The operation of the audio signal time base converter 100 configured as described above will be described with reference to FIGS. 4 and 5. The audio signal S1 reproduced at a reproduction speed twice that at the time of recording is input to the A / D converter 101 via the input terminal 1. This signal is sampled at the cycle of the clock signal S11 and converted into a digital audio signal S21. This audio signal S2
1 is input to the memory 102. Further, the write address counter 104 inputs the clock signal S11 and gives the write address signal S22 to the memory 102. The audio signal S21 thus converted is written in the memory 102.

【０００５】メモリ１０２の音声データを読み出すに
は、読出アドレスカウンタ１０５がクロック信号Ｓ１２
を入力し、読出しのアドレス信号Ｓ２３をメモリ１０２
に与える。ここでクロック信号Ｓ１２の周期は、クロッ
ク信号Ｓ１１の周期の２倍である。そのため、メモリ１
０２へ書込まれた音声信号Ｓ２１は、メモリ１０２から
周期が２倍の音声信号（音声データ）Ｓ２４として読出
される。音声データＳ２４はＤ／Ａ変換器１０３に入力
され、アナログの音声信号Ｓ２に変換される。音声信号
Ｓ２の周波数は、音声信号Ｓ１の周波数の半分となる。
例えば、音声信号Ｓ１がＶＴＲ等の特殊再生により２倍
速再生された信号の場合には、その周波数が通常再生の
２倍となるため、音声信号Ｓ２の周波数は通常再生（記
録時）と同一となる。To read the audio data in the memory 102, the read address counter 105 uses the clock signal S12.
And the read address signal S23 is input to the memory 102.
Give to. Here, the cycle of the clock signal S12 is twice the cycle of the clock signal S11. Therefore, memory 1
The audio signal S21 written in 02 is read from the memory 102 as an audio signal (audio data) S24 having a double cycle. The audio data S24 is input to the D / A converter 103 and converted into an analog audio signal S2. The frequency of the audio signal S2 is half the frequency of the audio signal S1.
For example, when the audio signal S1 is a signal reproduced at a double speed by a special reproduction such as a VTR, the frequency thereof is twice that of the normal reproduction, and therefore the frequency of the audio signal S2 is the same as that of the normal reproduction (during recording). Become.

【０００６】図５は音声信号時間軸変換装置１００の動
作を説明する図であり、信号の時間構造を模式的に示し
ている。図５（ａ）は通常再生の場合の信号の時間構造
を示しており、Ａ，Ｂ・・Ｄというように音声信号は連
続している。図５（ｂ）は２倍速再生した場合を示して
おり、音声信号は、Ａはａへ、Ｂはｂへというように、
各々の音声信号の再生時間が半分に圧縮される。FIG. 5 is a diagram for explaining the operation of the audio signal time base converter 100, and schematically shows the time structure of the signal. FIG. 5A shows the time structure of the signal in the normal reproduction, and the audio signal is continuous like A, B, ... D. FIG. 5 (b) shows a case where the reproduction is performed at a double speed, and the audio signals are A to a, B to b, and so on.
The playback time of each audio signal is compressed in half.

【０００７】さて、倍速再生であれば音声の再生周波数
は元の２倍となる。これに対して図５（ｃ）は、図５
（ｂ）の音声信号を音声信号時間軸変換装置１００へ入
力した場合における出力音声信号の時間構造を示すもの
である。ここではａはＡへ、ｄはＤへ変換され、通常再
生の信号と同一周波数となる。しかし、ｂは一部のみ再
生されＢ’となり、ｃは全く再生されないという欠点が
ある。その理由は、メモリ１０２の容量が有限であり、
クロック信号Ｓ１１、Ｓ１２の周期と、メモリ１０２の
容量で決定される記録可能な時間とが限られているから
である。Now, in the case of double speed reproduction, the reproduction frequency of the sound becomes twice the original frequency. On the other hand, FIG.
FIG. 3 shows a time structure of an output audio signal when the audio signal of (b) is input to the audio signal time base converter 100. Here, a is converted to A and d is converted to D, and the frequency becomes the same as that of the signal for normal reproduction. However, there is a drawback that b is only partially reproduced and becomes B ′, and c is not reproduced at all. The reason is that the capacity of the memory 102 is limited,
This is because the period of the clock signals S11 and S12 and the recordable time determined by the capacity of the memory 102 are limited.

【０００８】ここで話速とは、図５（ａ）に示すよう
に、例えば意味のある音声Ａ，Ｂ，Ｃ，Ｄが、図５
（ｃ）に示すようにその内容が聴きとれる程度の音声
Ａ，Ｂ’，Ｄに変換されたとき、（ｃ）の音声信号の再
生時間に対する（ａ）の音声信号の再生時間の比を指す
ものとする。Here, the speech speed means that, for example, meaningful voices A, B, C and D are as shown in FIG.
As shown in (c), when the content is converted into audible sounds A, B ', D, it indicates the ratio of the reproduction time of the audio signal of (a) to the reproduction time of the audio signal of (c). I shall.

【０００９】以上は２倍速再生された音声信号を通常再
生の音声信号、すなわち通常再生時の話速と音程を有す
る信号へ変換する場合を説明した。The case where the audio signal reproduced at double speed is converted to the audio signal for normal reproduction, that is, the signal having the speech speed and the pitch during normal reproduction has been described above.

【００１０】次に第２の従来例として、２倍速再生され
た音声信号を通常再生時の音程を有しながら任意の話速
を持つ音声信号へ変換する音声信号時間軸変換装置につ
いて説明する。図６は第２の従来例の音声信号時間軸変
換装置２００の構成を示すブロック図である。本図に示
すように音声信号時間軸変換装置２００は、第１の従来
例と同一の音声信号時間軸変換装置１００ａと、音程変
換装置１００ｂにより構成される。ここで第１の従来例
と同一機能の回路ブロックには同一番号を付け、それら
の詳細な説明は省略する。Next, as a second conventional example, an audio signal time base conversion device for converting an audio signal reproduced at double speed into an audio signal having an arbitrary speech speed while having a pitch during normal reproduction will be described. FIG. 6 is a block diagram showing the configuration of a second conventional audio signal time base converter 200. As shown in this figure, the audio signal time base conversion device 200 is composed of the same audio signal time base conversion device 100a and pitch conversion device 100b as in the first conventional example. Here, circuit blocks having the same functions as those of the first conventional example are designated by the same reference numerals, and detailed description thereof will be omitted.

【００１１】図６において、音声信号時間軸変換装置１
００ａのＡ／Ｄ変換器に入力される第１のクロック信号
Ｓ１１の入力端子３、音声信号時間軸変換装置１００ａ
のＤ／Ａ変換器に入力される第１のクロック信号Ｓ１２
の入力端子３に加えて、音程変換装置１００ｂ内のＡ／
Ｄ変換器に入力される第３のクロック信号Ｓ１３の入力
端子５、Ｄ／Ａ変換器に入力される第４のクロック信号
Ｓ１４の入力端子６が新たに設けられている。In FIG. 6, a voice signal time base conversion device 1
Input terminal 3 for the first clock signal S11 input to the A / D converter 00a, audio signal time base conversion device 100a
First clock signal S12 input to the D / A converter of
In addition to the input terminal 3 of
An input terminal 5 for the third clock signal S13 input to the D converter and an input terminal 6 for the fourth clock signal S14 input to the D / A converter are newly provided.

【００１２】このように構成された音声信号時間軸変換
装置２００の動作について簡単に説明する。録音時の２
倍の再生速度で再生された音声信号Ｓ１は、入力端子１
を介して音声信号時間軸変換装置１００ａ内のＡ／Ｄ変
換器に入力され、クロック信号Ｓ１１に対してクロック
信号Ｓ１２の周波数を変えることにより、通常再生時と
異なる話速を有する音声信号Ｓ２に変換されることは既
に説明した。録音時の２倍の再生速度で音声信号を再生
する場合において、クロック信号Ｓ１１の周波数をＦＳ
１１、クロック信号Ｓ１２の周波数をＦＳ１２、通常再
生時に対する話速をＳＳ１とすれば、次の（１）式が成
立する。The operation of the audio signal time base converter 200 configured as described above will be briefly described. 2 when recording
The audio signal S1 reproduced at the double reproduction speed is input to the input terminal 1
Is input to the A / D converter in the audio signal time base conversion device 100a, and by changing the frequency of the clock signal S12 with respect to the clock signal S11, an audio signal S2 having a speech speed different from that during normal reproduction is obtained. It has already been explained that it is converted. When the audio signal is reproduced at a reproduction speed twice as high as that at the time of recording, the frequency of the clock signal S11 is FS.
11. If the frequency of the clock signal S12 is FS12 and the speech speed for normal reproduction is SS1, the following equation (1) is established.

【数１】ここで（１）式の定数２は再生速度が通常速度の２倍で
あることを示している。[Equation 1] Here, the constant 2 in the equation (1) indicates that the reproduction speed is twice the normal speed.

【００１３】さらに、通常再生時に対する音程をＫＥ１
とすれば、次の（２）式が成立する。Further, the pitch for normal reproduction is set to KE1.
Then, the following equation (2) is established.

【数２】例えば、第１の従来例の場合はＳＳ１が１となり、話速
は通常再生と同一となる。さらに（２）式のＫＥ１が１
となり、音程も通常再生と同一となる。しかし、（１）
式より通常再生と異なる任意の話速に設定すると、出力
音声信号は（２）式より通常再生と異なる音程に自動的
に設定されてしまうことになる。そこで、通常再生と同
一音程の音声信号Ｓ３を得るためには、音声信号時間軸
変換装置１００ａの出力する信号Ｓ２を音程変換装置１
００ｂに与える。そしてクロック信号Ｓ１３とクロック
信号Ｓ１４とを音程変換装置１００ｂに書き込みクロッ
ク、読み出しクロックとして与えることにより、音程変
換比を所望の値に変換するようにしている。[Equation 2] For example, in the case of the first conventional example, SS1 is 1, and the speech speed is the same as in normal reproduction. Furthermore, KE1 in the equation (2) is 1
And the pitch is the same as in normal reproduction. But (1)
If an arbitrary voice speed different from the normal reproduction is set according to the formula, the output audio signal is automatically set to a pitch different from the normal reproduction according to the formula (2). Therefore, in order to obtain the audio signal S3 having the same pitch as that in the normal reproduction, the signal S2 output from the audio signal time base conversion device 100a is converted into the pitch conversion device 1.
Give to 00b. The clock signal S13 and the clock signal S14 are supplied to the pitch conversion device 100b as a write clock and a read clock to convert the pitch conversion ratio to a desired value.

【００１４】音程変換装置１００ｂによる音程変換比を
ＫＥ２とすれば、次の（３）式が成立する。If the pitch conversion ratio by the pitch converter 100b is KE2, the following equation (3) is established.

【数３】なお、音程変換比ＫＥ２は書き込みのクロック信号Ｓ１
３の周期を読み出しのクロック信号Ｓ１４の周期で割っ
た値である。(Equation 3) The pitch conversion ratio KE2 is the writing clock signal S1.
It is a value obtained by dividing the period of 3 by the period of the read clock signal S14.

【００１５】音程変換装置１００ｂは、例えば特開昭６
１−２９０００号公報に示す装置で実現される。図７は
特開昭６１−２９０００号の音程変換装置１００ｂの構
成を示すブロック図である。本図において図４に示す音
声信号時間軸変換装置１００と同一機能のブロックは同
一の符号を付ける。さて音程変換装置１００ｂは、Ａ／
Ｄ変換器１０１、メモリ１０２、第１のＤ／Ａ変換器１
０３、第２のＤ／Ａ変換器１０６、書込アドレスカウン
タ１０４、読出アドレスカウンタ１０５、第１の電子ボ
リューム１０７、第２の電子ボリューム１０８、加算器
１０９を含んで構成される。そして音程変換装置１００
ｂには、図４の装置と同様に音声信号Ｓ２の入力端子
１、クロック信号Ｓ１１の入力端子３、クロック信号Ｓ
１２の入力端子４、音声信号Ｓ３の出力端子２が設けら
れている。ここでは音程変換装置１００ｂの動作説明は
省略する。The pitch converting device 100b is disclosed in, for example, Japanese Patent Laid-Open No.
It is realized by the device disclosed in Japanese Patent Laid-Open No. 1-29000. FIG. 7 is a block diagram showing the configuration of the pitch converting device 100b of JP-A-61-29000. In this figure, blocks having the same functions as those of the audio signal time base converter 100 shown in FIG. Now, the pitch conversion device 100b is A /
D converter 101, memory 102, first D / A converter 1
03, a second D / A converter 106, a write address counter 104, a read address counter 105, a first electronic volume 107, a second electronic volume 108, and an adder 109. And the pitch conversion device 100
b, as in the device of FIG. 4, the input terminal 1 for the audio signal S2, the input terminal 3 for the clock signal S11, and the clock signal S
Twelve input terminals 4 and an output terminal 2 for the audio signal S3 are provided. Here, the description of the operation of the pitch converting apparatus 100b is omitted.

【００１６】[0016]

【発明が解決しようとする課題】しかしながら上記第１
の従来例の構成では、再生音声信号を任意の話速に設定
できないという問題点があった。一方、上記の第２の従
来例の構成では、任意の話速に設定できるが、音声信号
を記憶するメモリが２個必要であり、価格上昇を伴うと
いう問題点があった。However, the above-mentioned first problem
In the configuration of the conventional example, there is a problem that the reproduced voice signal cannot be set to an arbitrary speech rate. On the other hand, in the configuration of the second conventional example described above, although an arbitrary speech rate can be set, there is a problem in that two memories for storing voice signals are required, which causes a price increase.

【００１７】本発明はこのような従来の問題点に鑑みて
なされたものであって、ＶＴＲ等を高速再生した場合に
再生音声を任意の話速に設定が可能であり、かつ、単一
の音声メモリで構成可能な低価格の音声信号時間軸変換
装置を実現することを目的とする。The present invention has been made in view of the above-mentioned problems of the related art, and when the VTR or the like is reproduced at a high speed, the reproduced voice can be set to an arbitrary speech speed and a single voice can be set. An object of the present invention is to realize a low-priced audio signal time base conversion device that can be configured with an audio memory.

【００１８】[0018]

【課題を解決するための手段】本願の請求項１の発明
は、入力されたアナログ音声信号を第１のクロック信号
でデジタル信号に変換するＡ／Ｄ変換手段と、Ａ／Ｄ変
換手段の出力するデジタル音声信号を入力し、時系列の
デジタル音声信号における先行データを保持する第１の
メモリ領域、及び後続データを保持する第２のメモリ領
域を有するメモリ手段と、第１のクロック信号を入力
し、Ａ／Ｄ変換手段の出力信号をメモリの第１及び第２
のメモリ領域に書込むためのアドレス信号を生成する書
込みアドレス発生手段と、第２のクロック信号を入力
し、メモリ手段の第１のメモリ領域に格納された音声デ
ータを読み出すためのアドレス信号を発生する第１の読
出しアドレス発生手段と、第２のクロック信号を入力
し、メモリ手段の第２のメモリ領域に格納された音声デ
ータを読み出すためのアドレス信号を発生する第２の読
出しアドレス発生手段と、第２のクロック信号を入力
し、第１の読出しアドレス発生手段により読み出された
第１の音声データを保持する第１のデータ保持手段と、
第２のクロック信号を入力し、第２の読出しアドレス発
生手段により読み出された第２の音声データを保持する
第２のデータ保持手段と、第１及び第２のデータ保持手
段に対して、データ保持信号とデータ出力信号とを与え
る保持信号発生手段と、第１のデータ保持手段の出力信
号及び第２のデータ保持手段の出力信号を入力し、互い
の振幅値を制御して加算するクロスフェード手段と、第
１のデータ保持手段の出力信号及びクロスフェード手段
の出力信号を入力し、再生速度設定信号に基づいて切り
換えるセレクタ手段と、セレクタ手段の出力信号を第２
のクロック信号によりアナログ信号に変換して音声信号
を出力するＤ／Ａ変換手段と、を具備し、通常再生速度
のＮ倍で再生されたアナログ音声信号を入力したとき、
第１のクロック信号の周波数を、第２のクロック信号の
周波数のＮ倍にすることにより、通常再生速度のＭ倍の
話速で音声信号を再生することを特徴とするものであ
る。According to the invention of claim 1 of the present application, an A / D conversion means for converting an input analog audio signal into a digital signal by a first clock signal, and an output of the A / D conversion means. Memory signal having a first memory area for holding preceding data in the time-series digital audio signal and a second memory area for holding subsequent data, and a first clock signal. Then, the output signal of the A / D conversion means is set to the first and second memory
Write address generating means for generating an address signal for writing to the memory area and a second clock signal for generating an address signal for reading the audio data stored in the first memory area of the memory means. First read address generating means, and second read address generating means for inputting the second clock signal and generating an address signal for reading the audio data stored in the second memory area of the memory means. First data holding means for inputting the second clock signal and holding the first audio data read by the first read address generating means,
With respect to the second data holding means for inputting the second clock signal and holding the second audio data read by the second read address generating means, and the first and second data holding means, A holding signal generating means for giving a data holding signal and a data output signal, and a cross for inputting the output signal of the first data holding means and the output signal of the second data holding means, controlling their amplitude values and adding them. The fader means, the selector means for inputting the output signal of the first data holding means and the output signal of the crossfade means, and switching based on the reproduction speed setting signal, and the output signal of the selector means for the second
A D / A conversion means for converting the clock signal into an analog signal and outputting a voice signal, and when an analog voice signal reproduced at N times the normal reproduction speed is input,
By making the frequency of the first clock signal N times the frequency of the second clock signal, the audio signal is reproduced at a speech speed M times the normal reproduction speed.

【００１９】本願の請求項２の発明は、デジタル音声信
号を入力し、時系列のデジタル音声信号における先行デ
ータを保持する第１のメモリ領域、及び後続データを保
持する第２のメモリ領域を有するメモリ手段と、第１の
クロック信号を入力し、メモリの第１及び第２のメモリ
領域に書込むためのアドレス信号を生成する書込みアド
レス発生手段と、第２のクロック信号を入力し、メモリ
手段の第１のメモリ領域に格納された音声データを読み
出すためのアドレス信号を発生する第１の読出しアドレ
ス発生手段と、第２のクロック信号を入力し、メモリ手
段の第２のメモリ領域に格納された音声データを読み出
すためのアドレス信号を発生する第２の読出しアドレス
発生手段と、第２のクロック信号を入力し、第１の読出
しアドレス発生手段により読み出された第１の音声デー
タを保持する第１のデータ保持手段と、第２のクロック
信号を入力し、第２の読出しアドレス発生手段により読
み出された第２の音声データを保持する第２のデータ保
持手段と、第１及び第２のデータ保持手段に対して、デ
ータ保持信号とデータ出力信号とを与える保持信号発生
手段と、第１のデータ保持手段の出力信号及び第２のデ
ータ保持手段の出力信号を入力し、互いの振幅値を制御
して加算するクロスフェード手段と、第１のデータ保持
手段の出力信号及びクロスフェード手段の出力信号を入
力し、再生速度設定信号に基づいて切り換えるセレクタ
手段と、を具備し、通常再生速度のＮ倍で再生されたデ
ジタル音声信号を入力したとき、第１のクロック信号の
周波数を、第２のクロック信号の周波数のＮ倍にするこ
とにより、通常再生速度のＭ倍の話速で音声信号を再生
することを特徴とするものである。The invention of claim 2 of the present application has a first memory area for inputting a digital audio signal and holding preceding data in a time-series digital audio signal, and a second memory area for holding subsequent data. Memory means, write address generating means for inputting the first clock signal and generating address signals for writing in the first and second memory areas of the memory, and second clock signal for inputting the memory means First read address generating means for generating an address signal for reading the audio data stored in the first memory area, and a second clock signal are inputted and stored in the second memory area of the memory means. Second read address generating means for generating an address signal for reading the audio data, and a second read address generating means for inputting the second clock signal. A first data holding means for holding the first voice data read by the second clock signal and a second clock signal, and holds the second voice data read by the second read address generating means. Second data holding means, holding signal generating means for giving a data holding signal and a data output signal to the first and second data holding means, an output signal of the first data holding means and a second data holding means. A crossfade means for inputting the output signal of the data holding means and controlling and adding the amplitude values of the data holding means, and an output signal of the first data holding means and an output signal of the crossfade means are inputted and used as a reproduction speed setting signal. Selector circuit for switching based on the frequency of the first clock signal when the digital audio signal reproduced at N times the normal reproduction speed is input. By N times the number, is characterized in that for reproducing audio signal by M times the speech speed of the normal playback speed.

【００２０】本願の請求項３の発明では、クロスフェー
ド手段は、第１のデータ保持手段の出力信号の振幅レベ
ルを最大から最小まで次第に小さくなるようにに制御し
た信号と、第２のデータ保持手段の出力信号の振幅レベ
ルを最小から最大まで次第に大きくなるように制御した
信号とを加算することを特徴とするものである。In the invention of claim 3 of the present application, the crossfade means controls the amplitude level of the output signal of the first data holding means so as to gradually decrease from the maximum to the minimum, and the second data holding means. It is characterized in that the amplitude level of the output signal of the means is added to the signal controlled so as to gradually increase from the minimum to the maximum.

【００２１】本願の請求項４の発明では、セレクタ手段
は、クロスフェード手段から出力される信号時間長をＡ
とし、第１のデータ保持手段から出力される信号時間長
をＢとすると、通常再生速度に対してＭ倍の再生速度が
設定されたとき、Ｍ＝（２Ａ＋Ｂ）／（Ａ＋Ｂ）を満た
すよう入力信号を切り換えることを特徴とするものであ
る。In the invention of claim 4 of the present application, the selector means sets the signal time length output from the crossfade means to A
And the signal time length output from the first data holding means is B, when the reproduction speed M times the normal reproduction speed is set, input so as to satisfy M = (2A + B) / (A + B) It is characterized by switching signals.

【００２２】本願の請求項５の発明では、第１の読出し
アドレス発生手段と第２の読出しアドレス発生手段の出
力信号差である読出しアドレス差をＤとし、読出しクロ
ック信号の周期をＴとする場合、ＤとＴの積を１２５ミ
リ秒以内に設定し、第１のデータ保持手段と第２のデー
タ保持手段の出力信号間の時間差が１２５ミリ秒以内で
あることを特徴とするものである。In the invention of claim 5 of the present application, the read address difference which is the output signal difference between the first read address generating means and the second read address generating means is D, and the cycle of the read clock signal is T. , D and T are set within 125 milliseconds, and the time difference between the output signals of the first data holding means and the second data holding means is within 125 milliseconds.

【００２３】[0023]

【作用】このような特徴を有する本願の請求項１の発明
によれば、Ｎ倍速再生されたアナログ音声信号を第１の
クロック信号でデジタル変換し、そのデジタル音声信号
を第１のクロック信号に同期してメモリに書込む。つぎ
にメモリから第１のクロック信号のＮ倍の周期である第
２のクロック信号に同期して、メモリの２つのメモリ領
域から２つの音声データを読出し、第１，第２のデータ
保持手段に保持する。これらの音声データをクロスフェ
ード手段に入力し、互いの音声データの振幅値を制御し
て加算する。第１のデータ保持手段の音声データとクロ
スフェード手段の音声データとをセレクタ手段に入力
し、再生速度設定信号に基づいて切り換える。この音声
データを第２のクロック信号でアナログ変換すると、任
意の話速で且つ通常再生の音程を有する音声信号が再生
される。According to the invention of claim 1 of the present application having such characteristics, the analog audio signal reproduced at N times speed is digitally converted by the first clock signal, and the digital audio signal is converted into the first clock signal. Write to memory synchronously. Next, two audio data are read from the two memory areas of the memory in synchronism with the second clock signal having a cycle N times as long as the first clock signal from the memory, and the two audio data are stored in the first and second data holding means. Hold. These audio data are input to the crossfade means, and the amplitude values of the audio data of each are controlled and added. The audio data of the first data holding means and the audio data of the crossfade means are input to the selector means and switched based on the reproduction speed setting signal. When this voice data is converted into an analog signal with the second clock signal, a voice signal having an arbitrary talk speed and a normal reproduction pitch is reproduced.

【００２４】また本願の請求項２の発明では、Ｎ倍速再
生されたデジタル音声信号を第１のクロック信号に同期
してメモリに書込む。つぎにメモリから第１のクロック
信号のＮ倍の周期である第２のクロック信号に同期し
て、メモリの２つのメモリ領域から２つの音声データを
読出し、第１，第２のデータ保持手段に保持する。これ
らの音声データをクロスフェード手段に入力し、互いの
音声データの振幅値を制御して加算する。第１のデータ
保持手段の音声データとクロスフェード手段の音声デー
タとをセレクタ手段に入力し、再生速度設定信号に基づ
いて切り換える。こうして任意の話速で且つ通常再生の
音程を有するデジタル音声信号が再生される。Further, in the invention of claim 2 of the present application, the N-speed reproduced digital audio signal is written in the memory in synchronization with the first clock signal. Next, two audio data are read from the two memory areas of the memory in synchronism with the second clock signal having a cycle N times as long as the first clock signal from the memory, and the two audio data are stored in the first and second data holding means. Hold. These audio data are input to the crossfade means, and the amplitude values of the audio data of each are controlled and added. The audio data of the first data holding means and the audio data of the crossfade means are input to the selector means and switched based on the reproduction speed setting signal. In this way, a digital audio signal having an arbitrary speech speed and a normal reproduction pitch is reproduced.

【００２５】また本願の請求項３の発明では、クロスフ
ェード手段は第１のデータ保持手段の音声データの振幅
レベルを最大から最小まで次第に小さくなるように制御
する（フェードアウト）。また第２のデータ保持手段の
音声データの振幅レベルを最小から最大まで次第に大き
くなるように制御（フェードイン）する。つぎにフェー
ドアウト信号とフェードイン信号を加算した音声データ
を出力し、セレクタ手段に与える。こうすると繰り返し
信号成分を多く有する音声信号は、クロスフェード手段
により時間圧縮されても、音声内容は損なわれずに処理
される。Further, in the invention of claim 3 of the present application, the crossfade means controls the amplitude level of the audio data of the first data holding means so as to gradually decrease from the maximum to the minimum (fade out). Further, the amplitude level of the audio data of the second data holding means is controlled (fade in) so as to gradually increase from the minimum to the maximum. Next, audio data obtained by adding the fade-out signal and the fade-in signal is output and given to the selector means. In this way, an audio signal having a large number of repetitive signal components can be processed without losing the audio content even if it is time-compressed by the crossfade means.

【００２６】[0026]

【実施例】本発明の一実施例における音声信号時間軸変
換装置について図１を参照しつつ説明する。図１は本実
施例の音声信号時間軸変換装置の構成を示すブロック図
である。本図においてＶＴＲ等から特殊再生された音声
信号Ｓ１は入力端子１１を介してＡ／Ｄ変換器２０に与
えられる。また第１のクロック信号Ｓ１１は入力端子１
３を介してＡ／Ｄ変換器２０と書込アドレスカウンタ２
１に与えられる。更に第２のクロック信号Ｓ１２は入力
端子１４を介してＤ／Ａ変換器２３、第１の読出アドレ
スカウンタ２４、第２の読出アドレスカウンタ２５、ラ
ッチ信号発生回路２６に与えられる。DESCRIPTION OF THE PREFERRED EMBODIMENTS An audio signal time base converter according to an embodiment of the present invention will be described with reference to FIG. FIG. 1 is a block diagram showing the configuration of the audio signal time base converter according to this embodiment. In the figure, the audio signal S1 specially reproduced from the VTR or the like is given to the A / D converter 20 via the input terminal 11. The first clock signal S11 is input terminal 1
A / D converter 20 and write address counter 2 via
Given to 1. Further, the second clock signal S12 is given to the D / A converter 23, the first read address counter 24, the second read address counter 25, and the latch signal generation circuit 26 via the input terminal 14.

【００２７】メモリ２２はＡ／Ｄ変換器２０でデジタル
変換された音声信号Ｓ２１を書込アドレスカウンタ２１
の出力する書込みアドレス信号Ｓ２２によって書き込む
音声メモリである。メモリ２２は時系列のデジタル音声
信号における先行データを保持する第１のメモリ領域、
及び後続データを保持する第２のメモリ領域を有してい
る。このメモリ２２に保持された音声データＳ２４は第
１の読出アドレスカウンタ２４の出力する第１のアドレ
ス信号Ｓ３１と、第２の読出アドレスカウンタ２５の出
力する第２のアドレス信号Ｓ３２とによって同時又は入
力（格納）順序で読み出される。読出アドレスカウンタ
２４はメモリ２２の第１のメモリ領域（第１のアドレ
ス）に格納された音声データを読み出し、読出アドレス
カウンタ２５はメモリ２２の第２のメモリ領域（第２の
アドレス）に格納された音声データを読み出すもので、
共にクロック信号Ｓ１２によって駆動される。The memory 22 writes the audio signal S21 digitally converted by the A / D converter 20 into a write address counter 21.
Is a voice memory to be written by the write address signal S22 output by the. The memory 22 is a first memory area for holding preceding data in a time-series digital audio signal,
And a second memory area for holding subsequent data. The voice data S24 held in the memory 22 is simultaneously or inputted by the first address signal S31 output by the first read address counter 24 and the second address signal S32 output by the second read address counter 25. It is read in the (storing) order. The read address counter 24 reads the audio data stored in the first memory area (first address) of the memory 22, and the read address counter 25 is stored in the second memory area (second address) of the memory 22. To read the audio data
Both are driven by the clock signal S12.

【００２８】メモリ２２の第１のメモリ領域に格納され
た音声データＳ２４が読み出されると、第１のデータ保
持手段である第１のラッチ２７に出力され、第２のアド
レスに格納された音声データＳ２４が読み出されると、
第２のデータ保持手段てある第２のラッチ２８に出力さ
れる。ラッチ２７，２８は共に入力データを一時保持す
る回路である。保持信号発生手段であるラッチ信号発生
回路２６はラッチ２７に対して第１の制御信号Ｓ３７を
与え、ラッチ２８に対して第２の制御信号Ｓ３８を与え
る。これらの制御信号とは各ラッチに入力信号を保持さ
せるためのデータ保持信号とラッチに保持された音声デ
ータを出力させるデータ出力信号を意味する。When the audio data S24 stored in the first memory area of the memory 22 is read, it is output to the first latch 27 which is the first data holding means, and the audio data stored in the second address. When S24 is read,
It is output to the second latch 28 which is the second data holding means. The latches 27 and 28 are circuits that temporarily hold input data. The latch signal generating circuit 26, which is a holding signal generating means, gives the latch 27 a first control signal S37 and the latch 28 a second control signal S38. These control signals mean a data holding signal for holding the input signal in each latch and a data output signal for outputting the audio data held in the latch.

【００２９】クロスフェード回路２９はラッチ２７に保
持された音声データＳ３５と、ラッチ２８に保持された
音声データＳ３３を入力し、それらの信号に対してクロ
スフェードを行う回路である。クロスフェードとは、一
方の入力信号に対してフェードアウトを行い、他方の入
力信号に対してフェードインを行うことである。セレク
タ回路３０はクロスフェード回路２９から出力される音
声データＳ３４と、ラッチ２７から出力される音声デー
タＳ３５のいずれか一方を入力端子１５の制御信号Ｓ１
５に基づき選択する回路で、選択された音声データＳ３
６はＤ／Ａ変換器２３に入力され、アナログの音声信号
Ｓ２に変換される。出力端子１２から出力された音声信
号は図示しない音声出力回路に与えられる。The crossfade circuit 29 is a circuit which inputs the audio data S35 held in the latch 27 and the audio data S33 held in the latch 28 and crossfades the signals. The crossfade means performing fade-out on one input signal and performing fade-in on the other input signal. The selector circuit 30 outputs either the audio data S34 output from the crossfade circuit 29 or the audio data S35 output from the latch 27 to the control signal S1 of the input terminal 15.
In the circuit that selects based on 5, the selected audio data S3
6 is input to the D / A converter 23 and converted into an analog audio signal S2. The audio signal output from the output terminal 12 is applied to an audio output circuit (not shown).

【００３０】このように構成された本実施例の音声信号
時間軸変換装置の動作について、図１〜図３を用いて説
明する。なお、本実施例の音声信号時間軸変換装置は、
通常再生速度を含めて任意の再生速度に対応可能である
が、ここでは２倍速再生の場合を例にとって説明する。
図２は記録媒体が２倍速再生されるとき、音声信号を
１．５倍の話速で再生する場合の動作説明図であり、図
３は記録媒体が２倍速再生されるとき、音声信号を１．
３３倍及び１．１７倍の話速で夫々再生する場合の動作
説明図である。The operation of the audio signal time base converter according to the present embodiment thus constructed will be described with reference to FIGS. The audio signal time base conversion device of the present embodiment is
Although it is possible to support any reproduction speed including the normal reproduction speed, the case of double speed reproduction will be described here as an example.
FIG. 2 is an operation explanatory diagram in the case of reproducing an audio signal at a speech speed of 1.5 times when the recording medium is reproduced at double speed, and FIG. 3 shows an audio signal when the recording medium is reproduced at double speed. 1.
It is operation | movement explanatory drawing at the time of reproducing at the voice speed of 33 times and 1.17 times, respectively.

【００３１】入力端子１１に入力される２倍速再生の音
声信号Ｓ１は、Ａ／Ｄ変換器２０によってクロック信号
Ｓ１１に同期したデジタルの音声信号Ｓ２１に変換され
る。音声信号Ｓ２１はメモリ２２に入力され、書込アド
レスカウンタ２１のアドレス信号Ｓ２２に従って順次メ
モリ２２に書込まれる。メモリ２２に書込まれた音声信
号Ｓ２１は、第１，第２の読出アドレスカウンタ２４，
２５のアドレス信号Ｓ３１，Ｓ３２に従って、メモリ２
２から読出される。The double speed reproduction audio signal S1 input to the input terminal 11 is converted by the A / D converter 20 into a digital audio signal S21 synchronized with the clock signal S11. The voice signal S21 is input to the memory 22 and sequentially written in the memory 22 in accordance with the address signal S22 of the write address counter 21. The audio signal S21 written in the memory 22 is the first and second read address counters 24,
In accordance with the 25 address signals S31 and S32, the memory 2
2 is read.

【００３２】２倍速再生された音声信号Ｓ１の音程は、
通常再生された信号の音程の２倍となっている。出力端
子１２に出力するべき音声信号Ｓ２の音程を通常再生と
同一にするため、クロック信号Ｓ１２の周期をクロック
信号Ｓ１１の周期の２倍に設定する。一般的には入力さ
れた音声信号Ｓ１が通常再生のＮ倍速再生信号の場合、
クロック信号Ｓ１２の周期をクロック信号Ｓ１１の周期
のＮ倍に設定すれば良い。The pitch of the audio signal S1 reproduced at double speed is
It is twice the pitch of the normally reproduced signal. In order to make the pitch of the audio signal S2 to be output to the output terminal 12 the same as in normal reproduction, the cycle of the clock signal S12 is set to be twice the cycle of the clock signal S11. Generally, when the input audio signal S1 is an N times speed reproduction signal of normal reproduction,
The cycle of the clock signal S12 may be set to N times the cycle of the clock signal S11.

【００３３】メモリ２２から読出アドレスカウンタ２
４，２５によって夫々異なる読出しアドレス値で時分割
に読出される。読出アドレスカウンタ２４のアドレス信
号Ｓ３１で読み出された音声データＳ２４の一方は第１
のラッチ２７に入力され、ラッチ信号発生回路２６の制
御信号Ｓ３７でラッチ（保持）される。ここで保持され
た音声データは次の制御信号Ｓ３７により音声データＳ
３５として出力される。この音声データＳ３５は、読出
アドレスカウンタ２４のアドレス信号Ｓ３１により読出
された信号と同一である。Read address counter 2 from memory 22
4 and 25 are time-divisionally read with different read address values. One of the audio data S24 read by the address signal S31 of the read address counter 24 is the first
Is input to the latch 27 and latched (held) by the control signal S37 of the latch signal generation circuit 26. The audio data held here is the audio data S by the next control signal S37.
Is output as 35. The voice data S35 is the same as the signal read by the address signal S31 of the read address counter 24.

【００３４】読出アドレスカウンタ２５のアドレス信号
Ｓ３２により読出された音声データＳ２４の他方は、第
２のラッチ２８へ入力され、ラッチ信号発生回路２６の
制御信号Ｓ３８でラッチされる。ここで保持された音声
データは、次の制御信号Ｓ３８によって音声データＳ３
３として出力される。この音声データＳ３３は、読出ア
ドレスカウンタ２５のアドレス信号Ｓ３２ので読み出さ
れた音声データと同一である。The other of the audio data S24 read by the address signal S32 of the read address counter 25 is input to the second latch 28 and latched by the control signal S38 of the latch signal generating circuit 26. The audio data held here is audio data S3 by the next control signal S38.
It is output as 3. The voice data S33 is the same as the voice data read by the address signal S32 of the read address counter 25.

【００３５】ラッチ２７に保持された音声データＳ３５
は、セレクタ回路３０の第１の入力端に与えられ、クロ
スフェード回路２９の第１の入力端にも与えられる。ま
たラッチ２８に保持された音声データＳ３３はクロスフ
ェード回路２９の第２の入力端に与えられる。クロスフ
ェード回路２９は２つの音声データＳ３５とＳ３３との
クロスフェードを行う。次にセレクタ回路３０は入力端
子１５に入力される話速設定の制御信号Ｓ１５に基づ
き、音声データＳ３５又はクロスフェードされた音声デ
ータＳ３４の何れかを選択する。ここで図２（ａ）〜
（ｄ）を用いてクロスフェード回路２９とセレクタ回路
３０の機能を詳しく説明する。Audio data S35 held in the latch 27
Is applied to the first input terminal of the selector circuit 30 and is also applied to the first input terminal of the crossfade circuit 29. The audio data S33 held in the latch 28 is given to the second input terminal of the crossfade circuit 29. The crossfade circuit 29 crossfades the two audio data S35 and S33. Next, the selector circuit 30 selects either the voice data S35 or the cross-faded voice data S34 based on the control signal S15 for setting the voice speed input to the input terminal 15. Here, FIG.
The functions of the crossfade circuit 29 and the selector circuit 30 will be described in detail with reference to FIG.

【００３６】図２において、通常再生時の音声信号に対
する時間単位をｔとすると、（ａ）は２倍速再生された
音声信号Ｓ１を、ｓ，ｔ，ｕ，ｖ，ｗ・・のように時間
単位ｔ／２毎に区切って模式的に示した図である。前述
のように通常再生と比較して音声信号Ｓ１の周波数は２
倍、周期は１／２となっている。図２（ｂ）は図４に示
す従来例のように、２倍速再生された音声信号Ｓ１をク
ロック信号Ｓ１１でメモリ２２で書込み、クロック信号
Ｓ１１の２倍の周期を有するクロック信号Ｓ１２で読出
した場合の音声データである。出力された信号はＳ，
Ｔ，Ｕ，Ｖ，Ｗ・・というように出力される音声信号の
周波数と周期とが通常再生の場合と同一となり、時間単
位は２倍速再生時の２倍、即ちｔとなる。また、話速は
通常再生と同一であり、このままでは話速を変化させる
ことはできない。In FIG. 2, assuming that the time unit for the audio signal at the time of normal reproduction is t, (a) shows the audio signal S1 reproduced at double speed as time as s, t, u, v, w ... It is the figure which divided | segmented for every unit t / 2 and was shown typically. As described above, the frequency of the audio signal S1 is 2 as compared with the normal reproduction.
Times the cycle, and the cycle is 1/2. 2B, as in the conventional example shown in FIG. 4, the audio signal S1 reproduced at double speed is written in the memory 22 by the clock signal S11 and read by the clock signal S12 having a cycle twice that of the clock signal S11. This is audio data in the case. The output signal is S,
The frequency and cycle of the output audio signal such as T, U, V, W ... Are the same as in the case of normal reproduction, and the time unit is double that in double speed reproduction, that is, t. Further, the speech speed is the same as that in the normal reproduction, and the speech speed cannot be changed as it is.

【００３７】さて話速を例えば、１．５倍に設定する場
合を考える。ここでは時間単位ｔを変化させずに、図２
（ｂ）に示す如く、Ｓ，Ｔ，Ｕを再生するに必要な時間
単位３ｔを、図２（ｃ）に示す如く時間単位２ｔに設定
すれば良い。そのために時間ｔだけの時間圧縮が必要と
なる。本実施例では音声信号Ｓとこれに続く音声信号Ｔ
の２信号をクロスフェードし、最初の時間単位ｔに２信
号を存在させることで実現する。図２（ｃ）は図１に示
した装置で話速を１．５倍に設定した場合に変換される
音声信号Ｓ２を示し、Ｓ／Ｔ，Ｕ，Ｖ／Ｗ，Ｘと時間単
位ｔで区切って出力される。図２（ｃ）に示すＳ／Ｔ，
Ｖ／Ｗとは、２信号をクロスフェードすることを意味し
ており、例えば図２（ｃ），（ｄ）に示す時刻ｔ１〜ｔ
２においてＳ／Ｔは、音声信号Ｓをフェードアウトし、
音声信号Ｔをフェードインして加算することを意味して
いる。またＵ、Ｘは、音声信号Ｕ及び音声信号Ｘをこの
順序でそのまま出力することを意味している。Now, consider the case where the speech speed is set to 1.5 times, for example. Here, without changing the time unit t,
As shown in (b), the time unit 3t required for reproducing S, T, and U may be set to the time unit 2t as shown in FIG. 2 (c). Therefore, time compression of time t is required. In this embodiment, the audio signal S and the audio signal T following the audio signal S
It is realized by cross-fading the two signals and the two signals are present in the first time unit t. FIG. 2 (c) shows a voice signal S2 converted when the speech speed is set to 1.5 times in the device shown in FIG. 1, in S / T, U, V / W, X and time unit t. Output is separated. The S / T shown in FIG.
V / W means cross-fading two signals, and for example, times t1 to t shown in FIGS. 2 (c) and 2 (d).
2, the S / T fades out the audio signal S,
This means that the audio signal T is faded in and added. U and X mean that the audio signals U and X are output as they are in this order.

【００３８】図２（ｄ）は制御信号Ｓ１５によるセレク
タ回路３０の動作を示している。時刻ｔ１〜ｔ２では、
セレクタ回路３０はクロスフェードした信号Ｓ／Ｔを選
択するため、音声データＳ３４を選択し、時刻ｔ２〜ｔ
３ではそのままの音声データを出力するため、ラッチ２
７の出力する音声データＳ３５を選択する。このように
図２で説明した内容は、話速のみを任意に設定可能と
し、音程を通常再生と同一にするという、本実施例の新
しい考え方である。FIG. 2D shows the operation of the selector circuit 30 according to the control signal S15. At times t1 to t2,
Since the selector circuit 30 selects the cross-faded signal S / T, it selects the audio data S34 at the times t2 to t.
3 outputs the audio data as it is, so latch 2
The audio data S35 to be output by 7 is selected. As described above, the content described with reference to FIG. 2 is a new concept of the present embodiment in which only the speech speed can be arbitrarily set and the pitch is the same as that in the normal reproduction.

【００３９】図２に示した時間単位ｔとして設定可能な
範囲における最小値は、メモリ２２により処理されて出
力される音声データＳ３５の最低周波数により決定され
る。またその最大値はクロスフェード時の２信号の時間
遅延による生じる違和感等の聴感実験により決定され
る。なお、ここでは時間単位ｔを３２ミリ秒付近に設定
した。ここで時間単位ｔは、アドレス信号Ｓ３１とＳ３
２とのアドレス差をＤとし、クロック信号Ｓ１２の周期
をＴとすると、Ｄ×Ｔに等しい。The minimum value in the range that can be set as the time unit t shown in FIG. 2 is determined by the minimum frequency of the audio data S35 processed and output by the memory 22. Further, the maximum value is determined by an audible experiment such as a feeling of strangeness caused by a time delay of two signals at the time of crossfading. Here, the time unit t is set to around 32 milliseconds. Here, the time unit t is the address signals S31 and S3.
If the address difference from 2 is D and the cycle of the clock signal S12 is T, then it is equal to D × T.

【００４０】図３は話速１．３３倍と１．１７倍におけ
る音声データＳ３６の模式図である。図３（ａ）は図２
（ｂ）に示した信号と同一である。図３（ｂ）は、話速
を１．３３倍に設定したときの出力端子１２から出力さ
れる音声信号Ｓ２を示しており、Ｓ／Ｔ，Ｕ，Ｖ，Ｗ／
Ｘ，Ｙ、Ｚと時間単位ｔで区切って出力される。この場
合、図３（ａ）の音声信号列（Ｓ，Ｔ，Ｕ，Ｖ）の再生
に必要な時間単位は４ｔであるのに対して、図３（ｂ）
の音声信号列（Ｓ／Ｔ，Ｕ，Ｖ）の再生に必要な時間単
位は３ｔとなっている。従って話速は４／３＝１．３３
倍となる。FIG. 3 is a schematic diagram of the voice data S36 at the voice speeds of 1.33 times and 1.17 times. FIG. 3 (a) is shown in FIG.
It is the same as the signal shown in (b). FIG. 3B shows the audio signal S2 output from the output terminal 12 when the speech speed is set to 1.33 times, and S / T, U, V, W /
It is output by being separated by X, Y, Z and time unit t. In this case, the time unit required to reproduce the audio signal sequence (S, T, U, V) in FIG. 3A is 4t, while that in FIG.
The time unit required for reproducing the audio signal sequence (S / T, U, V) is 3t. Therefore, the speech rate is 4/3 = 1.33.
Double.

【００４１】図３（ｃ）は話速が１．１７倍の場合を示
している。この場合、図３（ａ）の音声信号列（Ｓ，
Ｔ，Ｕ，Ｖ，Ｗ，Ｘ，Ｙ）の再生に必要な時間単位は７
ｔであるのに対して、図３（ｃ）の音声信号列（Ｓ／
Ｔ，Ｕ，Ｖ，Ｗ，Ｘ，Ｙ）の再生に必要な時間単位は６
ｔとなり、話速は７／６＝１．１７倍となる。FIG. 3 (c) shows the case where the speech speed is 1.17 times. In this case, the audio signal string (S,
(T, U, V, W, X, Y) time unit required for playback is 7
However, the audio signal sequence (S /
T, U, V, W, X, Y) time unit required for playback is 6
Thus, the speech speed is 7/6 = 1.17 times.

【００４２】このように、通常再生速度と異なるＭ倍の
任意の話速は、クロスフェード回路２９の音声データＳ
３４の時間長をＡ、ラッチ２７から出力される音声デー
タＳ３５の時間長をＢとするとき、Ｍ＝（２Ａ＋Ｂ）／
（Ａ＋Ｂ）を満たすことにより実現可能となる。As described above, the M-times arbitrary speech speed, which is different from the normal reproduction speed, is applied to the voice data S of the crossfade circuit 29.
When the time length of 34 is A and the time length of the audio data S35 output from the latch 27 is B, M = (2A + B) /
It can be realized by satisfying (A + B).

【００４３】第１，第２の読出アドレスカウンタ２４，
２５の出力するアドレス差をＤとすると、音声データＳ
３１と音声データＳ３３との間には時間遅延が発生す
る。その時間遅延値をＤｔとすると、アドレス差Ｄと入
力端子１４に入力された読出しクロックＳ１２の周期Ｔ
には、次の（４）式が成立する。The first and second read address counters 24,
If the address difference output by 25 is D, the audio data S
A time delay occurs between 31 and the audio data S33. When the time delay value is Dt, the address difference D and the cycle T of the read clock S12 input to the input terminal 14
, The following expression (4) is established.

【数４】この時間遅延値Ｄｔが大きい場合、聴感的にはエコーを
知覚することになり、明瞭性の低下や違和感の増加が生
じる。時間遅延値Ｄｔにおける検知限界は、例えばナカ
ニシヤ出版1984年11月発行の難波精一郎編”聴覚ハンド
ブック”２９０ページに「明確に分離した２音の感じを
得るためには50msec〜100msec 以上に間隔を拡げる必要
があること」が報告されている。またＴｈｏｍａｓ他の
報告「Temporal order in the perception of vowelsと
Journal Acoustic Society of America 48,1010-1013(1
970）」による音声素材の実験では、１２５ミリ秒であ
ることが報告されている。いずれにしても本実施例で
は、これらの報告内容に基き、時間遅延値Ｄｔを１２５
ミリ秒以内に設定する。[Equation 4] When this time delay value Dt is large, an echo is perceptually perceived, resulting in a decrease in clarity and an increase in discomfort. For the detection limit of the time delay value Dt, for example, in 290 Seikiro Namba, "Hearing Handbook," published by Nakanishiya Publishing in November 1984, "Extend the interval to 50 msec to 100 msec or more in order to obtain a feeling of two clearly separated sounds." Need to be done ". In addition, a report by Thomas et al. “Temporal order in the perception of vowels
Journal Acoustic Society of America 48,1010-1013 (1
970) ”, an audio material experiment reported that it was 125 milliseconds. In any case, in this embodiment, the time delay value Dt is set to 125 based on the contents of these reports.
Set within milliseconds.

【００４４】以上は入力される音声信号Ｓ１として通常
の２倍速再生の場合について説明したが、音声信号Ｓ１
として通常再生の場合も同様にして話速を変化させるこ
とができる。その場合は入力端子１３へ入力するクロッ
ク信号Ｓ１１と入力端子１４へ入力するクロック信号Ｓ
１２の周波数を同一に設定すれば良い。The case of the normal double speed reproduction as the input audio signal S1 has been described above.
In the case of normal reproduction, the speech speed can be changed in the same manner. In that case, the clock signal S11 input to the input terminal 13 and the clock signal S input to the input terminal 14
The 12 frequencies may be set to be the same.

【００４５】また、話速を外部から制御するばかりでな
く、入力される信号の内容によって自動的に話速を徐々
に変化させることが考えられる。例えば、信号に含まれ
る無音区間のみを圧縮することにより、話速を入力信号
より遅くすることが可能であり、このような音声信号時
間軸変換装置を低価格で実現することができる。さらに
この音声信号時間軸変換装置を、既成のＬＳＩであるメ
モリ２２と、それ以外の回路をＬＳＩ化したカスタムＬ
ＳＩで構成することもできる。In addition to controlling the speech speed from the outside, it is possible to gradually change the speech speed automatically according to the contents of the input signal. For example, by compressing only the silent section included in the signal, the speech speed can be made slower than that of the input signal, and such an audio signal time base conversion device can be realized at low cost. Furthermore, this audio signal time base conversion device is a custom L in which the memory 22 which is an existing LSI and the other circuits are integrated into an LSI.
It can also be configured by SI.

【００４６】本実施例の音声信号時間軸変換装置では、
アナログの音声信号を入力し、話速と音程を変換したア
ナログの音声信号を出力するとした。しかしデジタルの
ＡＶ機器に利用する場合は、デジタルの音声信号をメモ
リ２２に入力し、セレクタ回路３０のデジタル音声信号
を直接使用するものとする。In the audio signal time base converter of this embodiment,
It is assumed that an analog voice signal is input and an analog voice signal in which the voice speed and pitch are converted is output. However, when used in a digital AV device, a digital audio signal is input to the memory 22 and the digital audio signal of the selector circuit 30 is directly used.

【００４７】[0047]

【発明の効果】以上のように本発明の音声信号時間軸変
換装置をＶＴＲや他のＡＶ機器に応用すれば、例えば特
殊再生において２倍速で再生する場合、画像は２倍速で
再生されるが、音声信号はＡＶ機器の使用者が設定した
１倍から２倍までの任意の話速で、かつ通常の音程で再
生されることとなる。このため再生時間を半分にするこ
とで、時間を有効に活用することが可能となり、且つ音
声の内容を２倍速よりも遅い好みの話速で再生すること
ができる。従ってＶＴＲ等の音声情報記録再生装置の特
殊再生において、音声内容を容易に理解することができ
るという優れた硬化が得られる。As described above, if the audio signal time base conversion device of the present invention is applied to a VTR or other AV equipment, for example, in the case of double speed reproduction in special reproduction, an image is reproduced at double speed. The audio signal will be reproduced at an arbitrary speech rate set by the user of the AV device from 1 to 2 times and at a normal pitch. Therefore, by halving the reproduction time, it is possible to effectively utilize the time, and it is possible to reproduce the content of the voice at a desired speech speed slower than double speed. Therefore, in the special reproduction of the audio information recording / reproducing apparatus such as the VTR, the excellent curing that the audio contents can be easily understood can be obtained.

[Brief description of drawings]

【図１】本発明の１実施例における声信号時間軸変換装
置の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a voice signal time base conversion device according to an embodiment of the present invention.

【図２】本実施例の音声信号時間軸変換装置における、
話速を１．５倍としたときの動作説明図である。FIG. 2 is a diagram showing an audio signal time base conversion device according to the present embodiment,
It is operation | movement explanatory drawing when a speech speed is set to 1.5 times.

【図３】本実施例の音声信号時間軸変換装置における、
話速を１．３３倍と１．１７倍としたときの動作説明図
である。FIG. 3 is a diagram showing an audio signal time base conversion device according to the present embodiment.
It is operation | movement explanatory drawing when the speech speed is set to 1.33 times and 1.17 times.

【図４】第１の従来例における音声信号時間軸変換装置
の構成を示すブロック図である。FIG. 4 is a block diagram showing a configuration of an audio signal time base conversion device in a first conventional example.

【図５】従来例の音声信号時間軸変換装置における話速
変換の動作説明図である。FIG. 5 is an explanatory diagram of a speech speed conversion operation in a conventional audio signal time base conversion device.

【図６】第２の従来例における音声信号時間軸変換装置
の構成図である。FIG. 6 is a configuration diagram of an audio signal time base converter according to a second conventional example.

【図７】従来例の音声信号時間軸変換装置に用いられる
音程変換装置の構成を示すブロック図である。FIG. 7 is a block diagram showing a configuration of a pitch conversion device used in a conventional audio signal time base conversion device.

[Explanation of symbols]

１１，１３，１４，１５入力端子１２出力端子２０Ａ／Ｄ変換器２１書込アドレスカウンタ２２メモリ２３Ｄ／Ａ変換器２４第１の読出アドレスカウンタ２５第２の読出アドレスカウンタ２６ラッチ信号発生回路２７第１のラッチ２８第２のラッチ２９クロスフェード回路３０セレクタ回路 11, 13, 14, 15 Input terminal 12 Output terminal 20 A / D converter 21 Write address counter 22 Memory 23 D / A converter 24 First read address counter 25 Second read address counter 26 Latch signal generation circuit 27 First Latch 28 Second Latch 29 Crossfade Circuit 30 Selector Circuit

Claims

[Claims]

1. A time-series digital signal receiving an A / D conversion unit for converting an input analog audio signal into a digital signal with a first clock signal and a digital audio signal output from the A / D conversion unit. Memory means having a first memory area for holding preceding data in the audio signal, and a second memory area for holding subsequent data; and an output signal of the A / D converting means for inputting the first clock signal. Write address generating means for generating an address signal for writing into the first and second memory areas of the memory, and a second clock signal, which is stored in the first memory area of the memory means. First read address generating means for generating an address signal for reading the audio data, and the second memory of the memory means for inputting the second clock signal. Second read address generating means for generating an address signal for reading the audio data stored in the area, and a first read address generating means for inputting the second clock signal and read by the first read address generating means. First data holding means for holding the second audio data, and second data holding means for inputting the second clock signal and holding the second audio data read by the second read address generating means. Means, holding signal generating means for giving a data holding signal and a data output signal to the first and second data holding means, an output signal of the first data holding means and the second data holding means Means for inputting the output signal of the means, controlling and adding the amplitude values to each other, a crossfade means, an output signal of the first data holding means and an output signal of the crossfade means Type and outputs an audio signal into an analog signal and selector means for switching based on the reproduction speed setting signal, by the second clock signal the output signal of the selector means D /
A conversion means is provided, and when an analog audio signal reproduced at N times the normal reproduction speed is input, the frequency of the first clock signal is made N times the frequency of the second clock signal. By
An audio signal time base conversion device, which reproduces an audio signal at a speech speed M times as high as a normal reproduction speed.

2. A memory means having a first memory area for inputting a digital audio signal and holding preceding data in a time-series digital audio signal, and a second memory area for holding subsequent data, and a first memory area. Write address generating means for inputting a clock signal and generating an address signal for writing to the first and second memory areas of the memory; and a second clock signal for inputting a first clock signal of the memory means. A first read address generating means for generating an address signal for reading the audio data stored in the memory area; and an audio signal stored in the second memory area of the memory means by inputting the second clock signal. Second read address generating means for generating an address signal for reading data, and the first read address by inputting the second clock signal. A first data holding means for holding the first voice data read by the generating means, and a second voice read by the second read address generating means by inputting the second clock signal. A second data holding means for holding data; a holding signal generating means for giving a data holding signal and a data output signal to the first and second data holding means; and a first data holding means A crossfade means for inputting an output signal and an output signal of the second data holding means, controlling and adding amplitude values of each other, and an output signal of the first data holding means and an output signal of the crossfade means. And selector means for switching based on the reproduction speed setting signal, and when the digital audio signal reproduced at N times the normal reproduction speed is inputted, the first clock signal is input. The frequency of click signal, by the N times the frequency of the second clock signal,
An audio signal time base conversion device, which reproduces an audio signal at a speech speed M times as high as a normal reproduction speed.

3. The crossfading means controls the amplitude level of the output signal of the first data holding means so as to gradually decrease from the maximum to the minimum, and the output signal of the second data holding means. 3. The audio signal time base converter according to claim 1 or 2, wherein the signal is controlled so that the amplitude level thereof is gradually increased from the minimum to the maximum.

4. The selector means sets the signal time length output from the crossfade means to A
When the signal time length output from the first data holding means is B, M = (2A + B) / (A + B) is satisfied when a reproduction speed M times the normal reproduction speed is set. The audio signal time base conversion device according to any one of claims 1 to 3, wherein the input signal is switched.

5. When the read address difference which is the output signal difference between the first read address generating means and the second read address generating means is D and the cycle of the read clock signal is T, D and T 5. The product is set within 125 milliseconds and the time difference between the output signals of the first data holding means and the second data holding means is within 125 milliseconds. The audio signal time base conversion device according to item 1.