JPH0887298A

JPH0887298A - Method and device for processing audio signal

Info

Publication number: JPH0887298A
Application number: JP22220294A
Authority: JP
Inventors: Ryuzo Nagai; 龍三永井
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1994-09-16
Filing date: 1994-09-16
Publication date: 1996-04-02
Anticipated expiration: 2019-04-12
Also published as: JP3517979B2

Abstract

PURPOSE: To correctly detect a pitch with an audio signal of a small bit length and to attain correct pitch correction processing by using the pitch. CONSTITUTION: An input digital audio signal of 24 bits is stored in a pitch operation data memory 26 while preserving respective codes as the quadripartite four kinds of sample data. The audio signal of 24 bits is stored as it is in an audio data memory 24. A maximum peak detection circuit 32 detects a maximum value related to the audio signal of a certain pitch detection retrieval section. Based on the maximum value, the relevant data of the quadripartite sample data of eight bits stored in the memory 26 are read out (it is called as bit shift processing), and self correlation is calculated in a pitch detection circuit 28, and the pitch is detected. When the pitch is detected, the audio signal stored in the memory 24 is read out based on the pitch. The pitch correction processing is performed by the read-out control.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明はオーディオ信号処理方法
とその装置に関するものであり、特に、ピッチコレクシ
ョン処理方法とその装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio signal processing method and apparatus, and more particularly to a pitch correction processing method and apparatus.

【０００２】[0002]

【従来の技術】ディジタル・オーディオ信号は通常、２
４ビットまたは１６ビットで量子化されているから、ピ
ッチコレクション処理においても、量子化ビットに合わ
せたビット長さで信号処理している。2. Description of the Prior Art Digital audio signals are typically 2
Since it is quantized by 4 bits or 16 bits, the signal processing is also performed in the bit length matching the quantized bit even in the pitch correction processing.

【０００３】[0003]

【発明が解決しようとする課題】２４ビット、あるい
は、１６ビットのビット長さで信号処理することは、信
号処理回路が複雑化し、演算時間もかかり、高価格にな
る。The signal processing with a bit length of 24 bits or 16 bits complicates the signal processing circuit, requires a long calculation time, and is expensive.

【０００４】本発明の目的は、オーディオ信号の量子化
ビット長さに依存されず、できるだけ短いビット長さで
正確にピッチ幅が検出でき、ひいては、正確にピッチコ
レクション処理が可能なオーディオ信号処理方法とその
装置を提供することにある。An object of the present invention is not to depend on the quantized bit length of an audio signal, but the pitch width can be accurately detected with the bit length as short as possible, and thus the pitch correction processing can be performed accurately. And to provide the device.

【０００５】また本発明の目的は、ステレオ信号につい
ても正確にピッチ幅が検出でき、正確にピッチコレクシ
ョン処理ができるオーディオ信号処理方法とその装置を
提供することにある。Another object of the present invention is to provide an audio signal processing method and apparatus capable of accurately detecting the pitch width of a stereo signal and accurately performing pitch correction processing.

【０００６】[0006]

【課題を解決するための手段】本発明によれば、ディジ
タル・オーディオ信号のピッチを検出し、このピッチを
用いてピッチコレクション処理を行うオーディオ信号処
理方法において、前記ディジタル・オーディオ信号のそ
れぞれを、符号を保存しつつ、複数のビットのサンプル
データに分割して保持し、あるピッチ検出検索区間にお
ける前記複数のディジタル・オーディオ信号について最
大値を検出し、該検出された最大値に基づいて、前記保
持されているサンプルデータの１種を選択し、該選択さ
れたサンプルデータを用いて、そのピッチ検出検索区間
におけるピッチ検出を行うことを特徴とするオーディオ
信号処理方法が提供される。According to the present invention, in the audio signal processing method for detecting the pitch of a digital audio signal and performing pitch correction processing using this pitch, each of the digital audio signals is While storing the code, it is divided into a plurality of bits of sample data and held, a maximum value is detected for the plurality of digital audio signals in a pitch detection search section, and based on the detected maximum value, the maximum value is detected. There is provided an audio signal processing method, characterized in that one type of held sample data is selected, and pitch detection in the pitch detection search section is performed using the selected sample data.

【０００７】特定的には、前記ピッチ検出は、前記選択
されたサンプルデータの自己相関を算出し、最大強度を
示す間隔をピッチ幅として検出する。Specifically, in the pitch detection, the autocorrelation of the selected sample data is calculated, and the interval showing the maximum intensity is detected as the pitch width.

【０００８】好適には、前記下位のデータが所定の値を
越えている場合、その所定の値以内に制限する。[0008] Preferably, when the lower data exceeds a predetermined value, it is limited to within the predetermined value.

【０００９】また好適には、前記ディジタル・オーディ
オ信号がステレオオーディオ信号の場合、ステレオ信号
として関連するチャネルのディジタル・オーディオ信号
を加算したのち、前記ビットシフト処理を行う。Further, preferably, when the digital audio signal is a stereo audio signal, the bit shift processing is performed after adding the digital audio signals of the related channels as a stereo signal.

【００１０】さらに好適には、ピッチ検出に使用するオ
ーディオ信号のピッチ成分を示すデータが消失されてい
るとき、前のピッチ検出検索区間において求めたピッチ
幅を用いて、ピッチコレクション処理を行う。More preferably, when the data indicating the pitch component of the audio signal used for pitch detection is lost, pitch correction processing is performed using the pitch width obtained in the previous pitch detection search section.

【００１１】また本発明によれば、上記オーディオ信号
処理方法を実施する装置が提供される。つまり、ピッチ
コレクション処理を行う本発明のオーディオ信号処理装
置は、入力オーディオ信号をそのまま記憶する第１のメ
モリ手段と、該入力オーディオ信号信号を、それぞれ符
号を保持しつつ、所定のビットに分割した複数のサンプ
ルデータとして保持する第２のメモリ手段と、あるピッ
チ検出検索区間内の前記オーディオ信号の最大値を検出
する最大値検出段と、該最大値検出手段で検出された最
大値に基づいて、前記１オーディオ信号について複数の
サンプルデータのうちの適切なサンプルデータを、前記
第２のメモリ手段から読み出すサンプルデータ読みだし
手段と、該読み出されたサンプルデータを用いて、ピッ
チ幅を検出するピッチ演算手段と、上記第１のメモリ手
段、第２のメモリ手段、最大値検出手段、サンプルデー
タ読みだし手段およびピッチ演算手段を制御する制御手
段とを有する。また、前記検出されたピッチ幅に基づい
て、前記第１のメモリ手段に記憶されているオーディオ
信号を読み出す、ピッチコレクション処理手段をさらに
有する。According to the present invention, there is also provided an apparatus for implementing the above audio signal processing method. That is, the audio signal processing device of the present invention which performs the pitch correction processing divides the input audio signal signal into a predetermined bit while holding the first memory means for directly storing the input audio signal and the input audio signal signal. Based on a second memory means for holding a plurality of sample data, a maximum value detection stage for detecting the maximum value of the audio signal in a certain pitch detection search section, and a maximum value detected by the maximum value detection means. , A sample data reading means for reading appropriate sample data of a plurality of sample data for the one audio signal from the second memory means, and a pitch width is detected by using the read sample data. Pitch calculation means, first memory means, second memory means, maximum value detection means, sample data Heading and a control means for controlling the means and pitch calculation means. Further, it further comprises pitch correction processing means for reading the audio signal stored in the first memory means based on the detected pitch width.

【００１２】[0012]

【作用】ビットシフト処理を行って、ピッチ検出に必要
なオーディオ信号の信号成分（サンプルデータ）を抽出
し、この短いビット長のサンプルデータを用いて、ピッ
チ幅を検出するので、回路が簡単になり、演算時間も短
縮できる。The circuit is simplified because the bit shift processing is performed to extract the signal component (sample data) of the audio signal necessary for pitch detection, and the pitch width is detected using this short bit length sample data. Therefore, the calculation time can be shortened.

【００１３】ビットシフトにおいて、サンプルデータが
所定の値を越えている場合、サンプルデータをその所定
の値以内に制限して、ピッチ検出に使用可能にする。In the bit shift, when the sample data exceeds a predetermined value, the sample data is limited within the predetermined value and can be used for pitch detection.

【００１４】ディジタル・オーディオ信号がステレオオ
ーディオ信号の場合、ステレオ信号として関連するチャ
ネルのディジタル・オーディオ信号を加算して、２チャ
ネルのオーディオ信号の両者に共通のピッチ検出が可能
なようにする。When the digital audio signal is a stereo audio signal, the digital audio signals of related channels are added as a stereo signal to enable pitch detection common to both of the two channels of audio signals.

【００１５】ピッチ検出に使用するオーディオ信号のピ
ッチ成分を示すデータが消失されているとき、前のピッ
チ検出検索区間において求めたピッチ幅を用いて、ピッ
チコレクション処理を継続させる。When the data indicating the pitch component of the audio signal used for pitch detection is lost, the pitch correction processing is continued using the pitch width obtained in the previous pitch detection search section.

【００１６】[0016]

【実施例】図１は本発明のオーディオ信号処理方法とそ
の装置が適用されるプログラムプレーシステムの構成図
である。プログラムプレーとは、映像信号（ビデオ信号
または画像信号）および音声信号（オーディオ信号）を
再生する際、再生スピードを変化させて再生時間の短縮
または延長を行う特殊再生処理を言う。図１において、
ブロックＡ（上段のブロック）が、可変速度で動作可能
な部分を示し、ブロックＢが通常の速度で動作可能な部
分を示す。プログラムプレーシステムにおいては、ビデ
オ記録再生装置（ＶＴＲ装置、図示せず）においてビデ
オ信号およびオーディオ信号（これらを総称してＡＶ信
号という）が記録されたビデオテープ２からビデオ信号
とオーディオ信号を再生する。この再生段階において、
ＶＴＲ装置内のエラー訂正回路（ＥＣＣ回路）４におい
てエラー訂正処理を行う。エラー訂正されたビデオ信号
がフレームシンクロナイザー６に印加され、エラー訂正
されたオーディオ信号がピッチコレクション処理回路８
に印加される。フレームシンクロナイザー６は、再生周
波数と外部周波数との合わせ込みを行い、画像のコマ落
としまたはコマ追加などを行い、処理された画像信号を
出力する。ピッチコレクション処理回路８およびメモリ
１０は、可変再生されたオーディオ信号を単位時間当た
りのデータ量を制御して、オーディオ信号の周波数を変
化させずに、再生速度を変化させる。1 is a block diagram of a program play system to which an audio signal processing method and apparatus of the present invention are applied. Program play refers to special reproduction processing in which, when reproducing a video signal (video signal or image signal) and an audio signal (audio signal), the reproduction speed is changed to shorten or extend the reproduction time. In FIG.
A block A (upper block) shows a part operable at a variable speed, and a block B shows a part operable at a normal speed. In the program play system, a video recording / reproducing apparatus (VTR apparatus, not shown) reproduces a video signal and an audio signal from a video tape 2 on which a video signal and an audio signal (collectively referred to as an AV signal) are recorded. . During this regeneration stage,
Error correction processing is performed in the error correction circuit (ECC circuit) 4 in the VTR device. The error-corrected video signal is applied to the frame synchronizer 6, and the error-corrected audio signal is output to the pitch correction processing circuit 8.
Is applied to The frame synchronizer 6 matches the reproduction frequency and the external frequency, performs frame dropping or addition of images, and outputs a processed image signal. The pitch correction processing circuit 8 and the memory 10 control the data amount of the variably reproduced audio signal per unit time, and change the reproduction speed without changing the frequency of the audio signal.

【００１７】図２を参照してピッチコレクション処理に
ついて詳細に述べる。図２（Ａ）は、原オーディオ信号
の波形図である。オーディオ信号はそれぞれ特有のピッ
チと呼ばれる基本波で構成されており、音声の周波数は
ピッチの集まった大きなうねりで決定される。つまり、
図２（Ａ）に示した波形の山の間隔で決定される。した
がって、基本波単位でのデータの削除または追加を行う
と、全体の周波数に影響を与えることなく、データの量
を制御できる。図２（Ｂ）にピッチコレクション処理し
たオーディオ信号の波形を示す。この例では、図２
（Ａ）に示した最後のピッチの音声信号を削除してい
る。ＥＣＣ回路４からのオーディオ信号は可変再生され
ているが、ピッチコレクション処理回路８は、そのオー
ディオ信号を一旦、メモリ１０に記録し、再び、メモリ
１０からオーディオ信号を読み出すときに、ピッチ単位
のデータを飛び越して読み出す。その結果、図２（Ｂ）
に示したピッチコレクション処理したオーディオ信号が
得られる。The pitch correction process will be described in detail with reference to FIG. FIG. 2A is a waveform diagram of the original audio signal. Each audio signal is composed of a fundamental wave called a peculiar pitch, and the frequency of voice is determined by a large swell of pitches. That is,
It is determined by the interval of the peaks of the waveform shown in FIG. Therefore, if data is deleted or added in fundamental wave units, the amount of data can be controlled without affecting the overall frequency. FIG. 2B shows the waveform of the audio signal subjected to the pitch correction processing. In this example, FIG.
The audio signal of the last pitch shown in (A) is deleted. Although the audio signal from the ECC circuit 4 is variably reproduced, the pitch correction processing circuit 8 temporarily records the audio signal in the memory 10 and then, when the audio signal is read from the memory 10 again, the pitch unit data is recorded. Jump over and read. As a result, FIG. 2 (B)
The pitch-corrected audio signal shown in is obtained.

【００１８】ピッチコレクション処理回路８におけるピ
ッチコレクション処理において、ピッチ（ピッチ幅）の
検出が重要である。このピッチ幅検出のため、ピッチコ
レクション処理回路８は、メモリ１０に記録させたオー
ディオ信号の自己相関を求める。そのピッチ検出方法
を、図３および図４を参照して述べる。ピッチコレクシ
ョン処理回路８は、メモリ１０に記録されている、ある
サーチ（検索）区間のオーディオ信号、この例では、１
０２４個のオーディオ信号について、２つのオーディオ
信号（サンプル）を同時に読みだし、その自己相関を算
出する。つまり、あるサーチ区間のスタートポイントが
決まると、ピッチコレクション処理回路８はメモリ１０
の、そのポイントから５１２サンプルを基準のデータ列
としてメモリ１０から読みだし、同時に、同じ基準のデ
ータ列のデータ、５１２個のサンプルを読み出して、対
応する順番にある２つのサンプルを乗算して、これら乗
算した結果を加算していく。５１２個のサンプルについ
てこれを演算した結果がラグ０、つまり、２系統のデー
タ列に遅れがない状態の自己相関結果となる。次いで、
基準のデータ列の５１２個のデータと、この基準のデー
タ列のスタートポインから１〜数ポイントずれた位置か
ら５１２個のデータを同時に読み出して、対応する順序
のデータ相互の乗算を行い、それらの結果を加算してい
く。５１２個のサンプルについてこれを演算した結果が
ラグ１、つまり、２系統のデータ列に、ある単位量だけ
遅れがある状態の自己相関結果となる。以下、同様に、
ラグ２〜ラグ１１４の自己相関を計算していく。以上の
結果をプロットすると、図４に図解したグラフとなる。
自己相関が強く出る部分、つまり、図４において、山と
なる部分の間隔がピッチ幅である。ピッチコレクション
処理回路８はそのような山を検出して、ピッチ幅を確定
する。In the pitch correction processing in the pitch correction processing circuit 8, it is important to detect the pitch (pitch width). To detect the pitch width, the pitch correction processing circuit 8 obtains the autocorrelation of the audio signal recorded in the memory 10. The pitch detecting method will be described with reference to FIGS. 3 and 4. The pitch correction processing circuit 8 is an audio signal recorded in the memory 10 in a certain search section, which is 1 in this example.
For 024 audio signals, two audio signals (samples) are read out at the same time and their autocorrelation is calculated. That is, when the start point of a certain search section is determined, the pitch correction processing circuit 8 causes the memory 10
From that point, 512 samples are read from the memory 10 as a reference data string, at the same time, data of the same reference data string, 512 samples are read, and two samples in a corresponding order are multiplied, The results of these multiplications are added. The result of calculating this for 512 samples is the lag 0, that is, the autocorrelation result in the state where there is no delay in the data strings of the two systems. Then
The 512 pieces of data of the reference data row and the 512 pieces of data at a position 1 to a few points away from the start point of the reference data row are read at the same time, and the data in the corresponding order are multiplied by each other. Add the results. The result of calculating this with respect to 512 samples is the lag 1, that is, the autocorrelation result in a state in which the data strings of the two systems are delayed by a certain unit amount. Hereafter, similarly,
The autocorrelation of lag 2 to lag 114 is calculated. When the above results are plotted, the graph illustrated in FIG. 4 is obtained.
The pitch width is the interval where the autocorrelation is strong, that is, the peak portion in FIG. The pitch correction processing circuit 8 detects such peaks and determines the pitch width.

【００１９】通常、ディジタル・オーディオ処理におい
ては、１６ビットまたは２４ビットの量子化データが一
般的である。ピッチコレクション処理回路８は、上記ピ
ッチ幅検出のために、たとえば、１６ビットのオーディ
オ信号の場合、１つラグの自己相関の計算のために、
（１６ビット×１６ビット）の乗算と、これら乗算結果
を加算していく加算処理を５１２回行う。その演算結果
は通常、１６ビットで保存する。また、２４ビットのオ
ーディオ信号の場合、ピッチコレクション処理回路８
は、１つラグの自己相関の計算のために、（２４ビット
×２４ビット）の乗算と加算を５１２回行う。その演算
結果は２４ビットである。通常、ピッチ幅検出のための
上記演算を、量子化のビット数に合わせて行っている。
つまり、２４ビットの量子化データの場合には、２４ビ
ットの上記演算を行い、１６ビットの量子化のデータの
場合には１６ビットの上記演算を行っている。しかしな
がら、このままでは、ビット数が多いので、演算回路が
大規模化している。演算回路をディジタル信号プロセッ
サ（ＤＳＰ）を用いて行うにせよ、ＩＣ回路を用いるに
せよ、ビット数の長い演算は回路構成が複雑になる。価
格も高くなる。さらに演算時間もかかる。In digital audio processing, 16-bit or 24-bit quantized data is generally used. The pitch correction processing circuit 8 detects the pitch width, for example, in the case of a 16-bit audio signal, calculates the autocorrelation of one lag,
The multiplication of (16 bits × 16 bits) and the addition process of adding the multiplication results are performed 512 times. The operation result is normally stored in 16 bits. In the case of a 24-bit audio signal, the pitch correction processing circuit 8
Performs (24 bits × 24 bits) multiplications and additions 512 times in order to calculate one-lag autocorrelation. The operation result is 24 bits. Usually, the above-mentioned calculation for pitch width detection is performed according to the number of bits of quantization.
That is, in the case of 24-bit quantized data, the above operation of 24 bits is performed, and in the case of quantized data of 16 bits, the above operation of 16 bits is performed. However, if this is left as it is, the number of bits is large, so that the arithmetic circuit becomes large-scaled. Whether the arithmetic circuit is a digital signal processor (DSP) or an IC circuit, an arithmetic operation with a long bit number complicates the circuit configuration. The price will also increase. In addition, it takes calculation time.

【００２０】そこでビット数を減少させることが考えら
れる。たとえば、図５に示したように、ダイナミックレ
ンジが１６ビットである場合、１６ビットのサンプルを
８ビットのサンプルとして、ピッチ幅検出を行うことも
できる。その理由は、ダイナミックレンジが１６ビット
に対して、その半分のビット数でも、充分に山の部分の
波形を表すことができるからである。同様に、図５に示
したディジタル・オーディオ信号の場合は、２４ビット
のダイナミックレンジを持つサンプルを、１２ビットの
サンプルとしてピッチ幅検出のデータとして使用するこ
ともできる。しかしながら、図６に示したように、１６
ビットのダイナミックレンジに対して、最大振幅が、１
６ビットの半分にも満たない場合には、８ビットのビッ
ト数にビット数を減少させて自己相関を演算しても意味
がなく、ピッチ幅を検出することができない。以上の事
情により、これまで、量子化ビット数に合わせたビット
数のサンプルを用いて、ピッチコレクション処理を行っ
ていた。その結果、演算処理回路の構成が複雑になり、
価格が高くなり、演算時間も長くなっていた。Therefore, it is possible to reduce the number of bits. For example, as shown in FIG. 5, when the dynamic range is 16 bits, pitch width detection can be performed by using 16-bit samples as 8-bit samples. The reason is that even if the dynamic range is 16 bits, even with half the number of bits, the waveform of the mountain portion can be sufficiently expressed. Similarly, in the case of the digital audio signal shown in FIG. 5, a sample having a dynamic range of 24 bits can be used as a 12-bit sample for pitch width detection data. However, as shown in FIG.
Maximum amplitude is 1 for the dynamic range of bits
If it is less than half of 6 bits, it is meaningless to calculate the autocorrelation by reducing the number of bits to 8 bits, and the pitch width cannot be detected. Due to the above circumstances, the pitch correction process has been performed so far by using the sample of the number of bits matching the number of quantization bits. As a result, the configuration of the arithmetic processing circuit becomes complicated,
The price was high and the calculation time was long.

【００２１】上記問題を解決する方法を述べる。図７は
オーディオ信号処理装置２０の構成図である。オーディ
オ信号処理装置２０は、データ制御回路２２、オーディ
オデータ・メモリ２４、ピッチ演算データ・メモリ２
６、ピッチ検出回路２８、メモリアドレスカウンター３
０、および、最大ピーク検出回路３２を有する。本実施
例では、２４ビットのダイナミックレンジを持つオーデ
ィオ信号を８ビットのサンプルとしてピッチコレクショ
ン処理、換言すれば、ピッチ幅検出に使用する場合を述
べる。A method for solving the above problem will be described. FIG. 7 is a block diagram of the audio signal processing device 20. The audio signal processing device 20 includes a data control circuit 22, an audio data memory 24, a pitch calculation data memory 2
6, pitch detection circuit 28, memory address counter 3
It has 0 and the maximum peak detection circuit 32. In the present embodiment, a case will be described in which an audio signal having a dynamic range of 24 bits is used as a sample of 8 bits for pitch correction processing, in other words, for pitch width detection.

【００２２】図１に示したビデオテープ２などの記録媒
体から再生されて、ＥＣＣ回路４においてエラー訂正が
行われた２４ビットの再生オーディオ信号がデータ制御
回路２２に印加される。データ制御回路２２は、２４ビ
ットの再生オーディオ信号Ｓ２２Ａをそのまま、オーデ
ィオデータ・メモリ２４に記憶させる。オーディオデー
タ・メモリ２４は再生オーディオ信号Ｓ２２Ａをそのま
ま記憶・保持しておく。オーディオデータ・メモリ２４
から再度、オーディオ信号が読み出されるとき、メモリ
アドレスカウンター３０から与えられるアドレスによっ
てその読みだし順序を制御することにより、その読み出
された信号Ｓ２４が、入力された再生オーディオ信号Ｓ
２２Ａに対して（たとえば、図２（Ａ）に対して）、ピ
ッチコレクション処理された信号となる（たとえば、図
２（Ｂ）に示した信号となる）。オーディオデータ・メ
モリ２４には、２４ビットの再生オーディオ信号を、符
号＋７ビットのデータ、合計８ビット単位で、４部分に
分け、１サンプル４個の部分データとして記憶されてい
る。この詳細については後述する。データ制御回路２２
は、ピッチ検出回路２８においてピッチ幅検出に用いる
オーディオ信号をピッチ演算データ・メモリ２６に記憶
させる。ただし、データ制御回路２２は、ピッチ演算デ
ータ・メモリ２６には、オーディオ信号１サンプルに対
して、上述したように、４種類、ビットシフトしたオー
ディオ信号を記憶させる。なお、ピッチ演算データ・メ
モリ２６に記憶させるデータは、２チャネル・ステレオ
信号を加算したものであり、さらに、２４ビットを８ビ
ットにしたデータであるから、ピッチ演算データ・メモ
リ２６に記憶するデータの量が増加することはない。A 24-bit reproduced audio signal reproduced from a recording medium such as the video tape 2 shown in FIG. 1 and subjected to error correction in the ECC circuit 4 is applied to the data control circuit 22. The data control circuit 22 stores the 24-bit reproduced audio signal S22A as it is in the audio data memory 24. The audio data memory 24 stores and holds the reproduced audio signal S22A as it is. Audio data memory 24
When the audio signal is read again from, the reading order is controlled by the address given from the memory address counter 30, so that the read signal S24 is the input reproduced audio signal S.
22A (for example, for FIG. 2A), the pitch-corrected signal is obtained (for example, the signal shown in FIG. 2B). In the audio data memory 24, a 24-bit reproduced audio signal is divided into 4 parts in a unit of code + 7 bits of data, totaling 8 bits, and stored as 4 pieces of partial data of 1 sample. The details will be described later. Data control circuit 22
Stores the audio signal used for pitch width detection in the pitch detection circuit 28 in the pitch calculation data memory 26. However, the data control circuit 22 causes the pitch calculation data memory 26 to store four types of bit-shifted audio signals for one audio signal sample, as described above. The data stored in the pitch calculation data memory 26 is the data obtained by adding the two-channel stereo signals, and is the data in which 24 bits are changed to 8 bits. Does not increase in quantity.

【００２３】データ制御回路２２におけるビットシフト
について述べる。２４ビットのダイナミックレンジをも
つオーディオ信号をいかにして８ビットのサンプルとし
てピッチ検出が可能かについて述べる。オーディオ信号
は強弱を含むから、図５に示したように、２４ビットの
ダイナミックレンジ一杯に最大振幅を有するオーディオ
信号もあれば、図６に示すように、８ビット未満の最大
振幅を有するオーディオ信号も存在する。このように最
大振幅が８ビット未満のオーディオ信号をそのまま用い
ても、ピッチ検出はできない。本発明においては、図８
（Ａ）に図解した振幅の小さなオーディオ信号に対して
は、図８（Ａ）に中央部分のオーディオ信号に対して、
図８（Ｂ）に図解したように、ビットシフト（振幅増
幅）を行う。つまり、図８（Ａ）に図解したオーディオ
信号の下位のビットを使用する。ただし、ビットシフト
を行う場合、ピッチ検索区間内で一律のシフト量に固定
する。ある検索区間内の複数のオーディオ信号について
ビットシフトしたサンプルを用いて、自己相関を計算す
るから、同じ区間で、同じダイナミックレンジになって
いればよい。そのとき、シフト量よりも大きなデータサ
ンプルが存在したときは、アンダーフローした部分、Ｕ
Ｆ１、ＵＦ２またはオーバーフローした部分ＯＦ１、Ｏ
Ｆ２に対して、リミッタをかけてクリップした状態にす
る。実際には、ピッチ演算データ・メモリ２６に記憶さ
れているオーディオ信号の、サイン（符号）を保存し
て、下位７ビットを抽出して８ビットのサンプルとして
抽出する。このとき、７ビットを越えるサンプルは７ビ
ット以内のサンプルとする。同じピッチ検索区間におけ
る複数のオーディオ信号については、通常、ほぼ同じ大
きさの振幅を有していると考えられる。しかしながら、
それらの振幅にはバラツキがある。そこで、上述したよ
うに、２４ビットの１つのオーディオ信号を４分割し
た、４種類のビットシフトしたサンプルデータをピッチ
演算データ・メモリ２６に記憶させておき、最大ピーク
検出回路３２で最大ピークを検出して、適切なビットシ
フトしたサンプルを用いる。この詳細は後述する。The bit shift in the data control circuit 22 will be described. It will be described how an audio signal having a dynamic range of 24 bits can be detected as an 8-bit sample for pitch detection. Since an audio signal includes strength and weakness, some audio signals have a maximum amplitude over a dynamic range of 24 bits as shown in FIG. 5, and some audio signals have a maximum amplitude of less than 8 bits as shown in FIG. Also exists. Thus, pitch detection cannot be performed even if an audio signal having a maximum amplitude of less than 8 bits is used as it is. In the present invention, FIG.
For an audio signal with a small amplitude illustrated in (A), for the audio signal in the central portion of FIG.
Bit shift (amplitude amplification) is performed as illustrated in FIG. That is, the lower bits of the audio signal illustrated in FIG. 8A are used. However, when performing the bit shift, it is fixed to a uniform shift amount within the pitch search section. Since the autocorrelation is calculated by using the bit-shifted samples of a plurality of audio signals in a certain search section, it is sufficient that the same section has the same dynamic range. At that time, if there is a data sample larger than the shift amount, the underflowed portion, U
F1, UF2 or overflowed parts OF1, O
A limiter is applied to F2 to clip it. In practice, the sign (sign) of the audio signal stored in the pitch calculation data memory 26 is stored, and the lower 7 bits are extracted and extracted as an 8-bit sample. At this time, samples exceeding 7 bits are samples within 7 bits. It is generally considered that a plurality of audio signals in the same pitch search section have almost the same amplitude. However,
There are variations in their amplitude. Therefore, as described above, four types of bit-shifted sample data obtained by dividing one 24-bit audio signal into four are stored in the pitch calculation data memory 26, and the maximum peak detection circuit 32 detects the maximum peak. And use the appropriate bit-shifted samples. The details will be described later.

【００２４】再生オーディオ信号Ｓ２２Ｂは、ステレオ
ペアの２チャネルのデータが加算さこれた後、最大ピー
ク検出回路３２に印加される。最大ピーク検出回路３２
は、ピッチ検出検索区間におけるオーディオ信号の最大
ピークを検出する。最大ピーク検出回路３２で検出した
最大ピークは、メモリアドレスカウンター３０へ印加さ
れて、ピッチ演算データ・メモリ２６における４種のサ
ンプルのうちの最適なサンプルを選択するためのメモリ
アドレス信号として使用される。ピッチ検出回路２８
は、メモリアドレスカウンター３０からのメモリアドレ
スに基づいてピッチ演算データ・メモリ２６から読み出
されたデータＳ２６を用いてピッチ検出のための演算を
行う。このピッチ検出のための演算としては、好適に
は、上述した自己相関演算を行う。ただし、オーディオ
信号の強度を検出する他の演算、たとえば、パワースペ
クトルを算出する方法など、他の演算方法も適用でき
る。ただし、このような演算は押し並べて、たとえば、
ピッチ検出検索区間として１０２４個程度のデータ区間
について、５１２個程度のサンプルデータを用いて、乗
算と加算とを繰り返していくので、自己相関に限らず、
演算は膨大な量となる。したがって、ピッチ検出に使用
するサンプルデータのビット長さが短いことは演算に非
常に有利になる。The reproduced audio signal S22B is applied to the maximum peak detection circuit 32 after the data of the two channels of the stereo pair have been added. Maximum peak detection circuit 32
Detects the maximum peak of the audio signal in the pitch detection search section. The maximum peak detected by the maximum peak detection circuit 32 is applied to the memory address counter 30 and used as a memory address signal for selecting the optimum sample of the four kinds of samples in the pitch operation data memory 26. . Pitch detection circuit 28
Performs a calculation for pitch detection using the data S26 read from the pitch calculation data memory 26 based on the memory address from the memory address counter 30. As the calculation for this pitch detection, the above-described autocorrelation calculation is preferably performed. However, other calculation methods for detecting the strength of the audio signal, such as a method for calculating the power spectrum, can also be applied. However, such operations can be pushed side by side, for example,
Since multiplication and addition are repeated using about 512 sample data for about 1024 data sections as the pitch detection search section, not limited to autocorrelation,
The calculation is enormous. Therefore, the short bit length of the sample data used for pitch detection is very advantageous for calculation.

【００２５】データ制御回路２２は、ピッチ幅が検出さ
れると、そのピッチ幅に基づいて、オーディオデータ・
メモリ２４から読み出すサンプルデータの読みだしアド
レスを制御して、図２で図解して述べたピッチコレクシ
ョン処理を行う。When the pitch width is detected, the data control circuit 22 detects the audio data, based on the pitch width.
The read address of the sample data read from the memory 24 is controlled to perform the pitch correction processing illustrated in FIG.

【００２６】以上のように、たとえば、２４ビットの量
子化長さのオーディオ信号でも、８ビットのビット長さ
のオーディオ信号を用いて、ピッチ幅を検出できる。同
じピッチ検索区間内のオーディオ信号についての自己相
関結果は、相対的な値が判ればよいから、ピッチ幅検出
においてオーディオ信号のビット長を短くしても、ピッ
チ幅の検出精度には影響がない。従って、ピッチコレク
ション処理も正確に行うことができる。ビット長の短い
オーディオ信号を用いてピッチ幅検出処理を行うので、
その回路構成が簡単になり、演算時間も短縮できる。さ
らに、ピッチコレクション処理を行う回路の価格が低下
する。As described above, even for an audio signal having a quantized length of 24 bits, the pitch width can be detected by using an audio signal having a bit length of 8 bits. Since the autocorrelation result for audio signals in the same pitch search section only needs to know the relative value, even if the bit length of the audio signal is shortened in pitch width detection, the accuracy of pitch width detection is not affected. . Therefore, the pitch correction process can be performed accurately. Since pitch width detection processing is performed using an audio signal with a short bit length,
The circuit configuration becomes simple and the calculation time can be shortened. Further, the cost of the circuit that performs the pitch correction process is reduced.

【００２７】次いで、ステレオ信号に対する処理につい
て述べる。図１０にその処理回路構成を示す。図１０に
示したステレオ信号ピッチ検出回路４０は、シリアル／
バラレル（Ｓ／Ｐ）変換回路４４と、加算器４６と、ビ
ットシフター４８を有するデータ制御回路４２と、ピッ
チ演算メモリ５０と、ピッチ演算器５２とを有する。図
１０に示したステレオ信号ピッチ検出回路４０は、図７
に示した、データ制御回路２２、ピッチ演算データ・メ
モリ２６、ピッチ検出回路２８の部分に関係しており、
図１０に示したステレオ信号ピッチ検出回路４０に、図
７に示した最大ピーク検出回路３２、メモリアドレスカ
ウンター３０およびオーディオデータ・メモリ２４が付
加されて、ピッチ幅を検出する回路となる。Next, the processing for a stereo signal will be described. FIG. 10 shows the processing circuit configuration. The stereo signal pitch detection circuit 40 shown in FIG.
It includes a parallel (S / P) conversion circuit 44, an adder 46, a data control circuit 42 having a bit shifter 48, a pitch calculation memory 50, and a pitch calculation unit 52. The stereo signal pitch detection circuit 40 shown in FIG.
Related to the data control circuit 22, the pitch calculation data memory 26, and the pitch detection circuit 28 shown in FIG.
The maximum peak detection circuit 32, the memory address counter 30, and the audio data memory 24 shown in FIG. 7 are added to the stereo signal pitch detection circuit 40 shown in FIG. 10 to form a circuit for detecting the pitch width.

【００２８】図７および図１０において入力される再生
オーディオ信号は、通常、２４ビットのデータが４チャ
ネル存在する。通常、チャネル１とチャネル２との組合
せ、および、チャネル３とチャネル４との組合せで、ス
テレオ信号になっているので、ピッチ検出もこれを考慮
して行う。ステレオ信号に対するピッチ検出は、各々の
信号に共通するピッチを検出することが重要である。そ
のため、ピッチ検出の前処理として、２つのチャネルの
信号の加算を行う。このピッチ検出の前処理として、ビ
ットシフトの前に、加算を行う理由を述べる。ビットシ
フトが行われているとすると、サンプルデータの最大値
がクリップされた結果を加算する場合がある。この場
合、加算することによって、データのピッチ特性が失わ
れる可能性がある。さらに、ステレオ信号を加算したと
き、２つのチャネルの信号が逆位相の場合には、加算し
た結果が全く振幅成分を失うことになる。よって、ビッ
トシフトの前に、２チャネルのオーディオ信号を加算す
ることにしている。The reproduced audio signal input in FIGS. 7 and 10 usually has four channels of 24-bit data. Usually, a combination of channel 1 and channel 2 and a combination of channel 3 and channel 4 form a stereo signal, so pitch detection is also performed in consideration of this. In pitch detection for a stereo signal, it is important to detect a pitch common to each signal. Therefore, the signals of the two channels are added as preprocessing for pitch detection. As the preprocessing for this pitch detection, the reason why addition is performed before bit shifting will be described. If bit shift is performed, the result of clipping the maximum value of the sample data may be added. In this case, there is a possibility that the pitch characteristics of the data may be lost by the addition. Furthermore, when the stereo signals are added, if the signals of the two channels have opposite phases, the addition result will lose the amplitude component at all. Therefore, the audio signals of two channels are added before the bit shift.

【００２９】Ｓ／Ｐ変換回路４４は、シリアル形式で入
力される２４ビットの再生オーディオ信号を２チャネル
のそれぞれ２４ビットのステレオ・オーディオ信号に変
換する。加算器４６は、ビットシフター４８におけるビ
ットシフトの前に、２チャネルのステレオ・オーディオ
信号を加算する。ビットシフター４８は、２チャネルの
オーディオ信号が加算されたステレオ・オーディオ信号
について、図８および図９で図解して述べたビットシフ
ト処理を行う。ピッチ演算器５２は、このように、必要
に応じて、ビットシフトしたオーディオ信号を用いて、
ピッチ検出演算を行う。以上述べたように、第２実施例
のステレオ信号ピッチ検出回路４０によれば、ビットシ
フター４８の処理の前に、加算器４６における２チャネ
ルの加算を行っているので、ビットシフトを行った後に
ステレオ加算処理を行う場合に比べて、ピッチ検出の精
度が向上し、ピッチコレクション処理の信頼性も向上す
る。The S / P conversion circuit 44 converts a 24-bit reproduced audio signal input in serial format into 24-channel stereo audio signals of two channels. The adder 46 adds the two-channel stereo audio signals before the bit shift in the bit shifter 48. The bit shifter 48 performs the bit shift processing illustrated in FIGS. 8 and 9 on the stereo audio signal in which the audio signals of 2 channels are added. In this way, the pitch calculator 52 uses the bit-shifted audio signal as necessary,
Performs pitch detection calculation. As described above, according to the stereo signal pitch detection circuit 40 of the second embodiment, since the two channels are added in the adder 46 before the processing of the bit shifter 48, after the bit shift is performed. The accuracy of pitch detection is improved and the reliability of pitch correction processing is also improved as compared with the case where stereo addition processing is performed.

【００３０】オーディオ信号についてビットシフトを数
種類設定したとき、その数種類のサンプルデータから、
ピッチ検出検索区間内のサンプルデータの強度（振幅）
に合わせて１つを選択する方法について述べる。ビット
シフトの値が最大に固定されていると、再生オーディオ
信号が最大振幅であるとき、図１１に示したように、最
大値のリミッターが常にかかる可能性がある。図１１に
示したようなリミッターがかかったデータからはピッチ
幅を検出することはできない。そこで、本発明において
は、以下に述べるように、ビットシフト量を入力される
オーディオ信号の大きさに合わせて適応的に行う。When several kinds of bit shifts are set for an audio signal, from several kinds of sample data,
Strength (amplitude) of sample data in the pitch detection search section
A method of selecting one according to the above will be described. If the value of the bit shift is fixed to the maximum, when the reproduced audio signal has the maximum amplitude, the limiter of the maximum value may be always applied as shown in FIG. The pitch width cannot be detected from the data subjected to the limiter as shown in FIG. Therefore, in the present invention, as described below, the bit shift amount is adaptively adjusted according to the size of the input audio signal.

【００３１】適応的なビットシフト処理の詳細を以下に
述べる。２４ビットのオーディオ信号から８ビットのデ
ータを切り出す場合、そのデータにリミッターがかから
ない範囲で最大の値にビットシフトすることが、原オー
ディオ信号を保存する意味でも望ましい。本実施例にお
いては、図１２に示した２４ビットのオーディオ信号
を、図１３に示したように、それぞれ８ビットの４種類
のデータに分割してピッチ演算メモリ５０に保存する。
８ビットのデータのそれぞれは元のオーディオ信号の符
号を保存している。したがって、有効なビット長は７ビ
ットとなる。４種類のデータの連続性を保ため、隣接す
るデータは重複したビットデータを含んでいる。図１２
のピッチ検出検索区間１またはピッチ検出検索区間２内
には、それぞれ、１つの山を持つ１０２４個程度のオー
ディオ信号が含まれている。これらの自己相関演算は相
当大きな量の演算であり、時間がかかる。ピッチ検出に
際しては、上記オーディオ信号を全て、図７に示したピ
ッチ演算データ・メモリ２６または図１０に示したピッ
チ演算メモリ５０に記憶した後に行う。したがって、ピ
ッチ演算データ・メモリ２６またはピッチ演算メモリ５
０にオーディオ信号を記憶させるときに、あるピッチ検
出検索区間におけるオーディオ信号の最大値は検出され
ていることになる。この最大値の検出は、図７に示した
オーディオ信号処理装置２０においては、最大ピーク検
出回路３２において行う。図１０に示したステレオ信号
ピッチ検出回路４０においては、加算器４６においてス
テレオ信号加算処理を行った後、この最大値の検出を行
う。あるピッチ検出検索区間において検出された最大値
を、図７のオーディオ信号処理装置２０においては最大
ピーク検出回路３２が保存し、この最大値を用いてメモ
リアドレスカウンター３０のアドレスを制御して、図１
３に示したように、ピッチ演算データ・メモリ２６に記
憶された、それぞれ２４ビットのオーディオ信号１つに
ついて４種類に８ビットのサンプルデータのうちの適切
なものを読み出していく。Details of the adaptive bit shift processing will be described below. When cutting out 8-bit data from a 24-bit audio signal, it is desirable to bit-shift to the maximum value in the range where the limiter is not applied to the data in order to save the original audio signal. In the present embodiment, the 24-bit audio signal shown in FIG. 12 is divided into four types of 8-bit data and stored in the pitch calculation memory 50, as shown in FIG.
Each of the 8-bit data stores the code of the original audio signal. Therefore, the effective bit length is 7 bits. In order to maintain the continuity of the four types of data, adjacent data includes overlapping bit data. 12
The pitch detection search section 1 and the pitch detection search section 2 each include about 1024 audio signals each having one peak. These autocorrelation operations are a fairly large amount of operations and are time consuming. The pitch detection is performed after all the audio signals are stored in the pitch calculation data memory 26 shown in FIG. 7 or the pitch calculation memory 50 shown in FIG. Therefore, the pitch calculation data memory 26 or the pitch calculation memory 5
When the audio signal is stored in 0, the maximum value of the audio signal in a certain pitch detection search section is detected. The maximum value is detected by the maximum peak detection circuit 32 in the audio signal processing device 20 shown in FIG. In the stereo signal pitch detection circuit 40 shown in FIG. 10, the maximum value is detected after the stereo signal addition processing is performed in the adder 46. In the audio signal processing device 20 of FIG. 7, the maximum peak detection circuit 32 stores the maximum value detected in a certain pitch detection search section, and the maximum value is used to control the address of the memory address counter 30, 1
As shown in FIG. 3, four kinds of 8-bit sample data, which are stored in the pitch calculation data memory 26, are read out for each 24-bit audio signal.

【００３２】図１３に示した４分割のサンプルデータ
は、オーディオ信号がステレオ信号の場合、表１に示し
た状態でピッチ演算メモリ５０に記憶されている。When the audio signal is a stereo signal, the 4-division sample data shown in FIG. 13 is stored in the pitch calculation memory 50 in the state shown in Table 1.

【００３３】[0033]

【表１】 [Table 1]

【００３４】表１において、Ｘ系はそれぞれ２４ビット
長のチャネル１のオーディオ信号とチャネル２のオーデ
ィオ信号とを加算したステレオ信号であり、加算したス
テレオ・オーディオ信号のビット長は２４ビットであ
り、第１のサンプルデータが〔符号、２２〜１６ビッ
ト〕、第２のサンプルデータが〔符号、１６〜１０ビッ
ト〕、第３のサンプルデータが〔符号、１１〜５ビッ
ト〕、第４のサンプルデータが〔符号、６〜０ビット〕
に４分割されて、ピッチ演算メモリ５０に記憶される。
Ｙ系はそれぞれ２４ビット長のチャネル３のオーディオ
信号とチャネル４のオーディオ信号とを加算したステレ
オ信号であり、加算したステレオ・オーディオ信号のビ
ット長は２４ビットであり、第１のサンプルデータが
〔符号、２２〜１６ビット〕、第２のサンプルデータが
〔符号、１６〜１０ビット〕、第３のサンプルデータが
〔符号、１１〜５ビット〕、第４のサンプルデータが
〔符号、６〜０ビット〕に４分割されて、ピッチ演算メ
モリ５０に記憶される。In Table 1, the X system is a stereo signal obtained by adding the audio signal of channel 1 and the audio signal of channel 2 each having a length of 24 bits, and the bit length of the added stereo audio signal is 24 bits, The first sample data is [sign, 22 to 16 bits], the second sample data is [sign, 16 to 10 bits], the third sample data is [sign, 11 to 5 bits], and the fourth sample data. Is [sign, 6-0 bits]
Is divided into four and stored in the pitch calculation memory 50.
The Y system is a stereo signal in which an audio signal of channel 3 and an audio signal of channel 4 each having a length of 24 bits are added, the bit length of the added stereo audio signal is 24 bits, and the first sample data is [ Code, 22 to 16 bits], the second sample data is [code, 16 to 10 bits], the third sample data is [code, 11 to 5 bits], and the fourth sample data is [code, 6 to 0]. Bit] and stored in the pitch calculation memory 50.

【００３５】ピッチ演算メモリ５０から、上述のように
して読み出した８ビットのサンプルデータがピッチ検出
回路２８に印加されてピッチ検出に用いられる。The 8-bit sample data read as described above from the pitch calculation memory 50 is applied to the pitch detection circuit 28 and used for pitch detection.

【００３６】図１２の波形図について、ピッチ検出検索
区間１のオーディオ信号は２４ビットのダイナミックレ
ンジ一杯に最大振幅があり、その他の連続する振幅の小
さい波形部分が８ビット以内にある。したがって、実質
的にビットシフトを行わず、最上位８ビットのサンプル
データを用いて、ピッチ検出を行うことになる。図１２
のピッチ検出検索区間２のオーディオ信号は最大値が小
さい。この場合はビットシフトした最下位８ビットのサ
ンプルデータを用いてピッチ検出を行うことになる。In the waveform diagram of FIG. 12, the audio signal in the pitch detection search section 1 has the maximum amplitude in the full 24-bit dynamic range, and the other continuous waveform portions with small amplitude are within 8 bits. Therefore, the bit detection is not substantially performed, and the pitch detection is performed using the most significant 8 bits of sample data. 12
The maximum value of the audio signal in the pitch detection search section 2 is small. In this case, pitch detection is performed using the bit-shifted least significant 8 bits of sample data.

【００３７】本発明の第３実施例について述べる。以上
の述べたビット削減処理をしてピッチ検出を行い、ピッ
チコレクション処理を行う場合、オーディオ信号の状態
によってピッチ検出ができなくなるときがある。たとえ
ば、ステレオ信号でない場合に（モノトーン・オーディ
オ信号の場合に）、原オーディオ信号そのものに全くピ
ッチ成分が存在しない場合は、ピッチ検出はできない。
また、たとえば、図１４に図解したように、チャネル２
のオーディオ信号が途中で反転した場合、ステレオ信号
処理のために、図１０のステレオ信号ピッチ検出回路４
０における加算器４６で、チャネル１のオーディオ信号
とチャネル２のオーディオ信号とを加算すると、逆位相
同士が相殺してしまい、図１５に示したように、途中か
らオーディオ信号成分が消失する。このようなオーディ
オ信号を用いては、ピッチ検出検索区間２においては、
ピッチ検出ができなくなる。第３実施例はこのような場
合を救済する。A third embodiment of the present invention will be described. When pitch detection is performed by performing the bit reduction processing described above and pitch correction processing is performed, there are cases where pitch detection cannot be performed depending on the state of the audio signal. For example, if it is not a stereo signal (in the case of a monotone audio signal), pitch detection cannot be performed if the original audio signal itself has no pitch component.
Also, for example, as illustrated in FIG.
When the audio signal of FIG. 10 is inverted on the way, the stereo signal pitch detection circuit 4 of FIG.
When the audio signal of channel 1 and the audio signal of channel 2 are added by the adder 46 of 0, the opposite phases cancel each other out, and the audio signal component disappears from the middle as shown in FIG. With such an audio signal, in the pitch detection search section 2,
The pitch cannot be detected. The third embodiment relieves such a case.

【００３８】図１５に示すように、途中でピッチ検出が
できないようなオーディオ信号が存在したとしても、オ
ーディオ信号の連続性を考慮すると、前のピッチ検出検
索区間のオーディオ信号と連続性を維持している訳であ
るから、そのような場合は正常に検出された前のピッチ
検出検索区間のピッチ幅を用いる。そのため、図１０の
ステレオ信号ピッチ検出回路４０におけるピッチ演算器
５２に代えて、図１６に示したピッチ演算器６２とピッ
チ情報保持回路６４からなる、ピッチ演算保持回路６０
を設ける。ピッチ演算器６２において、ピッチが正常に
検出できた場合、ピッチ情報保持回路６４を接点ａ側に
切り換えて、そのピッチ幅情報をピッチ演算保持回路６
０から出力する。もし、ピッチ演算器６２において、ピ
ッチが正常に検出できない場合、ピッチ情報保持回路６
４を接点ｂ側に切り換えて、前回のピッチ検出検索区間
において算出したピッチ幅情報を、ピッチ情報保持回路
６４を介して、ピッチ演算保持回路６０から出力する。
つまり、前回のピッチ検出検索区間で求めたピッチ幅情
報を出力する。これにより、瞬時的にピッチ成分が存在
しなくなっても、ピッチコレクション処理を継続するこ
とができる。As shown in FIG. 15, even if there is an audio signal whose pitch cannot be detected on the way, in consideration of the continuity of the audio signal, the continuity with the audio signal of the preceding pitch detection search section is maintained. Therefore, in such a case, the pitch width of the previous pitch detection search section that is normally detected is used. Therefore, in place of the pitch calculator 52 in the stereo signal pitch detection circuit 40 of FIG. 10, a pitch calculator holding circuit 60 including a pitch calculator 62 and a pitch information holding circuit 64 shown in FIG.
To provide. When the pitch is normally detected in the pitch calculator 62, the pitch information holding circuit 64 is switched to the contact a side, and the pitch width information is stored in the pitch calculation holding circuit 6.
Output from 0. If the pitch calculator 62 cannot detect the pitch normally, the pitch information holding circuit 6
4 is switched to the contact b side, and the pitch width information calculated in the previous pitch detection search section is output from the pitch calculation holding circuit 60 via the pitch information holding circuit 64.
That is, the pitch width information obtained in the previous pitch detection search section is output. As a result, the pitch correction process can be continued even if the pitch component does not exist instantaneously.

【００３９】図１６に示したピッチ演算保持回路６０
は、図７に示したピッチ検出回路２８に適用することも
できる。The pitch calculation holding circuit 60 shown in FIG.
Can also be applied to the pitch detection circuit 28 shown in FIG.

【００４０】本発明の実施に際しては、上述したものに
限定されず、その他種々の構成をとることができる。た
とえば、以上述べた実施例は独立に適用できるうえ、こ
れらを任意に組み合わせることができる。The embodiments of the present invention are not limited to those described above, and various other structures can be adopted. For example, the embodiments described above can be applied independently, and they can be arbitrarily combined.

【００４１】また、上述した実施例においては、原ディ
ジタル・オーディオ信号が２４ビットである場合に、８
ビットのサンプルデータにビットシフトする例を示した
が、このビットシフトの関係は任意にとることができ
る。たとえば、２４ビットの量子化オーディオ信号を、
符号＋９ビット、合計１０ビットのサンプルデータとし
て、ビットシフトすることもできる。この場合、たとえ
ば、第１のサンプルデータ＝〔符号＋２２〜１４ビット
のデータ〕、第２のサンプルデータ＝〔符号＋１２〜４
ビットのデータ〕、第３のサンプルデータ＝〔符号＋８
〜０ビットのデータ〕のように３分割する。あるいは、
さらにビット数を削減するため、もちろん、２４ビット
の量子化オーディオ信号を、６ビット（符号＋５ビッ
ト）のサンプルデータとしてビットシフトし、１オーデ
ィオ信号当たり、５サンプルデータに分割することもで
きる。勿論、１６ビットの量子化オーディオ信号を８ビ
ット、６ビットなどにビットシフトした結果を用いるこ
ともできる。In the above embodiment, if the original digital audio signal is 24 bits, 8
Although the example of performing the bit shift to the bit sample data is shown, the relationship of the bit shift can be arbitrarily set. For example, a 24-bit quantized audio signal
It is also possible to perform bit shift as sample data of 10 bits in total including sign + 9 bits. In this case, for example, the first sample data = [code + 22 to 14-bit data], the second sample data = [code + 12 to 4]
Bit data], third sample data = [sign + 8]
~ 0-bit data]. Alternatively,
In order to further reduce the number of bits, of course, a 24-bit quantized audio signal can be bit-shifted as 6-bit (sign + 5 bit) sample data and divided into 5 sample data per audio signal. Of course, the result of bit-shifting the 16-bit quantized audio signal to 8 bits, 6 bits, or the like can also be used.

【００４２】なお、本発明は図１に示したプログラムプ
レーシステムに関連づけて述べたが、本発明の実施に際
しては、プログラムプレーシステムに限定されるもので
はない。Although the present invention has been described in connection with the program play system shown in FIG. 1, the present invention is not limited to the program play system.

【００４３】[0043]

【発明の効果】本発明においては、ビットシフトを行う
ので、振幅が小さなディジタル・オーディオ信号につい
ても、少ないビット数でピッチ検出ができる。According to the present invention, since bit shifting is performed, pitch detection can be performed with a small number of bits even for a digital audio signal having a small amplitude.

【００４４】また本発明においては、ステレオ・ディジ
タル・オーディオ信号についてビットシフト前に２チャ
ネルのオーディオ信号の加算を行うので、ピッチ検出の
精度が高くなる。Further, in the present invention, since the two channels of audio signals are added to the stereo digital audio signal before the bit shift, the accuracy of pitch detection becomes high.

【００４５】さらに本発明においては、オーディオ信号
のピッチ成分が瞬時的に消失するような場合に、オーデ
ィオ信号の連続性を考慮して、前のピッチ検出検索区間
において求めたピッチ幅を用いることにより、そのよう
な不具合のオーディオ信号が存在しても、ピッチコレク
ション処理の救済を図ることができる。Further, in the present invention, when the pitch component of the audio signal disappears instantaneously, the pitch width obtained in the previous pitch detection search section is used in consideration of the continuity of the audio signal. Even in the presence of such a defective audio signal, the pitch correction process can be relieved.

[Brief description of drawings]

【図１】図１は本発明のオーディオ信号処理方法とその
装置が適用される１例としてのプログラムプレーシステ
ムの構成を示す図である。FIG. 1 is a diagram showing a configuration of a program play system as an example to which an audio signal processing method and apparatus of the present invention are applied.

【図２】図２はピッチコレクション処理を示すグラフで
あり、図２（Ａ）はピッチコレクション処理前の原オー
ディオ信号の波形図であり、図２（Ｂ）はピッチコレク
ション処理後のオーディオ信号の波形図である。2 is a graph showing pitch correction processing, FIG. 2 (A) is a waveform diagram of an original audio signal before pitch correction processing, and FIG. 2 (B) is an audio signal after pitch correction processing. It is a waveform diagram.

【図３】図３はピッチコレクション処理を行うためのピ
ッチ幅を検出するため、オーディオ信号の自己相関を計
算する様子を示すグラフである。FIG. 3 is a graph showing how an autocorrelation of an audio signal is calculated in order to detect a pitch width for performing pitch correction processing.

【図４】図４はピッチコレクション処理を行うためのピ
ッチ幅を検出することを示すグラフである。FIG. 4 is a graph showing detection of a pitch width for performing a pitch correction process.

【図５】図５はピッチコレクション処理の対象となるデ
ィジタル・オーディオ信号の波形図である。FIG. 5 is a waveform diagram of a digital audio signal which is a target of pitch correction processing.

【図６】図６はピッチコレクション処理の対象となるデ
ィジタル・オーディオ信号の波形図であり、ダイナミッ
クレンジに対して最大振幅が小さい場合の波形図であ
る。FIG. 6 is a waveform diagram of a digital audio signal that is a target of pitch correction processing, and is a waveform diagram when the maximum amplitude is small with respect to the dynamic range.

【図７】図７は本発明のオーディオ信号処理装置の第１
実施例の構成図である。FIG. 7 shows a first audio signal processing apparatus according to the present invention.
It is a block diagram of an Example.

【図８】図８はピッチコレクション処理におけるビット
シフト処理を図解する図であり、図８（Ａ）は振幅が小
さい場合のディジタル・オーディオ信号の波形図であ
り、図８（Ｂ）はビットシフトした結果を示す波形図で
ある。8 is a diagram illustrating a bit shift process in a pitch correction process, FIG. 8 (A) is a waveform diagram of a digital audio signal when the amplitude is small, and FIG. 8 (B) is a bit shift process. It is a wave form diagram which shows the result.

【図９】図９は本発明のオーディオ信号処理装置におい
て２４ビットのディジタル・オーディオ信号を８ビット
のディジタル・オーディオ信号（サンプルデータ）にビ
ットシフトした結果を示すグラフである。FIG. 9 is a graph showing a result of bit-shifting a 24-bit digital audio signal into an 8-bit digital audio signal (sample data) in the audio signal processing device of the present invention.

【図１０】図１０は本発明のオーディオ信号処理装置の
第２実施例の構成図である。FIG. 10 is a configuration diagram of a second embodiment of an audio signal processing device of the present invention.

【図１１】図１１はピッチコレクション処理の対象とな
るディジタル・オーディオ信号の波形図であり、ビット
シフトしたデータに頻繁にリミッタがかかる場合の波形
図である。FIG. 11 is a waveform diagram of a digital audio signal that is a target of pitch correction processing, and is a waveform diagram when a limiter is frequently applied to bit-shifted data.

【図１２】図１２は適応的にピッチを検出するためのビ
ットシフトを図解するグラフである。FIG. 12 is a graph illustrating a bit shift for adaptively detecting pitch.

【図１３】図１３は図１２に図解したビットシフトの処
理を示すグラフである。FIG. 13 is a graph showing the bit shift processing illustrated in FIG.

【図１４】図１４はピッチ検出ができない場合の処理を
示すグラフである。FIG. 14 is a graph showing processing when pitch detection is not possible.

【図１５】図１５は図１４に示したディジタル・オーデ
ィオ信号のピッチ検出ができない場合の救済処理結果を
示すグラフである。FIG. 15 is a graph showing a rescue processing result when the pitch of the digital audio signal shown in FIG. 14 cannot be detected.

【図１６】図１６は本発明のオーディオ信号処理装置に
おける第３実施例としてのピッチ幅処理回路の構成図で
ある。FIG. 16 is a configuration diagram of a pitch width processing circuit as a third embodiment in the audio signal processing device of the invention.

[Explanation of symbols]

２・・ビデオテープ４・・エラー訂正回路６・・フレームシンクロナイザー８・・ピッチコレクション処理回路１０・・メモリ２０・・オーディオ信号処理装置２２・・データ制御回路２４・・オーディオデータ・メモリ２６・・ピッチ演算データ・メモリ２８・・ピッチ検出回路３０・・メモリアドレスカウンター３２・・最大ピーク検出回路４０・・ステレオ信号ピッチ検出回路４２・・データ制御回路４４・・Ｓ／Ｐ変換回路４６・・加算器４８・・ビットシフター５０・・ピッチ演算メモリ５２・・ピッチ演算器６０・・ピッチ演算保持回路６２・・ピッチ演算器６４・・ピッチ情報保持回路 2 ... Video tape 4 ... Error correction circuit 6 ... Frame synchronizer 8 ... Pitch correction processing circuit 10 ... Memory 20 ... Audio signal processing device 22 ... Data control circuit 24 ... Audio data memory 26 ... -Pitch calculation data memory 28-Pitch detection circuit 30-Memory address counter 32-Maximum peak detection circuit 40-Stereo signal pitch detection circuit 42-Data control circuit 44-S / P conversion circuit 46- Adder 48-bit shifter 50-pitch calculation memory 52-pitch calculator 60-pitch calculation holding circuit 62-pitch calculator 64-pitch information holding circuit

Claims

[Claims]

1. An audio signal processing method for detecting a pitch of a digital audio signal and performing a pitch correction process using the pitch, wherein each of the digital audio signals has a plurality of bits while preserving a code. The sample data is divided and held, the maximum value is detected for the plurality of digital audio signals in a certain pitch detection search section, and one of the held sample data is detected based on the detected maximum value. An audio signal processing method, comprising: selecting, and using the selected sample data, performing pitch detection in the pitch detection search section.

2. The pitch detection calculates an autocorrelation of the selected sample data, and detects an interval showing the maximum intensity as a pitch width.
The described audio signal processing method.

3. The audio signal processing method according to claim 1, wherein, when the lower data exceeds a predetermined value, the lower data is limited to the predetermined value.

4. The audio signal according to claim 1, wherein when the digital audio signal is a stereo audio signal, the bit shift processing is performed after adding the digital audio signals of related channels as a stereo signal. Processing method.

5. When the data indicating the pitch component of the audio signal used for pitch detection is lost, pitch correction processing is performed using the pitch width obtained in the previous pitch detection search section. Any one of the audio signal processing methods.

6. An audio signal processing device for performing pitch correction processing of a digital audio signal, wherein a first memory means for storing an input audio signal as it is and a predetermined code while holding the input audio signal signal respectively. Second memory means for holding as a plurality of sample data divided into a plurality of bits, a maximum value detection stage for detecting the maximum value of the audio signal within a certain pitch detection search section, and a maximum value detection means for detecting the maximum value. Based on the maximum value, using appropriate sample data of the plurality of sample data for the one audio signal from the second memory means, using the sample data reading means, and the read sample data, Pitch calculation means for detecting pitch width, first memory means, second memory means The audio signal processing apparatus having the maximum value detecting means, and control means for controlling the means and pitch computing means read sample data.

7. A pitch correction processing means for reading an audio signal stored in said first memory means based on said detected pitch width, further comprising:
The audio signal processing device according to claim 6.

8. The audio signal processing according to claim 6, wherein the pitch calculation means calculates an autocorrelation of the read sample data, and detects an interval showing the maximum intensity of the autocorrelation as a pitch width. apparatus.

9. The audio according to claim 6, wherein the control means and the second memory means limit the read sample data to a predetermined value or less when the read sample data exceeds the predetermined value. Signal processing device.

10. The control means, when the digital audio signal is a stereo audio signal, adds means for adding digital audio signals of channels related as stereo signals, and holds a sign of the addition result. 10. The audio signal processing device according to claim 6, further comprising a bit shift unit that performs a bit shift process as a plurality of sample data divided into predetermined bits and applies the bit shift process to the second memory unit.

11. The pitch calculating means outputs the pitch width obtained in the preceding pitch detection search section when the data indicating the pitch component of the audio signal used for pitch detection is lost. 10. The audio signal processing device according to any one of 10.