JP2001148159A

JP2001148159A - Audio processing device and method, and computer- readable recording medium recorded with program to make computer execute audio processing method

Info

Publication number: JP2001148159A
Application number: JP32994999A
Authority: JP
Inventors: Hiroshi Segawa; 浩瀬川
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1999-11-19
Filing date: 1999-11-19
Publication date: 2001-05-29

Abstract

PROBLEM TO BE SOLVED: To provide an audio encoding processor which can reproduce second without causing the sudden changes of sound levels nor generating noise due to the changes. SOLUTION: This audio processor 30 includes a dual port memory 31, which stores the audio data supplied from an audio input pin, a register 32 which holds a pause indication signal, a register 33 which holds an audio-encoding indication signal, a processor 35 which applies the audio encoding processing to the audio data stored in the memory 31, an audio frame pulse generator 36 which generates an audio frame pulse and generates an interrupt to the processor 35 to start the audio encoding processing, a FIFO(first in first out) memory 34 which stores the audio encoding data processed by the processor 35 and outputs these data in a FIFO system via an audio-processing data output pin and a memory 37 which holds various parameters.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、オーディオ処理装
置および方法、ならびにオーディオ処理方法をコンピュ
ータに実行させるためのプログラムを記録したコンピュ
ータ読取可能な記録媒体に関する。特に、音声再生時に
雑音が生じないオーディオ処理装置および方法、ならび
にオーディオ処理方法をコンピュータに実行させるため
のプログラムを記録したコンピュータ読取可能な記録媒
体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio processing apparatus and method, and a computer-readable recording medium on which a program for causing a computer to execute the audio processing method is recorded. In particular, the present invention relates to an audio processing apparatus and method that does not generate noise during audio reproduction, and a computer-readable recording medium that stores a program for causing a computer to execute the audio processing method.

【０００２】[0002]

【従来の技術】従来、デジタルオーディオＰＣＭ（Puls
e Code Modulation）データの符号化処理装置を用いた
録音装置が存在する。オーディオ符号化処理としては、
ＭＰＥＧ（Moving Picture Experts Group）において国
際標準化が行なわれているＭＰＥＧ１オーディオ圧縮処
理やＤｏｌｂｙ社が開発したＡＣ−３圧縮処理などが用
いられている。2. Description of the Related Art Conventionally, digital audio PCM (Puls
e Code Modulation) There is a recording device that uses a data encoding device. As audio encoding processing,
MPEG1 audio compression processing, which is internationally standardized by the Moving Picture Experts Group (MPEG), and AC-3 compression processing developed by Dolby are used.

【０００３】録音装置には当然のことながら、録音動作
を一時停止させるポーズ機能がサポートされている。た
とえば、ユーザが、２つの異なるオーディオソースに録
音された楽曲を再生して、連続して録音するには、ある
オーディオソースからの録音が終了した段階で録音動作
を一時停止させ、オーディオソースを次のオーディオソ
ースに切換えた後、録音を再開させる。[0003] The recording device naturally supports a pause function for temporarily stopping the recording operation. For example, in order for the user to play back music recorded on two different audio sources and record continuously, the recording operation is paused when recording from one audio source is completed, and the audio source is switched to the next. After switching to the audio source, recording is resumed.

【０００４】図１１を参照して、ポーズ機能を利用した
録音動作の一例について説明する。図中、処理対象とな
る各オーディオフレーム（以下「フレーム」という）に
は、Ａ０〜Ａ８までの番号を付している。オーディオ符
号化指示信号がＬ（Ｌｏｗ）のとき、録音装置におい
て、オーディオ符号化処理がフレーム毎に行なわれ、ポ
ーズ指示信号がＬのとき、オーディオ符号化指示信号の
値にかかわらず、オーディオ符号化処理が停止される。
図１１に示す例では、フレームＡ２の処理が終了した段
階で、ポーズ指示信号がＬとなり、ポーズが行なわれ
る。ポーズ指示信号がＬである間に入力されるフレーム
Ａ３およびＡ４はオーディオ符号化処理されない。フレ
ームＡ５が処理される時点で、ポーズ指示信号がＨ（Ｈ
ｉｇｈ）となるものとすると、フレームＡ５以降のフレ
ームに対してオーディオ符号化処理が再開される。An example of a recording operation using a pause function will be described with reference to FIG. In the figure, audio frames to be processed (hereinafter referred to as “frames”) are numbered A0 to A8. When the audio encoding instruction signal is L (Low), the audio encoding process is performed for each frame in the recording device. When the pause instruction signal is L, audio encoding is performed regardless of the value of the audio encoding instruction signal. Processing is stopped.
In the example shown in FIG. 11, when the processing of the frame A2 is completed, the pause instruction signal becomes L, and the pause is performed. The frames A3 and A4 input while the pause instruction signal is at L are not subjected to audio encoding processing. At the time when the frame A5 is processed, the pause instruction signal becomes H (H
i), the audio encoding process is restarted for the frames after the frame A5.

【０００５】このようなポーズ機能を利用することによ
り、オーディオ再生時には、フレームＡ０、Ａ１、Ａ
２、Ａ５、Ａ６…の順で再生が行なわれる。By using such a pause function, frames A0, A1, A
Reproduction is performed in the order of 2, A5, A6,.

【０００６】[0006]

【発明が解決しようとする課題】しかし、録音動作が一
時停止された前後には、再生音声レベル差が存在するこ
とがある。この再生音声レベル差、すなわち、図１１に
示す例ではフレームＡ２とＡ５との間の再生音声レベル
差が大きい場合には、オーディオ再生時に雑音が生じ問
題であった。However, before and after the recording operation is temporarily stopped, there may be a difference in the reproduced sound level. If the difference between the reproduced sound levels, that is, the difference between the reproduced sound levels between the frames A2 and A5 in the example shown in FIG. 11, is large, noise is generated at the time of reproducing the audio.

【０００７】また、たとえば、テレビ番組では、ＣＭ
（Commercial Message）などへの切換えのタイミングで
ステレオ音声からモノラル音声への切換えが行なわれ
る。そしてこの音声切換えをトリガとして録画のポー
ズ、ポーズ解除を行なう装置がある。しかし、映像信号
と音声信号とが必ずしも正確に対応しているとは限らな
い。このため、音声切換えをトリガとしてポーズ動作を
行なうと通常のオーディオ符号化処理では、映像信号と
音声信号との間のずれにより、一時停止前の映像再生時
に、一時停止後の映像に対応した音声信号が再生されて
しまう場合、およびその逆の現象が生じる場合がある。[0007] For example, in a television program, CM
(Commercial Message) or the like, switching from stereo sound to monaural sound is performed. There is a device that pauses and unpauses the recording by using the voice switching as a trigger. However, video signals and audio signals do not always correspond exactly. For this reason, if a pause operation is performed with audio switching as a trigger, in a normal audio encoding process, due to a shift between the video signal and the audio signal, the audio corresponding to the video after the pause is reproduced when the video is reproduced before the pause. In some cases, a signal is reproduced, and vice versa.

【０００８】本発明は、上述の課題を解決するためにな
されたもので、その目的は、再生時に音声レベルの急激
な変化がなく、それに伴う雑音を生じさせることなく、
音声再生を行なうことができる、オーディオ符号化処理
装置および方法、ならびにオーディオ処理方法をコンピ
ュータに実行させるためのプログラムを記録したコンピ
ュータ読取可能な記録媒体を提供することである。SUMMARY OF THE INVENTION The present invention has been made to solve the above-mentioned problem, and has as its object to prevent a sudden change in the sound level during reproduction and without causing noise accompanying the change.
An object of the present invention is to provide an audio encoding processing device and method capable of reproducing audio, and a computer-readable recording medium recording a program for causing a computer to execute the audio processing method.

【０００９】本発明の他の目的は、複数の機器を使用す
る場合の再生タイミングの不整合を回避することができ
るオーディオ符号化処理装置および方法、ならびにオー
ディオ処理方法をコンピュータに実行させるためのプロ
グラムを記録したコンピュータ読取可能な記録媒体を提
供することである。Another object of the present invention is to provide an audio encoding processing apparatus and method capable of avoiding a mismatch in reproduction timing when a plurality of devices are used, and a program for causing a computer to execute the audio processing method. Is to provide a computer-readable recording medium on which is recorded.

【００１０】[0010]

【課題を解決するための手段】請求項１に記載の発明に
係るオーディオ処理装置は、第１および第２の値のうち
いずれかを選択的にとるオーディオデータの符号化処理
を指示するための第１の信号と、第３および第４の値の
うちいずれかを選択的にとるオーディオデータの符号化
処理の一時停止を指示するための第２の信号とに基づい
て、フレーム単位でオーディオデータを符号化する。オ
ーディオ処理装置は、フレーム単位のオーディオデータ
が入力される第１のポートおよびオーディオデータが出
力される第２のポートを有するメモリと、メモリに接続
され、メモリにオーディオデータが入力されると、メモ
リからオーディオデータを遅延させて読込み、第１およ
び第２の信号の履歴に基づいて、オーディオデータを符
号化する処理または符号化しない処理のいずれかを選択
的に実行するオーディオデータ符号化手段と、オーディ
オデータ符号化手段に結合され、記第２の信号の履歴に
基づいて、オーディオデータ符号化手段がオーディオデ
ータの符号化をしない期間の前後のフレームでオーディ
オデータの音声レベル差をなくすようオーディオデータ
符号化手段で符号化されるオーディオデータのゲインを
制御する音声レベル制御手段とを含む。According to a first aspect of the present invention, there is provided an audio processing apparatus for instructing an audio data encoding process to selectively take one of a first value and a second value. Audio data in frame units based on a first signal and a second signal for instructing to temporarily stop encoding processing of audio data that selectively takes one of a third value and a fourth value; Is encoded. The audio processing device is connected to a memory having a first port to which audio data of a frame unit is input and a second port to which audio data is output, and when the audio data is input to the memory, the memory Audio data encoding means for reading audio data with a delay from the audio data, and selectively executing either audio data encoding processing or non-encoding processing based on the history of the first and second signals; Audio data encoding means coupled to the audio data encoding means for eliminating an audio level difference between the audio data in frames before and after a period in which the audio data encoding means does not encode the audio data based on the history of the second signal; A sound level controlling the gain of audio data encoded by the encoding means. And a le control unit.

【００１１】オーディオデータの符号化処理が一時停止
される前後でオーディオフレームデータの音声レベル差
がなくなる。このため、再生時に音声レベルの急激な変
化がなく、それに伴う雑音が生じさせることなく、音声
再生を行なうことができる。There is no difference in audio level between audio frame data before and after the audio data encoding process is temporarily stopped. For this reason, audio reproduction can be performed without a sudden change in the audio level at the time of reproduction, and without generating noise accompanying the change.

【００１２】請求項２に記載の発明は、請求項１に記載
の発明の構成に加えて、音声レベル制御手段は、オーデ
ィオデータ符号化手段に結合され、第２の信号の履歴に
基づいて、オーディオデータ符号化手段が符号化を停止
するフレームよりも前の第１の所定フレーム期間でオー
ディオデータの音声レベルのゲインを基準値まで減少さ
せる第１のフェードアウト手段と、オーディオデータ符
号化手段に結合され、第２の信号の履歴に基づいて、オ
ーディオデータ符号化手段が符号化を再開するフレーム
以降の第２の所定フレーム期間でオーディオデータの音
声レベルのゲインを基準値から増加させるフェードイン
手段とを含む。According to a second aspect of the present invention, in addition to the configuration of the first aspect, the audio level control means is coupled to the audio data encoding means, and based on the history of the second signal, First fade-out means for reducing the gain of the audio level of the audio data to a reference value during a first predetermined frame period before the frame at which the audio data encoding means stops encoding, and audio data encoding means; And fade-in means for increasing the gain of the audio level of the audio data from the reference value in a second predetermined frame period after the frame in which the audio data encoding means resumes encoding based on the history of the second signal. including.

【００１３】オーディオデータの符号化処理が一時停止
される直前までにオーディオデータの音声レベルのゲイ
ンが基準値まで減少し、一時停止が解除されたフレーム
のオーディオデータの音声レベルのゲインが基準値から
増加する。このため、再生時に音声レベルの急激な変化
がなく、それに伴う雑音が生じさせることなく、音声再
生を行なうことができる。[0013] The gain of the audio level of the audio data decreases to the reference value just before the audio data encoding process is paused, and the gain of the audio level of the audio data of the frame whose pause has been released is reduced from the reference value. To increase. For this reason, audio reproduction can be performed without a sudden change in the audio level at the time of reproduction, and without generating noise accompanying the change.

【００１４】請求項３に記載の発明は、請求項２に記載
の発明の構成に加えて、第１のフェードアウト手段は、
オーディオデータ符号化手段に結合され、第２の信号の
履歴に基づいて、オーディオデータ符号化手段が符号化
を停止する直前の１フレームでオーディオデータの音声
レベルのゲインを基準値まで減少させる第２のフェード
アウト手段を含む。According to a third aspect of the present invention, in addition to the configuration of the second aspect, the first fade-out means includes:
A second audio signal encoding unit configured to reduce the audio level gain of the audio data to a reference value in one frame immediately before the audio data encoding unit stops encoding, based on the history of the second signal; Including fade-out means.

【００１５】請求項４に記載の発明は、請求項２に記載
の発明の構成に加えて、第１のフェードアウト手段は、
オーディオデータ符号化手段に結合され、第２の信号の
履歴に基づいて、オーディオデータ符号化手段が符号化
を停止するフレームよりも２フレーム以上前の予め定め
られたフレームに対して、オーディオデータの音声レベ
ルのゲインを予め定められた方法に従い基準値まで減少
させる第２のフェードアウト手段と、オーディオデータ
符号化手段に結合され、第２の信号の履歴に基づいて、
予め定められたフレームよりも後であり、かつオーディ
オデータ符号化手段が符号化を停止するよりも前のフレ
ームのオーディオデータの音声レベルのゲインを基準値
に維持させるミュート手段とを含む。According to a fourth aspect of the present invention, in addition to the configuration of the second aspect, the first fade-out means includes:
The audio data encoding unit is coupled to the predetermined frame two or more frames before the frame at which the audio data encoding unit stops encoding based on the history of the second signal. A second fade-out means for reducing the gain of the audio level to a reference value according to a predetermined method, and an audio data encoding means, based on a history of the second signal,
And mute means for maintaining the gain of the audio level of the audio data of the frame after the predetermined frame and before the audio data encoding means stops encoding at the reference value.

【００１６】たとえば、テレビ番組では、ＣＭなどへの
切換えのタイミングでステレオ音声からモノラル音声へ
の切換えが行なわれる。そしてこの音声切換えをトリガ
として録画のポーズ、ポーズ解除を行なう装置がある。
しかし、映像信号と音声信号とが必ずしも正確に対応し
ているとは限らない。このため、音声切換えをトリガと
してポーズ動作を行なうと通常のオーディオ符号化処理
では、映像信号と音声信号との間のずれにより、一時停
止前の映像再生時に、一時停止後の映像に対応した音声
信号が再生されてしまう場合、およびその逆の現象が生
じる場合がある。しかし、一時停止が行なわれる前の数
フレームの間、音声レベルが基準値に定められる。この
ため、この基準値を低く定めておけば、一時停止後の映
像に対応した音声信号が一時停止前の映像再生時に生成
されることがなくなり、複数の機器を使用する場合の再
生タイミングの不整合を回避することができる。For example, in a television program, switching from stereo sound to monaural sound is performed at the timing of switching to CM or the like. There is a device that pauses and unpauses the recording by using the voice switching as a trigger.
However, video signals and audio signals do not always correspond exactly. For this reason, if a pause operation is performed with audio switching as a trigger, in a normal audio encoding process, due to a shift between the video signal and the audio signal, the audio corresponding to the video after the pause is reproduced when the video is reproduced before the pause. In some cases, a signal is reproduced, and vice versa. However, for several frames before the pause is performed, the audio level is set to the reference value. For this reason, if this reference value is set low, an audio signal corresponding to the video after the pause will not be generated at the time of video reproduction before the pause, and the reproduction timing will not be correct when using multiple devices. Matching can be avoided.

【００１７】請求項５に記載の発明に係るオーディオ処
理方法は、第１および第２の値のうちいずれかを選択的
にとるオーディオデータの符号化処理を指示するための
第１の信号と、第３および第４の値のうちいずれかを選
択的にとるオーディオデータの符号化処理の一時停止を
指示するための第２の信号とに基づいて、オーディオデ
ータをフレーム単位で符号化する。オーディオ処理方法
は、入力されたオーディオデータを遅延させるステップ
と、遅延させたオーディオデータおよび第２の信号の履
歴に基づいて、オーディオデータが符号化されない期間
の前後のフレームでオーディオデータの音声レベル差を
なくすよう符号化されるオーディオデータのゲインを制
御するステップと、遅延させたオーディオデータならび
に第１および第２の信号の履歴に基づいて、オーディオ
データを符号化する処理または符号化しない処理のいず
れかを選択的に実行するステップとを含む。According to a fifth aspect of the present invention, there is provided an audio processing method, comprising: a first signal for instructing an audio data encoding process for selectively taking one of a first value and a second value; The audio data is encoded in frame units based on a second signal for instructing to temporarily stop encoding processing of the audio data that selectively takes one of the third and fourth values. The audio processing method includes the steps of: delaying input audio data; and determining, based on the history of the delayed audio data and the second signal, an audio level difference of the audio data between frames before and after a period in which the audio data is not encoded. Controlling the gain of the audio data to be encoded so as to eliminate the error, and encoding or not encoding the audio data based on the delayed audio data and the history of the first and second signals. Selectively performing the operation.

【００１８】オーディオデータの符号化処理が一時停止
される前後でオーディオフレームデータの音声レベル差
がなくなる。このため、再生時に音声レベルの急激な変
化がなく、それに伴う雑音が生じさせることなく、音声
再生を行なうことができる。There is no difference in audio level between audio frame data before and after the audio data encoding process is temporarily stopped. For this reason, audio reproduction can be performed without a sudden change in the audio level at the time of reproduction, and without generating noise accompanying the change.

【００１９】請求項６に記載の発明は、請求項５に記載
の発明の構成に加えて、ゲインを制御するステップは、
遅延させたオーディオデータおよび第２の信号の履歴に
基づいて、オーディオデータの符号化が停止するフレー
ムよりも前の第１の所定フレーム期間でオーディオデー
タの音声レベルのゲインを基準値まで減少させるステッ
プと、遅延させたオーディオデータおよび第２の信号の
履歴に基づいて、オーディオデータの符号化が再開され
るフレーム以降の第２の所定フレーム期間でオーディオ
データの音声レベルのゲインを基準値から増加させるス
テップとを含む。According to a sixth aspect of the present invention, in addition to the configuration of the fifth aspect of the present invention, the step of controlling the gain comprises:
Reducing the gain of the audio level of the audio data to a reference value in a first predetermined frame period before a frame in which encoding of the audio data is stopped based on the delayed audio data and the history of the second signal. And increasing the gain of the audio level of the audio data from the reference value in a second predetermined frame period after the frame in which the encoding of the audio data is restarted, based on the delayed audio data and the history of the second signal. Steps.

【００２０】オーディオデータの符号化処理が一時停止
される直前までにオーディオデータの音声レベルのゲイ
ンが基準値まで減少し、一時停止が解除されたフレーム
のオーディオデータの音声レベルのゲインが基準値から
増加する。このため、再生時に音声レベルの急激な変化
がなく、それに伴う雑音が生じさせることなく、音声再
生を行なうことができる。Just before the audio data encoding process is paused, the audio level gain of the audio data decreases to the reference value, and the audio level gain of the audio data of the frame whose pause has been released is reduced from the reference value. To increase. For this reason, audio reproduction can be performed without a sudden change in the audio level at the time of reproduction, and without generating noise accompanying the change.

【００２１】請求項７に記載の発明は、請求項６に記載
の発明の構成に加えて、ゲインを基準値まで減少させる
ステップは、遅延させたオーディオデータおよび第２の
信号の履歴に基づいて、オーディオデータの符号化が停
止する直前の１フレームでオーディオデータの音声レベ
ルのゲインを基準値まで減少させるステップを含む。According to a seventh aspect of the present invention, in addition to the configuration of the sixth aspect, the step of reducing the gain to the reference value is based on the delayed audio data and the history of the second signal. And reducing the gain of the audio level of the audio data to the reference value in one frame immediately before the stop of the encoding of the audio data.

【００２２】請求項８に記載の発明は、請求項６に記載
の発明の構成に加えて、ゲインを基準値まで減少させる
ステップは、遅延させたオーディオデータおよび第２の
信号の履歴に基づいて、オーディオデータの符号化が停
止するフレームよりも２フレーム以上前の予め定められ
たフレームに対して、オーディオデータの音声レベルの
ゲインを予め定められた方法に従い基準値まで減少させ
るステップと、遅延させたオーディオデータおよび第２
の信号の履歴に基づいて、予め定められたフレームより
も後であり、かつオーディオデータの符号化が停止する
よりも前のフレームのオーディオデータの音声レベルの
ゲインを基準値に維持させるステップとを含む。According to an eighth aspect of the present invention, in addition to the configuration of the sixth aspect, the step of reducing the gain to the reference value is based on the delayed audio data and the history of the second signal. Reducing the gain of the audio level of the audio data to a reference value according to a predetermined method for a predetermined frame two or more frames before the frame at which the encoding of the audio data is stopped; Audio data and second
Maintaining the gain of the audio level of the audio data of the frame after the predetermined frame and before the encoding of the audio data is stopped, based on the signal history of Including.

【００２３】たとえば、テレビ番組では、ＣＭなどへの
切換えのタイミングでステレオ音声からモノラル音声へ
の切換えが行なわれる。そしてこの音声切換えをトリガ
として録画のポーズ、ポーズ解除を行なう装置がある。
しかし、映像信号と音声信号とが必ずしも正確に対応し
ているとは限らない。このため、音声切換えをトリガと
してポーズ動作を行なうと通常のオーディオ符号化処理
では、映像信号と音声信号との間のずれにより、一時停
止前の映像再生時に、一時停止後の映像に対応した音声
信号が再生されてしまう場合、およびその逆の現象が生
じる場合がある。しかし、一時停止が行なわれる前の数
フレームの間、音声レベルが基準値に定められる。この
ため、この基準値を低く定めておけば、一時停止後の映
像に対応した音声信号が一時停止前の映像再生時に生成
されることがなくなり、複数の機器を使用する場合の再
生タイミングの不整合を回避することができる。For example, in a television program, switching from stereo sound to monaural sound is performed at the timing of switching to CM or the like. There is a device that pauses and unpauses the recording by using the voice switching as a trigger.
However, video signals and audio signals do not always correspond exactly. For this reason, if a pause operation is performed with audio switching as a trigger, in a normal audio encoding process, due to a shift between the video signal and the audio signal, the audio corresponding to the video after the pause is reproduced when the video is reproduced before the pause. In some cases, a signal is reproduced, and vice versa. However, for several frames before the pause is performed, the audio level is set to the reference value. For this reason, if this reference value is set low, an audio signal corresponding to the video after the pause will not be generated at the time of video reproduction before the pause, and the reproduction timing will not be correct when using multiple devices. Matching can be avoided.

【００２４】請求項９に記載の発明に係るコンピュータ
読取可能な記録媒体は、第１および第２の値のうちいず
れかを選択的にとるオーディオデータの符号化処理を指
示するための第１の信号と、第３および第４の値のうち
いずれかを選択的にとるオーディオデータの符号化処理
の一時停止を指示するための第２の信号とに基づいて、
オーディオデータをフレーム単位で符号化するオーディ
オ処理方法をコンピュータに実行させるためのプログラ
ムを記録している。オーディオ処理方法は、入力された
オーディオデータを遅延させるステップと、遅延させた
オーディオデータおよび第２の信号の履歴に基づいて、
オーディオデータが符号化されない期間の前後のフレー
ムでオーディオデータの音声レベル差をなくすよう符号
化されるオーディオデータのゲインを制御するステップ
と、遅延させたオーディオデータならびに第１および第
２の信号の履歴に基づいて、オーディオデータを符号化
する処理または符号化しない処理のいずれかを選択的に
実行するステップとを含む。According to a ninth aspect of the present invention, there is provided a computer-readable recording medium according to the first aspect for instructing an audio data encoding process to selectively take one of a first value and a second value. On the basis of a signal and a second signal for instructing to temporarily stop encoding processing of audio data that selectively takes one of the third and fourth values.
A program for causing a computer to execute an audio processing method for encoding audio data in frame units is recorded. The audio processing method includes the steps of: delaying input audio data; and, based on the delayed audio data and the history of the second signal,
Controlling the gain of the audio data to be encoded so as to eliminate the audio level difference of the audio data in frames before and after the period in which the audio data is not encoded, and the history of the delayed audio data and the first and second signals And selectively performing either a process of encoding the audio data or a process of not encoding the audio data based on the audio data.

【００２５】オーディオデータの符号化処理が一時停止
される前後でオーディオフレームデータの音声レベル差
がなくなる。このため、再生時に音声レベルの急激な変
化がなく、それに伴う雑音が生じさせることなく、音声
再生を行なうことができる。There is no difference in audio level between audio frame data before and after the audio data encoding process is temporarily stopped. For this reason, audio reproduction can be performed without a sudden change in the audio level at the time of reproduction, and without generating noise accompanying the change.

【００２６】請求項１０に記載の発明は、請求項９に記
載の発明の構成に加えて、ゲインを制御するステップ
は、遅延させたオーディオデータおよび第２の信号の履
歴に基づいて、オーディオデータの符号化が停止するフ
レームよりも前の第１の所定フレーム期間でオーディオ
データの音声レベルのゲインを基準値まで減少させるス
テップと、遅延させたオーディオデータおよび第２の信
号の履歴に基づいて、オーディオデータの符号化が再開
されるフレーム以降の第２の所定フレーム期間でオーデ
ィオデータの音声レベルのゲインを基準値から増加させ
るステップとを含む。According to a tenth aspect of the present invention, in addition to the configuration of the ninth aspect of the present invention, the step of controlling the gain includes the step of controlling the audio data based on the delayed audio data and the history of the second signal. Reducing the gain of the audio level of the audio data to a reference value in a first predetermined frame period before the frame in which encoding of the audio data stops, and based on the history of the delayed audio data and the second signal, Increasing the gain of the audio level of the audio data from the reference value in a second predetermined frame period after the frame in which the encoding of the audio data is restarted.

【００２７】オーディオデータの符号化処理が一時停止
される直前までにオーディオデータの音声レベルのゲイ
ンが基準値まで減少し、一時停止が解除されたフレーム
のオーディオデータの音声レベルのゲインが基準値から
増加する。このため、再生時に音声レベルの急激な変化
がなく、それに伴う雑音が生じさせることなく、音声再
生を行なうことができる。[0027] The gain of the audio level of the audio data decreases to the reference value just before the audio data encoding process is paused, and the gain of the audio level of the audio data of the frame whose pause has been released is reduced from the reference value. To increase. For this reason, audio reproduction can be performed without a sudden change in the audio level at the time of reproduction, and without generating noise accompanying the change.

【００２８】請求項１１に記載の発明は、請求項１０に
記載の発明の構成に加えて、ゲインを基準値まで減少さ
せるステップは、遅延させたオーディオデータおよび第
２の信号の履歴に基づいて、オーディオデータの符号化
が停止する直前の１フレームでオーディオデータの音声
レベルのゲインを基準値まで減少させるステップを含
む。According to an eleventh aspect of the present invention, in addition to the configuration of the tenth aspect, the step of reducing the gain to the reference value is based on the delayed audio data and the history of the second signal. And reducing the gain of the audio level of the audio data to the reference value in one frame immediately before the stop of the encoding of the audio data.

【００２９】請求項１２に記載の発明は、請求項１０に
記載の発明の構成に加えて、ゲインを基準値まで減少さ
せるステップは、遅延させたオーディオデータおよび第
２の信号の履歴に基づいて、オーディオデータの符号化
が停止するフレームよりも２フレーム以上前の予め定め
られたフレームに対して、オーディオデータの音声レベ
ルのゲインを予め定められた方法に従い基準値まで減少
させるステップと、遅延させたオーディオデータおよび
第２の信号の履歴に基づいて、予め定められたフレーム
よりも後であり、かつオーディオデータの符号化が停止
するよりも前のフレームのオーディオデータの音声レベ
ルのゲインを基準値に維持させるステップとを含む。According to a twelfth aspect of the present invention, in addition to the configuration of the tenth aspect, the step of reducing the gain to the reference value is based on the delayed audio data and the history of the second signal. Reducing the gain of the audio level of the audio data to a reference value according to a predetermined method for a predetermined frame two or more frames before the frame at which the encoding of the audio data is stopped; The gain of the audio level of the audio data of the frame after the predetermined frame and before the encoding of the audio data is stopped based on the history of the audio data and the history of the second signal. Maintaining.

【００３０】たとえば、テレビ番組では、ＣＭなどへの
切換えのタイミングでステレオ音声からモノラル音声へ
の切換えが行なわれる。そしてこの音声切換えをトリガ
として録画のポーズ、ポーズ解除を行なう装置がある。
しかし、映像信号と音声信号とが必ずしも正確に対応し
ているとは限らない。このため、音声切換えをトリガと
してポーズ動作を行なうと通常のオーディオ符号化処理
では、映像信号と音声信号との間のずれにより、一時停
止前の映像再生時に、一時停止後の映像に対応した音声
信号が再生されてしまう場合、およびその逆の現象が生
じる場合がある。しかし、一時停止が行なわれる前の数
フレームの間、音声レベルが基準値に定められる。この
ため、この基準値を低く定めておけば、一時停止後の映
像に対応した音声信号が一時停止前の映像再生時に生成
されることがなくなり、複数の機器を使用する場合の再
生タイミングの不整合を回避することができる。For example, in a television program, switching from stereo sound to monaural sound is performed at the timing of switching to CM or the like. There is a device that pauses and unpauses the recording by using the voice switching as a trigger.
However, video signals and audio signals do not always correspond exactly. For this reason, if a pause operation is performed with audio switching as a trigger, in a normal audio encoding process, due to a shift between the video signal and the audio signal, the audio corresponding to the video after the pause is reproduced when the video is reproduced before the pause. In some cases, a signal is reproduced, and vice versa. However, for several frames before the pause is performed, the audio level is set to the reference value. For this reason, if this reference value is set low, an audio signal corresponding to the video after the pause will not be generated at the time of video reproduction before the pause, and the reproduction timing will not be correct when using multiple devices. Matching can be avoided.

【００３１】[0031]

【発明の実施の形態】［実施の形態１］図１を参照し
て、本発明の実施の形態に係るオーディオ処理装置３０
は、一方のポートがオーディオ入力ピンに接続され、他
方のポートが後述するバスに接続され、オーディオ入力
ピンより供給されるオーディオデータを記憶する２ポー
トメモリ３１と、オーディオデータの符号化処理の一時
停止を指示するためのポーズ指示信号を保持するレジス
タ３２と、オーディオデータの符号化を指示をするため
のオーディオ符号化指示信号を保持するレジスタ３３
と、２ポートメモリ３１に記憶されたオーディオデータ
にオーディオ符号化処理を施すプロセッサ３５と、プロ
セッサ３５に接続され、オーディオフレームパルスを発
生させ、オーディオフレームパルスに応答してプロセッ
サ３５に割込みをかけ、オーディオ符号化処理を起動さ
せるオーディオフレームパルス発生器３６と、プロセッ
サ３５で処理されたオーディオ符号化データを記憶し、
先入れ先出し方式でオーディオ処理データ出力ピンより
出力するＦＩＦＯ（First In First Out）メモリ３４
と、各種パラメータを保持するメモリ３７と、２ポート
メモリ３１、レジスタ３２、レジスタ３３、ＦＩＦＯメ
モリ３４、プロセッサ３５およびメモリ３７を相互に接
続するバスとを含む。[First Embodiment] Referring to FIG. 1, an audio processing apparatus 30 according to an embodiment of the present invention will be described.
Is a two-port memory 31 in which one port is connected to an audio input pin, the other port is connected to a bus described later, and stores audio data supplied from the audio input pin. A register 32 for holding a pause instruction signal for instructing stop, and a register 33 for holding an audio encoding instruction signal for instructing audio data encoding.
And a processor 35 for performing audio encoding processing on audio data stored in the two-port memory 31, connected to the processor 35, generating an audio frame pulse, interrupting the processor 35 in response to the audio frame pulse, An audio frame pulse generator 36 for activating an audio encoding process, and audio encoded data processed by the processor 35 are stored,
FIFO (First In First Out) memory 34 for outputting from the audio processing data output pin in a first-in first-out manner
And a memory 37 for holding various parameters, and a bus for interconnecting the two-port memory 31, the register 32, the register 33, the FIFO memory 34, the processor 35, and the memory 37.

【００３２】レジスタ３２は、ポーズ指示信号がＨのと
き１を保持し、Ｌのとき０を保持する。レジスタ３３
は、オーディオ符号化指示信号がＨのとき１を保持し、
Ｌのとき０を保持する。The register 32 holds 1 when the pause instruction signal is H, and holds 0 when the pause instruction signal is L. Register 33
Holds 1 when the audio encoding instruction signal is H,
When it is L, it holds 0.

【００３３】図２および図３を参照して、オーディオ処
理装置３０の各部は以下のように動作する。オーディオ
フレームパルス発生器３６で発生するオーディオフレー
ムパルスにより、プロセッサ３５に割込み処理がかけら
れると、プロセッサ３５は、レジスタ３３に記憶されて
いるオーディオ符号化指示信号がＬ（オーディオ符号化
指示がされている）か否かを判断する（Ｓ２）。オーデ
ィオ符号化指示がされていなければ（Ｓ２でＮＯ）、処
理を終了する。オーディオ符号化指示がされていれば
（Ｓ２でＹＥＳ）、プロセッサ３５は、メモリ３７に記
憶されているオーディオフレームカウンタを１つインク
リメントする（Ｓ４）。なお、初期状態ではオーディオ
フレームカウンタが−１にセットされているものとす
る。Referring to FIGS. 2 and 3, each unit of audio processing device 30 operates as follows. When the processor 35 is interrupted by an audio frame pulse generated by the audio frame pulse generator 36, the processor 35 changes the audio encoding instruction signal stored in the register 33 to L (audio encoding instruction is issued. Is determined) (S2). If the audio encoding instruction has not been issued (NO in S2), the process ends. If an audio encoding instruction has been issued (YES in S2), the processor 35 increments the audio frame counter stored in the memory 37 by one (S4). It is assumed that the audio frame counter is set to -1 in the initial state.

【００３４】プロセッサ３５は、オーディオフレームカ
ウンタが２以上か否かを判断する（Ｓ６）。オーディオ
フレームカウンタが０または１であれば（Ｓ６でＮ
Ｏ）、オーディオ符号化処理はまだ行なわれないため、
処理を終了する。The processor 35 determines whether the audio frame counter is 2 or more (S6). If the audio frame counter is 0 or 1 (N in S6)
O), since the audio encoding process has not been performed yet,
The process ends.

【００３５】オーディオフレームカウンタが２以上にな
った時点で（Ｓ６でＹＥＳ）、プロセッサ３５は、２ポ
ートメモリ３１より１フレーム分のオーディオデータを
読込む（Ｓ８）。オーディオフレームカウンタが０およ
び１の場合には、オーディオデータの読込み処理が行な
われておらず、その間のオーディオデータは、２ポート
メモリ３１に保持されている。このため、Ｓ８の処理で
は、常に２フレーム前のオーディオデータが読込まれる
ことになり、以降のオーディオ符号化処理では、現在入
力されいれているオーディオデータよりも２フレーム前
のオーディオデータに対してオーディオ符号化処理が行
なわれる。When the audio frame counter becomes 2 or more (YES in S6), the processor 35 reads one frame of audio data from the two-port memory 31 (S8). When the audio frame counter is 0 or 1, no audio data reading processing is performed, and the audio data during that time is held in the two-port memory 31. Therefore, in the process of S8, the audio data two frames before is always read, and in the subsequent audio encoding process, the audio data two frames before the currently input audio data is read. An audio encoding process is performed.

【００３６】図４を参照して、プロセッサ３５は、２フ
レーム前のポーズ指示信号の状態を表わすポーズ指示フ
ラグ２に１フレーム前のポーズ指示信号の状態を表わす
ポーズ指示フラグ１の値を代入する。また、ポーズ指示
フラグ１に現在のポーズ指示フラグの状態を表わすポー
ズ指示フラグ０の値を代入する。さらに、ポーズ指示フ
ラグ０にレジスタ３２に記憶されているポーズ指示信号
の値を代入する（Ｓ１０）。ポーズ指示フラグ０〜２
は、メモリ３７に記憶されている。Referring to FIG. 4, processor 35 substitutes the value of pause instruction flag 1 representing the state of the pause instruction signal one frame before into the pause instruction flag 2 representing the state of the pause instruction signal two frames before. . Further, the value of the pause instruction flag 0 representing the current state of the pause instruction flag is substituted for the pause instruction flag 1. Further, the value of the pause instruction signal stored in the register 32 is substituted for the pause instruction flag 0 (S10). Pause instruction flag 0-2
Are stored in the memory 37.

【００３７】プロセッサ３５は、Ｓ１０の処理の際に、
ポーズ指示フラグ１が１から０に変化したか否かを判断
する（Ｓ１２）。現在オーディオ符号化処理しようとし
ているフレームは、現在入力されているフレームよりも
２フレーム遅れたものである。このため、ポーズ指示フ
ラグ１が１から０に変化したということは、現在オーデ
ィオ符号化処理しようとしているフレームの後にポーズ
指示信号がＬとなりオーディオ符号化処理が一時停止す
ることを示している。すなわち、図５を参照して、一例
として現在オーディオ符号化処理しようとしているフレ
ームがフレームＡ２である場合には、フレームＡ２の直
後のフレームＡ３でポーズ指示信号がＬとなっている。
このため、ポーズ指示フラグ１が１から０に変化する。The processor 35 performs the processing of S10
It is determined whether the pause instruction flag 1 has changed from 1 to 0 (S12). The frame to be subjected to the audio encoding process is delayed by two frames from the currently input frame. Therefore, the fact that the pause instruction flag 1 has changed from 1 to 0 indicates that the pause instruction signal becomes L after the frame currently being subjected to audio encoding processing, and the audio encoding processing is temporarily stopped. That is, referring to FIG. 5, when the frame currently undergoing audio encoding processing is frame A2, for example, the pause instruction signal is L in frame A3 immediately after frame A2.
Therefore, the pause instruction flag 1 changes from 1 to 0.

【００３８】ポーズ指示フラグ１が１から０に変化した
場合には（Ｓ１２でＹＥＳ）、上述のように現在オーデ
ィオ符号化しようとしているフレームの直後に一時停止
されるため、現在符号化しようとしているフレームの音
声レベルのゲインを基準値（たとえば０）になるまで緩
やかに減少させる処理（以下「フェードアウト処理」と
いう）を行なう（Ｓ１４）。When the pause instruction flag 1 changes from 1 to 0 (YES in S12), the pause is performed immediately after the frame for which audio encoding is to be performed as described above. A process of gently decreasing the gain of the audio level of the frame until it reaches a reference value (for example, 0) (hereinafter, referred to as “fade-out process”) is performed (S14).

【００３９】フェードアウト処理の後、またはポーズ指
示フラグ１が０から１に変化していない場合（Ｓ１２で
ＮＯ）、プロセッサ３５は、ポーズ指示フラグ２が０か
ら１に変化しているか否かを判断する（Ｓ１６）。ポー
ズ指示フラグ２が０から１に変化したということは、現
在処理しようとしているフレームの直前のフレームに対
応するポーズ指示信号はＨであり、オーディオ符号化処
理が一時停止状態であったが、現在処理しようとしてい
るフレームに対応するポーズ指示信号がＬとなり、現在
のフレームからオーディオ符号化処理が再開されること
を示す。すなわち、図５を参照して、一例として、現在
オーディオ符号化処理しようとするフレームがＡ５で
は、フレームＡ５からポーズ指示信号がＨとなり、一時
停止が解除されている。このため、ポーズ指示フラグ２
が０から１に変化する。After the fade-out processing, or when the pause instruction flag 1 has not changed from 0 to 1 (NO in S12), the processor 35 determines whether or not the pause instruction flag 2 has changed from 0 to 1. (S16). The fact that the pause instruction flag 2 has changed from 0 to 1 means that the pause instruction signal corresponding to the frame immediately before the frame currently being processed is H, and the audio encoding process has been paused. The pause instruction signal corresponding to the frame to be processed becomes L, indicating that the audio encoding process is restarted from the current frame. That is, referring to FIG. 5, as an example, when the frame to be subjected to the audio encoding process is currently A5, the pause instruction signal becomes H from frame A5, and the pause is released. Therefore, the pause instruction flag 2
Changes from 0 to 1.

【００４０】ポーズ指示フラグ２が０から１に変化した
場合には（Ｓ１６でＹＥＳ）、現在オーディオ符号化し
ようとしているフレームの音声レベルのゲインを基準値
（たとえば０）から緩やかに増加させる処理（以下「フ
ェードイン処理」という）を行なう（Ｓ１８）。When the pause instruction flag 2 changes from 0 to 1 (YES in S16), the process of gradually increasing the gain of the audio level of the frame to be audio-encoded from the reference value (for example, 0) ( Hereinafter, “fade-in processing”) is performed (S18).

【００４１】フェードイン処理の後、またはポーズ指示
フラグ２が０から１に変化していない場合（Ｓ１６でＮ
Ｏ）、プロセッサ３５は、ポーズ指示フラグ２が１から
否かを判断する（Ｓ２０）。After the fade-in processing, or when the pause instruction flag 2 has not changed from 0 to 1 (N in S16)
O), the processor 35 determines whether or not the pause instruction flag 2 is 1 (S20).

【００４２】ポーズ指示フラグ２が１の場合には（Ｓ２
０でＮＯ）、現在処理しようとしているフレームに対す
るポーズ指示信号がＨであり、オーディオ符号化処理の
一時停止が解除されているため、オーディオ符号化処理
を行なう（Ｓ２２）。プロセッサ３５は、オーディオ符
号化処理したデータをＦＩＦＯメモリ３４に書込む（Ｓ
２４）。このようにして、現在処理しようとしているフ
レームに対する処理が終了する。なお、オーディオ符号
化処理は、従来と同様ＭＰＥＧ１オーディオ圧縮処理な
どの処理である。When the pause instruction flag 2 is 1 (S2
Since the pause instruction signal for the frame currently being processed is H and the pause of the audio encoding process has been released, the audio encoding process is performed (S22). The processor 35 writes the audio-encoded data into the FIFO memory 34 (S
24). Thus, the processing for the frame currently being processed is completed. Note that the audio encoding processing is processing such as MPEG1 audio compression processing as in the related art.

【００４３】以下、オーディオフレームパルス発生器３
６よりオーディオフレームパルスが発生するごとに図２
および図３で示した処理が行なわれる。Hereinafter, the audio frame pulse generator 3
Each time an audio frame pulse is generated from FIG.
And the processing shown in FIG. 3 is performed.

【００４４】図５に示す具体的な例を参照して、オーデ
ィオ処理装置３０で得られる符号化処理データの音声レ
ベルについて説明する。With reference to a specific example shown in FIG. 5, the audio level of the encoded data obtained by the audio processing device 30 will be described.

【００４５】図５の例では、フレームＡ３およびＡ４が
２ポートメモリ３１に入力されるタイミングで、ポーズ
指示信号がＬとなり、録音の一時停止が行なわれる。上
述の処理に従うと、フレームＡ０、Ａ１、Ａ６およびＡ
７の処理においては、ポーズ指示フラグ１および２とも
値が変化しておらず、ポーズ指示フラグ２の値が１であ
るため、通常の処理に従いオーディオ符号化処理が実行
される（Ｓ２２）。In the example of FIG. 5, at the timing when the frames A3 and A4 are input to the two-port memory 31, the pause instruction signal becomes L, and the recording is temporarily stopped. According to the above processing, frames A0, A1, A6 and A
In the process of 7, since the values of the pause instruction flags 1 and 2 have not changed and the value of the pause instruction flag 2 is 1, the audio encoding process is executed according to the normal process (S22).

【００４６】フレームＡ２の処理においては、上述した
ように、ポーズ指示フラグ１が１から０に変化しており
（図３のＳ１２でＹＥＳ）、かつポーズ指示フラグ２が
１であるため（Ｓ２０でＹＥＳ）、フェードアウト処理
が行なわれた後、オーディオ符号化処理が行なわれる
（Ｓ１４、Ｓ２２）。In the processing of the frame A2, as described above, the pause instruction flag 1 has changed from 1 to 0 (YES in S12 of FIG. 3) and the pause instruction flag 2 is 1 (S20). YES), after performing the fade-out processing, the audio encoding processing is performed (S14, S22).

【００４７】フレームＡ５の処理においても、上述した
ようにポーズ指示フラグ２が０から１に変化しており
（図３のＳ１６でＹＥＳ）、かつポーズ指示フラグ２が
１であるため（Ｓ２０でＹＥＳ）、フェードイン処理が
行なわれた後、オーディオ符号化処理が行なわれる（Ｓ
１８、Ｓ２２）。Also in the processing of frame A5, as described above, pause instruction flag 2 has changed from 0 to 1 (YES in S16 of FIG. 3), and pause instruction flag 2 is 1 (YES in S20). ), After the fade-in process is performed, the audio encoding process is performed (S
18, S22).

【００４８】このため、各フレームにおける音声レベル
のゲインは、たとえば図５に示すようになり、ポーズ指
示される直前のフレームでフェードアウト処理がされ、
ポーズ指示が解除された時点のフレームでフェードイン
処理が行なわれ、オーディオ符号化処理が行なわれる。
このようにしてできあがったオーディオ符号化データ
は、フレームＡ０、Ａ１、Ａ２、Ａ５、Ａ６、Ａ７の順
で記憶されており、音声再生時にはこの順序で再生され
る。再生時には、フレームＡ２でフェードアウトし、フ
レームＡ５でフェードインするため、録音時の一時停止
の前後で音声レベルの急激な変化が生じず、雑音を生じ
させることなく音声再生することが可能になる。For this reason, the gain of the audio level in each frame is as shown in FIG. 5, for example, and the fade-out processing is performed in the frame immediately before the pause instruction is issued.
Fade-in processing is performed in the frame at the time when the pause instruction is released, and audio encoding processing is performed.
The audio encoded data thus completed is stored in the order of frames A0, A1, A2, A5, A6, and A7, and is reproduced in this order during audio reproduction. At the time of reproduction, the image fades out at the frame A2 and fades in at the frame A5. Therefore, the audio level does not change abruptly before and after the recording is temporarily stopped, and the audio can be reproduced without generating noise.

【００４９】以上説明したオーディオ処理装置３０は、
コンピュータにより実現することが可能である。図６を
参照して、オーディオ処理装置３０は、コンピュータ４
１と、コンピュータ４１に指示を与えるためのキーボー
ド４５およびマウス４６と、コンピュータ４１により演
算された結果等を表示するためのディスプレイ４２と、
コンピュータ４１が実行するプログラムをそれぞれ読取
るための磁気テープ装置４３、ＣＤ−ＲＯＭ（Compact
Disc-Read Only Memory ）装置４７および通信モデム４
９とを含む。The audio processing device 30 described above
It can be realized by a computer. With reference to FIG. 6, the audio processing device 30
1, a keyboard 45 and a mouse 46 for giving instructions to the computer 41, a display 42 for displaying the results calculated by the computer 41, etc.
A magnetic tape device 43 for reading programs executed by the computer 41, a CD-ROM (Compact
Disc-Read Only Memory) Device 47 and Communication Modem 4
9 is included.

【００５０】図２および図３を参照して説明したオーデ
ィオ符号化処理のプログラムは、コンピュータ４１で読
取可能な記録媒体である磁気テープ４４またはＣＤ−Ｒ
ＯＭ４８に記録され、磁気テープ装置４３およびＣＤ−
ＲＯＭ装置４７でそれぞれ読取られる。または、通信回
線を介して通信モデム４９で読取られる。The program for the audio encoding process described with reference to FIGS. 2 and 3 includes a magnetic tape 44 or a CD-R, which is a recording medium readable by the computer 41.
Recorded on the OM 48, the magnetic tape device 43 and the CD-
Each is read by the ROM device 47. Alternatively, it is read by the communication modem 49 via the communication line.

【００５１】図７を参照して、コンピュータ４１は、磁
気テープ装置４３、ＣＤ−ＲＯＭ装置４７または通信モ
デム４９を介して読取られたプログラムを実行するため
のＣＰＵ（Central Processing Unit）５０と、コンピ
ュータ４１の動作に必要なその他のプログラムおよびデ
ータを記憶するためのＲＯＭ（Read Only Memory)５１
と、プログラム、プログラム実行時のパラメータ、演算
結果などを記憶するためのＲＡＭ（Random Access Memo
ry）５２と、プログラムおよびデータなどを記憶するた
めの磁気ディスク５３とを含む。Referring to FIG. 7, computer 41 includes a CPU (Central Processing Unit) 50 for executing a program read via magnetic tape device 43, CD-ROM device 47 or communication modem 49, and a computer. ROM (Read Only Memory) 51 for storing other programs and data necessary for the operation of 41
And a RAM (Random Access Memo) for storing the program, parameters for executing the program, calculation results, and the like.
ry) 52 and a magnetic disk 53 for storing programs, data, and the like.

【００５２】ＲＡＭ５２が、２ポートメモリ３１、レジ
スタ３２、レジスタ３３、ＦＩＦＯメモリ３４およびメ
モリ３７の役割を果たし、ＣＰＵ５０が、プロセッサ３
５およびオーディオフレームパルス発生器３６の役割を
果たす。The RAM 52 plays the role of a two-port memory 31, a register 32, a register 33, a FIFO memory 34, and a memory 37.
5 and an audio frame pulse generator 36.

【００５３】磁気テープ装置４３、ＣＤ−ＲＯＭ装置４
７または通信モデム４９により読取られたプログラム
は、ＣＰＵ５０で実行され、オーディオ符号化処理が行
なわれる。Magnetic tape device 43, CD-ROM device 4
7 or the program read by the communication modem 49 is executed by the CPU 50 to perform audio encoding processing.

【００５４】以上説明したように、本実施の形態に係る
オーディオ処理装置３０によれば、録音時に一時停止を
行なった場合であっても、一時停止の前後でフェードア
ウト処理およびフェードイン処理を行ないオーディオ符
号化処理を行なうことができる。このため、再生時に音
声レベルの急激な変化に伴う雑音を生じさせることなく
音声再生することができる。As described above, according to the audio processing apparatus 30 of the present embodiment, even when the recording is paused, the audio processing is performed by performing the fade-out process and the fade-in process before and after the pause. An encoding process can be performed. For this reason, audio reproduction can be performed without generating noise due to a rapid change in audio level during reproduction.

【００５５】［実施の形態２］本実施の形態に係るオー
ディオ符号化装置は、図１を参照して説明した実施の形
態１に係るオーディオ処理装置３０と同様のハードウェ
ア構成をとる。このため、その詳細な説明はここでは繰
返さない。本実施の形態のプロセッサ３５では、フェー
ドアウト処理するタイミングを実施の形態１のプロセッ
サ３５に比べ、早める。[Second Embodiment] The audio encoding apparatus according to the present embodiment has the same hardware configuration as the audio processing apparatus 30 according to the first embodiment described with reference to FIG. Therefore, the detailed description will not be repeated here. In the processor 35 of the present embodiment, the timing of performing the fade-out process is advanced as compared with the processor 35 of the first embodiment.

【００５６】図８および図９を参照して、本実施の形態
に係るオーディオ処理装置３０の各部は以下のように動
作する。図８に示すＳ３２〜Ｓ４０までの処理は、図２
に示すＳ２〜Ｓ１０までの処理と同様である。このた
め、その詳細な説明は、ここでは繰返さない。Referring to FIGS. 8 and 9, each section of audio processing apparatus 30 according to the present embodiment operates as follows. The processing from S32 to S40 shown in FIG.
Are the same as the processing from S2 to S10. Therefore, detailed description thereof will not be repeated here.

【００５７】図９を参照して、プロセッサ３５は、Ｓ４
０の処理の際にポーズ指示フラグ０が１から０に変化し
たか否かを判断する（Ｓ４２）。現在オーディオ処理し
ようとしているフレームは、現在入力されているフレー
ムよりも２フレーム遅れたものである。このため、ポー
ズ指示フラグ０が１から０に変化したということは、現
在オーディオ符号化しようとしているフレームの２フレ
ーム後にポーズ指示信号がＬとなりオーディオ符号化処
理が一時停止することを示している。すなわち、図１０
を参照して、現在オーディオ符号化処理しようとしてい
るフレームがＡ１である場合には、２フレーム後のフレ
ームＡ３で一時停止が行なわれるため、ポーズ指示フラ
グ０が１から０に変化する。Referring to FIG. 9, the processor 35 executes the processing in S4
It is determined whether the pause instruction flag 0 has changed from 1 to 0 during the process of 0 (S42). The frame for which audio processing is to be performed is delayed by two frames from the currently input frame. Therefore, the fact that the pause instruction flag 0 has changed from 1 to 0 indicates that the pause instruction signal becomes L two frames after the frame currently being audio-encoded and the audio encoding process is temporarily stopped. That is, FIG.
, When the current frame to be subjected to the audio encoding process is A1, the pause is performed at frame A3 two frames later, and the pause instruction flag 0 changes from 1 to 0.

【００５８】ポーズ指示フラグ０が１から０に変化した
場合には（Ｓ４２でＹＥＳ）、上述のように現在オーデ
ィオ符号化しようとしているフレームの２フレーム後に
一時停止が開始されるため、現在符号化しようとしてい
るフレームに対してフェードアウト処理を行なう（Ｓ４
４）。これにより、ポーズ指示信号がＬになる２フレー
ム前のフレームに対してフェードアウト処理が施された
ことになる。When the pause instruction flag 0 changes from 1 to 0 (YES in S42), the pause is started two frames after the frame currently being audio-encoded as described above. A fade-out process is performed on the frame to be tried (S4
4). This means that the fade-out processing has been performed on the frame two frames before the pause instruction signal becomes L.

【００５９】フェードアウト処理の後、またはポーズ指
示フラグ０が１から０に変化していない場合（Ｓ４２で
ＮＯ）、プロセッサ３５は、ポーズ指示フラグ１が０か
否かを判断する（Ｓ４６）。ポーズ指示フラグ１が０で
あるということは、現在処理しようとしているフレーム
の直後のフレームに対するポーズ指示信号がＬであるこ
とを示す。すなわち、図１０を参照して、現在オーディ
オ符号化処理しようとしているフレームがフレームＡ２
またはＡ３の場合には、それぞれのフレームの直後のフ
レームＡ３またはＡ４で一時停止処理が行なわれるた
め、ポーズ指示フラグ１が０となる。After the fade-out process, or when the pause instruction flag 0 has not changed from 1 to 0 (NO in S42), the processor 35 determines whether or not the pause instruction flag 1 is 0 (S46). The fact that the pause instruction flag 1 is 0 indicates that the pause instruction signal for the frame immediately after the frame currently being processed is L. That is, referring to FIG. 10, the frame currently being subjected to audio encoding processing is frame A2.
Or in the case of A3, the pause instruction flag 1 becomes 0 because the pause processing is performed in the frame A3 or A4 immediately after each frame.

【００６０】ポーズ指示フラグ１が０の場合には（Ｓ４
６でＹＥＳ）、現在オーディオ符号化処理しようとして
いるフレームの音声レベルのゲインを基準値（たとえば
０）に維持する処理（以下「ミュート処理」という）が
行なわれる（Ｓ４８）。When the pause instruction flag 1 is 0 (S4
(YES in 6), a process of maintaining the gain of the audio level of the frame currently being subjected to the audio encoding process at a reference value (for example, 0) (hereinafter referred to as “mute process”) is performed (S48).

【００６１】ミュート処理の後、またはポーズ指示フラ
グ１が１の場合には（Ｓ４６でＮＯ）、Ｓ５０〜Ｓ５８
に示す処理が行なわれる。Ｓ５０〜Ｓ５８の処理は、図
３を参照して説明したＳ１６〜Ｓ２４の処理と同様であ
る。このため、その詳細な説明はここでは繰返さない。After the mute processing, or when the pause instruction flag 1 is 1 (NO in S46), S50 to S58
Are performed. The processing of S50 to S58 is the same as the processing of S16 to S24 described with reference to FIG. Therefore, the detailed description will not be repeated here.

【００６２】以下、オーディオフレームパルス発生器３
６よりオーディオフレームパルスが発生するごとに図８
および図９で示した処理が行なわれる。Hereinafter, the audio frame pulse generator 3
Each time an audio frame pulse is generated from FIG.
And the processing shown in FIG. 9 is performed.

【００６３】図１０に示す具体的な例を参照して、オー
ディオ処理装置３０で得られる符号化処理データの音声
レベルについて説明する。Referring to a specific example shown in FIG. 10, the audio level of the encoded data obtained by the audio processing device 30 will be described.

【００６４】図１０の例では、フレームＡ３およびＡ４
が２ポートメモリ３１に入力されている時のタイミング
で、ポーズ指示信号がＬとなり、録音の一時停止が行な
われる。上述の処理に従うと、フレームＡ０、Ａ６およ
びＡ７の処理においては、ポーズ指示フラグ０および２
とも値が変化しておらず、ポーズ指示フラグ１の値が１
で、かつポーズ指示フラグ２の値が１であるため、通常
の処理に従いオーディオ符号化処理が実行される（Ｓ５
６）。In the example of FIG. 10, frames A3 and A4
Is input to the two-port memory 31, the pause instruction signal becomes L, and the recording is temporarily stopped. According to the processing described above, in the processing of frames A0, A6 and A7, pause instruction flags 0 and 2 are set.
And the value of the pause instruction flag 1 is 1
And the value of the pause instruction flag 2 is 1, the audio encoding process is executed according to the normal process (S5).
6).

【００６５】フレームＡ１の処理においては、上述した
ように、ポーズ指示フラグ０が１から０に変化しており
（図９のＳ４２でＹＥＳ）、かつポーズ指示フラグ２の
値が１であるため（Ｓ５４でＹＥＳ）、フェードアウト
処理が行なわれた後、オーディオ符号化処理が行なわれ
る（Ｓ４４、Ｓ５６）。In the processing of the frame A1, as described above, the pause instruction flag 0 has changed from 1 to 0 (YES in S42 of FIG. 9), and the value of the pause instruction flag 2 is 1 ( After YES in S54, the fade-out process is performed, and then the audio encoding process is performed (S44, S56).

【００６６】フレームＡ２およびＡ３の処理において
は、上述したように、ポーズ指示フラグ１の値が０であ
り（Ｓ４４６でＹＥＳ）、かつポーズ指示フラグ２の値
が１であるため（Ｓ５４でＹＥＳ）、ミュート処理が行
なわれた後、オーディオ符号化処理が行なわれる（Ｓ４
８、Ｓ５６）。In the processing of frames A2 and A3, as described above, the value of pause instruction flag 1 is 0 (YES in S446), and the value of pause instruction flag 2 is 1 (YES in S54). After the mute processing is performed, the audio encoding processing is performed (S4).
8, S56).

【００６７】フレームＡ５の処理においても、上述した
ようにポーズ指示フラグ２が０から１に変化しており
（図９のＳ５０でＹＥＳ）、かつポーズ指示フラグ２が
１であるため（Ｓ５４でＹＥＳ）、フェードイン処理が
行なわれた後、オーディオ符号化処理が行なわれる（Ｓ
５２、Ｓ５８）。Also in the processing of frame A5, the pause instruction flag 2 has changed from 0 to 1 as described above (YES in S50 of FIG. 9), and the pause instruction flag 2 is 1 (YES in S54). ), After the fade-in process is performed, the audio encoding process is performed (S
52, S58).

【００６８】このため、各フレームにおける音声レベル
のゲインは、たとえば図１０に示すようになり、ポーズ
指示される２フレーム前のフレームでフェードアウト処
理が行なわれ、ポーズ指示が解除された時点のフレーム
でフェードイン処理が行なわれ、その間のフレームにお
いてミュート処理が行なわれる。その後、オーディオ符
号化処理が行なわれる。このようにしてできあがったオ
ーディオ符号化データは、フレームＡ０、Ａ１、Ａ２、
Ａ５、Ａ６、Ａ７の順で記憶されており、音声再生時に
はこの順序で再生される。再生時には、フレームＡ１で
フェードアウトし、フレームＡ２で音声レベルが０にな
った後、フレームＡ５でフェードインする。このため、
録音時の一時停止の前後で音声レベルの急激な変化が生
じず、雑音を生じさせることなく音声再生することが可
能になる。Therefore, the gain of the audio level in each frame is as shown in FIG. 10, for example. The fade-out process is performed in the frame two frames before the pause instruction is performed, and the frame at the time when the pause instruction is released is set. A fade-in process is performed, and a mute process is performed in a frame during the fade-in process. After that, an audio encoding process is performed. The audio encoded data thus completed includes frames A0, A1, A2,
A5, A6, and A7 are stored in this order, and are reproduced in this order during audio reproduction. At the time of reproduction, the image fades out at the frame A1, the audio level becomes 0 at the frame A2, and then fades at the frame A5. For this reason,
A sudden change in the sound level does not occur before and after the pause during recording, and the sound can be reproduced without generating noise.

【００６９】特に、テレビ番組では、ＣＭ（Commercial
Message）などへの切換えのタイミングでステレオ音声
からモノラル音声への切換えが行なわれる。そしてこの
音声切換えをトリガとして録画のポーズ、ポーズ解除を
行なうことが可能である。しかし、映像信号と音声信号
とが必ずしも正確に対応しているとは限らない。このた
め、音声切換えをトリガとしてポーズ動作を行なうと通
常のオーディオ符号化処理では、映像信号と音声信号と
の間のずれにより、一時停止前の映像再生時に、一時停
止後の映像に対応した音声信号が再生されてしまう場
合、およびその逆の現象が生じる場合がある。本実施の
形態に係るオーディオ処理装置３０では、ポーズ指示信
号がＬとなるフレームの２フレーム前でフェードアウト
処理が行なわれ、１フレーム前でミュート処理が行なわ
れる。このため、一時停止後の映像に対応した音声信号
が一時停止前の映像再生時に生成されることがなくな
り、複数の機器を使用する場合の再生タイミングの不整
合を回避することができる。In particular, in a television program, commercials (Commercial
Message) or the like, switching from stereo sound to monaural sound is performed. Then, it is possible to perform recording pause and pause release by using the voice switching as a trigger. However, video signals and audio signals do not always correspond exactly. For this reason, if a pause operation is performed with audio switching as a trigger, in a normal audio encoding process, due to a shift between the video signal and the audio signal, the audio corresponding to the video after the pause is reproduced when the video is reproduced before the pause. In some cases, a signal is reproduced, and vice versa. In audio processing apparatus 30 according to the present embodiment, fade-out processing is performed two frames before the frame in which the pause instruction signal is L, and mute processing is performed one frame before. Therefore, an audio signal corresponding to the video after the pause is not generated at the time of reproducing the video before the pause, and it is possible to avoid a mismatch in the reproduction timing when a plurality of devices are used.

【００７０】以上説明したオーディオ処理装置３０は、
コンピュータにより実現することができる。コンピュー
タによるオーディオ処理装置３０の実現方法は、図６お
よび図７を参照して説明した、実施の形態１の実現方法
と同様である。このため、その詳細な説明はここでは繰
返さない。The audio processing device 30 described above
It can be realized by a computer. The method of realizing the audio processing device 30 by the computer is the same as the method of realizing the first embodiment described with reference to FIGS. Therefore, the detailed description will not be repeated here.

【００７１】オーディオ処理装置３０では、ポーズ指示
信号がＬとなるフレームの２フレーム前のフレームでフ
ェードアウト処理を行なうようにしたが、３フレーム以
上前のフレームでフェードアウト処理をし、続くフレー
ムでミュート処理を行なうようにしてもよい。In the audio processing apparatus 30, the fade-out processing is performed in the frame two frames before the frame in which the pause instruction signal is L. However, the fade-out processing is performed in the frame three or more frames before, and the mute processing is performed in the subsequent frame. May be performed.

【００７２】以上説明したように、本実施の形態に係る
オーディオ処理装置３０によれば、音声信号再生時の音
声レベルの急激な変化に伴う雑音を生じさせることがな
く、音声再生することができる。また、複数の機器を同
期させてオーディオ符号化処理を行なう場合であって
も、複数の機器間での再生タイミングの不整合を回避す
ることができる。As described above, according to the audio processing apparatus 30 according to the present embodiment, audio can be reproduced without generating noise due to a sudden change in the audio level when reproducing the audio signal. . Further, even when a plurality of devices are synchronized to perform the audio encoding process, it is possible to avoid a mismatch in reproduction timing between the plurality of devices.

【００７３】今回開示された実施の形態はすべての点で
例示であって制限的なものではないと考えられるべきで
ある。本発明の範囲は上記した説明ではなくて特許請求
の範囲によって示され、特許請求の範囲と均等の意味お
よび範囲内でのすべての変更が含まれることが意図され
る。The embodiments disclosed this time should be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the description above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.

【００７４】[0074]

【発明の効果】請求項１〜１２に記載の発明によると、
再生時に音声レベルの急激な変化がなく、それに伴う雑
音を生じさせることなく、音声再生を行なうことができ
る。According to the invention described in claims 1 to 12,
Audio reproduction can be performed without a sudden change in audio level at the time of reproduction and without accompanying noise.

【００７５】請求項４、８および１２に記載の発明によ
ると、複数の機器を使用する場合の再生タイミングの不
整合を回避することができる。According to the fourth, eighth and twelfth aspects of the present invention, it is possible to avoid a mismatch in reproduction timing when a plurality of devices are used.

[Brief description of the drawings]

【図１】本発明の実施の形態１および２に係るオーデ
ィオ処理装置の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of an audio processing device according to Embodiments 1 and 2 of the present invention.

【図２】実施の形態１に係るオーディオ符号化処理の
フローチャートである。FIG. 2 is a flowchart of an audio encoding process according to Embodiment 1.

【図３】実施の形態１に係るオーディオ符号化処理の
フローチャートである。FIG. 3 is a flowchart of an audio encoding process according to Embodiment 1.

【図４】ポーズ指示フラグの値の更新処理を説明する
ための図である。FIG. 4 is a diagram for explaining a process of updating the value of a pause instruction flag.

【図５】実施の形態１に係るオーディオ符号化処理後
の音声レベルについて説明するための図である。FIG. 5 is a diagram for describing an audio level after audio encoding processing according to Embodiment 1.

【図６】オーディオ処理装置を実現するコンピュータ
の外観図である。FIG. 6 is an external view of a computer that realizes the audio processing device.

【図７】オーディオ処理装置を実現するコンピュータ
の内部ブロック図である。FIG. 7 is an internal block diagram of a computer that realizes the audio processing device.

【図８】実施の形態２に係るオーディオ符号化処理の
フローチャートである。FIG. 8 is a flowchart of an audio encoding process according to Embodiment 2.

【図９】実施の形態２に係るオーディオ符号化処理の
フローチャートである。FIG. 9 is a flowchart of an audio encoding process according to Embodiment 2.

【図１０】実施の形態２に係るオーディオ符号化処理
後の音声レベルについて説明するための図である。FIG. 10 is a diagram for describing an audio level after audio encoding processing according to Embodiment 2.

【図１１】従来のオーディオ符号化処理について説明
するための図である。FIG. 11 is a diagram illustrating a conventional audio encoding process.

[Explanation of symbols]

３０オーディオ処理装置、３１２ポートメモリ、３
２，３３レジスタ、３４ＦＩＦＯメモリ、３５プ
ロセッサ、３６オーディオフレームパルス発生器、３
７メモリ。30 audio processing unit, 31 two-port memory, 3
2, 33 registers, 34 FIFO memory, 35 processor, 36 audio frame pulse generator, 3
7 Memory.

Claims

[Claims]

1. A first signal for instructing an encoding process of audio data that selectively takes one of a first value and a second value, and one of a third value and a fourth value. An audio processing apparatus for encoding audio data on a frame basis based on a second signal for instructing a temporary stop of encoding processing of audio data to be selectively taken. A memory having a first port and a second port from which audio data is output; connected to the memory, and when audio data is input to the memory, the audio data is read from the memory with a delay, Based on the histories of the first and second signals, selectively execute either a process of encoding audio data or a process of not encoding audio data. Audio data encoding means, coupled to the audio data encoding means, and based on the history of the second signal, the audio data encoding means performs audio encoding in frames before and after a period during which the audio data encoding means does not encode audio data. An audio level control means for controlling a gain of audio data encoded by the audio data encoding means so as to eliminate an audio level difference of data.

2. The audio level control means is coupled to the audio data encoding means,
First fade-out means for reducing the gain of the audio level of the audio data to a reference value in a first predetermined frame period before the frame at which the audio data encoding means stops encoding, based on the signal history of The audio data encoding means, and the second
Fade-in means for increasing the gain of the audio level of the audio data from the reference value in a second predetermined frame period after the frame in which the audio data encoding means resumes encoding based on the signal history of The audio processing device according to claim 1.

3. The first fade-out means is coupled to the audio data encoding means, and one frame immediately before the audio data encoding means stops encoding based on a history of the second signal. 3. The audio processing apparatus according to claim 2, further comprising a second fade-out means for reducing the gain of the audio level of the audio data to a reference value.

4. The first fade-out means is coupled to the audio data encoding means, and the second fade-out means is
The gain of the audio level of the audio data is determined for a predetermined frame two or more frames before the frame at which the audio data encoding unit stops encoding based on the signal history of A second fade-out means for reducing to the reference value in accordance with
Based on the signal history, the gain of the audio level of the audio data of the frame after the predetermined frame and before the audio data encoding unit stops encoding is set to the reference value. 3. The audio processing device according to claim 2, further comprising: a mute means for maintaining the audio processing value.

5. A first signal for instructing an encoding process of audio data that selectively takes one of a first value and a second value, and one of a third value and a fourth value. An audio processing method for encoding audio data on a frame basis based on a second signal for instructing a temporary stop of encoding processing of audio data to be selectively taken, wherein the input audio data is delayed. And a gain of the audio data that is encoded based on the delayed audio data and the history of the second signal so as to eliminate an audio level difference of the audio data in frames before and after a period in which the audio data is not encoded. Controlling the audio data based on the delayed audio data and the history of the first and second signals. Selectively performing either a process of encoding data or a process of not encoding data.

6. The method according to claim 6, wherein the step of controlling the gain includes, based on the delayed audio data and the history of the second signal, in a first predetermined frame period before a frame in which encoding of the audio data is stopped. Reducing the gain of the audio level of the audio data to a reference value; and a second predetermined frame after a frame in which encoding of the audio data is restarted based on the delayed audio data and the history of the second signal. The audio processing method according to claim 5, further comprising: increasing a gain of the audio level of the audio data from the reference value during a period.

7. The step of reducing the gain to a reference value includes, based on the delayed audio data and the history of the second signal, the audio data of the audio data in one frame immediately before the stop of the encoding of the audio data. 7. The audio processing method according to claim 6, comprising reducing a level gain to a reference value.

8. The step of reducing the gain to a reference value, comprising the step of: performing, on the basis of the delayed audio data and the history of the second signal, at least two frames before a frame at which encoding of the audio data is stopped. Reducing the gain of the audio level of the audio data to the reference value in accordance with a predetermined method for a predetermined frame; based on the delayed audio data and the history of the second signal, Maintaining the audio level gain of the audio data of the frame after the predetermined frame and before the encoding of the audio data is stopped at the reference value.
An audio processing method according to claim 1.

9. A first signal for instructing audio data encoding processing to selectively take one of the first and second values, and one of the third and fourth values. A program for causing a computer to execute an audio processing method for encoding audio data on a frame basis based on a second signal for instructing a temporary stop of encoding processing of audio data to be selectively taken is recorded. A computer-readable recording medium, comprising: a step of delaying input audio data; and wherein audio data is not encoded based on the delayed audio data and the history of the second signal. Gain of audio data that is encoded to eliminate the difference in audio level of audio data in frames before and after the period And selectively performing either a process of encoding the audio data or a process of not encoding based on the delayed audio data and the history of the first and second signals. And a computer-readable recording medium.

10. The step of controlling a gain, the step of: controlling a gain in a first predetermined frame period before a frame in which encoding of audio data is stopped based on a history of delayed audio data and a history of the second signal. Reducing the gain of the audio level of the audio data to a reference value; and a second predetermined frame after a frame in which encoding of the audio data is restarted based on the delayed audio data and the history of the second signal. Increasing the gain of the audio level of the audio data from the reference value during a period.

11. The step of reducing a gain to a reference value comprises the steps of:
Reducing the gain of the audio level of the audio data to a reference value in one frame immediately before the encoding of the audio data is stopped based on the signal history of
A computer-readable recording medium according to claim 10.

12. The step of reducing the gain to a reference value includes the step of, based on the delayed audio data and the history of the second signal, setting a gain at least two frames before a frame at which encoding of audio data is stopped. Reducing the gain of the audio level of the audio data to the reference value in accordance with a predetermined method for a predetermined frame; based on the delayed audio data and the history of the second signal, Maintaining the gain of the audio level of the audio data of the frame after the predetermined frame and before the encoding of the audio data is stopped at the reference value.
0. The computer-readable recording medium according to 0.