JP2000151531A

JP2000151531A - Device and method for correcting audio data

Info

Publication number: JP2000151531A
Application number: JP10315886A
Authority: JP
Inventors: Koichi Horiuchi; 浩一堀内; Takao Matsumoto; 孝夫松本; Aki Yoneda; 亜旗米田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1998-11-06
Filing date: 1998-11-06
Publication date: 2000-05-30

Abstract

PROBLEM TO BE SOLVED: To provide a device and a method for correcting audio data that corrects missing audio data in order to encode and multiplex audio data and video data synchronously with the video data. SOLUTION: A data quantity measurement means 11 measures a data quantity 1i per unit tie of received audio data. A data quantity comparison means 12 compares a data quantity 1t per preset unit time with the data quantity 1i. A correction data insert means 13 outputs the received audio data when the comparison result is equal, distributes uniformly correction data by a difference between the data quantity 1t and the data quantity 1i to the input audio data after the judgement of the smaller data quantity it and provides an output of the result when the comparison result indicates that the data quantity 1i is less than the data quantity 1t.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、欠損のある音声デ
ータを補正する音声データ補正装置及び方法に関し、特
に映像データとの同期をとって音声データと映像データ
を符号化し多重化するために欠損のある音声データを補
正する音声データ補正装置及び方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio data correcting apparatus and method for correcting audio data having a loss, and more particularly, to a method for encoding and multiplexing audio data and video data in synchronization with video data. TECHNICAL FIELD The present invention relates to an audio data correction apparatus and method for correcting audio data with noise.

【０００２】[0002]

【従来の技術】近年、マルチメディア対応パソコンと呼
ばれるパソコンが低価格で市場に出てきており、音声
や映像の再生がパソコン上で簡単に出来るようになっ
た。また、インターネットでの音声や映像の配信なども
行われるようになってきた。音声や映像のデータ量は非
常に大きいため、これらのデータを原データのまま用い
るのではなく、データ量を減らすため符号化したデータ
を用いるのが一般的である。また、音声データと映像デ
ータの両方を含むデータは、音声データと映像データが
同期をとって再生（復号化）できるように多重化して用
いられている。このような符号化多重化の例としては、
ＭＰＥＧがよく知られている。また、再生（復号化）に
必要な処理量は比較的小さいため、現在ではソフトウエ
アを用いて行なうのが主流になってきた。2. Description of the Related Art In recent years, personal computers called multimedia-compatible personal computers have appeared on the market at low prices, and it has become possible to easily reproduce audio and video on the personal computer. In addition, distribution of audio and video over the Internet has also been performed. Since the amount of audio and video data is very large, it is common to use coded data to reduce the amount of data, rather than using these data as the original data. Data including both audio data and video data is multiplexed and used so that the audio data and the video data can be synchronously reproduced (decoded). Examples of such coded multiplexing include:
MPEG is well known. Further, since the amount of processing required for reproduction (decoding) is relatively small, the use of software has become the mainstream at present.

【０００３】従来この分野では、音声データと映像デー
タを符号化し多重化する際に、符号化に必要な処理量が
膨大なため、音声データと映像データを取込みながら専
用ＬＳＩを搭載したハードウエア（拡張ボード、または
パソコン外付けの周辺装置）を用いて実時間で行なう
か、または予め取込んでおいた音声データと映像データ
をソフトウエアを用いて実時間の数倍の時間をかけて行
なうのが一般的であった。専用ＬＳＩを搭載したハード
ウエアを用いる場合には、音声データの取込みは専用Ｌ
ＳＩにより行なわれるため、音声データに欠損はなかっ
た。また、ソフトウエアを用いる場合には、音声データ
は予め用意しておけばよいため、音声データに欠損はな
かった。従って、欠損のある音声データを用いることは
ないため、欠損のある音声データを補正する音声データ
補正装置及び方法は必要なかった。Conventionally, in this field, when encoding and multiplexing audio data and video data, the amount of processing required for encoding is enormous. Using an expansion board or a peripheral device external to a personal computer) in real time, or using the software to take audio and video data in several times the real time. Was common. When using hardware equipped with a dedicated LSI, the acquisition of audio data
There was no loss in the audio data because it was performed by SI. Further, when software is used, the audio data may be prepared in advance, and there is no loss in the audio data. Therefore, there is no need to use the audio data with the loss, so that there is no need for an audio data correction apparatus and method for correcting the audio data with the loss.

【０００４】[0004]

【発明が解決しようとする課題】このような状況の中
で、ＣＰＵ性能の向上により、音声データと映像データ
をソフトウエアで実時間で符号化多重化することが可能
になりつつある。実時間ソフトウエアは、高性能ＣＰＵ
を搭載するパソコンの世界では、再生（復号化）の主流
がソフトウエアで行われるようになったと同様に、今後
主流になっていくと考えられる。この場合の実時間ソフ
トウエアとは、符号化多重化の処理の一部をハードウエ
アに分担させて処理を行なうソフトウエアも含む。In such a situation, it is becoming possible to encode and multiplex audio data and video data by software in real time by improving the performance of the CPU. Real-time software is a high-performance CPU
In the world of PCs equipped with, it is expected that the mainstream of playback (decryption) will become mainstream in the future, just as software has become the mainstream. The real-time software in this case also includes software for performing processing by sharing a part of the encoding and multiplexing processing with hardware.

【０００５】ソフトウエアで実時間で符号化多重化する
場合、音声データはアナログの音声信号をデジタルの音
声データに変換するキャプチャボードと呼ばれる拡張ボ
ードを用いてパソコンに取込まれる。通常このボード
は、デジタルの音声データをアナログの音声信号に変換
してスピーカへの出力を行なう機能を合わせもち、音声
データの入出力を行なう音声ボードとしてパソコンに実
装されている。従来の専用ＬＳＩを搭載したハードウエ
アと異なり、符号化多重化のための音声データの取込み
以外にも用いられ、またデジタルに変換された音声デー
タはソフトウエアがパソコン内部のメモリまたは処理を
分担して行なうハードウエアへと転送するという取込み
処理を行なう。When encoding and multiplexing in real time by software, audio data is taken into a personal computer using an expansion board called a capture board that converts an analog audio signal into digital audio data. Usually, this board has a function of converting digital audio data into an analog audio signal and outputting the signal to a speaker, and is mounted on a personal computer as an audio board for inputting and outputting audio data. Unlike hardware with conventional dedicated LSI, it is used for other than the acquisition of audio data for encoding and multiplexing. Digitally converted audio data is used by software to share memory or processing inside the PC. Then, the data is transferred to hardware.

【０００６】このようにして音声データを取り込む場
合、ソフトウエアによる取込み処理が実時間に間に合わ
ない場合が発生し、音声データを取り損ねることがあ
る。例えば、Ａ（１）、Ａ（２）、Ａ（３）、…、Ａ
（Ｎ−１）、Ａ（Ｎ）という音声データを取り込もうと
して、Ａ（２）の音声データを取り損ねると、Ａ
（１）、Ａ（３）、…、Ａ（Ｎ−１）、Ａ（Ｎ）という
Ａ（２）が欠損した音声データが取り込まれる。When the audio data is fetched in this way, there is a case where the fetching process by the software cannot be performed in real time, and the audio data may be missed. For example, A (1), A (2), A (3),.
If the user tries to capture the audio data of (N-1) and A (N) and fails to capture the audio data of A (2), A
A (2), which is (1), A (3),..., A (N-1), and A (N), is lost.

【０００７】音声データには、個々の音声データがどの
時間に取り込まれたかという時間情報は付いていないの
で、Ａ（１）、Ａ（３）、…、Ａ（Ｎ−１）、Ａ（Ｎ）
という音声データを受取っても、欠損があるかないか分
からない。また、どの音声データが欠損しているか分か
らない。この音声データと用いて映像データと多重化を
行なうと、音声データが欠損した時間以降では音声と映
像の同期にずれが発生する。この同期のずれは欠損が微
少の場合は人間には知覚されにくいが、欠損が累積する
と明らかに知覚できるようになる。また、映像の終了直
前では音声がない状態になる。[0007] Since the audio data does not have time information indicating at what time the individual audio data was captured, A (1), A (3), ..., A (N-1), A (N )
Does not know if there is any loss or not. Further, it is not known which audio data is missing. When multiplexing is performed with video data using the audio data, a synchronization error occurs between the audio and the video after the time when the audio data is lost. This synchronization deviation is hardly perceived by a human when the loss is minute, but can be clearly perceived when the loss is accumulated. Immediately before the end of the video, there is no sound.

【０００８】本発明は、上記従来の課題に鑑み、映像デ
ータとの同期をとって音声データと映像データを符号化
し多重化するために欠損のある音声データを補正する音
声データ補正装置及び方法を提供することを目的とす
る。In view of the above-mentioned conventional problems, the present invention provides an audio data correction apparatus and method for correcting defective audio data in order to encode and multiplex audio data and video data in synchronization with video data. The purpose is to provide.

【０００９】[0009]

【課題を解決するための手段】本発明の音声データ補正
装置は、入力音声データの単位時間当たりの第１のデー
タ量を計測する手段と、予め設定された単位時間当たり
の第２のデータ量と第１のデータ量とを比較する手段
と、第１のデータ量が第２のデータ量と等しい場合、入
力音声データを出力し、第１のデータ量が第２のデータ
量より少ない場合、第２のデータ量と第１のデータ量の
差だけ補正データを均等に分散させてデータ量が少ない
と判断された時点以降の入力音声データに挿入し出力す
る手段とを備える。According to the present invention, there is provided an audio data correction apparatus comprising: means for measuring a first data amount per unit time of input audio data; and a second data amount per unit time set in advance. Means for comparing the first data amount with the first data amount; outputting the input audio data when the first data amount is equal to the second data amount; and outputting the input audio data when the first data amount is smaller than the second data amount. Means for uniformly dispersing the correction data by the difference between the second data amount and the first data amount and inserting the corrected data into the input audio data after the point in time when it is determined that the data amount is small, and outputting the data.

【００１０】また、本発明の音声データ補正装置は、入
力音声データの単位時間当たりの第１のデータ量を計測
する手段と、予め設定された単位時間当たりの第２のデ
ータ量と第１のデータ量とを比較する手段と、入力音声
データを一時的に格納する手段と、第１のデータ量が第
２のデータ量と等しい場合、格納する手段に格納された
入力音声データを出力し、第１のデータ量が第２のデー
タ量より少ない場合、第２のデータ量と第１のデータ量
の差だけ補正データを格納する手段に格納された入力音
声データに均等に分散させて挿入し出力する手段とを備
える。The audio data correction apparatus of the present invention further comprises means for measuring a first data amount per unit time of the input audio data, and a second data amount per unit time which is set in advance. Means for comparing the amount of data with the data, means for temporarily storing the input voice data, and when the first data amount is equal to the second data amount, outputting the input voice data stored in the storing means; When the first data amount is smaller than the second data amount, the first data amount is evenly distributed and inserted into the input audio data stored in the correction data storing means by the difference between the second data amount and the first data amount. Output means.

【００１１】また、本発明の音声データ補正装置は、補
正データを挿入する代わりに、入力音声データをアップ
サンプルして補間し直し、第２のデータ量と第１のデー
タ量の差だけデータ量を増加させる手段を備える。Further, instead of inserting the correction data, the audio data correction apparatus of the present invention upsamples the input audio data and re-interpolates the input audio data to obtain a data amount corresponding to the difference between the second data amount and the first data amount. Is provided.

【００１２】[0012]

【発明の実施の形態】以下、本発明の実施の形態につい
て、図面を参照しながら説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１３】（実施の形態１）図１は本発明の実施の形
態１における音声データ補正装置の構成の例を示すブロ
ック図である。図１において、データ量計測手段１１
は、入力音声データの単位時間当たりのデータ量（以降
データ量１ｉとする）を計測する、データ量比較手段１
２は、予め設定された単位時間当たりのデータ量（以降
データ量１ｔとする）とデータ量計測手段１１が計測し
たデータ量１ｉとを比較する、補正データ挿入手段１３
は、データ量比較手段１２の比較結果が等しい場合、入
力音声データを出力し、データ量比較手段１２の比較結
果がデータ量１ｉが少ない場合、データ量１ｔとデータ
量１ｉの差だけ補正データを均等に分散させてデータ量
１ｉが少ないと判断された時点以降の入力音声データに
挿入し出力する、音声データ符号化装置１４はこの補正
された音声データを符号化する、映像データ符号化手段
１５は入力映像データを符号化する、多重化手段１６は
符号化された音声データと符号化された映像データを同
期をとって多重化し音声映像符号化データを出力する。(Embodiment 1) FIG. 1 is a block diagram showing an example of the configuration of an audio data correction apparatus according to Embodiment 1 of the present invention. In FIG. 1, data amount measuring means 11
Is a data amount comparing means 1 for measuring a data amount of input voice data per unit time (hereinafter referred to as a data amount 1i).
Reference numeral 2 denotes a correction data insertion unit 13 that compares a preset data amount per unit time (hereinafter referred to as a data amount 1t) with a data amount 1i measured by the data amount measurement unit 11.
Outputs the input voice data when the comparison result of the data amount comparing means 12 is equal, and outputs the correction data by the difference between the data amount 1t and the data amount 1i when the comparison result of the data amount comparing means 12 is small. The audio data encoding device 14 that uniformly scatters and inserts and outputs the input audio data after the point in time when it is determined that the data amount 1i is small, encodes the corrected audio data. Encodes the input video data. The multiplexing means 16 multiplexes the coded audio data and the coded video data in synchronization with each other and outputs coded audio / video data.

【００１４】音声データ補正装置の動作を図３の例を用
いてさらに詳しく説明する。図３において、３１は原音
声データ、３２は欠損の発生したデータ、３３は音声デ
ータ補正装置に入力される入力音声データ、３４は音声
データ補正装置が出力する補正された音声データであ
る。またこの図は、連続して入力される音声データの一
部分を示している。The operation of the audio data correction device will be described in more detail with reference to the example shown in FIG. In FIG. 3, reference numeral 31 denotes original voice data, reference numeral 32 denotes defective data, reference numeral 33 denotes input voice data input to the voice data correction device, and reference numeral 34 denotes corrected voice data output from the voice data correction device. This figure shows a part of the audio data that is continuously input.

【００１５】音声データは欠損がなければ一定速度のデ
ータ量で入力され、このデータ量は予め分かっている。
データ量計測手段１１が計測する単位時間あたりのデー
タ量は、音声データに欠損がなければ常に一定である。
このデータ量をｍとすると、データ量比較手段１２はデ
ータ量計測手段１１が計測したデータ量とｍとを比較す
る。等しい場合は、欠損がないため、入力音声データを
そのまま出力する。If there is no loss, audio data is input at a constant speed data amount, and this data amount is known in advance.
The data amount per unit time measured by the data amount measuring means 11 is always constant unless there is any loss in the audio data.
Assuming that this data amount is m, the data amount comparison unit 12 compares the data amount measured by the data amount measurement unit 11 with m. If they are equal, there is no loss, so the input voice data is output as it is.

【００１６】データ量計測手段１１が計測したデータ量
の方が少ない場合は、欠損があると判断できる。この時
計測したデータ量をｍ’とすると、ｍ−ｍ’だけのデー
タが欠損したことになり、ｍ−ｍ’だけデータを増加さ
せる必要がある。どのデータが欠損したかは判断できな
いため、データが欠損したと判断された計測単位の次の
入力音声データに、ｍ−ｍ’だけの補正データを均等に
分散させて挿入して出力する。均等に分散させて挿入す
るのは、入力音声データに対する変更を数箇所に分散さ
せ、１ヶ所当たりの変更は最小限に押さえることで、人
間が補正データが挿入されていることを知覚しにくくな
るためである。If the data amount measured by the data amount measuring means 11 is smaller, it can be determined that there is a loss. Assuming that the data amount measured at this time is m ', data for mm' has been lost, and it is necessary to increase the data by mm '. Since it is not possible to determine which data has been lost, correction data of mm ′ is inserted in the input audio data next to the unit of measurement in which the data has been determined to be lost, evenly distributed and output. Inserting evenly distributed means that changes to the input audio data are distributed to several places and changes per place are kept to a minimum, making it difficult for a human to perceive that the correction data is inserted. That's why.

【００１７】このようにして、欠損のある入力音声デー
タを補正することで、映像データと音声データが同期し
た符号化データを作成することができる。また、入力音
声データ自体を遅延させる処理が不要なので、入力音声
データの補正による遅延時間を少なくすることができ
る。遅延時間が少ないことは、例えばＴＶ電話や監視カ
メラの場合のように、可能な限り遅延時間が少ないこと
が望まれる用途に有効である。In this way, by correcting input audio data having a loss, encoded data in which video data and audio data are synchronized can be created. Further, since it is not necessary to perform a process of delaying the input voice data itself, it is possible to reduce a delay time due to the correction of the input voice data. The small delay time is effective for applications where it is desired to minimize the delay time, such as in the case of a TV phone or a surveillance camera.

【００１８】なお、本実施の形態１では、補正データを
データが欠損したと判断された計測単位の次の入力音声
データに挿入したが、欠損したと判断された計測単位以
降の入力音声データであれば、どこでもよいし複数の計
測単位に渡ってもよい。また、補正データを挿入する位
置を均等に分散させたが、厳密に均等でなくても極端に
偏っていなければよい。In the first embodiment, the correction data is inserted into the input audio data next to the measurement unit in which the data is determined to be lost. If it exists, it may be anywhere and may extend to a plurality of measurement units. Further, the positions at which the correction data are inserted are uniformly distributed. However, the positions at which the correction data are inserted are not strictly uniform, but need not be extremely biased.

【００１９】（実施の形態２）図２は本発明の実施の形
態２における音声データ補正装置の構成の例を示すブロ
ック図である。図２において、データ量計測手段２１
は、入力音声データの単位時間当たりのデータ量（以降
データ量２ｉとする）を計測する、データ量比較手段２
２は、予め設定された単位時間当たりのデータ量（以降
データ量２ｔとする）とデータ量計測手段２１が計測し
たデータ量２ｉとを比較する、バッファ２７は、入力音
声データを一時的に格納する、補正データ挿入手段２３
は、データ量比較手段２２の比較結果が等しい場合、バ
ッファ２７に格納された入力音声データを出力し、デー
タ量比較手段２２の比較結果がデータ量２ｉが少ない場
合、データ量２ｔとデータ量２ｉの差だけ補正データを
バッファ２７に格納された入力音声データに均等に分散
させて挿入し出力する、音声データ符号化装置２４はこ
の補正された音声データを符号化する、映像データ符号
化手段２５は入力映像データを符号化する、多重化手段
２６は符号化された音声データと符号化された映像デー
タを同期をとって多重化し音声映像符号化データを出力
する。(Embodiment 2) FIG. 2 is a block diagram showing an example of the configuration of an audio data correction apparatus according to Embodiment 2 of the present invention. In FIG. 2, the data amount measuring means 21
Is a data amount comparing means 2 for measuring a data amount of input voice data per unit time (hereinafter referred to as a data amount 2i).
2 compares the preset data amount per unit time (hereinafter referred to as data amount 2t) with the data amount 2i measured by the data amount measuring means 21. The buffer 27 temporarily stores the input audio data. Correction data insertion means 23
Outputs the input audio data stored in the buffer 27 when the comparison result of the data amount comparing unit 22 is equal, and outputs the data amount 2t and the data amount 2i when the comparison result of the data amount comparing unit 22 is small. The audio data encoding device 24 encodes this corrected audio data by inserting and outputting the corrected data evenly dispersed in the input audio data stored in the buffer 27 and outputting the corrected data. Encodes the input video data. The multiplexing means 26 multiplexes the coded audio data and the coded video data in synchronization with each other and outputs coded audio / video data.

【００２０】音声データ補正装置の動作を図４の例を用
いてさらに詳しく説明する。図４において、４１は原音
声データ、４２は欠損の発生したデータ、４３は音声デ
ータ補正装置に入力される入力音声データ、４４は音声
データ補正装置が出力する補正された音声データであ
る。またこの図は、連続して入力される音声データの一
部分を示している。The operation of the audio data correction device will be described in more detail with reference to the example of FIG. In FIG. 4, reference numeral 41 denotes original voice data, reference numeral 42 denotes missing data, reference numeral 43 denotes input voice data input to the voice data correction device, and reference numeral 44 denotes corrected voice data output from the voice data correction device. This figure shows a part of the audio data that is continuously input.

【００２１】音声データは欠損がなければ一定速度のデ
ータ量で入力され、このデータ量は予め分かっている。
データ量計測手段２１が計測する単位時間あたりのデー
タ量は、音声データに欠損がなければ常に一定である。
このデータ量をｍとすると、データ量比較手段２２はデ
ータ量計測手段２１が計測したデータ量とｍとを比較す
る。等しい場合は、欠損がないため、バッファ２７に格
納された入力音声データをそのまま出力する。The voice data is input at a constant speed data amount if there is no loss, and this data amount is known in advance.
The data amount per unit time measured by the data amount measuring means 21 is always constant unless there is any loss in the audio data.
Assuming that this data amount is m, the data amount comparison unit 22 compares the data amount measured by the data amount measurement unit 21 with m. If they are equal, there is no loss, so the input audio data stored in the buffer 27 is output as it is.

【００２２】データ量計測手段２１が計測したデータ量
の方が少ない場合は、欠損があると判断できる。この時
計測したデータ量をｍ’とすると、ｍ−ｍ’だけのデー
タが欠損したことになり、ｍ−ｍ’だけデータを増加さ
せる必要がある。どのデータが欠損したかは判断できな
いため、データが欠損したと判断された計測単位を含む
バッファ２７に格納された入力音声データに、ｍ−ｍ’
だけの補正データを均等に分散させて挿入して出力す
る。均等に分散させて挿入するのは、入力音声データに
対する変更を数箇所に分散させ、１ヶ所当たりの変更は
最小限に押さえることで、人間が補正データが挿入され
ていることを知覚しにくくなるためである。If the data amount measured by the data amount measuring means 21 is smaller, it can be determined that there is a loss. Assuming that the data amount measured at this time is m ', data for mm' has been lost, and it is necessary to increase the data by mm '. Since it is not possible to determine which data has been lost, the input audio data stored in the buffer 27 including the unit of measurement determined to have lost the data includes MM ′
The correction data is inserted evenly dispersed and output. Inserting evenly distributed means that changes to the input audio data are distributed to several places and changes per place are kept to a minimum, making it difficult for a human to perceive that the correction data is inserted. That's why.

【００２３】このようにして、欠損のある入力音声デー
タを補正することで、映像データと音声データが同期し
た符号化データを作成することができる。また、補正デ
ータが広い範囲に分散されるため、補正データが挿入さ
れていることをより知覚しにくくすることができる。ま
た、バッファの格納単位をデータ量の計測単位と同じに
すれば、補正データは欠損が発生した計測単位のみに挿
入されるため、補正データの影響を最小限の範囲に押さ
える事ができ、これ以外の計測単位の入力音声データは
完全に同期をとった上で、全体として同期した符号化デ
ータを作成することができる。As described above, by correcting input audio data having a loss, encoded data in which video data and audio data are synchronized can be created. Further, since the correction data is dispersed over a wide range, it is possible to make it more difficult to perceive that the correction data has been inserted. In addition, if the buffer storage unit is set to be the same as the data amount measurement unit, the correction data is inserted only in the measurement unit in which the loss has occurred, so that the influence of the correction data can be suppressed to a minimum range. The input audio data of the measurement units other than the above can be completely synchronized, and then encoded data synchronized as a whole can be created.

【００２４】なお、本実施の形態２では、補正データを
データが欠損したと判断された計測単位を含むバッファ
２７に格納された入力音声データ全体に渡り挿入した
が、その一部分の計測単位だけでもよい。また、補正デ
ータを挿入する位置を均等に分散させたが、厳密に均等
でなくても極端に偏っていなければよい。In the second embodiment, the correction data is inserted over the entire input audio data stored in the buffer 27 including the measurement unit whose data is determined to be lost. Good. Further, the positions at which the correction data are inserted are uniformly distributed. However, the positions at which the correction data are inserted are not strictly uniform, but need not be extremely biased.

【００２５】（実施の形態３）本発明の実施の形態３
は、実施の形態１において説明した図１の補正データ挿
入手段１３および実施の形態２において説明した図２の
補正データ挿入手段２３が挿入する補正データが、無音
データ、または直前の音声データ、または直後の音声デ
ータ、または直前の音声データと直後の音声データの補
間データ、のいずれかであるものである。(Embodiment 3) Embodiment 3 of the present invention
The correction data inserted by the correction data insertion means 13 of FIG. 1 described in the first embodiment and the correction data insertion means 23 of FIG. 2 described in the second embodiment is the sound data, the immediately preceding audio data, or It is either the immediately following audio data or interpolation data of the immediately preceding audio data and the immediately following audio data.

【００２６】欠損した入力音声データが何であったかは
分からないため、それに代わる何等かのデータを補正デ
ータとして挿入する必要がある。この補正データは、元
々不正なデータなので雑音または異音として知覚される
可能性があり、人間が知覚しにくいものが望ましい。無
音データは、音声の瞬間的な途切れであるので、知覚し
にくい。直前または直後のデータは、同じ音声の瞬間的
な連続であるので、さらに知覚しにくい。直前の音声デ
ータと直後の音声データの補間データは、前後の音声の
から推測される音声であるので、さらに知覚しにくい。Since it is not known what the lost input voice data was, it is necessary to insert some data instead of it as correction data. Since the correction data is originally incorrect data, there is a possibility that the correction data is perceived as noise or abnormal noise. Silent data is hard to perceive because it is a momentary break in speech. Since the data immediately before or immediately after is the instantaneous continuation of the same sound, it is more difficult to perceive. The interpolation data of the immediately preceding audio data and the immediately succeeding audio data is the audio estimated from the preceding and following audio data, and is therefore more difficult to perceive.

【００２７】（実施の形態４）本発明の実施の形態４
は、実施の形態１において説明した図１の補正データ挿
入手段１３および実施の形態２において説明した図２の
補正データ挿入手段２３が、補正データを挿入する代わ
りに、入力音声データをアップサンプルすることで、欠
損したｍ−ｍ’だけデータ量を増加させ、音声データを
補正するものである。(Embodiment 4) Embodiment 4 of the present invention
Is that the correction data insertion means 13 of FIG. 1 described in the first embodiment and the correction data insertion means 23 of FIG. 2 described in the second embodiment up-sample input audio data instead of inserting correction data. In this way, the data amount is increased by the lost MM ′ to correct the audio data.

【００２８】入力音声データをアップサンプルするの
で、補正された音声データは入力音声データを欠損した
データ量だけ時間的に引き伸ばしたものになり、補正さ
れたことが知覚しにくい。Since the input voice data is up-sampled, the corrected voice data is obtained by expanding the input voice data temporally by the amount of the missing data, and it is difficult to perceive the correction.

【００２９】（実施の形態５）図５は本発明の実施の形
態５における音声データ補正方法の構成の例を示すフロ
ーチャートである。図５において、データ量計測ステッ
プ５２で、入力音声データの単位時間当たりのデータ量
（以降データ量５ｉとする）を計測する、データ量比較
ステップ５３で、予め設定された単位時間当たりのデー
タ量（以降データ量５ｔとする）とデータ量計測ステッ
プ５２で計測したデータ量５ｉとを比較する、データ量
比較ステップ５３の比較結果が等しい（５ｉ＝５ｔ）場
合、データ出力ステップ５６で、入力音声データを出力
する、データ量比較ステップ５３の比較結果がデータ量
５ｉが少ない（５ｉ＜５ｔ）場合、補正データ挿入ステ
ップ５５で、データ量５ｔとデータ量５ｉの差だけ補正
データを均等に分散させてデータ量５ｉが少ないと判断
された時点以降の入力音声データに挿入し、データ出力
ステップ５６で、そのデータを出力する。以上のステッ
プを入力音声データが終了するまで繰り返す。(Embodiment 5) FIG. 5 is a flowchart showing an example of the configuration of the audio data correction method according to Embodiment 5 of the present invention. In FIG. 5, a data amount measuring step 52 measures a data amount of input audio data per unit time (hereinafter referred to as a data amount 5i). A data amount comparing step 53 sets a preset data amount per unit time. (Hereinafter referred to as data amount 5t) and data amount 5i measured in data amount measurement step 52. If the comparison result in data amount comparison step 53 is equal (5i = 5t), input voice is output in data output step 56. If the comparison result of the data amount comparison step 53 that outputs data is that the data amount 5i is small (5i <5t), in the correction data insertion step 55, the correction data is evenly distributed by the difference between the data amount 5t and the data amount 5i. Is inserted into the input audio data after the time when it is determined that the data amount 5i is small, and the data is output in a data output step 56. . The above steps are repeated until the input voice data ends.

【００３０】音声データ補正方法の動作を図３の例を用
いてさらに詳しく説明する。音声データは欠損がなけれ
ば一定速度のデータ量で入力され、このデータ量は予め
分かっている。データ量計測ステップ５２で計測する単
位時間あたりのデータ量は、音声データに欠損がなけれ
ば常に一定である。このデータ量をｍとすると、データ
量比較ステップ５３でデータ量計測ステップ５２で計測
したデータ量とｍとを比較する。等しい場合は、欠損が
ないため、入力音声データをそのまま出力する。The operation of the audio data correction method will be described in more detail with reference to the example of FIG. If there is no loss, the voice data is input at a constant speed data amount, and this data amount is known in advance. The data amount per unit time measured in the data amount measurement step 52 is always constant unless there is any loss in the audio data. Assuming that this data amount is m, a data amount comparison step 53 compares the data amount measured in the data amount measurement step 52 with m. If they are equal, there is no loss, so the input voice data is output as it is.

【００３１】データ量計測ステップ５２で計測したデー
タ量の方が少ない場合は、欠損があると判断できる。こ
の時計測したデータ量をｍ’とすると、ｍ−ｍ’だけの
データが欠損したことになり、ｍ−ｍ’だけデータを増
加させる必要がある。どのデータが欠損したかは判断で
きないため、データが欠損したと判断された計測単位の
次の入力音声データに、ｍ−ｍ’だけの補正データを均
等に分散させて挿入して出力する。均等に分散させて挿
入するのは、入力音声データに対する変更を数箇所に分
散させ、１ヶ所当たりの変更は最小限に押さえること
で、人間が補正データが挿入されていることを知覚しに
くくなるためである。If the data amount measured in the data amount measuring step 52 is smaller, it can be determined that there is a loss. Assuming that the data amount measured at this time is m ', data for mm' has been lost, and it is necessary to increase the data by mm '. Since it is not possible to determine which data has been lost, correction data of mm ′ is inserted in the input audio data next to the unit of measurement in which the data has been determined to be lost, evenly distributed and output. Inserting evenly distributed means that changes to the input audio data are distributed to several places and changes per place are kept to a minimum, making it difficult for a human to perceive that the correction data is inserted. That's why.

【００３２】このようにして、欠損のある入力音声デー
タを補正することで、映像データと音声データが同期し
た符号化データを作成することができる。また、入力音
声データ自体を遅延させる処理が不要なので、入力音声
データの補正による遅延時間を少なくすることができ
る。遅延時間が少ないことは、例えばＴＶ電話や監視カ
メラの場合のように、可能な限り遅延時間が少ないこと
が望まれる用途に有効である。In this way, by correcting the input audio data having a loss, encoded data in which the video data and the audio data are synchronized can be created. Further, since it is not necessary to perform a process of delaying the input voice data itself, it is possible to reduce a delay time due to the correction of the input voice data. The small delay time is effective for applications where it is desired to minimize the delay time, such as in the case of a TV phone or a surveillance camera.

【００３３】なお、本実施の形態５では、補正データを
データが欠損したと判断された計測単位の次の入力音声
データに挿入したが、欠損したと判断された計測単位以
降の入力音声データであれば、どこでもよいし複数の計
測単位に渡ってもよい。また、補正データを挿入する位
置を均等に分散させたが、厳密に均等でなくても極端に
偏っていなければよい。In the fifth embodiment, the correction data is inserted into the input audio data next to the measurement unit in which the data is determined to be lost. If it exists, it may be anywhere and may extend to a plurality of measurement units. Further, the positions at which the correction data are inserted are uniformly distributed. However, the positions at which the correction data are inserted are not strictly uniform, but need not be extremely biased.

【００３４】（実施の形態６）図６は本発明の実施の形
態６における音声データ補正方法の構成の例を示すフロ
ーチャートである。図６において、データ一時格納ステ
ップ６７で、入力音声データを一時的にバッファに格納
する、データ量計測ステップ６２で、入力音声データの
単位時間当たりのデータ量（以降データ量６ｉとする）
を計測する、データ量比較ステップ６３で、予め設定さ
れた単位時間当たりのデータ量（以降データ量６ｔとす
る）とデータ量計測ステップ６２で計測したデータ量６
ｉとを比較する、データ量比較ステップ６３の比較結果
が等しい（６ｉ＝６ｔ）場合、データ出力ステップ６６
で、入力音声データを出力する、データ量比較ステップ
６３の比較結果がデータ量６ｉが少ない（６ｉ＜６ｔ）
場合、補正データ挿入ステップ６５で、データ量６ｔと
データ量６ｉの差だけ補正データをデータ一時格納ステ
ップ６７で格納された入力音声データに均等に分散させ
て挿入し、データ出力ステップ６６で、そのデータを出
力する。以上のステップを入力音声データが終了するま
で繰り返す。(Embodiment 6) FIG. 6 is a flowchart showing an example of the configuration of the audio data correction method according to Embodiment 6 of the present invention. In FIG. 6, in a data temporary storage step 67, the input audio data is temporarily stored in a buffer. In a data amount measurement step 62, a data amount of the input audio data per unit time (hereinafter referred to as a data amount 6i).
In the data amount comparison step 63, the data amount per unit time set in advance (hereinafter referred to as data amount 6t) and the data amount 6 measured in the data amount measurement step 62
When the comparison result of the data amount comparison step 63 for comparing i is equal (6i = 6t), the data output step 66
Then, the comparison result of the data amount comparison step 63 for outputting the input audio data indicates that the data amount 6i is small (6i <6t).
In this case, in the correction data insertion step 65, the correction data is evenly dispersed and inserted into the input audio data stored in the data temporary storage step 67 by the difference between the data amount 6t and the data amount 6i, and in the data output step 66, Output data. The above steps are repeated until the input voice data ends.

【００３５】音声データ補正方法の動作を図４の例を用
いてさらに詳しく説明する。音声データは欠損がなけれ
ば一定速度のデータ量で入力され、このデータ量は予め
分かっている。データ量計測ステップ６２で計測する単
位時間あたりのデータ量は、音声データに欠損がなけれ
ば常に一定である。このデータ量をｍとすると、データ
量比較ステップ６３でデータ量計測ステップ６２で計測
したデータ量とｍとを比較する。等しい場合は、欠損が
ないため、データ一時格納ステップ６７で格納された入
力音声データをそのまま出力する。The operation of the audio data correction method will be described in more detail with reference to the example of FIG. If there is no loss, the voice data is input at a constant speed data amount, and this data amount is known in advance. The data amount per unit time measured in the data amount measurement step 62 is always constant unless there is any loss in the audio data. Assuming that this data amount is m, the data amount measured in the data amount measurement step 62 is compared with m in the data amount comparison step 63. If they are equal, there is no loss, and the input voice data stored in the data temporary storage step 67 is output as it is.

【００３６】データ量計測ステップ６２で計測したデー
タ量の方が少ない場合は、欠損があると判断できる。こ
の時計測したデータ量をｍ’とすると、ｍ−ｍ’だけの
データが欠損したことになり、ｍ−ｍ’だけデータを増
加させる必要がある。どのデータが欠損したかは判断で
きないため、データが欠損したと判断された計測単位を
含むデータ一時格納ステップ６７で格納された入力音声
データに、ｍ−ｍ’だけの補正データを均等に分散させ
て挿入して出力する。均等に分散させて挿入するのは、
入力音声データに対する変更を数箇所に分散させ、１ヶ
所当たりの変更は最小限に押さえることで、人間が補正
データが挿入されていることを知覚しにくくなるためで
ある。If the data amount measured in the data amount measuring step 62 is smaller, it can be determined that there is a loss. Assuming that the data amount measured at this time is m ', data for mm' has been lost, and it is necessary to increase the data by mm '. Since it is not possible to determine which data has been lost, the correction data of only m−m ′ is evenly distributed in the input voice data stored in the data temporary storage step 67 including the unit of measurement determined to be lost. Insert and output. Inserting evenly distributed is
This is because it is difficult for a human to perceive that the correction data is inserted by dispersing the change to the input voice data in several places and minimizing the change per place.

【００３７】このようにして、欠損のある入力音声デー
タを補正することで、映像データと音声データが同期し
た符号化データを作成することができる。また、補正デ
ータが広い範囲に分散されるため、補正データが挿入さ
れていることをより知覚しにくくすることができる。ま
た、バッファの格納単位をデータ量の計測単位と同じに
すれば、補正データは欠損が発生した計測単位のみに挿
入されるため、補正データの影響を最小限の範囲に押さ
える事ができ、これ以外の計測単位の入力音声データは
完全に同期をとった上で、全体として同期した符号化デ
ータを作成することができる。As described above, by correcting input audio data having a loss, encoded data in which video data and audio data are synchronized can be created. Further, since the correction data is dispersed over a wide range, it is possible to make it more difficult to perceive that the correction data has been inserted. In addition, if the buffer storage unit is set to be the same as the data amount measurement unit, the correction data is inserted only in the measurement unit in which the loss has occurred, so that the influence of the correction data can be suppressed to a minimum range. The input audio data of the measurement units other than the above can be completely synchronized, and then encoded data synchronized as a whole can be created.

【００３８】なお、本実施の形態６では、補正データを
データが欠損したと判断された計測単位を含むデータ一
時格納ステップ６７で格納された入力音声データ全体に
渡り挿入したが、その一部分の計測単位だけでもよい。
また、補正データを挿入する位置を均等に分散させた
が、厳密に均等でなくても極端に偏っていなければよ
い。In the sixth embodiment, the correction data is inserted over the entire input voice data stored in the data temporary storing step 67 including the unit of measurement determined to have lost data. Only the unit may be used.
Further, the positions at which the correction data are inserted are uniformly distributed. However, the positions at which the correction data are inserted are not strictly uniform, but need not be extremely biased.

【００３９】（実施の形態７）本発明の実施の形態７
は、実施の形態５において説明した図５の補正データ挿
入ステップ５５および実施の形態６において説明した図
６の補正データ挿入ステップ６５で挿入する補正データ
が、無音データ、または直前の音声データ、または直後
の音声データ、または直前の音声データと直後の音声デ
ータの補間データ、のいずれかであるものである。(Embodiment 7) Embodiment 7 of the present invention
The correction data inserted in the correction data insertion step 55 shown in FIG. 5 described in the fifth embodiment and the correction data insertion step 65 shown in FIG. It is either the immediately following audio data or interpolation data of the immediately preceding audio data and the immediately following audio data.

【００４０】欠損した入力音声データが何であったかは
分からないため、それに代わる何等かのデータを補正デ
ータとして挿入する必要がある。この補正データは、元
々不正なデータなので雑音または異音として知覚される
可能性があり、人間が知覚しにくいものが望ましい。無
音データは、音声の瞬間的な途切れであるので、知覚し
にくい。直前または直後のデータは、同じ音声の瞬間的
な連続であるので、さらに知覚しにくい。直前の音声デ
ータと直後の音声データの補間データは、前後の音声の
から推測される音声であるので、さらに知覚しにくい。Since it is not known what the lost input voice data was, it is necessary to insert some data instead of it as correction data. Since the correction data is originally incorrect data, there is a possibility that the correction data is perceived as noise or abnormal noise. Silent data is hard to perceive because it is a momentary break in speech. Since the data immediately before or immediately after is the instantaneous continuation of the same sound, it is more difficult to perceive. The interpolation data of the immediately preceding audio data and the immediately succeeding audio data is the audio estimated from the preceding and following audio data, and is therefore more difficult to perceive.

【００４１】（実施の形態８）本発明の実施の形態８
は、実施の形態５において説明した図５の補正データ挿
入ステップ５５および実施の形態６において説明した図
６の補正データ挿入ステップ６５で、補正データを挿入
する代わりに、入力音声データをアップサンプルするこ
とで、欠損したｍ−ｍ’だけデータ量を増加させ、音声
データを補正するものである。Embodiment 8 Embodiment 8 of the present invention
In the correction data insertion step 55 of FIG. 5 described in the fifth embodiment and the correction data insertion step 65 of FIG. 6 described in the sixth embodiment, input audio data is up-sampled instead of inserting correction data. In this way, the data amount is increased by the lost MM ′ to correct the audio data.

【００４２】入力音声データをアップサンプルするの
で、補正された音声データは入力音声データを欠損した
データ量だけ時間的に引き伸ばしたものになり、補正さ
れたことが知覚しにくい。Since the input voice data is up-sampled, the corrected voice data is obtained by expanding the input voice data temporally by the amount of the missing data, and it is difficult to perceive the correction.

【００４３】なお、以上の実施の形態１から実施の形態
８の全てにおいて、計測したデータ量が等しい場合と少
ない場合のみ述べたが、データ量を計測するタイミング
のずれによっては計測したデータ量が多い場合もありう
る。この場合は、余分な分だけデータ量を均等に分散さ
せて削除するか、または、余分な分だけデータ量が減少
するようにダウンサンプリングすればよい。In all of the above-described first to eighth embodiments, only the case where the measured data amount is equal and the case where the measured data amount is small have been described. There may be many cases. In this case, the extra data amount may be evenly distributed and deleted, or down-sampling may be performed so that the extra data amount is reduced.

【００４４】[0044]

【発明の効果】以上のように本発明は、入力音声データ
の単位時間当たりの第１のデータ量を計測する手段と、
予め設定された単位時間当たりの第２のデータ量と第１
のデータ量とを比較する手段と、入力音声データに補正
データを挿入し出力する手段とが、第１のデータ量が第
２のデータ量と等しい場合、入力音声データを出力し、
第１のデータ量が第２のデータ量より少ない場合、第２
のデータ量と第１のデータ量の差だけ補正データを均等
に分散させてデータ量が少ないと判断された時点以降の
入力音声データに挿入し出力することで、欠損のある入
力音声データを補正して、映像データと音声データが同
期した符号化データを作成することが可能になる。As described above, according to the present invention, there is provided means for measuring a first data amount of input voice data per unit time,
The second data amount per unit time set in advance and the first
Means for comparing with the data amount of the input audio data and means for inserting and outputting the correction data into the input audio data, when the first data amount is equal to the second data amount, output the input audio data;
If the first data amount is smaller than the second data amount, the second
Correction of the input audio data with loss by dispersing the correction data evenly by the difference between the data amount of the first audio data and the first data amount and inserting and outputting the input audio data after the time point when the data amount is determined to be small. Thus, encoded data in which the video data and the audio data are synchronized can be created.

【００４５】また、本発明は、入力音声データの単位時
間当たりの第１のデータ量を計測する手段と、予め設定
された単位時間当たりの第２のデータ量と第１のデータ
量とを比較する手段と、入力音声データを一時的に格納
する手段と、入力音声データに補正データを挿入し出力
する手段とが、第１のデータ量が第２のデータ量と等し
い場合、格納する手段に格納された入力音声データを出
力し、第１のデータ量が第２のデータ量より少ない場
合、第２のデータ量と第１のデータ量の差だけ補正デー
タを格納する手段に格納された入力音声データに均等に
分散させて挿入し出力することで、欠損のある入力音声
データを補正して、映像データと音声データが同期した
符号化データを作成することが可能になる。The present invention also relates to a means for measuring a first data amount per unit time of input audio data, and comparing the second data amount and the first data amount per unit time set in advance. Means for temporarily storing input audio data, and means for inserting correction data into the input audio data and outputting the same when the first data amount is equal to the second data amount. Outputting the stored input voice data, and when the first data amount is smaller than the second data amount, the input stored in the means for storing the correction data by the difference between the second data amount and the first data amount By inserting and outputting the audio data evenly dispersed in the audio data, it is possible to correct the deficient input audio data and create encoded data in which the video data and the audio data are synchronized.

【００４６】また、本発明は、入力音声データに補正デ
ータを挿入し出力する手段が、補正データを挿入する代
わりに、入力音声データをアップサンプルして補間し直
し、第２のデータ量と第１のデータ量の差だけデータ量
を増加させることで、欠損のある入力音声データを補正
して、映像データと音声データが同期した符号化データ
を作成することが可能になる。Also, according to the present invention, the means for inserting and outputting correction data into the input audio data is such that instead of inserting the correction data, the input audio data is up-sampled and re-interpolated, and the second data amount and the second By increasing the data amount by the difference of the data amount of 1, it becomes possible to correct the deficient input audio data and create encoded data in which the video data and the audio data are synchronized.

[Brief description of the drawings]

【図１】本発明の実施の形態１の構成の一例を示すブロ
ック図FIG. 1 is a block diagram illustrating an example of a configuration according to a first embodiment of the present invention.

【図２】本発明の実施の形態２の構成の一例を示すブロ
ック図FIG. 2 is a block diagram illustrating an example of a configuration according to a second embodiment of the present invention;

【図３】本発明の実施の形態１の動作の一例を説明する
説明図FIG. 3 is an explanatory diagram illustrating an example of an operation according to the first embodiment of the present invention;

【図４】本発明の実施の形態２の動作の一例を説明する
説明図FIG. 4 is an explanatory diagram illustrating an example of an operation according to the second embodiment of the present invention.

【図５】本発明の実施の形態５の構成の一例を示すフロ
ーチャートFIG. 5 is a flowchart illustrating an example of a configuration according to a fifth embodiment of the present invention;

【図６】本発明の実施の形態６の構成の一例を示すフロ
ーチャートFIG. 6 is a flowchart illustrating an example of a configuration according to the sixth embodiment of the present invention;

[Explanation of symbols]

１１，２１データ量計測手段１２，２２データ量比較手段１３，２３補正データ挿入手段２７バッファ５２，６２データ量計測ステップ５３，６３データ量比較ステップ５５，６５補正データ挿入ステップ５６，６６データ出力ステップ６７データ一時格納ステップ 11, 21 Data amount measurement means 12, 22 Data amount comparison means 13, 23 Correction data insertion means 27 Buffer 52, 62 Data amount measurement step 53, 63 Data amount comparison step 55, 65 Correction data insertion step 56, 66 Data output step 67 Temporary Data Storage Step

───────────────────────────────────────────────────── フロントページの続き (72)発明者米田亜旗大阪府門真市大字門真1006番地松下電器産業株式会社内Ｆターム(参考） 5C063 AB05 AC05 BA08 CA14 CA20 CA40 5K028 EE03 EE08 KK32 SS03 SS24 ──────────────────────────────────────────────────続き Continuing from the front page (72) Inventor Aki Yoneda 1006 Kazuma Kadoma, Kadoma, Osaka Prefecture F-term in Matsushita Electric Industrial Co., Ltd. 5C063 AB05 AC05 BA08 CA14 CA20 CA40 5K028 EE03 EE08 KK32 SS03 SS24

Claims

[Claims]

A means for measuring a first data amount per unit time of input audio data; and a means for comparing the first data amount with a second data amount per unit time set in advance. Outputting the input audio data when the first data amount is equal to the second data amount; and outputting the input audio data when the first data amount is smaller than the second data amount. Means for dispersing the correction data evenly by the difference in the first data amount and inserting and outputting the corrected data in the input audio data after the point in time when it is determined that the data amount is small. apparatus.

2. A means for measuring a first data amount per unit time of input voice data, and a means for comparing a second data amount per unit time and the first data amount set in advance. Means for temporarily storing the input sound data, and, when the first data amount is equal to the second data amount, outputting the input sound data stored in the storing means; If the data amount is smaller than the second data amount, the correction data is evenly distributed to the input audio data stored in the storage unit by the difference between the second data amount and the first data amount. And a means for inserting and outputting the data.

3. The method according to claim 1, wherein the correction data is any of silent data, immediately preceding audio data, immediately following audio data, or interpolation data of immediately preceding and succeeding audio data. The audio data correction device according to claim 1 or 2.

4. The method according to claim 1, wherein instead of inserting said correction data, said input audio data is upsampled and re-interpolated to increase the data amount by the difference between said second data amount and said first data amount. Claim 1 or Claim 2
An audio data correction device according to claim 1.

5. A step of measuring a first data amount per unit time of input audio data, and a step of comparing the second data amount per unit time and the first data amount set in advance. Outputting the input audio data when the first data amount is equal to the second data amount; and outputting the input audio data when the first data amount is smaller than the second data amount. A step of dispersing the correction data evenly by the difference in the first data amount and inserting the correction data into the input audio data after the point in time when the data amount is determined to be small, and outputting the input data. Method.

6. A step of measuring a first data amount of input audio data per unit time, and a step of comparing the first data amount with a preset second data amount per unit time. Temporarily storing the input audio data; and, if the first data amount is equal to the second data amount, outputting the input audio data stored in the storing step; Is smaller than the second data amount, the correction data is evenly distributed to the input audio data stored in the step of storing the correction data by the difference between the second data amount and the first data amount. And inserting and outputting the audio data.

7. The method according to claim 1, wherein the correction data is any of silent data, immediately preceding audio data, immediately following audio data, or interpolation data of immediately preceding and succeeding audio data. The audio data correction method according to claim 5 or 6.

8. Instead of inserting the correction data, up-sampling and re-interpolating the input audio data to increase the data amount by the difference between the second data amount and the first data amount. Claim 5 or Claim 6
The described audio data correction method.