JP2011172153A

JP2011172153A - Media encoding and transmitting apparatus

Info

Publication number: JP2011172153A
Application number: JP2010036124A
Authority: JP
Inventors: Hiroshi Nakada; 浩志中田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2010-02-22
Filing date: 2010-02-22
Publication date: 2011-09-01

Abstract

<P>PROBLEM TO BE SOLVED: To prevent band utilization efficiency from being deteriorated, while maintaining media reproduction quality, and to prevent a transmission error even when using a transmission line limited in band of the line. <P>SOLUTION: A media encoding and transmitting apparatus includes an importance level discriminating section 13, an encoding rate determining section 14, and the like. The importance level discriminating section 13 discriminates an importance level of media data to be encoded. With the importance level discriminated by the importance level discriminating section 13 as a criterion, the encoding rate determining section 14 determines an encoding rate E<SB>rate</SB>when compression-encoding the media data. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

この発明は、メディアデータを圧縮符号化して、圧縮符号化後のメディアデータであるメディア符号化データをネットワーク経由でリアルタイムに伝送するメディア符号化伝送装置に関するものである。 The present invention relates to a media encoding / transmission apparatus that compresses and encodes media data and transmits media encoded data that is media data after compression encoding in real time via a network.

例えば、映像データや音声データなどのメディアデータを圧縮符号化し、圧縮符号化後のメディアデータであるメディア符号化データをネットワーク経由でリアルタイムに伝送するメディア符号化伝送装置が開発されている。
ただし、メディア符号化伝送装置から送出されたメディア符号化データが受信側に到達するまでの間に、ネットワークを構成する様々な中継器や、特性が異なる回線を経由するため、伝送の途中で発生した電気的ノイズなどの影響でメディア符号化データが破損したり、伝送データの輻輳により中継器でメディア符号化データが破棄されたりして、メディア符号化データに伝送誤りが発生することがある。 For example, media encoding and transmission devices have been developed that compress and encode media data such as video data and audio data, and transmit media encoded data that is media data after compression encoding in real time via a network.
However, it occurs in the middle of transmission because the media encoded data sent from the media encoding transmission device passes through various repeaters constituting the network and circuits with different characteristics until it reaches the receiving side. The encoded media data may be damaged due to the influence of electrical noise or the like, or the encoded media may be discarded by the repeater due to congestion of the transmitted data, resulting in a transmission error in the encoded media data.

伝送誤りが発生した場合の対策として、受信側が伝送誤りの発生を検知すると、メディア符号化データの再送を送信側に要求し、送信側から再送されたメディア符号化データを用いて、破損しているメディア符号化データの修復を行う再送処理が一般的に行われている。
ただし、このような再送処理の場合、送信側が受信側から再送要求を受信した時点から、メディア符号化データの再送を開始するため、破損しているメディア符号化データの修復が完了するまでに遅延が発生し、メディア符号化データのリアルタイム伝送には適していない。 As a countermeasure when a transmission error occurs, when the receiving side detects the occurrence of a transmission error, it requests the transmitting side to retransmit the media encoded data, and the media encoded data retransmitted from the transmitting side is used to cause damage. In general, a retransmission process is performed to repair existing media encoded data.
However, in the case of such retransmission processing, since the transmission side starts retransmission of the media encoded data from the time when the transmission side receives the retransmission request from the reception side, there is a delay until the repair of the damaged media encoded data is completed. This is not suitable for real-time transmission of media encoded data.

破損しているメディア符号化データを遅延なく誤り訂正を行う方式として、受信側において、伝送誤りを訂正することが可能な誤り訂正用データを予めメディア符号化データに付加して送信する前方誤り訂正（ＦｏｒｗａｒｄＥｒｒｏｒＣｏｒｒｅｃｔｉｏｎ：ＦＥＣ）の方式がある（例えば、非特許文献１を参照）。
前方誤り訂正方式を用いる場合、メディア符号化データを構成しているメディアデータ伝送用パケットを送信する際、そのメディアデータ伝送用パケットの一定間隔毎（例えば、１０パケット毎）に、ＦＥＣパケットを１個挿入して送信する。
ＦＥＣパケットの内容は、ＦＥＣパケットを送信する直前の例えば１０個のメディアデータ伝送用パケットのＸＯＲ（ｅＸｃｌｕｓｉｖｅＯＲ：排他的論理和）をとったものである。 Forward error correction in which error correction data capable of correcting transmission errors is added to media encoded data in advance and transmitted on the receiving side as a method for error correction of damaged media encoded data without delay (Forward Error Correction: FEC) is available (for example, see Non-Patent Document 1).
When the forward error correction method is used, when transmitting media data transmission packets constituting media encoded data, one FEC packet is set at regular intervals (for example, every 10 packets) of the media data transmission packets. Insert and send.
The content of the FEC packet is an XOR (eXclusive OR) of, for example, 10 media data transmission packets immediately before transmitting the FEC packet.

受信側では、メディアデータ伝送用パケットに連続的に付けられている番号等を参照することでパケットの欠落を検知すると、欠落していない残りの９個のメディアデータ伝送用パケットとＦＥＣパケットを合わせた合計１０個のパケットのＸＯＲをとることで、欠落しているメディアデータ伝送用パケットを復旧させることができる。
しかし、前方誤り訂正方式を用いる場合、伝送誤りの発生の有無に関わらず、常時、一定量のＦＥＣパケットを伝送する必要があるため、本来のメディア符号化データを伝送するための帯域が減少して、伝送効率が低下してしまうことになる。 When the receiving side detects a missing packet by referring to the number continuously attached to the media data transmission packet, the remaining nine media data transmission packets and the FEC packet that are not missing are combined. Further, by taking XOR of a total of 10 packets, the missing media data transmission packet can be recovered.
However, when the forward error correction method is used, it is necessary to always transmit a certain amount of FEC packets regardless of whether or not a transmission error occurs, so that the band for transmitting original media encoded data is reduced. As a result, transmission efficiency is reduced.

そこで、メディアデータの重要度に応じて誤り訂正用データのデータ量を制御することで、帯域利用効率の低下を防止するメディア符号化伝送装置が開発されている（例えば、特許文献１を参照）。
このようなメディア符号化伝送装置では、メディアデータの重要度（例えば、ＭＰＥＧのピクチャタイプ）に応じて、誤り訂正用データのデータ長を変更している。
誤り訂正用データのデータ長が長い程、誤り訂正能力が高くなるため、重要なメディアデータに付加する誤り訂正用データのデータ長を長くし、それ以外のメディアデータに付加する誤り訂正用データのデータ長を短くすることで、メディアの再生品質を維持しながら、帯域利用効率の低下を防止している。 In view of this, a media encoding and transmission apparatus has been developed that prevents a reduction in bandwidth utilization efficiency by controlling the data amount of error correction data according to the importance of media data (see, for example, Patent Document 1). .
In such a media encoding / transmission apparatus, the data length of the error correction data is changed according to the importance of the media data (for example, MPEG picture type).
The longer the data length of the error correction data, the higher the error correction capability. Therefore, the data length of the error correction data added to important media data is lengthened, and the error correction data to be added to other media data By shortening the data length, the bandwidth utilization efficiency is prevented from decreasing while maintaining the media playback quality.

特開２００８−９２２１４号公報（段落番号［００１６］）JP 2008-92214 A (paragraph number [0016])

ＩＥＴＦＲＦＣ２７３３ “ＡｎＲＴＰＰａｙｌｏａｄＦｏｒｍａｔｆｏｒＧｅｎｅｒｉｃＦｏｒｗａｒｄＥｒｒｏｒＣｏｒｒｅｃｔｉｏｎ”IETF RFC 2733 “An RTP Payload Format for Generic Forward Error Correction”

従来のメディア符号化伝送装置は以上のように構成されているので、メディアデータの重要度に応じて、誤り訂正用データのデータ長を制御することで、帯域利用効率の低下を防止している。しかし、メディアデータのデータ量が常時一定量であっても、誤り訂正用データのデータ長がメディアデータの重要度に応じて変化するので、メディアデータと誤り訂正用データを合わせた全体のデータ長が時間毎に変動する。このため、例えば、衛星通信や無線通信のように、回線の帯域に制限がある伝送路を使用する場合、全体のデータ長が長くなって伝送量が増加したときに、回線の帯域の制限を越えてしまって、伝送誤りが発生することがある課題があった。 Since the conventional media encoding / transmission apparatus is configured as described above, it controls the data length of the error correction data according to the importance of the media data, thereby preventing a reduction in bandwidth utilization efficiency. . However, even if the amount of media data is always constant, the data length of the error correction data changes according to the importance of the media data, so the total data length of the media data and the error correction data combined. Fluctuates from hour to hour. For this reason, for example, when using a transmission line with limited line bandwidth, such as satellite communication or wireless communication, when the entire data length increases and the transmission amount increases, the line bandwidth is limited. There is a problem that a transmission error may occur due to overrun.

この発明は上記のような課題を解決するためになされたもので、メディアの再生品質を維持しながら、帯域利用効率の低下を防止することができるとともに、回線の帯域に制限がある伝送路を使用する場合でも、伝送誤りの発生を防止することができるメディア符号化伝送装置を得ることを目的とする。 The present invention has been made to solve the above-described problems, and while maintaining the reproduction quality of media, it is possible to prevent a decrease in bandwidth utilization efficiency and to provide a transmission line with a limited bandwidth of the line. An object of the present invention is to obtain a media encoding transmission apparatus that can prevent transmission errors even when used.

この発明に係るメディア符号化伝送装置は、符号化対象のメディアデータの重要度を判別する重要度判別手段と、重要度判別手段により判別された重要度を基準にして、そのメディアデータを圧縮符号化する際の符号化レートを決定するとともに、圧縮符号化後のメディアデータに付加する誤り訂正符号の付加レートを決定するレート決定手段と、レート決定手段により決定された符号化レートでメディアデータを圧縮符号化して、圧縮符号化後のメディアデータであるメディア符号化データを出力するメディアデータ符号化手段と、メディアデータ符号化手段から出力されたメディア符号化データに対して、レート決定手段により決定された付加レートで誤り訂正符号を付加し、誤り訂正符号付きのメディア符号化データを出力する誤り訂正符号付加手段と、誤り訂正符号付加手段から出力された誤り訂正符号付きのメディア符号化データをネットワークに送出するデータ送出手段とを設け、データ送出手段から送出された誤り訂正符号付きのメディア符号化データの伝送レートが許容範囲内から逸脱している場合、レート補正手段がレート決定手段に対して符号化レート及び付加レートの補正を指示して、伝送レートを許容範囲内に収めるようにしたものである。 A media encoding transmission apparatus according to the present invention includes an importance level determining unit that determines the level of importance of media data to be encoded, and compressing the media data based on the level of importance determined by the level of importance determining unit. A rate determining means for determining an additional rate of an error correction code to be added to the media data after compression encoding, and a media rate at the encoding rate determined by the rate determining means. Media data encoding means for compressing and outputting media encoded data, which is media data after compression encoding, and media encoded data output from the media data encoding means determined by the rate determining means Error correction code is added at the specified additional rate and media encoded data with error correction code is output. Code adding means and data sending means for sending the media encoded data with error correction code output from the error correcting code adding means to the network are provided, and media encoding with error correcting code sent from the data sending means is provided When the data transmission rate deviates from the allowable range, the rate correction means instructs the rate determination means to correct the encoding rate and the additional rate so that the transmission rate falls within the allowable range. It is.

この発明によれば、符号化対象のメディアデータの重要度を判別する重要度判別手段と、重要度判別手段により判別された重要度を基準にして、そのメディアデータを圧縮符号化する際の符号化レートを決定するとともに、圧縮符号化後のメディアデータに付加する誤り訂正符号の付加レートを決定するレート決定手段と、レート決定手段により決定された符号化レートでメディアデータを圧縮符号化して、圧縮符号化後のメディアデータであるメディア符号化データを出力するメディアデータ符号化手段と、メディアデータ符号化手段から出力されたメディア符号化データに対して、レート決定手段により決定された付加レートで誤り訂正符号を付加し、誤り訂正符号付きのメディア符号化データを出力する誤り訂正符号付加手段と、誤り訂正符号付加手段から出力された誤り訂正符号付きのメディア符号化データをネットワークに送出するデータ送出手段とを設け、データ送出手段から送出された誤り訂正符号付きのメディア符号化データの伝送レートが許容範囲内から逸脱している場合、レート補正手段がレート決定手段に対して符号化レート及び付加レートの補正を指示して、伝送レートを許容範囲内に収めるように構成したので、メディアの再生品質を維持しながら、帯域利用効率の低下を防止することができるとともに、回線の帯域に制限がある伝送路を使用する場合でも、伝送誤りの発生を防止することができる効果がある。 According to the present invention, the importance determining means for determining the importance of the media data to be encoded, and the code for compressing and encoding the media data on the basis of the importance determined by the importance determining means A rate determination means for determining an addition rate of an error correction code to be added to the media data after compression encoding, and compression encoding the media data at the encoding rate determined by the rate determination means, Media data encoding means for outputting media encoded data that is media data after compression encoding, and the media encoded data output from the media data encoding means at an additional rate determined by the rate determining means Error correction code adding means for adding error correction code and outputting media encoded data with error correction code, and error Data transmission means for sending the media encoded data with error correction code output from the positive code adding means to the network, and the transmission rate of the media encoded data with error correction code sent from the data transmission means is allowed When the rate deviates from the range, the rate correction means instructs the rate determination means to correct the encoding rate and the additional rate so that the transmission rate falls within the allowable range. While maintaining the above, it is possible to prevent a decrease in bandwidth utilization efficiency and to prevent the occurrence of transmission errors even when a transmission line with a limited bandwidth is used.

この発明の実施の形態１によるメディア符号化伝送装置を示す構成図である。It is a block diagram which shows the media coding transmission apparatus by Embodiment 1 of this invention. この発明の実施の形態１によるメディア符号化伝送装置の処理内容を示すフローチャートである。It is a flowchart which shows the processing content of the media coding transmission apparatus by Embodiment 1 of this invention. 符号化レートＥ_ｒａｔｅで圧縮符号化されたメディアデータ及び付加レートＡ_ｒａｔｅで付加された誤り訂正符号を示す説明図である。It is explanatory drawing which shows the media data compression-encoded with the encoding rate E _rate , and the error correction code added with the addition rate A _rate . この発明の実施の形態２によるメディア符号化伝送装置を示す構成図である。It is a block diagram which shows the media coding transmission apparatus by Embodiment 2 of this invention. この発明の実施の形態３によるメディア符号化伝送装置を示す構成図である。It is a block diagram which shows the media coding transmission apparatus by Embodiment 3 of this invention. 映像音声多重部２２により多重化された映像符号化データ及び音声符号化データと、映像音声整列部２３により並び替えが行われた映像符号化データ及び音声符号化データと、誤り訂正符号が付加された映像符号化データ及び音声符号化データとを示す説明図である。Video encoded data and audio encoded data multiplexed by the video / audio multiplexing unit 22, video encoded data and audio encoded data rearranged by the video / audio aligning unit 23, and an error correction code are added. It is explanatory drawing which shows the video coding data and audio | voice coding data which were read. この発明の実施の形態９によるメディア符号化伝送装置を示す構成図である。It is a block diagram which shows the media coding transmission apparatus by Embodiment 9 of this invention.

実施の形態１．
図１はこの発明の実施の形態１によるメディア符号化伝送装置を示す構成図である。
図１において、メディア符号化伝送装置１はメディアソース２から例えば映像データや音声データなどのメディアデータを入力し、そのメディアデータを圧縮符号化して、圧縮符号化後のメディアデータであるメディア符号化データをネットワーク３に送出する装置である。
メディアソース２は例えば映像データや音声データなどのメディアデータを出力することが可能な装置であり、例えば、ビデオカメラやＶＴＲなどが該当する。
ネットワーク３は例えば無線ＬＡＮや衛星通信回線などが該当する。 Embodiment 1 FIG.
1 is a block diagram showing a media encoding / transmission apparatus according to Embodiment 1 of the present invention.
In FIG. 1, a media encoding / transmission apparatus 1 receives media data such as video data and audio data from a media source 2, compresses and encodes the media data, and encodes media data that is media data after compression encoding. This is a device for sending data to the network 3.
The media source 2 is a device that can output media data such as video data and audio data, and corresponds to, for example, a video camera or a VTR.
The network 3 corresponds to, for example, a wireless LAN or a satellite communication line.

メディア圧縮符号化処理部１１は特徴量抽出処理部１２及び符号化処理部１６から構成されており、メディアソース２からメディアデータを入力すると、そのメディアデータの特徴量を抽出して、その特徴量を重要度判別部１３に出力するとともに、そのメディアデータを圧縮符号化して、圧縮符号化後のメディアデータであるメディア符号化データを誤り訂正符号付加部１７に出力する処理を実施する。 The media compression encoding processing unit 11 includes a feature amount extraction processing unit 12 and an encoding processing unit 16. When media data is input from the media source 2, the feature amount of the media data is extracted and the feature amount is extracted. Is output to the importance determination unit 13, and the media data is compressed and encoded, and the media encoded data that is the media data after the compression encoding is output to the error correction code adding unit 17.

特徴量抽出処理部１２はメディアソース２から符号化対象のメディアデータを入力するインタフェース（例えば、ビデオ入力端子や音声入力端子）を備えており、入力した符号化対象のメディアデータの特徴量（例えば、動きベクトル、ＤＣＴ（ＤｉｓｃｒｅｔｅＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ：離散コサイン変換）係数、音声データの有無など）を抽出して、その特徴量を重要度判別部１３及び符号化処理部１６に出力するとともに、そのメディアデータを符号化処理部１６に出力する処理を実施する。
重要度判別部１３は特徴量抽出処理部１２から出力された特徴量を参照して、符号化対象のメディアデータの重要度を判別する処理を実施する。
なお、特徴量抽出処理部１２及び重要度判別部１３から重要度判別手段が構成されている。 The feature amount extraction processing unit 12 includes an interface (for example, a video input terminal or an audio input terminal) for inputting media data to be encoded from the media source 2, and features (for example, feature amounts of the input media data to be encoded) , Motion vectors, DCT (Discrete Cosine Transform) coefficients, presence / absence of voice data, etc.) are extracted, and their feature values are output to the importance level discrimination unit 13 and the encoding processing unit 16, and the media data Is output to the encoding processing unit 16.
The importance determination unit 13 refers to the feature amount output from the feature amount extraction processing unit 12 and performs a process of determining the importance of the media data to be encoded.
The feature amount extraction processing unit 12 and the importance level determination unit 13 constitute importance level determination means.

符号化レート決定部１４は重要度判別部１３により判別された重要度を基準にして、特徴量抽出処理部１２により入力されたメディアデータを圧縮符号化する際の符号化レートを決定する処理を実施する。
誤り訂正符号付加レート決定部１５は重要度判別部１３により判別された重要度を基準にして、符号化処理部１６により圧縮符号化されたメディアデータに付加する誤り訂正符号の付加レートを決定する処理を実施する。
なお、符号化レート決定部１４及び誤り訂正符号付加レート決定部１５からレート決定手段が構成されている。 The encoding rate determination unit 14 performs a process of determining an encoding rate when the media data input by the feature amount extraction processing unit 12 is compression-encoded based on the importance determined by the importance determination unit 13. carry out.
The error correction code addition rate determination unit 15 determines the addition rate of the error correction code added to the media data compressed and encoded by the encoding processing unit 16 based on the importance determined by the importance determination unit 13. Implement the process.
The coding rate determination unit 14 and the error correction code addition rate determination unit 15 constitute a rate determination unit.

符号化処理部１６は符号化レート決定部１４により決定された符号化レートで、特徴量抽出処理部１２から出力された特徴量に応じてメディアデータを圧縮符号化し、圧縮符号化後のメディアデータであるメディア符号化データを出力する処理を実施する。なお、符号化処理部１６はメディアデータ符号化手段を構成している。
誤り訂正符号付加部１７は符号化処理部１６から出力されたメディア符号化データに対して、誤り訂正符号付加レート決定部１５により決定された付加レートで誤り訂正符号を付加し、誤り訂正符号付きのメディア符号化データを出力する処理を実施する。なお、誤り訂正符号付加部１７は誤り訂正符号付加手段を構成している。 The encoding processing unit 16 compresses and encodes the media data according to the feature amount output from the feature amount extraction processing unit 12 at the encoding rate determined by the encoding rate determination unit 14, and the media data after the compression encoding The process which outputs the media coding data which is is implemented. Note that the encoding processing unit 16 constitutes media data encoding means.
The error correction code adding unit 17 adds an error correction code to the media encoded data output from the encoding processing unit 16 at the additional rate determined by the error correction code additional rate determining unit 15 and includes the error correction code. The process of outputting the encoded media data is executed. The error correction code adding unit 17 constitutes an error correction code adding means.

データ送出部１８はネットワーク３に対するインタフェース（例えば、無線ＬＡＮアダプタ、無線通信機器など）を備えており、誤り訂正符号付加部１７から出力された誤り訂正符号付きのメディア符号化データをネットワーク３に送出する処理を実施する。なお、データ送出部１８はデータ送出手段を構成している。
伝送レート補正部１９はデータ送出部１８から送出された誤り訂正符号付きのメディア符号化データの伝送レートが予め設定されている許容範囲内から逸脱している場合、符号化レート決定部１４及び誤り訂正符号付加レート決定部１５に対して、符号化レート及び付加レートの補正を指示して、その伝送レートを許容範囲内に収める処理を実施する。なお、伝送レート補正部１９はレート補正手段を構成している。 The data transmission unit 18 includes an interface (for example, a wireless LAN adapter, a wireless communication device, etc.) to the network 3, and transmits the media encoded data with the error correction code output from the error correction code addition unit 17 to the network 3. Perform the process. The data sending unit 18 constitutes data sending means.
When the transmission rate of the media encoded data with the error correction code sent from the data sending unit 18 deviates from the preset allowable range, the transmission rate correcting unit 19 and the coding rate determining unit 14 and the error The correction code addition rate determination unit 15 is instructed to correct the encoding rate and the addition rate, and performs processing for keeping the transmission rate within an allowable range. The transmission rate correction unit 19 constitutes a rate correction unit.

図１の例では、メディア符号化伝送装置の構成要素である特徴量抽出処理部１２、重要度判別部１３、符号化レート決定部１４、誤り訂正符号付加レート決定部１５、符号化処理部１６、誤り訂正符号付加部１７、データ送出部１８及び伝送レート補正部１９のそれぞれが専用のハードウェア（例えば、ＣＰＵを実装している半導体集積回路、あるいは、ワンチップマイコンなど）で構成されているものを想定しているが、メディア符号化伝送装置がコンピュータで構成される場合、特徴量抽出処理部１２、重要度判別部１３、符号化レート決定部１４、誤り訂正符号付加レート決定部１５、符号化処理部１６、誤り訂正符号付加部１７、データ送出部１８及び伝送レート補正部１９の処理内容を記述しているプログラムをコンピュータのメモリに格納し、当該コンピュータのＣＰＵが当該メモリに格納されているプログラムを実行するようにしてもよい。
図２はこの発明の実施の形態１によるメディア符号化伝送装置の処理内容を示すフローチャートである。 In the example of FIG. 1, the feature amount extraction processing unit 12, the importance level determination unit 13, the encoding rate determination unit 14, the error correction code addition rate determination unit 15, and the encoding processing unit 16 that are components of the media encoding transmission apparatus. Each of the error correction code adding unit 17, the data sending unit 18, and the transmission rate correcting unit 19 is configured by dedicated hardware (for example, a semiconductor integrated circuit on which a CPU is mounted, or a one-chip microcomputer). However, when the media encoding / transmission apparatus is configured by a computer, the feature amount extraction processing unit 12, the importance level determination unit 13, the encoding rate determination unit 14, the error correction code addition rate determination unit 15, A program describing the processing contents of the encoding processing unit 16, the error correction code adding unit 17, the data sending unit 18 and the transmission rate correcting unit 19 is stored in a computer program. Stored in the re, CPU of the computer may execute a program stored in the memory.
FIG. 2 is a flowchart showing the processing contents of the media encoding / transmission apparatus according to Embodiment 1 of the present invention.

次に動作について説明する。
メディア圧縮符号化処理部１１の特徴量抽出処理部１２は、メディアソース２から符号化対象のメディアデータ（映像データ、音声データ）を入力すると（ステップＳＴ１）、そのメディアデータの特徴量を抽出する（ステップＳＴ２）。
メディアデータの特徴量としては、例えば、メディアデータのピクチャタイプ、映像データが圧縮符号化される際に用いられる動きベクトル、映像データが圧縮符号化される際に算出されるＤＣＴ係数、メディアデータにおける音声データの有無などが考えられる。 Next, the operation will be described.
When the feature data extraction processing unit 12 of the media compression coding processing unit 11 inputs media data (video data, audio data) to be coded from the media source 2 (step ST1), the feature value extraction processing unit 12 extracts the feature value of the media data. (Step ST2).
Examples of the feature amount of the media data include, for example, the picture type of the media data, the motion vector used when the video data is compressed and encoded, the DCT coefficient calculated when the video data is compressed and encoded, and the media data The presence or absence of audio data is considered.

特徴量を抽出する具体的な手順として、例えば、「ＩＳＯ／ＩＥＣＭＰＥＧ（ＭｏｖｉｎｇＰｉｃｔｕｒｅＥｘｐｅｒｔｓＧｒｏｕｐ）−１／２／４」や、「ＩＴＵ−Ｔ勧告Ｈ．２６１／２６２／２６３／２６４」に規定されている圧縮符号化技術を利用すればよいので、ここでは詳細な説明を省略する。
特徴量抽出処理部１２は、メディアデータの特徴量を抽出すると、その特徴量を重要度判別部１３及び符号化処理部１６に出力するとともに、そのメディアデータを符号化処理部１６に出力する。 As specific procedures for extracting feature quantities, for example, “ISO / IEC MPEG (Moving Picture Experts Group) -1/2/4” and “ITU-T Recommendation H.261 / 262/263/264” are specified. Therefore, detailed description thereof will be omitted here.
When the feature amount extraction processing unit 12 extracts the feature amount of the media data, the feature amount extraction processing unit 12 outputs the feature amount to the importance determination unit 13 and the encoding processing unit 16 and outputs the media data to the encoding processing unit 16.

重要度判別部１３は、特徴量抽出処理部１２からメディアデータの特徴量を受けると、その特徴量を参照して、符号化対象のメディアデータの重要度を判別する（ステップＳＴ３）。
メディアデータの重要度は、特徴量に基づいて判別するが（重要度の具体的な判別処理は、実施の形態４〜８で説明する）、メディアデータの重要度は、「符号化対象のメディアデータの欠落が、メディア再生の品質に及ぼす影響の度合い」を示す指標であり、影響度が大きい程、重要度が高い。 When receiving the feature amount of the media data from the feature amount extraction processing unit 12, the importance level determination unit 13 refers to the feature amount to determine the importance level of the media data to be encoded (step ST3).
The importance of the media data is determined based on the feature amount (the specific determination process of the importance will be described in Embodiments 4 to 8), but the importance of the media data is “the medium to be encoded” This is an index indicating the “degree of influence of data loss on the quality of media playback”. The greater the degree of influence, the higher the importance.

符号化レート決定部１４は、重要度判別部１３がメディアデータの重要度を判別すると、その重要度を基準にして、特徴量抽出処理部１２により入力されたメディアデータを圧縮符号化する際の符号化レートＥ_ｒａｔｅを決定し、その符号化レートＥ_ｒａｔｅを符号化処理部１６に出力する（ステップＳＴ４）。
符号化レートＥ_ｒａｔｅの具体的な決定処理は後述する。 When the importance determination unit 13 determines the importance of the media data, the encoding rate determination unit 14 compresses and encodes the media data input by the feature amount extraction processing unit 12 based on the importance. The encoding rate E _rate is determined, and the encoding rate E _rate is output to the encoding processing unit 16 (step ST4).
Specific determination processing of the encoding rate E _rate will be described later.

符号化処理部１６は、符号化レート決定部１４から符号化レートＥ_ｒａｔｅを受けると、その符号化レートＥ_ｒａｔｅで、特徴量抽出処理部１２から出力された特徴量に応じてメディアデータを圧縮符号化し、圧縮符号化後のメディアデータであるメディア符号化データを誤り訂正符号付加部１７に出力する（ステップＳＴ５）。
メディアデータを圧縮符号化する具体的な手順として、例えば、「ＩＳＯ／ＩＥＣＭＰＥＧ（ＭｏｖｉｎｇＰｉｃｔｕｒｅＥｘｐｅｒｔｓＧｒｏｕｐ）−１／２／４」や、「ＩＴＵ−Ｔ勧告Ｈ．２６１／２６２／２６３／２６４」に規定されている圧縮符号化技術を利用すればよいので、ここでは詳細な説明を省略する。 When the encoding processing unit 16 receives the encoding rate E _rate from the encoding rate determining unit 14, the encoding processing unit 16 compresses the media data at the encoding rate E _rate in accordance with the feature amount output from the feature amount extraction processing unit 12. The encoded encoded media data, which is media data after compression encoding, is output to the error correction code adding unit 17 (step ST5).
Specific procedures for compressing and encoding media data include, for example, “ISO / IEC MPEG (Moving Picture Experts Group) -1/2/4” and “ITU-T Recommendation H.261 / 262/263/264”. Therefore, detailed description thereof is omitted here.

誤り訂正符号付加レート決定部１５は、重要度判別部１３がメディアデータの重要度を判別すると、その重要度を基準にして、符号化処理部１６により圧縮符号化されたメディアデータに付加する誤り訂正符号の付加レートＡ_ｒａｔｅを決定し、誤り訂正符号の付加レートＡ_ｒａｔｅを誤り訂正符号付加部１７に出力する（ステップＳＴ６）。
付加レートＡ_ｒａｔｅの具体的な決定処理は後述する。 When the importance level determination unit 13 determines the importance level of the media data, the error correction code addition rate determination unit 15 adds an error to the media data compressed and encoded by the encoding processing unit 16 based on the importance level. The correction code addition rate A _rate is determined, and the error correction code addition rate A _rate is output to the error correction code addition unit 17 (step ST6).
Specific processing for determining the additional rate A _rate will be described later.

誤り訂正符号付加部１７は、符号化処理部１６からメディア符号化データを受けると、そのメディア符号化データが伝送中に破損した場合に、受信側が破損したメディア符号化データを復旧させる際に利用する誤り訂正符号を生成する。
即ち、誤り訂正符号付加部１７は、符号化処理部１６から出力された付加レートＡ_ｒａｔｅに対応するデータ長の誤り訂正符号を生成し、その誤り訂正符号をメディア符号化データに付加する（ステップＳＴ７）。
誤り訂正符号を付加する具体的な手順については、例えば、リードソロモン符号の手順や、「ＩＥＴＦＲＦＣ２７３３ “ＡｎＲＴＰＰａｙｌｏａｄＦｏｒｍａｔｆｏｒＧｅｎｅｒｉｃＦｏｒｗａｒｄＥｒｒｏｒＣｏｒｒｅｃｔｉｏｎ”」に規定されているＦＥＣパケット挿入の手順を利用すればよいので、ここでは詳細な説明を省略する。 When the error correction code adding unit 17 receives the media encoded data from the encoding processing unit 16 and the media encoded data is damaged during transmission, the error correcting code adding unit 17 is used to restore the damaged media encoded data on the receiving side. An error correction code to be generated is generated.
That is, the error correction code adding unit 17 generates an error correction code having a data length corresponding to the additional rate A _rate output from the encoding processing unit 16, and adds the error correction code to the media encoded data (step). ST7).
For specific procedures for adding an error correction code, for example, the Reed-Solomon code procedure or the FEC packet insertion procedure defined in “IETF RFC2733“ An RTP Payload Format for Generic Forward Error Correction ”” is used. Detailed description will be omitted here.

ここで、図３は符号化レートＥ_ｒａｔｅで圧縮符号化されたメディアデータ及び付加レートＡ_ｒａｔｅで付加された誤り訂正符号を示す説明図である。
以下、図３を参照して、重要度判別部１３により判別された重要度に応じて、符号化レートＥ_ｒａｔｅ及び付加レートＡ_ｒａｔｅを決定する処理内容を具体的に説明する。
図３において、縦軸は誤り訂正符号付きのメディア符号化データ（メディア符号化データ＋誤り訂正符号）の伝送レートを表している。
横軸は誤り訂正符号付きのメディア符号化データの伝送時間を表しており、面積は伝送可能なメディア符号化データ及び誤り訂正符号の情報量を表している。 Here, FIG. 3 is an explanatory diagram showing the media data compressed and encoded at the encoding rate E _rate and the error correction code added at the additional rate A _rate .
Hereinafter, with reference to FIG. 3, processing contents for determining the encoding rate E _rate and the additional rate A _rate according to the importance determined by the importance determination unit 13 will be specifically described.
In FIG. 3, the vertical axis represents the transmission rate of media encoded data with error correction code (media encoded data + error correction code).
The horizontal axis represents the transmission time of media encoded data with an error correction code, and the area represents the amount of media encoded data and error correction code information that can be transmitted.

また、図３の例では、メディアデータの重要度が「高／中／低」の３段階の場合を示している。
図３の上側の図は、メディアデータの重要度が考慮されていない場合の誤り訂正符号とメディア符号化データの配分であり、メディアデータの重要度と関係なく、常に、誤り訂正符号とメディア符号化データのレート配分が一定である。
図３の下側の図は、メディアデータの重要度が考慮されている場合の誤り訂正符号とメディア符号化データの配分である。 Further, the example of FIG. 3 shows a case where the importance of media data is three stages of “high / medium / low”.
The upper diagram in FIG. 3 shows the distribution of error correction codes and media encoded data when the importance of the media data is not taken into account. The error correction code and the media code are always allocated regardless of the importance of the media data. The rate distribution of data is constant.
The lower diagram in FIG. 3 shows the distribution of error correction codes and media encoded data when the importance of the media data is taken into consideration.

重要度が高いメディアデータは、伝送誤りによって欠落すると、メディアの再生品質に与える影響が大きいので、仮に欠落しても受信側で正確に訂正できるように誤り訂正能力を上げる必要がある。
そこで、誤り訂正符号付加部１７は、メディアデータの重要度が高い場合、重要度が低い場合よりも、誤り訂正符号の付加レートＡ_ｒａｔｅを増やしている（図３の下側の図を参照）。
しかし、全体の伝送レートを一定に維持するには、誤り訂正符号の付加レートＡ_ｒａｔｅの増加に伴ってメディアデータの符号化レートＥ_ｒａｔｅを減少させる必要があるので、符号化レート決定部１４がメディアデータの符号化レートＥ_ｒａｔｅを下げている。 If media data with high importance is lost due to a transmission error, it has a great influence on the reproduction quality of the media. Therefore, even if it is lost, it is necessary to improve the error correction capability so that it can be accurately corrected on the receiving side.
Therefore, the error correction code adding unit 17 increases the error correction code addition rate A _rate when the importance of the media data is higher than when the importance is low (see the lower diagram in FIG. 3). .
However, in order to maintain the entire transmission rate constant, it is necessary to decrease the encoding rate E _rate of the media data as the additional rate A _rate of the error correction code increases. The encoding rate E _rate of the media data is lowered.

ただし、誤り訂正符号の付加レートＡ_ｒａｔｅを増す前後で、同等の情報量を伝送可能にするため、全体の伝送時間のうち、重要度が高いメディアデータの符号化データに割り当てる伝送時間を長くして、伝送可能な情報量を確保している。
重要度が高いメディアデータの符号化データに割り当てる伝送時間を長くした分、重要度が「中／低」のメディアデータの符号化データに割り当てる伝送時間を短くすることで、全体の伝送時間を維持している。
なお、伝送時間の短縮によって、重要度が「中／低」のメディアデータの伝送可能な情報量が不足する場合には、重要度が低いメディアデータの符号化データに付加する誤り訂正符号の付加レートＡ_ｒａｔｅを減少させて情報量を確保するようにする。 However, in order to make it possible to transmit the same amount of information before and after increasing the additional rate A _rate of the error correction code, the transmission time allocated to the encoded data of the media data having a high importance is increased in the entire transmission time. The amount of information that can be transmitted is secured.
Maintaining the overall transmission time by shortening the transmission time allocated to encoded data of medium / low importance media data by increasing the transmission time allocated to encoded data of media data with high importance is doing.
If the amount of information that can be transmitted with medium / low importance media data is insufficient due to a reduction in transmission time, an error correction code is added to the encoded data of media data with low importance. The amount of information is secured by decreasing the rate A _rate .

データ送出部１８は、誤り訂正符号付加部１７から誤り訂正符号付きのメディア符号化データを受けると、誤り訂正符号付きのメディア符号化データをネットワーク３に送出する（ステップＳＴ８）。
上述したように、符号化レートＥ_ｒａｔｅ及び付加レートＡ_ｒａｔｅがメディアデータの重要度に応じて決定されているので、データ送出部１８から送出される誤り訂正符号付きのメディア符号化データの伝送レートは一定に制御される。
しかし、実際には、メディアデータの内容等によって伝送レートに誤差が生じることがある。 Upon receiving the media encoded data with the error correction code from the error correction code adding unit 17, the data transmission unit 18 transmits the media encoded data with the error correction code to the network 3 (step ST8).
As described above, since the encoding rate E _rate and the additional rate A _rate are determined according to the importance of the media data, the transmission rate of the media encoded data with the error correction code sent from the data sending unit 18 Is controlled to be constant.
However, actually, an error may occur in the transmission rate depending on the contents of the media data.

そこで、伝送レート補正部１９は、データ送出部１８から送出された誤り訂正符号付きのメディア符号化データの伝送レートを監視し、その伝送レートが予め設定されている許容範囲内から逸脱していれば（ステップＳＴ９）、符号化レート決定部１４及び誤り訂正符号付加レート決定部１５に対して、符号化レートＥ_ｒａｔｅ及び付加レートＡ_ｒａｔｅの補正を指示して、その伝送レートを許容範囲内に収めるようにする（ステップＳＴ１０）。
具体的には、以下の通りである。 Therefore, the transmission rate correction unit 19 monitors the transmission rate of the media encoded data with the error correction code sent from the data sending unit 18, and the transmission rate deviates from the preset allowable range. (Step ST9), the encoding rate determination unit 14 and the error correction code additional rate determination unit 15 are instructed to correct the encoding rate E _rate and the additional rate A _rate , and the transmission rate falls within the allowable range. (Step ST10).
Specifically, it is as follows.

伝送レート補正部１９は、データ送出部１８から送出された誤り訂正符号付きのメディア符号化データの伝送レートを監視して、そのメディア符号化データの全体の伝送レートと、そのメディア符号化データの重要度別の伝送レートを分析する。
伝送レート補正部１９は、そのメディア符号化データの全体の伝送レートが許容範囲を超えて変動している場合、変動が特定の重要度に係る誤り訂正符号付きのメディア符号化データ（例えば、重要度が高いメディアデータが圧縮符号化されている符号化データ及び誤り訂正符号）にのみ発生しているのか、特定の重要度に関わらず全体的に変動している（全ての重要度に係る誤り訂正符号付きのメディア符号化データに変動が発生している）のかを確認する。 The transmission rate correction unit 19 monitors the transmission rate of the media encoded data with the error correction code sent from the data sending unit 18, and the overall transmission rate of the media encoded data and the media encoded data Analyze the transmission rate by importance.
When the overall transmission rate of the media encoded data fluctuates beyond an allowable range, the transmission rate correction unit 19 performs media encoded data (for example, important data with an error correction code having a specific importance). Whether high-level media data occurs only in encoded data and error-correcting codes that have been compression-encoded or fluctuated as a whole regardless of the specific importance (errors related to all importance) Check if the media encoded data with the correction code has changed).

伝送レート補正部１９は、変動が特定の重要度に係る誤り訂正符号付きのメディア符号化データにのみ発生している場合、該当する重要度に対応する符号化レートＥ_ｒａｔｅの補正指示を符号化レート決定部１４に出力し、また、該当する重要度に対応する付加レートＡ_ｒａｔｅの補正を誤り訂正符号付加レート決定部１５に出力する。
これにより、符号化レート決定部１４が該当する重要度に対応する符号化レートＥ_ｒａｔｅを下げる補正を実施し、誤り訂正符号付加レート決定部１５が該当する重要度に対応する付加レートＡ_ｒａｔｅを下げる補正を実施する。 The transmission rate correction unit 19 encodes a correction instruction for the encoding rate E _rate corresponding to the corresponding importance when the fluctuation occurs only in the media encoded data with the error correction code related to the specific importance. The output is output to the rate determination unit 14, and the correction of the additional rate A _rate corresponding to the corresponding importance is output to the error correction code additional rate determination unit 15.
As a result, the encoding rate determination unit 14 performs correction to lower the encoding rate E _rate corresponding to the corresponding importance, and the error correction code addition rate determination unit 15 _sets the additional rate A _rate corresponding to the corresponding importance. Implement a correction to lower.

伝送レート補正部１９は、特定の重要度に関わらず全体的に変動している場合、全ての重要度に対応する符号化レートＥ_ｒａｔｅの補正指示を符号化レート決定部１４に出力し、また、全ての重要度に対応する付加レートＡ_ｒａｔｅの補正指示を誤り訂正符号付加レート決定部１５に出力する。
これにより、符号化レート決定部１４が全ての重要度に対応する符号化レートＥ_ｒａｔｅを下げる補正を実施し、誤り訂正符号付加レート決定部１５が全ての重要度に対応する付加レートＡ_ｒａｔｅを下げる補正を実施する。 The transmission rate correction unit 19 outputs, to the encoding rate determination unit 14, a correction instruction for the encoding rate E _rate corresponding to all the importance levels when the overall fluctuation occurs regardless of the specific importance level. The correction instruction of the additional rate A _rate corresponding to all the importance levels is output to the error correction code additional rate determining unit 15.
As a result, the encoding rate determination unit 14 performs correction to lower the encoding rate E _rate corresponding to all the importance levels, and the error correction code additional rate determination unit 15 _sets the additional rate A _rate corresponding to all the importance levels. Implement a correction to lower.

以上で明らかなように、この実施の形態１によれば、符号化対象のメディアデータの重要度を判別する重要度判別部１３と、重要度判別部１３により判別された重要度を基準にして、そのメディアデータを圧縮符号化する際の符号化レートＥ_ｒａｔｅを決定する符号化レート決定部１４と、重要度判別部１３により判別された重要度を基準にして、圧縮符号化後のメディアデータに付加する誤り訂正符号の付加レートＡ_ｒａｔｅを決定する誤り訂正符号付加レート決定部１５と、符号化レート決定部１４により決定された符号化レートＥ_ｒａｔｅでメディアデータを圧縮符号化して、圧縮符号化後のメディアデータであるメディア符号化データを出力する符号化処理部１６と、符号化処理部１６から出力されたメディア符号化データに対して、誤り訂正符号付加レート決定部１５により決定された付加レートＡ_ｒａｔｅで誤り訂正符号を付加し、誤り訂正符号付きのメディア符号化データを出力する誤り訂正符号付加部１７とを設け、データ送出部１８が誤り訂正符号付加部１７から出力された誤り訂正符号付きのメディア符号化データをネットワーク３に送出するように構成したので、メディアの再生品質を維持しながら、帯域利用効率の低下を防止することができる効果を奏する。 As is apparent from the above, according to the first embodiment, the importance determination unit 13 that determines the importance of the media data to be encoded, and the importance determined by the importance determination unit 13 are used as a reference. The media data after compression encoding based on the importance determined by the encoding rate determination unit 14 and the importance determination unit 13 for determining the encoding rate E _rate when the media data is compressed and encoded. An error correction code addition rate determination unit 15 that determines an addition rate A _rate of the error correction code to be added to the media, and compression-encodes the media data at the encoding rate E _rate determined by the encoding rate determination unit 14 Encoding processing unit 16 that outputs media encoded data that is media data after conversion, and media encoded data output from encoding processing unit 16 , Adds an error correcting code added rate A _rate determined by the error correction code addition rate determining section 15 is provided with an error correction code addition unit 17 for outputting the medium coded data with the error correction code, the data delivery unit 18 is configured to send the media encoded data with the error correction code output from the error correction code adding unit 17 to the network 3, thereby preventing reduction in bandwidth utilization efficiency while maintaining the reproduction quality of the media. There is an effect that can be.

また、データ送出部１８から送出された誤り訂正符号付きのメディア符号化データの伝送レートが許容範囲内から逸脱している場合、伝送レート補正部１９が符号化レート決定部１４及び誤り訂正符号付加レート決定部１５に対して、符号化レートＥ_ｒａｔｅ及び付加レートＡ_ｒａｔｅの補正を指示して、伝送レートを許容範囲内に収めるように構成したので、回線の帯域に制限がある伝送路を使用する場合でも、伝送誤りの発生を防止することができる効果を奏する。 When the transmission rate of the media encoded data with the error correction code transmitted from the data transmission unit 18 deviates from the allowable range, the transmission rate correction unit 19 adds the encoding rate determination unit 14 and the error correction code addition. Since the rate determining unit 15 is instructed to correct the encoding rate E _rate and the additional rate A _rate and the transmission rate falls within the allowable range, a transmission path with a limited line bandwidth is used. Even in this case, there is an effect that it is possible to prevent occurrence of transmission errors.

実施の形態２．
図４はこの発明の実施の形態２によるメディア符号化伝送装置を示す構成図であり、図において、図１と同一符号は同一又は相当部分を示すので説明を省略する。
映像圧縮符号化処理部１１ａは映像特徴量抽出処理部１２ａ及び映像符号化処理部１６ａから構成されており、メディアソース２からメディアデータである映像データを入力すると、その映像データの特徴量を抽出して、その特徴量を映像重要度判別部１３ａに出力するとともに、その映像データを圧縮符号化して、圧縮符号化後の映像データである映像符号化データを映像誤り訂正符号付加部１７ａに出力する処理を実施する。
音声圧縮符号化処理部１１ｂは音声特徴量抽出処理部１２ｂ及び音声符号化処理部１６ｂから構成されており、メディアソース２からメディアデータである音声データを入力すると、その音声データの特徴量を抽出して、その特徴量を音声重要度判別部１３ｂに出力するとともに、その音声データを圧縮符号化して、圧縮符号化後の音声データである音声符号化データを音声誤り訂正符号付加部１７ｂに出力する処理を実施する。 Embodiment 2. FIG.
4 is a block diagram showing a media encoding / transmission apparatus according to Embodiment 2 of the present invention. In the figure, the same reference numerals as those in FIG.
The video compression coding processing unit 11a is composed of a video feature amount extraction processing unit 12a and a video coding processing unit 16a. When video data as media data is input from the media source 2, the feature amount of the video data is extracted. Then, the feature amount is output to the video importance determination unit 13a, and the video data is compressed and encoded, and the video encoded data which is the video data after the compression encoding is output to the video error correction code adding unit 17a. Perform the process.
The voice compression coding processing unit 11b is composed of a voice feature amount extraction processing unit 12b and a voice coding processing unit 16b. When voice data as media data is input from the media source 2, the feature amount of the voice data is extracted. Then, the feature amount is output to the speech importance determination unit 13b, the speech data is compressed and encoded, and the speech encoded data that is the speech data after compression encoding is output to the speech error correction code adding unit 17b. Perform the process.

映像特徴量抽出処理部１２ａはメディアソース２から符号化対象の映像データを入力するインタフェース（例えば、ビデオ入力端子）を備えており、入力した符号化対象の映像データの特徴量（例えば、動きベクトル、ＤＣＴ係数など）を抽出して、その特徴量を映像重要度判別部１３ａ及び映像符号化処理部１６ａに出力するとともに、その映像データを映像符号化処理部１６ａに出力する処理を実施する。
音声特徴量抽出処理部１２ｂはメディアソース２から符号化対象の音声データを入力するインタフェース（例えば、音声入力端子）を備えており、入力した符号化対象の音声データの特徴量を抽出して、その特徴量を音声重要度判別部１３ｂ及び音声符号化処理部１６ｂに出力するとともに、その音声データを音声符号化処理部１６ｂに出力する処理を実施する。 The video feature quantity extraction processing unit 12a includes an interface (for example, a video input terminal) for inputting video data to be encoded from the media source 2, and features (for example, motion vectors) of the input video data to be encoded. , DCT coefficients, etc.) are extracted, and the feature amount is output to the video importance determination unit 13a and the video encoding processing unit 16a, and the video data is output to the video encoding processing unit 16a.
The audio feature amount extraction processing unit 12b includes an interface (for example, an audio input terminal) that inputs audio data to be encoded from the media source 2, and extracts the feature amount of the input audio data to be encoded, The feature amount is output to the speech importance determination unit 13b and the speech encoding processing unit 16b, and the process of outputting the speech data to the speech encoding processing unit 16b is performed.

映像重要度判別部１３ａは映像特徴量抽出処理部１２ａから出力された特徴量を参照して、符号化対象の映像データの重要度を判別する処理を実施する。
音声重要度判別部１３ｂは音声特徴量抽出処理部１２ｂから出力された特徴量を参照して、符号化対象の音声データの重要度を判別する処理を実施する。
なお、映像特徴量抽出処理部１２ａ、音声特徴量抽出処理部１２ｂ、映像重要度判別部１３ａ及び音声重要度判別部１３ｂから重要度判別手段が構成されている。 The video importance level determination unit 13a refers to the feature amount output from the video feature amount extraction processing unit 12a and performs a process of determining the importance level of video data to be encoded.
The speech importance determination unit 13b refers to the feature amount output from the speech feature amount extraction processing unit 12b, and performs a process of determining the importance of the speech data to be encoded.
The video feature amount extraction processing unit 12a, the audio feature amount extraction processing unit 12b, the video importance level determination unit 13a, and the audio importance level determination unit 13b constitute importance level determination means.

映像符号化レート決定部１４ａは映像重要度判別部１３ａにより判別された重要度を基準にして、映像特徴量抽出処理部１２ａにより入力された映像データを圧縮符号化する際の符号化レートを決定する処理を実施する。
音声符号化レート決定部１４ｂは音声重要度判別部１３ｂにより判別された重要度を基準にして、音声特徴量抽出処理部１２ｂにより入力された音声データを圧縮符号化する際の符号化レートを決定する処理を実施する。 The video coding rate determination unit 14a determines a coding rate for compressing and coding the video data input by the video feature amount extraction processing unit 12a based on the importance determined by the video importance determination unit 13a. Perform the process.
The speech coding rate determination unit 14b determines a coding rate when the speech data input by the speech feature amount extraction processing unit 12b is compression-encoded based on the importance determined by the speech importance determination unit 13b. Perform the process.

映像誤り訂正符号付加レート決定部１５ａは映像重要度判別部１３ａにより判別された重要度を基準にして、映像符号化処理部１６ａにより圧縮符号化された映像データに付加する誤り訂正符号の付加レートを決定する処理を実施する。
音声誤り訂正符号付加レート決定部１５ｂは音声重要度判別部１３ｂにより判別された重要度を基準にして、音声符号化処理部１６ｂにより圧縮符号化された音声データに付加する誤り訂正符号の付加レートを決定する処理を実施する。
なお、映像符号化レート決定部１４ａ、音声符号化レート決定部１４ｂ、映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂからレート決定手段が構成されている。 The video error correction code addition rate determination unit 15a uses the importance determined by the video importance determination unit 13a as a reference, and the error correction code addition rate added to the video data compressed and encoded by the video encoding processing unit 16a. The process of determining is performed.
The voice error correction code addition rate determination unit 15b uses the importance determined by the voice importance determination unit 13b as a reference, and the addition rate of the error correction code added to the voice data compressed and encoded by the voice encoding processing unit 16b. The process of determining is performed.
The video coding rate determination unit 14a, the audio coding rate determination unit 14b, the video error correction code addition rate determination unit 15a, and the audio error correction code addition rate determination unit 15b constitute a rate determination unit.

映像符号化処理部１６ａは映像符号化レート決定部１４ａにより決定された符号化レートで、映像特徴量抽出処理部１２ａから出力された特徴量に応じて映像データを圧縮符号化し、圧縮符号化後の映像データである映像符号化データを出力する処理を実施する。
音声符号化処理部１６ｂは音声符号化レート決定部１４ｂにより決定された符号化レートで、音声特徴量抽出処理部１２ｂから出力された特徴量に応じて音声データを圧縮符号化し、圧縮符号化後の音声データである音声符号化データを出力する処理を実施する。
なお、映像符号化処理部１６ａ及び音声符号化処理部１６ｂからメディアデータ符号化手段が構成されている。 The video encoding processing unit 16a compresses and encodes video data according to the feature amount output from the video feature amount extraction processing unit 12a at the encoding rate determined by the video encoding rate determination unit 14a. The process which outputs the video coding data which is the video data of this is implemented.
The audio encoding processing unit 16b compresses and encodes audio data according to the feature amount output from the audio feature amount extraction processing unit 12b at the encoding rate determined by the audio encoding rate determination unit 14b. The process which outputs the audio | voice coding data which are audio | voice data of this is implemented.
The video encoding processing unit 16a and the audio encoding processing unit 16b constitute media data encoding means.

映像誤り訂正符号付加部１７ａは映像符号化処理部１６ａから出力された映像符号化データに対して、映像誤り訂正符号付加レート決定部１５ａにより決定された付加レートで誤り訂正符号を付加し、誤り訂正符号付きの映像符号化データを出力する処理を実施する。
音声誤り訂正符号付加部１７ｂは音声符号化処理部１６ｂから出力された音声符号化データに対して、音声誤り訂正符号付加レート決定部１５ｂにより決定された付加レートで誤り訂正符号を付加し、誤り訂正符号付きの音声符号化データを出力する処理を実施する。
なお、映像誤り訂正符号付加部１７ａ及び音声誤り訂正符号付加部１７ｂから誤り訂正符号付加手段が構成されている。 The video error correction code addition unit 17a adds an error correction code to the video encoded data output from the video encoding processing unit 16a at the additional rate determined by the video error correction code addition rate determination unit 15a, and generates an error. A process of outputting video encoded data with a correction code is performed.
The voice error correction code addition unit 17b adds an error correction code to the voice encoded data output from the voice coding processing unit 16b at the additional rate determined by the voice error correction code addition rate determination unit 15b. A process of outputting speech encoded data with a correction code is performed.
The video error correcting code adding unit 17a and the audio error correcting code adding unit 17b constitute error correcting code adding means.

データ送出部２０はネットワーク３に対するインタフェース（例えば、無線ＬＡＮアダプタ、無線通信機器など）を備えており、映像誤り訂正符号付加部１７ａから出力された誤り訂正符号付きの映像符号化データ及び音声誤り訂正符号付加部１７ｂから出力された誤り訂正符号付きの音声符号化データをネットワーク３に送出する処理を実施する。なお、データ送出部２０はデータ送出手段を構成している。
伝送レート補正部２１はデータ送出部２０から送出された誤り訂正符号付きのメディア符号化データ（誤り訂正符号付きの映像符号化データ＋誤り訂正符号付きの音声符号化データ）の伝送レートが予め設定されている許容範囲内から逸脱している場合、映像符号化レート決定部１４ａ及び音声符号化レート決定部１４ｂに対して符号化レートの補正を指示するとともに、映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂに対して付加レートの補正を指示して、その伝送レートを許容範囲内に収める処理を実施する。なお、伝送レート補正部２１はレート補正手段を構成している。 The data transmission unit 20 includes an interface (for example, a wireless LAN adapter, a wireless communication device, etc.) to the network 3, and the video encoded data with error correction code and audio error correction output from the video error correction code adding unit 17a. A process of transmitting the voice encoded data with the error correction code output from the code adding unit 17b to the network 3 is performed. The data sending unit 20 constitutes data sending means.
The transmission rate correction unit 21 sets in advance the transmission rate of media encoded data with error correction code (video encoded data with error correction code + audio encoded data with error correction code) transmitted from the data transmission unit 20. If it deviates from the allowable range, the video coding rate determination unit 14a and the audio coding rate determination unit 14b are instructed to correct the coding rate, and the video error correction code addition rate determination unit 15a. And the correction of the additional rate is instructed to the voice error correcting code additional rate determining unit 15b, and the process of keeping the transmission rate within the allowable range is performed. The transmission rate correction unit 21 constitutes a rate correction unit.

図４の例では、メディア符号化伝送装置の構成要素である映像特徴量抽出処理部１２ａ、音声特徴量抽出処理部１２ｂ、映像重要度判別部１３ａ、音声重要度判別部１３ｂ、映像符号化レート決定部１４ａ、音声符号化レート決定部１４ｂ、映像誤り訂正符号付加レート決定部１５ａ、音声誤り訂正符号付加レート決定部１５ｂ、映像符号化処理部１６ａ、音声符号化処理部１６ｂ、映像誤り訂正符号付加部１７ａ、音声誤り訂正符号付加部１７ｂ、データ送出部２０及び伝送レート補正部２１のそれぞれが専用のハードウェア（例えば、ＣＰＵを実装している半導体集積回路、あるいは、ワンチップマイコンなど）で構成されているものを想定しているが、メディア符号化伝送装置がコンピュータで構成される場合、映像特徴量抽出処理部１２ａ、音声特徴量抽出処理部１２ｂ、映像重要度判別部１３ａ、音声重要度判別部１３ｂ、映像符号化レート決定部１４ａ、音声符号化レート決定部１４ｂ、映像誤り訂正符号付加レート決定部１５ａ、音声誤り訂正符号付加レート決定部１５ｂ、映像符号化処理部１６ａ、音声符号化処理部１６ｂ、映像誤り訂正符号付加部１７ａ、音声誤り訂正符号付加部１７ｂ、データ送出部２０及び伝送レート補正部２１の処理内容を記述しているプログラムをコンピュータのメモリに格納し、当該コンピュータのＣＰＵが当該メモリに格納されているプログラムを実行するようにしてもよい。 In the example of FIG. 4, the video feature amount extraction processing unit 12a, the audio feature amount extraction processing unit 12b, the video importance level determination unit 13a, the audio importance level determination unit 13b, and the video encoding rate, which are components of the media encoding transmission apparatus. Determination unit 14a, audio encoding rate determination unit 14b, video error correction code addition rate determination unit 15a, audio error correction code addition rate determination unit 15b, video encoding processing unit 16a, audio encoding processing unit 16b, video error correction code Each of the adding unit 17a, the voice error correcting code adding unit 17b, the data sending unit 20, and the transmission rate correcting unit 21 is a dedicated hardware (for example, a semiconductor integrated circuit mounted with a CPU or a one-chip microcomputer). If the media encoding transmission device is configured by a computer, a video feature amount extraction processing unit is assumed. 2a, audio feature amount extraction processing unit 12b, video importance level determination unit 13a, audio importance level determination unit 13b, video encoding rate determination unit 14a, audio encoding rate determination unit 14b, video error correction code addition rate determination unit 15a, Audio error correction code addition rate determination unit 15b, video encoding processing unit 16a, audio encoding processing unit 16b, video error correction code addition unit 17a, audio error correction code addition unit 17b, data transmission unit 20 and transmission rate correction unit 21 It is also possible to store a program describing the processing contents of the above in a memory of a computer and execute the program stored in the memory by the CPU of the computer.

上記実施の形態１では、メディアデータを圧縮符号化して伝送するものについて示したが、通常、メディアデータには、映像データや音声データなど、複数種類のデータが重畳されている場合が多い。
しかし、映像データから見て重要度が高いシーンが、音声データにおいても重要度が高いとは限らない。逆に、音声データから見て重要度が高いシーンが、映像データにおいても重要度が高いとは限らない。
そこで、この実施の形態２では、映像データと音声データの重要度を個別に判別して、映像データに係る符号化レートＩＥ_ｒａｔｅ及び付加レートＩＡ_ｒａｔｅと、音声データに係る符号化レートＶＥ_ｒａｔｅ及び付加レートＶＡ_ｒａｔｅとを別個に決定するようにする。 In the first embodiment, media data is compressed and encoded and transmitted. However, in many cases, media data is often superimposed with a plurality of types of data such as video data and audio data.
However, a scene having a high importance level from the viewpoint of video data does not necessarily have a high importance level in audio data. On the other hand, a scene having a high importance level from the viewpoint of audio data does not necessarily have a high importance level in video data.
Therefore, in the second embodiment, the importance levels of the video data and the audio data are individually determined, and the encoding rate IE _rate and additional rate IA _rate related to the video data, and the encoding rate VE _rate related to the audio data, and The additional rate VA _rate is determined separately.

次に動作について説明する。
映像圧縮符号化処理部１１ａの映像特徴量抽出処理部１２ａは、メディアソース２から符号化対象の映像データを入力すると、図１の特徴量抽出処理部１２と同様に、その映像データの特徴量を抽出する。
映像特徴量抽出処理部１２ａは、映像データの特徴量を抽出すると、その特徴量を映像重要度判別部１３ａ及び映像符号化処理部１６ａに出力するとともに、その映像データを映像符号化処理部１６ａに出力する。 Next, the operation will be described.
When the video feature amount extraction processing unit 12a of the video compression encoding processing unit 11a receives the video data to be encoded from the media source 2, the feature amount of the video data is the same as the feature amount extraction processing unit 12 of FIG. To extract.
When the video feature amount extraction processing unit 12a extracts the feature amount of the video data, the video feature amount extraction processing unit 12a outputs the feature amount to the video importance level determination unit 13a and the video encoding processing unit 16a, and also outputs the video data to the video encoding processing unit 16a. Output to.

音声圧縮符号化処理部１１ｂの音声特徴量抽出処理部１２ｂは、メディアソース２から符号化対象の音声データを入力すると、図１の特徴量抽出処理部１２と同様に、その音声データの特徴量を抽出する。
音声特徴量抽出処理部１２ｂは、音声データの特徴量を抽出すると、その特徴量を音声重要度判別部１３ｂ及び音声符号化処理部１６ｂに出力するとともに、その音声データを音声符号化処理部１６ｂに出力する。 When the speech feature amount extraction processing unit 12b of the speech compression encoding processing unit 11b receives the speech data to be encoded from the media source 2, the feature amount of the speech data is the same as the feature amount extraction processing unit 12 of FIG. To extract.
When the speech feature amount extraction processing unit 12b extracts the feature amount of the speech data, the speech feature amount extraction processing unit 12b outputs the feature amount to the speech importance level determination unit 13b and the speech encoding processing unit 16b, and the speech data processing unit 16b outputs the speech data. Output to.

映像重要度判別部１３ａは、映像特徴量抽出処理部１２ａから映像データの特徴量を受けると、図１の重要度判別部１３と同様に、その特徴量を参照して、符号化対象の映像データの重要度を判別する。
音声重要度判別部１３ｂは、音声特徴量抽出処理部１２ｂから音声データの特徴量を受けると、図１の重要度判別部１３と同様に、その特徴量を参照して、符号化対象の音声データの重要度を判別する。 When receiving the feature quantity of the video data from the video feature quantity extraction processing section 12a, the video importance level judging section 13a refers to the feature quantity, as with the importance level judgment section 13 in FIG. Determine the importance of the data.
When receiving the feature amount of the voice data from the voice feature amount extraction processing unit 12b, the voice importance level discrimination unit 13b refers to the feature amount and, like the importance level discrimination unit 13 in FIG. Determine the importance of the data.

映像符号化レート決定部１４ａは、映像重要度判別部１３ａが映像データの重要度を判別すると、図１の符号化レート決定部１４と同様に、その重要度を基準にして、映像特徴量抽出処理部１２ａにより入力された映像データを圧縮符号化する際の符号化レートＩＥ_ｒａｔｅを決定し、その符号化レートＩＥ_ｒａｔｅを映像符号化処理部１６ａに出力する。
音声符号化レート決定部１４ｂは、音声重要度判別部１３ｂが音声データの重要度を判別すると、図１の符号化レート決定部１４と同様に、その重要度を基準にして、音声特徴量抽出処理部１２ｂにより入力された音声データを圧縮符号化する際の符号化レートＶＥ_ｒａｔｅを決定し、その符号化レートＶＥ_ｒａｔｅを音声符号化処理部１６ｂに出力する。 When the video importance level determination unit 13a determines the importance level of the video data, the video coding rate determination unit 14a extracts the video feature amount based on the importance level as in the coding rate determination unit 14 of FIG. An encoding rate IE _rate for compressing and encoding the video data input by the processing unit 12a is determined, and the encoding rate IE _rate is output to the video encoding processing unit 16a.
When the speech importance level determination unit 13b determines the importance level of the speech data, the speech coding rate determination unit 14b extracts speech feature values based on the importance level, as in the coding rate determination unit 14 of FIG. A coding rate VE _rate for compressing and coding the speech data input by the processing unit 12b is determined, and the coding rate VE _rate is output to the speech coding processing unit 16b.

映像符号化処理部１６ａは、映像符号化レート決定部１４ａから符号化レートＩＥ_ｒａｔｅを受けると、図１の符号化処理部１６と同様に、その符号化レートＩＥ_ｒａｔｅで、映像特徴量抽出処理部１２ａから出力された特徴量に応じて映像データを圧縮符号化し、圧縮符号化後の映像データである映像符号化データを映像誤り訂正符号付加部１７ａに出力する。
音声符号化処理部１６ｂは、音声符号化レート決定部１４ｂから符号化レートＶＥ_ｒａｔｅを受けると、図１の符号化処理部１６と同様に、その符号化レートＶＥ_ｒａｔｅで、音声特徴量抽出処理部１２ｂから出力された特徴量に応じて音声データを圧縮符号化し、圧縮符号化後の音声データである音声符号化データを音声誤り訂正符号付加部１７ｂに出力する。 When receiving the encoding rate IE _rate from the video encoding rate determining unit 14a, the video encoding processing unit 16a performs the video feature amount extraction process at the encoding rate IE _rate , as in the encoding processing unit 16 of FIG. The video data is compression-encoded according to the feature amount output from the unit 12a, and the video encoded data which is the video data after the compression encoding is output to the video error correction code adding unit 17a.
When the speech encoding processing unit 16b receives the encoding rate VE _rate from the speech encoding rate determining unit 14b, the speech encoding amount extraction process is performed at the encoding rate VE _rate as in the encoding processing unit 16 of FIG. The audio data is compression-encoded according to the feature amount output from the unit 12b, and the audio encoded data that is the audio data after compression encoding is output to the audio error correction code adding unit 17b.

映像誤り訂正符号付加レート決定部１５ａは、映像重要度判別部１３ａが映像データの重要度を判別すると、図１の誤り訂正符号付加レート決定部１５と同様に、その重要度を基準にして、映像符号化処理部１６ａにより圧縮符号化された映像データに付加する誤り訂正符号の付加レートＩＡ_ｒａｔｅを決定し、誤り訂正符号の付加レートＩＡ_ｒａｔｅを映像誤り訂正符号付加部１７ａに出力する。
音声誤り訂正符号付加レート決定部１５ｂは、音声重要度判別部１３ｂが音声データの重要度を判別すると、図１の誤り訂正符号付加レート決定部１５と同様に、その重要度を基準にして、音声符号化処理部１６ｂにより圧縮符号化された音声データに付加する誤り訂正符号の付加レートＶＡ_ｒａｔｅを決定し、誤り訂正符号の付加レートＶＡ_ｒａｔｅを音声誤り訂正符号付加部１７ｂに出力する。 When the video importance determination unit 13a determines the importance of the video data, the video error correction code addition rate determination unit 15a uses the importance as a reference in the same manner as the error correction code addition rate determination unit 15 in FIG. The addition rate IA _rate of the error correction code added to the video data compressed and encoded by the video encoding processing unit 16a is determined, and the addition rate IA _rate of the error correction code is output to the video error correction code addition unit 17a.
The voice error correction code addition rate determination unit 15b, when the voice importance determination unit 13b determines the importance of the voice data, like the error correction code addition rate determination unit 15 in FIG. The addition rate VA _rate of the error correction code to be added to the audio data compressed and encoded by the audio encoding processing unit 16b is determined, and the addition rate VA _rate of the error correction code is output to the audio error correction code addition unit 17b.

映像誤り訂正符号付加部１７ａは、映像符号化処理部１６ａから映像符号化データを受けると、その映像符号化データが伝送中に破損した場合に、受信側が破損した映像符号化データを復旧させる際に利用する誤り訂正符号を生成する。
即ち、映像誤り訂正符号付加部１７ａは、映像符号化処理部１６ａから出力された付加レートＩＡ_ｒａｔｅに対応するデータ長の誤り訂正符号を生成し、その誤り訂正符号を映像符号化データに付加する。 When the video error correction code adding unit 17a receives the video encoded data from the video encoding processing unit 16a, when the video encoded data is damaged during transmission, the video error correcting code adding unit 17a restores the damaged video encoded data on the receiving side. An error correction code used in the above is generated.
That is, the video error correction code adding unit 17a generates an error correction code having a data length corresponding to the additional rate IA _rate output from the video encoding processing unit 16a, and adds the error correction code to the video encoded data. .

音声誤り訂正符号付加部１７ｂは、音声符号化処理部１６ｂから音声符号化データを受けると、その音声符号化データが伝送中に破損した場合に、受信側が破損した音声符号化データを復旧させる際に利用する誤り訂正符号を生成する。
即ち、音声誤り訂正符号付加部１７ｂは、音声符号化処理部１６ｂから出力された付加レートＶＡ_ｒａｔｅに対応するデータ長の誤り訂正符号を生成し、その誤り訂正符号を音声符号化データに付加する。 When the speech error correction code adding unit 17b receives the speech encoded data from the speech encoding processing unit 16b, when the speech encoded data is damaged during transmission, the speech error correcting code adding unit 17b restores the damaged speech encoded data on the receiving side. An error correction code used in the above is generated.
That is, the voice error correction code adding unit 17b generates an error correction code having a data length corresponding to the additional rate VA _rate output from the voice encoding processing unit 16b, and adds the error correction code to the voice encoded data. .

データ送出部２０は、映像誤り訂正符号付加部１７ａから誤り訂正符号付きの映像符号化データを受け、音声誤り訂正符号付加部１７ｂから誤り訂正符号付きの音声符号化データを受けると、誤り訂正符号付きのメディア符号化データ（誤り訂正符号付きの映像符号化データ＋誤り訂正符号付きの音声符号化データ）をネットワーク３に送出する。
上述したように、符号化レートＩＥ_ｒａｔｅ，ＶＥ_ｒａｔｅ及び付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅが映像データ／音声データの重要度に応じて決定されているので、データ送出部２０から送出される誤り訂正符号付きのメディア符号化データの伝送レートは一定に制御される。
しかし、実際には、メディアデータの内容等によって伝送レートに誤差が生じることがある。 When the data sending unit 20 receives the encoded video data with the error correcting code from the video error correcting code adding unit 17a and receives the encoded audio data with the error correcting code from the audio error correcting code adding unit 17b, the error correcting code Attached media encoded data (video encoded data with error correction code + audio encoded data with error correction code) is sent to the network 3.
As described above, since the encoding rates IE _rate and VE _rate and the additional rates IA _rate and VA _rate are determined according to the importance of the video data / audio data, the error correction code sent from the data sending unit 20 The transmission rate of the attached media encoded data is controlled to be constant.
However, actually, an error may occur in the transmission rate depending on the contents of the media data.

そこで、伝送レート補正部２１は、データ送出部２０から送出された誤り訂正符号付きのメディア符号化データの伝送レートを監視し、その伝送レートが予め設定されている許容範囲内から逸脱していれば、映像符号化レート決定部１４ａ及び音声符号化レート決定部１４ｂに対して符号化レートＩＥ_ｒａｔｅ，ＶＥ_ｒａｔｅの補正を指示するとともに、映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂに対して付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅの補正を指示して、その伝送レートを許容範囲内に収めるようにする。
具体的には、以下の通りである。 Therefore, the transmission rate correction unit 21 monitors the transmission rate of the media encoded data with the error correction code sent from the data sending unit 20, and the transmission rate deviates from a preset allowable range. For example, the video encoding rate determining unit 14a and the audio encoding rate determining unit 14b are instructed to correct the encoding rates IE _rate and VE _rate , and the video error correcting code adding rate determining unit 15a and the audio error correcting code are added. The rate determination unit 15b is instructed to correct the additional rates IA _rate and VA _rate so that the transmission rate falls within the allowable range.
Specifically, it is as follows.

伝送レート補正部２１は、データ送出部２０から送出された誤り訂正符号付きのメディア符号化データの伝送レートを監視して、そのメディア符号化データの全体の伝送レートと、そのメディア符号化データの重要度別の伝送レートを分析する。
伝送レート補正部２１は、そのメディア符号化データの全体の伝送レートが許容範囲を超えて変動している場合、変動が特定の重要度に係る誤り訂正符号付きのメディア符号化データ（例えば、重要度が高い映像データ／音声データが圧縮符号化されている符号化データ及び誤り訂正符号）にのみ発生しているのか、特定の重要度に関わらず全体的に変動している（全ての重要度に係る誤り訂正符号付きのメディア符号化データに変動が発生している）のかを確認する。 The transmission rate correction unit 21 monitors the transmission rate of the media encoded data with the error correction code sent from the data sending unit 20, and determines the overall transmission rate of the media encoded data and the media encoded data. Analyze the transmission rate by importance.
When the overall transmission rate of the media encoded data fluctuates beyond an allowable range, the transmission rate correction unit 21 media encoded data with an error correction code having a specific importance (for example, important Whether video data / audio data with a high degree is generated only in encoded data and error correction code), it fluctuates as a whole regardless of the specific importance (all importance levels) Whether or not there is a change in the media encoded data with the error correction code.

伝送レート補正部２１は、変動が特定の重要度に係る誤り訂正符号付きの映像符号化データにのみ発生している場合、該当する重要度に対応する符号化レートＩＥ_ｒａｔｅの補正指示を映像符号化レート決定部１４ａに出力し、また、該当する重要度に対応する付加レートＩＡ_ｒａｔｅの補正を映像誤り訂正符号付加レート決定部１５ａに出力する。
これにより、映像符号化レート決定部１４ａが該当する重要度に対応する符号化レートＩＥ_ｒａｔｅを下げる補正を実施し、映像誤り訂正符号付加レート決定部１５ａが該当する重要度に対応する付加レートＩＡ_ｒａｔｅを下げる補正を実施する。 When the fluctuation occurs only in the video encoded data with the error correction code related to the specific importance, the transmission rate correction unit 21 gives an instruction to correct the encoding rate IE _rate corresponding to the corresponding importance. And the correction of the additional rate IA _rate corresponding to the corresponding importance is output to the video error correction code additional rate determining unit 15a.
As a result, the video coding rate determination unit 14a performs correction to lower the coding rate IE _rate corresponding to the corresponding importance, and the video error correction code additional rate determination unit 15a performs the additional rate IA corresponding to the corresponding importance. Perform correction to lower the _rate .

伝送レート補正部２１は、変動が特定の重要度に係る誤り訂正符号付きの音声符号化データにのみ発生している場合、該当する重要度に対応する符号化レートＶＥ_ｒａｔｅの補正指示を音声符号化レート決定部１４ｂに出力し、また、該当する重要度に対応する付加レートＶＡ_ｒａｔｅの補正を音声誤り訂正符号付加レート決定部１５ｂに出力する。
これにより、音声符号化レート決定部１４ｂが該当する重要度に対応する符号化レートＶＥ_ｒａｔｅを下げる補正を実施し、音声誤り訂正符号付加レート決定部１５ｂが該当する重要度に対応する付加レートＶＡ_ｒａｔｅを下げる補正を実施する。 When the fluctuation occurs only in the voice encoded data with the error correction code related to the specific importance, the transmission rate correction unit 21 gives a correction instruction of the coding rate VE _rate corresponding to the corresponding importance to the voice code. And the correction of the additional rate VA _rate corresponding to the corresponding importance is output to the voice error correction code additional rate determining unit 15b.
As a result, the speech coding rate determination unit 14b performs correction to lower the coding rate VE _rate corresponding to the corresponding importance, and the speech error correction code addition rate determination unit 15b performs the additional rate VA corresponding to the corresponding importance. Perform correction to lower the _rate .

伝送レート補正部２１は、特定の重要度に関わらず全体的に変動している場合、全ての重要度に対応する符号化レートＩＥ_ｒａｔｅ，ＶＥ_ｒａｔｅの補正指示を映像符号化レート決定部１４ａ及び音声符号化レート決定部１４ｂに出力し、また、全ての重要度に対応する付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅの補正指示を映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂに出力する。
これにより、映像符号化レート決定部１４ａ及び音声符号化レート決定部１４ｂが全ての重要度に対応する符号化レートＩＥ_ｒａｔｅ，ＶＥ_ｒａｔｅを下げる補正を実施し、映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂが全ての重要度に対応する付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅを下げる補正を実施する。 When the transmission rate correction unit 21 changes as a whole regardless of the specific importance level, the transmission rate correction unit 21 sends the encoding rate IE _rate and VE _rate correction instructions corresponding to all the importance levels to the video encoding rate determination unit 14a and It outputs to the audio coding rate determination unit 14b, and gives instructions for correcting the additional rates IA _rate and VA _rate corresponding to all the importance levels. Output to.
As a result, the video coding rate determination unit 14a and the audio coding rate determination unit 14b perform correction to lower the coding rates IE _rate and VE _rate corresponding to all the importance levels, and the video error correction code addition rate determination unit 15a. The voice error correction code addition rate determination unit 15b performs correction to lower the addition rates IA _rate and VA _rate corresponding to all the importance levels.

以上で明らかなように、この実施の形態２によれば、映像データと音声データの重要度を個別に判別して、映像データに係る符号化レートＩＥ_ｒａｔｅ及び付加レートＩＡ_ｒａｔｅと、音声データに係る符号化レートＶＥ_ｒａｔｅ及び付加レートＶＡ_ｒａｔｅとを別個に決定するように構成したので、同一シーンの映像データと音声データの重要度が異なる場合でも、メディアの再生品質を維持しながら、帯域利用効率の低下を防止することができるとともに、回線の帯域に制限がある伝送路を使用する場合でも、伝送誤りの発生を防止することができる効果を奏する。 As is apparent from the above, according to the second embodiment, the importance levels of the video data and the audio data are individually determined, and the encoding rate IE _rate and the additional rate IA _rate related to the video data are added to the audio data. Since the encoding rate VE _rate and the additional rate VA _rate are determined separately, even when the importance of the video data and audio data of the same scene is different, the band usage is maintained while maintaining the reproduction quality of the media. It is possible to prevent a reduction in efficiency and to prevent the occurrence of transmission errors even when using a transmission line with a limited line bandwidth.

実施の形態３．
上記実施の形態２では、映像データと音声データの重要度を個別に判別して、映像データに係る符号化レートＩＥ_ｒａｔｅ及び付加レートＩＡ_ｒａｔｅと、音声データに係る符号化レートＶＥ_ｒａｔｅ及び付加レートＶＡ_ｒａｔｅとを別個に決定するものについて示したが、例えば、ＭＰＥＧシステムなどのように、映像データと音声データを符号化した後、誤り訂正符号を付加する前に、映像符号化データと音声符号化データを多重化する場合、上記実施の形態２を適用することができない。
この実施の形態３では、映像データと音声データを符号化した後、誤り訂正符号を付加する前に、映像符号化データと音声符号化データを多重化する場合でも、適用可能な実施形態を開示する。 Embodiment 3 FIG.
In the second embodiment, the importance levels of the video data and the audio data are individually determined, and the encoding rate IE _rate and additional rate IA _rate related to the video data, and the encoding rate VE _rate and additional rate related to the audio data are determined. Although the VA _rate is determined separately, for example, the video data and the audio code are encoded after the video data and the audio data are encoded and then the error correction code is added. In the case where multiplexed data is multiplexed, the second embodiment cannot be applied.
The third embodiment discloses an applicable embodiment even when video encoded data and audio encoded data are multiplexed after video data and audio data are encoded and before an error correction code is added. To do.

図５はこの発明の実施の形態３によるメディア符号化伝送装置を示す構成図であり、図において、図４と同一符号は同一又は相当部分を示すので説明を省略する。
映像音声多重部２２は映像符号化処理部１６ａから出力された映像符号化データと音声符号化処理部１６ｂから出力された音声符号化データを多重化し、メディア符号化データ（映像符号化データ＋音声符号化データ）を映像音声整列部２３に出力する処理を実施する。なお、映像音声多重部２２はデータ多重化手段を構成している。
映像音声整列部２３は映像音声多重部２２から出力されたメディア符号化データ内の映像符号化データと音声符号化データの並び替え処理を実施する。なお、映像音声整列部２３はデータ整列手段を構成している。 5 is a block diagram showing a media encoding / transmission apparatus according to Embodiment 3 of the present invention. In the figure, the same reference numerals as those in FIG.
The video / audio multiplexing unit 22 multiplexes the video encoded data output from the video encoding processing unit 16a and the audio encoded data output from the audio encoding processing unit 16b, and media encoded data (video encoded data + audio A process of outputting the encoded data) to the video / audio sorting unit 23 is performed. The video / audio multiplexing unit 22 constitutes data multiplexing means.
The video / audio sorting unit 23 performs a rearrangement process of the video encoded data and the audio encoded data in the media encoded data output from the video / audio multiplexing unit 22. The video / audio sorting unit 23 constitutes data sorting means.

映像誤り訂正符号付加部２４ａは映像音声整列部２３により並び替えが行われたメディア符号化データ内の映像符号化データに対して、映像誤り訂正符号付加レート決定部１５ａにより決定された付加レートで誤り訂正符号を付加する処理を実施する。
音声誤り訂正符号付加部２４ｂは映像音声整列部２３により並び替えが行われたメディア符号化データ内の音声符号化データに対して、音声誤り訂正符号付加レート決定部１５ｂにより決定された付加レートで誤り訂正符号を付加する処理を実施する。
なお、映像誤り訂正符号付加部２４ａ及び音声誤り訂正符号付加部２４ｂから誤り訂正符号付加手段が構成されている。 The video error correction code adding unit 24a applies to the video encoded data in the media encoded data rearranged by the video / audio aligning unit 23 at the additional rate determined by the video error correcting code additional rate determining unit 15a. A process of adding an error correction code is performed.
The audio error correction code adding unit 24b applies to the audio encoded data in the media encoded data rearranged by the video / audio aligning unit 23 at the additional rate determined by the audio error correcting code additional rate determining unit 15b. A process of adding an error correction code is performed.
The video error correction code addition unit 24a and the audio error correction code addition unit 24b constitute error correction code addition means.

図５の例では、メディア符号化伝送装置の構成要素である映像特徴量抽出処理部１２ａ、音声特徴量抽出処理部１２ｂ、映像重要度判別部１３ａ、音声重要度判別部１３ｂ、映像符号化レート決定部１４ａ、音声符号化レート決定部１４ｂ、映像誤り訂正符号付加レート決定部１５ａ、音声誤り訂正符号付加レート決定部１５ｂ、映像符号化処理部１６ａ、音声符号化処理部１６ｂ、映像音声多重部２２、映像音声整列部２３、映像誤り訂正符号付加部２４ａ、音声誤り訂正符号付加部２４ｂ、データ送出部２０及び伝送レート補正部２１のそれぞれが専用のハードウェア（例えば、ＣＰＵを実装している半導体集積回路、あるいは、ワンチップマイコンなど）で構成されているものを想定しているが、メディア符号化伝送装置がコンピュータで構成される場合、映像特徴量抽出処理部１２ａ、音声特徴量抽出処理部１２ｂ、映像重要度判別部１３ａ、音声重要度判別部１３ｂ、映像符号化レート決定部１４ａ、音声符号化レート決定部１４ｂ、映像誤り訂正符号付加レート決定部１５ａ、音声誤り訂正符号付加レート決定部１５ｂ、映像符号化処理部１６ａ、音声符号化処理部１６ｂ、映像音声多重部２２、映像音声整列部２３、映像誤り訂正符号付加部２４ａ、音声誤り訂正符号付加部２４ｂ、データ送出部２０及び伝送レート補正部２１の処理内容を記述しているプログラムをコンピュータのメモリに格納し、当該コンピュータのＣＰＵが当該メモリに格納されているプログラムを実行するようにしてもよい。 In the example of FIG. 5, the video feature amount extraction processing unit 12a, the audio feature amount extraction processing unit 12b, the video importance level determination unit 13a, the audio importance level determination unit 13b, and the video encoding rate, which are components of the media encoding transmission device. Determination unit 14a, audio encoding rate determination unit 14b, video error correction code addition rate determination unit 15a, audio error correction code addition rate determination unit 15b, video encoding processing unit 16a, audio encoding processing unit 16b, video / audio multiplexing unit 22, each of the video / audio alignment unit 23, the video error correction code addition unit 24a, the audio error correction code addition unit 24b, the data transmission unit 20, and the transmission rate correction unit 21 is mounted with dedicated hardware (for example, a CPU). It is assumed that it is composed of a semiconductor integrated circuit or a one-chip microcomputer. In this case, the video feature extraction processing unit 12a, the audio feature extraction processing unit 12b, the video importance determination unit 13a, the audio importance determination unit 13b, the video encoding rate determination unit 14a, and the audio encoding rate determination unit 14b, video error correction code addition rate determination unit 15a, audio error correction code addition rate determination unit 15b, video encoding processing unit 16a, audio encoding processing unit 16b, video / audio multiplexing unit 22, video / audio alignment unit 23, video error A program describing the processing contents of the correction code adding unit 24a, the voice error correcting code adding unit 24b, the data sending unit 20, and the transmission rate correcting unit 21 is stored in a computer memory, and the CPU of the computer stores the program in the memory. You may make it run the program currently performed.

次に動作について説明する。
映像音声多重部２２は、映像符号化処理部１６ａが映像データを圧縮符号化して映像符号化データを出力し、音声符号化処理部１６ｂが音声データを圧縮符号化して音声符号化データを出力すると、その映像符号化データと音声符号化データを多重化し、メディア符号化データ（映像符号化データ＋音声符号化データ）を映像音声整列部２３に出力する。
ここで、図６は映像音声多重部２２により多重化された映像符号化データ及び音声符号化データと、映像音声整列部２３により並び替えが行われた映像符号化データ及び音声符号化データと、誤り訂正符号が付加された映像符号化データ及び音声符号化データとを示す説明図である。 Next, the operation will be described.
When the video encoding processing unit 16a compresses and encodes video data and outputs video encoded data, and the audio encoding processing unit 16b compresses and encodes audio data and outputs audio encoded data, the video and audio multiplexing unit 22 The video encoded data and the audio encoded data are multiplexed, and the media encoded data (video encoded data + audio encoded data) is output to the video / audio aligning unit 23.
Here, FIG. 6 shows the video encoded data and audio encoded data multiplexed by the video / audio multiplexing unit 22, the video encoded data and audio encoded data rearranged by the video / audio aligning unit 23, It is explanatory drawing which shows the video coding data and audio | voice coding data to which the error correction code was added.

映像音声整列部２３は、映像音声多重部２２から図６に示すようなメディア符号化データ（映像符号化データ＋音声符号化データ）を受けると、そのメディア符号化データ内の映像符号化データと音声符号化データの並び替え処理を実施する。
具体的には、以下に示すような並び替え処理を実施する。 Upon receiving the media encoded data (video encoded data + audio encoded data) as shown in FIG. 6 from the video / audio multiplexing unit 22, the video / audio aligning unit 23 receives the video encoded data in the media encoded data, A rearrangement process of speech encoded data is performed.
Specifically, the rearrangement process as shown below is performed.

例えば、映像符号化データに対して、５パケット毎に１つの誤り訂正符号を付加するように指定されている場合、映像符号化データのパケット［１］〜［５］に対して、１つの誤り訂正符号を付加する。
図６の例では、映像符号化データのパケット［１］〜［５］の次に２つの映像符号化データのパケット［６］［７］が入力されているが、５つのパケットが入力される前に、音声符号化データのパケット［８］が入力されているため、映像符号化データの５つのパケットに対して、１つの誤り訂正符号を付加することができない。
そこで、音声符号化データのパケット［８］の次に入力された映像符号化データのパケット［９］〜［１１］を、音声符号化データのパケット［８］の前に入れ替える並び替え処理を行って、映像符号化データの５つのパケット［６］［７］［９］［１０］［１１］に対して、１つの誤り訂正符号を付加することができるようにしている。 For example, when it is specified that one error correction code is added every 5 packets for encoded video data, one error is generated for encoded video data packets [1] to [5]. A correction code is added.
In the example of FIG. 6, two video encoded data packets [6] and [7] are input after the video encoded data packets [1] to [5], but five packets are input. Since the audio encoded data packet [8] has been input before, one error correction code cannot be added to the 5 video encoded data packets.
Therefore, a rearrangement process is performed in which the video encoded data packets [9] to [11] input next to the audio encoded data packet [8] are replaced before the audio encoded data packet [8]. Thus, one error correction code can be added to the five packets [6] [7] [9] [10] [11] of the encoded video data.

ただし、伝送にＭＰＥＧシステムを利用する場合、メディア符号化データの中に、送信側と受信側の時刻を同期させるための信号（ＰＣＲ：ＰｒｏｇｒａｍＣｌｏｃｋＲｅｆｅｒｅｎｃｅ）が含まれている。
このＰＣＲが含まれているパケットの送信順序を並べ替えると、受信側において、時刻同期がずれてしまうため、並べ替えの対象となるパケットの中にＰＣＲが含まれている場合、受信側での時刻同期がずれないように、ＰＣＲの補正も合わせて行うことで、受信側での時刻同期を守るようにしている。 However, when an MPEG system is used for transmission, media encoded data includes a signal (PCR: Program Clock Reference) for synchronizing the time on the transmission side and the reception side.
If the transmission order of packets containing this PCR is rearranged, the time synchronization will be shifted on the receiving side. Therefore, if the PCR is included in the packets to be rearranged, The time synchronization on the receiving side is protected by correcting the PCR so that the time synchronization is not shifted.

映像誤り訂正符号付加部２４ａは、映像音声整列部２３がメディア符号化データ内の映像符号化データと音声符号化データの並び替えを行うと、並び替えが行われたメディア符号化データ内の映像符号化データに対して、映像誤り訂正符号付加レート決定部１５ａにより決定された付加レートＩＡ_ｒａｔｅで誤り訂正符号を付加する。
図６では、例えば、映像符号化データのパケット［１］〜［５］に対して、１つの誤り訂正符号を付加し、また、映像符号化データのパケット［６］［７］［９］［１０］［１１］に対して、１つの誤り訂正符号を付加している。 When the video / audio sorting unit 23 rearranges the video encoded data and the audio encoded data in the media encoded data, the video error correction code adding unit 24a performs the video in the media encoded data subjected to the rearrangement. An error correction code is added to the encoded data at the addition rate IA _rate determined by the video error correction code addition rate determination unit 15a.
In FIG. 6, for example, one error correction code is added to video encoded data packets [1] to [5], and video encoded data packets [6] [7] [9] [9] [ 10] [11], one error correction code is added.

音声誤り訂正符号付加部２４ｂは、映像音声整列部２３がメディア符号化データ内の映像符号化データと音声符号化データの並び替えを行うと、並び替えが行われたメディア符号化データ内の音声符号化データに対して、音声誤り訂正符号付加レート決定部１５ｂにより決定された付加レートＶＡ_ｒａｔｅで誤り訂正符号を付加する。
図６の例では、音声符号化データのパケット［８］に対して、１つの誤り訂正符号を付加している。
データ送出部２０及び伝送レート補正部２１の処理内容は、上記実施の形態２と同様であるため説明を省略する。 When the video / audio sorting unit 23 rearranges the video encoded data and the audio encoded data in the media encoded data, the audio error correcting code adding unit 24b performs the audio in the media encoded data subjected to the rearrangement. An error correction code is added to the encoded data at the addition rate VA _rate determined by the voice error correction code addition rate determination unit 15b.
In the example of FIG. 6, one error correction code is added to the voice encoded data packet [8].
Since the processing contents of the data sending unit 20 and the transmission rate correction unit 21 are the same as those in the second embodiment, description thereof is omitted.

以上で明らかなように、この実施の形態３によれば、映像音声整列部２３が映像音声多重部２２から出力されたメディア符号化データ内の映像符号化データと音声符号化データの並び替えを行うように構成したので、ＭＰＥＧシステムなどのように、映像データと音声データを符号化した後、誤り訂正符号を付加する前に、映像符号化データと音声符号化データを多重化する場合でも適用することができる。 As apparent from the above, according to the third embodiment, the video / audio sorting unit 23 rearranges the video encoded data and the audio encoded data in the media encoded data output from the video / audio multiplexing unit 22. Since it is configured to be applied, it is applicable even when video encoded data and audio encoded data are multiplexed after encoding video data and audio data and before adding an error correction code as in an MPEG system, etc. can do.

実施の形態４．
この実施の形態４では、図１の特徴量抽出部１２（図４及び図５の映像特徴量抽出部１２ａ、音声特徴量抽出部１２ｂを含む）が符号化対象のメディアデータの特徴量として、そのメディアデータがヘッダ部を構成している情報であるのか、データ部を構成している情報であるのかを確認し、図１の重要度判別部１３（図４及び図５の映像重要度判別部１３ａ、音声重要度判別部１３ｂを含む）が、符号化対象のメディアデータがヘッダ部を構成している情報であれば、そのメディアデータの重要度が「高」であると判別し、符号化対象のメディアデータがデータ部を構成している情報であれば、そのメディアデータの重要度が「中」又は「低」であると判別するようにする。 Embodiment 4 FIG.
In the fourth embodiment, the feature quantity extraction unit 12 in FIG. 1 (including the video feature quantity extraction unit 12a and the audio feature quantity extraction unit 12b in FIGS. 4 and 5) is used as the feature quantity of the media data to be encoded. It is confirmed whether the media data is information constituting the header part or information constituting the data part, and the importance level discrimination unit 13 in FIG. 1 (video importance level discrimination in FIGS. 4 and 5). Section 13a and voice importance level determination section 13b), if the media data to be encoded is information constituting the header section, it is determined that the importance level of the media data is “high”. If the media data to be converted is information constituting a data part, it is determined that the importance of the media data is “medium” or “low”.

具体的には、以下の通りである。
メディアデータのデータ部には、例えば、映像データや音声データなど、実際に伝送したい情報が格納され、ヘッダ部には、データ部に格納されている情報の種別やデータ構成などが格納される。
ヘッダ部の種類には、例えば、ＭＰＥＧシステムでは、映像の縦横の画素数や、フレームレートなどが格納されているシーケンスヘッダや、シーケンスの先頭からの時間などが格納されたＧＯＰ（ＧｒｏｕｐｏｆＰｉｃｔｕｒｅ）ヘッダなどがある。
ヘッダ部が欠落すると、データ部に格納されている情報の種別や構成が分からなくなり、データ部が正常に受信されても、メディアデータの復号ができなくなってしまうため、ヘッダ部はデータ部よりも重要度が高い。 Specifically, it is as follows.
For example, information to be actually transmitted such as video data and audio data is stored in the data portion of the media data, and the type and data configuration of the information stored in the data portion are stored in the header portion.
As the type of header portion, for example, in the MPEG system, a GOP (Group of Pictures) in which a sequence header in which the number of vertical and horizontal pixels of a video, a frame rate, and the like are stored, a time from the beginning of the sequence, and the like are stored. There are headers.
If the header part is missing, the type and configuration of the information stored in the data part will not be known, and even if the data part is received normally, it will not be possible to decode the media data. High importance.

そこで、特徴量抽出部１２は、符号化対象のメディアデータの特徴量として、そのメディアデータがヘッダ部を構成している情報であるのか、データ部を構成している情報であるのかを確認する。
重要度判別部１３は、符号化対象のメディアデータがヘッダ部を構成している情報であれば、そのメディアデータの重要度が「高」であると判別し、符号化対象のメディアデータがデータ部を構成している情報であれば、そのメディアデータの重要度が「中」又は「低」であると判別する。 Therefore, the feature quantity extraction unit 12 confirms whether the media data is information constituting the header part or information constituting the data part as the feature quantity of the media data to be encoded. .
The importance determination unit 13 determines that the importance of the media data is “high” if the media data to be encoded is information constituting the header portion, and the media data to be encoded is data. If the information constitutes a section, it is determined that the importance of the media data is “medium” or “low”.

以上で明らかなように、この実施の形態４によれば、符号化対象のメディアデータの重要度を判別する際、そのメディアデータがヘッダ部を構成している情報であるのか、データ部を構成している情報であるのかを確認し、そのメディアデータがヘッダ部を構成している情報である場合の重要度を、データ部を構成している情報である場合の重要度より高く設定するように構成したので、ヘッダ部の誤り訂正能力を高めて、復号メディアの品質を高めることができる効果を奏する。 As is apparent from the above, according to the fourth embodiment, when determining the importance of the media data to be encoded, whether the media data is information constituting the header portion or the data portion is configured. To determine whether the media data is the information constituting the header part, and to set the importance higher than the importance of the information constituting the data part. Thus, there is an effect that the error correction capability of the header part can be increased and the quality of the decoding media can be improved.

実施の形態５．
この実施の形態５では、符号化対象のメディアデータが映像データである場合、図１の特徴量抽出部１２（図４及び図５の映像特徴量抽出部１２ａを含む）が符号化対象の映像データの特徴量として、その映像データのピクチャタイプを確認し、図１の重要度判別部１３（図４及び図５の映像重要度判別部１３ａを含む）が、そのピクチャタイプを基準にして、その映像データの重要度を判別するようにする。 Embodiment 5 FIG.
In the fifth embodiment, when the media data to be encoded is video data, the feature quantity extraction unit 12 in FIG. 1 (including the video feature quantity extraction unit 12a in FIGS. 4 and 5) is the video to be encoded. As the data feature amount, the picture type of the video data is confirmed, and the importance level determination unit 13 in FIG. 1 (including the video importance level determination unit 13a in FIGS. 4 and 5) uses the picture type as a reference, The importance of the video data is determined.

具体的には、以下の通りである。
例えば、映像データの圧縮符号化にＭＰＥＧ−２を利用する場合、Ｉピクチャが欠落すると、そのＩピクチャを参照するＰピクチャやＢピクチャの復号にも影響を及ぼしてしまうため、Ｉピクチャは、ＰピクチャやＢピクチャよりも重要度が高いデータであるといえる。
また、Ｐピクチャが欠落すると、そのＰピクチャを参照するＢピクチャの復号にも影響を及ぼしてしまうため、Ｐピクチャは、Ｂピクチャよりも重要度が高いデータであるといえる。 Specifically, it is as follows.
For example, when MPEG-2 is used for compressing and encoding video data, if an I picture is lost, decoding of a P picture or a B picture that refers to the I picture is also affected. It can be said that the data is more important than the picture and the B picture.
In addition, if a P picture is missing, it also affects the decoding of a B picture that refers to the P picture, and therefore it can be said that the P picture is data that is more important than the B picture.

そこで、特徴量抽出部１２は、符号化対象の映像データの特徴量として、その映像データのピクチャタイプを確認する。
重要度判別部１３は、特徴量抽出部１２が映像データのピクチャタイプを確認すると、そのピクチャタイプを基準にして、その映像データの重要度を判別する。
例えば、映像データのピクチャタイプがＩピクチャであれば、その映像データの重要度が「高」であると判別し、映像データのピクチャタイプがＰピクチャであれば、その映像データの重要度が「中」であると判別し、映像データのピクチャタイプがＢピクチャであれば、その映像データの重要度が「低」であると判別する。 Therefore, the feature amount extraction unit 12 confirms the picture type of the video data as the feature amount of the video data to be encoded.
When the feature amount extraction unit 12 confirms the picture type of the video data, the importance level determination unit 13 determines the importance level of the video data on the basis of the picture type.
For example, if the picture type of the video data is I picture, it is determined that the importance of the video data is “high”. If the picture type of the video data is P picture, the importance of the video data is “ If the picture type of the video data is a B picture, it is determined that the importance of the video data is “low”.

以上で明らかなように、この実施の形態５によれば、符号化対象のメディアデータが映像データである場合、その映像データのピクチャタイプを確認し、そのピクチャタイプを基準にして、その映像データの重要度を判別するように構成したので、復号映像の画質を高めることができる効果を奏する。 As apparent from the above, according to the fifth embodiment, when the media data to be encoded is video data, the picture type of the video data is confirmed, and the video data is based on the picture type. Therefore, the image quality of the decoded video can be improved.

実施の形態６．
この実施の形態６では、符号化対象のメディアデータが映像データである場合、図１の特徴量抽出部１２（図４及び図５の映像特徴量抽出部１２ａを含む）が符号化対象の映像データの特徴量として、その映像データが圧縮符号化される際に用いられる動きベクトルを確認し、図１の重要度判別部１３（図４及び図５の映像重要度判別部１３ａを含む）が、その動きベクトルを基準にして、その映像データの重要度を判別するようにする。 Embodiment 6 FIG.
In the sixth embodiment, when the media data to be encoded is video data, the feature quantity extraction unit 12 in FIG. 1 (including the video feature quantity extraction unit 12a in FIGS. 4 and 5) is the video to be encoded. As a data feature amount, a motion vector used when the video data is compression-encoded is confirmed, and the importance determination unit 13 in FIG. 1 (including the video importance determination unit 13a in FIGS. 4 and 5). The importance of the video data is determined based on the motion vector.

具体的には、以下の通りである。
例えば、ＭＰＥＧなどの映像圧縮符号化方式では、圧縮符号化手法の一つとして、動き補償フレーム間予測が行われる。
動き補償フレーム間予測が行われるとき、大きな移動量（動きベクトル）を持っているピクチャは、参照画像からの変化が大きいことを意味するので、このピクチャが欠落すると、このピクチャを参照するピクチャの復号映像に影響を与えることになる。
このため、大きな動きベクトルを持つピクチャは、大きな動きベクトルを持たないピクチャよりも重要度が高いといえる。
これより、例えば、ピクチャの持つ動きベクトルの大きさの平均値が大きいピクチャの重要度を高くすることによって、ＰピクチャやＢピクチャの中でも、重要度が高いピクチャと重要度が低いピクチャを判定して、より細かく符号化レートや誤り訂正符号の付加レートを制御することができる。 Specifically, it is as follows.
For example, in a video compression encoding method such as MPEG, motion compensation interframe prediction is performed as one of the compression encoding methods.
When motion compensation inter-frame prediction is performed, a picture having a large movement amount (motion vector) means that the change from the reference picture is large. Therefore, if this picture is missing, the picture that refers to this picture This will affect the decoded video.
For this reason, it can be said that a picture with a large motion vector is more important than a picture without a large motion vector.
Thus, for example, by increasing the importance of a picture having a large average value of motion vector sizes, a picture having a high importance and a picture having a low importance are determined among P pictures and B pictures. Thus, the encoding rate and the error correction code addition rate can be controlled more finely.

そこで、特徴量抽出部１２は、符号化対象の映像データの特徴量として、その映像データが圧縮符号化される際に用いられる動きベクトルを確認する。動きベクトルを確認するに際して、特徴量抽出部１２が動きベクトルを探索する動きベクトル探索部を実装するようにしてもよいし、図１の符号化処理部１６（図４及び図５の映像符号化処理部１６ａを含む）から圧縮符号化される際に用いられる動きベクトルを取得するようにしてもよい。
重要度判別部１３は、特徴量抽出部１２により確認された動きベクトルを基準にして、映像データの重要度を判別する。
例えば、動きベクトルの大きさの平均値が大きいピクチャ（映像データ）である程、当該ピクチャの重要度を高く設定する。 Therefore, the feature quantity extraction unit 12 confirms a motion vector used when the video data is compression-coded as the feature quantity of the video data to be encoded. When confirming a motion vector, the feature quantity extraction unit 12 may implement a motion vector search unit that searches for a motion vector, or the encoding processing unit 16 of FIG. 1 (video encoding of FIGS. 4 and 5). The motion vector used when the compression encoding is performed may be acquired from the processing unit 16a).
The importance level determination unit 13 determines the importance level of the video data based on the motion vector confirmed by the feature amount extraction unit 12.
For example, the importance of the picture is set higher as the picture (video data) has a larger average value of the motion vectors.

以上で明らかなように、この実施の形態６によれば、符号化対象のメディアデータが映像データである場合、その映像データが圧縮符号化される際に用いられる動きベクトルを基準にして、その映像データの重要度を判別するように構成したので、符号化レートや付加レートをきめ細かく制御することができるようになり、その結果、復号映像の画質を高めることができる効果を奏する。 As is apparent from the above, according to the sixth embodiment, when the media data to be encoded is video data, the motion data used when the video data is compressed and encoded is used as a reference. Since the configuration is such that the importance of the video data is discriminated, the encoding rate and the additional rate can be finely controlled, and as a result, the image quality of the decoded video can be improved.

なお、この実施の形態６では、動きベクトルの大きさの平均値が大きいピクチャ（映像データ）である程、当該ピクチャの重要度を高く設定するものについて示したが、これに限るものではなく、例えば、ある閾値以上の大きさを持つ動きベクトルの数や、ある方向を向いている動きベクトルの数などに応じて、映像データの重要度を判別するようにしてもよい。 In the sixth embodiment, the higher the importance of the picture is set as the picture (video data) having a larger average value of the motion vectors is shown. However, the present invention is not limited to this. For example, the importance of the video data may be determined according to the number of motion vectors having a magnitude equal to or greater than a certain threshold, the number of motion vectors pointing in a certain direction, and the like.

実施の形態７．
この実施の形態７では、符号化対象のメディアデータが映像データである場合、図１の特徴量抽出部１２（図４及び図５の映像特徴量抽出部１２ａを含む）が符号化対象の映像データの特徴量として、その映像データが圧縮符号化される際に算出されるＤＣＴ係数（離散コサイン変換係数）を確認し、図１の重要度判別部１３（図４及び図５の映像重要度判別部１３ａを含む）が、そのＤＣＴ係数を基準にして、その映像データの重要度を判別するようにする。 Embodiment 7 FIG.
In the seventh embodiment, when the media data to be encoded is video data, the feature quantity extraction unit 12 in FIG. 1 (including the video feature quantity extraction unit 12a in FIGS. 4 and 5) is the video to be encoded. The DCT coefficient (discrete cosine transform coefficient) calculated when the video data is compression-encoded is confirmed as the data feature amount, and the importance level discrimination unit 13 in FIG. 1 (video importance level in FIGS. 4 and 5) is confirmed. (Including the discriminator 13a) discriminates the importance of the video data on the basis of the DCT coefficient.

具体的には、以下の通りである。
例えば、ＭＰＥＧなどの映像圧縮符号化方式では、空間方向の冗長度を削減するために、符号化対象の映像データをＤＣＴ変換して、その変換結果であるＤＣＴ係数を量子化する。
このとき、ＤＣＴ係数のＡＣ成分に大きな値（特に、高周波成分に大きな値）を持つピクチャは、ピクチャ内の画像が複雑であることを意味するので、ＤＣＴ係数の分布を計算し、高周波成分に多くの値を持つピクチャの重要度を高く設定すれば、複雑な画像の誤り訂正能力を強化することができる。 Specifically, it is as follows.
For example, in a video compression encoding system such as MPEG, in order to reduce redundancy in the spatial direction, video data to be encoded is DCT-converted and the DCT coefficient that is the conversion result is quantized.
At this time, a picture having a large value (especially a large value for the high frequency component) in the AC component of the DCT coefficient means that the image in the picture is complicated. Therefore, the distribution of the DCT coefficient is calculated, If the importance of a picture having many values is set high, the error correction capability of a complex image can be enhanced.

そこで、特徴量抽出部１２は、符号化対象の映像データの特徴量として、その映像データが圧縮符号化される際に算出されるＤＣＴ係数を確認する。ＤＣＴ係数を確認するに際して、特徴量抽出部１２が、例えば、符号化対象の映像データと参照画像の映像データとの差分データをＤＣＴ変換してＤＣＴ係数を算出するＤＣＴ器を実装するようにしてもよいし、図１の符号化処理部１６（図４及び図５の映像符号化処理部１６ａを含む）から圧縮符号化される際に算出されるＤＣＴ係数を取得するようにしてもよい。
重要度判別部１３は、特徴量抽出部１２により確認されたＤＣＴ係数を基準にして、映像データの重要度を判別する。
例えば、ＤＣＴ係数のＡＣ成分に大きな値を有するピクチャ（映像データ）である程、当該ピクチャの重要度を高く設定する。 Therefore, the feature amount extraction unit 12 confirms the DCT coefficient calculated when the video data is compression-encoded as the feature amount of the video data to be encoded. When confirming the DCT coefficient, the feature amount extraction unit 12 is mounted with, for example, a DCT device that calculates the DCT coefficient by DCT conversion of difference data between the video data to be encoded and the video data of the reference image. Alternatively, the DCT coefficient calculated when the compression encoding is performed from the encoding processing unit 16 of FIG. 1 (including the video encoding processing unit 16a of FIGS. 4 and 5) may be acquired.
The importance level determination unit 13 determines the importance level of the video data based on the DCT coefficient confirmed by the feature amount extraction unit 12.
For example, the importance of the picture is set higher as the picture (video data) has a larger value in the AC component of the DCT coefficient.

以上で明らかなように、この実施の形態７によれば、符号化対象のメディアデータが映像データである場合、その映像データの特徴量として、その映像データが圧縮符号化される際に算出されるＤＣＴ係数を確認し、そのＤＣＴ係数を基準にして、その映像データの重要度を判別するように構成したので、符号化レートや付加レートをきめ細かく制御することができるようになり、その結果、復号映像の画質を高めることができる効果を奏する。 As is apparent from the above, according to the seventh embodiment, when the media data to be encoded is video data, it is calculated as the feature amount of the video data when the video data is compressed and encoded. The DCT coefficient is determined and the importance of the video data is determined based on the DCT coefficient, so that the encoding rate and the additional rate can be finely controlled. There is an effect that the image quality of the decoded video can be improved.

実施の形態８．
この実施の形態８では、符号化対象のメディアデータの重要度を判別する際、図１の特徴量抽出部１２が、そのメディアデータが音声データを含んでいるか否かを確認し、図１の重要度判別部１３が、音声データを含んでいる場合のメディアデータの重要度を、音声データを含んでいない場合のメディアデータの重要度より高いものと判別するようにする。 Embodiment 8 FIG.
In the eighth embodiment, when determining the importance of the media data to be encoded, the feature quantity extraction unit 12 in FIG. 1 confirms whether or not the media data includes audio data. The importance determination unit 13 determines that the importance of the media data when the audio data is included is higher than the importance of the media data when the audio data is not included.

具体的には、以下の通りである。
映像データは音声データと一緒に使用されることが多く、映像データは音声データが重畳されて伝送されることが多い。
音声データは、単位時間当たりのデータ量が映像データと比べて非常に小さいため、映像データであれば、復号映像の一部分の画質が劣化する程度のデータ量の欠落でも、音声データであれば、音声を復号することができないほどの欠落になる可能性がある。
つまり、１つのデータ欠落が復号音声に与える影響は、１つのデータ欠落が復号映像に与える影響より大きい。
このため、音声データを含むメディアデータの重要度を高く判定すれば、音声データの伝送誤り訂正能力を上げて、復号音声の品質を維持することができる。 Specifically, it is as follows.
Video data is often used together with audio data, and video data is often transmitted with audio data superimposed thereon.
Since the amount of audio data per unit time is very small compared to video data, if it is video data, even if it is audio data even if there is a lack of data amount that degrades the image quality of a part of the decoded video, There is a possibility of missing such that the voice cannot be decoded.
That is, the influence of one data loss on decoded audio is larger than the influence of one data loss on decoded video.
Therefore, if the importance of media data including audio data is determined to be high, the transmission error correction capability of audio data can be increased and the quality of decoded audio can be maintained.

そこで、特徴量抽出部１２は、符号化対象のメディアデータの特徴量として、そのメディアデータが音声データを含んでいるか否かを確認する。
重要度判別部１３は、符号化対象のメディアデータが音声データを含んでいる場合のメディアデータの重要度を、音声データを含んでいない場合のメディアデータの重要度より高く設定する。 Therefore, the feature quantity extraction unit 12 checks whether the media data includes audio data as the feature quantity of the media data to be encoded.
The importance determination unit 13 sets the importance of the media data when the media data to be encoded includes audio data higher than the importance of the media data when it does not include audio data.

以上で明らかなように、この実施の形態８によれば、符号化対象のメディアデータの重要度を判別する際、そのメディアデータが音声データを含んでいるか否かを確認し、音声データを含んでいる場合のメディアデータの重要度を、音声データを含んでいない場合のメディアデータの重要度より高く設定するように構成したので、音声データの伝送誤り訂正能力を上げて、復号音声の品質を維持することができる効果を奏する。 As is apparent from the above, according to the eighth embodiment, when determining the importance of the media data to be encoded, it is confirmed whether or not the media data includes audio data. Since the importance of media data is set higher than the importance of media data when audio data is not included, the transmission error correction capability of audio data is increased and the quality of decoded audio is increased. There is an effect that can be maintained.

実施の形態９．
上記実施の形態１〜８では、受信側において、誤り訂正を行うことができるようにするために、誤り訂正符号を付加しているが、誤り訂正符号は冗長なデータであり、伝送誤りが発生していなければ、誤り訂正符号が使用する帯域をメディア符号化データに割り当てた方が、より高画質なメディアデータを伝送することができる。
そのため、伝送状況が良好であれば、誤り訂正符号の付加レートを下げて伝送効率を上げ、伝送状況が劣化していれば、誤り訂正符号の付加レートを上げて誤り訂正能力を上げることが望ましい。 Embodiment 9 FIG.
In the above first to eighth embodiments, an error correction code is added to enable error correction on the receiving side. However, the error correction code is redundant data and a transmission error occurs. Otherwise, media data with higher image quality can be transmitted if the band used by the error correction code is allocated to the media encoded data.
Therefore, if the transmission condition is good, it is desirable to increase the transmission efficiency by reducing the error correction code addition rate, and increase the error correction capability by increasing the error correction code addition rate if the transmission condition is degraded. .

図７はこの発明の実施の形態９によるメディア符号化伝送装置を示す構成図であり、図において、図４と同一符号は同一又は相当部分を示すので説明を省略する。
回線状況取得部２５は例えば受信側装置などのネットワーク３内の他の機器から回線状況（ネットワークの通信状況）を収集する処理を実施する。なお、回線状況取得部２５は通信状況収集手段を構成している。 FIG. 7 is a block diagram showing a media encoding / transmission apparatus according to Embodiment 9 of the present invention. In the figure, the same reference numerals as those in FIG.
The line status acquisition unit 25 performs a process of collecting the line status (network communication status) from other devices in the network 3 such as a receiving device. The line status acquisition unit 25 constitutes a communication status collection unit.

伝送レート補正部２６は図４の伝送レート補正部２１と同様の処理を実施するほか、回線状況取得部２５により収集された回線状況に応じた符号化レートに決定する指示を映像符号化レート決定部１４ａ及び音声符号化レート決定部１４ｂに行うとともに、その回線状況に応じた付加レートに決定する指示を映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂに行う処理を実施する。なお、伝送レート補正部２６はレート補正手段を構成している。
図７では、回線状況取得部２５及び伝送レート補正部２６を図４のメディア符号化伝送装置に実装している例を示しているが、回線状況取得部２５及び伝送レート補正部２６を図１又は図５のメディア符号化伝送装置に実装するようにしてもよい。 The transmission rate correction unit 26 performs the same processing as the transmission rate correction unit 21 of FIG. 4 and determines an instruction to determine the encoding rate according to the line status collected by the line status acquisition unit 25. 14a and the audio coding rate determining unit 14b, and processing for instructing the video error correcting code additional rate determining unit 15a and the audio error correcting code additional rate determining unit 15b to determine an additional rate according to the line status. carry out. The transmission rate correction unit 26 constitutes a rate correction unit.
7 shows an example in which the line status acquisition unit 25 and the transmission rate correction unit 26 are mounted on the media encoding transmission apparatus of FIG. 4, the line status acquisition unit 25 and the transmission rate correction unit 26 are shown in FIG. Or you may make it mount in the media encoding transmission apparatus of FIG.

図７の例では、メディア符号化伝送装置の構成要素である映像特徴量抽出処理部１２ａ、音声特徴量抽出処理部１２ｂ、映像重要度判別部１３ａ、音声重要度判別部１３ｂ、映像符号化レート決定部１４ａ、音声符号化レート決定部１４ｂ、映像誤り訂正符号付加レート決定部１５ａ、音声誤り訂正符号付加レート決定部１５ｂ、映像符号化処理部１６ａ、音声符号化処理部１６ｂ、映像誤り訂正符号付加部１７ａ、音声誤り訂正符号付加部１７ｂ、データ送出部２０、回線状況取得部２５及び伝送レート補正部２６のそれぞれが専用のハードウェア（例えば、ＣＰＵを実装している半導体集積回路、あるいは、ワンチップマイコンなど）で構成されているものを想定しているが、メディア符号化伝送装置がコンピュータで構成される場合、映像特徴量抽出処理部１２ａ、音声特徴量抽出処理部１２ｂ、映像重要度判別部１３ａ、音声重要度判別部１３ｂ、映像符号化レート決定部１４ａ、音声符号化レート決定部１４ｂ、映像誤り訂正符号付加レート決定部１５ａ、音声誤り訂正符号付加レート決定部１５ｂ、映像符号化処理部１６ａ、音声符号化処理部１６ｂ、映像誤り訂正符号付加部１７ａ、音声誤り訂正符号付加部１７ｂ、データ送出部２０、回線状況取得部２５及び伝送レート補正部２６の処理内容を記述しているプログラムをコンピュータのメモリに格納し、当該コンピュータのＣＰＵが当該メモリに格納されているプログラムを実行するようにしてもよい。 In the example of FIG. 7, the video feature amount extraction processing unit 12a, the audio feature amount extraction processing unit 12b, the video importance level determination unit 13a, the audio importance level determination unit 13b, and the video encoding rate, which are components of the media encoding transmission apparatus. Determination unit 14a, audio encoding rate determination unit 14b, video error correction code addition rate determination unit 15a, audio error correction code addition rate determination unit 15b, video encoding processing unit 16a, audio encoding processing unit 16b, video error correction code Each of the addition unit 17a, the voice error correction code addition unit 17b, the data transmission unit 20, the line status acquisition unit 25, and the transmission rate correction unit 26 has dedicated hardware (for example, a semiconductor integrated circuit on which a CPU is mounted, or One-chip microcomputer, etc.) is assumed, but when the media encoding transmission device is configured with a computer, Image feature extraction processing unit 12a, audio feature extraction processing unit 12b, video importance determination unit 13a, audio importance determination unit 13b, video encoding rate determination unit 14a, audio encoding rate determination unit 14b, video error correction code Additional rate determination unit 15a, audio error correction code addition rate determination unit 15b, video encoding processing unit 16a, audio encoding processing unit 16b, video error correction code addition unit 17a, audio error correction code addition unit 17b, data transmission unit 20 The program describing the processing contents of the line status acquisition unit 25 and the transmission rate correction unit 26 may be stored in a computer memory, and the CPU of the computer may execute the program stored in the memory. .

次に動作について説明する。
回線状況取得部２５は、例えば、受信側装置などのネットワーク３内の他の機器から回線状況を収集し、その回線状況を伝送レート補正部２６に通知する。
具体的には、例えば、伝送路がＩＰネットワークである場合、「ＩＥＴＦＲＦＣ３５５０ “ＲＴＰ：ＡＴｒａｓｎｐｏｒｔＰｒｏｔｏｃｏｌｆｏｒＲｅａｌ−ＴｉｍｅＡｐｐｌｉｃａｔｉｏｎｓ”」に規定されているＲＴＣＰ（Ｒｅａｌ−ＴｉｍｅＴｒａｎｓｆｅｒＣｏｎｔｒｏｌＰｒｏｔｏｃｏｌ）を利用することで、回線状況を収集することができる。
即ち、ＲＴＣＰを利用する場合、回線状況取得部２５が、送信側装置であるメディア符号化伝送装置１のレポート（ＳＲ：ＳｅｎｄｅｒＲｅｐｏｒｔ）に対する受信側装置からのレポート（ＲＲ：ＲｅｃｅｉｖｅｒＲｅｐｏｒｔ）を受け取ることにより、ネットワーク３の回線状況として、ネットワーク３におけるパケット欠落率、累積欠落パケット数やパケット間隔ジッタなどを収集することができる。 Next, the operation will be described.
For example, the line status acquisition unit 25 collects the line status from other devices in the network 3 such as a receiving side device and notifies the transmission rate correction unit 26 of the line status.
Specifically, for example, when the transmission path is an IP network, RTCP (Real-Time Transfer Control Protocol) defined in “IETF RFC3550“ RTP: A Tranport Protocol for Real-Time Applications ”” is used. The line status can be collected.
That is, when RTCP is used, the line status acquisition unit 25 receives a report (RR: Receiver Report) from the receiving side apparatus with respect to a report (SR: Sender Report) of the media encoding transmission apparatus 1 that is the transmitting side apparatus. Thus, as the line status of the network 3, the packet loss rate, the cumulative number of lost packets, the packet interval jitter, etc. in the network 3 can be collected.

伝送レート補正部２６は、伝送レートを一定に維持するために、図４の伝送レート補正部２１と同様の処理を実施するが、ネットワーク３の回線状況に応じて、符号化レートと付加レートを制御する。
例えば、伝送レート補正部２６は、回線状況取得部２５から通知されたネットワーク３の回線状況が、パケットの欠落がない旨を示していれば、誤り訂正符号の付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅを下げる補正指示を映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂに出力して、伝送効率が上がるようにする。
一方、回線状況取得部２５から通知されたネットワーク３の回線状況が、パケットの欠落が発生している旨を示していれば、誤り訂正符号の付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅを上げる補正指示を映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂに出力して、誤り訂正能力が上がるようにする。 The transmission rate correction unit 26 performs the same processing as the transmission rate correction unit 21 in FIG. 4 in order to maintain the transmission rate constant, but the encoding rate and the additional rate are set according to the line status of the network 3. Control.
For example, if the line status of the network 3 notified from the line status acquisition unit 25 indicates that there is no packet loss, the transmission rate correction unit 26 decreases the error correction code addition rates IA _rate and VA _rate . The correction instruction is output to the video error correction code addition rate determination unit 15a and the audio error correction code addition rate determination unit 15b so that the transmission efficiency is increased.
On the other hand, if the line status of the network 3 notified from the line status acquisition unit 25 indicates that a packet is missing, a correction instruction for increasing the additional rates IA _rate and VA _rate of the error correction code is displayed. The error correction code addition rate determination unit 15a and the voice error correction code addition rate determination unit 15b are output to improve the error correction capability.

以上で明らかなように、この実施の形態９によれば、ネットワーク３の回線状況を収集する回線状況取得部２５を設け、伝送レート補正部２６が、回線状況取得部２５により収集された回線状況に応じた付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅに決定する指示を映像誤り訂正符号付加レート決定部１５ａ及び音声誤り訂正符号付加レート決定部１５ｂに行うように構成したので、ネットワーク３の回線状況が良好であれば、伝送効率を上げることができる一方、ネットワーク３の回線状況が劣化していれば、誤り訂正能力を上げることができる効果を奏する。 As is apparent from the above, according to the ninth embodiment, the line status acquisition unit 25 that collects the line status of the network 3 is provided, and the transmission rate correction unit 26 collects the line status collected by the line status acquisition unit 25. Since the video error correction code addition rate determination unit 15a and the voice error correction code addition rate determination unit 15b are instructed to determine the addition rates IA _rate and VA _rate according to the network, the network status of the network 3 is good. If there is, the transmission efficiency can be increased. On the other hand, if the line status of the network 3 is deteriorated, the error correction capability can be increased.

実施の形態１０．
上記実施の形態９では、ネットワーク３の回線状況に応じて誤り訂正符号の付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅを増減させるものについて示したが、この場合、ネットワーク３の回線状況が悪くなると、全体的に誤り訂正符号の符号量が増えるため、メディア符号化データの符号量が減り、メディアデータの重要度に関係なく、一律にメディア品質が低下することになる。 Embodiment 10 FIG.
In the ninth embodiment, an example in which the addition rates IA _rate and VA _rate of the error correction code are increased or decreased according to the line status of the network 3 has been described. Since the code amount of the error correction code is increased, the code amount of the media encoded data is decreased, and the media quality is uniformly reduced regardless of the importance of the media data.

そこで、この実施の形態１０では、ネットワーク３の回線状況が劣化している場合、伝送レート補正部２６が、メディアデータの重要度の再判別を映像重要度判別部１３ａ及び音声重要度判別部１３ｂに指示するようにしている。
これにより、映像重要度判別部１３ａは、映像データの重要度を再判別し、音声重要度判別部１３ｂは、音声データの重要度を再判別する。 Therefore, in the tenth embodiment, when the line status of the network 3 is deteriorated, the transmission rate correction unit 26 re-discriminates the importance of the media data by the video importance determining unit 13a and the audio importance determining unit 13b. I'm trying to tell you.
As a result, the video importance level determination unit 13a redetermines the importance level of the video data, and the audio importance level discrimination unit 13b redetermines the importance level of the audio data.

この結果、回線状況が劣化している場合、誤り訂正符号の付加レートＩＡ_ｒａｔｅ，ＶＡ_ｒａｔｅを上げた分だけ、重要度が高いメディアデータの伝送時間を更に長く設定して、重要度が低いメディアデータの伝送時間を短くすることで、重要度が高いメディアデータの伝送をより確実に行うことが可能な最適なレート配分を実現することができる。 As a result, when the line condition is deteriorated, the transmission time of media data with high importance is set longer by the amount corresponding to the increase of the additional rates IA _rate and VA _rate of the error correction code, and the medium with low importance is set. By shortening the data transmission time, it is possible to realize an optimum rate distribution capable of more reliably transmitting media data having a high degree of importance.

１メディア符号化伝送装置、２メディアソース、３ネットワーク、１１メディア圧縮符号化処理部、１１ａ映像圧縮符号化処理部、１１ｂ音声圧縮符号化処理部、１２特徴量抽出処理部（重要度判別手段）、１２ａ映像特徴量抽出処理部（重要度判別手段）、１２ｂ音声特徴量抽出処理部（重要度判別手段）、１３重要度判別部（重要度判別手段）、１３ａ映像重要度判別部（重要度判別手段）、１３ｂ音声重要度判別部（重要度判別手段）、１４符号化レート決定部（レート決定手段）、１４ａ映像符号化レート決定部（レート決定手段）、１４ｂ音声符号化レート決定部（レート決定手段）、１５誤り訂正符号付加レート決定部（レート決定手段）、１５ａ映像誤り訂正符号付加レート決定部（レート決定手段）、１５ｂ音声誤り訂正符号付加レート決定部（レート決定手段）、１６符号化処理部（メディアデータ符号化手段）、１６ａ映像符号化処理部（メディアデータ符号化手段）、１６ｂ音声符号化処理部（メディアデータ符号化手段）、１７誤り訂正符号付加部（誤り訂正符号付加手段）、１７ａ，２４ａ映像誤り訂正符号付加部（誤り訂正符号付加手段）、１７ｂ，２４ｂ音声誤り訂正符号付加部（誤り訂正符号付加手段）、１８，２０データ送出部（データ送出手段）、１９，２１，２６伝送レート補正部（レート補正手段）、２２映像音声多重部（データ多重化手段）、２３映像音声整列部（データ整列手段）、２５回線状況取得部（通信状況収集手段）。 DESCRIPTION OF SYMBOLS 1 Media encoding transmission apparatus, 2 Media source, 3 Network, 11 Media compression encoding process part, 11a Video compression encoding process part, 11b Voice compression encoding process part, 12 Feature-value extraction process part (importance discrimination means) , 12a Video feature amount extraction processing unit (importance level determination unit), 12b Audio feature amount extraction processing unit (importance level determination unit), 13 Importance level determination unit (importance level determination unit), 13a Video importance level determination unit (importance level) Discriminating means), 13b audio importance discriminating section (importance discriminating means), 14 encoding rate determining section (rate determining means), 14a video encoding rate determining section (rate determining means), 14b audio encoding rate determining section ( Rate determination means), 15 error correction code addition rate determination section (rate determination means), 15a video error correction code addition rate determination section (rate determination means) , 15b Audio error correction code addition rate determining unit (rate determining unit), 16 encoding processing unit (media data encoding unit), 16a video encoding processing unit (media data encoding unit), 16b audio encoding processing unit ( Media data encoding means), 17 error correction code addition section (error correction code addition means), 17a, 24a Video error correction code addition section (error correction code addition means), 17b, 24b Audio error correction code addition section (error correction) Sign adding means), 18, 20 data sending section (data sending means), 19, 21, 26 transmission rate correcting section (rate correcting means), 22 video / audio multiplexing section (data multiplexing means), 23 video / audio aligning section ( Data alignment means), 25 line status acquisition unit (communication status collection means).

Claims

An importance level discriminating unit that discriminates the degree of importance of media data to be encoded, and an encoding rate for compressing and encoding the media data based on the importance level discriminated by the importance level discriminating unit. And a rate determining means for determining an additional rate of an error correction code to be added to the media data after compression encoding, and compression encoding by encoding the media data at the encoding rate determined by the rate determining means. An error at an additional rate determined by the rate determining means with respect to the media encoded data output from the media data encoding means; Error correction code adding means for adding a correction code and outputting media encoded data with an error correction code; Data transmission means for sending media encoded data with error correction code output from error correction code adding means to the network, and transmission rate of media encoded data with error correction code sent from the data transmission means are allowed A media encoding / transmission apparatus comprising: rate correction means for instructing the rate determination means to correct the encoding rate and the additional rate when deviating from the range, and to keep the transmission rate within an allowable range .

The importance discriminating means for discriminating the importance of the video data which is the media data to be encoded, while determining the importance of the audio data which is the media data to be encoded, and the video discriminated by the importance discriminating means. Based on the importance of the data, the coding rate for compressing and encoding the video data is determined, and the addition rate of the error correction code to be added to the video data after compression encoding is determined. Determine the encoding rate when compressing and encoding the audio data based on the importance of the audio data determined by the degree determination means, and add an error correction code to be added to the audio data after compression encoding A rate determining means for determining a rate, and a video after compression encoding the video data at the encoding rate determined by the rate determining means; Video encoded data, which is data, and at the same time, the audio data is compressed and encoded at the encoding rate determined by the rate determining means, and audio encoded data which is audio data after compression encoding is output. An error correction code is added to the media data encoding means and the encoded video data output from the media data encoding means at an additional rate determined by the rate determination means, and a video code with an error correction code is added. On the other hand, an error correction code is added to the audio encoded data output from the media data encoding means at an additional rate determined by the rate determining means, and an audio code with an error correction code is output. Error correction code adding means for outputting encoded data, and video coding data with an error correction code output from the error correction code adding means. Data transmission means for transmitting voice encoded data with a data and error correction code to a network, and transmission of video encoded data with error correction code and voice encoded data with error correction code transmitted from the data transmission means If the rate deviates from the allowable range, the media code is provided with rate correction means for instructing the rate determination means to correct the encoding rate and the additional rate, and to keep the transmission rate within the allowable range. Transmission equipment.

The importance discriminating means for discriminating the importance of the video data which is the media data to be encoded, while determining the importance of the audio data which is the media data to be encoded, and the video discriminated by the importance discriminating means. Based on the importance of the data, the coding rate for compressing and encoding the video data is determined, and the addition rate of the error correction code to be added to the video data after compression encoding is determined. Determine the encoding rate when compressing and encoding the audio data based on the importance of the audio data determined by the degree determination means, and add an error correction code to be added to the audio data after compression encoding A rate determining means for determining a rate, and a video after compression encoding the video data at the encoding rate determined by the rate determining means; Video encoded data, which is data, and at the same time, the audio data is compressed and encoded at the encoding rate determined by the rate determining means, and audio encoded data which is audio data after compression encoding is output. Media data encoding means, and media encoding in which video encoded data and audio encoded data output from the media data encoding means are multiplexed, and the video encoded data and audio encoded data are multiplexed Data multiplexing means for outputting data, data alignment means for rearranging video encoded data and audio encoded data in the media encoded data output from the data multiplexing means, and alignment by the data alignment means The rate determining means applies the video encoded data and the audio encoded data in the media encoded data subjected to the replacement. An error correction code adding means for adding an error correction code at the determined addition rate; a data sending means for sending media encoded data to which an error correction code has been added by the error correction code adding means to the network; and the data When the transmission rate of the media encoded data with the error correction code sent from the sending means is out of the allowable range, the rate determining means is instructed to correct the coding rate and the additional rate, and A media encoding transmission apparatus comprising rate correction means for keeping a transmission rate within an allowable range.

The importance determining means, when determining the importance of the media data to be encoded, confirms whether the media data is information constituting the header part or information constituting the data part. The importance level when the media data is information constituting the header portion is set higher than the importance level when the media data is information constituting the data portion. 4. The media encoding transmission apparatus according to claim 1.

The importance level determination means is characterized in that, when the media data to be encoded is video data, the picture type of the video data is confirmed, and the importance level of the video data is determined based on the picture type. The media encoding transmission apparatus according to any one of claims 1 to 3.

The importance determining means determines the importance of the video data based on a motion vector used when the video data is compression encoded when the media data to be encoded is video data. The media encoding transmission apparatus according to any one of claims 1 to 3, wherein the media encoding transmission apparatus is any one of claims 1 to 3.

The importance determining means determines the importance of the video data based on a discrete cosine transform coefficient calculated when the video data is compressed and encoded when the media data to be encoded is video data. The media encoding transmission apparatus according to any one of claims 1 to 3, characterized in that:

When determining the importance of the media data to be encoded, the importance determination means checks whether the media data includes audio data, and determines the importance of the media data when the audio data includes the audio data. 2. The media encoding transmission apparatus according to claim 1, wherein the media encoding transmission device is set to be higher than the importance of media data when no audio data is included.

A communication status collection unit that collects the communication status of the network is provided, and the rate determination unit instructs the rate determination unit to determine an additional rate according to the communication status of the network collected by the communication status collection unit. The media encoding transmission apparatus according to any one of claims 1 to 8.

The rate correction unit instructs the importance level determination unit to redeterminate the importance of the media data when the network communication status collected by the communication status collection unit is deteriorated from the allowable status. 9. The media encoding transmission apparatus according to 9.