JP2003348549A

JP2003348549A - Apparatus and method for generating moving image data with voice

Info

Publication number: JP2003348549A
Application number: JP2002148823A
Authority: JP
Inventors: Hiroki Osaku; 弘樹大削; Koichi Shimizu; 浩一清水; Hidenori Ishida; 秀徳石田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2002-05-23
Filing date: 2002-05-23
Publication date: 2003-12-05

Abstract

<P>PROBLEM TO BE SOLVED: To make compact and light in weight an apparatus for generating moving image data with voice. <P>SOLUTION: In an image acquiring part 10, image data associated with an object are acquired and in a voice acquiring part 20, voice data associated with voice uttered by a user of the like are acquired. Then, the image data and the voice data are inputted to a data generating part 40 and in the data generating part 40, the image data and the voice data are compression-coded in the time division manner to generate the moving image data with voice in which the compression-coded image data and the compression-coded voice data are multiplexed. <P>COPYRIGHT: (C)2004,JPO

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、携帯式通信装置等
における音声付き動画像データ生成技術に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technique for generating moving image data with sound in a portable communication device or the like.

【０００２】[0002]

【従来の技術】従来より、携帯電話等の携帯式通信装置
では、ＣＣＤを有する撮像部から画像データを取得する
とともに、マイクから音声データを取得することによっ
て、音声付き動画像データを生成し、音声付き動画像デ
ータを送受信することができる。2. Description of the Related Art Conventionally, a portable communication device such as a cellular phone obtains moving image data with sound by acquiring image data from an imaging unit having a CCD and acquiring sound data from a microphone. Video data with sound can be transmitted and received.

【０００３】例えば、特開平９−５５９３５号公報に示
す音声付き動画像データ生成装置では、まず、画像デー
タと音声データとを取得する。そして、画像データの圧
縮符号化と、音声データの圧縮符号化とを、それぞれ別
々の圧縮符号化装置において行い、その後、圧縮符号化
された画像データと、圧縮符号化された音声データとを
マルチプレクス装置で多重化して、音声付き動画像デー
タを生成することができる。[0003] For example, in a moving image data with sound generator disclosed in Japanese Patent Application Laid-Open No. 9-55935, first, image data and sound data are acquired. Then, the compression encoding of the image data and the compression encoding of the audio data are respectively performed in separate compression encoding devices, and thereafter, the compression encoded image data and the compression encoded audio data are multiplexed. The video data with sound can be generated by multiplexing in the plex device.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上述し
た音声付き動画像データ生成装置では、画像データの圧
縮符号化を行う装置と、音声データの圧縮符号化を行う
装置とを別々に備え、さらに、圧縮符号化された画像デ
ータと圧縮符号化された音声データとを多重化する装置
も備えているため、音声付き動画像データ生成装置の回
路規模が大きくなる。SUMMARY OF THE INVENTION However, the above-described moving picture data with audio data generating apparatus is provided with a device for compressing and encoding image data and a device for compressing and encoding audio data separately. Since a device for multiplexing the compression-encoded image data and the compression-encoded audio data is also provided, the circuit scale of the moving image data generation device with audio becomes large.

【０００５】したがって、上述した音声付き動画像デー
タ生成装置を搭載した携帯式通信装置全体の回路規模も
大きくなるため、携帯式通信装置に対して恒常的に要求
される小型軽量化を図る上で弊害となっている。[0005] Therefore, the circuit size of the entire portable communication device equipped with the above-described moving image data generating device with sound also becomes large, and in order to reduce the size and weight constantly required for the portable communication device. It is an evil.

【０００６】本発明は、上記課題に鑑みてなされたもの
であり、音声付き動画像データ生成装置の小型軽量化を
図ることを第１の目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and a first object of the present invention is to reduce the size and weight of a moving image data generating apparatus with sound.

【０００７】本発明の第２の目的は、音声付き動画像デ
ータ生成装置の小型軽量化を図りつつ、通信相手側での
再生時に発生する音声の途切れを抑制可能な音声付き動
画像データ生成装置を提供することである。A second object of the present invention is to reduce the size and weight of a moving picture data generating apparatus with sound and to suppress a break in the sound generated at the time of reproduction on the communication partner side. It is to provide.

【０００８】[0008]

【課題を解決するための手段】上記の課題を解決するた
めに、請求項１の発明は、画像データを取得する画像取
得手段と、音声データを取得する音声取得手段と、前記
画像データの圧縮符号化と前記音声データの圧縮符号化
とを時分割で行うことによって、圧縮符号化された画像
データと、圧縮符号化された音声データとが多重化され
た音声付き動画像データを生成するデータ生成手段とを
備えることを特徴とする。According to an aspect of the present invention, there is provided an image acquiring apparatus for acquiring image data, audio acquiring means for acquiring audio data, and compression of the image data. Data that generates moving image data with sound by multiplexing compression-coded image data and compression-coded sound data by performing coding and compression coding of the sound data in a time-division manner. Generating means.

【０００９】また、請求項２の発明は、請求項１に記載
の音声付き動画像データ生成装置であって、前記データ
生成手段における前記画像データの圧縮符号化に対する
圧縮負荷と所定の閾値とを比較する負荷検出手段と、前
記圧縮負荷が前記所定の閾値より大きい場合には、前記
音声データを一時記憶データとして一時的に記憶する記
憶手段とをさらに備えることを特徴とする。According to a second aspect of the present invention, there is provided the moving picture data with sound according to the first aspect, wherein a compression load and a predetermined threshold value for the compression encoding of the image data in the data generating means are set. It is characterized by further comprising load detecting means for comparison, and storage means for temporarily storing the audio data as temporary storage data when the compression load is larger than the predetermined threshold.

【００１０】また、請求項３の発明は、請求項２に記載
の音声付き動画像データ生成装置であって、前記圧縮負
荷が前記所定の閾値より小さい場合に、前記記憶手段に
前記一時記憶データが記憶されていれば前記データ生成
手段において前記一時記憶データの圧縮符号化を行い、
前記記憶手段に前記一時記憶データが記憶されていなけ
れば前記データ生成手段において前記音声取得手段で新
たに取得される前記音声データの圧縮符号化を行うよう
に、前記データ生成手段を制御する制御手段、をさらに
備えることを特徴とする。According to a third aspect of the present invention, in the moving image data generating apparatus with audio according to the second aspect, when the compression load is smaller than the predetermined threshold, the temporary storage data is stored in the storage means. If stored, the data generating means performs compression encoding of the temporary storage data,
Control means for controlling the data generation means such that the data generation means performs compression encoding of the audio data newly acquired by the audio acquisition means if the temporary storage data is not stored in the storage means; , Is further provided.

【００１１】また、請求項４の発明は、請求項２に記載
の音声付き動画像データ生成装置であって、前記記憶手
段が、前記音声取得手段で取得される前記音声データを
必ず一時記憶データとして一時的に記憶し、前記音声付
き動画像データ生成装置が、前記圧縮負荷が前記所定の
閾値より小さい場合に、前記記憶手段に記憶された順に
前記一時記憶データを読み出して圧縮符号化を行うよう
に前記データ生成手段を制御する制御手段をさらに備え
ることを特徴とする。According to a fourth aspect of the present invention, there is provided the moving image data generating apparatus with audio according to the second aspect, wherein the storage means always stores the audio data acquired by the audio acquisition means in the temporary storage data. When the compression load is smaller than the predetermined threshold, the moving image data with audio data reads out the temporary storage data in the order stored in the storage unit and performs compression encoding. And a control unit for controlling the data generation unit.

【００１２】また、請求項５の発明は、請求項１に記載
の音声付き動画像データ生成装置であって、前記データ
生成手段における前記画像データの圧縮符号化に対する
圧縮負荷と所定の閾値とを比較する負荷検出手段と、前
記圧縮負荷が所定の閾値より大きい場合には、前記画像
データの圧縮符号化における圧縮率を所定の基準圧縮率
よりも高めるように前記データ生成手段を制御する圧縮
率制御手段とをさらに備えることを特徴とする。According to a fifth aspect of the present invention, in the moving image data generating apparatus with audio according to the first aspect, a compression load and a predetermined threshold value for the compression encoding of the image data in the data generating means are set. Load detecting means for comparing, and when the compression load is greater than a predetermined threshold, a compression rate for controlling the data generating means so as to increase a compression rate in the compression encoding of the image data to be higher than a predetermined reference compression rate. And a control unit.

【００１３】また、請求項６の発明は、請求項２から請
求項５のいずれかに記載の音声付き動画像データ生成装
置であって、前記負荷検出手段が、前記画像取得手段か
ら前記画像データに関する情報を取得し、当該情報に基
づいて前記圧縮負荷を予測することを特徴とする。According to a sixth aspect of the present invention, there is provided the moving image data generating apparatus with sound according to any one of the second to fifth aspects, wherein the load detecting means is configured to output the image data from the image obtaining means. And acquiring the compression load based on the acquired information.

【００１４】また、請求項７の発明は、請求項１に記載
の音声付き動画像データ生成装置であって、ユーザーの
操作に基づいて、前記画像データの圧縮符号化における
圧縮率を所定の基準圧縮率よりも高めるように前記デー
タ生成手段を制御する圧縮率制御手段をさらに備えるこ
とを特徴とする。According to a seventh aspect of the present invention, there is provided the moving image data generating apparatus with sound according to the first aspect, wherein a compression ratio in the compression encoding of the image data is set to a predetermined standard based on a user operation. The image processing apparatus further includes a compression ratio control unit that controls the data generation unit so as to increase the compression ratio higher than the compression ratio.

【００１５】また、請求項８の発明は、画像データを取
得するステップと、音声データを取得するステップと、
前記画像データの圧縮符号化と前記音声データの圧縮符
号化とを時分割で行うことによって、圧縮符号化された
画像データと、圧縮符号化された音声データとが多重化
された音声付き動画像データを生成するステップとを備
えることを特徴とする。[0015] Further, according to the invention of claim 8, a step of obtaining image data, a step of obtaining audio data,
By performing the compression encoding of the image data and the compression encoding of the audio data in a time-division manner, a moving image with audio in which the compression-encoded image data and the compression-encoded audio data are multiplexed. Generating data.

【００１６】[0016]

【発明の実施の形態】以下、本発明を適用した音声付き
動画像データ生成装置の実施例を図面を参照しながら説
明する。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing an embodiment of a moving image data generating apparatus with sound to which the present invention is applied.

【００１７】＜実施の形態１．＞＜１−１．データ生成装置１の機能ブロック＞図１は、
実施の形態１に係る音声付き動画像データを生成する音
声付き動画像データ生成装置（以下、「データ生成装
置」と称す）１の機能ブロック図である。<Embodiment 1>><1-1. Functional Block of Data Generation Apparatus 1> FIG.
FIG. 2 is a functional block diagram of a moving image data with sound generation device (hereinafter, referred to as a “data generation device”) 1 that generates moving image data with sound according to the first embodiment;

【００１８】図１に示すように、データ生成装置１は、
画像取得部１０、音声取得部２０、音声取得部２０から
データ生成部４０への音声データの出力を制御する音声
データ制御部３０、および画像データと音声データの圧
縮符号化を行うことによって音声付き動画像データを生
成するデータ生成部４０等によって構成される。なお、
図示を省略するが、データ生成装置１の外部には、操作
部が設けられており、ユーザーが操作部を操作すること
によって、画像取得部１０における画像データの取得お
よび音声取得部２０における音声データの取得を開始ま
たは終了する。このデータ生成装置１は、例えば携帯電
話等のモバイル機器に組み込んで使用するほか、他の電
子機器にも利用可能である。As shown in FIG. 1, the data generating device 1
The image acquisition unit 10, the audio acquisition unit 20, the audio data control unit 30 that controls the output of audio data from the audio acquisition unit 20 to the data generation unit 40, and an audio unit by performing compression encoding of image data and audio data. It is configured by a data generation unit 40 that generates moving image data. In addition,
Although not shown, an operation unit is provided outside the data generation device 1, and when the user operates the operation unit, the image data is acquired by the image acquisition unit 10 and the audio data is acquired by the audio acquisition unit 20. Start or end acquisition of. The data generating device 1 can be used by being incorporated in a mobile device such as a mobile phone, for example, and can also be used in other electronic devices.

【００１９】画像取得部１０は、被写体に係る画像デー
タを取得する画像取得手段を構成するものであり、撮像
部１０１と、Ａ／Ｄ変換器１０２とを備える。The image acquiring section 10 constitutes an image acquiring means for acquiring image data of a subject, and includes an image pickup section 101 and an A / D converter 102.

【００２０】撮像部１０１は、レンズとＣＣＤとを備
え、ＣＣＤがレンズにより結像された被写体の光像を、
ＲＧＢの色成分のアナログ画像信号に光電変換して出力
する。Ａ／Ｄ変換器１０２は、撮像部１０１から出力さ
れるアナログ画像信号の各画素信号をデジタル信号に変
換することによって、デジタルデータ化されたフルカラ
ーの画像データを取得して出力する。なお、ここでは、
ＣＣＤが毎秒３０フレーム分のアナログ画像信号を出力
するため、画像取得部１０は毎秒３０フレーム分の画像
データを取得して出力する。The imaging unit 101 includes a lens and a CCD, and the CCD captures an optical image of a subject formed by the lens.
The signal is photoelectrically converted into an analog image signal of RGB color components and output. The A / D converter 102 obtains digital color full-color image data by converting each pixel signal of the analog image signal output from the imaging unit 101 into a digital signal, and outputs the digital data. Here,
Since the CCD outputs an analog image signal for 30 frames per second, the image acquisition unit 10 acquires and outputs image data for 30 frames per second.

【００２１】音声取得部２０は、ユーザー等が発する音
声に係る音声データを取得する音声取得手段を構成する
ものであり、マイク１０３とＡ／Ｄ変換器１０４とを備
える。The voice obtaining section 20 constitutes voice obtaining means for obtaining voice data relating to a voice uttered by a user or the like, and includes a microphone 103 and an A / D converter 104.

【００２２】マイク１０３は、ユーザー等が発する音声
をアナログ音声信号に変換して出力する。Ａ／Ｄ変換器
１０４は、マイク１０３から出力されるアナログ音声信
号をデジタル信号に変換することによって、デジタルデ
ータ化された音声データを取得して出力する。The microphone 103 converts a voice uttered by a user or the like into an analog voice signal and outputs it. The A / D converter 104 obtains digital audio data by converting an analog audio signal output from the microphone 103 into a digital signal, and outputs the digital audio data.

【００２３】音声データ制御部３０は、負荷検出部１０
６、音声データを一時的に記憶するメモリ１０７、メモ
リ１０７に記憶される音声データの容量を検出するメモ
リ検出部１０８、音声取得部２０から出力される音声デ
ータの出力先を切り換えるスイッチ（ＳＷ）回路１０
９、およびＳＷ回路１０９における音声データの出力先
の切り換えを制御する出力制御部１１０等によって構成
される。The voice data control unit 30 includes the load detection unit 10
6. Memory 107 for temporarily storing audio data, memory detection unit 108 for detecting the volume of audio data stored in memory 107, and switch (SW) for switching the output destination of audio data output from audio acquisition unit 20 Circuit 10
9 and an output control unit 110 for controlling the switching of the output destination of the audio data in the SW circuit 109.

【００２４】なお、メモリ１０７を除く音声データ制御
部３０の各部の機能は、専用の電子回路で構成されて実
現されるものでも良いし、マイクロコンピュータ(ＣＰ
Ｕ)においてプログラムによって実現されるようなもの
であっても良い。The function of each part of the audio data control unit 30 except for the memory 107 may be realized by a dedicated electronic circuit or realized by a microcomputer (CP).
It may be realized by a program in U).

【００２５】負荷検出部１０６は、データ生成部４０に
おける画像データおよび音声データの圧縮符号化に対す
る負荷（以下、「圧縮負荷」と称す）と所定の閾値とを
比較する負荷検出手段を構成するものであり、画像デー
タの圧縮符号化に対する圧縮負荷（以下、「画像圧縮負
荷」と称す）が所定の閾値より大きい場合は、その旨を
示す信号（以下、「高負荷信号」と称す）を出力制御部
１１０に出力する。なお、画像圧縮負荷が所定の閾値と
同じ場合は、高負荷信号を出力するか否かのいずれの判
断分岐に含めてもよい（以下の各実施の形態においても
同様）。また、ここでは、後述する１つのパケットを生
成するためのデータ量より、画像データと音声データと
を区別することができる。The load detecting section 106 constitutes load detecting means for comparing a load (hereinafter, referred to as a "compression load") for compressing and encoding image data and audio data in the data generating section 40 with a predetermined threshold value. If the compression load for the compression encoding of the image data (hereinafter, referred to as “image compression load”) is larger than a predetermined threshold, a signal indicating the fact (hereinafter, referred to as “high load signal”) is output. Output to control unit 110. When the image compression load is the same as the predetermined threshold, the image compression load may be included in any of the determination branches as to whether to output the high load signal (the same applies to the following embodiments). Here, the image data and the audio data can be distinguished from each other based on the data amount for generating one packet described later.

【００２６】負荷検出部１０６における圧縮負荷の検出
は、例えば、後述する１つの画像パケットまたは音声パ
ケットを生成するための画像データごとに、データ生成
部４０における圧縮符号化前のデータ量と圧縮符号化後
のデータ量との比を求めることによって実施される。The detection of the compression load by the load detection unit 106 is performed, for example, for each image data for generating one image packet or audio packet, which will be described later, by the data amount before compression encoding in the data generation unit 40 and the compression code. This is performed by determining the ratio to the data amount after the conversion.

【００２７】具体的には、ＭＰＥＧ１やＭＰＥＧ２等の
圧縮符号化規則にしたがって画像データの圧縮符号化を
行う場合には、被写体の動きが大きくなると、圧縮符号
化前のデータ量に対して、圧縮符号化後のデータ量が相
対的に増大する。そして、（圧縮符号化後のデータ量）
／（圧縮符号化前のデータ量）の値が所定の閾値より大
きい場合に、負荷検出部１０６が高負荷信号を出力制御
部１１０に出力する。More specifically, when image data is compression-encoded according to compression-encoding rules such as MPEG1 and MPEG2, if the movement of a subject increases, the amount of data before compression-encoding is reduced. The data amount after encoding relatively increases. And (the amount of data after compression encoding)
When the value of / (data amount before compression encoding) is larger than a predetermined threshold, the load detection unit 106 outputs a high load signal to the output control unit 110.

【００２８】また、負荷検出部１０６は、データ生成部
４０における画像データおよび音声データの圧縮符号化
の開始後、圧縮負荷を一定期間全く検出しなくなると、
一連の音声付き動画像データを生成するための画像デー
タおよび音声データの圧縮符号化が完了した旨を示す信
号（以下、「符号化完了信号」）を出力制御部１１０に
出力する。If the load detection unit 106 does not detect any compression load for a certain period of time after the start of the compression encoding of the image data and the audio data in the data generation unit 40,
A signal indicating that compression encoding of image data and audio data for generating a series of moving image data with audio has been completed (hereinafter, “encoding completion signal”) is output to output control section 110.

【００２９】メモリ１０７は、負荷検出部１０６によっ
て検出される画像圧縮負荷が所定の閾値より大きい場合
に、所定の期間（ここでは、０．５秒間）ごとに音声取
得部２０で取得される音声データの全部若しくは一部を
一時記憶データとして一時的に記憶する記憶手段を構成
するものである。When the image compression load detected by the load detection unit 106 is larger than a predetermined threshold, the memory 107 stores the audio acquired by the audio acquisition unit 20 every predetermined period (here, 0.5 seconds). It constitutes storage means for temporarily storing all or a part of the data as temporary storage data.

【００３０】メモリ検出部１０８は、メモリ１０７に記
憶された一時記憶データの容量を検出し、メモリ１０７
に一時記憶データが記憶されている場合には、その旨を
示す信号（以下、「データ有り信号」と称す）を出力制
御部１１０に出力する。The memory detecting section 108 detects the capacity of the temporarily stored data stored in the memory 107,
If the temporary storage data is stored in the output control unit 110, a signal indicating this (hereinafter, referred to as a “data presence signal”) is output to the output control unit 110.

【００３１】ＳＷ（スイッチ）回路１０９は、音声取得
部２０から音声データを受け取り、出力制御部１１０の
制御の下で、受け取った音声データをデータ生成部４０
またはメモリ１０７に出力する。また、ＳＷ回路１０９
は、出力制御部１１０の制御の下で、メモリ１０７に記
憶される一時記憶データを早期に記憶されたものから順
に読み出して、データ生成部４０に出力する。The SW (switch) circuit 109 receives audio data from the audio acquisition unit 20 and, under the control of the output control unit 110, converts the received audio data into the data generation unit 40.
Alternatively, the data is output to the memory 107. Also, the SW circuit 109
Under the control of the output control unit 110, the temporary storage data stored in the memory 107 is sequentially read out from the one stored earlier and output to the data generation unit 40.

【００３２】出力制御部１１０は、負荷検出部１０６か
ら入力される高負荷信号と、メモリ検出部１０８から入
力されるデータ有り信号とに基づいて、ＳＷ回路１０９
における音声データの出力先の切り換えを制御する。The output control unit 110 determines the SW circuit 109 based on the high load signal input from the load detection unit 106 and the data presence signal input from the memory detection unit 108.
The switching of the output destination of the audio data in is controlled.

【００３３】具体的には、出力制御部１１０は、データ
生成部４０の画像圧縮負荷が所定の閾値より大きい場合
には、負荷検出部１０６から入力される高負荷信号に対
応して、ＳＷ回路１０９がデータ生成部４０に音声デー
タを出力することなく、音声データをメモリ１０７に出
力するように制御する。Specifically, when the image compression load of the data generation unit 40 is larger than a predetermined threshold, the output control unit 110 responds to the high load signal input from the load detection unit 106 by using the SW circuit. The control unit 109 controls the audio data to be output to the memory 107 without outputting the audio data to the data generation unit 40.

【００３４】また、出力制御部１１０は、データ生成部
４０の画像圧縮負荷と所定の閾値とを比較し、画像圧縮
負荷が当該閾値より小さい場合には、メモリ１０７に一
時記憶データが記憶されていれば、メモリ検出部１０８
から入力されるデータ有り信号に対応して、ＳＷ回路１
０９がメモリ１０７から一時記憶データを読み出してデ
ータ生成部４０に出力するとともに、音声取得部２０か
ら当該ＳＷ回路１０９に新たに入力される音声データを
メモリ１０７に出力するように制御する。一方、メモリ
１０７に一時記憶データが記憶されていなければ、ＳＷ
回路１０９が音声取得部２０から当該ＳＷ回路１０９に
新たに入力される音声データをデータ生成部４０に出力
するように制御する。The output control section 110 compares the image compression load of the data generation section 40 with a predetermined threshold value. If the image compression load is smaller than the threshold value, the memory 107 stores temporary storage data. If so, the memory detection unit 108
Circuit 1 corresponding to the data presence signal input from
09 reads out the temporarily stored data from the memory 107 and outputs it to the data generation unit 40, and controls so that the voice data newly input to the SW circuit 109 from the voice acquisition unit 20 is output to the memory 107. On the other hand, if the temporary storage data is not stored in the memory 107,
The circuit 109 controls the audio data newly input to the SW circuit 109 from the audio acquisition unit 20 to be output to the data generation unit 40.

【００３５】また、出力制御部１１０は、音声付き動画
像データの生成時に、負荷検出部１０６から符号化完了
信号が入力されると、一連の音声付き動画像データの生
成が完了したものと判別し、音声データの出力制御を終
了する。When generating the moving image data with sound and receiving the encoding completion signal from the load detecting unit 106 at the time of generating the moving image data with sound, the output control unit 110 determines that the generation of the series of moving image data with sound is completed. Then, the output control of the audio data ends.

【００３６】したがって、画像圧縮負荷が所定の閾値よ
り大きい場合は、音声取得部２０からＳＷ回路１０９に
入力された音声データを、ＳＷ回路１０９からデータ生
成部４０に出力させず、メモリ１０７に出力させ、メモ
リ１０７にその音声データを一時記憶データとして一時
的に記憶するように出力制御部１１０が制御する。Therefore, when the image compression load is larger than the predetermined threshold, the audio data input from the audio acquisition unit 20 to the SW circuit 109 is not output from the SW circuit 109 to the data generation unit 40 but is output to the memory 107. Then, the output control unit 110 controls the memory 107 to temporarily store the audio data as temporary storage data.

【００３７】また、画像圧縮負荷が所定の閾値より小さ
い場合は、メモリ１０７に一時記憶データが記憶されて
いれば、ＳＷ回路１０９が、メモリ１０７から一時記憶
データを読み出してデータ生成部４０に出力し、データ
生成部４０において、一時記憶データの圧縮符号化を行
うように出力制御部１１０が制御する。一方、メモリ１
０７に一時記憶データが記憶されていなければ、ＳＷ回
路１０９が、音声取得部２０で新たに取得されて当該Ｓ
Ｗ回路１０９に入力される音声データをデータ生成部４
０に出力し、データ生成部４０において、その音声デー
タの圧縮符号化を行うように出力制御部１１０が制御す
る。When the image compression load is smaller than the predetermined threshold, if the temporarily stored data is stored in the memory 107, the SW circuit 109 reads the temporarily stored data from the memory 107 and outputs it to the data generator 40. Then, the output control unit 110 controls the data generation unit 40 to perform compression encoding of the temporarily stored data. On the other hand, memory 1
If the temporary storage data is not stored in 07, the SW circuit 109 is newly acquired by the voice
The voice data input to the W circuit 109 is output to the data generation unit 4
0, and the output control unit 110 controls the data generation unit 40 to compress and encode the audio data.

【００３８】なお、音声データ制御部３０からデータ生
成部４０への音声データの出力制御のタイミング等の詳
細については、音声付き動画像データの生成、および音
声データのデータ生成部４０への出力制御フローについ
て説明する際に詳述する。For details of the timing of the output control of the audio data from the audio data control unit 30 to the data generation unit 40, see the description of the generation of the moving image data with audio and the output control of the audio data to the data generation unit 40. This will be described in detail when describing the flow.

【００３９】データ生成部４０は、図示を省略するが、
ＣＰＵ、画像データの圧縮符号化を行う際に画像データ
を一時的に記憶する画像メモリ、当該データ生成部４０
に入力された音声データを一時的に記憶して圧縮符号化
のタイミングを調節するためのＲＡＭ、データ生成部４
０における各種処理を統括制御するプログラムを格納す
るＲＯＭとを有し、時分割多重によって多重化された音
声付き動画像データを生成するデータ生成手段として構
成される。The data generator 40 is not shown, but
CPU, image memory for temporarily storing image data when performing compression encoding of image data, data generation unit 40
RAM for temporarily storing audio data input to the RAM and adjusting the timing of compression encoding, and a data generation unit 4
And a ROM for storing a program for comprehensively controlling various processes in the O.0, and is configured as data generation means for generating audio-added moving image data multiplexed by time division multiplexing.

【００４０】また、データ生成部４０は、画像データに
関しては、ＭＰＥＧ１やＭＰＥＧ２といった圧縮符号化
規則に基づいて圧縮符号化を行い、音声データに関して
は、ＭＰＥＧ勧告に従い圧縮符号化を行うことによっ
て、音声付き動画像データを作成し、送信部１３０を介
して、通信相手の携帯式通信装置等に音声付き動画像デ
ータを送信する。The data generating section 40 performs compression coding on image data based on compression coding rules such as MPEG1 and MPEG2, and performs compression coding on audio data according to the MPEG recommendation. The moving image data with sound is created, and the moving image data with sound is transmitted to the portable communication device or the like of the communication partner via the transmission unit 130.

【００４１】また、データ生成部４０では、１回目の画
像データの圧縮符号化の開始時間を基準として、２回目
以降の画像データの圧縮符号化を０．５×Ｎ秒後（Ｎは
任意の自然数）から開始し、ＲＡＭに記憶された音声デ
ータの圧縮符号化を０．４＋０．５×Ｎ秒後（Ｎは０以
上の整数）から開始すべくＲＡＭに記憶された音声デー
タをＣＰＵに出力するように制御される。音声付き動画
像データの生成については、後程詳述する。Further, the data generation unit 40 performs the compression encoding of the second and subsequent image data 0.5 × N seconds later (where N is an arbitrary number) based on the start time of the first compression encoding of the image data. (Natural number), and outputs the audio data stored in the RAM to the CPU to start compression encoding of the audio data stored in the RAM after 0.4 + 0.5 × N seconds (N is an integer of 0 or more). Is controlled. Generation of moving image data with sound will be described later in detail.

【００４２】＜１−２．音声付き動画像データの生成＞＜１−２−１．音声付き動画像データのデータ構造＞図
２は、データ生成装置１において生成される、画像デー
タと音声データとが時分割多重によって多重化された音
声付き動画像データのデータ構造を示す模式図であり、
データ生成部４０は、図２に示す音声付き動画像データ
を前から順に（図中では左から順に）生成する。<1-2. Generation of Moving Image Data with Sound><1-2-1. Data Structure of Moving Picture Data with Sound> FIG. 2 is a schematic diagram showing a data structure of moving picture data with sound generated by the data generation device 1 and obtained by multiplexing image data and sound data by time division multiplexing. Yes,
The data generation unit 40 generates the moving image data with sound shown in FIG. 2 in order from the front (in the figure, from the left).

【００４３】図２に示すように、音声付き動画像データ
は、パックの集まり（Ｐｋ１，Ｐｋ２，Ｐｋ３，・・
・）で構成される。また、１つのパックは、シーケンス
ヘッダＳＨと、複数のパケットＰｔ１，Ｐｔ２，・・
・，ＰｔＮとから構成され、さらに、１つのパケット
は、パケットヘッダＰＨと、画像または音声の個別のス
トリームを分割したエレメンタリーストリームＥＳとか
ら構成される。As shown in FIG. 2, moving image data with sound is composed of a group of packs (Pk1, Pk2, Pk3,...).
・) One pack includes a sequence header SH and a plurality of packets Pt1, Pt2,.
, PtN, and one packet is composed of a packet header PH and an elementary stream ES obtained by dividing an individual image or audio stream.

【００４４】シーケンスヘッダＳＨには、画像と音声と
の同期再生を図るための基準時間（ＳＲＣ）、画面の大
きさ、画面のアスペクト比、画像の表示周期等の情報が
格納される。また、パケットヘッダＰＨには、画像と音
声との同期再生を図るための時間情報ＰＴＳ，ＤＴＳ、
エレメンタリーストリームＥＳ中のデータの特性情報が
格納される。The sequence header SH stores information such as a reference time (SRC) for synchronizing image and sound reproduction, a screen size, a screen aspect ratio, an image display cycle, and the like. In the packet header PH, time information PTS, DTS for synchronizing image and sound reproduction,
The characteristic information of the data in the elementary stream ES is stored.

【００４５】そして、１つのエレメンタリーストリーム
ＥＳには、画像のみ、または音声のみに関するデータが
格納される。つまり、１つのパケットは画像データのみ
に関するデータが格納されるパケット（以下、「画像パ
ケット」と称す）、または音声データのみに関するデー
タが格納されるパケット（以下、「音声パケット」と称
す）となる。Then, one elementary stream ES stores data relating to only images or only audio. That is, one packet is a packet in which data relating to only image data is stored (hereinafter, referred to as “image packet”) or a packet in which data relating to only audio data is stored (hereinafter, referred to as “audio packet”). .

【００４６】なお、通信相手側の装置では、図２に示す
ような画像データと音声データとが多重化された音声付
き動画像データから、画像データと音声データとをそれ
ぞれ分離する。そして、画像データ用の記憶部と、音声
データ用の記憶部とに、それぞれある程度のデータを蓄
積しつつ、シーケンスヘッダＳＨに格納される基準時間
（ＳＲＣ）、パケットヘッダＰＨに格納される時間情報
ＰＴＳ，ＤＴＳに基づいて、同期した画像データと音声
データとから動画像と音声との同期再生を行う。The communication partner apparatus separates image data and audio data from audio-added moving image data in which image data and audio data are multiplexed as shown in FIG. The reference time (SRC) stored in the sequence header SH and the time information stored in the packet header PH are stored while storing a certain amount of data in each of the image data storage unit and the audio data storage unit. Based on the PTS and DTS, synchronous playback of a moving image and audio is performed from synchronized image data and audio data.

【００４７】＜１−２−２．圧縮符号化のタイミング＞
次に、データ生成部４０における画像データと音声デー
タの圧縮符号化のタイミングについて説明する。<1-2-2. Timing of compression encoding>
Next, the timing of compression encoding of image data and audio data in the data generation unit 40 will be described.

【００４８】データ生成部４０は、０．５秒間に取得さ
れる画像データに相当する１５フレーム分の画像データ
を画像メモリに一時的に記憶しつつ、画像データの圧縮
符号化を行う。一方、画像データの圧縮符号化の間、当
該データ生成部４０に入力される音声データをＲＡＭに
記憶しつつ、動画像に途切れが生じないように、画像デ
ータの圧縮符号化の合間に、ＲＡＭに記憶された音声デ
ータの圧縮符号化を行うように構成されている。The data generating section 40 compresses and encodes image data while temporarily storing image data for 15 frames corresponding to image data acquired for 0.5 second in the image memory. On the other hand, during the compression encoding of the image data, the audio data input to the data generation unit 40 is stored in the RAM, and the RAM is stored between the compression encoding of the image data so that the moving image is not interrupted. Is configured to perform compression encoding of the audio data stored in the.

【００４９】図３は、データ生成部４０における画像デ
ータおよび音声データの圧縮符号化のタイミングを説明
するタイミングチャートである。なお、ここでは、横軸
が時間を示しており、１回目の画像データの圧縮符号化
の時間帯Ｐ１の開始時間を基準（０［ｓｅｃ］）として
時間を付している。FIG. 3 is a timing chart for explaining the timing of compression encoding of image data and audio data in the data generation section 40. Here, the horizontal axis indicates time, and time is given with the start time of the time zone P1 of the first compression encoding of image data as a reference (0 [sec]).

【００５０】また、上述したように、データ生成装置１
では、データ生成部４０における画像データの圧縮符号
化に対する画像圧縮負荷が所定の閾値より小さい場合
と、所定の閾値より大きいときを含む場合とで、データ
生成部４０への音声データの出力制御が異なってくるた
め、データ生成部４０における圧縮符号化のタイミング
も異なる。As described above, the data generation device 1
In the above, the output control of the audio data to the data generation unit 40 is controlled depending on whether the image compression load for the compression encoding of the image data in the data generation unit 40 is smaller than a predetermined threshold and when the image compression load is larger than the predetermined threshold. Since they differ, the timing of compression encoding in the data generation unit 40 also differs.

【００５１】そこで、図３（ａ）はデータ生成部４０に
おける画像データの圧縮符号化に対する画像圧縮負荷が
所定の閾値より小さい場合のタイミングチャートを示
し、図３（ｂ）は画像圧縮負荷が所定の閾値より大きい
ときを含む場合の問題点を説明するタイミングチャート
を示し、さらに、図３（ｃ）は画像圧縮負荷が所定の閾
値より大きいときを含む場合のタイミングチャートを示
している。FIG. 3A shows a timing chart when the image compression load for the compression encoding of the image data in the data generator 40 is smaller than a predetermined threshold value, and FIG. FIG. 3C is a timing chart illustrating a problem in a case where the image compression load is larger than a predetermined threshold, and FIG. 3C is a timing chart in a case where the image compression load is larger than a predetermined threshold.

【００５２】また、図４は、音声付き動画像データのデ
ータ構造を説明する図であり、図４（ａ）は画像圧縮負
荷が所定の閾値より小さい場合に生成されるパケットの
配列について示しており、図４（ｂ）は画像圧縮負荷が
所定の閾値より大きいときを含む場合に生成されるパケ
ットの配列について示している。FIG. 4 is a view for explaining the data structure of moving image data with sound. FIG. 4A shows an arrangement of packets generated when the image compression load is smaller than a predetermined threshold. FIG. 4B shows an arrangement of packets generated when the image compression load includes a case where the image compression load is larger than a predetermined threshold.

【００５３】以下、画像圧縮負荷が所定の閾値より小さ
い場合と、画像圧縮負荷が所定の閾値より大きいときを
含む場合とにおける音声付き動画像データの生成につい
て、図３および図４を参照しつつ説明する。Hereinafter, the generation of moving image data with sound when the image compression load is smaller than the predetermined threshold value and when the image compression load includes a case where the image compression load is larger than the predetermined threshold value will be described with reference to FIGS. 3 and 4. explain.

【００５４】＜１−２−３．画像圧縮負荷が所定の閾値
より小さい場合＞上述したように、データ生成部４０で
は、１回目の画像データの圧縮符号化の時間帯Ｐ１の開
始時間を基準とすると、２回目以降の画像データの圧縮
符号化を０．５×Ｎ秒後（Ｎは任意の自然数）から開始
し、音声データの圧縮符号化を０．４＋０．５×Ｎ秒後
（Ｎは０以上の整数）から開始するように制御されてい
る。したがって、ここでは、図３（ａ）に示すように、
画像データの圧縮符号化と音声データの圧縮符号化とが
交互に時分割で行われる。<1-2-3. When the image compression load is smaller than the predetermined threshold> As described above, the data generation unit 40 sets the start time of the time zone P1 of the compression encoding of the first image data as a reference, The compression encoding is started after 0.5 × N seconds (N is an arbitrary natural number), and the compression encoding of the audio data is started after 0.4 + 0.5 × N seconds (N is an integer of 0 or more). Is controlled. Therefore, here, as shown in FIG.
The compression encoding of the image data and the compression encoding of the audio data are alternately performed in a time division manner.

【００５５】具体的には、図３（ａ）に示すように、画
像データの圧縮符号化の時間帯Ｐ１〜Ｐ４のそれぞれに
おいて、０．５秒間に画像取得部１０で取得される画像
データに相当する１５フレーム分の画像データの圧縮符
号化を行って１つの画像パケットを生成する。一方で
は、音声データの圧縮符号化の時間帯Ｓ１〜Ｓ４のそれ
ぞれにおいて、０．５秒間に音声取得部２０で取得され
る音声データの圧縮符号化を行って１つの音声パケット
を生成する。More specifically, as shown in FIG. 3A, in each of the time zones P1 to P4 of the compression encoding of the image data, the image data acquired by the One image packet is generated by compression-encoding the corresponding 15 frames of image data. On the other hand, in each of the time zones S1 to S4 of the audio data compression encoding, the audio data acquired by the audio acquisition unit 20 is subjected to the compression encoding for 0.5 second to generate one audio packet.

【００５６】なお、ここでは、時間帯Ｓ１〜Ｓ４におい
て圧縮符号化される音声データは、それぞれ、時間帯Ｓ
１〜Ｓ４の直前まで音声取得部２０で取得される０．５
秒間の音声に相当する音声データとなっている。Here, the audio data to be compression-encoded in the time zones S1 to S4 respectively correspond to the time zones S1 to S4.
0.5 obtained by the voice obtaining unit 20 immediately before 1 to S4
The voice data is equivalent to the voice of seconds.

【００５７】つまり、データ生成部４０では、異なる時
間帯Ｐ１〜Ｐ４，Ｓ１〜Ｓ４のうち、異なる時間帯Ｐ１
〜Ｐ４のそれぞれにおいて、画像取得部１０から入力さ
れる画像データの圧縮符号化、および音声取得部２０か
らＳＷ回路１０９を介して入力される音声データの圧縮
符号化のうちの一方である画像データの圧縮符号化のみ
を選択的に行う。また、異なる時間帯Ｓ１〜Ｓ４のそれ
ぞれにおいて、画像取得部１０から入力される画像デー
タの圧縮符号化、および音声取得部２０からＳＷ回路１
０９を介して入力される音声データの圧縮符号化のうち
の他方である音声データの圧縮符号化のみを選択的に行
う。In other words, the data generation section 40 selects different time zones P1 out of the different time zones P1 to P4 and S1 to S4.
In each of P4 to P4, image data that is one of compression encoding of image data input from the image acquisition unit 10 and compression encoding of audio data input from the audio acquisition unit 20 via the SW circuit 109 Is selectively performed. In each of the different time zones S1 to S4, compression encoding of the image data input from the image acquisition unit 10 and the SW circuit 1
Only the compression encoding of the audio data which is the other of the compression encoding of the audio data input via the input terminal 09 is selectively performed.

【００５８】したがって、ここでは、例えば図４（ａ）
に示すように、圧縮符号化された画像データと音声デー
タとが多重化され、画像パケットＰＰ１，音声パケット
ＳＰ１，画像パケットＰＰ２，音声パケットＳＰ２，画
像パケットＰＰ３，音声パケットＳＰ３，画像パケット
ＰＰ４，音声パケットＳＰ４，・・・の順に、画像パケ
ットと音声パケットとが交互に配列する音声付き動画像
データを時分割で生成する。Therefore, here, for example, FIG.
As shown in the figure, the compression-encoded image data and audio data are multiplexed, and the image packet PP1, audio packet SP1, image packet PP2, audio packet SP2, image packet PP3, audio packet SP3, image packet PP4, audio The moving image data with sound in which the image packet and the sound packet are alternately arranged in the order of the packets SP4,.

【００５９】＜１−２−４．画像圧縮負荷が所定の閾値
より大きいときを含む場合＞ＭＰＥＧ１やＭＰＥＧ２等
の圧縮符号化規則に従った圧縮符号化では、被写体の動
きが著しく大きな場合等は、データ生成部４０における
画像データの圧縮符号化にかかる負荷が大きくなる。し
たがって、もしも、画像取得部１０から出力される画像
データと、音声取得部２０から出力される音声データと
が、そのままデータ生成部４０に入力されると、例え
ば、データ生成部４０における画像データの圧縮符号化
にかかる負荷が著しく大きな場合は、図３（ｂ）に示す
ように、画像データの圧縮符号化の時間帯Ｐ２が長くな
り、時間帯Ｐ２が、ＲＡＭから出力される音声データを
圧縮符号化すべき時間帯Ｓ２にまで重なる状態となる。<1-2-4. The case where the image compression load is greater than a predetermined threshold> In the compression encoding according to the compression encoding rules such as MPEG1 and MPEG2, when the motion of the subject is extremely large, the data generation unit 40 compresses the image data. The load on encoding increases. Therefore, if the image data output from the image acquisition unit 10 and the audio data output from the audio acquisition unit 20 are directly input to the data generation unit 40, for example, When the load on the compression encoding is extremely large, as shown in FIG. 3B, the time zone P2 of the compression encoding of the image data becomes long, and the time zone P2 compresses the audio data output from the RAM. The state overlaps with the time zone S2 to be encoded.

【００６０】図３（ｂ）に示すような状態になると、デ
ータ生成部４０では動画像が途切れないように音声付き
動画像データを生成するため、時間帯Ｓ２において圧縮
符号化すべき音声データが０．９秒にＲＡＭから出力さ
れても、時間帯Ｐ２における画像データの圧縮符号化が
行われているため、時間帯Ｓ２において圧縮符号化すべ
き音声データが圧縮符号化されずに消滅してしまい、通
信相手側での再生時に音声の途切れを生じるような音声
付き動画像データを生成してしまう。In the state shown in FIG. 3B, the data generation unit 40 generates moving image data with sound so that the moving image is not interrupted. Even if the image data is output from the RAM at .9 seconds, since the image data is compressed and encoded in the time zone P2, the audio data to be compression-encoded in the time zone S2 disappears without being compressed and encoded. The moving image data with sound that generates a break in the sound during reproduction on the communication partner side is generated.

【００６１】しかし、本発明の実施の形態１に係るデー
タ生成装置１では、ある画像パケットを生成する際に、
画像圧縮負荷が所定の閾値より大きくなると、出力制御
部１１０の制御の下で、生成中の画像パケットの次に音
声パケットを生成すべく音声取得部２０からデータ生成
部４０に出力する音声データをＳＷ回路１０９からデー
タ生成部４０に出力せず、その音声データをメモリ１０
７に出力して、一時記憶データとしてメモリ１０７に一
時的に記憶する。そして、その後に画像パケットを生成
する際に、画像圧縮負荷が所定の閾値より小さくなる
と、メモリ１０７に一時的に記憶した一時記憶データを
データ生成部４０に出力するように構成されている。However, in the data generating apparatus 1 according to the first embodiment of the present invention, when generating a certain image packet,
When the image compression load exceeds a predetermined threshold, under the control of the output control unit 110, audio data output from the audio acquisition unit 20 to the data generation unit 40 to generate an audio packet next to the image packet being generated is output. The audio data is not output from the SW circuit 109 to the data generation unit 40, but is output to the memory 10
7 and temporarily stored in the memory 107 as temporary storage data. Then, when an image packet is subsequently generated, if the image compression load becomes smaller than a predetermined threshold, the temporary storage data temporarily stored in the memory 107 is output to the data generation unit 40.

【００６２】具体的には、例えば、図３（ｃ）に示すよ
うに、画像データの圧縮符号化の時間帯Ｐ２が長くなり
すぎるような場合には、時間帯Ｐ２に画像圧縮負荷が所
定の閾値より大きくなるため、時間帯Ｓ２において圧縮
符号化すべき音声データを一時記憶データとしてメモリ
１０７に一時的に記憶する。Specifically, for example, as shown in FIG. 3C, when the time zone P2 of the compression encoding of the image data becomes too long, the image compression load is set to a predetermined value in the time zone P2. Since the value becomes larger than the threshold value, audio data to be compression-encoded in the time zone S2 is temporarily stored in the memory 107 as temporary storage data.

【００６３】その後、画像圧縮負荷が所定の閾値より小
さくなった時間帯Ｐ３の後に、メモリ１０７に記憶した
音声データをＳＷ回路１０９を介してデータ生成部４０
に出力し、時間帯ＳＡ２において時間帯Ｓ２で圧縮符号
化すべき音声データの圧縮符号化を行う。また、このと
き、時間帯Ｓ３で圧縮符号化すべき音声データを一時記
憶データとしてメモリ１０７に一時的に記憶し、時間帯
Ｐ４の後に、メモリ１０７に記憶した一時記憶データを
ＳＷ回路１０９を介してデータ生成部４０に出力し、時
間帯ＳＡ３において時間帯Ｓ３で圧縮符号化すべき音声
データの圧縮符号化を行う。Thereafter, after the time period P3 during which the image compression load has become smaller than the predetermined threshold, the audio data stored in the memory 107 is transferred to the data generator 40 via the SW circuit 109.
To perform compression encoding of audio data to be compression-encoded in the time zone S2 in the time zone SA2. At this time, audio data to be compression-encoded in the time zone S3 is temporarily stored in the memory 107 as temporary storage data, and after the time zone P4, the temporarily stored data stored in the memory 107 is transferred via the SW circuit 109. The data is output to the data generation unit 40, and compression encoding of audio data to be compression-coded in the time zone S3 is performed in the time zone SA3.

【００６４】なお、ここでは、実際に時間帯Ｐ２におい
て画像圧縮負荷が高くなった後に、出力制御部１１０が
音声データをＳＷ回路１０９からメモリ１０７に出力す
るように制御するため、時間帯Ｓ２において圧縮符号化
すべき音声データのうち、音声データをＳＷ回路１０９
からメモリ１０７に出力するように制御するまでに、既
にデータ生成部４０に出力された音声データは消失する
が、その消失部分は比較的少なくて済むため、音声デー
タの消失を減少させることができる。Here, the output control unit 110 controls the audio data to be output from the SW circuit 109 to the memory 107 after the image compression load actually increases in the time zone P2. Of the audio data to be compression-encoded, the audio data is
The audio data already output to the data generation unit 40 is lost before the control is performed to output the data to the memory 107. However, since the lost part is relatively small, the loss of the audio data can be reduced. .

【００６５】したがって、ここでは、データ生成部４０
から出力される音声付き動画像データは、図４（ｂ）に
示すように、画像パケットＰＰ１、音声パケットＳＰ
１、画像パケットＰＰ２、画像パケットＰＰ３、音声パ
ケットＳＰ２、画像パケットＰＰ４、音声パケットＳＰ
３、・・・の順にパケットが配列された状態となる。Therefore, here, the data generation unit 40
As shown in FIG. 4B, the video data with audio output from the
1, image packet PP2, image packet PP3, audio packet SP2, image packet PP4, audio packet SP
The packets are arranged in the order of 3,.

【００６６】＜１−３．音声データのデータ生成部４０
への出力制御フロー＞次に、音声データのデータ生成部
４０への出力制御フローについて説明する。<1-3. Data generator 40 for audio data
Output Control Flow> Next, an output control flow of audio data to the data generation unit 40 will be described.

【００６７】図５は、データ生成装置１における音声デ
ータのデータ生成部４０への出力制御を説明するフロー
チャートである。なお、ここでは、出力制御部１１０に
よる制御に基づいて音声データのデータ生成部４０への
出力が制御される。FIG. 5 is a flowchart for explaining the output control of the audio data to the data generator 40 in the data generator 1. Here, the output of the audio data to the data generator 40 is controlled based on the control by the output controller 110.

【００６８】まず、音声付き動画像データを生成すべ
く、ユーザーの操作に基づいて、画像取得部１０におけ
る画像データの取得と、音声取得部２０における音声デ
ータの取得とを開始することによって、音声データ制御
部３０、つまりは、出力制御部１１０における音声デー
タの出力制御が開始し、ステップＳＴ１に進む。なお、
ここでは、出力制御部１１０における音声データの出力
制御が開始した時点では、音声取得部２０からＳＷ回路
１０９に新たに入力される音声データをデータ生成部４
０に入力するように制御されているものとする。First, in order to generate the moving image data with sound, the acquisition of the image data by the image acquisition unit 10 and the acquisition of the sound data by the audio acquisition unit 20 are started based on the operation of the user. The output control of the audio data in the data control unit 30, that is, the output control unit 110 is started, and the process proceeds to step ST1. In addition,
Here, when the output control of the audio data in the output control unit 110 is started, the audio data newly input from the audio acquisition unit 20 to the SW circuit 109 is output to the data generation unit 4.
It is assumed that it is controlled to input 0.

【００６９】ステップＳＴ１では、データ生成部４０に
おける圧縮符号化の負荷が高いか否かを判別する。ここ
では、負荷検出部１０６から高負荷信号が出力制御部１
１０に入力されていると、データ生成部４０における圧
縮符号化の負荷が高いと判別されて、ステップＳＴ２に
進み、また、負荷検出部１０６から高負荷信号が出力制
御部１１０に入力されていなければ、データ生成部４０
における圧縮符号化の負荷は低いと判別されて、ステッ
プＳＴ３に進む。In step ST1, it is determined whether or not the load of the compression encoding in the data generation unit 40 is high. Here, a high load signal is output from the load detection unit 106 to the output control unit 1.
If it has been input to the control unit 10, it is determined that the load of the compression encoding in the data generation unit 40 is high, and the process proceeds to step ST <b> 2, and the high load signal from the load detection unit 106 must be input to the output control unit 110. For example, the data generation unit 40
Is determined to be low, and the process proceeds to step ST3.

【００７０】ステップＳＴ２では、現在生成中の画像パ
ケットの後に音声パケットを生成すべく音声取得部２０
からＳＷ回路１０９に入力される音声データをメモリ１
０７に一時記憶データとして記憶するように制御し、さ
らに、次の音声パケットを生成すべく音声取得部２０か
らＳＷ回路１０９に音声データが入力される直前に、ス
テップＳＴ６に進む。In step ST2, the audio acquisition unit 20 generates an audio packet after the image packet currently being generated.
From the memory 1 to the SW 1
07 is stored as temporary storage data, and the process proceeds to step ST6 immediately before audio data is input from the audio acquisition unit 20 to the SW circuit 109 to generate the next audio packet.

【００７１】ステップＳＴ３では、メモリ１０７内に一
時記憶データが記憶されているか否かを判別する。ここ
では、メモリ検出部１０８からデータ有り信号が出力制
御部１１０に入力されていると、メモリ１０７内に一時
記憶データが記憶されていると判別されて、ステップＳ
Ｔ４に進み、また、メモリ検出部１０８からデータ有り
信号が出力制御部１１０に入力されていなければ、メモ
リ１０７内に一時記憶データが記憶されていないと判別
されて、ステップＳＴ５に進む。In step ST3, it is determined whether or not temporary storage data is stored in memory 107. Here, when the data presence signal is input from the memory detection unit 108 to the output control unit 110, it is determined that the temporarily stored data is stored in the memory 107, and the process proceeds to step S
The process proceeds to T4, and if a data presence signal has not been input from the memory detection unit 108 to the output control unit 110, it is determined that temporary storage data is not stored in the memory 107, and the process proceeds to step ST5.

【００７２】ステップＳＴ４では、メモリ１０７に記憶
された一時記憶データのうち、最も早い時期に記憶され
た一時記憶データをＳＷ回路１０９を介してデータ生成
部４０に出力するとともに、音声取得部２０から新たに
入力される音声データをメモリ１０７に記憶するように
制御し、ステップＳＴ６に進む。なお、ここで出力され
る一時記憶データは、１つの音声パケットを生成するた
めの音声データにあたる。In step ST 4, among the temporary storage data stored in the memory 107, the temporary storage data stored at the earliest time is output to the data generator 40 via the SW circuit 109, and Control is performed so that newly input voice data is stored in the memory 107, and the process proceeds to step ST6. Note that the temporarily stored data output here corresponds to audio data for generating one audio packet.

【００７３】ステップＳＴ５では、音声取得部２０から
ＳＷ回路１０９に新たに入力される音声データをデータ
生成部４０に出力するように制御し、ステップＳＴ６に
進む。In step ST5, control is performed so that voice data newly input from the voice obtaining unit 20 to the SW circuit 109 is output to the data generating unit 40, and the process proceeds to step ST6.

【００７４】ステップＳＴ６では、一連の音声付き動画
像データの生成が完了したか否かについて判別する。こ
こでは、負荷検出部１０６から出力制御部１１０に符号
化完了信号が入力されていると、一連の音声付き動画像
データの生成が完了したものと判別し、音声データの出
力制御を終了する。また、負荷検出部１０６から出力制
御部１１０に符号化完了信号が入力されていなければ、
一連の音声付き動画像データの生成は完了していないも
のと判別し、ステップＳＴ１に戻って、ステップＳＴ１
からステップＳＴ６のデータ処理を繰り返す。At step ST6, it is determined whether or not the generation of a series of moving image data with sound has been completed. Here, when the encoding completion signal has been input from the load detection unit 106 to the output control unit 110, it is determined that the generation of a series of moving image data with audio has been completed, and the output control of the audio data ends. If the encoding completion signal has not been input from the load detection unit 106 to the output control unit 110,
It is determined that the generation of a series of moving image data with sound has not been completed, and the process returns to step ST1 to return to step ST1.
And the data processing of step ST6 is repeated.

【００７５】以上、実施の形態１に係るデータ生成装置
１では、１つのデータ生成部４０において、画像データ
の圧縮符号化と音声データの圧縮符号化とを時分割で行
うことによって、音声付き動画像データを生成する。し
たがって、音声付き動画像データを生成する回路の小型
化を図ることができるため、音声付き動画像データ生成
装置の小型軽量化を図ることができる。As described above, in the data generating apparatus 1 according to the first embodiment, one data generating unit 40 performs the compression encoding of the image data and the compression encoding of the audio data in a time-division manner, so that Generate image data. Therefore, the circuit for generating the moving image data with sound can be reduced in size, so that the size of the moving image data generating device with sound can be reduced in size and weight.

【００７６】また、データ生成部４０における画像圧縮
負荷が所定の閾値より大きい場合には、音声データをＳ
Ｗ回路１０９からデータ生成部４０に出力せず、一時記
憶データとしてメモリ１０７に一時的に記憶する。した
がって、画像圧縮負荷が高くなった場合にも音声データ
の消失を防ぐことができ、音声付き動画像データ生成装
置の小型軽量化を図りつつ、通信相手側での再生時にお
ける音声の途切れを抑制することができる。If the image compression load on the data generator 40 is larger than a predetermined threshold, the audio data is
It is not output from the W circuit 109 to the data generator 40 but is temporarily stored in the memory 107 as temporary storage data. Therefore, even if the image compression load is increased, the loss of audio data can be prevented, and the size of the moving image data generation device with audio can be reduced in size and weight, and the interruption of audio during reproduction on the communication partner side can be suppressed. can do.

【００７７】また、データ生成部４０における画像圧縮
負荷が所定の閾値より小さい場合には、メモリ１０７に
一時記憶データが記憶されていれば、ＳＷ回路１０９
が、メモリ１０７から一時記憶データを読み出してデー
タ生成部４０に出力する。そして、データ生成部４０に
おいて、その一時記憶データの圧縮符号化を行う。一
方、メモリ１０７に一時記憶データが記憶されていなけ
れば、ＳＷ回路１０９が、音声取得部２０で新たに取得
される音声データをデータ生成部４０に出力する。そし
て、データ生成部４０において、その音声データの圧縮
符号化を行う。したがって、画像データと音声データの
圧縮符号化を効率良く行うことができる。When the image compression load in the data generation unit 40 is smaller than a predetermined threshold, if the temporarily stored data is stored in the memory 107, the SW circuit 109
Reads the temporarily stored data from the memory 107 and outputs it to the data generator 40. Then, the data generation unit 40 performs compression encoding of the temporarily stored data. On the other hand, if the temporary storage data is not stored in the memory 107, the SW circuit 109 outputs the audio data newly acquired by the audio acquisition unit 20 to the data generation unit 40. Then, the data generator 40 performs compression encoding of the audio data. Therefore, compression encoding of image data and audio data can be performed efficiently.

【００７８】＜実施の形態２．＞＜２−１．データ生成装置２の機能ブロック＞図６は、
実施の形態２に係る音声付き動画像データを生成するデ
ータ生成装置２の機能ブロック図である。実施の形態２
に係るデータ生成装置２では、実施の形態１に係るデー
タ生成装置１におけるメモリ１０７、メモリ検出部１０
８、およびＳＷ回路１０９をＦＩＦＯ形式のスタックメ
モリ（以下「ＦＩＦＯ」と略称する）１１１に置換し、
出力制御部１１０を出力制御部１１０Ｆに置換して、音
声データ制御部３０の回路規模を小型化した構成となっ
ている。そして、その構成の変更に伴って、データ生成
装置１とは、音声データ制御部３０からデータ生成部４
０への音声データの出力制御フローが異なる。<Embodiment 2>><2-1. Functional Block of Data Generation Device 2> FIG.
FIG. 13 is a functional block diagram of a data generation device 2 that generates moving image data with sound according to the second embodiment. Embodiment 2
In the data generation device 2 according to the first embodiment, the memory 107 and the memory detection unit 10 in the data generation device 1 according to the first embodiment are used.
8, and the SW circuit 109 are replaced with a FIFO type stack memory (hereinafter abbreviated as “FIFO”) 111,
The output control unit 110 is replaced with an output control unit 110F, and the circuit scale of the audio data control unit 30 is reduced. Then, with the change of the configuration, the data generation device 1 and the data generation unit 4
The output control flow of audio data to 0 is different.

【００７９】なお、ここでは、ＦＩＦＯ１１１を除く音
声データ制御部３０の各部の機能は、専用の電子回路で
構成されて実現されていても良いし、ＣＰＵにおいてプ
ログラムによって実現されるようなものであっても良
い。Here, the function of each section of the audio data control section 30 except for the FIFO 111 may be realized by a dedicated electronic circuit or realized by a program in the CPU. May be.

【００８０】以下、実施の形態１に係るデータ生成装置
１と異なる点のみを説明し、データ生成装置１と同様で
ある部分には同様な符号を付す等して説明を省略する。Hereinafter, only points different from the data generating apparatus 1 according to the first embodiment will be described, and the same parts as those of the data generating apparatus 1 will be denoted by the same reference numerals and the description will be omitted.

【００８１】図６に示すように、音声取得部２０から出
力される音声データはＦＩＦＯ１１１に入力される。As shown in FIG. 6, audio data output from the audio acquisition unit 20 is input to the FIFO 111.

【００８２】ＦＩＦＯ１１１は、記憶手段として機能
し、音声取得部２０から入力される音声データを必ず一
時記憶データとして一時的に記憶し、出力制御部１１０
の制御にしたがって、記憶されている一時記憶データの
うち最も古い一時記憶データ、つまりは、ＦＩＦＯ１１
１に記憶された順に一時記憶データをデータ生成部４０
へ出力する。The FIFO 111 functions as a storage unit, and always stores the audio data input from the audio acquisition unit 20 temporarily as temporary storage data.
Of the stored temporary storage data, that is, the FIFO 11
1 in the order in which they are stored in the data generation unit 40.
Output to

【００８３】出力制御部１１０Ｆは、負荷検出部１０６
から入力される高負荷信号に基づいて、ＦＩＦＯ１１１
からデータ生成部４０への音声データの出力を制御す
る。The output control unit 110F is connected to the load detection unit 106
FIFO 111 based on the high load signal input from
The output of audio data to the data generation unit 40 is controlled.

【００８４】具体的には、出力制御部１１０Ｆがデータ
生成部４０の画像圧縮負荷が所定の閾値より大きい場合
には、出力制御部１１０ＦがＦＩＦＯ１１１からデータ
生成部４０へ音声データを出力させないように制御し
て、音声データを一時記憶データとしてＦＩＦＯ１１１
に一時的に記憶させた状態に保持する。More specifically, when the image compression load of the data generation unit 40 is larger than a predetermined threshold, the output control unit 110F prevents the output control unit 110F from outputting audio data from the FIFO 111 to the data generation unit 40. Controlling the audio data as temporary storage data in the FIFO 111
Is temporarily stored.

【００８５】一方、画像圧縮負荷が所定の閾値より小さ
い場合には、出力制御部１１０Ｆが、ＦＩＦＯ１１１か
ら記憶された順に一時記憶データをデータ生成部４０に
出力し、データ生成部４０において圧縮符号化を行うよ
うに制御する。On the other hand, if the image compression load is smaller than the predetermined threshold value, the output control unit 110F outputs the temporarily stored data to the data generation unit 40 in the order in which they were stored from the FIFO 111, and the data generation unit 40 performs compression encoding. Is controlled to be performed.

【００８６】＜２−２．データ生成装置２におけるデー
タ処理＞図７は、データ生成装置２における音声データ
の出力制御を説明するフローチャートである。なお、こ
こでは、出力制御部１１０Ｆによる制御に基づいて音声
データのデータ生成部４０への出力が制御される。<2-2. Data Processing in Data Generating Apparatus 2> FIG. 7 is a flowchart illustrating output control of audio data in the data generating apparatus 2. Here, the output of the audio data to the data generation unit 40 is controlled based on the control by the output control unit 110F.

【００８７】まず、実施の形態１と同様に、音声付き動
画像データを生成すべく、ユーザーの操作に基づいて、
画像取得部１０における画像データの取得と、音声取得
部２０における音声データの取得とを開始すると、出力
制御部１１０Ｆにおける音声データの出力制御が開始
し、ステップＳＴ２１に進む。First, in the same manner as in the first embodiment, in order to generate moving image data with audio, based on a user operation,
When the acquisition of the image data by the image acquisition unit 10 and the acquisition of the audio data by the audio acquisition unit 20 are started, the output control of the audio data by the output control unit 110F is started, and the process proceeds to step ST21.

【００８８】ステップＳＴ２１では、データ生成部４０
における圧縮符号化の負荷が高いか否かを判別する。こ
こでの判別は、１つの音声パケットを生成するための
０．５秒間の音声に相当する音声データがＦＩＦＯ１１
１に入力される直前ごとに実施される。具体的には、判
別時に負荷検出部１０６から高負荷信号が出力制御部１
１０Ｆに入力されていると、データ生成部４０における
圧縮符号化の負荷が高いと判別されて、ステップＳＴ２
２に進み、また、判別時に負荷検出部１０６から高負荷
信号が出力制御部１１０Ｆに入力されていなければ、デ
ータ生成部４０における圧縮符号化の負荷は低いと判別
されて、ステップＳＴ２３に進む。At step ST21, the data generation unit 40
It is determined whether or not the load of the compression encoding is high. Here, the audio data corresponding to the audio for 0.5 seconds for generating one audio packet is stored in the FIFO 11.
It is performed every time immediately before input to 1. Specifically, at the time of determination, a high load signal is output from the load detection unit 106 to the output control unit 1.
If the data is input to 10F, it is determined that the load of the compression encoding in the data generation unit 40 is high, and step ST2
If the high load signal is not input from the load detection unit 106 to the output control unit 110F at the time of the determination, the load of the compression encoding in the data generation unit 40 is determined to be low, and the process proceeds to step ST23.

【００８９】ステップＳＴ２２では、音声取得部２０か
らＦＩＦＯ１１１に入力されて一時的に記憶される音声
データである一時記憶データをデータ生成部４０に出力
せず、ステップＳＴ２４に進む。In step ST22, the temporary storage data, which is the audio data that is input from the audio acquisition unit 20 to the FIFO 111 and temporarily stored, is not output to the data generation unit 40, and the process proceeds to step ST24.

【００９０】ステップＳＴ２３では、音声取得部２０か
らＦＩＦＯ１１１に入力されて一時的に記憶される一時
記憶データのうち、最も古い一時記憶データをデータ生
成部４０へ出力するように設定する。なお、ステップＳ
Ｔ２３の出力設定ごとに出力されるように設定される一
時記憶データは、１つの音声パケットを構成するための
０．５秒間の音声に相当する音声データにあたる。In step ST23, the oldest temporary storage data among the temporary storage data input from the voice acquisition unit 20 to the FIFO 111 and temporarily stored is set to be output to the data generation unit 40. Step S
The temporary storage data set to be output for each output setting of T23 corresponds to audio data corresponding to 0.5 seconds of audio for forming one audio packet.

【００９１】ステップＳＴ２４では、一連の音声付き動
画像データの生成が完了したか否かについて判別する。
ここでは、負荷検出部１０６から出力制御部１１０Ｆに
符号化完了信号が入力されていると、一連の音声付き動
画像データの生成が完了したものと判別し、音声データ
の出力制御を終了する。また、負荷検出部１０６から出
力制御部１１０Ｆに符号化完了信号が入力されていなけ
れば、一連の音声付き動画像データの生成は完了してい
ないものと判別し、ステップＳＴ２１に戻って、ステッ
プＳＴ２１からステップＳＴ２４のデータ処理を繰り返
す。In step ST24, it is determined whether or not the generation of a series of moving image data with sound has been completed.
Here, when the encoding completion signal is input from the load detection unit 106 to the output control unit 110F, it is determined that the generation of a series of moving image data with sound has been completed, and the output control of the sound data ends. If the encoding completion signal has not been input from the load detection unit 106 to the output control unit 110F, it is determined that the generation of a series of moving image data with sound has not been completed, and the process returns to step ST21 and returns to step ST21. And the data processing of step ST24 is repeated.

【００９２】以上、実施の形態２に係るデータ生成装置
２では、音声取得部２０で取得した音声データを必ずＦ
ＩＦＯ１１１に一時記憶データとして一時的に記憶す
る。そして、データ生成部４０における画像圧縮負荷を
所定の閾値と比較し、それが閾値より小さい場合に、出
力制御部１１０Ｆの制御の下で、ＦＩＦＯ１１１から記
憶された順に一時記憶データを読み出して、データ生成
部４０において、一時記憶データの圧縮符号化が行われ
る。したがって、音声データ制御部３０の回路規模の小
型化により、音声付き動画像データ生成装置の小型軽量
化をさらに図ることができる。また、音声データの消失
を防ぐことができ、通信相手側での再生時における音声
の途切れを抑制することもできる。As described above, in the data generating apparatus 2 according to the second embodiment, the audio data acquired by the audio
The data is temporarily stored in the IFO 111 as temporary storage data. Then, the image compression load in the data generation unit 40 is compared with a predetermined threshold value. If the load value is smaller than the threshold value, the temporary storage data is read out from the FIFO 111 in the order stored from the FIFO 111 under the control of the output control unit 110F. In the generation unit 40, compression encoding of the temporarily stored data is performed. Therefore, the downsizing of the circuit scale of the audio data control unit 30 can further reduce the size and weight of the moving image data generation device with audio. Further, loss of audio data can be prevented, and interruption of audio during reproduction on the communication partner side can be suppressed.

【００９３】＜実施の形態３．＞図８は、実施の形態３
に係る音声付き動画像データを生成するデータ生成装置
３の機能ブロック図である。実施の形態１に係るデータ
生成装置１においては、データ生成部４０の画像圧縮負
荷を負荷検出部１０６がデータ生成部４０から直接検出
していたが、実施の形態３に係るデータ生成装置３にお
いては、画像取得部１０から画像データに関する情報を
取得して、データ生成部４０の画像圧縮負荷を事前に予
測する。<Embodiment 3> FIG. 8 shows the third embodiment.
FIG. 2 is a functional block diagram of a data generation device 3 that generates moving image data with sound according to the first embodiment. In the data generation device 1 according to the first embodiment, the load detection unit 106 directly detects the image compression load of the data generation unit 40 from the data generation unit 40. However, in the data generation device 3 according to the third embodiment, Acquires information about image data from the image acquisition unit 10 and predicts the image compression load of the data generation unit 40 in advance.

【００９４】以下、実施の形態１に係るデータ生成装置
１と異なる点のみを説明し、データ生成装置１と同様で
ある部分には同様な符号を付す等して説明を省略する。Hereinafter, only points different from the data generating apparatus 1 according to the first embodiment will be described, and the same parts as those of the data generating apparatus 1 will be denoted by the same reference numerals and the description thereof will be omitted.

【００９５】図８に示すように、画像取得部１０のＡ／
Ｄ変換器１０２は、実施の形態１と同様にデータ生成部
４０に画像データを出力するとともに、同じ画像データ
を負荷検出部１０６Ｐにも出力する。As shown in FIG. 8, the A /
The D converter 102 outputs image data to the data generator 40 as in the first embodiment, and also outputs the same image data to the load detector 106P.

【００９６】負荷検出部１０６Ｐは、Ａ／Ｄ変換器１０
２から画像データを取得し、データ生成部４０における
ＭＰＥＧ１やＭＰＥＧ２といった圧縮符号化規則に基づ
く圧縮符号化と同様に、動き補償フレーム間予測を行
う。そして、圧縮符号化の対象となるフレーム間の差分
を求めることによって、差分を求める対象となった画像
データがデータ生成部４０において圧縮符号化される際
の画像圧縮負荷を予測する。つまり、負荷検出部１０６
Ｐは、事前に画像圧縮負荷を予測的に検出する負荷検出
手段（負荷予測手段）を構成するものであり、予測的に
検出した画像圧縮負荷と所定の閾値とを比較して、その
画像圧縮負荷が所定の閾値より大きくなる場合は、その
旨を示す高負荷信号を出力制御部１１０Ｐに出力する。The load detector 106P is provided with the A / D converter 10
2, the motion compensation inter-frame prediction is performed in the same manner as the compression encoding based on the compression encoding rules such as MPEG1 and MPEG2 in the data generation unit 40. Then, by calculating the difference between the frames to be subjected to the compression encoding, the image compression load when the image data for which the difference is to be calculated is compression-encoded by the data generation unit 40 is predicted. That is, the load detection unit 106
P constitutes a load detecting means (load estimating means) for predictively detecting the image compression load in advance, and compares the image compression load detected predictively with a predetermined threshold value to determine the image compression load. When the load becomes larger than the predetermined threshold, a high load signal indicating that fact is output to the output control unit 110P.

【００９７】また、出力制御部１１０Ｐは、実施の形態
１における出力制御部１１０とほぼ同様のものである。
また、データ生成装置３における音声データのデータ生
成部４０への出力制御フローに関するフローチャートつ
いても、図５に示すデータ生成装置１における音声デー
タの出力制御を説明するフローチャートとほぼ同様にな
る。The output control section 110P is substantially the same as the output control section 110 in the first embodiment.
Also, the flowchart regarding the output control flow of the audio data to the data generation unit 40 in the data generation device 3 is substantially the same as the flowchart for explaining the output control of the audio data in the data generation device 1 shown in FIG.

【００９８】しかし、図５に示すフローチャートにおけ
るステップＳＴ６における判別では、負荷検出部１０６
から出力制御部１１０に符号化完了信号が入力されたか
否かで一連の音声付き動画像データの生成が完了したか
否かを判別したが、ここでは、メモリ１０７に音声デー
タが記憶されていないためにメモリ検出部１０８からデ
ータ有り信号が入力されず、かつ、ユーザーの操作に基
づいて音声取得部２０で音声データが取得されない場合
に、出力制御部１１０Ｐが、一連の音声付き動画像デー
タの生成が完了したものと判別する。However, in the determination in step ST6 in the flowchart shown in FIG.
It is determined whether or not the generation of a series of moving image data with sound has been completed based on whether or not the encoding completion signal has been input to the output control unit 110 from here, but here, no sound data is stored in the memory 107. Therefore, when the data presence signal is not input from the memory detection unit 108 and the audio data is not acquired by the audio acquisition unit 20 based on the operation of the user, the output control unit 110P outputs a series of moving image data with audio. It is determined that the generation has been completed.

【００９９】なお、ここでも、実施の形態１と同様に、
メモリ１０７を除く音声データ制御部３０の各部の機能
は、専用の電子回路で構成されて実現されていても良い
し、ＣＰＵにおいてプログラムによって実現されるよう
なものであっても良い。Here, as in the first embodiment,
The functions of each unit of the audio data control unit 30 except for the memory 107 may be realized by being constituted by a dedicated electronic circuit, or may be realized by a program in a CPU.

【０１００】以上、実施の形態３に係るデータ生成装置
３では、負荷検出部１０６Ｐが、画像取得部１０のＡ／
Ｄ変換器１０２から画像データに関する情報として画像
データを取得する。そして、負荷検出部１０６Ｐが、そ
の画像データがデータ生成部４０において圧縮符号化さ
れる際の画像圧縮負荷を予測し、事前に画像圧縮負荷が
所定の閾値より大きくなるか否かを検出する。したがっ
て、データ生成部４０における画像圧縮負荷が高くなる
際には、遅滞なくメモリ１０７に音声データを一時記憶
データとして一時的に記憶させることができるため、通
信相手側での再生時における音声の途切れを抑制するた
めの制御を早急に行うことができ、音声の途切れをさら
に低減することができる。As described above, in the data generation device 3 according to the third embodiment, the load detection unit 106P
Image data is acquired from the D converter 102 as information relating to the image data. Then, the load detection unit 106P predicts an image compression load when the image data is compression-encoded in the data generation unit 40, and detects in advance whether the image compression load becomes larger than a predetermined threshold. Therefore, when the image compression load on the data generation unit 40 becomes high, the audio data can be temporarily stored in the memory 107 as the temporary storage data without delay, so that the audio is interrupted during reproduction on the communication partner side. The control for suppressing the noise can be promptly performed, and the interruption of the sound can be further reduced.

【０１０１】＜実施の形態４．＞＜４−１．データ生成装置４の機能ブロック＞図９は、
実施の形態４に係る音声付き動画像データを生成するデ
ータ生成装置４の機能ブロック図である。実施の形態１
に係るデータ生成装置１では、音声データのデータ生成
部４０への出力制御を行い、データ生成部４０における
画像圧縮負荷が高い場合は、音声データをメモリ１０７
に一時的に記憶することで、音声データの消失を防ぎ、
回路規模の小型化を図りつつ、通信相手側の再生時にお
ける音声の途切れを抑制していたが、実施の形態４に係
るデータ生成装置４では、音声データのデータ生成部４
０への出力制御を行う音声データ制御部３０を設けず、
データ生成部４０における画像圧縮負荷が高い場合は、
データ生成部４０での画像データの圧縮符号化における
圧縮率を所定の基準圧縮率よりも高めることによって、
回路規模の小型化を図りつつ、音声データの消失を防い
でいる。<Embodiment 4>><4-1. Functional Block of Data Generation Apparatus 4> FIG.
FIG. 14 is a functional block diagram of a data generation device 4 that generates moving image data with sound according to Embodiment 4. Embodiment 1
In the data generation device 1 according to the above, the output control of the audio data to the data generation unit 40 is performed, and when the image compression load on the data generation unit 40 is high, the audio data is
By temporarily storing the audio data,
Although the interruption of audio during reproduction on the communication partner side was suppressed while miniaturizing the circuit scale, the data generation device 4 according to the fourth embodiment uses the data generation unit 4 for audio data.
Without providing the audio data control unit 30 for controlling the output to 0,
When the image compression load on the data generation unit 40 is high,
By increasing the compression ratio in the compression encoding of the image data in the data generation unit 40 from a predetermined reference compression ratio,
The circuit size is reduced while preventing the loss of audio data.

【０１０２】以下、実施の形態１に係るデータ生成装置
１と異なる点のみを説明し、データ生成装置１と同様で
ある部分には同様な符号を付する等して説明を省略す
る。Hereinafter, only points different from the data generating apparatus 1 according to the first embodiment will be described, and the same parts as those of the data generating apparatus 1 will be denoted by the same reference numerals and the description thereof will be omitted.

【０１０３】図９に示すように、音声取得部２０から出
力される音声データはデータ生成部４０に直接入力さ
れ、圧縮符号化のタイミングを調節するためのＲＡＭに
一時的に記憶される。そして、データ生成部４０におい
て、図３（ａ）に示したようなタイミングで画像データ
と音声データの圧縮符号化が行われる。As shown in FIG. 9, audio data output from the audio acquisition unit 20 is directly input to the data generation unit 40, and is temporarily stored in a RAM for adjusting the timing of compression encoding. Then, the data generation unit 40 performs the compression encoding of the image data and the audio data at the timing shown in FIG.

【０１０４】負荷検出部１０６Ｄは、実施の形態１の負
荷検出部１０６と同様に、データ生成部４０における画
像圧縮負荷と所定の閾値とを比較する負荷検出手段を構
成するものであり、画像圧縮負荷が所定の閾値より大き
い場合は、その旨を示す高負荷信号を圧縮率制御部１１
４に出力する。The load detecting section 106D constitutes a load detecting means for comparing the image compression load in the data generating section 40 with a predetermined threshold value, similarly to the load detecting section 106 of the first embodiment. If the load is larger than the predetermined threshold, a high load signal indicating that fact is sent to the compression ratio controller 11.
4 is output.

【０１０５】また、負荷検出部１０６Ｄは、実施の形態
１の負荷検出部１０６と同様に、データ生成部４０にお
ける画像データおよび音声データの圧縮符号化を開始
後、圧縮負荷を一定時間全く検出しない場合には、一連
の音声付き動画像データを生成するための画像データと
音声データの圧縮符号化が完了した旨を示す符号化完了
信号を圧縮率制御部１１４に出力する。Further, similarly to the load detection unit 106 of the first embodiment, the load detection unit 106D does not detect any compression load for a certain period of time after the data generation unit 40 starts compression encoding of image data and audio data. In this case, an encoding completion signal indicating that the compression encoding of the video data and audio data for generating a series of moving image data with audio is completed is output to the compression ratio control unit 114.

【０１０６】圧縮率制御部１１４は、負荷検出部１０６
Ｄから入力される高負荷信号に基づいて、データ生成部
４０での画像データの圧縮符号化における圧縮率の切り
換えを制御する。The compression ratio control unit 114 is
Based on the high load signal input from D, switching of the compression ratio in the compression encoding of the image data in the data generation unit 40 is controlled.

【０１０７】具体的には、データ生成部４０の画像圧縮
負荷が所定の閾値より大きい場合には、負荷検出部１０
６Ｄから圧縮率制御部１１４に高負荷信号が入力され、
圧縮率制御部１１４がデータ生成部４０での圧縮符号化
における圧縮率を所定の基準圧縮率よりも高めるように
制御し、また、データ生成部４０の画像圧縮負荷が所定
の閾値より小さい場合には、負荷検出部１０６Ｄから圧
縮率制御部１１４に高負荷信号が入力されないため、圧
縮率制御部１１４がデータ生成部４０での圧縮符号化に
おける圧縮率を高める制御を行わない。Specifically, when the image compression load of the data generation unit 40 is larger than a predetermined threshold, the load detection unit 10
A high load signal is input from 6D to the compression ratio control unit 114,
The compression rate control unit 114 controls the compression rate in the compression encoding in the data generation unit 40 to be higher than a predetermined reference compression rate, and when the image compression load of the data generation unit 40 is smaller than a predetermined threshold value. Since the high load signal is not input from the load detection unit 106D to the compression ratio control unit 114, the compression ratio control unit 114 does not perform control to increase the compression ratio in the compression encoding in the data generation unit 40.

【０１０８】また、圧縮率制御部１１４は、負荷検出部
１０６Ｄから符号化完了信号が入力されると、圧縮率の
切り換えの制御を終了する。When the encoding completion signal is input from load detection section 106D, compression rate control section 114 ends the control of switching the compression rate.

【０１０９】次に、データ生成部４０での圧縮符号化に
おける圧縮率の切り換えの内容と効果についてさらに具
体的に説明する。Next, the contents and effects of switching the compression ratio in the compression encoding in the data generation section 40 will be described more specifically.

【０１１０】例えば、圧縮率が高くない所定の基準圧縮
率の場合には、データ生成部４０において、画像取得部
１０から入力される画像データの全画素を圧縮符号化の
対象とする。また、圧縮率が所定の基準圧縮率よりも高
い場合には、データ生成部４０において、画像取得部１
０から入力される画像データの全画素データのうち画素
のマトリクス配列において水平・垂直方向にそれぞれ１
／２ずつ間引いた画素データを圧縮符号化の対象とする
ようにする。For example, in the case where the compression ratio is a predetermined reference compression ratio that is not high, the data generation unit 40 sets all the pixels of the image data input from the image acquisition unit 10 as compression encoding targets. If the compression ratio is higher than a predetermined reference compression ratio, the data generation unit 40 causes the image acquisition unit 1
0 in the horizontal and vertical directions in the pixel matrix arrangement of all the pixel data of the input image data.
Pixel data decimated by / 2 is to be subjected to compression encoding.

【０１１１】そして、上述のように圧縮率を高めること
によって、画像の質は低下するものの、圧縮符号化の対
象となる画像データが少なくなるため、データ生成部４
０における画像データの圧縮符号化の処理スピードを高
めることができる。As the compression rate is increased as described above, the quality of the image is reduced, but the amount of image data to be compression-coded is reduced.
0 can increase the processing speed of the compression encoding of the image data.

【０１１２】したがって、被写体の動きが著しく大きな
場合等は、データ生成部４０における画像データの圧縮
符号化にかかる負荷が大きくなり、画像データの圧縮符
号化に要する時間が長くなる傾向となるが、データ生成
部４０での画像データの圧縮符号化にかかる負荷が高い
場合には、データ生成部４０での画像データの圧縮符号
化における圧縮率を高め、図３（ｂ）に示すように、画
像データの圧縮符号化の時間帯が長くなり過ぎるのを防
ぐため、画像の質は低下するが、音声データの消失を防
ぐことができる。Therefore, when the movement of the subject is extremely large, the load on the data generator 40 for compressing and encoding the image data increases, and the time required for compressing and encoding the image data tends to increase. When the load on the compression encoding of the image data in the data generation unit 40 is high, the compression rate in the compression encoding of the image data in the data generation unit 40 is increased, and as shown in FIG. In order to prevent the time zone of data compression and encoding from becoming too long, the quality of the image is reduced, but the loss of audio data can be prevented.

【０１１３】なお、ここでは、負荷検出部１０６Ｄおよ
び圧縮率制御部１１４の機能は、専用の電子回路で構成
されて実現されていても良いし、それぞれ別個の、また
は、１つのＣＰＵにおいてプログラムによって実現され
るようなものであっても良い。さらに、ここでは、デー
タ生成部４０、負荷検出部１０６Ｄ、および圧縮率制御
部１１４の機能が、１つのＣＰＵにおいてプログラムに
よって実現されるようなものであっても良い。Here, the functions of the load detection unit 106D and the compression ratio control unit 114 may be implemented by a dedicated electronic circuit, or may be realized by separate or one CPU by a program. What is realized may be sufficient. Further, here, the functions of the data generation unit 40, the load detection unit 106D, and the compression ratio control unit 114 may be realized by a single CPU by a program.

【０１１４】＜４−２．データ生成装置４における圧縮
率の制御フロー＞図１０は、圧縮率制御部１１４の制御
に基づく、データ生成装置４の圧縮率の制御フローを示
すフローチャートである。<4-2. FIG. 10 is a flowchart showing a control flow of the compression ratio of the data generation device 4 based on the control of the compression ratio control unit 114.

【０１１５】まず、実施の形態１と同様に、音声付き動
画像データを生成すべく、ユーザーの操作に基づいて、
画像取得部１０における画像データの取得と、音声取得
部２０における音声データの取得とを開始すると、圧縮
率制御部１１４における圧縮率の制御が開始し、ステッ
プＳＴ４１に進む。First, in the same manner as in the first embodiment, in order to generate moving image data with sound, based on a user operation,
When the acquisition of image data by the image acquisition unit 10 and the acquisition of audio data by the audio acquisition unit 20 are started, control of the compression ratio by the compression ratio control unit 114 is started, and the process proceeds to step ST41.

【０１１６】ステップＳＴ４１では、データ生成部４０
における圧縮符号化の負荷が高いか否かを判別する。こ
こでは、例えば、０．１秒間隔でステップＳＴ４１の判
別を行い、判別時に負荷検出部１０６Ｄから高負荷信号
が圧縮率制御部１１４に入力されていると、データ生成
部４０における圧縮符号化の負荷が高いと判別されて、
ステップＳＴ４２に進み、また、判別時に負荷検出部１
０６Ｄから高負荷信号が圧縮率制御部１１４に入力され
ていなければ、データ生成部４０における圧縮符号化の
負荷は低いと判別されて、ステップＳＴ４３に進む。In step ST41, the data generation unit 40
It is determined whether or not the load of the compression encoding is high. Here, for example, the determination in step ST41 is performed at 0.1 second intervals, and when a high load signal is input from the load detection unit 106D to the compression ratio control unit 114 at the time of the determination, the compression encoding in the data generation unit 40 is performed. It is determined that the load is high,
The process proceeds to step ST42, and the load detector 1
If a high-load signal has not been input to the compression rate control unit 114 from 06D, it is determined that the load of compression encoding in the data generation unit 40 is low, and the process proceeds to step ST43.

【０１１７】ステップＳＴ４２では、データ生成部４０
での画像データの圧縮符号化における圧縮率を基準圧縮
率よりも高く設定し、ステップＳＴ４４に進む。In step ST42, the data generation unit 40
In step ST44, the compression ratio in the compression encoding of the image data is set higher than the reference compression ratio.

【０１１８】ステップＳＴ４３では、データ生成部４０
での画像データの圧縮符号化における圧縮率を基準圧縮
率に相当する低い圧縮率に設定し、ステップＳＴ４４に
進む。In step ST43, the data generation unit 40
Is set to a low compression ratio corresponding to the reference compression ratio, and the process proceeds to step ST44.

【０１１９】ステップＳＴ４４では、一連の音声付き動
画像データの生成が完了したか否かについて判別する。
ここでは、負荷検出部１０６Ｄから圧縮率制御部１１４
に符号化完了信号が入力されていると、一連の音声付き
動画像データの生成が完了したものと判別し、圧縮率の
制御を終了する。また、負荷検出部１０６Ｄから圧縮率
制御部１１４に符号化完了信号が入力されていなけれ
ば、一連の音声付き動画像データの生成は完了していな
いものと判別し、ステップＳＴ４１に戻って、ステップ
ＳＴ４１からステップＳＴ４４のデータ処理を繰り返
す。In step ST44, it is determined whether or not a series of generation of moving image data with sound has been completed.
Here, from the load detection unit 106D to the compression ratio control unit 114
When the encoding completion signal has been input to the CPU, it is determined that the generation of a series of moving image data with sound has been completed, and the control of the compression ratio ends. If the encoding completion signal has not been input from the load detection unit 106D to the compression ratio control unit 114, it is determined that the generation of a series of moving image data with sound has not been completed, and the process returns to step ST41. The data processing from ST41 to ST44 is repeated.

【０１２０】以上、実施の形態４に係るデータ生成装置
４では、データ生成部４０における画像圧縮負荷が所定
の閾値よりも大きい場合には、圧縮率制御部１１４が、
データ生成部４０での画像データの圧縮符号化における
圧縮率を所定の基準圧縮率よりも高める。したがって、
音声データを一時的に記憶するための記憶手段等を設け
なくとも、音声データの消失を防ぐことができるため、
音声付き動画像データ生成装置の小型軽量化をさらに図
りつつ、通信相手側での再生時における音声の途切れを
抑制することができる。As described above, in the data generation device 4 according to the fourth embodiment, when the image compression load on the data generation unit 40 is larger than a predetermined threshold, the compression ratio control unit 114
The compression ratio in the compression encoding of the image data in the data generation unit 40 is made higher than a predetermined reference compression ratio. Therefore,
Even without providing a storage means for temporarily storing the audio data, it is possible to prevent the loss of the audio data,
It is possible to further reduce the size and weight of the moving image data generating apparatus with sound and suppress interruption of sound during reproduction on the communication partner side.

【０１２１】＜実施の形態５．＞図１１は、実施の形態
５に係る音声付き動画像データを生成するデータ生成装
置５の機能ブロック図である。実施の形態４に係るデー
タ生成装置４では、データ生成部４０における画像圧縮
負荷が高い場合は、圧縮率制御部１１４の制御の下で、
データ生成部４０での画像データの圧縮符号化における
圧縮率を高めていたが、実施の形態５に係るデータ生成
装置５では、データ生成部４０における画像圧縮負荷が
高くなるような使用状況に応じて、ユーザーがデータ生
成部４０での画像データの圧縮符号化における圧縮率を
基準圧縮率よりも高めるように設定する。<Embodiment 5> FIG. 11 is a functional block diagram of a data generation device 5 for generating moving image data with sound according to the fifth embodiment. In the data generation device 4 according to the fourth embodiment, when the image compression load on the data generation unit 40 is high, under the control of the compression ratio control unit 114,
Although the compression rate in the compression encoding of the image data in the data generation unit 40 has been increased, the data generation device 5 according to the fifth embodiment responds to a use situation where the image compression load on the data generation unit 40 increases. Then, the user sets the compression ratio in the compression encoding of the image data in the data generation unit 40 to be higher than the reference compression ratio.

【０１２２】以下、実施の形態４に係るデータ生成装置
４と異なる点のみを説明し、データ生成装置４と同様で
ある部分には同様な符号を付す等して説明を省略する。Hereinafter, only points different from the data generating apparatus 4 according to the fourth embodiment will be described, and the same parts as those of the data generating apparatus 4 will be denoted by the same reference numerals and the description thereof will be omitted.

【０１２３】図１１に示すように、データ生成装置５の
外部にユーザーが種々の操作を行うことによって、デー
タ生成部４０での画像データの圧縮符号化における圧縮
率を変更することができる操作部１１５が設けられてい
る。As shown in FIG. 11, the user can perform various operations outside the data generator 5 to change the compression ratio in the compression encoding of image data in the data generator 40. 115 are provided.

【０１２４】操作部１１５では、ユーザーは種々の操作
を行うことによって、被写体の動きの小さな場面を撮影
する”ＴＶ電話モード”と、被写体の動きの大きな場面
を撮影する”スポーツモード”の２つの撮影モードを選
択することができ、操作部１１５で選択された撮影モー
ドを特定するための信号（以下、「モード特定信号」と
称す）を圧縮率制御部１１４Ｙに出力する。The operation section 115 allows the user to perform various operations to obtain a "TV phone mode" for photographing a scene with a small movement of the subject, and a "sports mode" for photographing a scene with a large movement of the subject. A shooting mode can be selected, and a signal for specifying the shooting mode selected by the operation unit 115 (hereinafter, referred to as a “mode specifying signal”) is output to the compression ratio control unit 114Y.

【０１２５】圧縮率制御部１１４Ｙは、操作部１１５か
ら入力されるモード特定信号に基づいて、データ生成部
４０での画像データの圧縮符号化における圧縮率の切り
換えを制御する。The compression ratio control unit 114Y controls switching of the compression ratio in the compression coding of the image data in the data generation unit 40 based on the mode specifying signal input from the operation unit 115.

【０１２６】具体的には、ユーザーが操作部１１５を操
作して、”スポーツモード”が撮影モードとして選択さ
れると、操作部１１５から”スポーツモード”が撮影モ
ードとして選択された旨を示すモード特定信号が圧縮率
制御部１１４Ｙに出力される。このとき、圧縮率制御部
１１４Ｙがデータ生成部４０での画像データの圧縮符号
化における圧縮率を高めるように制御する。つまり、ユ
ーザーの操作ないしは選択に基づいて、圧縮率制御部１
１４Ｙが、データ生成部４０での画像データの圧縮符号
化における圧縮率を基準圧縮率よりも高くするように制
御する。More specifically, when the user operates the operation unit 115 to select “sports mode” as the shooting mode, the operation unit 115 indicates that “sports mode” is selected as the shooting mode. The specific signal is output to compression ratio control section 114Y. At this time, the compression ratio control unit 114Y controls to increase the compression ratio in the compression encoding of the image data in the data generation unit 40. That is, the compression ratio control unit 1 is operated based on a user operation or selection.
14Y controls the compression ratio in the compression encoding of the image data in the data generation unit 40 to be higher than the reference compression ratio.

【０１２７】一方、ユーザーが操作部１１５を操作し
て、”ＴＶ電話モード”が撮影モードとして選択される
と、操作部１１５から”ＴＶ電話モード”が撮影モード
として選択された旨を示すモード特定信号が圧縮率制御
部１１４Ｙに出力される。このときは、圧縮率制御部１
１４Ｙがデータ生成部４０での画像データの圧縮符号化
における圧縮率を高める制御を行わない。On the other hand, when the user operates the operation unit 115 to select “TV phone mode” as the shooting mode, the mode identification indicating that “TV phone mode” is selected as the shooting mode from the operation unit 115 The signal is output to compression ratio control section 114Y. At this time, the compression ratio control unit 1
14Y does not perform control to increase the compression ratio in the compression encoding of image data in the data generation unit 40.

【０１２８】なお、ここでは、圧縮率制御部１１４Ｙの
機能は、専用の電子回路で構成されて実現されていても
良いし、ＣＰＵにおいてプログラムによって実現される
ようなものであっても良い。さらに、ここでは、データ
生成部４０、および圧縮率制御部１１４Ｙの機能が、１
つのＣＰＵにおいてプログラムによって実現されるよう
なものであっても良い。Here, the function of the compression ratio control section 114Y may be realized by a dedicated electronic circuit, or may be realized by a program in a CPU. Further, here, the functions of the data generation unit 40 and the compression ratio control unit 114Y are 1
It may be realized by a program in one CPU.

【０１２９】また、データ生成部４０での圧縮符号化に
おける圧縮率の切り換えの内容と効果については、実施
の形態４に係るデータ生成装置４と同様となるため、こ
こでは、説明を省略する。The content and effect of switching the compression ratio in the compression encoding in the data generation unit 40 are the same as those of the data generation device 4 according to the fourth embodiment, and therefore the description is omitted here.

【０１３０】以上、実施の形態５に係るデータ生成装置
５では、ユーザーの操作に基づいて、圧縮率制御部１１
４Ｙがデータ生成部４０での画像データの圧縮符号化に
おける圧縮率を所定の基準圧縮率よりも高める。つま
り、データ生成部４０における画像圧縮負荷が高くなる
前に、データ生成部４０での画像データの圧縮符号化に
おける圧縮率を高めることによって、音声データの消失
を防ぐ。したがって、使用状況に合わせた設定をユーザ
ーが事前に行うことによって、通信相手側での再生時に
おける音声の途切れを確実に回避することができる。ま
た、回路規模のさらなる小型化による音声付き動画像デ
ータ生成装置の小型軽量化を図ることができる。As described above, in the data generating apparatus 5 according to the fifth embodiment, the compression rate control unit 11
4Y increases the compression ratio in the compression encoding of the image data in the data generation unit 40 from a predetermined reference compression ratio. That is, loss of audio data is prevented by increasing the compression ratio in the compression encoding of image data in the data generation unit 40 before the image compression load in the data generation unit 40 increases. Therefore, by performing the setting in advance according to the use situation by the user, it is possible to reliably avoid the interruption of the sound during the reproduction on the communication partner side. Further, the size of the moving image data generation device with sound can be reduced in size and weight by further reducing the circuit size.

【０１３１】なお、ここでは、ユーザーの設定に基づい
て、使用状況に合わせて、通信相手側での再生時におけ
る音声の途切れを確実に回避することができるが、ユー
ザーの設定に基づいて一律に画像と音声の質が決定され
るため、例えば、”スポーツモード”が撮影モードとし
て設定されている状態で、被写体の動きが小さくなって
も、圧縮率は高い状態のまま保持されるため、常に画像
の質はある程度良好でない状態となってしまう。In this case, it is possible to reliably avoid interruption of the sound at the time of reproduction on the communication partner side in accordance with the use condition based on the user's setting. Since the quality of the image and the sound is determined, for example, in a state where the “sports mode” is set as the shooting mode, even if the movement of the subject becomes small, the compression ratio is maintained in a high state. The quality of the image is not good to some extent.

【０１３２】したがって、画像の質をなるべく確保する
といった観点からすると、被写体の動きに合わせて圧縮
率が切り換えられる実施の形態４に係るデータ生成装置
４の方が優れていると言えるが、音声の途切れを確実に
回避するといった観点からすると、データ生成部４０に
おける画像圧縮負荷が実際に高くなってから圧縮率を切
り換える実施の形態４に係るデータ生成装置４よりも、
実施の形態５に係るデータ生成装置５の方が優れている
と言える。Therefore, from the viewpoint of ensuring the quality of the image as much as possible, it can be said that the data generation device 4 according to the fourth embodiment, in which the compression ratio is switched according to the movement of the subject, is superior, From the viewpoint of reliably avoiding interruption, the data generation unit 40 can switch the compression ratio after the image compression load actually increases, compared to the data generation device 4 according to the fourth embodiment.
It can be said that the data generation device 5 according to the fifth embodiment is superior.

【０１３３】＜変形例．＞以上、この発明の実施の形態
について説明したが、この発明は上記説明した内容のも
のに限定されるものではない。<Modifications> The embodiments of the present invention have been described above, but the present invention is not limited to the above-described contents.

【０１３４】◎例えば、上述した実施の形態１から３で
は、メモリ１０７やＦＩＦＯ１１１を除く音声データ制
御部３０の機能と、データ生成部４０の機能とは別々の
構成によって実現されていたが、これに限られるもので
はなく、メモリ１０７やＦＩＦＯ１１１を除く音声デー
タ制御部３０の機能は、データ生成部４０内のＣＰＵに
おいてプログラムによって実現されるようなものであっ
ても良い。For example, in the above-described first to third embodiments, the function of the audio data control unit 30 except for the memory 107 and the FIFO 111 and the function of the data generation unit 40 are realized by separate configurations. The functions of the audio data control unit 30 except for the memory 107 and the FIFO 111 may be realized by a program in the CPU in the data generation unit 40.

【０１３５】◎また、上述した実施の形態２では、負荷
検出部１０６がデータ生成部４０から圧縮負荷を直接検
出していたが、これに限られるものではなく、実施の形
態３に係るデータ生成装置３の負荷検出部１０６Ｐのよ
うに、画像取得部１０のＡ／Ｄ変換器１０２から画像デ
ータを取得して、画像圧縮負荷を予測的に検出するよう
なものであっても良い。In the second embodiment, the load detector 106 directly detects the compression load from the data generator 40. However, the present invention is not limited to this. Like the load detection unit 106P of the device 3, image data may be acquired from the A / D converter 102 of the image acquisition unit 10 to predictably detect the image compression load.

【０１３６】◎また、上述した実施の形態４では、負荷
検出部１０６Ｄがデータ生成部４０から圧縮負荷を直接
検出していたが、これに限られるものではなく、実施の
形態３に係るデータ生成装置３の負荷検出部１０６Ｐの
ように、画像取得部１０のＡ／Ｄ変換器１０２から画像
データを取得して、画像圧縮負荷を予測的に検出するよ
うなものであっても良い。In the fourth embodiment, the load detector 106D directly detects the compression load from the data generator 40. However, the present invention is not limited to this. Like the load detection unit 106P of the device 3, image data may be acquired from the A / D converter 102 of the image acquisition unit 10 to predictably detect the image compression load.

【０１３７】◎また、上述した実施の形態１から３で
は、一度、データ生成部４０における画像圧縮負荷が高
くなると、音声データを一時記憶データとして一時的に
記憶して、画像圧縮負荷が低くなったときに、一時記憶
データの圧縮符号化をするため、一連の音声付き動画像
データの生成中に、複数回にわたって画像圧縮負荷が高
くなると、複数回にわたって音声データの圧縮符号化が
累積して後回しとなり、音声データの送信が遅くなり過
ぎてしまうが、これに限られるものではなく、画像圧縮
負荷が低くなったときに、圧縮符号化が累積して後回し
となった音声データの圧縮符号化をできるだけまとめて
行うようなものであっても良い。このような構成とする
ことで、音声データの送信が遅くなり過ぎる状態を抑制
することができる。In the above-described first to third embodiments, once the image compression load on the data generation unit 40 increases, the audio data is temporarily stored as temporary storage data, and the image compression load decreases. When the image compression load increases several times during the generation of a series of moving image data with audio, the compression encoding of the audio data is accumulated multiple times. It is postponed, and the transmission of audio data will be too slow, but this is not limited to this, and when the image compression load becomes low, compression encoding of audio data that is postponed due to accumulation of compression encoding May be performed collectively as much as possible. With such a configuration, it is possible to suppress a state in which transmission of audio data is too slow.

【０１３８】◎また、上述した実施の形態では、１つの
画像パケットおよび音声パケットを生成するための画像
データおよび音声データが０．５秒間に取得されるもの
であったが、これに限られるものではなく、例えば、１
／６秒間に取得される画像データおよび音声データから
１つの画像パケットおよび音声パケットをそれぞれ生成
するようなものであっても良い。このような構成とする
ことで、データ生成部４０における画像圧縮負荷が高く
なったときに、音声データの圧縮符号化が後回しとなる
時間が短くなるため、音声データの送信が遅くなり過ぎ
る状態をより回避することができる。In the above-described embodiment, the image data and the audio data for generating one image packet and the audio packet are acquired in 0.5 seconds, but the present invention is not limited to this. But, for example, 1
For example, one image packet and one audio packet may be generated from image data and audio data acquired in / 6 seconds. With such a configuration, when the image compression load on the data generation unit 40 becomes high, the time during which the compression encoding of the audio data is postponed becomes short, so that the transmission of the audio data becomes too slow. More can be avoided.

【０１３９】[0139]

【発明の効果】以上説明したように、請求項１の発明に
よれば、画像データの圧縮符号化と音声データの圧縮符
号化とを時分割で行うことによって、音声付き動画像デ
ータを生成するような構成とすることによって、音声付
き動画像データ生成装置の小型軽量化を図ることができ
る。As described above, according to the first aspect of the present invention, the moving picture data with sound is generated by performing the compression coding of the image data and the compression coding of the sound data in a time division manner. With such a configuration, it is possible to reduce the size and weight of the moving image data generation device with sound.

【０１４０】また、請求項２の発明によれば、画像デー
タの圧縮符号化に対する負荷が所定の閾値より大きい場
合には、音声データを一時的に記憶するような構成とす
ることによって、音声データの消失を防ぐことができる
ため、音声付き動画像データ生成装置の小型軽量化を図
りつつ、通信相手側での再生時における音声の途切れを
抑制することができる。According to the second aspect of the present invention, when the load on the compression encoding of image data is larger than a predetermined threshold, the audio data is temporarily stored. Since it is possible to prevent the disappearance of the audio data, it is possible to reduce the size and weight of the moving image data generating apparatus with sound and to suppress interruption of sound during reproduction on the communication partner side.

【０１４１】また、請求項３の発明によれば、画像デー
タの圧縮符号化の負荷が所定の閾値より小さい場合に
は、記憶手段に音声データが一時的に記憶されていれ
ば、その音声データの圧縮符号化を行い、また、記憶手
段に音声データが一時的に記憶されていなければ、音声
取得手段で新たに取得される音声データの圧縮符号化を
行うような構成とすることによって、画像データと音声
データの圧縮符号化を効率良く行うことができる。According to the third aspect of the present invention, when the load of the compression encoding of the image data is smaller than the predetermined threshold, if the audio data is temporarily stored in the storage means, By performing a compression encoding of audio data which is newly acquired by the audio acquisition unit if the audio data is not temporarily stored in the storage unit, Data and audio data can be efficiently compressed and encoded.

【０１４２】また、請求項４の発明によれば、音声取得
手段で取得した音声データを必ず記憶手段に一時的に記
憶し、画像データの圧縮符号化に対する負荷が所定の閾
値よりも小さい場合に、記憶手段に記憶された順に音声
データの圧縮符号化を行うような構成とすることによっ
て、回路規模の小型化による音声付き動画像データ生成
装置の小型軽量化をさらに図りつつ、通信相手側での再
生時における音声の途切れを抑制することができる。According to the fourth aspect of the present invention, the audio data acquired by the audio acquisition means is always temporarily stored in the storage means, and when the load on the compression encoding of the image data is smaller than a predetermined threshold value. The configuration in which the compression encoding of the audio data is performed in the order stored in the storage means makes it possible for the communication partner side to further reduce the size and weight of the audio-equipped moving image data generation device by reducing the circuit size. Can be prevented from being interrupted at the time of reproduction of.

【０１４３】また、請求項５の発明によれば、画像デー
タの圧縮符号化に対する負荷が所定の閾値より大きい場
合には、画像データの圧縮符号化における圧縮率を所定
の基準圧縮率よりも高めるような構成とすることによっ
て、音声データを一時的に記憶するための記憶手段等を
設けなくとも、音声データの消失を防ぐことができるた
め、音声付き動画像データ生成装置の小型軽量化をさら
に図りつつ、通信相手側での再生時における音声の途切
れを抑制することができる。According to the fifth aspect of the present invention, when the load on the compression encoding of the image data is larger than the predetermined threshold, the compression ratio in the compression encoding of the image data is made higher than the predetermined reference compression ratio. With such a configuration, it is possible to prevent the loss of the audio data without providing a storage unit or the like for temporarily storing the audio data. At the same time, it is possible to suppress breaks in audio during reproduction on the communication partner side.

【０１４４】また、請求項６の発明によれば、画像取得
手段から画像データに関する情報を取得して、画像デー
タの圧縮符号化に対する負荷を予測するような構成とす
ることによって、事前に、画像データの圧縮符号化に対
する負荷が高くなることを検出し、通信相手側での再生
時における音声の途切れを抑制するための制御を早急に
行うことができるため、音声の途切れをさらに低減する
ことができる。According to the sixth aspect of the present invention, the information regarding the image data is acquired from the image acquiring means, and the load on the compression encoding of the image data is predicted. Detecting an increase in the load on data compression and encoding, it is possible to quickly perform control to suppress breaks in audio during playback on the communication partner side, so that breaks in audio can be further reduced. it can.

【０１４５】また、請求項７の発明によれば、ユーザー
の操作に基づいて、画像データの圧縮符号化における圧
縮率を所定の基準圧縮率よりも高めるような構成とする
ことによって、画像データの圧縮符号化に対する負荷が
高くなる前に、ユーザーが画像データの圧縮符号化にお
ける圧縮率を高めることができるため、通信相手側での
再生時における音声の途切れを確実に回避することがで
きるとともに、回路規模のさらなる小型化による音声付
き動画像データ生成装置の小型軽量化を図ることができ
る。According to the seventh aspect of the present invention, the compression ratio in the compression encoding of the image data is made higher than a predetermined reference compression ratio based on the operation of the user. Before the load on the compression encoding becomes high, the user can increase the compression ratio in the compression encoding of the image data, so that it is possible to reliably avoid the interruption of the sound at the time of reproduction on the communication partner side, It is possible to reduce the size and weight of the moving image data generation device with sound by further reducing the circuit size.

【０１４６】また、請求項８の発明によれば、請求項１
に記載の発明と同様の効果を得ることができる。Further, according to the invention of claim 8, claim 1
The same effect as the invention described in (1) can be obtained.

[Brief description of the drawings]

【図１】実施の形態１に係るデータ生成装置の機能ブ
ロック図である。FIG. 1 is a functional block diagram of a data generation device according to a first embodiment.

【図２】音声付き動画像データのデータ構造を示す模
式図である。FIG. 2 is a schematic diagram showing a data structure of moving image data with sound.

【図３】圧縮符号化のタイミングを説明するタイミン
グチャートである。FIG. 3 is a timing chart illustrating the timing of compression encoding.

【図４】音声付き動画像データのデータ構造を説明す
る図である。FIG. 4 is a diagram illustrating a data structure of moving image data with sound.

【図５】音声データの出力制御を説明するフローチャ
ートである。FIG. 5 is a flowchart illustrating output control of audio data.

【図６】実施の形態２に係るデータ生成装置の機能ブ
ロック図である。FIG. 6 is a functional block diagram of a data generation device according to a second embodiment.

【図７】音声データの出力制御を説明するフローチャ
ートである。FIG. 7 is a flowchart illustrating output control of audio data.

【図８】実施の形態３に係るデータ生成装置の機能ブ
ロック図である。FIG. 8 is a functional block diagram of a data generation device according to a third embodiment.

【図９】実施の形態４に係るデータ生成装置の機能ブ
ロック図である。FIG. 9 is a functional block diagram of a data generation device according to a fourth embodiment.

【図１０】圧縮率の制御フローを示すフローチャート
である。FIG. 10 is a flowchart illustrating a control flow of a compression ratio.

【図１１】実施の形態５に係るデータ生成装置の機能
ブロック図である。FIG. 11 is a functional block diagram of a data generation device according to a fifth embodiment.

[Explanation of symbols]

１，２，３，４，５音声付き動画像データ生成装置
（データ生成装置）、１０画像取得部、２０音声取
得部、３０音声データ制御部、４０データ生成部、
１０１撮像部、１０３マイク、１０６，１０６Ｄ，
１０６Ｐ負荷検出部、１０７メモリ、１０８メモ
リ検出部、１０９スイッチ（ＳＷ）回路、１１０，１
１０Ｆ，１１０Ｐ出力制御部、１１１ＦＩＦＯ、１
１４，１１４Ｙ圧縮率制御部、１１５操作部。1, 2, 3, 4, 5 moving image data generation device with sound (data generation device), 10 image acquisition unit, 20 sound acquisition unit, 30 sound data control unit, 40 data generation unit,
101 imaging unit, 103 microphone, 106, 106D,
106P load detector, 107 memory, 108 memory detector, 109 switch (SW) circuit, 110, 1
10F, 110P output control unit, 111 FIFO, 1
14, 114Y Compression ratio control unit, 115 operation unit.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｈ０４Ｎ 7/24 (72)発明者石田秀徳東京都千代田区丸の内二丁目２番３号三菱電機株式会社内Ｆターム(参考） 5C022 AA12 AC69 AC72 5C059 MA00 PP04 RA08 RB02 RC04 RC32 RE03 SS07 SS30 TA06 TA46 TC18 TC38 TD05 TD12 UA02 UA32 UA38 5C063 AB07 AC01 AC05 CA11 CA20 DA05 DA13 5J064 AA04 BC01 BC02 BC06 BC22 BC26 BD02 5K101 LL12 NN06 NN07 NN15 NN18 SS07 SS08 TT06 UU19 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) H04N 7/24 (72) Inventor Hidenori Ishida 2-3-2 Marunouchi, Chiyoda-ku, Tokyo Mitsubishi Electric Corporation F-term (reference) 5C022 AA12 AC69 AC72 5C059 MA00 PP04 RA08 RB02 RC04 RC32 RE03 SS07 SS30 TA06 TA46 TC18 TC38 TD05 TD12 UA02 UA32 UA38 5C063 AB07 AC01 AC05 CA11 CA20 DA05 DA13 5J064 AA04 BC01 BC02 BC06 BC12 BC02 BC06 NN18 SS07 SS08 TT06 UU19

Claims

[Claims]

An image acquisition unit for acquiring image data; an audio acquisition unit for acquiring audio data; and a compression encoding of the image data and a compression encoding of the audio data being performed in a time-division manner. A moving image data with sound generation apparatus, comprising: data generating means for generating moving image data with sound in which encoded image data and compression-encoded sound data are multiplexed.

2. The video data generating apparatus with sound according to claim 1, wherein: a load detecting unit that compares a compression load for the compression encoding of the image data in the data generating unit with a predetermined threshold value; A storage unit for temporarily storing the audio data as temporary storage data when the compression load is larger than the predetermined threshold value;

3. The video data generating apparatus with sound according to claim 2, wherein the temporary storage data is stored in the storage unit when the compression load is smaller than the predetermined threshold. A compression encoding of the temporary storage data is performed by a data generation unit, and the compression encoding of the audio data newly acquired by the audio acquisition unit by the data generation unit if the temporary storage data is not stored in the storage unit. And a control unit for controlling the data generation unit so as to perform the conversion.

4. The moving image data generation device with sound according to claim 2, wherein the storage unit temporarily stores the audio data acquired by the audio acquisition unit as temporary storage data. The audio-moving image data generating device, when the compression load is smaller than the predetermined threshold, reads out the temporary storage data in the order stored in the storage unit and performs compression encoding. A moving image data generating apparatus with sound, further comprising control means for controlling.

5. The moving image data with sound generation apparatus according to claim 1, wherein: a load detection unit that compares a compression load for compression encoding of the image data in the data generation unit with a predetermined threshold value; A compression ratio control unit that controls the data generation unit so that a compression ratio in the compression encoding of the image data is higher than a predetermined reference compression ratio when the compression load is larger than a predetermined threshold. A moving image data generation device with sound, characterized in that:

6. The moving image data with sound generation device according to claim 2, wherein the load detecting means acquires information on the image data from the image acquiring means, and A video data generating apparatus with sound, wherein the compression load is predicted based on information.

7. The moving image data with sound generation apparatus according to claim 1, wherein a compression ratio in the compression encoding of the image data is set higher than a predetermined reference compression ratio based on a user operation. A moving image data generation device with sound, further comprising a compression ratio control unit for controlling the data generation unit.

8. A method of obtaining image data, a step of obtaining audio data, and a step of performing compression encoding of the image data and compression encoding of the audio data in a time division manner. A method for generating moving image data with sound, comprising: generating moving image data with sound in which image data and sound data compressed and encoded are multiplexed.