JP3466683B2

JP3466683B2 - Multimedia communication device and multimedia communication method

Info

Publication number: JP3466683B2
Application number: JP35169493A
Authority: JP
Inventors: 正寿大谷
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1993-12-29
Filing date: 1993-12-29
Publication date: 2003-11-17
Anticipated expiration: 2018-11-17
Also published as: JPH07203072A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声情報・画像情報・
データ等のマルチメディア情報を多重化して相互通信す
ることが可能な、テレビ電話装置・テレビ会議システム
等のＡＶ（ＡｕｄｉｏＶｉｓｕａｌ）通信装置を代表
とするマルチメディア通信装置及びマルチメディア通信
方法に関する。BACKGROUND OF THE INVENTION The present invention relates to audio information, image information,
A multimedia communication device and multimedia communication represented by an AV (Audio Visual) communication device such as a videophone device and a video conference system capable of multiplexing and communicating multimedia information such as data.
Regarding the method .

【０００２】[0002]

【従来の技術】近年、ＩＳＤＮ（サービス総合デジタル
網）回線による通信サービスの実用化が開始され、この
様なデジタル回線を用いたテレビ電話装置・テレビ会議
システム等のＡＶサービスが注目されている。ＡＶサー
ビス用のサービス規定、プロトコル規定、マルチメディ
ア多重化フレーム構造及び動画像情報の符号化方式等
が、ＣＣＩＴＴ（国際電信電話諮問委員会）勧告「Ｈ．
３２０」、「Ｈ．２４２」、「Ｈ．２２１」、「Ｈ．２
３０」、「Ｈ．２６１」等として規定されている。
「Ｈ．２２１」では、６４ｋｂｐｓ〜１９２０ｋｂｐｓ
チャネル上でのＡＶサービスにおけるフレーム構造が規
定されている。2. Description of the Related Art In recent years, communication services using ISDN (Integrated Services Digital Network) lines have started to be put into practical use, and AV services such as video telephone devices and video conference systems using such digital lines have been receiving attention. The service specifications, protocol specifications, multimedia multiplexing frame structure, moving image information encoding method, etc. for AV services are recommended by CCITT (International Telegraph and Telephone Consultative Committee) “H.
320 "," H.242 "," H.221 "," H.2 "
30 "," H.261 ", etc.
In "H.221", 64 kbps to 1920 kbps
The frame structure in the AV service on the channel is specified.

【０００３】図６は６４ｋｂｐｓｘ１チャネル上での
「Ｈ．２２１」のフレーム構造を示すものであり、横軸
の１〜８の数字はビット番号、縦軸の１〜８０の数字は
オクテット番号であり、８０オクテットで１フレームを
構成する。同図におけるＦＡＳ（フレーム同期信号）に
より、フレーム同期・マルチフレーム同期の制御、通信
品質の監視機能及び警報情報通知等の制御を行なう。[0003] Fig. 6 shows a frame structure of "H.221" on a 64 kbps x 1 channel. Numbers 1 to 8 on the horizontal axis are bit numbers and numbers 1 to 80 on the vertical axis are octet numbers. , 80 octets make up one frame. The FAS (frame synchronization signal) in the figure controls the frame synchronization / multi-frame synchronization, the communication quality monitoring function, and the alarm information notification.

【０００４】図７（ａ）は、上記ＦＡＳの１マルチフレ
ーム＝８サブマルチフレーム（１サブマルチフレーム＝
２フレーム）間でのビットアサインを示す図である。ま
た、ＢＡＳ（ビットレート割当信号）により端末の能
力、実際のフレーム内の各メディアごとのビットレート
割当の指定及びその他各種の制御と通知を行なう。図７
（ｂ）に示す様に、ＢＡＳは偶数フレームにて伝送さ
れ、奇数フレームでは対応する誤り訂正ビットが伝送さ
れる。FIG. 7A shows one multiframe of FAS = 8 submultiframes (one submultiframe =
It is a figure which shows the bit assignment between (2 frames). In addition, the BAS (bit rate allocation signal) is used to specify the terminal capability, the bit rate allocation for each medium in the actual frame, and various other controls and notifications. Figure 7
As shown in (b), the BAS is transmitted in even frames and the corresponding error correction bits are transmitted in odd frames.

【０００５】「Ｈ．２４２」ではＡＶ端末間でのインチ
ャネルでのＢＡＳを用いた能力情報交換シーケンス・モ
ード切換シーケンス等の通信手順が規定されている。
「Ｈ．３２０」ではＡＶサービス全般のシステムアスペ
クトが規定されている。「Ｈ．２３０」ではＡＶサービ
スに必要な機能に対する付加情報として、伝送フレーム
同期或は緊急の応答を必要とする各種の制御と通知につ
いて規定している。「Ｈ．２６１」ではｐｘ６４ｋｂｐ
ｓ（ｐ＝１〜３０）の速度における動画像情報の符号化
／復号化方式が規定されている。In "H.242", communication procedures such as a capability information exchange sequence and a mode switching sequence using an in-channel BAS between AV terminals are specified.
“H.320” defines system aspects of AV services in general. “H.230” defines various controls and notifications that require transmission frame synchronization or an emergency response, as additional information for functions required for AV services. Px64 kbp for "H.261"
A coding / decoding method of moving image information at a speed of s (p = 1 to 30) is defined.

【０００６】上記勧告に従って、画像、音声、データ
（画像、音声以外の全てのユーザ情報）等のマルチメデ
ィア通信を行なう際の基本的なシーケンスを図８に示
す。まず通信を開始したいマルチメディア通信装置（以
下、発信側装置と記述する）の起動により、アウトチャ
ネル呼制御が起動され相手側装置に着信が通知される。
ＩＳＤＮ回線の場合はＤチャネル（ｃｈ）による呼設定
シーケンスが起動される。FIG. 8 shows a basic sequence when multimedia communication of images, voices, data (all user information other than images and voices), etc. is performed in accordance with the above recommendations. First, when a multimedia communication device (hereinafter, referred to as a calling side device) that wants to start communication is activated, an out-channel call control is activated and an incoming call is notified to the partner side device.
In the case of the ISDN line, the call setup sequence by the D channel (ch) is activated.

【０００７】着信側装置では、まず、図８のステップＳ
８０１にて着信がテレビ電話の呼か否かを判断する。Ｉ
ＳＤＮ回線ではＤｃｈ呼制御上のセットアップ（ｓｅｔ
ｕｐ）メッセージ中のＢＣ（伝達能力）情報要素及びＨ
ＬＣ（高位レイヤ整合性）情報要素・ＬＬＣ（低位レイ
ヤ整合性）情報要素等により電話呼かテレビ電話呼か或
は全く別の呼かを判断する。In the receiving side device, first, in step S of FIG.
At 801 it is determined whether the incoming call is a videophone call. I
In SDN line, setup for Dch call control (set
up) BC information element and H in the message
It is determined whether the call is a telephone call, a videophone call or a completely different call based on the LC (higher layer compatibility) information element / LLC (lower layer compatibility) information element.

【０００８】前記ステップＳ８０１において、テレビ電
話呼でないと判断された場合は、何もせずに本処理動作
を終了し、またテレビ電話呼と判断された場合は、ステ
ップＳ８０２に進みＤｃｈ呼設定を行ない、第１コネク
ションを確立する。次いで、ステップＳ８０３に進ん
で、設定されたコネクション上（ＩＳＤＮではＢｃｈ，
Ｈｃｈ等に相当する）にてＦＡＳ探索・検出及びＡビッ
ト＝０の送出・検出によってフレーム同期確立を行な
う。該同期確立後、ステップＳ８０４に進みＢＡＳ送信
・受信検出による能力情報交換シーケンスによって相手
装置の能力を判断する。次いで、ステップＳ８０５に進
んで前記ステップＳ８０２において確立した第１コネク
ション上でのマルチメディア多重化割当を決定して、Ｂ
ＡＳコマンド送信・受信によるモード切換シーケンスを
行ないマルチメディア多重化通信を開始する。If it is determined in step S801 that the call is not a videophone call, the processing operation is terminated without doing anything. If it is determined that the call is a videophone call, the process proceeds to step S802 to set a Dch call. , Establish a first connection. Next, in step S803, the set connection (Bch in ISDN,
Frame synchronization is established by FAS search / detection and transmission / detection of A bit = 0 in Hch). After the synchronization is established, the process proceeds to step S804, and the capability of the partner device is judged by the capability information exchange sequence by the BAS transmission / reception detection. Next, in step S805, the multimedia multiplexing allocation on the first connection established in step S802 is determined, and B
A multimedia switching communication is started by performing a mode switching sequence by transmitting / receiving an AS command.

【０００９】実際にはこのケースでは、追加コネクショ
ン確立を継続して行う場合等は、音声情報通信のみを行
なうケースが多い。例えば、音声５６ｋｂｐｓ・「Ｈ２
６１」画像オフ（ＯＦＦ）のモードにて通信を開始す
る。その後、ステップＳ８０６に進み、追加コネクショ
ン能力が自装置側・相手装置側共にあると判断された場
合、次のステップＳ８０７に進み追加コネクションの確
立を行なう。その後、ステップＳ８０８に進んで追加コ
ネクションのＦＡＳ探索・検出及びＡビットを利用して
フレーム同期・マルチフレーム同期及び第１チャネルと
の同期確立処理を行なった後、前記ステップＳ８０６に
戻り、再び追加コネクションの必要があるか否かを判断
する。Actually, in this case, in many cases, only voice information communication is performed when the additional connection is continuously established. For example, voice 56kbps / "H2
61 "Communication is started in the image off mode. After that, the process proceeds to step S806, and when it is determined that the additional connection capability is present on both the own device side and the partner device side, the process proceeds to the next step S807 to establish the additional connection. Then, in step S808, FAS search / detection of the additional connection and frame synchronization / multi-frame synchronization and synchronization establishment processing with the first channel are performed using the A bit, and then the process returns to step S806 to add the additional connection again. To determine whether it is necessary.

【００１０】追加コネクションの必要がなくなれば、ス
テップＳ８０９に進み全チャネル利用に適した動作モー
ドを決定し、ＢＡＳコマンドの送信・受信によるモード
切換シーケンスを行ない、例えば「Ｈ．２６１」映像送
信オン（ＯＮ）・音声符号化を両端末間にて最適な符号
化に変更して、マルチメディア多重通信を行なった後、
本処理動作を終了する。但し、前記ステップＳ８０９に
て行なうＢＡＳコマンドによるモード切換シーケンス
は、追加コネクション毎の同期確立ステップＳ８０８の
実行後、直ちに毎回行なってもよい。When the need for additional connection is eliminated, the operation proceeds to step S809 to determine an operation mode suitable for use of all channels, perform a mode switching sequence by transmitting / receiving a BAS command, for example, "H.261" video transmission on ( ON) ・ After changing the voice coding to the optimum coding between both terminals and performing multimedia multiplex communication,
This processing operation ends. However, the mode switching sequence by the BAS command performed in step S809 may be performed each time immediately after execution of the synchronization establishing step S808 for each additional connection.

【００１１】テレビ電話装置・テレビ会議システムにお
いて通常の電話の音声情報のみの応答メッセージ及び留
守録（留守番記録）モードの音声情報のみの応答メッセ
ージとは異なり、映像情報を含んだ音声情報の応答メッ
セージを送信しようとする場合、伝送路中の多重化した
形で音声情報・映像情報を送信するが、どの様な多重化
になるかは、実際に接続された相手装置との間の能力情
報交換シーケンスを実行後でないと判断できない。ま
た、「Ｈ．２６１」の映像符号化がフレーム間予測符号
化方式を用いているため、単純に符号化された音声情報
・映像情報を応答メッセージとして記録しておくだけで
は、いつも利用できるとは限らない。単純な方法として
ＶＴＲ（ビデオテープレコーダ）等のアナログ映像・音
声蓄積メディアを利用して応答メッセージを送信する方
法や、ＨＤ（ハードディスク）・ＭＯＤ（光磁気ディス
ク）等のデジタル記憶メディア上に単純にデジタル化さ
れた音声情報・映像情報を応答メッセージとして利用す
る方法が可能である。また、音声情報に関しては、単純
に符号化蓄積して応答メッセージとして送信する場合
は、復号化後に再び符号化して送信する方法がアナログ
電話でも一部行われている。Unlike a response message containing only voice information of an ordinary telephone and a response message containing only voice information in an answering machine (answering machine recording) mode in a videophone device / videoconference system, a response message containing audio information including video information. , The audio information and video information are transmitted in a multiplexed form on the transmission line. What kind of multiplexing is used is the ability information exchange with the other device that is actually connected. It cannot be judged until after the sequence is executed. Further, since the video coding of “H.261” uses the inter-frame predictive coding method, it is always available by simply recording the coded audio information / video information as a response message. Not necessarily. As a simple method, a method of transmitting a response message using an analog video / audio storage medium such as a VTR (video tape recorder), or simply on a digital storage medium such as HD (hard disk) / MOD (magneto-optical disk) A method of using digitized audio information / video information as a response message is possible. In the case of simply encoding and accumulating voice information and transmitting it as a response message, some analog telephones also use a method of decoding and re-encoding and transmitting.

【００１２】[0012]

【発明が解決しようとする課題】しかしながら、上記従
来例ではＶＴＲ等のアナログ蓄積メディアを利用するケ
ースでは、後で再生する際には、複数の応答メッセージ
のダイナミックな選択が直ちには行えず、しかも装置自
体が大型化し且つ高価になってしまうという問題点があ
った。However, in the case of using an analog storage medium such as a VTR in the above-mentioned conventional example, a dynamic selection of a plurality of response messages cannot be performed immediately when reproducing later, and There is a problem that the device itself becomes large and expensive.

【００１３】また、ＨＤ等のデジタル蓄積メディアを利
用する場合は、動画情報を蓄積するので非常に膨大な蓄
積容量を必要とするという問題点があった。Further, when a digital storage medium such as an HD is used, there is a problem that a very large storage capacity is required because moving image information is stored.

【００１４】更に単純な動画像符号化情報の蓄積では、
送信しても相手装置上の復号表示がうまく機能しないと
いう問題点があった。In a simpler method of storing moving picture coded information,
There was a problem that the decryption display on the partner device did not work well even if it was sent.

【００１５】本発明は上述した従来の技術の有するこの
ような問題点に鑑みてなされたものであり、その目的と
するところは、動画情報を同時に取り扱うための蓄積容
量が増大するのを抑制しつつ、テレビ電話における留守
番応答メッセージを、通常の電話の留守番応答メッセー
ジと同等に機能させることができるようにしたマルチメ
ディア通信装置及びマルチメディア通信方法を提供しよ
うとするものである。The present invention has been made in view of the above problems of the above-mentioned conventional technique, and an object of the present invention is to suppress an increase in storage capacity for simultaneously handling moving image information. At the same time, it is an object of the present invention to provide a multimedia communication device and a multimedia communication method capable of making an answering machine answering message in a videophone function as an ordinary answering machine answering message.

【００１６】[0016]

【課題を解決するための手段】上記目的を達成するため
本発明のマルチメディア通信装置は、音声情報を符号化
すると共に該符号化された音声情報を復号化する音声符
号化／復号化手段と、映像情報を符号化すると共に該符
号化された映像情報を復号化する映像符号化／復号化手
段と、前記音声符号化／復号化手段及び前記映像符号化
／復号化手段により符号化された音声情報及び映像情報
並びにデータ等のマルチメディア情報を多重化する多重
化手段と、該多重化手段により多重化されたマルチメデ
ィア情報を各メディア毎に復号化できるように分離化す
る分離化手段とを有するマルチメディア通信装置におい
て、留守の際の着信時に該着信を自動的に受け付けると
共に予め前記記録部に記録した応答メッセージ用の音声
情報・映像情報の送信を開始して留守録モードであるこ
とを相手端末に通知後、該相手端末からの受信音声情報
・映像情報等を前記記録部に記録するように制御する留
守録制御手段と、前記留守録モード時における応答メッ
セージ用の音声情報・映像情報を予め前記記録部に記録
するように制御する記録制御手段であって、前記応答メ
ッセージ用の音声情報・映像情報を前記記録部に記録時
に、前記音声情報は前記音声符号化／復号化手段の出力
を且つ前記映像情報は前記映像符号化／復号化手段の出
力を符号化された形の情報として前記記録部に記録する
ように制御し、さらに前記応答メッセージ用の映像情報
は前記マルチメディア通信装置が送信可能な最大の映像
送信レートによって符号化して前記記録部に記録するよ
うに制御する記録制御手段と、前記記録部に記録した応
答メッセージ用の音声情報・映像情報を送信する時前記
符号化された音声情報を直接前記多重化手段に入力する
ように制御する第１入力制御手段と、前記記録部に記録
した応答メッセージ用の音声情報・映像情報を送信する
時に、前記符号化された映像情報を前記映像符号化／復
号化手段に入力して復号し、前記相手端末の能力に合わ
せた符号化を行って前記多重化手段に入力するように制
御する第２入力制御手段とを具備したことを特徴とする
ものである。また、上記目的を達成するため本発明のマ
ルチメディア通信方法は、音声情報を符号化すると共に
該符号化された音声情報を復号化する音声符号化／復号
化手段と、映像情報を符号化すると共に該符号化された
映像情報を復号化する映像符号化／復号化手段と、前記
音声符号化／復号化手段及び前記映像符号化／復号化手
段により符号化された音声情報及び映像情報並びにデー
タ等のマルチメディア情報を多重化する多重化手段と、
該多重化手段により多重化されたマルチメディア情報を
各メディア毎に復号化できるように分離化する分離化手
段とを有するマルチメディア通信装置により通信するマ
ルチメディア通信方法において、留守の際の着信時に該
着信を自動的に受け付けると共に予め前記記録部に記録
した応答メッセージ用の音声情報・映像情報の送信を開
始して留守録モードであることを相手端末に通知後、該
相手端末からの受信音声情報・映像情報等を前記記録部
に記録するように制御する留守録制御工程と、前記留守
録モード時における応答メッセージ用の音声情報・映像
情報を予め前記記録部に記録するように制御する記録制
御工程であって、前記応答メッセージ用の音声情報・映
像情報を前記記録部に記録時に、前記音声情報は前記音
声符号化／復号化手段の出力を且つ前記映像情報は前記
映像符号化／復号化手段の出力を符号化された形の情報
として前記記録部に記録するように制御し、さらに前記
応答メッセージ用の映像情報は前記マルチメディア通信
装置が送信可能な最大の映像送信レートによって符号化
して前記記録部に記録するように制御する記録制御工程
と、前記記録部に記録した応答メッセージ用の音声情報
・映像情報を送信する時前記符号化された音声情報を直
接前記多重化手段に入力するように制御する第１入力制
御工程と、前記記録部に記録した応答メッセージ用の音
声情報・映像情報を送信する時に、前記符号化された映
像情報を前記映像符号化／復号化手段に入力して復号
し、前記相手端末の能力に合わせた符号化を行って前記
多重化手段に入力するように制御する第２入力制御工程
とを具備したことを特徴とするものである。 In order to achieve the above object, a multimedia communication apparatus of the present invention comprises a voice encoding / decoding means for encoding voice information and decoding the encoded voice information. A video encoding / decoding means for encoding the video information and for decoding the encoded video information, and the audio encoding / decoding means and the video encoding / decoding means. Multiplexing means for multiplexing multimedia information such as audio information, video information and data, and demultiplexing means for demultiplexing the multimedia information multiplexed by the multiplexing means so that each medium can be decoded. In the multimedia communication device having the above, the incoming call is automatically accepted at the time of absence, and the voice information and the video information for the response message recorded in the recording section in advance are received. A recorded message control means for controlling to later notify the remote terminal, the received audio information and video information, etc. from the partner terminal is recorded in the recording unit to be a recorded message mode to start signal, the answer phone a recording control means for recording in advance the recording unit audio information and video information for the response message in mode, the audio information and video information for the reply message at the time of recording in the recording unit, the The audio information is controlled so as to record the output of the audio encoding / decoding means and the video information so as to record the output of the video encoding / decoding means as encoded information in the recording unit , and Video information for the response message
Is the maximum video that the multimedia communication device can send
It is encoded according to the transmission rate and recorded in the recording unit.
Recording control means for controlling the
When sending audio / video information for answer messages
The encoded voice information is directly input to the multiplexing means.
Input control means for controlling as described above, and recording in the recording section
The audio / video information for the response message
Sometimes, the encoded video information is converted into the video encoding / decoding.
It is input to the encryption means, decrypted, and matched with the capability of the partner terminal.
It is controlled so that it is encoded and input to the multiplexing means.
It is characterized in that and a second input control means for control. In addition, in order to achieve the above object,
The multimedia communication method encodes voice information and
Speech coding / decoding for decoding the coded speech information
And catheter stage was said encoded with encoding video information
Video encoding / decoding means for decoding video information;
Audio encoding / decoding means and video encoding / decoding means
Audio and video information and data encoded by the stage
Multiplexing means for multiplexing multimedia information such as data
The multimedia information multiplexed by the multiplexing means
Separation hand that separates each media so that it can be decrypted
A multimedia communication device having a stage.
When using the multimedia communication method,
Automatically accepts incoming calls and records them in the recording unit in advance
Open the transmission of audio and video information for the response message
After notifying the other terminal that it is in the answering machine mode for the first time,
The recording unit stores the audio information and video information received from the partner terminal.
Record control process to control to record on
Audio information / video for response message in recording mode
Recording system for controlling information to be recorded in the recording unit in advance
In the process, the voice information and video for the response message
When the image information is recorded in the recording unit, the audio information is
The output of the voice encoding / decoding means and the video information
Information in encoded form at the output of the video encoding / decoding means
Control to record in the recording unit as
The video information for the response message is the multimedia communication.
Encoded by the maximum video transmission rate that the device can transmit
And recording control step for controlling to record in the recording unit
And voice information for the response message recorded in the recording unit
・ When transmitting video information, the encoded audio information is directly
A first input system for controlling input to the multiplexing means
Process and sound for the response message recorded in the recording section
When transmitting voice information / video information, the encoded video
Input image information into the video encoding / decoding means for decoding
Then, the encoding is performed according to the capability of the partner terminal to
Second input control step for controlling input to the multiplexing means
And is provided.

【００１７】[0017]

【００１８】[0018]

【００１９】[0019]

【００２０】[0020]

【００２１】[0021]

【００２２】[0022]

【実施例】以下、本発明の実施例を図１〜図５に基づき
説明する。Embodiments of the present invention will be described below with reference to FIGS.

【００２３】〔第１実施例〕図１は、本発明の第１実施
例に係るマルチメディア通信装置の構成を示すブロック
図である。同図において、１は音声入／出力手段の一つ
であるハンドセット、２は音声入力手段の一つであるマ
イク、３は音声出力手段の一つであるスピーカ、４は音
声インタフェース（Ｉ／Ｆ）部で、後述するシステム制
御部１５の指示により、音声入／出力手段としてのハン
ドセット１、マイク２、スピーカ３を切り換える機能、
ハンドセット１がオンフック状態またはオフフック状態
のいずれにあるかを検出するオン／オフフック検出機
能、音声入／出力手段としてマイク２とスピーカ３を使
用した時にエコーを消去するためのエコーキャンセル機
能、ダイヤルトーン、呼出音、ビジートーン、着信音等
のトーン生成機能を持っている。５は、音声符号化／
復号化部で、後述するシステム制御部１５の指示によ
り、６４ｋｂｐｓＰＣＭ（Ａ−ｌａｗ）、６４ｋｂｐ
ｓＰＣＭ（μ−ｌａｗ）、６４ｋｂｐｓ／５６ｋｂｐ
ｓ／４８ｋｂｐｓＳＢ−ＡＤＰＣＭ、３２ｋｂｐｓ
ＡＤＰＣＭ、ＬＤ−ＣＥＬＰ、１６ｋｂｐｓ、８ｋｂｐ
ｓ等の音声信号（情報）符号化／復号化アルゴリズムに
従って、送信音声信号（情報）をＡ／Ｄ変換して符号化
する機能及び受信音声信号をＤ／Ａ変換して復号化する
機能をもっている。[First Embodiment] FIG. 1 is a block diagram showing the configuration of a multimedia communication apparatus according to the first embodiment of the present invention. In the figure, 1 is a handset which is one of audio input / output means, 2 is a microphone which is one of audio input means, 3 is a speaker which is one of audio output means, and 4 is an audio interface (I / F) Function for switching between the handset 1, the microphone 2 and the speaker 3 as voice input / output means in accordance with an instruction from the system control unit 15 described later.
An on / off hook detection function for detecting whether the handset 1 is in an on-hook state or an off-hook state, an echo cancel function for canceling an echo when the microphone 2 and the speaker 3 are used as voice input / output means, a dial tone, It has tone generation functions such as ringing tone, busy tone, and ring tone. 5 is voice coding /
In the decoding unit, 64 kbps PCM (A-law), 64 kbp, according to an instruction from the system control unit 15 described later.
s PCM (μ-law), 64 kbps / 56 kbp
s / 48kbps SB-ADPCM, 32kbps
ADPCM, LD-CELP, 16kbps, 8kbps
It has a function of A / D-converting and encoding a transmission voice signal (information) and a function of D / A-converting and decoding a reception voice signal according to a voice signal (information) encoding / decoding algorithm such as s. .

【００２４】６はＶＴＲ（ビデオテープレコーダ）等の
音声・映像入出力部、７は映像入力手段の一つであり、
自画像等を入力するためのカメラ、８は画像入力手段の
一つであり、絵や図面等を入力するための書画カメラ、
９はカメラ７または書画カメラ８からの入力画像、相手
からの受信画像及び操作画面等を表示する表示部、１０
はビデオインタフェース（Ｉ／Ｆ）部で、後述するシス
テム制御部１５の指示により、画像入力手段としてのカ
メラ７及び書画カメラ８を切り換える機能、入力画像と
受信画像と操作画面の表示切り換え及びそれらを表示部
９上で分割表示するための画像信号合成処理機能を持っ
ている。Reference numeral 6 is an audio / video input / output unit such as a VTR (video tape recorder), and 7 is one of video input means,
A camera for inputting a self-portrait, 8 is one of image input means, and a document camera for inputting a picture, a drawing, etc.,
Reference numeral 9 denotes a display unit that displays an input image from the camera 7 or the document camera 8, a received image from the other party, an operation screen, and the like.
Is a video interface (I / F) unit, which has a function of switching between the camera 7 and the document camera 8 as image input means, a display switching between an input image, a received image, and an operation screen, and those by an instruction from a system control unit 15 described later. It has an image signal synthesizing function for split display on the display unit 9.

【００２５】１１はビデオ（映像）符号化／復号化部
（画像コーデック部）で、ＣＣＩＴＴ勧告「Ｈ．２６
１」に従って送信映像信号（情報）をＡ／Ｄ変換して符
号化する機能及び受信映像信号（情報）をＤ／Ａ変換し
て復号化する機能を持っている。Reference numeral 11 is a video encoding / decoding unit (image codec unit), which is CCITT Recommendation "H.26".
1 ”, it has a function of A / D converting the transmission video signal (information) and encoding, and a function of D / A converting the reception video signal (information) and decoding.

【００２６】１２はデータの送受信を行なうデータ端
末、１３はデータインタフェース部で、データ端末１２
及びシステム制御部１５からの送信データを多重化／分
離化部１６へ通知すると共に、受信データをデータ端末
１２またはシステム制御部１５へ通知するものである。Reference numeral 12 is a data terminal for transmitting and receiving data, and 13 is a data interface section, which is a data terminal 12
Also, the data transmitted from the system control unit 15 is notified to the multiplexing / demultiplexing unit 16, and the received data is notified to the data terminal 12 or the system control unit 15.

【００２７】１４は本装置全体を制御するための制御情
報を入力するために使用するキーボード等の操作部、１
５はシステム制御部で、ＣＰＵ、ＲＯＭ、ＲＡＭ及び補
助記憶装置等を備え、各部の状態を監視して本装置全体
の制御、入力制御情報・利用回線状態等により各メディ
アへの割当伝送速度を計算し且つ最終的なモードの判断
・制御、状態に応じた操作／表示画面の作成及びマンマ
シンインタフェース等のアプリケーションプログラムの
実行等を行なうものである。Reference numeral 14 denotes an operation unit such as a keyboard used for inputting control information for controlling the entire apparatus, 1
A system control unit 5 includes a CPU, a ROM, a RAM, an auxiliary storage device, and the like, monitors the state of each unit, and controls the entire unit, and determines the transmission rate assigned to each medium according to the input control information, the used line state, and the like. It is for calculating and finally determining / controlling the mode, creating an operation / display screen according to the state, and executing an application program such as a man-machine interface.

【００２８】１６は多重化／分離化部で、ＣＣＩＴＴ勧
告「Ｈ．２２１」に従って音声符号化／復号化部５から
の音声信号、ビデオ符号化／復号化部１０からの映像信
号、データインタフェース部１３からのデータ、システ
ム制御部１５からのデータ及びＣＣＩＴＴ勧告「Ｈ．２
２１」，「Ｈ．２４２」等の制御情報を送信フレーム単
位に多重化すると共に、受信フレームを構成単位の各メ
ディアに分離して、音声符号化／復号化部５、ビデオ符
号化／復号化部１１、データインタフェース部１３、シ
ステム制御部１５等に通知するものである。１７はＩＳ
ＤＮユーザ・網インタフェースに従って回線１９を制御
する回線インタフェース部である。１８は各種制御情報
を記憶する記憶部である。Reference numeral 16 denotes a multiplexing / demultiplexing unit, which is an audio signal from the audio encoding / decoding unit 5, a video signal from the video encoding / decoding unit 10, and a data interface unit in accordance with CCITT Recommendation "H.221". 13 data, data from the system controller 15 and CCITT Recommendation “H.2.
21 "," H.242 ", etc. are multiplexed for each transmission frame, and the reception frame is separated into each medium of a constituent unit, and the audio encoding / decoding unit 5 and the video encoding / decoding are performed. It notifies the unit 11, the data interface unit 13, the system control unit 15 and the like. 17 is IS
A line interface unit for controlling the line 19 according to the DN user / network interface. A storage unit 18 stores various control information.

【００２９】図２は、本実施例に係るマルチメディア通
信装置の音声情報・映像情報のアナログ入出力部から多
重化／分離化部までの間の流れを示すブロック図であ
る。同図中、１０１は本装置へ入力するアナログ音声信
号と本装置から出力するアナログ音声信号とを制御する
アナログ音声インタフェース（Ｉ／Ｆ）部、１０２は入
力アナログ音声信号をデジタル音声信号に変換する第１
Ａ／Ｄ変換部、１０３は第１Ａ／Ｄ変換部１０２により
デジタル化された音声信号を符号化する音声符号化部、
１０４は後述する音声復号化部１０５により復号化され
たデジタル音声信号をアナログ音声信号に変換する第１
Ｄ／Ａ変換部、１０５は受信符号化音声信号を復号化す
る音声復号化部、１０６は本装置へ入力するアナログ映
像信号と本装置から出力するアナログ映像信号とを制御
するアナログ映像インタフェース（Ｉ／Ｆ）部、１０７
は入力アナログ映像信号をデジタル映像信号に変換する
第２Ａ／Ｄ変換部、１０８は第２Ａ／Ｄ変換部１０７に
よりデジタル化された映像信号を符号化するビデオ符号
化部、１０９は後述するビデオ復号化部１１０により復
号化されたデジタル映像信号をアナログ映像信号に変換
する第２Ｄ／Ａ変換部、１１０は受信符号化映像信号を
復号化するビデオ復号化部である。FIG. 2 is a block diagram showing the flow from the analog input / output unit of the audio information / video information to the multiplexing / demultiplexing unit of the multimedia communication apparatus according to this embodiment. In the figure, 101 is an analog audio interface (I / F) unit that controls an analog audio signal input to this device and an analog audio signal output from this device, and 102 converts an input analog audio signal into a digital audio signal. First
An A / D conversion unit, 103 is a voice encoding unit that encodes the voice signal digitized by the first A / D conversion unit 102,
Reference numeral 104 denotes a first unit for converting a digital audio signal decoded by an audio decoding unit 105 described later into an analog audio signal.
A D / A conversion unit 105 is an audio decoding unit that decodes a received encoded audio signal, and 106 is an analog video interface (I) that controls an analog video signal input to this device and an analog video signal output from this device. / F) section, 107
Is a second A / D conversion unit for converting the input analog video signal into a digital video signal, 108 is a video coding unit for coding the video signal digitized by the second A / D conversion unit 107, and 109 is a video decoding described later. A second D / A conversion unit that converts the digital video signal decoded by the conversion unit 110 into an analog video signal, and 110 is a video decoding unit that decodes the received encoded video signal.

【００３０】１１１は図１のシステム制御部１５の指示
により音声符号化部１０３からの音声符号化信号及びビ
デオ符号化部１０８からの映像符号化信号を多重化して
図１の回線インタフェース部１７へ送信する多重化部、
１１２は図１の回線インタフェース部１７からの受信信
号を分離して、音声信号は音声復号化部１０５、映像信
号はビデオ復号化部１１０へ転送する分離化部、１１３
は光磁気ディスク／ＨＤ（ハードディスク）等よりなり
且つ映像・音声信号のデジタル記憶が可能なデジタル記
録部、１１４はＶＴＲ等のアナログＩ／Ｆにより音声・
映像信号を同期して入力可能で、逆にアナログＩ／Ｆに
て出力されてきた音声・映像信号を同期をとって記憶す
ることが可能なアナログ記憶部、１１５は各種デジタル
映像信号を合成・編集するデジタルビデオ処理部であ
る。１１６は第２Ａ／Ｄ変換部１０７からの出力及びビ
デオ符号化部１０８への出力を制御するためにデジタル
ビデオ処理部１１５が制御する第１フレームメモリ、１
１７はビデオ復号化部１１５が制御する第２フレームメ
モリである。Reference numeral 111 indicates that the system control unit 15 of FIG. 1 multiplexes the voice coded signal from the voice coding unit 103 and the video coded signal from the video coding unit 108 to the line interface unit 17 of FIG. A multiplexer to transmit,
Reference numeral 112 denotes a separation unit that separates the received signal from the line interface unit 17 in FIG. 1 and transfers the audio signal to the audio decoding unit 105 and the video signal to the video decoding unit 110, and 113.
Is a digital recording section that is composed of a magneto-optical disk / HD (hard disk) and can digitally store video and audio signals, and 114 is an audio interface using an analog I / F such as a VTR.
An analog storage unit that can input video signals in synchronization and, conversely, can synchronously store the audio and video signals output by the analog I / F. 115 is a composite of various digital video signals. It is a digital video processing unit for editing. Reference numeral 116 denotes a first frame memory controlled by the digital video processing unit 115 to control the output from the second A / D conversion unit 107 and the output to the video encoding unit 108.
A second frame memory 17 is controlled by the video decoding unit 115.

【００３１】図３は、図２のビデオ符号化部１０８及び
ビデオ復号化部１１０がＣＣＩＴＴ勧告「Ｈ．２６１」
準拠の場合の内部構成を示すブロック図であり、同図
中、２０１は入力デジタルビデオ信号を動き補償フレー
ム間予測＋ＤＣＴ＋量子化の符号化処理を行う情報源符
号器、２０２はフレーム・ＧＯＢ・ＭＢ・ブロックの４
層からなる階層構造に従って可変長符号化を用いて多重
化処理を行うビデオ信号多重化符号器、２０３はビデオ
信号多重化符号器２０２からの出力信号を一時的に蓄積
する送信バッファ、２０４はＢＣＨ（５１１，４９３）
誤り訂正符号化フレームを用いて符号化処理を行う伝送
符号器、２０５はビデオ符号化部１０８全体の制御を行
う符号化制御部、２０６は図２のビデオ符号化部１０８
からの受信符号を復号化する情報源復号器、２０７はビ
デオ信号多重化復号器、２０８は伝送復号化器２０９か
らの出力信号を一時的に蓄積する受信バッファ、２０９
は伝送復号器である。3, the video coding unit 108 and the video decoding unit 110 of FIG. 2 are CCITT recommendation "H.261".
FIG. 2 is a block diagram showing an internal configuration in the case of conformity, in which 201 is an information source encoder that performs an encoding process of an input digital video signal of motion compensation interframe prediction + DCT + quantization, and 202 is a frame / GOB / MB.・ Block 4
A video signal multiplex encoder that performs a multiplex process using variable length coding according to a hierarchical structure of layers, 203 is a transmission buffer that temporarily stores the output signal from the video signal multiplex encoder 202, and 204 is a BCH. (511,493)
A transmission encoder that performs an encoding process using an error correction encoded frame, 205 is an encoding control unit that controls the entire video encoding unit 108, and 206 is the video encoding unit 108 in FIG.
, An information source decoder for decoding the reception code from 207, a video signal multiplexing decoder 207, a reception buffer 208 for temporarily accumulating the output signal from the transmission decoder 209, 209
Is a transmission decoder.

【００３２】以下、上記構成のマルチメディア通信装置
の動作を図４のフローチャートに基づき詳述する。The operation of the multimedia communication apparatus having the above configuration will be described below in detail with reference to the flowchart of FIG.

【００３３】まず、留守録応答メッセージ作成を図１の
操作部１４により指示すると、ステップＳ４０１にて応
答メッセージ作成用に使う音声符号化モードとして、圧
縮効率と再生音質の両面から考えて最適なものを決定
（選択）する。例えば、独自モード８Ｋｂｐｓ・１６Ｋ
ｂｐｓＬＤ−ＣＥＬＰ、ＰＣＭＡ−ｌａｗ、μ−ｌａ
ｗの各モードをサポートしていたとして、圧縮効率だけ
を考えれば独自モード８Ｋｂｐｓが最も有効であるが、
再生音質、特に復号化後にＰＣＭにした時極端に悪い等
の問題があれば、１６ＫｂｐｓＬＤ−ＣＥＬＰを選択す
る。First, when the operation section 14 of FIG. 1 is instructed to create an answering machine reply message, the optimum voice coding mode used for creating the reply message in step S401 in terms of both compression efficiency and reproduction sound quality. Is determined (selected). For example, original mode 8Kbps / 16K
bpsLD-CELP, PCM A-law, μ-la
Assuming that each mode of w is supported, the unique mode of 8 Kbps is most effective if only considering the compression efficiency.
16 Kbps LD-CELP is selected if there is a problem such as reproduction sound quality, especially when the PCM is performed after decoding and is extremely bad.

【００３４】次に、ステップＳ４０２に進み映像符号化
時の映像符号化モードと最大割当可能な転送レート（最
大転送レート）の選択を行う。例えば、ＣＩＦとＱＣＩ
Ｆとを持っていれば、ＣＩＦを選択する。最大転送レー
トが２Ｂであり且つ音声モードがＰＣＭのみの場合、音
声と同時に送信する場合の最大値６８．８Ｋｂｐｓまた
は単純に映像のみの時の最大レート１２８Ｋｂｐｓ／１
２４．８Ｋｂｐｓを選択する。次いでステップＳ４０３
に進み、上記ステップＳ４０２での選択に基づいて図２
の各符号化部１０３，１０８へのクロックを入力し起動
する。これは疑似的に図２の多重化部１１１を起動する
ことにより可能になる。全く別のクロックジェネレータ
を使用しても勿論かまわない。Next, in step S402, the video coding mode at the time of video coding and the maximum assignable transfer rate (maximum transfer rate) are selected. For example, CIF and QCI
If it has F and, select CIF. When the maximum transfer rate is 2B and the audio mode is PCM only, the maximum value when transmitting simultaneously with audio is 68.8 Kbps or the maximum rate when only video is 128 Kbps / 1.
Select 24.8 Kbps. Then step S403
2 based on the selection made in step S402 above.
A clock is input to each of the coding units 103 and 108 to start. This is possible by activating the multiplexing unit 111 in FIG. 2 in a pseudo manner. Of course, a completely different clock generator may be used.

【００３５】次にステップＳ４０４に進んで応答メッセ
ージの最大時間を図１の表示部９に表示することにより
ユーザに知らせ、必要ならば図１の操作部１４からの入
力により変更する。次いでステップＳ４０５に進み、応
答メッセージ用音声及び映像ファイルをオープンすると
共に、各種の属性を記録（記憶）する。次にステップＳ
４０６により図１の表示部９またはハンドセット１或は
スピーカ３等を利用して、応答メッセージ作成開始をユ
ーザに知らせ、場合によってはユーザによる操作部１４
からの指示を待つ。Next, in step S404, the maximum time of the response message is displayed on the display unit 9 of FIG. 1 to notify the user, and if necessary, it is changed by an input from the operation unit 14 of FIG. Next, proceeding to step S405, the audio and video files for the response message are opened and various attributes are recorded (stored). Then step S
A display unit 9 of FIG. 1 or a handset 1, a speaker 3, or the like is used by 406 to notify the user of the start of response message creation, and in some cases, the operation unit 14 by the user.
Wait for instructions from.

【００３６】応答メッセージ作成が開始するとステップ
Ｓ４０７に進み図２のビデオ符号化部１０８中の図３に
示すビデオ多重化符号器２０２の出力を監視し、フレー
ムの先頭の符号化、具体的には「フレーム開始符号」の
検知を待つ。「フレーム開始符号」が検知されたなら、
ステップＳ４０８に進み図１の表示部９またはハンドセ
ット１或はスピーカ３等を利用して、蓄積開始をユーザ
に知らせる。When the creation of the response message starts, the process proceeds to step S407 and the output of the video multiplex encoder 202 shown in FIG. 3 in the video encoding unit 108 of FIG. 2 is monitored to encode the beginning of the frame, specifically, Wait for detection of "frame start code". If the "frame start code" is detected,
In step S408, the user is notified of the start of storage by using the display unit 9 of FIG. 1, the handset 1, the speaker 3, or the like.

【００３７】次いでステップＳ４０９に進んで、音声情
報は図２の音声符号化部１０３からの出力をデジタル記
憶部１１３へ、映像情報は図２のビデオ符号化部１０８
中の図３に示すビデオ信号多重化符号器２０２からの出
力を前記ステップＳ４０７において検知したフレームの
先頭より図２のデジタル記憶部１１３へ転送及び蓄積す
る処理を開始する。次にステップＳ４１１に進み前記ス
テップＳ４０３において起動した応答メッセージ作成の
最大時間のタイムアウトまたは応答メッセージ作成終了
の指示の何れかがあったかを監視し、そのどちらかが検
知されたなら、ステップＳ４１２に進んで応答メッセー
ジのファイルクローズ処理を行なった後、本処理動作を
終了する。Next, in step S409, the audio information is the output from the audio encoding unit 103 of FIG. 2 to the digital storage unit 113, and the video information is the video encoding unit 108 of FIG.
The process of transferring and accumulating the output from the video signal multiplex encoder 202 shown in FIG. 3 therein to the digital storage unit 113 of FIG. 2 from the head of the frame detected in step S407 is started. Next, the process proceeds to step S411, and it is monitored whether there is a time-out of the maximum time for creating the response message started in step S403 or an instruction to end the creation of the response message. If either one is detected, the process proceeds to step S412. After the file closing process of the response message is performed, this processing operation is ended.

【００３８】〔第２実施例〕次に本発明の第２実施例に
係わるマルチメディア通信装置の動作を図５のフローチ
ャートに基づいて詳述する。なお、本実施例においてマ
ルチメディア通信装置の基本的な構成は、上述した第１
実施例の図１〜図３と同一であるから、これら図１〜図
３を流用して説明する。[Second Embodiment] Next, the operation of the multimedia communication apparatus according to the second embodiment of the present invention will be described in detail with reference to the flowchart of FIG. In addition, the basic configuration of the multimedia communication device in the present embodiment is the
Since it is the same as FIGS. 1 to 3 of the embodiment, description will be given by diverting these FIGS. 1 to 3.

【００３９】まず、図５のステップＳ５０１にて着信が
あった時、留守録モードであるか否かを判断し、留守録
モードでなければ何もせずに本処理動作を終了する。ま
た、留守録モードであればステップＳ５０２に進み自動
的に呼の設定、即ち第１コネクションの確立を行い、イ
ンチャネル上でＣＣＩＴＴ勧告「Ｈ．２２１」、「Ｈ．
２４２」に従いフレーム同期の確立・ＢＡＳコードによ
る能力情報交換シーケンス・モード切換シーケンスを実
行する。次にステップＳ５０３に進んで追加コネクショ
ン設定の要求があるか否かを判断し、要求があればステ
ップＳ５０４に進み追加コネクションの確立・追加コネ
クション上のフレーム同期確立・マルチフレーム同期確
立・第１チャネルとの同期確立を行った後、ステップＳ
５０５に進む。また、前記ステップＳ５０３において追
加コネクション設定の要求がなければ前記ステップＳ５
０４をスキップしてステップＳ５０５に進む。First, when an incoming call is received in step S501 of FIG. 5, it is judged whether or not it is in the recorded message mode, and if it is not in the recorded message mode, this processing operation is terminated without doing anything. Further, in the answering machine mode, the process proceeds to step S502 to automatically set up a call, that is, establish a first connection, and CCITT recommendations “H.221” and “H.221” are performed on the in-channel.
242 ”, frame synchronization establishment, capability information exchange sequence by BAS code, and mode switching sequence are executed. Next, in step S503, it is determined whether or not there is a request for additional connection setting, and if there is a request, the process advances to step S504 to establish an additional connection, establish frame synchronization on the additional connection, establish multiframe synchronization, and first channel. After establishing synchronization with
Proceed to 505. If there is no request for additional connection setting in step S503, then step S5
04 is skipped and it progresses to step S505.

【００４０】このステップＳ５０５では、能力情報交換
シーケンスの結果に基づき送信の音声符号化モード・映
像符号化モードの決定とモード切換シーケンスの実行及
び応答メッセージの選択を行う。次にステップＳ５０６
に進んで、選択応答メッセージの蓄積音声符号化モード
と送信音声符号化モードとが互いに異なる場合は、図２
のデジタル記憶部１１３→音声復号化部１０５→多重化
部１１１のパス、或はデジタル記憶部１１３→音声復号
化部１０５→第１Ｄ／Ａ変換部１０４→第１Ａ／Ｄ変換
部１０２→音声符号化部１０３→多重化部１１１のパス
の何れかを設定する。In step S505, the audio coding mode / video coding mode for transmission is determined based on the result of the capability information exchange sequence, the mode switching sequence is executed, and the response message is selected. Next, step S506.
2 in the case where the accumulated voice encoding mode and the transmission voice encoding mode of the selection response message are different from each other,
Digital storage unit 113 → voice decoding unit 105 → path of multiplexing unit 111, or digital storage unit 113 → voice decoding unit 105 → first D / A conversion unit 104 → first A / D conversion unit 102 → voice code Any one of the paths from the conversion unit 103 to the multiplexing unit 111 is set.

【００４１】次いでステップＳ５０７に進んで映像応答
メッセージの送信パスとして、図２のデジタル記憶部１
１３→ビデオ復号化部１１０→デジタルビデオ処理部１
１５→（第２フレームメモリ１１７及び第１フレームメ
モリ１１６）→ビデオ符号化部１０８→多重化部１１１
のパスを設定する。より詳細には、図２のデジタル記憶
部１１３→図３のビデオ信号多重化復号器２０７→情報
源復号器２０６→デジタルビデオ処理部１１５→（第２
フレームメモリ１１７→第１フレームメモリ１１６）→
情報源符号器２０６→ビデオ信号多重化符号器２０２→
送信バッファ２０３→伝送符号器２０４→図２の多重化
部１１１のパスを設定する。次にステップＳ５０８に進
んで応答メッセージの送信終了を待つ。そして、応答メ
ッセージ送信が終るとステップＳ５０９に進んで通常の
受信パスに設定し直してメッセージを受信し、該受信メ
ッセージを記録処理した後、本処理動作を終了する。Next, in step S507, the digital storage unit 1 shown in FIG. 2 is used as a transmission path of the video response message.
13 → video decoding unit 110 → digital video processing unit 1
15 → (second frame memory 117 and first frame memory 116) → video encoding unit 108 → multiplexing unit 111
Set the path of. More specifically, the digital storage unit 113 of FIG. 2 → the video signal multiplex decoder 207 of FIG. 3 → the information source decoder 206 → the digital video processing unit 115 → (the second
Frame memory 117 → first frame memory 116) →
Information source encoder 206 → video signal multiplex encoder 202 →
The transmission buffer 203 → transmission encoder 204 → the path of the multiplexing unit 111 in FIG. 2 is set. Next, the process advances to step S508 to wait for the end of transmission of the response message. Then, when the response message is transmitted, the process proceeds to step S509, the normal reception path is set again, the message is received, the received message is recorded, and then this processing operation is ended.

【００４２】〔その他の実施例〕尚、上述した各実施例
では、説明を簡単にするため蓄積映像符号化情報は、復
号化した後に再び符号化して送信するケースのみに関し
て述べたが、これに限られるものではなく、直接図２の
多重化部１１１へ入力するようにしてもよい。[Other Embodiments] In each of the above-described embodiments, only the case where the accumulated video encoded information is decoded and then re-encoded and transmitted is described for simplification of description. The input is not limited to this, and may be directly input to the multiplexing unit 111 in FIG.

【００４３】また、映像符号化情報を図２のビデオ符号
化部１０８から直接取り出さず、図２の多重化部１１１
から分離化部１１２へ折り返し送信して、ビデオ復号化
部１１０中の図３に示すビデオ信号多重化復号器２０７
より取り出すことも可能である。Further, the video coding information is not directly extracted from the video coding unit 108 of FIG. 2, but the multiplexing unit 111 of FIG.
From the video decoding unit 110 to the demultiplexing unit 112, and the video signal multiplexing decoder 207 shown in FIG.
It is also possible to take out more.

【００４４】[0044]

【発明の効果】以上説明したように、本発明によれば、
留守録モード時の応答メッセージを符号化された形式で
音声情報・映像情報とも同期をとって記録部にデジタル
記憶できるため、大幅に少ない蓄積容量にて蓄積が可能
となり且つダイナミックなアクセス・再生が可能とな
る。As described above, according to the present invention, according to the present onset Akira,
Since the response message in the answering machine mode can be digitally stored in the recording unit in a coded form in synchronization with the audio information and the video information, it can be stored with a significantly small storage capacity and dynamic access and reproduction can be performed. It will be possible.

【００４５】[0045]

【００４６】[0046]

[Brief description of drawings]

【図１】本発明の一実施例に係るマルチメディア通信装
置の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a multimedia communication device according to an embodiment of the present invention.

【図２】同実施例に係るマルチメディア通信装置の音声
情報・映像情報のアナログ入出力部から多重化／分離化
部までの間の流れを示すブロック図である。FIG. 2 is a block diagram showing a flow from an analog input / output unit of audio information / video information to a multiplexing / demultiplexing unit of the multimedia communication apparatus according to the embodiment.

【図３】図２のビデオ符号化部及びビデオ復号化部がＣ
ＣＩＴＴ勧告「Ｈ．２６１」準拠の場合の内部構成を示
すブロック図である。3 is a block diagram of the video encoding unit and the video decoding unit of FIG.
It is a block diagram which shows an internal structure in case of a CITT recommendation "H.261" conformity.

【図４】同実施例に係るマルチメディア通信装置の動作
を示すフローチャートである。FIG. 4 is a flowchart showing an operation of the multimedia communication device according to the embodiment.

【図５】本発明の第２実施例に係るマルチメディア通信
装置の動作を示すフローチャートである。FIG. 5 is a flowchart showing an operation of the multimedia communication device according to the second exemplary embodiment of the present invention.

【図６】ＣＣＩＴＴ勧告「Ｈ．２２１」に示されるフレ
ーム構成図である。6 is a frame configuration diagram shown in CCITT Recommendation “H.221”.

【図７】ＣＣＩＴＴ勧告「Ｈ．２２１」に示されるＦＡ
Ｓ・ＢＡＳのマルチフレーム内でのビット割当図であ
る。7: FA shown in CCITT Recommendation “H.221”
It is a bit allocation diagram in the multi-frame of S · BAS.

【図８】従来のＣＣＩＴＴ「Ｈシリーズ勧告」準拠のテ
レビ電話装置・テレビ会議システムにおける、基本的な
接続シーケンスを示すフローチャートである。FIG. 8 is a flowchart showing a basic connection sequence in the conventional CCITT “H series recommendation” compliant video telephone device / video conference system.

[Explanation of symbols]

５音声符号化／復号化部（音声符号化／復号化手段）１１ビデオ符号化／復号化部（映像符号化／復号化手
段）１５システム制御部（留守録制御手段、第１〜第３記
録制御手段、第１、第２入力制御手段、復号化、同期化
制御手段、復号制御手段）１６多重化／分離化部１８記録部（記録部）１０６アナログ記憶部（記録部）１１１多重化部（多重化手段）１１２分離化部（分離化手段）１１３デジタル記憶部（記録部）２０１情報源符号器（符号化手段）２０２ビデオ信号多重化符号器（多重化手段）２０６情報源復号器（復号化手段）２０７ビデオ信号多重化復号器（多重化手段）5 audio encoding / decoding section (audio encoding / decoding means) 11 video encoding / decoding section (video encoding / decoding means) 15 system control section (answer recording control means, first to third recording) Control means, first and second input control means, decoding, synchronization control means, decoding control means) 16 multiplexing / demultiplexing section 18 recording section (recording section) 106 analog storage section (recording section) 111 multiplexing section (Multiplexing means) 112 Separating section (Separating means) 113 Digital storage section (Recording section) 201 Information source encoder (encoding means) 202 Video signal multiplexing encoder (multiplexing means) 206 Information source decoder ( Decoding means) 207 Video signal multiplexing decoder (multiplexing means)

Claims

(57) [Claims]

1. Audio encoding / decoding means for encoding audio information and for decoding the encoded audio information,
Video encoding / decoding means for encoding video information and decoding the encoded video information, audio encoding / decoding means, and audio encoded by the video encoding / decoding means A multiplexing means for multiplexing multimedia information such as information and video information and data; and a demultiplexing means for demultiplexing the multimedia information multiplexed by the multiplexing means so that each medium can be decoded. In the multimedia communication device, it is possible to automatically accept the incoming call at the time of the absence of the absence and to start the transmission of the voice information and the video information for the response message recorded in the recording unit in advance to enter the absence recording mode. after notifying the remote terminal, a recorded message control means for controlling to record the received audio information and video information, etc. from the partner terminal to the recording unit, the message recording mode Definitive the audio information and video information for the response message in advance a recording control means for recording in the recording unit, the audio information and video information for the reply message at the time of recording in the recording unit, the voice information wherein and said video information output of the voice encoding / decoding means controls to record in the recording unit as the information in the form of the output encoded in the video encoding / decoding means, and the
In addition, the video information for the response message includes the multimedia information.
Depending on the maximum video transmission rate
Recording to control so that it is encoded and recorded in the recording unit
Control means and audio information / images for the response message recorded in the recording section.
When transmitting image information, the encoded audio information is directly
A first input control means for controlling the input to the multiplexing means.
And the voice information and video for the response message recorded in the recording section.
When transmitting image information, the encoded video information is
It is input to the video encoding / decoding means to be decoded and
Encoding according to the capability of the terminal
A multimedia communication device, comprising: a second input control means for controlling to input .

2. Encoding audio information and encoding the audio information
Voice encoding / decoding means for decoding the generated voice information ,
The video information is encoded and the encoded video information is
Video encoding / decoding means for decoding, and audio encoding
/ Decoding means and the video encoding / decoding means
Marshalled audio and video information and data, etc.
Multiplexing means for multiplexing multimedia information and the multiplexing means
Multi-media information multiplexed by stages for each media
It has a demultiplexing means that demultiplexes so that it can be decrypted for each
Multimedia that communicates with a multimedia communication device
A. In the communication method, when an incoming call is received when you are out,
Voice information for response message recorded in the recording unit in advance
・ Start sending video information to confirm that
After notifying the other party's terminal, the received voice information / video from the other party's terminal
Answering machine that controls to record image information in the recording unit
The control process and the voice information for the response message in the above recorded message mode.
Control to record information / video information in the recording unit in advance.
A recording control step, the voice for the response message
When recording information / video information in the recording unit, the audio information
Is the output of the audio encoding / decoding means and the video information.
Information encoded output of the video encoding / decoding means
It is controlled so that it is recorded in the recording section as shape information.
In addition, the video information for the response message includes the multimedia information.
Depending on the maximum video transmission rate
Recording to control so that it is encoded and recorded in the recording unit
Control process and audio information / video for the response message recorded in the recording unit
When transmitting image information, the encoded audio information is directly
A first input control engine for controlling input to the multiplexing means.
And the voice information and video for the response message recorded in the recording unit.
When transmitting image information, the encoded video information is
It is input to the video encoding / decoding means to be decoded and
Encoding according to the capability of the terminal
A second input control step of controlling to input
A multimedia communication method characterized by the above.